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B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 
genebank 

search terms: structures provided by specifically claimed SEQ ID NO. which encode the claimed polynucleotides. 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under POT Rule 13.1. In order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group I, claim(s)l-7, 24-33, drawn to no fewer than 694 different isolated polynucleotides, vectors containing the 

polynucleotides,host celts comprising said polynucleotide and methods of producing polypeptides. 

Group II, claim(s) 8-11, drawn no fewer than 682 different polypeptides encoded by the polynucleotides. 

Group III, claim(s) 12*20, drawn to compositions comprising a single polypeptide, an antisecretory agent, a bismuth salt 

or a combination thereof. 

Group IV, claims 21-23, drawn to compositions comprising two Helicobacter polypeptides. 
Group V, claim 34, drawn to a composition comprising antibodies to a polypeptide. 

The polypeptide encoding polynucleotides, vectors containing them, organisms transformed with them and methods of 
polypeptide production are materially different from each other and therefore independent and distinct from the 
polypeptides and compositions of Groups II, III, and IV, as well as the protein antibodies of Group V, 

This application contains claims directed to more than one species of generic invention. These species are deemed to 
lack Unity of Invention because they are not so linked as to form a single inventive concept under PCT Rule 13.1. In 
order for more than one species to be examined the appropriate additional examination fees must be paid, the species 
are as follows: Group I contains separate polynucleotide species for each sequence mentioned. Therefore, there is a 
minimum of 694 different species. Group II contains at least one polypeptide for each polynucleotide sequence 
mentioned. Therefore, there is a minimum of 682 species in Group II. For the Group that applicant elects, a total of 
lO(ten) specified sequences wilt be searched and no more than 4(four) specified sequences will be searched for each 
additional fee paid. The species listed above do not relate to a single inventive concept under PCT Rule 13.1 because, 
under PCT Rule 13.2, the species lack the same or corresponding special technical features for the following reasons: 
There is no relationship between or among the various nucleotide and polypeptide amino acid sequences mentioned in 
the claims. 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international rcpoit has not been established in respect of certain claims under Article 17(2Xa) for the following reasons: 
1. I I Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



I I Claims Nos.: 

— ' because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. I j Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a)- 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please Sec Extra Sheet. 



1. I I As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. \ I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. I I As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. I x| No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
l-7^4-33;species SEQ ID NO:26,56, 198,2 12^64^78,490,838,1220,1292 



Remark on Protest 



I I The additional search fees were accompanied by the applicant's protest. 
I I No protest accompanied the payment of additional search fees. 
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A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) ; AO IN 43/04; A6 IK 31/70 

US CL : 514/44 

According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 514/44 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Citation of document, with indication, where appropriate, of the relevant passages 



Category* 



Relevant to claim No. 



X, P 



TOMB, J.-F. et al. The complete genome sequence of the gastric 
pathogen Helicobacter pylori. Nature. 07 August 1997, Vol. 388, 
pages 539-547, see entire document. 



1-7, 24-33 



Further documents are listed in the continuation of Box C. 1 See patent family annex. 



specia] catfgohea of cited documenu. *T' later document published after the intemation&l filing date or priority 

^ ^ . date and not in conflict with the application but ciud to understand 
"A* document defining the general state of the art which u not considered the principle or theory underlying the invention 
to be of parucular relevance 

•E- earlier document published on or after the iniemauonal filing date document of particular relevance; the claimed invention cannot be 

° considered novel or cannot be considered to mvolve an mventive step 
"L' docimient which may throw doubts on priority claim (a) or which ii when the document is taken alone 
citod to establish the publication date of another citation or other 

special reason (as specified) "Y" docimient of particular relevance; the claimed invention caxmot be 

conaidered to involve an inventive step when the document is 
*0* document referring to an oral disclosure, use, exhibition or other combined with one or more other such documents, such combination 
meant being obvious to e person skilled in the art 

•p' document published pnor to the international filing date but later than document member of the same patent family 
the pnonty date claimed 


Date of the actual completion of the international search 
17 JUNE 1998 


Date of mailing of the international search report 
08 July 1998 (08.07.98) 


Name and mailing address of the ISAAJS 
Conunissioner of Patents and Trademarks 
Box PCX 

Washington, D C. 20231 
Facsimile No. (703) 305-3230 


Authorized officer — ^ . 

GINNY PORTNER^^iUK^w^^ 
Telephone No. (703) 308-0196 \J Y\ 
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the compound of claim 8, 9, or 10 admixed with a physiologically acceptable 
diluent or carrier. 
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29. A composition that cornprises a bacterial vector comprising a DNA 
molecule of claim 3, said DNA molecule being placed under conditions for 
expression and said bacterial vector being admixed with a physiologically 
acceptable diluent or carrier. 

5 30. The composition of claim 29, wherein said vector is selected from 

the group consisting of Shigella, Salmonella, Vibrio cholerae, Lactobacillus, 
Bacille bilie de Calmette-Guerin, and Streptococcus. 

31 . The composition of claim 24, wherein said polynucleotide is a DNA 
10 molecule that is inserted in a plasmid that is unable to replicate and to 

substantially integrate in a mammalian genome and is placed under conditions 
for expression in a mammalian cell. 

32. An expression cassette comprising a DNA molecule of claim 3, said 
1 5 DNA molecule being placed under conditions for expression in a procaryotic or 

eucaryotic cell. 

33. A process for producing a compound of claim 8, which comprises 
culturing a procaryotic or eucaryotic cell transformed or transfected with an 

20 expression cassette of claim 32, and recovering said compound from the cell 

culture, 

34. A pharmaceutical composition for preventing or treating 
Helicobacter infection in a mammal, said composition comprising a 

25 prophylactically or therapeutically effective amount of an antibody that binds to 
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22. The composition of claim 21, wherein the second Helicobacter 
polypeptide is a Helicobacter urease, or a subunit or a derivative thereof, 

23. The composition of claim 11, further comprising an adjuvant. 

5 24. A pharmaceutical composition for preventing or treating 

Helicobacter infection in a mammal, said composition comprising a 
prophylactically or therapeutically effective amount of a polynucleotide of 
claim 1 or 2 admixed with a physiologically acceptable diluent or carrier. 

10 25. A pharmaceutical composition for preventing or treating 

Helicobacter infection in a mammal, said composition comprising a 
prophylactically or therapeutically effective amount of a polynucleotide of 
claim 4, 5, or 6 admixed with a physiologically acceptable diluent or carrier. 

1 5 26. A pharmaceutical composition for preventing or treating 

Helicobacter infection in a mammal, said composition comprising a 
prophylactically or therapeutically effective amount of a polynucleotide of 
claim 7 admixed with a physiologically acceptable diluent or carrier. 

20 27. A composition comprising a viral vector, in the genome of which is 

inserted a DNA molecule of claim 3, said DNA molecule being placed under 
conditions for expression in a mammalian cell and said viral vector being 
admixed with a physiologically acceptable diluent or carrier. 

25 28. The composition of claim 27, wherein said viral vector is a 

poxvirus. 
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14. The composition of claim 12, wherein said bismuth salt is selected 
from the group consisting of bismuth subcitrate and bismuth subsalicylate. 

15. The composition of claim 12, wherein said antisecretory agent is a 
proton pump inhibitor. 

16. The composition of claim 15, wherein said proton pump inhibitor is 
selected from the group consisting of omeprazole, lansoprazole, and 
pantoprazole. 



10 17. The composition of claim 12, wherein said antisecretory agent is an 

Hj-receptor antagonist. 

18. The composition of claim 17, wherein said Hj-receptor antagonist is 
selected from the group consisting of ranitidine, cimetidine, famotidine, 
15 nizatidine, and roxatidine. 



19. The composition of claim 12, wherein said antisecretory agent is a 
prostaglandin analog. 

20 20. The composition of claim 19, wherein said prostaglandin analog is 

misoprostil or enprostil. 

21 . The composition of claim 11, further comprising a prophylactically 
or therapeutically effective amount of a second Helicobacter polypeptide or a 
25 derivative thereof 
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GHPO 1723 (SEQ ID NO: 1342), GHPO 1732 (SEQ ID NO: 1344), GHPO 
1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ 
ID NO: 13 50), GHPO 1749 (SEQ ID NO: 13 52), GHPO 1750 (SEQ ID 
NO:1354), GHPO 1751 (SEQ ID NO:1356), GHPO 1755 (SEQ ID NO:1358), 
GHPO 1771 (SEQ ID NO:1360), GHPO 1786 (SEQ ID NO:1362), and GHPO 
5 1789 (SEQ ID NO: 1364); or 

(ii) a derivative of said Helicobacter polypeptide. 

9. The compound of claim 8, which is the mature fomi or a derivative of 
a polypeptide encoded by a DNA molecule of claim 4. 

10 

10. The compound of claim 8, which is the mature form or a derivative 
of a polypeptide encoded by a DNA molecule of claim 5. 

1 1 . A pharmaceutical composition for preventing or treating 
1 5 Helicobacter infection in a mammal, said composition comprising a 

prophylactically or therapeutically effective amount of a compound of claim 8, 
9, or 10 admixed with a physiologically acceptable diluent or carrier. 

12. The composition of claim 11, further comprising an antibiotic, an 
20 antisecretory agent, a bismuth salt, or a combination thereof. 

13. The composition of claim 12, wherein said antibiotic is selected 
from the group consisting of amoxicillin, clarithromycin, tetracycline, 
metronidizole, and erythromycin. 

25 
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1351 (SEQ IDNO:1220), GHPO 1354 (SEQ ID NO:1222), GHPO 1363 (SEQ 
ID NO:1224), GHPO 1371 (SEQ ID NO:1226), GHPO 1381 (SEQ ID 
NO:1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO:1232), 
GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO:1236), GHPO 
1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO: 1240), GHPO 1428 (SEQ 
5 ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID 

NO: 1246), GHPO 1460 (SEQ ID NO:1248), GHPO 1463 (SEQ ID NO: 1250), 
GHPO 1472 (SEQ ID NO: 1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 
1484 (SEQ IDNO:1256), GHPO 1489 (SEQ ID NO:1258), GHPO 1494 (SEQ 
ID NO: 1260), GHPO 1495 (SEQ ID NO: 1262), GHPO 1498 (SEQ ID 

10 NO: 1264), GHPO 1499 (SEQ ID NO: 1266), GHPO 1500 (SEQ ID NO: 1268), 

GHPO 1503 (SEQ ID NO: 1270), GHPO 1504 (SEQ ID NO: 1272), GHPO 
1510 (SEQ IDNO:1274), GHPO 1518 (SEQ ID NO:1276), GHPO 1533 (SEQ 
ID NO:1278), GHPO 1541 (SEQ ID NO:1280), GHPO 1544 (SEQ ID 
NO:1282), GHPO 1548 (SEQ ID NO:1284), GHPO 1565 (SEQ ID NO:1286), 

15 GHPO 1575 (SEQ ID NO:1288), GHPO 1582 (SEQ ID NO: 1290), GHPO 

1595 (SEQ ID NO: 1292), GHPO 1597 (SEQ ID NO: 1294), GHPO 1599 (SEQ 
ID NO:1296), GHPO 1601 (SEQ ID NO:1298), GHPO 1609 (SEQ ID 
NO:1300), GHPO 1613 (SEQ ID NO:1302), GHPO 1614 (SEQ ID NO: 1304), 
GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO 

20 1639 (SEQ ID NO: 13 10), GHPO 1640 (SEQ ID NO: 13 12), GHPO 1641 (SEQ 

ID NO: 13 14), GHPO 1646 (SEQ ID NO: 13 16), GHPO 1662 (SEQ ID 
NO:1318), GHPO 1667 (SEQ ID NO:1320), GHPO 1668 (SEQ ID NO:1322), 
GHPO 1670 (SEQ ID NO:1324), GHPO 1671 (SEQ ID NO:1326), GHPO 
1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ 

25 ID NO: 1 332), GHPO 1 695 (SEQ ID NO: 1 334), GHPO 1 697 (SEQ ID 

NO:1336),GHPO 1701 (SEQ ID NO:1338), GHPO 1719 (SEQ ID NO: 1340), 
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ID NO: 1098), GHPO 1043 (SEQ ID NO: 1 100), GHPO 1044 (SEQ ID 
NO: 1 102), GHPO 1051 (SEQ ID NO:l 104), GHPO 1058 (SEQ ID NO: 1 106), 
GHPO 1060 (SEQ ID NO: 1 108), GHPO 1075 (SEQ ID NO:l 1 10), GHPO 
1077 (SEQ ID NO: 11 12), GHPO 1082 (SEQ ID NO: 11 14), GHPO 1083 (SEQ 
ID NO:l 1 16), GHPO 1086 (SEQ ID NO:l 1 18), GHPO 1087 (SEQ ID 
5 NO:l 120), GHPO 1090 (SEQ ID NO:1122), GHPO 1097 (SEQ ID NO:l 124), 

GHPO 1098 (SEQ ID NO: 1 126), GHPO 1103 (SEQ ID NO: 1 128), GHPO 
1113 (SEQ ID NO: 1130), GHPO 11 16 (SEQ ID NO: 1132), GHPO 1123 (SEQ 
ID NO: 1 134), GHPO 1 125 (SEQ ID NO: 1 136), GHPO 1 129 (SEQ ID 
NO: 1 138), GHPO 1 130 (SEQ ID NO:l 140), GHPO 1 134 (SEQ ID NO: 1 142), 

10 GHPO 1161 (SEQ ID NO: 1144), GHPO 1 166 (SEQ ID NO: 1 146), GHPO 

1 1 70 (SEQ ID NO: 1 1 48), GHPO 1 1 75 (SEQ ID NO: 1 1 50), GHPO 1181 (SEQ 
ID NO: 1 152), GHPO 1186 (SEQ ID NO: 11 54), GHPO 1188 (SEQ ID 
NO: 1156), GHPO 1191 (SEQ ID NO: 1 158), GHPO 1193 (SEQ ID NO: 1 160), 
GHPO 1 196 (SEQ ID NO: 1 1 62), GHPO 1204 (SEQ ID NO: 1 1 64), GHPO 

15 1210 (SEQ ID NO: 1166), GHPO 1211 (SEQ ID NO: 1 168), GHPO 1216 (SEQ 

ID NO: 1170), GHPO 1218 (SEQ ID NO: 11 72), GHPO 1220 (SEQ ID 
NO: 11 74), GHPO 1223 (SEQ ID NO: 1 176), GHPO 1226 (SEQ ID NO: 1178), 
GHPO 1240 (SEQ ID NO: 1 180), GHPO 1246 (SEQ ID NO: 1 182), GHPO 
1251 (SEQ ID NO: 11 84), GHPO 1252 (SEQ ID NO: 1 186), GHPO 1261 (SEQ 

20 ID NO: 11 88), GHPO 1265 (SEQ ID NO: 1190), GHPO 1267 (SEQ ID 

NO:1192), GHPO 1278 (SEQ ID NO:l 194), GHPO 1282 (SEQ ID NO:l 196), 
GHPO 1283 (SEQ IDNO:1198), GHPO 1287 (SEQ ID NO: 1200), GHPO 
1292 (SEQ ID NO:1202), GHPO 1293 (SEQ ID NO:1204), GHPO 1302 (SEQ 
ID NO:1206), GHPO 1309 (SEQ ID NO:1208), GHPO 1317 (SEQ ID 

25 NO:1210), GHPO 1318 (SEQ ID NO:1212), GHPO 1321 (SEQ ID NO: 12 14), 

GHPO 1325 (SEQ IDNO:1216), GHPO 1341 (SEQ ID NO:1218), GHPO 
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NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 
604 (SEQ ID NO:976), GHPO 606 (SEQ ID NO:978), GHPO 61 1 (SEQ ID 
NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 
632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID 
NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO 
5 686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID 

NO:1000), GHPO 703 (SEQ ID NO:1002), GHPO 704 (SEQ ID NO:1004), 
GHPO 705 (SEQ ID NO: 1006), GHPO 707 (SEQ ID NO: 1008), GHPO 721 
(SEQ ID NO: 1010), GHPO 727 (SEQ ID NO: 1012), GHPO 728 (SEQ ID 
NO:1014), GHPO 733 (SEQ ID NO:1016), GHPO 758 (SEQ ID NO:1018), 

1 0 GHPO 763 (SEQ ID NO: 1 020), GHPO 77 1 (SEQ ID NO: 1 022), GHPO 774 

(SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID 
NO: 1028), GHPO 800 (SEQ ID NO: 1030), GHPO 806 (SEQ ID NO: 1032), 
GHPO 807 (SEQ ID NO:1034), GHPO 808 (SEQ ID NO: 1036), GHPO 809 
(SEQ ID NO: 1038), GHPO 811 (SEQ ID NO: 1040), GHPO 815 (SEQ ID 

15 NO: 1042), GHPO 819 (SEQ ID NO: 1044), GHPO 841 (SEQ ID NO: 1046), 

GHPO 843 (SEQ ID NO: 1048), GHPO 846 (SEQ ID NO: 1050), GHPO 875 
(SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID 
NO:1056), GHPO 904 (SEQ ID NO:1058), GHPO 906 (SEQ ID NO:1060), 
GHPO 908 (SEQ ID NO:1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 

20 (SEQ ID NO: 1 066), GHPO 926 (SEQ ID NO: 1 068), GHPO 933 (SEQ ID 

NO: 1070), GHPO 939 (SEQ ID NO: 1072), GHPO 940 (SEQ ID NO: 1074), 
GHPO 943 (SEQ ID NO: 1076), GHPO 951 (SEQ ID NO: 1078), GHPO 961 
(SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID 
NO: 1084), GHPO 991 (SEQ ID NO: 1086), GHPO 998 (SEQ ID NO: 1088), 

25 GHPO 1001 (SEQ ID NO:1090), GHPO 1005 (SEQ ID NO:1092), GHPO 

1033 (SEQ ID NO:1094), GHPO 1039 (SEQ ID NO: 1096), GHPO 1041 (SEQ 
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231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID 
NO:840), GHPO 245 (SEQ ID NO:842), GHPO 246 (SEQ ID NO:844), GHPO 
248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID 
NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO 
272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID 
5 NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO 

304 (SEQ ID NO:866), GHPO 307 (SEQ ID NO:868), GHPO 324 (SEQ ID 
NO:870), GHPO 326 (SEQ ID NO:872), GHPO 331 (SEQ ID NO:874), GHPO 
343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID 
NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO 

1 0 363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID 

NO:890), GHPO 378 (SEQ ID NO:892), GHPO 388 (SEQ ID NO:894), GHPO 
396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID 
NO:900), GHPO 415 (SEQ ID NO:902), GHPO 421 (SEQ ID NO:904), GHPO 
439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID 

15 NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 

464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID 
NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 
487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID 
NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO 

20 504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID 

NO:940), GHPO 520 (SEQ ID NO:942), GHPO 528 (SEQ ID NO:944), GHPO 
530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID 
NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 
572 (SEQ ID NO:956), GHPO 573 (SEQ ID NO:958), GHPO 574 (SEQ ID 

25 NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO 

583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID 
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NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), 
GHPO 1654 (SEQ IDNO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 
(SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID 
NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), 
GHPO 1738 (SEQ ID NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746 
5 (SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID 

NO:730), GHPO 1795 (SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), 
GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID 
NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 
25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID 

10 NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 

49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID 
NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 
68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID 
NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO 

15 87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID 

NO:780), GHPO 96 (SEQ ID NO:782), GHPO 97 (SEQ ID NO:784), GHPO 
1 1 1 (SEQ ID NO:786), GHPO 1 1 5 (SEQ ID NO:788), GHPO 1 1 7 (SEQ ID 
NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 
126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID 

20 NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO 

140 (SEQ ID NO:806), GHPO 141 (SEQ ID NO:808), GHPO 145 (SEQ ID 
NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 
181 (SEQ ID NO: 8 16), GHPO 187 (SEQ ID NO: 8 18), GHPO 188 (SEQ ID 
NO: 820), GHPO 192 (SEQ ID NO: 822), GHPO 202 (SEQ ID NO: 824), GHPO 

25 204 (SEQ ID NO: 826), GHPO 205 (SEQ ID NO: 828), GHPO 212 (SEQ ID 

NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 
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NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), 
GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 
(SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID 
NO: 590), GHPO 1091 (SEQ ID NO:592), GHPO 1 105 (SEQ ID NO: 594), 
GHPO 1115 (SEQ ID NO:596), GHPO 1 159 (SEQ ID NO:598), GHPO 1 177 

5 (SEQ ID NO:600), GHPO 1 1 87 (SEQ ID NO:602), GHPO 1 1 92 (SEQ ID 

NO:604), GHPO 1 195 (SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), 
GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 
(SEQ IDNO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID 
NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), 

10 GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 

(SEQ ID NO:628), GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID 
NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), 
GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 
(SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID 

15 NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), 

GHPO 1394 (SEQ ID NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 
(SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID 
NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), 
GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 

20 (SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID 

NO:674), GHPO 1496 (SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), 
GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 
(SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID 
NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692), 

25 GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604 

(SEQ lb NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID 
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342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID 
NO:446), GHPO 373 (SEQ ID NO:448), GHPO 382 (SEQ ID NO:450), GHPO 
384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID 
NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 
446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID 
5 NO:466), GHPO 45 1 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 

456 (SEQ ID NO:472), GHPO 461 (SEQ ID NO:474), GHPO 476 (SEQ ID 
NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 
511 (SEQ ID NO:482), GHPO 519 (SEQ ID NO:484), GHPO 526 (SEQ ID 
NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO 

1 0 542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID 

NO:496), GHPO 578 (SEQ ID NO:498), GHPO 580 (SEQ ID NO:500), GHPO 
585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID 
NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO 
654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID 

15 NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 

718 (SEQ ID NO:522), GHPO 726 (SEQ ID NO:524), GHPO 734 (SEQ ID 
NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 
782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID 
NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO 

20 828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID 

NO:546), GHPO 842 (SEQ ID NO:548), GHPO 885 (SEQ ID NO:550), GHPO 
889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID 
NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 
968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID 

25 NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO 

1002 (SEQ ID NO:572), GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID 
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798 (SEQ ID NO:3 10), GHPO 1 102 (SEQ ID NO:312), GHPO 1114 (SEQ ID 
NO:314), GHPO 1152 (SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), 
GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 
(SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID 
NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), 
5 GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 

(SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID 
NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1101 (SEQ ID NO:346), 
GHPO 1 120 (SEQ ID NO:348), GHPO 1 138 (SEQ ID NO:350), GHPO 1310 
(SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID 

10 NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 

282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), GHPO 1 141 (SEQ ID 
NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), 
GHPO 15 (SEQ ID NO:372), GHPO 16 (SEQ ID NO:374), GHPO 36 (SEQ ID 
NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO 

15 57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID 

NO:386), GHPO 84 (SEQ ID NO:388), GHPO 86 (SEQ ID NO:390), GHPO 
99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID 
NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 
138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID 

20 NO:406), GHPO 1 68 (SEQ ID NO:408), GHPO 1 79 (SEQ ID NO:41 0), GHPO 

189 (SEQ ID NO:412), GHPO 229 (SEQ ID NO:414), GHPO 243 (SEQ ID 
NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 
267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID 
NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO 

25 300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 3 1 9 (SEQ ID 

NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 
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GHPO 1574 (SEQ ID NO: 180), GHPO 1627 '(SEQ ID NO: 182), GHPO 1657 
(SEQ ID NO: 184), GHPO 1664 (SEQ ID NO: 186), GHPO 1694 (SEQ ID 
NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), 
GHPO 616 (SEQ ID NO:194), GHPO 76 (SEQ ID NO:196), GHPO 109 (SEQ 
ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), 
GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 
(SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID 
NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 
836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID 
NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 
894 (SEQ ID NO:228), GHPO 976 (SEQ ID NO:230), GHPO 101 1 (SEQ ID 
NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 (SEQ ID NO:236), 
GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 
(SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID 
NO:246), GHPO 141 1 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), 
GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 
(SEQ ID NO:256), GHPO 1 505 (SEQ ID NO:258), GHPO 1522 (SEQ ID 
NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), 
GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 
(SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID 
NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 
325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID 
NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 
515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID 
NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 
646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID 
NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 
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NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 
1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID NO:54), GHPO 1275 (SEQ ID 
NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 
22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200 (SEQ ID NO:66), 
GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 695 (SEQ 
5 ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO 

709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 762 (SEQ ID 
NO:82), GHPO 827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO:86), GHPO 
1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID 
NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1 127 (SEQ ID NO:96), GHPO 
10 1 149 (SEQ ID NO:98), GHPO 1 176 (SEQ ID NO:100), GHPO 1250 (SEQ ID 

NO:102), GHPO 1312 (SEQ IDNO:104), GHPO 1358 (SEQ IDNO:106), 
GHPO 1490 (SEQ ID NO:108), GHPO 1559 (SEQ IDNO:110), GHPO 1651 
(SEQ ID NO:l 12), GHPO 1726 (SEQ ID NO:l 14), GHPO 1780 (SEQ ID 
NO: 11 6), GHPO 895 (SEQ ID NO: 11 8), GHPO 1447 (SEQ ID NO: 120), 
15 GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO: 124), GHPO 155 (SEQ 

ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), 
GHPO 290 (SEQ ID NO:132), GHPO 293 (SEQ ID NO:134), GHPO 335 
(SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID 
NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 
20 610 (SEQ ID NO: 146), GHPO 675 (SEQ ID NO: 148), GHPO 690 (SEQ ID 

NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 (SEQ ID NO: 154), GHPO 
876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO: 158), GHPO 989 (SEQ ID 
NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1145 (SEQ ID NO: 164), 
GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316 
25 (SEQ ID NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID 

NO: 174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), 
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4. The isolated polynucleotide of claim 1, which is a DNA molecule 
that can be amplified by polymerase chain reaction from a Helicobacter 
genome. 

5. The isolated DNA molecule of claim 4, which can be amplified by 
5 the polymerase chain reaction from a Helicobacter pylori genome. 

6. The isolated polynucleotide of claim 1, which is a DNA molecule 
that encodes the mature form or a derivative of a polypeptide encoded by the 
DNA molecule of claim 4. 

10 

7. The isolated polynucleotide of claim 1, which is a DNA molecule 
that encodes the mature form or a derivative of a polypeptide encoded by the 
DNA molecule of claim 5 . 



15 8. A compound, in a substantially purified form, that is the mature form 

or a derivative of a polypeptide comprising an amino acid sequence that is 
homologous to a Helicobacter polypeptide selected from the group consisting 
of GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID 
NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO:10), GHPO 541 

20 (SEQ ID NO:12), GHPO 607 (SEQ ID NO:14), GHPO 635 (SEQ ID NO:16), 

GHPO 701 (SEQ ID NO:18), GHPO 712 (SEQ ID NO:20), GHPO 761 (SEQ 
ID NO;22), GHPO 838 (SEQ ID NO:24), GHPO 1034 (SEQ ID NO;26), 
GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), GHPO 1255 
(SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID 

25 NO:36), GHPO 1 706 (SEQ ID NO:38), GHPO 234 (SEQ ID NO:40), GHPO 

314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID 
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GHPO 1575 (SEQ ID NO: 1288), GHPO 1582 (SEQ ID NO: 1290), GHPO 
1595 (SEQ ID NO:1292), GHPO 1597 (SEQ IDNO:1294), GHPO 1599 (SEQ 
ID NO: 1296), GHPO 1601 (SEQ ID NO: 1298), GHPO 1609 (SEQ ID 
NO:1300), GHPO 1613 (SEQ ID NO: 1302), GHPO 1614 (SEQ ID NO: 1304), 
GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO 
1639 (SEQ ID NO:1310), GHPO 1640 (SEQ IDNO:1312), GHPO 1641 (SEQ 
IDNO:1314), GHPO 1646 (SEQ IDNO:1316), GHPO 1662 (SEQ ID 
NO: 131 8), GHPO 1667 (SEQ ID NO: 1320), GHPO 1668 (SEQ ID NO: 1322), 
GHPO 1670 (SEQ ID NO: 1324), GHPO 1671 (SEQ ID NO: 1326), GHPO 
1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ 
ID NO: 1332), GHPO 1695 (SEQ ID NO: 1334), GHPO 1697 (SEQ ID 
NO:1336), GHPO 1701 (SEQ ID NO:1338), GHPO 1719 (SEQ IDNO:1340), 
GHPO 1723 (SEQ ID NO: 1342), GHPO 1732 (SEQ ID NO: 1344), GHPO 
1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ 
ID NO:1350), GHPO 1749 (SEQ ID NO:1352), GHPO 1750 (SEQ ID 
NO:1354), GHPO 1751 (SEQ ID NO: 1356), GHPO 1755 (SEQ IDNO:1358), 
GHPO 1771 (SEQ ID NO: 1360), GHPO 1786 (SEQ ID NO: 1362), and GHPO 
1789 (SEQ ID NO: 1364); or 

(ii) a derivative of said Helicobacter polypeptide. 

2. The isolated polynucleotide of claim 1, which encodes a mature form 
of said Helicobacter polypeptide. 

3. The isolated polynucleotide of claim 1 or 2, wherein the 
polynucleotide is a DNA molecule. 
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1210 (SEQ ID NO: 11 66), GHPO 1211 (SEQ ID NO: 1 1 68), GHPO 1216 (SEQ 
ID NO: 11 70), GHPO 1218 (SEQ ID NO: 1 172), GHPO 1220 (SEQ ID 
NO:1174), GHPO 1223 (SEQ ID NO: 1 176), GHPO 1226 (SEQ ID NO:1178), 
GHPO 1240 (SEQ ID NO: 1 180), GHPO 1246 (SEQ ID NO: 1 182), GHPO 
1251 (SEQ ID NO: 11 84), GHPO 1252 (SEQ ID NO: 1 186), GHPO 1261 (SEQ 

5 ID NO: 11 88), GHPO 1265 (SEQ ID NO: 1 190), GHPO 1267 (SEQ ID 

NO:l 192), GHPO 1278 (SEQ ID NO:1194), GHPO 1282 (SEQ ID NO: 1 196), 
GHPO 1283 (SEQ IDNO:1198), GHPO 1287 (SEQ ID NO:1200), GHPO 
1292 (SEQ ID NO: 1202), GHPO 1293 (SEQ ID NO: 1204), GHPO 1302 (SEQ 
ID NO: 1206), GHPO 1309 (SEQ ID NO: 1208), GHPO 1317 (SEQ ID 

10 NO:1210), GHPO 1318 (SEQ ID NO:1212), GHPO 1321 (SEQ ID NO: 12 14), 

GHPO 1325 (SEQ ID NO:1216), GHPO 1341 (SEQ ID NO:1218), GHPO 
1351 (SEQ ID NO:1220), GHPO 1354 (SEQ ID NO:1222), GHPO 1363 (SEQ 
ID NO:1224), GHPO 1371 (SEQ ID NO:1226), GHPO 1381 (SEQ ID 
NO: 1228), GHPO 1401 (SEQ ID NO: 1230), GHPO 1402 (SEQ ID NO: 1232), 

1 5 GHPO 1 403 (SEQ ID NO: 1 234), GHPO 1 408 (SEQ ID NO: 1236), GHPO 

1416 (SEQ ID NO:1238), GHPO 1420 (SEQ IDNO:1240), GHPO 1428 (SEQ 
ID NO:1242), GHPO 1437 (SEQ ID NO:1244), GHPO 1439 (SEQ ID 
NO:1246), GHPO 1460 (SEQ ID NO:1248), GHPO 1463 (SEQ ID NO:1250), 
GHPO 1472 (SEQ ID NO: 1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 

20 1484 (SEQ ID NO: 1256), GHPO 1489 (SEQ ID NO: 1258), GHPO 1494 (SEQ 

ID NO:1260), GHPO 1495 (SEQ ID NO:1262), GHPO 1498 (SEQ ID 
NO: 1264), GHPO 1499 (SEQ ID NO: 1266), GHPO 1500 (SEQ ID NO: 1268), 
GHPO 1503 (SEQ ID NO: 1270), GHPO 1504 (SEQ ID NO: 1272), GHPO 
1510 (SEQ ID NO:1274), GHPO 1518 (SEQ ID NO:1276), GHPO 1533 (SEQ 

25 ID NO: 1278), GHPO 1541 (SEQ ID NO:1280), GHPO 1544 (SEQ ID 

NO:1282), GHPO 1548 (SEQ IDNO:1284), GHPO 1565 (SEQ ID NO:1286), 
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NO:1042), GHPO 819 (SEQ ID NO:1044), GHPO 841 (SEQ ID NO: 1046), 
GHPO 843 (SEQ ID NO: 1048), GHPO 846 (SEQ ID NO: 1050), GHPO 875 
(SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID 
NO: 1 056), GHPO 904 (SEQ ID NO: 1 058), GHPO 906 (SEQ ID NO: 1 060), 
GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 
5 (SEQ ID NO: 1 066), GHPO 926 (SEQ ID NO: 1 068), GHPO 933 (SEQ ID 

NO: 1070), GHPO 939 (SEQ ID NO: 1072), GHPO 940 (SEQ ID NO: 1074), 
GHPO 943 (SEQ ID NO: 1076), GHPO 951 (SEQ ID NO: 1078), GHPO 961 
(SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID 
NO:1084), GHPO 991 (SEQ ID NO:1086), GHPO 998 (SEQ ID NO:1088), 
10 GHPO 1001 (SEQ ID NO: 1090), GHPO 1005 (SEQ ID NO: 1092), GHPO 

1033 (SEQ ID NO:1094), GHPO 1039 (SEQ ID NO:1096), GHPO 1041 (SEQ 
ID NO: 1098), GHPO 1043 (SEQ ID NO:l 100), GHPO 1044 (SEQ ID 
NO:1102), GHPO 1051 (SEQ IDNO:1104), GHPO 1058 (SEQ ID NO: 1 106), 
GHPO 1060 (SEQ ID NO: 11 08), GHPO 1075 (SEQ ID NO:1110), GHPO 
15 1077 (SEQ ID NO: 11 12), GHPO 1082 (SEQ ID NO:l 1 14), GHPO 1083 (SEQ 

ID NO:l 1 16), GHPO 1086 (SEQ ID NO:ll 18), GHPO 1087 (SEQ ID 
NO: 1 120), GHPO 1090 (SEQ ID NO: 1 122), GHPO 1097 (SEQ ID NO: 1 124), 
GHPO 1098 (SEQ ID NO: 1126), GHPO 1103 (SEQ ID NO: 1 128), GHPO 
1113 (SEQ ID NO: 11 30), GHPO 11 16 (SEQ ID NO: 1132), GHPO 1123 (SEQ 
20 IDNO:1134), GHPO 1125 (SEQ ID NO:l 136), GHPO 1129 (SEQ ID 

NO: 1 138), GHPO 1 130 (SEQ ID NO: 1 140), GHPO 1 134 (SEQ ID NO: 1 142), 
GHPO 1161 (SEQ ID NO: 1144), GHPO 1 166 (SEQ ID NO: 1 146), GHPO 
1 1 70 (SEQ ID NO: 1 148), GHPO 1 1 75 (SEQ ID NO: 1 1 50), GHPO 1181 (SEQ 
ID NO: 1 1 52), GHPO 1 1 86 (SEQ ID NO: 1 1 54), GHPO 1 1 88 (SEQ ID 
25 NO:1156), GHPO 1191 (SEQ ID NO: 1 158), GHPO 1193 (SEQ ID NO: 11 60), 

GHPO 1 196 (SEQ ID NO: 1 162), GHPO 1204 (SEQ ID NO: 1 164), GHPO 
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NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 
464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID 
NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 
487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID 
NO:930), GHPO 498 (SEQ ID NO:932), GHPO 501 (SEQ ID NO:934), GHPO 
5 504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID 

NO:940), GHPO 520 (SEQ ID NO:942), GHPO 528 (SEQ ID NO:944), GHPO 
530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID 
NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 
572 (SEQ ID NO:956), GHPO 573 (SEQ ID NO:958), GHPO 574 (SEQ ID 

1 0 NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO 

583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID 
NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 
604 (SEQ ID NO:976), GHPO 606 (SEQ ID NO:978), GHPO 611 (SEQ ID 
NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 

1 5 632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID 

NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO 
686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID 
NO: 1000), GHPO 703 (SEQ ID NO: 1002), GHPO 704 (SEQ ID NO: 1004), 
GHPO 705 (SEQ ID NO: 1006), GHPO 707 (SEQ ID NO: 1008), GHPO 721 

20 (SEQ ID NO:1010), GHPO 727 (SEQ ID NO:1012), GHPO 728 (SEQ ID 

NO:1014), GHPO 733 (SEQ ID NO:1016), GHPO 758 (SEQ ID NO:1018), 
GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO:1022), GHPO 774 
(SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID 
NO: 1028), GHPO 800 (SEQ ID NO: 1030), GHPO 806 (SEQ ID NO: 1032), 

25 GHPO 807 (SEQ ID NO: 1034), GHPO 808 (SEQ ID NO: 1036), GHPO 809 

(SEQ ID NO: 103 8), GHPO 811 (SEQ ID NO: 1040), GHPO 815 (SEQ ID 
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87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID 
NO:780), GHPO 96 (SEQ ID NO:782), GHPO 97 (SEQ ID NO:784), GHPO 
1 1 1 (SEQ ID NO:786), GHPO 115 (SEQ ID NO:788), GHPO 117 (SEQ ID 
NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 
126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID 

5 NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO 

140 (SEQ ID NO:806), GHPO 141 (SEQ ID NO:808), GHPO 145 (SEQ ID 
NO: 8 10), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO: 8 14), GHPO 
181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID 
NO:820), GHPO 192 (SEQ ID NO:822), GHPO 202 (SEQ ID NO:824), GHPO 

10 204 (SEQ ID NO:826), GHPO 205 (SEQ ID NO:828), GHPO 212 (SEQ ID 

NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 
231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID 
NO: 840), GHPO 245 (SEQ ID NO:842), GHPO 246 (SEQ ID NO: 844), GHPO 
248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID 

1 5 NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO 

272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID 
NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO 
304 (SEQ ID NO: 866), GHPO 307 (SEQ ID NO: 868), GHPO 324 (SEQ ID 
NO:870), GHPO 326 (SEQ ID NO:872), GHPO 331 (SEQ ID NO:874), GHPO 

20 343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID 

NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO 
363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID 
NO:890), GHPO 378 (SEQ ID NO:892), GHPO 388 (SEQ ID NO:894), GHPO 
396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID 

25 NO:900), GHPO 415 (SEQ ID NO:902), GHPO 42 1 (SEQ ID NO:904), GHPO 

439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID 
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NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), 
GHPO 1394 (SEQ ID NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 
(SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID 
NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), 
GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 
5 (SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID 

NO:674), GHPO 1496 (SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), 
GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 
(SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID 
NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692), 

1 0 GHPO 1 570 (SEQ ID NO:694), GHPO 1 588 (SEQ ID NO:696), GHPO 1 604 

(SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID 
NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), 
GHPO 1654 (SEQ ID NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 
(SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID 

15 NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), 

GHPO 1738 (SEQ ID NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746 
(SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID 
NO:730), GHPO 1795 (SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), 
GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID 

20 NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 

25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID 
NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 
49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID 
NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 

25 68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID 

NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO 
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NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 
718 (SEQ ID NO:522), GHPO 726 (SEQ ID NO:524), GHPO 734 (SEQ ID 
NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 
782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID 
NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO 

5 828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID 

NO:546), GHPO 842 (SEQ ID NO:548), GHPO 885 (SEQ ID NO:550), GHPO 
889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID 
NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 
968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID 

1 0 NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO 

1002 (SEQ ID NO: 572), GHPO 1026 (SEQ ID NO: 574), GHPO 1028 (SEQ ID 
NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), 
GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 
(SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID 

15 NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1 105 (SEQ ID NO:594), 

GHPO 1115 (SEQ ID NO:596), GHPO 1 1 59 (SEQ ID NO:598), GHPO 1 1 77 
(SEQ ID NO:600), GHPO 1 1 87 (SEQ ID NO:602), GHPO 1 192 (SEQ ID 
NO:604), GHPO 1 195 (SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), 
GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 

20 (SEQ ID NO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID 

NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), 
GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 
(SEQ ID NO:628), GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID 
NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), 

25 GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 

(SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID 
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57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID 
NO:386), GHPO 84 (SEQ ID NO:388), GHPO 86 (SEQ ID NO:390), GHPO 
99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 1 18 (SEQ ID 
NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 
138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID 

5 NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO 

189 (SEQ ID NO:412), GHPO 229 (SEQ ID NO:414), GHPO 243 (SEQ ID 
NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 
267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID 
NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO 

10 300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 319 (SEQ ID 

NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 
342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID 
NO:446), GHPO 373 (SEQ ID NO:448), GHPO 382 (SEQ ID NO:450), GHPO 
384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID 

1 5 NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 

446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID 
NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 
456 (SEQ ID NO:472), GHPO 461 (SEQ ID NO:474), GHPO 476 (SEQ ID 
NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 

20 511 (SEQ ID NO:482), GHPO 5 1 9 (SEQ ID NO:484), GHPO 526 (SEQ ID 

NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO 
542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID 
NO:496), GHPO 578 (SEQ ID NO:498), GHPO 580 (SEQ ID NO: 500), GHPO 
585 (SEQ ID NO:502),,GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID 

25 NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:5 1 0), GHPO 

654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID 
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GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 
(SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID 
NO:260), GHPO 1525 (SEQ IDNO:262), GHPO 1615 (SEQ ID NO:264), 
GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 
(SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID 

5 NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 

325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID 
NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 
515 (SEQ IDNO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID 
NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 

1 0 646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID 

NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 
798 (SEQ ID NO:310), GHPO 1 102 (SEQ ID NO:312), GHPO 1114 (SEQ ID 
NO:314), GHPO 1152 (SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), 
GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 

15 (SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID 

NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), 
GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 
(SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID 
NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1 101 (SEQ ID NO:346), 

20 GHPO 1120 (SEQ ID NO: 348), GHPO 1 138 (SEQ ID NO:350), GHPO 1310 

(SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID 
NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 
282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), GHPO 1 141 (SEQ ID 
NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), 

25 GHPO 1 5 (SEQ ID NO:372), GHPO 1 6 (SEQ ID NO;374), GHPO 36 (SEQ ID 

NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO 
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NO:120), GHPO 28 (SEQ ID NO: 122), GHPO 86 (SEQ ID NO:124), GHPO 
155 (SEQ ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID 
NO: 130), GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 
335 (SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID 
NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 

5 610 (SEQ ID NO:146), GHPO 675 (SEQ ID NO:148), GHPO 690 (SEQ ID 

NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 (SEQ ID NO: 154), GHPO 
876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO: 158), GHPO 989 (SEQ ID 
NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1 145 (SEQ ID NO: 164), 
GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316 

10 (SEQ ID NO:170), GHPO 1368 (SEQ ID NO:172), GHPO 1442 (SEQ ID 

NO: 174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), 
GHPO 1574 (SEQ ID NO: 180), GHPO 1627 (SEQ ID NO: 182), GHPO 1657 
(SEQ ID NO: 184), GHPO 1664 (SEQ ID NO: 186), GHPO 1694 (SEQ ID 
NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), 

15 GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 109 (SEQ 

ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), 
GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 
(SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID 
NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 

20 836 (SEQ ID N0:2 1 8), GHPO 879 (SEQ ID NO:220), GHPO 88 1 (SEQ ID 

NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 
894 (SEQ ID NO:228), GHPO 976 (SEQ ID NO:230), GHPO 1011 (SEQ ID 
NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 (SEQ ID NO:236), 
GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 

25 (SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID 

NO:246), GHPO 141 1 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), 
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1 . An isolated polynucleotide that encodes: 

(i) a polypeptide comprising an amino acid sequence that is homologous 
to the amino acid sequence of a Helicobacter polypeptide selected from the 
group consisting of GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), 
GHPO 78 (SEQ ID NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID 

5 NO: 10), GHPO 541 (SEQ ID NO: 12), GHPO 607 (SEQ ID NO: 14), GHPO 

635 (SEQ ID NO: 16), GHPO 701 (SEQ ID NO: 18), GHPO 712 (SEQ ID 
NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 
1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID 
NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 

10 1389 (SEQ ID NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID 

NO:40), GHPO 314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 
603 (SEQ ID NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID 
NO:50), GHPO 1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID NO:54), GHPO 
1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID 

1 5 NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200 

(SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), 
GHPO 695 (SEQ ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ 
ID NO:76), GHPO 709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO:80), GHPO 
762 (SEQ ID NO: 82), GHPO 827 (SEQ ID NO: 84), GHPO 852 (SEQ ID 

20 NO:86), GHPO 1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO 

1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1 127 (SEQ ID 
NO:96), GHPO 1149 (SEQ ID NO:98), GHPO 1176 (SEQ ID NO: 100), GHPO 
1250 (SEQ ID NO:102), GHPO 1312 (SEQ ID NO:104), GHPO 1358 (SEQ ID 
NO: 106), GHPO 1490 (SEQ ID NO:108),GHPO 1559 (SEQ ID NO:110), 

25 GHPO 1651 (SEQ ID NO: 11 2), GHPO 1726 (SEQ ID NO: 1 14), GHPO 1780 

(SEQ ID NO: 1 1 6), GHPO 895 (SEQ ID NO: 1 1 8), GHPO 1447 (SEQ ID 
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Other embodiments are within the following claims. 



What is claimed is: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 75: 
CGCGGATCCG AGGAAATAGC ATGTTAATAA CC 3 2 

(2) INFORMATION FOR SEQ ID NO: 1376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 6: 
CCGCTCGAGT CACTGCTTGC ATGACTTATT CCA 3 3 
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GCGGATCCTT TTCTTCAATG TTTG 24 
(2) INFORMATION FOR SEQ ID NO: 1372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372: 
CCGCTCGAGT CAAAGTTTTA AACAAATTC 2 9 



(2) INFORMATION FOR SEQ ID NO: 1373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1373: 
CCGAATTCGG TTATAAAGCC CCT 2 3 

(2) INFORMATION FOR SEQ ID NO: 13 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1374: 
CCGCTCGAGT TAAGGCTGAT TTAA 2 4 

(2) INFORMATION FOR SEQ ID NO: 1375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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CGCGGATCCG AAGACATGTG CAACCGATG 



29 



(2) INFORMATION FOR SEQ ID NO: 1368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 68: 
CCGCTCGAGC TAAAAGTTTT GCAAAATCAC 3 0 



(2) INFORMATION FOR SEQ ID NO: 13 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369: 



(2) INFORMATION FOR SEQ ID NO: 1370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 70: 



(2) INFORMATION FOR SEQ ID NO: 1371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1371: 



CGCGGATCCG ATTTTACTTG AAAAATTTAA AC 



32 



CCGCTCGAGT TAGAAAGTGT AGTTCAAATA C 



31 
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(xi) SEQUENCE 

Met Leu Gly Ser Lys 

1 5 
Phe Ser Pro Glu Glu 
20 

Pro lie Met Leu Lys 
35 

Leu Leu Glu Ala Leu 
50 

Leu Lys Asp Lys Ser 
65 

Glu Ser Lys Arg Pro 
85 



DESCRIPTION: SEQ ID 

Thr Tyr Ser Val Leu 
10 

Leu Cys lie Leu Met 
25 

Glu Leu Leu Met Leu 
40 

Lys Val lie Phe Asn 
55 

Pro Phe Ser Leu Arg 
70 

lie Thr 



NO:1364 : 

Arg Ser Tyr Glu Lys Thr 
15 

Gly Lys Thr Tyr Glu Tyr 
30 

Leu Ala Asn Ala Arg Gly 
45 

Met Leu Gly Leu Ser Lys 
60 

Val Leu Ser Ser Phe Lys 
75 80 



(2) INFORMATION FOR SEQ ID NO: 13 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 65: 
CTGAATTCGA ATGAAAAGAA TTTTAGTCTC T 31 
(2) INFORMATION FOR SEQ ID NO: 1366: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366: 
CCGCTCGAGT TAAAACTCAT AATTCAAAT 2 9 

(2) INFORMATION FOR SEQ ID NO: 13 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1367: 
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(2) INFORMATION FOR SEQ ID NO: 13 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 30... 290 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1363: 

AACCATGCCT TTTCATCGTC TCTATCAAA ATG TTA GGG TCT AAA ACA TAT TCC 5 3 

Met Leu Gly Ser Lys Thr Tyr Ser 
1 5 

GTT TTA AGA TCG TAT GAA AAA ACA TTC TCG CCT GAA GAG CTT TGC ATT 101 

Val Leu Arg Ser Tyr Glu Lys Thr Phe Ser Pro Glu Glu Leu Cys lie 
10 15 20 

TTA ATG GGC AAA ACA TAC GAA TAG CCC ATC ATG CTT AAA GAA TTA TTG 14 9 

Leu Met Gly Lys Thr Tyr Glu Tyr Pro lie Met Leu Lys Glu Leu Leu 
25 30 35 40 

ATG CTT TTG GCA AAC GCT AGG GGA TTG CTT GAA GCC TTG AAA GTG ATT 197 
Met Leu Leu Ala Asn Ala Arg Gly Leu Leu Glu Ala Leu Lys Val lie 
45 50 55 

TTC AAC ATG CTT GGC TTG TCA AAA TTA AAA GAC AAA AGC CCG TTT TCT 24 5 

Phe Asn Met Leu Gly Leu Ser Lys Leu Lys Asp Lys Ser Pro Phe Ser 
60 65 70 

TTG AGA GTG TTG AGC AGT TTC AAG GAA TCC AAA CGC CCC ATT ACA TAGAA 2 95 
Leu Arg Val Leu Ser Ser Phe Lys Glu Ser Lys Arg Pro lie Thr 
75 80 85 

AGCCTTACGA TTTTTAAACA AACGCTCTAA AAAAAGCTTG TTCGTATGA 344 

(2) INFORMATION FOR SEQ ID NO: 13 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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Glu His Glu Gly Leu Ser Lys Lys lie Leu Ala Lys Met Asp Thr lie 

190 195 200 

TTG AGC GTA AAA ATG CGA AGA GAT TTT GAT TCG CTC AAT GTG AGC GTG 67 6 

Leu Ser Val Lys Met Arg Arg Asp Phe Asp Ser Leu Asn Val Ser Val 

205 210 215 



GCA GCA GGG ATC TTA ATG GAT AAA ATC AAC TAGGTGGTCA ATTGAATGGA ACA 72 9 
Ala Ala Gly lie Leu Met Asp Lys lie Asn 
220 225 



G AAT AAA 73 6 

(2) INFORMATION FOR SEQ ID NO: 1362: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 62: 



Met Gin Ala Val 
1 

Ser His Gin Glu 
20 

Lys Lys Leu Phe 
35 

Val Asp Asn Lys 
50 

Gly Val Leu Ala 
65 

Val Lys Lys Ala 

Gly Asn lie Gly 
100 

Gly Val lie Leu 
115 

Arg Ser Ser Leu 
130 

Asn Thr Leu Asp 
145 

Leu Gly Ala Ser 

Lys Lys Cys Ala 
180 

Lys lie Leu Ala 
195 

Asp Phe Asp Ser 

210 
Lys lie Asn 
225 



lie Tyr Gly Lys 
5 

Lys Leu Gin Glu 

Phe Ala Leu Lys 
40 

Lys Ala Gin Ser 
55 

Lys Val Glu Leu 
70 

Gin Lys Leu Leu 
85 

Gly lie Phe Arg 

Asp Phe Ala Lys 
120 

Gly Leu Met Tyr 
135 

Leu lie Asn Glu 
150 

Met Gin Gly Ser 
165 

Leu Phe Leu Gly 

Lys Met Asp Thr 
200 

Leu Asn Val Ser 
215 



Gin Val lie Met 
10 

lie Tyr Leu Ser 
25 

Lys Ala Cys Pro 

Leu Ala Lys Gly 
60 

Pro Leu Ala Val 
75 

Val Leu Cys Gly 
90 

Ser Ala Tyr Cys 
105 

Glu Leu Ala Tyr 

Asp Leu Pro Phe 
140 

Leu Lys Thr Ser 
155 

Ser Gin lie Glu 
170 

Ser Glu His Glu 
185 

lie Leu Ser Val 

Val Ala Ala Gly 
220 



His Leu Leu Asn 
15 

Lys Glu lie Asp 
30 

Asn lie lie Lys 
45 

Gly Asn His Gin 

Ser Leu Lys Glu 
80 

lie Thr Asp Val 
95 

Leu Gly Met Gly 
110 

Glu Gly He Val 
125 

Ser Val Met Pro 

Gly Phe Leu Cys 
160 

Asn Leu Ser Leu 
175 

Gly Leu Ser Lys 
190 

Lys Met Arg Arg 
205 

He Leu Met Asp 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361: 

AAAAAATCAA TAAAAGGGGT TTAGC ATG CAA GCA GTA ATT TAT GGC AAG CAA 52 

Met Gin Ala Val lie Tyr Gly Lys Gin 
1 5 

GTG ATT ATG CAC CTT CTA AAC TCT CAT CAA GAA AAA TTG CAA GAA ATC 100 
Val lie Met His Leu Leu Asn Ser His Gin Glu Lys Leu Gin Glu lie 
10 15 20 25 

TAT CTT TCT AAA GAA ATA GAC AAG AAA CTT TTT TTC GCG CTC AAA AAA 14 8 

Tyr Leu Ser Lys Glu lie Asp Lys Lys Leu Phe Phe Ala Leu Lys Lys 
30 35 40 

GCA TGC CCT AAT ATC ATC AAA GTG GAT AAT AAA AAA GCG CAA AGC TTG 196 
Ala Cys Pro Asn lie lie Lys Val Asp Asn Lys Lys Ala Gin Ser Leu 
45 50 55 

GCT AAG GGG GGG AAT CAT CAA GGG GTT TTG GCT AAG GTG GAA CTG CCC 244 
Ala Lys Gly Gly Asn His Gin Gly Val Leu Ala Lys Val Glu Leu Pro 
60 65 70 

TTA GCG GTT TCT TTA AAA GAG GTT AAA AAA GCT CAA AAA CTT TTG GTG 2 92 

Leu Ala Val Ser Leu Lys Glu Val Lys Lys Ala Gin Lys Leu Leu Val 
75 80 85 

CTT TGC GGG ATT ACG GAT GTG GGG AAT ATT GGA GGT ATT TTT AGG AGC 34 0 

Leu Cys Gly He Thr Asp Val Gly Asn He Gly Gly He Phe Arg Ser 
90 95 100 105 

GCG TAT TGC TTA GGA ATG GGT GGC GTT ATT TTA GAT TTT GCT AAA GAA 3 88 

Ala Tyr Cys Leu Gly Met Gly Gly Val He Leu Asp Phe Ala Lys Glu 
110 115 120 

TTG GCT TAT GAG GGG ATT GTG CGA TCC AGC TTG GGG CTT ATG TAT GAT 43 6 

Leu Ala Tyr Glu Gly He Val Arg Ser Ser Leu Gly Leu Met Tyr Asp 
125 130 135 

TTG CCT TTT AGC GTT ATG CCT AAC ACG CTG GAT TTA ATC AAT GAA TTG 4 84 

Leu Pro Phe Ser Val Met Pro Asn Thr Leu Asp Leu He Asn Glu Leu 
140 145 150 

AAA ACG AGC GGG TTT TTA TGT TTG GGC GCG AGC ATG CAA GGC TCT AGT 53 2 

Lys Thr Ser Gly Phe Leu Cys Leu Gly Ala Ser Met Gin Gly Ser Ser 
155 160 165 

CAA ATA GAA AAT CTA TCC TTA AAA AAA TGC GCT CTT TTT TTG GGG AGC 5 80 

Gin He Glu Asn Leu Ser Leu Lys Lys Cys Ala Leu Phe Leu Gly Ser 
170 175 180 185 

GAG CAT GAG GGG TTG TCT AAA AAA ATC CTT GCT AAA ATG GAT ACT ATA 62 8 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE 



TYPE: protein 



(xi) SEQUENCE 



DESCRIPTION 



SEQ ID NO : 1360 



Met Lys Gin Ser Leu Arg Glu Gin Lys Leu Leu Lys lie Leu Glu Asn 

15 10 15 

Asp Val Leu Thr lie Leu Asp Ser Phe Ser Asn Tyr Leu Phe Glu Leu 

20 25 30 

Arg Glu Glu Leu Asp Phe lie Glu Glu Glu Met Glu Gly Glu lie Thr 

35 40 45 

Glu Gin Asn Leu Thr Ala Leu Tyr Asp Phe Ser Asn Phe Leu Glu Asp 

50 55 60 

His Val Asn Val Phe Tyr Glu Asn Val Leu Asn lie Asp Asp Val Lys 
65 70 75 80 

Thr Glu His Leu Tyr Ser Gly Leu lie Asp Ser Leu Asn Ala Asn Leu 

85 90 95 

His Phe Val Lys Ser Phe Leu Ser Asn Gin Asp Leu Asp Phe Arg Phe 

100 105 110 

Phe Lys Glu lie Asn Asp Gly Gin Asp Pro Gin Lys Thr Leu Ser Arg 

115 120 125 

Leu lie Pro Leu Gin Ser Gly Lys Asn Asp Ala Ser Ser Phe Lys Ala 

130 135 140 

Asn Asn Ser Phe Val Ser Leu Val Tyr Val Tyr Val Tyr Phe Met Leu 
145 150 155 160 

Glu Thr lie Met Gin Ser Tyr Arg lie Leu Arg Leu Leu Glu Lys Pro 

165 170 175 

lie Asn Asn Asn lie Ser Glu Asp Met Gin Asn Asp lie Glu Asn Phe 

180 185 190 

Phe Val Gin Ala Asn Phe Leu Glu Tyr Tyr Val Gin Asn Lys lie Tyr 

195 200 205 

Pro Thr Asn His Ala Tyr Asp Phe Thr His Leu lie Met Asp Ser lie 

210 215 220 

lie Pro Asn Trp lie Gin Thr Asp Met Ser Val Glu Ala Lys Lys Lys 
225 230 235 240 

Glu Leu Phe Glu Lys Tyr Phe Gin Asn He Asp Glu Val Thr Asn Lys 

245 250 255 

Met Leu Asp Gin Lys Xaa Gin Asn Lys Ser Asn Asp 



(2) INFORMATION FOR SEQ ID NO: 1361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 26... 706 



260 



265 
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AGT CTT AAC GCT AAT CTT CAC TTT GTC AAG TCA TTT CTC AGT AAT CAG 339 
Ser Leu Asn Ala Asn Leu His Phe Val Lys Ser Phe Leu Ser Asn Gin 
95 100 105 

GAT TTA GAG TTC CGC TTT TTT AAG GAA ATA AAC GAT GGG CAA GAT CCC 3 87 

Asp Leu Asp Phe Arg Phe Phe Lys Glu lie Asn Asp Gly Gin Asp Pro 
110 115 120 

CAA AAA AC A TTA TCA AG A TTA ATT CCT CTT CAA AGT GGG AAA AAT GAT 43 5 

Gin Lys Thr Leu Ser Arg Leu lie Pro Leu Gin Ser Gly Lys Asn Asp 
125 130 135 

GCA AGC TCG TTT AAA GCC AAT AAT TCT TTT GTT TCA TTA GTT TAT GTT 4 83 

Ala Ser Ser Phe Lys Ala Asn Asn Ser Phe Val Ser Leu Val Tyr Val 
140 145 150 

TAT GTT TAG TTC ATG CTA GAA ACT ATC ATG CAG TCG TAT AGG ATT CTC 531 
Tyr Val Tyr Phe Met Leu Glu Thr lie Met Gin Ser Tyr Arg lie Leu 
155 160 165 170 

AG A TTG CTA GAA AAA CCT ATC AAT AAC AAC ATA AGC GAG GAC ATG CAG 57 9 

Arg Leu Leu Glu Lys Pro lie Asn Asn Asn lie Ser Glu Asp Met Gin 
175 180 185 

AAC GAT ATA GAG AAT TTT TTT GTT CAA GCG AAT TTT TTA GAA TAG TAT 627 
Asn Asp lie Glu Asn Phe Phe Val Gin Ala Asn Phe Leu Glu Tyr Tyr 
190 195 200 

GTT CAG AAC AAA ATA TAG CCA ACC AAT CAT GCC TAT GAC TTC ACG CAT 675 
Val Gin Asn Lys lie Tyr Pro Thr Asn His Ala Tyr Asp Phe Thr His 
205 210 215 

TTG ATC ATG GAC TCC ATT ATT CCT AAT TGG ATT CAA ACT GAT ATG AGC 72 3 

Leu lie Met Asp Ser lie lie Pro Asn Trp lie Gin Thr Asp Met Ser 
220 225 230 

GTT GAA GCT AAA AAG AAA GAG CTT TTT GAA AAA TAT TTT CAA AAC ATT 771 
Val Glu Ala Lys Lys Lys Glu Leu Phe Glu Lys Tyr Phe Gin Asn lie 
235 240 245 250 

GAT GAA GTA ACA AAC AAA ATG CTC GAT CAA AAA ANT CAA AAC AAA AGT 819 
Asp Glu Val Thr Asn Lys Met Leu Asp Gin Lys Xaa Gin Asn Lys Ser 
255 260 265 

AAC GAT TGAGTGGCGT TAATGCGCTA GAATAGTGCT AAAAATAAGA ATAAAGGAGT CA 8 77 
Asn Asp 



877 

(2) INFORMATION FOR SEQ ID NO: 13 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 
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Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin Ala lie Asn Gly Ser Met 

1650 1655 1660 

Gin Ser Ser Ala Gin Met Ser Asn Gin lie Leu Gly Gin Leu Met Asn 
665 1670 1675 1680 

lie Pro Pro Ser Phe Tyr Lys Asn Glu Gly Asp Ser lie Lys lie Leu 

1685 1690 1695 

Thr Met Asp Asp lie Asp Phe Ser Gly Val Tyr Asp Val Lys lie Thr 

1700 1705 1710 

Asn Lys Ser Val Val Asp Glu lie lie Lys Gin Ser Thr Lys Thr Leu 

1715 1720 1725 

Ser Arg Glu His Glu Glu lie Thr Thr Ser Pro Lys Gly Gly Asn 
1730 1735 1740 

(2) INFORMATION FOR SEQ ID NO: 13 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 877 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22 . . . 825 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135 9: 

AACAAATAAA GGAGTATTAA A ATG AAA CAA AGT TTG CGC GAA CAA AAA TTA 51 

Met Lys Gin Ser Leu Arg Glu Gin Lys Leu 
15 10 

TTG AAA ATT TTA GAA AAT GAT GTC TTG ACG ATT TTG GAT AGT TTT TCT 9 9 

Leu Lys lie Leu Glu Asn Asp Val Leu Thr lie Leu Asp Ser Phe Ser 
15 20 25 

AAT TAT CTT TTT GAA CTG AG A GAA GAG TTG GAC TTC ATA GAA GAA GAA 14 7 

Asn Tyr Leu Phe Glu Leu Arg Glu Glu Leu Asp Phe lie Glu Glu Glu 
30 35 40 

ATG GAA GGT GAA ATC ACC GAA CAA AAC CTT ACC GCT CTT TAT GAT TTT 195 
Met Glu Gly Glu lie Thr Glu Gin Asn Leu Thr Ala Leu Tyr Asp Phe 
45 50 55 

TCT AAT TTC TTA GAA GAC CAT GTC AAT GTA TTT TAT GAG AAT GTT TTG 24 3 

Ser Asn Phe Leu Glu Asp His Val Asn Val Phe Tyr Glu Asn Val Leu 
60 65 70 

AAT ATA GAT GAT GTC AAA ACA GAA CAC CTT TAT TCA GGT CTC ATA GAT 2 91 

Asn lie Asp Asp Val Lys Thr Glu His Leu Tyr Ser Gly Leu lie Asp 
75 80 85 90 
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1205 1210 1215 

Asn Asp Arg Ala Ala lie Met Lys Cys Leu Asp Gly Leu Ser Asp Glu 

1220 1225 1230 

Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg Glu Lys Ala Val Ala Asp 

1235 1240 1245 

Cys Leu Ala Met Ala Lys Thr Asp Glu Glu Lys Arg Lys Cys Gin Asn 

1250 1255 1260 

Leu Tyr Ser Asp Leu lie Gin Glu lie Gin Asn Lys Arg Thr Gin Asn 
265 1270 1275 1280 

Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg Leu His Gin Ala Ser Glu 

1285 1290 1295 

Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp Gin Glu Ala lie Glu Gin 

1300 1305 1310 

Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg Ala Leu lie Leu Gly lie 

1315 1320 1325 

Lys Arg Gin Ala Asp Glu Val Asp Leu lie Tyr Ser Asp Leu Arg Asn 

1330 1335 1340 

Arg Lys Thr Phe Asp Asn Met Ala Ala Lys Gly Tyr Pro Leu Leu Pro 
345 1350 1355 1360 

Met Asp Phe Lys Asn Gly Gly Asp lie Ala Thr lie Asn Ala Thr Asn 

1365 1370 1375 

Val Asp Ala Asp Lys lie Ala Ser Asp Asn Pro lie Tyr Ala Ser lie 

1380 1385 1390 

Glu Pro Asp lie Ala Lys Gin Tyr Glu Thr Glu Lys Thr lie Lys Asp 

1395 1400 1405 

Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala Leu Gly Gly Asn Lys Lys 

1410 1415 1420 

Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser Thr Ala Glu Ala Lys Ala 
425 1430 1435 1440 

Glu Asn Asn Lys lie Asp Lys Asp Val Ala Glu Thr Ala Lys Asn lie 

1445 1450 1455 

Ser Glu lie Ala Leu Lys Asn Lys Lys Glu Lys Ser Gly Glu Phe Val 

1460 1465 1470 

Asp Glu Asn Gly Asn Pro lie Asp Asp Lys Lys Lys Ala Glu Lys Gin 

1475 1480 1485 

Asp Glu Thr Ser Pro Val Lys Gin Ala Phe lie Gly Lys Ser Asp Pro 

1490 1495 1500 

Thr Phe Val Leu Ala Gin Tyr Thr Pro lie Glu lie Thr Leu Thr Ser 
505 1510 1515 1520 

Lys Val Asp Ala Thr Leu Thr Gly lie Val Ser Gly Val Val Ala Lys 

1525 1530 1535 

Asp Val Trp Asn Met Asn Gly Thr Met lie Leu Leu Asp Lys Gly Thr 

1540 1545 1550 

Lys Val Tyr Gly Asn Tyr Gin Ser Val Lys Gly Gly Thr Pro lie Met 

1555 1560 1565 

Thr Arg Leu Met lie Val Phe Thr Lys Ala lie Thr Pro Asp Gly Val 

1570 1575 1580 

lie lie Pro Leu Ala Asn Ala Gin Ala Ala Gly Met Leu Gly Glu Ala 
585 1590 1595 1600 

Gly Val Asp Gly Tyr Val Asn Asn His Phe Met Lys Arg lie Gly Phe 

1605 1610 1615 

Ala Val He Ala Ser Val Val Asn Ser Phe Leu Gin Thr Ala Pro He 

1620 1625 1630 

He Ala Leu Asp Lys Leu He Gly Leu Gly Lys Gly Arg Ser Glu Arg 
1635 1640 1645 
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Tyr 


Leu 


Asp 


Cys 


Val 


Ser 


Arg 


Ala 


Arg 


Asn 


Glu 


Lys 


Glu 


Lys 


Lys 


Glu 




770 










775 










780 










Cys 


Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 


Ala 


Lys 


Lys 


Leu 


Leu 


Glu 


Gin 


Gin 


785 










790 










795 










800 


Ala 


Leu 


Asp 


Cys 


Leu 


Lys 


Asn 


Ala 


Lys 


Thr 


Asp 


Lys 


Glu 


Arg 


Lys 


Lys 










805 










810 










815 




Cys 


Leu 


Lys 


Asp 


Leu 


Pro 


Lys 


Asp 


Leu 


Gin 


Lys 


Lys 


Val 


Leu 


Ala 


Lys 








820 










825 










830 






Glu 


Ser 


Val 


Lys 


Ala 


Tyr 


Leu 


Asp 


Cys 


Val 


Ser 


Gin 


Ala 


Lys 


Thr 


Glu 






835 










840 










845 








Ala 


Glu 


Lys 


Lys 


Glu 


Cys 


Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 


Ala 


Arg 


Lys 




850 










855 










860 










Leu 


Leu 


Glu 


Glu 


Ala 


Lys 


Lys 


Ser 


Val 


Lys 


Ala 


xaa 


Leu 


Asp 


Cys 


Val 


865 










870 










875 










880 


Ser 


Gin 


Ala 


Lys 


Thr 


Glu 


Ala 


Glu 


Lys 


Lys 


Glu 


Cys 


Glu 


Lys 


Leu 


Leu 










885 










890 










895 




Thr 


Pro 


Glu 


Ala 


Arg 


Lys 


Leu 


Leu 


Glu 


Glu 


Ala 


Lys 


Glu 


Ser 


Val 


Lys 








900 










905 










910 






Ala 


Tyr 


Lys 


Asp 


Cys 


Val 


Ser 


Lys 


Ala 


Arg 


Asn 


Glu 


Lys 


Glu 


Lys 


Lys 






915 










920 










925 








Glu 


Cys 


Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 


Ala 


Lys 


Lys 


Leu 


Leu 


Glu 


Gin 




930 










935 










940 










Gin 


Val 


Leu 


Asp 


Cys 


Leu 


Lys 


Asn 


Ala 


Lys 


Thr 


Glu 


Ala 


Asp 


Lys 


Lys 


94 5 










950 










955 










960 


Arg 


Cys 


Val 


Lys 


Asp 


Leu 


Pro 


Lys 


Asp 


Leu 


Gin 


Lys 


Lys 


Val 


Leu 


Ala 










965 










970 










975 




Lys 


Glu 


Ser 


Val 


Lys 


Ala 


Tyr 


Leu 


Asp 


Cys 


Val 


Ser 


Arg 


Ala 


Arg 


Asn 








980 










985 










990 






Glu 


Lys 


Glu 


Lys 


Lys 


Glu 


Cys 


Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 


Ala 


Lys 






995 








1000 








1005 








Lys 


Leu 


Leu 


Glu 


Glu 


Ala 


Lys 


Glu 


Ser 


Leu 


Lys 


Ala 


Tyr 


Lys 


Asp 


Cys 


1010 








1015 








1020 










Leu 


Ser 


Gin 


Ala 


Arg 


Asn 


Glu 


Glu 


Glu 


Arg 


Arg 


Ala 


Cys 


Glu 


Lys 


Leu 


025 








1030 








1035 








1040 


Leu 


Thr 


Pro 


Glu 


Ala 


Arg 


Lys 


Leu 


Leu 


Glu 


Gin 


Glu 


Val 


Lys 


Lys 


Ser 








1045 








1050 








1055 




He 


Lys 


Ala 


Tyr 


Leu 


Asp 


Cys 


Val 


Ser 


Arg 


Ala 


Arg 


Asn 


Glu 


Lys 


Glu 






1060 








1065 








1070 






Lys 


Lys 


Glu 


Cys 


Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 


Ala 


Arg 


Lys 


Phe 


Leu 




1075 








1080 








1085 








Ala 


Lys 


Gin 


Val 


Leu 


Asn 


Cys 


Leu 


Glu 


Lys 


Ala 


Gly 


Asn 


Glu 


Glu 


Glu 


1090 








1095 








1100 












Lys 


Ala 


Cys 


Leu 


Lys 


Asn 


Leu 


Pro 


Lys 


Asp 


Leu 


Gin 


Glu 


Asn 


He 


105 








1110 








1115 








1120 


Leu 


Ala 


Lys 


Glu 


Ser 


Leu 


Lys 


Ala 


Tyr 


Lys 


Asp 


Cys 


Leu 


Ser 


Gin 


Ala 








1125 








1130 








1135 




Arg 


Asn 


Glu 


Glu 


Glu 


Arg 


Arg 


Ala 


Cys 


Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 






1140 










1145 










1150 






Ala 


Arg 


Lys 


Leu 


Leu 


Glu 


Gin 


Glu 


Val 


Lys 


Lys 


Ser 


Val 


Lys 


Ala 


Tyr 






1155 










1160 










1165 








Leu 


Asp 


Cys 


Val 


Ser 


Arg 


Ala 


Arg 


Asn 


Glu 


Lys 


Glu 


Lys 


Lys 


Glu 


Cys 




1170 










1175 










1180 










Glu 


Lys 


Leu 


Leu 


Thr 


Pro 


Glu 


Ala 


Arg 


Lys 


Phe 


Leu 


Ala 


Lys 


Glu 


Leu 



185 1190 1195 1200 

Gin Gin Lys Asp Lys Ala He Lys Asp Cys Leu Lys Asn Ala Asp Pro 
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325 

Lys Gly Ser Lys Lys 
340 

Leu lie Val lie lie 
355 

Pro Leu Glu Asp Lys 
370 

Val Asn Asp Glu lie 
385 

Glu Arg Asn Glu Lys 
405 

Asp Asp Pro Asn Arg 
420 

Glu Asp Lys Asn Pro 
435 

Gly Asn Tyr Glu Glu 
450 

Asp Gin Met Lys Lys 
465 

Ala Lys Thr Glu Glu 
485 

Glu Asn Leu Lys Lys 
500 

Leu Asp Cys Leu Lys 
515 

Leu Lys Leu lie Asn 
530 

Leu Glu Leu Gin Lys 
545 

Ala Lys Thr Glu Ala 
565 

Glu Ala lie Glu Arg 
580 

Ala Lys Thr Asp Glu 
595 

Asp Leu Gin Lys Glu 
610 

Asp Cys Val Ser Lys 
625 

Lys Leu Leu Thr Pro 
645 

Asp Cys Leu Lys Asn 
660 

Lys Asp Leu Pro Lys 
675 

Leu Lys Ala Tyr Lys 
690 

Lys Lys Glu Cys Glu 
705 

Glu Glu Glu Ala Lys 
725 

Gin Ala Lys Thr Glu 
740 

Pro Glu Ala Lys Lys 
755 



330 

Lys Tyr lie lie Gly Gly 
345 

Leu Phe Ser Arg Ser lie 
360 

Ser Ser Arg Phe Ser Lys 
375 

Gin lie Arg Gin Glu Tyr 
390 395 
Gly Asn Met lie Asp Lys 
410 

Thr Leu Tyr Asn Tyr Leu 
425 

Leu Arg Ala Phe Tyr Glu 
440 

Cys Leu Lys Leu lie Lys 
455 

Thr Leu Glu Ala Tyr Asn 
470 475 
Glu Arg lie Lys Cys Leu 
490 

Ser Leu Leu Asn Gin Gin 
505 

Asn Ala Lys Thr Asp Glu 
520 

Asp Pro Glu lie Arg Glu 
535 

Glu Leu Gin Glu Tyr Lys 
550 555 
Glu Lys Asn Lys Cys Leu 
570 

Leu Lys Gin Gin Ala Leu 
585 

Glu Arg Asn Glu Cys Leu 
600 

Leu Leu Ala Asp Met Ser 
615 

Ala Arg Asn Glu Lys Glu 
630 635 
Glu Ala Arg Lys Lys Leu 
650 

Ala Lys Thr Asp Glu Glu 
665 

Asp Leu Gin Ser Asp lie 
680 

Asp Cys Val Ser Gin Ala 
695 

Lys Leu Leu Thr Pro Glu 
710 715 
Glu Ser Val Lys Ala Tyr 
730 

Ala Glu Lys Lys Glu Cys 
745 

Lys Leu Glu Glu Ala Lys 
760 



335 

lie Val Val Ala Val 
350 

Phe His Tyr Phe Met 
365 

Asp Arg Asn Leu Tyr 
380 

Asn Arg Leu Leu Lys 
400 

Asn Leu Phe Phe Asn 
415 

Asn lie Ala Glu lie 
430 

Cys lie Ser Asn Gly 
445 

Asp Lys Lys Leu Gin 
460 

Asp Cys lie Lys Asn 
480 

Asp Leu lie Lys Asp 
495 

Lys Val Gin Val Ala 
510 

Glu Arg Asn Glu Cys 
525 

Lys Phe Arg Lys Glu 
540 

Asp Cys lie Lys Asn 
560 

Lys Gly Leu Ser Lys 
575 

Asp Cys Leu Lys Asn 
590 

Lys Asn lie Pro Gin 
605 

Val Lys Ala Tyr Lys 
620 

Lys Gin Glu Cys Glu 
640 

Glu Gin Gin Val Leu 
655 

Arg Lys Lys Cys Leu 
670 

Leu Ala Lys Glu Ser 
685 

Lys Thr Glu Ala Glu 
700 

Ala Lys Lys Leu Leu 
720 

Leu Asp Cys Val Ser 
735 

Glu Lys Leu Leu Thr 
750 

Lys Ser Val Lys Ala 
765 



-2009- 



BNSDOCID: <WO_984347aA1J_> 



wo 98/43478 



PCT/US98/06371 



TCGGTTCTTT ACAAAATAAA AGACAAAACC AACAACAGGC TCT 53 34 

(2) INFORMATION FOR SEQ ID NO: 1358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1743 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 58: 

Met Asn Glu Glu Asn Asp Lys Leu Glu Thr Ser Lys Lys Ala Gin Gin 

15 10 15 

Asp Ser Pro Gin Asp Leu Ser Asn Glu Glu Ala Thr Glu Ala Asn His 

20 25 30 

Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu Ser Ser Asp His His Leu 

35 40 45 

Asp Asn Pro Thr Glu Thr Gin Thr His Phe Asp Gly Asp Lys Ser Glu 

50 55 60 

Glu Thr Gin Thr Gin Met Asp Ser Glu Gly Asn Glu Thr Ser Glu Ser 
65 70 75 80 

Ser Asn Gly Ser Leu Ala Asp Lys Leu Phe Lys Lys Ala Arg Lys Leu 

85 90 95 

Val Asp Asn Lys Lys Pro Phe Thr Gin Gin Lys Asn Leu Asp Glu Glu 

100 105 110 

Thr Gin Glu Leu Asn Glu Glu Asp Asp Gin Glu Asn Asn Glu Tyr Gin 

115 120 125 

Glu Glu Thr Gin Thr Asp Leu lie Asp Asp Glu Thr Ser Lys Lys Thr 

130 135 140 

Gin Gin His Ser Pro Gin Asp Leu Ser Asn Glu Glu Ala Thr Glu Ala 
145 150 155 160 

Asn His Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu Ser Ser Asp His 

165 170 175 

His Leu Asp Asn Pro Thr Glu Thr Gin Thr Asn Phe Asp Gly Asp Lys 

180 185 190 

Ser Glu Glu Thr Gin Thr Gin Met Asp Ser Glu Gly Asn Glu Thr Ser 

195 200 205 

Glu Ser Ser Asn Gly Ser Leu Ala Asp Lys Leu Phe Lys Lys Ala Arg 

210 215 220 

Lys Leu Val Asp Asn Lys Lys Pro Phe Thr Gin Gin Lys Asn Leu Asp 
225 230 235 240 

Glu Glu Thr Gin Glu Leu Asn Glu Glu Asp Asp Gin Glu Asn Asn Glu 

245 250 255 

Tyr Gin Glu Glu Thr Gin Thr Asp Leu lie Asp Asp Glu Thr Ser Lys 

260 265 270 

Lys Thr Gin Gin His Ser Pro Gin Asp Leu Ser Asn Glu Glu Ala Thr 

275 280 285 

Glu Ala Asn His Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu Ser Ser 

290 295 300 

Asp His His Leu Asp Asn Pro Thr Glu Thr Gin Thr Asn Phe Asp Gly 
305 310 315 320 

Asp Lys Ser Glu Glu He Thr Asp Asp Ser Asn Asp Gin Glu He He 
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AGT GGG GTT GTA GCC AAA GAT GTA TGG AAC ATG AAC GGC ACT ATG ATC 4 659 

Ser Gly Val Val Ala Lys Asp Val Trp Asn Met Asn Gly Thr Met lie 
1535 1540 1545 



TTA TTA GAC AAA GGC ACT AAG GTG TAT GGG AAT TAT CAA AGC GTG AAA 4 707 

Leu beu Asp Lys Gly Thr Lys Val Tyr Gly Asn Tyr Gin Ser Val Lys 
1550 1555 1560 



GGT GGC ACA CCC ATT ATG ACA CGC TTA ATG ATA GTC TTT ACT AAA GCC 4 7 55 

Gly Gly Thr Pro lie Met Thr Arg Leu Met lie Val Phe Thr Lys Ala 
1565 1570 1575 



ATT ACG CCT GAT GGT GTG ATA ATA CCT CTA GCA AAC GCT CAA GCA GCA 4 8 03 

lie Thr Pro Asp Gly Val lie lie Pro Leu Ala Asn Ala Gin Ala Ala 
1580 1585 1590 



GGC ATG TTG GGT GAA GCA GGG GTA GAT GGC TAT GTG AAT AAT CAC TTT 4 851 

Gly Met Leu Gly Glu Ala Gly Val Asp Gly Tyr Val Asn Asn His Phe 
1595 1600 1605 1610 



ATG AAG CGC ATA GGC TTT GCT GTG ATA GCA AGC GTG GTT AAT AGC TTC 4 8 99 

Met Lys Arg lie Gly Phe Ala Val lie Ala Ser Val Val Asn Ser Phe 
1615 1620 1625 



TTG CAA ACT GCG CCT ATC ATA GCT CTA GAT AAA CTC ATA GGC CTT GGC 4 94 7 

Leu Gin Thr Ala Pro lie lie Ala Leu Asp Lys Leu lie Gly Leu Gly 
1630 1635 1640 

AAA GGT AGA AGT GAA AGG ACA CCT GAA TTT AAT TAC GCT TTG GGT CAA 4 995 

Lys Gly Arg Ser Glu Arg Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin 
1645 1650 1655 



GCT ATC AAT GGT AGC ATG CAA AGT TCA GCT CAG ATG TCT AAT CAA ATT 5 04 3 

Ala lie Asn Gly Ser Met Gin Ser Ser Ala Gin Met Ser Asn Gin lie 
1660 1665 1670 



CTA GGG CAA CTG ATG AAT ATC CCC CCA AGT TTT TAC AAA AAC GAG GGC 50 91 

Leu Gly Gin Leu Met Asn lie Pro Pro Ser Phe Tyr Lys Asn Glu Gly 
1675 1680 1685 1690 



GAT AGT ATT AAG ATT CTC ACA ATG GAC GAT ATT GAT TTT AGC GGT GTG 513 9 

Asp Ser lie Lys lie Leu Thr Met Asp Asp lie Asp Phe Ser Gly Val 
1695 1700 1705 



TAT GAT GTT AAA ATT ACT AAC AAA TCT GTG GTA GAT GAA ATT ATC AAA 5187 
Tyr Asp Val Lys lie Thr Asn Lys Ser Val Val Asp Glu lie lie Lys 
1710 1715 1720 



CAA AGC ACC AAA ACT TTG TCT AGA GAA CAT GAA GAA ATC ACC ACA AGC 52 3 5 

Gin Ser Thr Lys Thr Leu Ser Arg Glu His Glu Glu lie Thr Thr Ser 
1725 1730 1735 



CCC AAA GGT GGC AAT TAATTCAAGA GAAAGGATAA AATATATTCA TGTTACTAAA C 52 91 
Pro Lys Gly Gly Asn 
1740 
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CAA GAG GCC ATA GAG CAA TGT TTA GAG GGC TTG AGC GAT AGT GAA AGG 3 98 7 

Gin Glu Ala lie Glu Gin Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg 
1310 1315 1320 

GCG CTA ATT CTA GGA ATT AAA CGA CAA GCT GAT GAA GTG GAT CTG ATT 4 03 5 

Ala Leu lie Leu Gly lie Lys Arg Gin Ala Asp Glu Val Asp Leu He 
1325 1330 1335 

TAT AGC GAT CTA AGA AAC CGT AAA ACC TTT GAT AAC ATG GCG GCT AAA 4 08 3 

Tyr Ser Asp Leu Arg Asn Arg Lys Thr Phe Asp Asn Met Ala Ala Lys 
1340 1345 1350 

GGT TAT CCA TTG TTA CCA ATG GAT TTC AAA AAT GGC GGC GAT ATT GCC 4131 
Gly Tyr Pro Leu Leu Pro Met Asp Phe Lys Asn Gly Gly Asp He Ala 
1355 1360 1365 1370 

ACT ATT AAC GCC ACT AAT GTT GAT GCG GAC AAA ATA GCT AGC GAT AAT 417 9 

Thr He Asn Ala Thr Asn Val Asp Ala Asp Lys He Ala Ser Asp Asn 
1375 1380 1385 

CCT ATT TAT GCT TCC ATA GAG CCT GAT ATT GCC AAG CAA TAC GAA ACA 4 22 7 

Pro He Tyr Ala Ser He Glu Pro Asp He Ala Lys Gin Tyr Glu Thr 
1390 1395 1400 

GAA AAA ACC ATT AAG GAT AAG AAT TTA GAA GCT AAA TTA GCT AAG GCT 42 7 5 

Glu Lys Thr He Lys Asp Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala 
1405 1410 1415 

TTA GGT GGC AAT AAA AAA GAT GAC GAT AAA GAA AAA AGT AAA AAA TCC 4323 
Leu Gly Gly Asn Lys Lys Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser 
1420 1425 1430 

ACA GCA GAA GCT AAA GCA GAA AAC AAT AAG ATA GAC AAA GAT GTC GCA 43 71 

Thr Ala Glu Ala Lys Ala Glu Asn Asn Lys He Asp Lys Asp Val Ala 
1435 1440 1445 1450 

GAA ACT GCC AAG AAT ATC AGT GAA ATC GCT CTT AAG AAC AAA AAA GAA 4419 
Glu Thr Ala Lys Asn He Ser Glu He Ala Leu Lys Asn Lys Lys Glu 
1455 1460 1465 

AAG AGT GGG GAA TTT GTA GAT GAA AAT GGT AAT CCC ATT GAT GAC AAA 44 67 

Lys Ser Gly Glu Phe Val Asp Glu Asn Gly Asn Pro He Asp Asp Lys 
1470 1475 1480 

AAG AAA GCA GAA AAA CAA GAT GAA ACA AGC CCT GTC AAA CAG GCC TTT 4 515 

Lys Lys Ala Glu Lys Gin Asp Glu Thr Ser Pro Val Lys Gin Ala Phe 
1485 1490 1495 

ATA GGC AAG AGT GAT CCC ACA TTT GTT TTA GCG CAA TAC ACC CCC ATT 4 563 

He Gly Lys Ser Asp Pro Thr Phe Val Leu Ala Gin Tyr Thr Pro He 
1500 1505 1510 

GAA ATC ACT CTG ACT TCT AAA GTA GAT GCC ACT CTC ACA GGT ATA GTG 4 611 

Glu He Thr Leu Thr Ser Lys Val Asp Ala Thr Leu Thr Gly He Val 
1515 1520 1525 1530 
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GAA GCG AGA AAA TTT TTA GCG AAG CAA GTG CTA AAT TGT TTG GAA AAA 3 315 

Glu Ala Arg Lys Phe Leu Ala Lys Gin Val Leu Asn Cys Leu Glu Lys 
1085 1090 1095 

GOT GGA AAT GAA GAA GAA AGA AAA GCA TGT CTT AAA AAT CTC COT AAA 3 3 63 

Ala Gly Asn Glu Glu Glu Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys 
1100 1105 1110 

GAC TTA CAG GAA AAT ATT TTA GCT AAA GAG AGT CTT AAA GCT TAT AAA 3411 
Asp Leu Gin Glu Asn lie Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys 
1115 1120 1125 1130 

GAC TGC CTC TOT CAA GCT AGA AAT GAA GAA GAA AGG AGA GCT TGC GAG 34 5 9 

Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu 
1135 1140 1145 

AAA CTA CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG CAA GAA GTT AAG 3 507 

Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu Val Lys 
1150 1155 1160 

AAA AGC GTT AAG GCT TAT TTG GAC TGC GTA TCA AGA GCT AGG AAT GAA 35 5 5 

Lys Ser Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu 
1165 1170 1175 

AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG AGA AAA 3 60 3 

Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys 
1180 1185 1190 

TTT TTA GCG AAA GAA CTC CAA CAA AAA GAT AAA GCG ATC AAA GAT TGC 3 651 

Phe Leu Ala Lys Glu Leu Gin Gin Lys Asp Lys Ala lie Lys Asp Cys 
1195 1200 1205 1210 

TTG AAA AAC GCC GAT CCT AAC GAC AGA GCG GCT ATC ATG AAG TGT TTG 3 69 9 

Leu Lys Asn Ala Asp Pro Asn Asp Arg Ala Ala lie Met Lys Cys Leu 
1215 1220 1225 

GAT GGT TTG AGC GAT GAA GAG AAG CTC AAA TAC CTG CAA GAA GCT AGA 3 74 7 

Asp Gly Leu Ser Asp Glu Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg 
1230 1235 1240 

GAA AAG GCT GTT GCG GAT TGT TTG GCT ATG GCT AAA ACC GAT GAA GAA 3 7 95 

Glu Lys Ala Val Ala Asp Cys Leu Ala Met Ala Lys Thr Asp Glu Glu 
1245 1250 1255 

AAA AGG AAA TGC CAA AAC CTT TAT AGC GAT TTG ATC CAA GAA ATC CAA 3 84 3 

Lys Arg Lys Cys Gin Asn Leu Tyr Ser Asp Leu lie Gin Glu lie Gin 
1260 1265 1270 

AAT AAA AGG ACA CAA AAC AAA CAA AAT CAA TTG AGT AAA ACA GAA AGG 3891 
Asn Lys Arg Thr Gin Asn Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg 
1275 1280 1285 1290 

TTG CAT CAA GCA AGC GAG TGC TTG GAT AAC TTA GAT GAC CCT ACT GAT 3 93 9 

Leu His Gin Ala Ser Glu Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp 
1295 1300 1305 
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ACC CCT GAA GCG AGA AAA CTT TTA GAA GAA GCT AAA AAA AGC GTT AAG 2 64 3 

Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys Lys Ser Val Lys 
860 865 870 

GCT TAY TTG GAT TGC GTA TCT CAA GCC AAA ACT GAA GCT GAG AAA AAA 2 6 91 

Ala Xaa Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys 
875 880 885 890 

GAA TGC GAG AAA TTA CTC ACC CCT GAA GCG AGA AAA CTC TTA GAA GAA 2 73 9 

Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu 
895 900 905 

GCT AAA GAG AGC GTT AAA GCT TAT AAA GAC TGC GTA TCA AAA GCT AGG 2 7 87 

Ala Lys Glu Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg 
910 915 920 

AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG 2 83 5 

Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala 
925 930 935 

AAA AAA CTT TTA GAG CAA CAA GTG CTA GAT TGT TTG AAA AAC GCT AAA 2 883 

Lys Lys Leu Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys 
940 945 950 

ACC GAA GCT GAT AAA AAA AGG TGT GTC AAA GAT CTC CCT AAA GAC TTG 2 931 

Thr Glu Ala Asp Lys Lys Arg Cys Val Lys Asp Leu Pro Lys Asp Leu 
955 960 965 970 

CAG AAA AAG GTT TTA GCT AAA GAG AGC GTT AAG GCT TAT TTG GAC TGC 2 97 9 

Gin Lys Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys 
975 980 985 

GTA TCA AGA GCT AGG AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTG 3 02 7 

Val Ser Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu 
990 995 1000 

CTC ACC CCT GAA GCG AAA AAA CTT TTA GAA GAA GCC AAA GAG AGT CTT 3 075 

Leu Thr Pro Glu Ala Lys Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu 
1005 1010 1015 

AAA GCT TAT AAA GAC TGC CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG 312 3 

Lys Ala Tyr Lys Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg 
1020 1025 1030 

AGA GCT TGC GAG AAA CTA CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG 3171 
Arg Ala Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu 
1035 1040 1045 1050 

CAA GAA GTT AAG AAA AGC ATT AAG GCT TAT TTG GAC TGC GTA TCA AGA 3 219 

Gin Glu Val Lys Lys Ser lie Lys Ala Tyr Leu Asp Cys Val Ser Arg 
1055 1060 1065 

GCT AGG AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT 3 2 67 

Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro 
1070 1075 1080 



-2004- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



GAG AAA CAA GAA TGC GAG AAA TTG CTC ACQ CCT GAA GCG AGG AAA AAG 1971 
Glu Lys Gin Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Lys 
635 640 645 650 

TTA GAA CAA CAG GTT CTA GAT TGT TTG AAA AAC GCT AAA ACC GAT GAA 2019 
Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp Glu 
655 660 665 

GAA CGA AAA AAG TGT TTG AAA GAT CTC CCT AAA GAC TTA CAA AGC GAT 2 0 67 

Glu Arg Lys Lys Cys Leu Lys Asp Leu Pro Lys Asp Leu Gin Ser Asp 
670 675 680 

ATT CTA GCC AAA GAG AGC CTG AAA GCT TAT AAA GAC TGC GTA TCT CAA 2115 
lie Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys Val Ser Gin 
685 690 695 

GCC AAA ACC GAA GCT GAG AAA AAA GAA TGC GAG AAA TTA CTC ACC CCT 216 3 

Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro 
700 705 710 

GAA GCG AAA AAA CTT TTA GAA GAA GAA GCC AAA GAG AGC GTT AAG GCT 2211 
Glu Ala Lys Lys Leu Leu Glu Glu Glu Ala Lys Glu Ser Val Lys Ala 
715 720 725 730 

TAT TTG GAT TGC GTA TCT CAA GCC AAA ACC GAA GCT GAG AAA AAA GAA 22 5 9 

Tyr Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu 
735 740 745 

TGC GAG AAA TTG CTC ACC CCT GAA GCG AAA AAA AAG TTA GAA GAA GCT 2 3 07 

Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys Lys Leu Glu Glu Ala 
750 755 760 

AAA AAA AGC GTT AAA GCT TAC TTG GAT TGC GTA TCA AGA GCT AGG AAT 2 3 55 

Lys Lys Ser Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn 
765 770 775 

GAA AAA GAG AAA AAA GAA TGC GAG AAA TTG CTC ACC CCT GAA GCG AAA 24 03 

Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys 
780 785 790 

AAA CTT TTA GAG CAA CAA GCA CTA GAT TGT TTG AAA AAC GCT AAA ACC 24 51 

Lys Leu Leu Glu Gin Gin Ala Leu Asp Cys Leu Lys Asn Ala Lys Thr 
795 800 805 810 

GAT AAA GAA CGA AAA AAG TGT TTG AAA GAT CTC CCT AAA GAC TTG CAG 24 9 9 

Asp Lys Glu Arg Lys Lys Cys Leu Lys Asp Leu Pro Lys Asp Leu Gin 
815 820 825 

AAA AAG GTT TTA GCT AAA GAA AGC GTT AAA GCT TAC TTG GAT TGC GTA 2 54 7 

Lys Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val 
830 835 840 

TCT CAA GCC AAA ACT GAA GCT GAG AAA AAA GAA TGC GAG AAA TTA CTC 2 5 95 

Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys Glu- Lys Leu Leu 
845 850 855 
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AAG AAT CTT TTC TTC AAT GAC GAT CCC AAT AGA ACC TTA TAG AAC TAT 12 99 

Lys Asn Leu Phe Phe Asn Asp Asp Pro Asn Arg Thr Leu Tyr Asn Tyr 
415 420 425 

TTG AAT ATT GCA GAA ATT GAG GAC AAA AAC CCG TTG AGA GCC TTT TAT 134 7 

Leu Asn lie Ala Glu lie Glu Asp Lys Asn Pro Leu Arg Ala Phe Tyr 
430 435 440 

GAA TGT ATT AGT AAT GGT GGC AAC TAT GAA GAA TGT TTG AAG CTT ATC 13 95 

Glu Cys lie Ser Asn Gly Gly Asn Tyr Glu Glu Cys Leu Lys Leu lie 
445 450 455 

AAA GAC AAA AAA CTT CAA GAT CAG ATG AAA AAG ACT CTA GAG GCT TAT 144 3 

Lys Asp Lys Lys Leu Gin Asp Gin Met Lys Lys Thr Leu Glu Ala Tyr 
460 465 470 

AAC GAC TGC ATC AAA AAT GCC AAA ACT GAA GAA GAA AGG ATC AAG TGT 14 91 

Asn Asp Cys lie Lys Asn Ala Lys Thr Glu Glu Glu Arg lie Lys Cys 
475 480 485 490 

TTA GAT TTA ATC AAA GAT GAA AAC CTA AAA AAA AGC TTA CTG AAC CAA 153 9 

Leu Asp Leu lie Lys Asp Glu Asn Leu Lys Lys Ser Leu Leu Asn Gin 
495 500 505 

CAA AAA GTT CAA GTG GCG CTA GAT TGT TTG AAA AAC GCT AAA ACC GAT 15 8 7 

Gin Lys Val Gin Val Ala Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp 
510 515 520 

GAA GAA CGA AAC GAG TGC CTA AAA CTC ATA AAT GAC CCT GAG ATT AGA 163 5 

Glu Glu Arg Asn Glu Cys Leu Lys Leu lie Asn Asp Pro Glu lie Arg 
525 530 535 

GAG AAA TTC CGT AAG GAA TTA GAG CTT CAA AAA GAG CTT CAA GAG TAT 16 83 

Glu Lys Phe Arg Lys Glu Leu Glu Leu Gin Lys Glu Leu Gin Glu Tyr 
540 545 550 

AAG GAT TGT ATC AAA AAC GCC AAA ACA GAA GCT GAG AAA AAC AAA TGC 1731 
Lys Asp Cys lie Lys Asn Ala Lys Thr Glu Ala Glu Lys Asn Lys Cys 
555 560 565 570 

TTG AAA GGC TTG TCT AAA GAA GCT ATA GAG AGA TTG AAA CAG CAA GCG 177 9 

Leu Lys Gly Leu Ser Lys Glu Ala lie Glu Arg Leu Lys Gin Gin Ala 
575 580 585 

CTA GAT TGT TTG AAA AAC GCT AAA ACC GAT GAA GAA CGA AAC GAG TGC 1827 
Leu Asp Cys Leu Lys Asn Ala Lys Thr Asp Glu Glu Arg Asn Glu Cys 
590 595 600 

TTG AAA AAT ATT CCC CAA GAC TTG CAA AAA GAA CTA TTA GCT GAT ATG 187 5 

Leu Lys Asn lie Pro Gin Asp Leu Gin Lys Glu Leu Leu Ala Asp Met 
605 610 615 

AGC GTC AAG GCT TAC AAG GAT TGC GTA TCA AAA GCT AGA AAT GAA AAA 192 3 

Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu Lys 
620 625 630 
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Asn Phe Asp Gly Asp Lys Ser Glu Glu Thr Gin Thr Gin Met Asp Ser 
190 195 200 

GAA GGT AAT GAA ACT TCA GAA TCT AGC AAT GGC AGT CTA GCA GAC AAG 675 
Glu Gly Asn Glu Thr Ser Glu Ser Ser Asn Gly Ser Leu Ala Asp Lys 
205 210 215 

TTA TTC AAA AAA GCC AGA AAA TTA GTT GAT AAT AAA AAA CCT TTC ACT 72 3 

Leu Phe Lys Lys Ala Arg Lys Leu Val Asp Asn Lys Lys Pro Phe Thr 
220 225 230 

CAG CAA AAG AAT TTA GAT GAA GAA ACC CAA GAA CTG AAC GAA GAA GAC 771 
Gin Gin Lys Asn Leu Asp Glu Glu Thr Gin Glu Leu Asn Glu Glu Asp 
235 240 245 250 

GAT CAA GAA AAT AAT GAG TAT CAA GAA GAA ACT CAA ACQ GAC TTA ATT 819 
Asp Gin Glu Asn Asn Glu Tyr Gin Glu Glu Thr Gin Thr Asp Leu lie 
255 260 265 

GAT GAT GAA ACT TCT AAA AAA ACC CAA CAA CAT TCA CCC CAA GAT TTA 8 67 

Asp Asp Glu Thr Ser Lys Lys Thr Gin Gin His Ser Pro Gin Asp Leu 
270 275 280 

TCC AAT GAA GAA GCA ACA GAA GCC AAT CAT TTT GAA AAT CTT TTA AAA 915 
Ser Asn Glu Glu Ala Thr Glu Ala Asn His Phe Glu Asn Leu Leu Lys 
285 290 295 

GAA TCC AAA GAA AGC TCA GAT CAT CAT CTT. GAC AAC CCC ACA GAA ACT 96 3 

Glu Ser Lys Glu Ser Ser Asp His His Leu Asp Asn Pro Thr Glu Thr 
300 305 310 

CAA ACC AAT TTT GAT GGA GAC AAG TCA GAA GAA ATA ACT GAC GAC TCT 1011 
Gin Thr Asn Phe Asp Gly Asp Lys Ser Glu Glu lie Thr Asp Asp Ser 
315 320 325 330 

AAC GAT CAA GAG ATT ATC AAA GGA AGC AAA AAG AAA TAT ATT ATT GGT 105 9 

Asn Asp Gin Glu lie lie Lys Gly Ser Lys Lys Lys Tyr lie lie Gly 
335 340 345 

GGC ATT GTA GTC GCT GTT CTT ATC GTG ATT ATT TTA TTT TCT AGA AGC 110 7 

Gly lie Val Val Ala Val Leu lie Val lie lie Leu Phe Ser Arg Ser 
350 355 360 

ATT TTT CAC TAC TTC ATG CCT TTG GAA GAT AAA AGC TCT CGT TTT AGC 115 5 

lie Phe His Tyr Phe Met Pro Leu Glu Asp Lys Ser Ser Arg Phe Ser 
365 370 375 

AAA GAC AGG AAT CTT TAT GTC AAT GAT GAA ATC CAA ATA AGG CAA GAG 12 0 3 

Lys Asp Arg Asn Leu Tyr Val Asn Asp Glu lie Gin lie Arg Gin Glu 
380 385 390 

TAT AAC CGA TTG CTG AAA GAA CGG AAT GAA AAA GGC AAT ATG ATC GAT 12 51 

Tyr Asn Arg Leu Leu Lys Glu Arg Asn Glu Lys Gly Asn Met lie Asp 
395 400 405 410 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357: 

TAAATAAAAA GGCGTTAAGA C ATG AAT GAA GAA AAC GAT AAA CTT GAA ACT 51 

Met Asn Glu Glu Asn Asp Lys Leu Glu Thr 
15 10 

TCT AAA AAA GCC CAA CAA GAT TCA CCC CAA GAT TTA TCC AAT GAA GAA 99 
Ser Lys Lys Ala Gin Gin Asp Ser Pro Gin Asp Leu Ser Asn Glu Glu 
15 20 25 

GCA ACA GAA GCC AAT CAT TTT GAA AAT CTT TTA AAA GAA TCC AAA GAA 14 7 

Ala Thr Glu Ala Asn His Phe Glu Asn Leu Leu Lys Glu Ser Lys Glu 
30 35 40 

AGC TCA GAT CAT CAT CTT GAC AAC CCC ACA GAA ACT CAA ACC CAT TTT 195 
Ser Ser Asp His His Leu Asp Asn Pro Thr Glu Thr Gin Thr His Phe 
45 50 55 

GAT GGA GAC AAG TCA GAA GAA ACC CAA ACT CAA ATG GAT TCT GAA GGT 24 3 

Asp Gly Asp Lys Ser Glu Glu Thr Gin Thr Gin Met Asp Ser Glu Gly 
60 65 70 

AAT GAA ACT TCA GAA TCT AGC AAT GGC AGT CTA GCA GAC AAG TTA TTC 291 
Asn Glu Thr Ser Glu Ser Ser Asn Gly Ser Leu Ala Asp Lys Leu Phe 
75 80 85 90 

AAA AAA GCC AGA AAA TTA GTT GAT AAT AAA AAA CCT TTC ACT CAG CAA 33 9 

Lys Lys Ala Arg Lys Leu Val Asp Asn Lys Lys Pro Phe Thr Gin Gin 
95 100 105 

AAG AAT TTA GAT GAA GAA ACC CAA GAA CTG AAC GAA GAA GAC GAT CAA 3 87 

Lys Asn Leu Asp Glu Glu Thr Gin Glu Leu Asn Glu Glu Asp Asp Gin 
110 115 120 

GAA AAT AAT GAG TAT CAA GAA GAA ACT CAA ACG GAC TTA ATT GAT GAT 43 5 

Glu Asn Asn Glu Tyr Gin Glu Glu Thr Gin Thr Asp Leu lie Asp Asp 
125 130 135 

GAA ACT TCT AAA AAA ACC CAA CAA CAT TCA CCC CAA GAT TTA TCC AAT 4 83 

Glu Thr Ser Lys Lys Thr Gin Gin His Ser Pro Gin Asp Leu Ser Asn 
140 145 150 

GAA GAA GCA ACA GAA GCC AAT CAT TTT GAA AAT CTT TTA AAA GAA TCC 531 
Glu Glu Ala Thr Glu Ala Asn His Phe Glu Asn Leu Leu Lys Glu Ser 
155 160 165 170 

AAA GAA AGC TCA GAT CAT CAT CTT GAC AAC CCC ACA GAA ACT CAA ACC 5 79 

Lys Glu Ser Ser Asp His His Leu Asp Asn Pro Thr Glu Thr Gin Thr 
175 180 185 

AAT TTT GAT GGA GAC AAG TCA GAA GAA ACC CAA ACT CAA ATG GAT TCT 62 7 

-2000- 



BNSDOCID: <WO 9843478 A 1J_> 



wo 98/43478 



PCT/US98/06371 



15 10 15 

Glu Arg Ala Leu Lys Glu Ala Ala Leu Asn Pro Leu Arg His Ala Thr 

20 25 30 

Glu Glu Leu Phe Gly Asp Phe Leu Lys Met Glu Asn lie Thr Glu lie 

35 40 45 

Cys Tyr Asn Gly Asn Lys Val Val Trp Val Leu Lys Asn Asn Gly Glu 

50 55 60 

Trp Gin Pro Phe Asp Val Arg Asp Arg Lys Ala Phe Ser Leu Ser Arg 
65 70 75 80 

Leu Met His Phe Ala Arg Cys Cys Ala Ser Phe Lys Lys Lys Thr lie 

85 90 95 

Asp Asn Tyr Glu Asn Pro lie Leu Ser Ser Asn Leu Ala Asn Gly Glu 

100 105 110 

Arg Val Gin He Val Leu Ser Pro Val Thr Val Asn Asp Glu Thr He 

115 120 125 

Ser He Ser He Arg He Pro Ser Lys Thr Thr Tyr Pro His Ser Phe 

130 135 140 

Phe Glu Glu Gin Gly Phe Tyr Asn Leu Leu Asp Asn Lys Glu Gin Ala 
145 150 155 160 

He Ser Ala He Lys Asp Gly He Ala He Gly Lys Asn Val He Val 

165 170 175 

Cys Gly Gly Thr Gly Ser Gly Lys Thr Thr Tyr He Lys Ser He Met 

180 185 190 

Glu Phe He Pro Lys Glu Glu Arg He He Ser He Glu Asp Thr Glu 

195 200 205 

Glu He Val Phe Lys His His Lys Asn Tyr Thr Gin Leu Phe Phe Gly 

210 215 220 

Gly Asn He Thr Ser Ala Asp Cys Leu Lys Ser Cys Leu Arg Met Arg 
225 230 235 240 

Pro Asp Arg He He Leu Gly Glu Leu Arg Ser Ser Glu Ala Tyr Asp 

245 250 255 

Phe Tyr Asn Val Leu Cys Ser Gly His Lys Gly Thr Leu Thr Thr Leu 

260 265 270 

His Ala Gly Ser Ser Glu Glu Ala Phe He Arg Leu Ala Asn Met Ser 

275 280 285 

Ser Ser Asn Ser Ala Ala Arg Asn He Lys Phe Glu Ser Leu He Glu 

290 295 300 

Gly Phe Lys Asp Leu He Asp Met He Val His He Asn His His Lys 
305 310 315 320 

Gin Cys Asp Glu Phe Tyr He Lys His Arg 
325 330 

(2) INFORMATION FOR SEQ ID NO: 1357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22 . . . 5250 
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AAA AAT GTG ATT GTT TGT GGT GGC ACA GGA AGC GGT AAA ACG ACT TAT 57 9 

Lys Asn Val lie Val Cys Gly Gly Thr Gly Ser Gly Lys Thr Thr Tyr 
175 180 185 

ATC AAA AGC ATC ATG GAG TTT ATC CCT AAA GAA GAA AGG ATC ATA TCC 62 7 

lie Lys Ser lie Met Glu Phe lie Pro Lys Glu Glu Arg He He Ser 
190 195 200 

ATT GAA GAC ACC GAA GAG ATT GTA TTC AAA CAC CAC AAA AAC TAC ACA 67 5 

He Glu Asp Thr Glu Glu He Val Phe Lys His His Lys Asn Tyr Thr 
205 210 215 

CAG CTT TTT TTT GGT GGG AAT ATC ACC TCT GCT GAT TGC TTA AAG TCA 72 3 

Gin Leu Phe Phe Gly Gly Asn He Thr Ser Ala Asp Cys Leu Lys Ser 
220 225 230 235 

TGT CTG AGA ATG CGG CCT GAT AGA ATC ATT TTA GGG GAA' CTC AGA AGC 771 
Cys Leu Arg Met Arg Pro Asp Arg He He Leu Gly Glu Leu Arg Ser 
240 245 250 

AGT GAG GCA TAC GAT TTT TAT AAT GTG CTT TGT AGC GGT CAT AAA GGC 819 
Ser Glu Ala Tyr Asp Phe Tyr Asn Val Leu Cys Ser Gly His Lys Gly 
255 260 265 

ACA CTA ACC ACT CTG CAT GCA GGG AGC AGT GAA GAA GCG TTT ATC CGT 867 
Thr Leu Thr Thr Leu His Ala Gly Ser Ser Glu Glu Ala Phe He Arg 
270 275 280 

TTG GCG AAC ATG AGT TCA TCT AAT AGC GCA GCA AGG AAT ATC AAG TTT 915 
Leu Ala Asn Met Ser Ser Ser Asn Ser Ala Ala Arg Asn He Lys Phe 
285 290 295 

GAA AGT CTT ATT GAG GGC TTT AAA GAT TTG ATT GAT ATG ATT GTC CAT . 96 3 
Glu Ser Leu He Glu Gly Phe Lys Asp Leu He Asp Met He Val His 
300 305 310 315 

ATC AAC CAC CAC AAA CAG TGT GAT GAA TTT TAT ATC AAA CAC AGG TAGTA 1013 
He Asn His His Lys Gin Cys Asp Glu Phe Tyr He Lys His Arg 
320 325 330 

GGCACAATGG AAGACTTTTT GTAT 103 7 

(2) INFORMATION FOR SEQ ID NO: 1356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356: 
Met Thr Glu Asp Arg Leu Ser Ala Glu Asp Lys Lys Phe Leu Glu Val 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 1008 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355: 

TAAATTTGGA ATAAGAAC ATG ACT GAA GAC AGA TTG AGT GCA GAA GAT AAA 51 

Met Thr Glu Asp Arg Leu Ser Ala Glu Asp Lys 
15 10 

AAG TTT CTA GAA GTA GAA AGA GCT TTA AAA GAA GCG GCA TTA AAT CCT 9 9 

Lys Phe Leu Glu Val Glu Arg Ala Leu Lys Glu Ala Ala Leu Asn Pro 
15 20 25 

CTA AGG CAT GCT ACT GAA GAA CTT TTT GGT GAT TTT TTA AAA ATG GAA 14 7 

Leu Arg His Ala Thr Glu Glu Leu Phe Gly Asp Phe Leu Lys Met Glu 
30 35 40 

AAT ATC ACT GAG ATT TGT TAC AAT GGG AAC AAG GTT GTA TGG GTT TTA 195 
Asn lie Thr Glu lie Cys Tyr Asn Gly Asn Lys Val Val Trp Val Leu 
45 50 55 

AAA AAT AAT GGC GAA TGG CAA CCA TTT GAT GTG AGA GAC AGG AAA GCC 24 3 

Lys Asn Asn Gly Glu Trp Gin Pro Phe Asp Val Arg Asp Arg Lys Ala 
60 65 70 75 

TTT AGC CTG TCT CGT TTA ATG CAT TTT GCT CGG TGT TGT GCA AGT TTT 2 91 

Phe Ser Leu Ser Arg Leu Met His Phe Ala Arg Cys Cys Ala Ser Phe 
80 85 90 

AAG AAA AAA ACA ATA GAC AAC TAT GAA AAT CCT ATT TTG AGC AGC AAT 33 9 

Lys Lys Lys Thr lie Asp Asn Tyr Glu Asn Pro lie Leu Ser Ser Asn 
95 100 105 

TTA GCG AAT GGT GAA AGG GTG CAG ATT GTC CTT TCC CCT GTT ACA GTT 3 87 

Leu Ala Asn Gly Glu Arg Val Gin lie Val Leu Ser Pro Val Thr Val 
110 115 120 

AAT GAT GAA ACC ATT TCC ATA TCC ATA AGG ATA CCT AGC AAA ACA ACC 4 35 

Asn Asp Glu Thr lie Ser lie Ser lie Arg lie Pro Ser Lys Thr Thr 
125 130 135 

TAT CCT CAT AGC TTC TTT GAA GAG CAA GGT TTT TAT AAT CTA CTA GAC 4 83 

Tyr Pro His Ser Phe Phe Glu Glu Gin Gly Phe Tyr Asn Leu Leu Asp 
140 145 150 155 

AAC AAA GAA CAA GCG ATC AGC GCG ATT AAA GAT GGT ATT GCT ATT GGT 531 
Asn Lys Glu Gin Ala lie Ser Ala lie Lys Asp Gly lie Ala lie Gly 
160 165 170 
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Glu Phe Val Lys Arg Lys Lys lie lie Met Pro Glu Thr Pro Thr Met 
385 390 395 400 

Phe Phe lie Gly Ser Met Ala Ser Gly lie Asn Leu lie Asp Glu Asp 

405 410 415 

Thr Asn Met Glu Lys Val Val Ser Leu Met Glu Phe Phe Gly Gly Glu 

420 425 430 

Glu Asp Lys Ser Gly Asp Asn Leu Arg Val Leu Ser Pro Ala Thr Arg 

435 440 445 

Asn Met Trp Asn Ser Phe Lys Thr Met Gly Gly Ala Arg Glu Thr Tyr 

450 455 460 

Ser Ser Val Gin Gly Val Tyr Thr Ser Ala Phe Ala Pro Tyr Asn Asn 
465 470 475 480 

Ala Met lie Arg Asn Phe Thr Ser Ala Asn Asp Phe Asp Phe Arg Arg 

485 490 495 

Leu Arg lie Asp Glu Val Ser He Gly Val He Ala Asn Pro Lys Glu 

500 505 510 

Ser Thr He Val Gly Pro He Leu Glu Leu Phe Phe Asn Val Met He 

515 520 525 

Tyr Ser Asn Leu He Leu Pro He His Asp Pro Gin Cys Lys Arg Ser 

530 535 540 

Cys Leu Met Leu Met Asp Glu Phe Thr Leu Cys Gly Tyr Leu Glu Thr 
545 550 555 560 

Phe Val Lys Ala Val Gly He Met Ala Glu Tyr Asn Met Arg Pro Ala 

565 570 575 

Phe Val Phe Gin Ser Lys Ala Gin Leu Glu Asn Asp Pro Pro Leu Gly 

580 585 590 

Tyr Gly Arg Asn Gly Ala Lys Thr He Leu Asp Asn Leu Ser Leu Asn 

595 600 605 

Met Tyr Tyr Gly He Asn Asn Asp Asn Tyr Tyr Glu His Phe Glu Lys 

610 615 620 

Leu Ser Lys Val Leu Gly Lys Tyr Thr Arg Gin Asp Val Ser Arg Ser 
625 630 635 640 

He Asp Asp Asn Thr Gly Lys Thr Asn Thr Ser He Ser Asn Lys Glu 

645 650 655 

Arg Phe Leu Met Thr Pro Asp Glu Leu Met Thr Met Gly Asp Glu Leu 

660 665 670 

He He Leu Glu Asn Thr Leu Lys Pro He Lys Cys His Lys Ala Leu 

675 680 685 

Tyr Tyr Asp Asp Pro Phe Phe Thr Asp Glu Leu He Lys Val Ser Pro 

690 695 700 

Ser Leu Ser Lys Lys Tyr Lys Leu Gly Lys Val Pro Asn Gin Ala Thr 
705 710 715 720 

Phe Tyr Asp Asp Leu Gin Ala Ala Lys Thr Arg Gly Glu Leu Ser Tyr 

725 730 735 

Asp Lys Ser Leu Val Pro Val Gly Ser Ser Glu Leu 
740 745 

(2) INFORMATION FOR SEQ ID NO: 13 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 54: 



Met Glu Asp Phe 
1 

Leu Val Val lie 
20 

Tyr Lys Phe lie 
35 

Gin Pro Gin Lys 
50 

Lys Glu Arg Val 
65 

Leu Ser Tyr Ser 

Gly Asn Phe Tyr 
100 

Phe Asp lie Trp 
115 

Lys Ala Leu Gly 
130 

Leu Val Val Tyr 
145 

Phe Val Trp Phe 

As n Ly s Ly s Va 1 
180 

Glu Lys Met lie 
195 

Al a Phe Asp Lys 
210 

Phe lie Ala Tyr 
225 

Arg Ser Gly Lys 

Pro Gin Asn lie 
260 

Cys Gly Lys lie 
275 

Glu Pro Phe Ser 
290 

Asp Phe Gly Asn 
305 

Asp Thr Arg Leu 

Ser Thr Gin lie 
340 

Asn Glu Lys Asp 
355 

lie Asn Cys Asn 
370 



Leu Tyr Asn Thr 
5 

Phe Ser Phe lie 

Lys Ala Gin Lys 
40 

Lys Lys Ser Phe 
55 

Lys Thr Phe Gly 
70 

Phe lie Thr Ser 
85 

Asp Asp Asn Arg 

lie Tyr Ala He 
120 

Phe Ser Ser Leu 
135 

Gly Ser He Leu 
150 

Leu Lys Tyr Leu 
165 

Asp Asp Leu Phe 

Lys Ala Lys Leu 
200 

Arg Glu Val He 
215 

Ala Gly Gin Ala 
230 

Gly Val Gly Phe 
245 

Val Val Phe Asp 

Arg Glu Lys Arg 
280 

Leu Lys Thr His 
295 

Asp Val Val Leu 
310 

Lys Gly His Gly 
325 

Phe Gly Leu Ala 

Pro Phe Phe Ser 
360 

He Tyr Arg Asp 
375 



Leu Tyr Phe He 
10 

Gly Leu He Ala 
25 

Lys Ala Phe Lys 

Lys Glu He He 
60 

Phe Trp Leu Gin 
75 

Gly Leu Phe Phe 
90 

Ser Pro Glu Ser 
105 

Gin Asp Phe Pro 

Lys He Tyr Gly 
140 

Cys Ser Tyr He 
155 

Thr Arg Thr Arg 
170 

Gly Ser Ala Ser 
185 

He Thr Pro Asn 

Val Gly Arg Arg 
220 

Phe He Gly Leu 
235 

He Met Pro Asn 
250 

Pro Lys Ala Asp 
265 

Phe Asn Gin Lys 

Arg Phe Asn Pro 
300 

Thr Glu Asp He 
315 

Met Val Ala Ser 
330 

Lys Leu Val Phe 
345 

Asn Gin Ala Arg 

Leu Met Trp Thr 
380 



Glu Asp Tyr Lys 
15 

Leu Phe Phe Leu 
30 

Asp Lys Ala Asn 
45 

He Asp Gly Leu 

Ala He Leu Leu 
80 

Leu He Leu Leu 
95 

Asp Asp Asp Leu 
110 

Asn Tyr Tyr Phe 
125 

Phe Asn He Ser 

Phe He Thr Phe 
160 

Asp He Gly Ala 
175 

Trp Glu Thr Glu 
190 

Asn Lys Lys Arg 
205 

Gly Leu Gly Asp 

He Ala Pro Thr 
240 

Met He Asn Tyr 
255 

Thr Met Glu Thr 
270 

Val Phe He Tyr 
285 

Phe Ala Tyr Val 

Leu Ser Gin He 
320 

Gly Gly Asp Phe 
335 

Pro Glu Arg Pro 
350 

Asn Leu Phe Val 
365 

Lys Lys Gly Leu 
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GCA GAA TAG AAC ATG CGC CCC GOT TTT GTG TTT CAA AGT AAG GCG CAA 17 82 

Ala Glu Tyr Asn Met Arg Pro Ala Phe Val Phe Gin Ser Lys Ala Gin 
570 575 580 

CTA GAG AAT GAG CCC CCA CTT GGT TAT GGT AGG AAT GGC GCT AAG ACT 183 0 

Leu Glu Asn Asp Pro Pro Leu Gly Tyr Gly Arg Asn Gly Ala Lys Thr 
585 590 595 600 

ATT TTA GAC AAC CTT TCT TTG AAT ATG TAT TAT GGG ATT AAC AAC GAT 18 7 8 

lie Leu Asp Asn Leu Ser Leu Asn Met Tyr Tyr Gly lie Asn Asn Asp 
605 610 615 

AAC TAC TAT GAA CAT TTT GAA AAA CTT TCT AAG GTA TTA GGG AAA TAG 192 6 

Asn Tyr Tyr Glu His Phe Glu Lys Leu Ser Lys Val Leu Gly Lys Tyr 
620 625 630 

ACA AGG CAA GAC GTG AGC CGA AGC ATT GAT GAT AAT ACA GGT AAG ACC 19 74 

Thr Arg Gin Asp Val Ser Arg Ser lie Asp Asp Asn Thr Gly Lys Thr 
635 640 645 

AAC ACT TCT ATC AGC AAC AAA GAG CGG TTT TTG ATG ACC CCT GAT GAA 2 0 22 

Asn Thr Ser lie Ser Asn Lys Glu Arg Phe Leu Met Thr Pro Asp Glu 
650 655 660 

TTG ATG ACT ATG GGC GAT GAG CTT ATC ATT CTA GAG AAT ACG CTC AAA 2 0 70 

Leu Met Thr Met Gly Asp Glu Leu lie lie Leu Glu Asn Thr Leu Lys 
665 670 675 680 

CCT ATC AAG TGC CAC AAG GCG CTT TAC TAT GAT GAT CCA TTC TTC ACC 2118 
Pro lie Lys Cys His Lys Ala Leu Tyr Tyr Asp Asp Pro Phe Phe Thr 
685 690 695 

GAT GAA CTC ATT AAG GTA AGT CCA AGC TTG AGC AAG AAA TAC AAA TTG 216 6 

Asp Glu Leu lie Lys Val Ser Pro Ser Leu Ser Lys Lys Tyr Lys Leu 
700 705 710 

GGG AAA GTG CCT AAT CAA GCA ACT TTC TAT GAT GAT TTG CAA GCC GCT 2 214 

Gly Lys Val Pro Asn Gin Ala Thr Phe Tyr Asp Asp Leu Gin Ala Ala 
715 720 725 

AAA ACT AGA GGT GAA TTG AGT TAT GAT AAA TCT TTA GTG CCT GTG GGT 22 6 2 

Lys Thr Arg Gly Glu Leu Ser Tyr Asp Lys Ser Leu Val Pro Val Gly 
730 735 740 

TCA AGT GAA CTG TGATTAAGAC AAAATATCTT AACAAAAAGA AAATTAAAAG ATAAT 2319 

Ser Ser Glu Leu 

745 

GATATAAATA 2 3 2 9 

(2) INFORMATION FOR SEQ ID NO:1354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 8 amino acids 

(B) TYPE: amino acid 



-1994- 



BNSCKDCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



AAG CTC GTG TTC CCT GAA AG A CCT AAT GAA AAA GAT CCT TTC TTT AGC 1110 
Lys Leu Val Phe Pro Glu Arg Pro Asn Glu Lys Asp Pro Phe Phe Ser 
345 350 355 360 

AAT CAA GCG CGA AAT CTT TTT GTC ATC AAT TGC AAT ATT TAG AGG GAT 115 8 

Asn Gin Ala Arg Asn Leu Phe Val lie Asn Cys Asn lie Tyr Arg Asp 
365 370 375 

CTC ATG TGG ACT AAA AAG GGG CTT GAG TTT GTC AAA AGA AAA AAA ATC 12 0 6 

Leu Met Trp Thr Lys Lys Gly Leu Glu Phe Val Lys Arg Lys Lys lie 
380 385 390 

ATC ATG CCT GAA ACA CCC ACG ATG TTT TTC ATA GGT TCT ATG GCA AGC 12 54 

lie Met Pro Glu Thr Pro Thr Met Phe Phe lie Gly Ser Met Ala Ser 
395 400 405 

GGG ATC AAC TTG ATT GAT GAA GAC ACA AAC ATG GAA AAA GTC GTG TCT 13 02 

Gly lie Asn Leu lie Asp Glu Asp Thr Asn Met Glu Lys Val Val Ser 
410 415 420 

TTA ATG GAA TTT TTT GGA GGT GAA GAA GAT AAG AGT GGC GAT AAT CTA 13 5 0 

Leu Met Glu Phe Phe Gly Gly Glu Glu Asp Lys Ser Gly Asp Asn Leu 
425 430 435 440 

AGA GTG CTT AGT CCT GCC ACT AGA AAC ATG TGG AAT AGC TTC AAG ACA 13 9 8 

Arg Val Leu Ser Pro Ala Thr Arg Asn Met Trp Asn Ser Phe Lys Thr 
445 450 455 

ATG GGC GGC GCT AGA GAA ACT TAT AGC TCG GTT CAA GGG GTA TAG ACT 14 4 6 

Met Gly Gly Ala Arg Glu Thr Tyr Ser Ser Val Gin Gly Val Tyr Thr 
460 465 470 

TCA GCC TTT GCG CCT TAT AAT AAC GCA ATG ATT AGA AAT TTC ACG AGC 14 94 

Ser Ala Phe Ala Pro Tyr Asn Asn Ala Met lie Arg Asn Phe Thr Ser 
475 480 485 

GCC AAT GAT TTT GAT TTC AGG CGT TTA AGG ATC GAT GAA GTG AGT ATT 154 2 

Ala Asn Asp Phe Asp Phe Arg Arg Leu Arg lie Asp Glu Val Ser lie 
490 495 500 

GGT GTG ATC GCT AAT CCT AAA GAA AGC ACT ATT GTT GGA CCG ATA TTA 15 90 

Gly Val lie Ala Asn Pro Lys Glu Ser Thr lie Val Gly Pro lie Leu 
505 510 515 520 

GAG CTG TTT TTC AAT GTG ATG ATT TAT AGC AAT TTG ATT CTG CCA ATC 163 8 

Glu Leu Phe Phe Asn Val Met lie Tyr Ser Asn Leu lie Leu Pro lie 
525 530 535 

CAT GAT CCA CAG TGC AAA AGA AGT TGC TTG ATG CTC ATG GAC GAA TTC 16 8 6 

His Asp Pro Gin Cys Lys Arg Ser Cys Leu Met Leu Met Asp Glu Phe 
540 545 550 

ACT TTA TGT GGC TAT TTA GAG ACC TTT GTT AAA GCG GTA GGG ATT ATG 17 3 4 

Thr Leu Cys Gly Tyr Leu Glu Thr Phe Val Lys Ala Val Gly lie Met 
555 560 565 
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CAA GAT TTT CCT AAT TAG TAT TTT AAA GCG CTT GGT TTT AGT TCA CTC 43 8 

Gin Asp Phe Pro Asn Tyr Tyr Phe Lys Ala Leu Gly Phe Ser Ser Leu 
125 130 135 

AAG ATT TAT GGG TTC AAT ATA TCC TTA GTC GTA TAT GGT TCT ATT TTA 48 6 

Lys lie Tyr Gly Phe Asn lie Ser Leu Val Val Tyr Gly Ser lie Leu 
140 145 150 

TGC TCT TAT ATC TTC ATT ACC TTT TTT GTG TGG TTC TTA AAA TAC TTA 53 4 

Cys Ser Tyr lie Phe lie Thr Phe Phe Val Trp Phe Leu Lys Tyr Leu 
155 160 165 

ACT CGG ACT AGA GAT ATA GGA GCG AAT AAA AAA GTT GAT GAT CTC TTT 5 82 

Thr Arg Thr Arg Asp lie Gly Ala Asn Lys Lys Val Asp Asp Leu Phe 
170 175 180 

GGT AGC GCG AGT TGG GAA ACT GAA GAG AAA ATG ATC AAA GCC AAA CTC 63 0 

Gly Ser Ala Ser Trp Glu Thr Glu Glu Lys Met lie Lys Ala Lys Leu 
185 190 195 200 

ATC ACG CCC AAC AAT AAA AAA CGC GCC TTT GAC AAA CGA GAG GTG ATT 67 8 

lie Thr Pro Asn Asn Lys Lys Arg Ala Phe Asp Lys Arg Glu Val lie 
205 210 215 

GTA GGC AGG CGT GGC TTG GGG GAT TTT ATC GCT TAC GCA GGG CAG GCG 72 6 

Val Gly Arg Arg Gly Leu Gly Asp Phe lie Ala Tyr Ala Gly Gin Ala 
220 225 230 

TTC ATT GGC TTG ATT GCT CCT ACT AGA AGC GGT AAG GGG GTG GGT TTC 774 
Phe lie Gly Leu lie Ala Pro Thr Arg Ser Gly Lys Gly Val Gly Phe 
235 240 245 

ATC ATG CCC AAT ATG ATC AAT TAT CCT CAA AAT ATC GTT GTG TTT GAC 82 2 

lie Met Pro Asn Met lie Asn Tyr Pro Gin Asn lie Val Val Phe Asp 
250 255 260 

CCT AAA GCT GAC ACT ATG GAG ACT TGC GGA AAA ATC AGA GAA AAA CGC 87 0 

Pro Lys Ala Asp Thr Met Glu Thr Cys Gly Lys lie Arg Glu Lys Arg 
265 270 275 280 

TTC AAC CAA AAA GTG TTC ATC TAT GAA CCT TTC TCC TTA AAA ACA CAC 918 
Phe Asn Gin Lys Val Phe lie Tyr Glu Pro Phe Ser Leu Lys Thr His 
285 290 295 

CGA TTT AAT CCT TTC GCT TAT GTG GAT TTT GGT AAT GAT GTG GTT TTG 96 6 

Arg Phe Asn Pro Phe Ala Tyr Val Asp Phe Gly Asn Asp Val Val Leu 
300 305 310 

ACC GAA GAC ATA CTC TCT CAA ATT GAC ACA CGC CTA AAA GGG CAT GGC 1014 
Thr Glu Asp lie Leu Ser Gin lie Asp Thr Arg Leu Lys Gly His Gly 
315 320 325 

ATG GTG GCT AGT GGA GGG GAT TTT TCC ACT CAA ATC TTT GGA TTA GCA 10 62 

Met Val Ala Ser Gly Gly Asp Phe Ser Thr Gin lie Phe Gly Leu Ala 
330 _ 335 340 
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100 105 110 

Lys Phe Arg Cys He Tyr Ser Gin Val Arg Tyr Asn Glu 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 135 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 2274 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 53: 

ATTTTATATC AAACACAGGT AGTAGGCACA ATG GAA GAC TTT TTG TAT AAC ACC 54 

Met Glu Asp Phe Leu Tyr Asn Thr 
1 5 

TTA TAT TTC ATA GAG GAT TAT AAG TTG GTT GTT ATT TTT AGT TTC ATA 102 
Leu Tyr Phe He Glu Asp Tyr Lys Leu Val Val He Phe Ser Phe He 
10 15 20 

GGG TTA ATA GCG TTA TTT TTT CTT TAG AAA TTC ATA AAA GCT CAA AAA 150 
Gly Leu He Ala Leu Phe Phe Leu Tyr Lys Phe He Lys Ala Gin Lys 
25 30 35 40 

AAG GCT TTT AAA GAT AAA GCT AAC CAG CCT CAA AAG AAA AAA AGC TTT 198 
Lys Ala Phe Lys Asp Lys Ala Asn Gin Pro Gin Lys Lys Lys Ser Phe 
45 50 55 

AAA GAA ATC ATT ATA GAT GGG CTG AAA GAA AGG GTT AAA ACC TTT GGC 24 6 

Lys Glu He He He Asp Gly Leu Lys Glu Arg Val Lys Thr Phe Gly 
60 65 70 

TTT TGG TTG CAA GCT ATA CTA TTA CTA TCC TAT TCT TTT ATC ACA TCA 2 94 

Phe Trp Leu Gin Ala He Leu Leu Leu Ser Tyr Ser Phe He Thr Ser 
75 80 85 

GGA TTA TTT TTC TTG ATT CTC TTA GGT AAT TTT TAT GAT GAT AAT CGA 34 2 

Gly Leu Phe Phe Leu He Leu Leu Gly Asn Phe Tyr Asp Asp Asn Arg 
90 95 100 

TCG CCT GAG AGC GAT GAT GAT CTT TTT GAT ATA TGG ATC TAT GCG ATA 3 90 

Ser Pro Glu Ser Asp Asp Asp Leu Phe Asp He Trp He Tyr Ala He 
105 110 115 120 
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CTA GGG TTG AAC AAC AGG CGA GAA AAA TCC CTT AAA GAG ACT TCT TAT 15 8 

Leu Gly Leu Asn Asn Arg Arg Glu Lys Ser Leu Lys Asp Thr Ser Tyr 
15 20 25 30 

TCC ATG TTT CAT ATC ACC CTA AAC ACC GCT AAA AAA TTC TAC CCT ACC 2 06 

Ser Met Phe His lie Thr Leu Asn Thr Ala Lys Lys Phe Tyr Pro Thr 
35 40 45 

TAC TCT AAA ACG CTC CTC AAA TTC AAA TTG CTA AAC GAT GTG GGT TTT 2 54 

Tyr Ser Lys Thr Leu Leu Lys Phe Lys Leu Leu Asn Asp Val Gly Phe 
50 55 60 

GCG ATC CAA TTA GCC AAA CAA ATT TTA AAA GAA AAT TTT GAT TAT TAC 3 02 

Ala lie Gin Leu Ala Lys Gin lie Leu Lys Glu Asn Phe Asp Tyr Tyr 
65 70 75 

AAA CAA AAA CAC CCC AAC AAA AGC GTG TAT CAA TTA GTA GAA ATG GCA 3 50 

Lys Gin Lys His Pro Asn Lys Ser Val Tyr Gin Leu Val Glu Met Ala 
80 85 90 

ATA GGC GCT TAC AAT GGG GGA ATG AAA CAC AAC CCT AAT GGC GCT TAC 3 98 

lie Gly Ala Tyr Asn Gly Gly Met Lys His Asn Pro Asn Gly Ala Tyr 
95 100 105 110 

GTG AAA AAA TTC CGT TGC ATT TAT TCT CAA GTG CGA TAT AAC GAG TAGAG 44 8 
Val Lys Lys Phe Arg Cys lie Tyr Ser Gin Val Arg Tyr Asn Glu 
115 120 125 

CATACTCATT TTATAAGCAA TCTTGATGAC ACACTTCTAC TATCTTATGA ATTTAT 5 04 

(2) INFORMATION FOR SEQ ID NO: 1352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1352: 

Met Cys Arg Thr Leu lie Ser lie Ala Leu Leu Glu Ser Ser Leu Gly 

15 10 15 

Leu Asn Asn Arg Arg Glu Lys Ser Leu Lys Asp Th.r Ser Tyr Ser Met 

20 25 30 

Phe His lie Thr Leu Asn Thr Ala Lys Lys Phe Tyr Pro Thr Tyr Ser 

35 40 45 

Lys Thr Leu Leu Lys Phe Lys Leu Leu Asn Asp Val Gly Phe Ala lie 

50 55 60 

Gin Leu Ala Lys Gin lie Leu Lys Glu Asn Phe Asp Tyr Tyr Lys Gin 
65 70 75 80 

Lys His Pro Asn Lys Ser Val Tyr Gin Leu Val Glu Met Ala lie Gly 

85 90 95 

Ala Tyr Asn Gly Gly Met Lys His Asn Pro Asn Gly Ala Tyr Val Lys 
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(2) INFORMATION FOR SEQ ID NO: 13 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350: 



Met Tyr Gly Gly 
1 

Thr His Glu Thr 
20 

Asn Leu Thr Asp 
35 

Val Leu Cys Cys 
50 

Phe Val Ala lie 
65 

Lys Leu Val Glu 

Val Asn Asn Glu 
100 

Pro Thr Glu Ala 
115 



Asn Ala Met Ala 
5 

Lys Lys Pro Asn 

Lys Arg Tyr Asp 
40 

Val Leu Ser Gly 
55 

Gly lie Phe Leu 
70 

Lys Leu Ser Lys 
85 

Thr Gin Lys Ser 

Lys Glu Thr Lys 
120 



Asp Thr lie Asn 
10 

Ala Phe Val Asn 
25 

Ser Leu Gly Leu 

Ala Met Gly lie 
60 

Ser Phe Ser Asn 
75 

Lys Gin Ser Lys 
90 

Gin Ala Thr Ser 

105 

Asp 



Thr Thr Glu Ala 
15 

Phe Phe Lys Asn 
30 

lie Gly Ala Gly 
45 

Val Gly lie lie 

lie Asn Leu Val 
80 

Val Pro Thr Thr 
95 

Val Thr Asn Glu 
110 



(2) INFORMATION FOR SEQ ID NO: 1351: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 69.,. 443 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 51: 



TAACCATTAG TTTCAAGCAG TATGAAAATC 
ATGAAGTG ATG TGC AGA ACG CTC ATC 
Met Cys Arg Thr Leu lie 
1 5 



TTCTCCATAT CCATCAAAAA GGTTGCGACA 6 0 

TCT ATC GCT TTG TTA GAA AGC TCT 110 
Ser lie Ala Leu Leu Glu Ser Ser 
10 



-1989- 



BNSOOCID: <WO 9843478A1J_> 
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PCTAJS98/06371 



(2) INFORMATION FOR SEQ ID NO: 134 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34 . . . 396 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 9: 



TGTTTCATAG TAACAAATTG AAAATATACC ATT ATG TAT GGA GGT AAT GCT ATG 5 4 

Met Tyr Gly Gly Asn Ala Met 
1 5 



GCT GAC ACA ATC AAT ACA ACT GAA GCA ACT CAT GAA ACA AAA AAA CCA 102 
Ala Asp Thr lie Asn Thr Thr Glu Ala Thr His Glu Thr Lys Lys Pro 
10 15 20 



AAC GCT TTT GTA AAT TTT TTC AAA AAC AAT TTG ACT GAT AAG CGT TAT 15 0 

Asn Ala Phe Val Asn Phe Phe Lys Asn Asn Leu Thr Asp Lys Arg Tyr 
25 30 35 



GAT TCA TTA 
Asp Ser Leu 
40 

GGT GCT ATG 
Gly Ala Met 



TTG TCT TTT 
Leu Ser Phe 



AAA AAA CAA 
Lys Lys Gin 
90 

TCT CAA GCA 
Ser Gin Ala 
105 



GGT CTC ATT 
Gly Leu lie 
45 

GGG ATT GTT 
Gly lie Val 
60 

TCT AAT ATC 
Ser Asn lie 
75 

TCT AAA GTG 
Ser Lys Val 



ACA AGC GTT 
Thr Ser Val 



GGA GCA GGG 
Gly Ala Gly 



GGG ATA ATC 
Gly He He 

AAC TTA GTG 
Asn Leu Val 
80 

CCA ACA ACT 
Pro Thr Thr 
95 

ACC AAC GAA 
Thr Asn Glu 
110 



GTT TTA TGT 
Val Leu Cys 
50 

TTT GTC GCA 
Phe Val Ala 
65 

AAA TTA GTT 
Lys Leu Val 



GTC AAT AAC 
Val Asn Asn 



CCA ACT GAA 
Pro Thr Glu 
115 



TGT GTC TTG 
Cys Val Leu 



ATA GGA ATC 
He Gly He 
70 

GAA AAA TTG 
Glu Lys Leu 
85 

GAA ACT CAA 
Glu Thr Gin 
100 

GCC AAA GAG 
Ala Lys Glu 



AGC 198 

Ser 

55 

TTT 2 4 6 

Phe 



TCC 2 94 

Ser 



AAA 342 
Lys 



ACT 3 90 

Thr 



AAA GAT TGAGGCAAAA CAACGATTTT GACTGAAGAA AGAATGAGAG AAAATTTCAA AA 44 8 

Lys Asp 

120 
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GAA GAC AAT AAG ATC GCT GCT ATT GGG AGT GGG GGG AAT TAG GCT TTA 43 5 

Glu Asp Asn Lys lie Ala Ala lie Gly Ser Gly Gly Asn Tyr Ala Leu 
125 130 135 

AGO GCG GCT AGG GCT TTA GAT CAT TTC GCT CAT TTA GAG CCT AGA AAA 4 83 

Ser Ala Ala Arg Ala Leu Asp His Phe Ala His Leu Glu Pro Arg Lys 
140 145 150 155 

CTT GTA GAA GAG TCC TTA AAA ATC GCA GGG GAT CTT TGC ATT TAC ACC 531 
Leu Val Glu Glu Ser Leu Lys lie Ala Gly Asp Leu Cys lie Tyr Thr 
160 165 170 

AAC ACG AAT ATT AAA ATT TTG GAG CTT TAATGTCTAA ATTGAATATG ACCCCAC 58 5 
Asn Thr Asn lie Lys lie Leu Glu Leu 
175 180 

GAGAAATTGT CGC 5 98 

(2) INFORMATION FOR SEQ ID NO: 134 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1348: 

Met Phe Glu Ala Thr Thr lie Leu Gly Tyr Arg Gly Glu Leu Asn His 

15 10 15 

Lys Lys Phe Ala Leu lie Gly Gly Asp Gly Gin Val Thr Leu Gly Asn 

20 25 30 

Cys Val Val Lys Ala Asn Ala Thr Lys lie Arg Ser Leu Tyr His Asn 

35 40 45 

Gin Val Leu Ser Gly Phe Ala Gly Ser Thr Ala Asp Ala Phe Ser Leu 

50 55 60 

Phe Asp Met Phe Glu Arg lie Leu Glu Ser Lys Lys Gly Asp Leu Phe 
65 70 75 80 

Lys Ser Val Val Asp Phe Ser Lys Glu Trp Arg Lys Asp Lys Tyr Leu 

85 90 95 

Arg Arg Leu Glu Ala Met Met lie Val Leu Asn Phe Asp His lie Phe 

100 105 110 

lie Leu Ser Gly Met Gly Asp Val Leu Glu Ala Glu Asp Asn Lys lie 

115 120 125 

Ala Ala lie Gly Ser Gly Gly Asn Tyr Ala Leu Ser Ala Ala Arg Ala 

130 135 140 

Leu Asp His Phe Ala His Leu Glu Pro Arg Lys Leu Val Glu Glu Ser 
145 150 155 160 

Leu Lys lie Ala Gly Asp Leu Cys lie Tyr Thr Asn Thr Asn lie Lys 
165 170 175 

lie Leu Glu Leu 
180 
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Leu Gly Leu lie Lys Gin Ala Leu Glu Lys Gin Met Gly Gly Arg Asn 
645 650 655 

Arg Gin 

(2) INFORMATION FOR SEQ ID NO:1347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 558 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1347: 

GTGGTGGCTG AGTAGAAA ATG TTT GAA GCG ACG ACG ATT TTA GGC TAT AGA 51 

Met Phe Glu Ala Thr Thr lie Leu Gly Tyr Arg 
15 10 

GGG GAA TTG AAT CAT AAA AAG TTC GCG CTC ATT GGA GGC GAT GGG CAG 9 9 

Gly Glu Leu Asn His Lys Lys Phe Ala Leu lie Gly Gly Asp Gly Gin 
15 20 25 

GTA ACT TTG GGT AAT TGC GTG GTC AAA GCC AAT GCG ACA AAA ATC AGA 14 7 

Val Thr Leu Gly Asn Cys Val Val Lys Ala Asn Ala Thr Lys lie Arg 
30 35 40 

AGC TTG TAT CAC AAC CAG GTT TTA AGC GGG TTT GCC GGA AGC ACC GCG 195 
Ser Leu Tyr His Asn Gin Val Leu Ser Gly Phe Ala Gly Ser Thr Ala 
45 50 55 

GAC GCT TTT AGT TTG TTT GAT ATG TTT GAA CGC ATT TTA GAG AGC AAA 24 3 

Asp Ala Phe Ser Leu Phe Asp Met Phe Glu Arg lie Leu Glu Ser Lys 
60 65 70 75 

AAG GGG GAT TTG TTT AAA AGC GTG GTG GAT TTC AGT AAA GAA TGG CGC 2 91 

Lys Gly Asp Leu Phe Lys Ser Val Val Asp Phe Ser Lys Glu Trp Arg 
80 85 90 

AAA GAT AAG TAT TTA CGC CGA CTG GAA GCG ATG ATG ATC GTT TTA AAC 33 9 

Lys Asp Lys Tyr Leu Arg Arg Leu Glu Ala Met Met lie Val Leu Asn 
95 100 105 

TTC GAT CAC ATT TTC ATT TTG AGC GGC ATG GGC GAT GTT TTA GAA GCT 3 87 

Phe Asp His lie Phe lie Leu Ser Gly Met Gly Asp Val Leu Glu Ala 
110 115 120 
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195 










200 










205 








His 


Tyr 


Leu 


Thr 


He 


Lys 


He 


Gly 


Lys 


He 


Pro 


Asn 


Glu 


Lys 


Arg 


Val 




210 










215 










220 










Tyr 


Glu 


Ala 


Phe 


Lys 


Asp 


Tyr 


Arg 


Gin 


Lys 


Lys 


Gly 


He 


Glu 


He 


Glu 


225 










230 










235 










240 


Asp 


Leu 


Leu 


Lys 


Asp 


Leu 


Gin 


Lys 


Tyr 


Cys 


Gly 


Tyr 


Phe 


Cys 


Gin 


He 










245 










250 










255 




Ala 


Phe 


Lys 


Lys 


Glu 


Asp 


Asp 


Lys 


Asp 


Leu 


Asn 


Lys 


Ala 


Leu 


Ser 


Phe 








260 










265 










270 






Leu 


Val 


Asn 


Leu 


Glu 


Met 


Asp 


Val 


He 


Tyr 


Pro 


Leu 


Leu 


Leu 


Glu 


Leu 






275 










280 










285 








Tyr 


Ser 


Asp 


Tyr 


Lys 


Asp 


Gly 


Val 


Leu 


Ser 


Lys 


Gin 


Asp 


Phe 


He 


Pro 




290 










295 










300 










He 


He 


Tyr 


Leu 


He 


Glu 


Ser 


Tyr 


He 


Cys 


Arg 


Arg 


Ala 


Val 


Cys 


Gly 


305 










310 










315 










320 


Leu 


Gly 


Thr 


Asn 


Ser 


Leu 


Asn 


Lys 


Val 


Phe 


Pro 


Ser 


Phe 


Thr 


Lys 


His 










325 










330 










335 




He 


Gin 


Lys 


Asp 


Glu 


Tyr 


Phe 


Lys 


Ser 


Leu 


Lys 


Ala 


His 


Phe 


Val 


Cys 








340 










345 










350 






Leu 


Thr 


Glu 


Lys 


Gin 


Arg 


Phe 


Pro 


Asn 


Asn 


Asp 


Glu 


Phe 


Lys 


Lys 


Leu 






355 










360 










365 








Phe 


He 


Thr 


He 


Asp 


Phe 


Tyr 


Lys 


Phe 


Lys 


Lys 


Asn 


Lys 


Tyr 


Phe 


Leu 




370 










375 










380 










Glu 


Arg 


Leu 


Glu 


Asn 


Phe 


Asp 


Thr 


Lys 


Glu 


Pro 


Val 


Asp 


Thr 


Gin 


Lys 


385 










390 










395 










400 


Cys 


Asn 


He 


Glu 


His 


He 


Met 


Pro 


Gin 


Thr 


Leu 


Thr 


Pro 


Glu 


Trp 


Gin 










405 










410 










415 




Arg 


Asp 


Leu 


Gly 


Glu 


Asn 


Phe 


Gin 


Ala 


He 


His 


Glu 


Lys 


Tyr 


Leu 


His 








420 










425 










430 






Thr 


He 


Gly Asn 


Leu 


Thr 


Leu 


Thr 


Gly 


Tyr 


Asn 


Ser 


Lys 


Tyr 


Ser 


Asn 






435 










440 










445 








Asn 


Ser 


Phe 


Gin 


Glu 


Lys 


Arg 


Asp 


Met 


Glu 


Lys 


Gly 


Phe 


Lys 


Gin 


Ser 




450 










455 










460 










Ser 


Leu 


Lys 


Leu 


Asn 


Gin 


Ser 


Leu 


Lys 


Asp 


Leu 


Glu 


Ser 


Phe 


Gly 


Glu 


465 










470 










475 










480 


Lys 


Glu 


He 


Glu 


Lys 


Arg 


Ala 


Ser 


Asp 


Leu 


Ala 


Asp 


Trp 


Ala 


Leu 


Lys 










485 










490 










495 




He 


Trp 


Thr 


Tyr 


Pro 


He 


Leu 


Glu 


Ala 


Glu 


Thr 


Leu 


Glu 


Glu 


Tyr 


Lys 








500 










505 










510 






Pro 


Lys 


Lys 


Glu 


Lys 


Lys 


Glu 


Lys 


Lys 


Glu 


Lys 


Glu 


Glu 


Tyr 


Lys 


Leu 






515 










520 










525 








Lys 


Lys 


Glu 


Lys 


Lys 


Val 


Tyr 


Asp 


Leu 


Ser 


Ser 


Tyr 


Lys 


Phe 


Ser 


Ser 




530 










535 










540 










Asp 


Ser 


Arg 


Glu 


Leu 


Phe 


Asp 


He 


Leu 


Arg 


Glu 


Lys 


He 


Lys 


Ala 


Leu 


545 










550 










555 










560 


Asp 


Glu 


Arg 


He 


Thr 


Glu 


Lys 


Phe 


Asn 


Gin 


Lys 


Tyr 


He 


Ala 


Tyr 


Lys 










565 










570 










575 




Phe 


Cys 


Lys 


He 


Ser 


Phe 


Val 


Asp 


He 


Val 


Val 


Gin 


Glu 


Lys 


Gly 


Leu 








580 










585 










590 






Lys 


Leu 


Tyr 


Leu 


Lys 


Met 


Asn 


Leu 


Asn 


Glu 


Leu 


Gin 


Asp 


Glu 


He 


Lys 






595 










600 










605 








Glu 


Lys 


Leu 


Lys 


He 


Arg 


Asp 


Val 


Ser 


Asn 


He 


Gly 


Arg 


Pro 


Cys 


Val 




610 










615 










620 










Gly Asn 


Met 


Glu 


Val 


Glu 


Leu 


Glu 


Thr 


Lys 


Glu 


Asn 


He 


Pro 


Tyr 


Cys 


625 










630 










635 










640 
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TTA AAA ATG AAC TTG AAT GAA TTG CAA GAT GAA ATA AAG GAA AAA CTA 1881 
Leu Lys Met Asn Leu Asn Glu Leu Gin Asp Glu lie Lys Glu Lys Leu 
600 605 610 

AAA ATT AGA GAC GTT TCT AAT ATC GGT CGT CCA TGC GTT GGA AAC ATG 192 9 

Lys lie Arg Asp Val Ser Asn lie Gly Arg Pro Cys Val Gly Asn Met 
615 620 625 

GAA GTA GAG CTA GAA AC A AAA GAA AAT ATC CCT TAT TGT TTG GGA TTG 1977 
Glu Val Glu Leu Glu Thr Lys Glu Asn lie Pro Tyr Cys Leu Gly Leu 
630 635 640 

ATC AAG CAG GCT TTA GAA AAA CAG ATG GGT GGT AGG AAT AGG CAA TAAAA 2 02 7 
lie Lys Gin Ala Leu Glu Lys Gin Met Gly Gly Arg Asn Arg Gin 
645 650 655 

ACCCAACTTA TTCAAAATAA AGAGTATAAT TACAAATTAC TTAC 20 71 

(2) INFORMATION FOR SEQ ID NO: 1346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1346: 

Met Asn Gly His Phe lie Gly Ser lie Leu Tyr Val Leu Asp Ser Asn 

15 10 15 

Thr His Ser Asn Asn Thr Leu Leu lie He Asp Gly Gin Gin Arg Leu 

20 25 30 

Thr Thr He Thr Leu Leu Leu He Ala Leu Arg Asn His Leu Ser Glu 

35 40 45 

Glu Val Glu He Leu Glu Lys Phe Ser Arg Lys Glu He Glu Ser Tyr 

50 55 60 

Leu He Asn Ser Asn Lys Asp Gly Asp Lys Lys Phe Arg Leu He Leu 
65 70 75 80 

Ser Glu Ser Asp Lys Asp Thr Leu Leu Ser Leu He Asp Lys Asn Lys 

85 90 95 

Arg Lys Pro Ser Glu Pro Ser Val Lys He Val Glu Asn Phe Glu Leu 

100 105 110 

Phe Glu Lys Trp He Ser Glu Asn Thr Asp Lys Leu Glu Thr He Phe 

115 120 125 

Lys Gly Leu Lys Lys Leu Met He Val Trp He Ser Leu Asp Lys Gly 

130 135 140 

Lys Asp Asp Pro Gin Leu He Phe Glu Ser Met Asn Ser Lys Asp He 
145 150 155 160 

Glu Leu Thr Gin Thr Asp Leu He Arg Asn Tyr He Val Met Glu Thr 

165 170 175 

Glu Val Glu Lys Gin Glu Asp Phe Tyr Asn Gin Tyr Trp Arg Ala Met 

180 185 190 

Glu Glu Arg Phe Glu Gin Asn Glu Thr Leu Phe Asn Arg Phe Val Arg 
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ATA GAT TTT TAT AAG TTT AAA AAA AAT AAA TAG TTT CTT GAA AGG TTA 12 0 9 

lie Asp Phe Tyr Lys Phe Lys Lys Asn Lys Tyr Phe Leu Glu Arg Leu 
375 380 385 

GAA AAT TTT GAG ACA AAA GAA CCG GTC GAT ACT CAA AAA TGC AAT ATA 12 57 

Glu Asn Phe Asp Thr Lys Glu Pro Val Asp Thr Gin Lys Cys Asn He 
390 395 400 

GAA CAT ATA ATG OCT CAA ACC CTT ACT CCA GAA TGG CAA AGG GAT TTG 13 05 

Glu His He Met Pro Gin Thr Leu Thr Pro Glu Trp Gin Arg Asp Leu 
405 410 415 

GGT GAA AAT TTT CAA GCA ATA CAC GAG AAA TAC CTC CAC ACA ATA GGG 13 53 

Gly Glu Asn Phe Gin Ala He His Glu Lys Tyr Leu His Thr He Gly 
420 425 430 435 

AAT CTC ACT CTA ACC GGT TAT AAC TCT AAG TAT AGC AAC AAT TCT TTC 14 01 

Asn Leu Thr Leu Thr Gly Tyr Asn Ser Lys Tyr Ser Asn Asn Ser Phe 
440 445 450 

CAA GAA AAA AGA GAT ATG GAG AAG GGC TTT AAA CAA AGC TCA TTA AAA 14 4 9 

Gin Glu Lys Arg Asp Met Glu Lys Gly Phe Lys Gin Ser Ser Leu Lys 
455 460 465 

CTC AAT CAA AGT TTG AAA GAT TTG GAA TCT TTT GGC GAA AAA GAG ATT 14 97 

Leu Asn Gin Ser Leu Lys Asp Leu Glu Ser Phe Gly Glu Lys Glu He 
470 475 480 

GAA AAA AGG GCT AGT GAT TTA GCG GAT TGG GCT TTA AAG ATT TGG ACT 154 5 

Glu Lys Arg Ala Ser Asp Leu Ala Asp Trp Ala Leu Lys He Trp Thr 
485 490 495 

TAC CCA ATT CTA GAG GCA GAA ACA TTA GAG GAA TAT AAA CCC AAA AAA 15 93 

Tyr Pro He Leu Glu Ala Glu Thr Leu Glu Glu Tyr Lys Pro Lys Lys 
500 505 510 515 

GAA AAG AAA GAA AAG AAA GAA AAA GAG GAG TAT AAA CTC AAG AAA GAA 1641 
Glu Lys Lys Glu Lys Lys Glu Lys Glu Glu Tyr Lys Leu Lys Lys Glu 
520 525 530 

AAA AAG GTT TAT GAT TTA AGC TCT TAT AAG TTT AGC TCT GAT TCA AGG 16 8 9 

Lys Lys Val Tyr Asp Leu Ser Ser Tyr Lys Phe Ser Ser Asp Ser Arg 
535 540 545 

GAA TTG TTT GAT ATT TTA AGA GAA AAG ATT AAA GCT CTT GAT GAA AGG 17 3 7 

Glu Leu Phe Asp He Leu Arg Glu Lys He Lys Ala Leu Asp Glu Arg 
550 555 560 

ATA ACT GAA AAA TTT AAT CAA AAA TAT ATA GCT TAT AAG TTT TGT AAA 17 85 

He Thr Glu Lys Phe Asn Gin Lys Tyr He Ala Tyr Lys Phe Cys Lys 
565 570 575 

ATA AGT TTT GTG GAT ATT GTT GTG CAA GAA AAA GGC TTA AAA TTG TAT 18 3 3 

He Ser Phe Val Asp He Val Val Gin Glu Lys Gly Leu Lys Leu Tyr 
580 585 590 595 
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CCT CAA CTT ATT TTT GAG AGC ATG AAC TCA AAA GAT ATC GAA CTC ACG 53 7 

Pro Gin Leu lie Phe Glu Ser Met Asn Ser Lys Asp lie Glu Leu Thr 
150 155 160 

CAA ACG GAT TTG ATC AGA AAT TAT ATC GTA ATG GAA ACG GAG GTT GAA 5 85 

Gin Thr Asp Leu lie Arg Asn Tyr lie Val Met Glu Thr Glu Val Glu 
165 170 175 

AAA CAG GAA GAC TTT TAT AAT CAA TAT TGG AGG GCT ATG GAG GAG AGA 63 3 

Lys Gin Glu Asp Phe Tyr Asn Gin Tyr Trp Arg Ala Met Glu Glu Arg 
180 185 190 195 

TTT GAA CAA AAT GAA ACA TTG TTT AAT CGG TTT GTC CGG CAT TAT CTC 6 81 

Phe Glu Gin Asn Glu Thr Leu Phe Asn Arg Phe Val Arg His Tyr Leu 
200 205 210 

ACG ATC AAA ATA GGA AAG ATT CCC AAT GAG AAA AGA GTT TAT GAA GCT 72 9 

Thr lie Lys lie Gly Lys lie Pro Asn Glu Lys Arg Val Tyr Glu Ala 
215 220 225 

TTC AAG GAT TAC CGG CAA AAA AAG GGG ATA GAA. ATA GAG GAT TTA TTA 777 
Phe Lys Asp Tyr Arg Gin Lys Lys Gly lie Glu lie Glu Asp Leu Leu 
230 235 240 

AAA GAT TTA CAA AAA TAC TGC GGG TAT TTT TGC CAG ATT GCA TTC AAA 82 5 

Lys Asp Leu Gin Lys Tyr Cys Gly Tyr Phe Cys Gin lie Ala Phe Lys 
245 250 255 

AAA GAA GAC GAT AAA GAT TTA AAC AAG GCT TTA AGT TTT TTG GTG AAT 873 
Lys Glu Asp Asp Lys Asp Leu Asn Lys Ala Leu Ser Phe Leu Val Asn 
260 265 270 275 

TTA GAG ATG GAT GTG ATC TAT CCG CTA CTA CTA GAG CTT TAT AGC GAT 921 
Leu Glu Met Asp Val lie Tyr Pro Leu Leu Leu Glu Leu Tyr Ser Asp 
280 285 290 

TAT AAG GAT GGC GTT TTA TCC AAG CAG GAT TTT ATC CCT ATT ATC TAT 96 9 

Tyr Lys Asp Gly Val Leu Ser. Lys Gin Asp Phe lie Pro lie lie Tyr 
295 300 305 

TTA ATA GAG AGC TAT ATT TGC AGA AGG GCG GTG TGT GGG CTT GGC ACA 1017 
Leu lie Glu Ser Tyr lie Cys Arg Arg Ala Val Cys Gly Leu Gly Thr 
310 315 320 

AAT AGT CTC AAT AAA GTT TTT CCC TCT TTT ACA AAG CAC ATC CAA AAA 106 5 

Asn Ser Leu Asn Lys Val Phe Pro Ser Phe Thr Lys His lie Gin Lys 
325 330 335 

GAT GAA TAT TTT AAA AGC CTA AAG GCG CAT TTT GTC TGT CTG ACA GAA 1113 
Asp Glu Tyr Phe Lys Ser Leu Lys Ala His Phe Val Cys Leu Thr Glu 
340 345 350 355 

AAA CAA AGA TTT CCA AAC AAT GAC GAG TTT AAA AAG CTT TTT ATT ACG 1161 
Lys Gin Arg Phe Pro Asn Asn Asp Glu Phe Lys Lys Leu Phe lie Thr 
360 365 370 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 2022 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1345: 



TGCGAACAAT TATGGGATGA TATTATAAAA ATTGGTGGGA ATGATAAG ATG AAC GGA 5 7 

Met Asn Gly 
1 



CAT TTT ATC 
His Phe lie 
5 

AAC AAT ACA 
Asn Asn Thr 
20 

ACG CTT TTA 
Thr Leu Leu 



ATT TTG GAG 
lie Leu Glu 



AGC AAT AAG 
Ser Asn Lys 
70 

GAT AAA GAC 
Asp Lys Asp 
85 

AGC GAG CCT 
Ser Glu Pro 
100 

TGG ATC AGT 
Trp lie Ser 

AAA AAA CTC 
Lys Lys Leu 



GGT TCT ATT 
Gly Ser lie 



TTA CTC ATC 
Leu Leu lie 
25 

CTC ATC GCT 
Leu lie Ala 
40 

AAA TTT TCG 
Lys Phe Ser 
55 

GAC GGC GAT 
Asp Gly Asp 



ACC TTG CTG 
Thr Leu Leu 



TCG GTA AAA 
Ser Val Lys 
105 

GAA AAC ACC 
Glu Asn Thr 
120 

ATG ATA GTT 
Met He Val 
135 



TTG TAT GTG 
Leu Tyr Val 
10 

ATT GAC GGC 
He Asp Gly 

TTA AGG AAT 
Leu Arg Asn 

CGT AAA GAA 

Arg Lys Glu 
60 

AAG AAA TTC 

Lys Lys Phe 
75 

TCT TTG ATT 

Ser Leu He 
90 

ATA GTG GAA 

He Val Glu 



GAC AAA CTA 
Asp Lys Leu 



TGG ATT TCT 
Trp He Ser 
140 



CTA GAT AGT 
Leu Asp Ser 
15 

CAA CAA AGG 
Gin Gin Arg 
30 

CAT CTA AGC 
His Leu Ser 
45 

ATA GAG AGC 
He Glu Ser 



AGG CTC ATT 
Arg Leu He 

GAT AAA AAC 
Asp Lys Asn 
95 

AAT TTT GAA 
Asn Phe Glu 
110 

GAA ACG ATT 
Glu Thr He 
125 

TTA GAT AAA 
Leu Asp Lys 



AAT ACG CAC 
Asn Thr His 



CTC ACC ACT 
Leu Thr Thr 



GAA GAA GTT 
Glu Glu Val 
50 

TAT CTT ATC 
Tyr Leu He 
65 

CTT TCA GAG 
Leu Ser Glu 
80 

AAA AGA AAG 
Lys Arg Lys 



TTG TTT GAA 
Leu Phe Glu 



TTT AAA GGA 
Phe Lys Gly 
130 

GGA AAA GAT 
Gly Lys Asp 
145 



TCT 105 
Ser 



ATC 15 3 

He 

35 

GAA 201 
Glu 



AAC 24 9 

Asn 



TCC 297 
Ser 



CCG 345 
Pro 



AAA 393 

Lys 

115 

TTA 4 41 

Leu 



GAT 4 8 9 

Asp 
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CTT GAT TTT ATA ATG GAT TTT CAA TAG GOT AAA ACC ACC GGA TTG TCT 63 0 

Leu Asp Phe lie Met Asp Phe Gin Tyr Ala Lys Thr Thr Gly Leu Ser 
185 190 195 

TTA GCC ATT TTA TGG CAT TTA AAA AAG TTT AAA AAT GTT TAAAAGGAAT TT 681 

Leu Ala lie Leu Trp His Leu Lys Lys Phe Lys Asn Val 
200 205 210 

TATGTTAAGG CTTTTG 6 97 

(2) INFORMATION FOR SEQ ID NO: 1344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1344: 

Met Ser Tyr Phe Lys Asn Ala Phe Asn Gin Lys Ser Leu lie Asp Asp 

15 10 15 

Ser Ser Val Tyr Leu Glu Pro Cys Ser Ser Ser Asn Phe lie Glu Leu 

20 25 30 

Lys Arg Met His Tyr Asn Glu Glu Asn Thr Lys Lys Thr Trp Asp lie 

35 40 45 

lie Lys Ser Leu Asp Ser Val Ala Val Leu Leu Tyr Glu Lys Glu Ser 

50 55 60 

Asp Cys Phe Val lie Val Lys Gin Phe Arg Pro Ala lie Tyr Ala Arg 
65 70 75 80 

Arg Phe His Phe Lys Cys Asp Gin Asp Gin Thr lie Asp Gly Tyr Thr 

85 90 95 

Tyr Glu Leu Cys Ala Gly Leu Val Asp Lys Ala Asn Lys Ser Leu Glu 

100 105 110 

Glu lie Ala Cys Glu Glu Ala Leu Glu Glu Cys Gly Tyr Gin lie Ser 

115 120 125 

Pro Lys Asn Leu Glu Thr lie Gly Gin Phe Tyr Ser Ala Thr Gly Leu 

130 135 140 

Ser Gly Ser Leu Gin Thr Leu Tyr Tyr Ala Glu Val His Lys Asn Leu 
145 150 155 160 

Lys Val Ser Lys Gly Gly Gly lie Asp Thr Glu Arg lie Glu Val Leu 

165 170 175 

Phe Leu Glu Arg Ser Lys Ala Leu Asp Phe lie Met Asp Phe Gin Tyr 

180 185 190 

Ala Lys Thr Thr Gly Leu Ser Leu Ala lie Leu Trp His Leu Lys Lys 

195 200 205 

Phe Lys Asn Val 
210 

(2) INFORMATION FOR SEQ ID NO: 1345: 

(i) SEQUENCE CHARACTERISTICS: 
_(A) LENGTH: 2071 base pairs 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 669 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 3: 

AAAAAATTCA TTTTATCTTT TAGAGGGTTT TTA ATG TCT TAT TTT AAG AAT GCT 54 

Met Ser Tyr Phe Lys Asn Ala 
1 5 

TTC AAT CAA AAA TCT TTA ATA GAT GAT TCT AGT GTG TAT TTA GAG CCT 102 
Phe Asn Gin Lys Ser Leu lie Asp Asp Ser Ser Val Tyr Leu Glu Pro 
10 15 20 

TGT TCT AGC TCT AAT TTC ATA GAA TTA AAA CGC ATG CAT TAT AAT GAA 15 0 

Cys Ser Ser Ser Asn Phe lie Glu Leu Lys Arg Met His Tyr Asn Glu 
25 30 35 

GAG AAT ACT AAG AAA ACA TGG GAT ATT ATT AAG TCT TTA GAC AGC GTG 19 8 

Glu Asn Thr Lys Lys Thr Trp Asp lie lie Lys Ser Leu Asp Ser Val 
40 45 50 55 

GCG GTT TTA CTC TAT GAA AAA GAA TCC GAT TGC TTT GTG ATT GTG AAA 24 6 

Ala Val Leu Leu Tyr Glu Lys Glu Ser Asp Cys Phe Val lie Val Lys 
60 65 70 

CAA TTC CGC CCA GCC ATT TAT GCG CGC COT TTT CAT TTT AAG TGC GAT 2 94 

Gin Phe Arg Pro Ala lie Tyr Ala Arg Arg Phe His Phe Lys Cys Asp 
75 80 85 

CAA GAT CAA ACT ATT GAC GGA TAC ACT TAT GAA TTG TGC GCA GGG CTT 342 
Gin Asp Gin Thr lie Asp Gly Tyr Thr Tyr Glu Leu Cys Ala Gly Leu 
90 95 100 

GTG GAT AAA GCT AAT AAG AGT TTA GAA GAA ATC GCT TGC GAA GAA GCG 3 90 

Val Asp Lys Ala Asn Lys Ser Leu Glu Glu He Ala Cys Glu Glu Ala 
105 110 115 

CTA GAA GAA TGC GGT TAT CAA ATT AGC CCT AAA AAT TTA GAA ACC ATA 43 8 

Leu Glu Glu Cys Gly Tyr Gin He Ser Pro Lys Asn Leu Glu Thr He 
120 125 130 135 

GGC CAA TTT TAT AGC GCG ACT GGG TTG AGT GGG AGT TTG CAA ACG CTC 4 86 

Gly Gin Phe Tyr Ser Ala Thr Gly Leu Ser Gly Ser Leu Gin Thr Leu 
140 145 150 

TAT TAC GCT GAA GTG CAT AAG AAT TTG AAA GTT TCA AAG GGT GGG GGG 534 
Tyr Tyr Ala Glu Val His Lys Asn Leu Lys Val Ser Lys Gly Gly Gly 
155 160 165 

ATT GAT ACC GAA AGG ATT GAA GTG CTG TTT TTA GAG CGA TCA AAA GCT 5 82 

He Asp Thr Glu Arg He Glu Val Leu Phe Leu Glu Arg Ser Lys Ala 
170 175 180 
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Gly Lys Lys Phe Ser Gin Asp Asn Phe Lys Val Asp Tyr Asn Tyr Tyr 

20 25 30 

Leu Arg Lys Gin Asp Leu His lie lie Lys Thr Gin Asn Asp Leu Ser 

35 40 45 

Asn Ser Trp Tyr Leu Pro Pro Gin Lys Ala Pro Lys Glu His Ser Trp 

50 55 60 

Val Asp Phe Ala Lys Lys Tyr Leu Asn Met Met Asp Tyr Leu Gly Thr 
65 70 75 80 

Tyr Phe Leu Pro Phe Tyr His Ser Phe Thr Pro lie Phe Gin Trp Tyr 

85 90 95 

His Pro Asn lie Asn Pro Tyr Gin Arg Asn Glu Phe Lys Phe Gin lie 

100 105 110 

Ser Phe Arg Val Pro Val Phe Arg His lie Leu Trp Thr Lys Gly Thr 

115 120 125 

Leu Tyr Leu Ala Tyr Thr Gin Thr Asp Trp Phe Gin lie Tyr Asn Asp 

130 135 140 

Pro Gin Ser Ala Pro Met Arg Met Met Asn Phe Met Pro Glu Leu lie 
145 150 155 160 

Tyr Val Tyr Pro lie Asn Phe Lys Pro Phe Gly Gly Lys lie Gly Asn 

165 170 175 

Phe Ser Glu lie Trp lie Gly Trp Gin His lie Ser Asn Gly Val Gly 

180 185 190 

Gly Ala Gin Cys Tyr Gin Pro Phe Asn Lys Glu Gly Asn Pro Glu Asn 

195 200 205 

Gin Phe Pro Gly Gin Pro Val lie Val Lys Asp Tyr Asn Gly Gin Lys 

210 215 220 

Asp Val Arg Trp Gly Gly Cys Arg Ser Val Ser Ala Gly Gin Arg Pro 
225 230 235 240 

Val Phe Arg Leu Val Trp Glu Lys Gly Gly Leu Lys lie Met Val Ala 

245 250 255 

Tyr Trp Pro Tyr Val Pro Tyr Asp Gin Ser Asn Pro Asn Leu lie Asp 

260 265 270 

Tyr Met Gly Tyr Gly Asn Ala Lys lie Asp Tyr Arg Arg Gly Arg His 

275 280 285 

His Phe Glu Leu Gin Leu Tyr Asp lie Phe Thr Gin Tyr Trp Arg Tyr 

290 295 300 

Asp Arg Trp His Gly Ala Phe Arg Leu Gly Tyr Thr Tyr Arg lie Asn 
305 310 315 320 

Pro Phe Val Gly lie Tyr Ala Gin Trp Phe Asn Gly Tyr Gly Asp Gly 

325 330 335 

Leu Tyr Glu Tyr Asp Val Phe Ser Asn Arg lie Gly Val Gly lie Arg 
340 345 350 

Leu Asn Pro 
355 

(2) INFORMATION FOR SEQ ID N0:1343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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Asn Pro Glu Asn Gin Phe Pro Gly Gin Pro Val lie Val Lys Asp Tyr 
205 210 215 220 

AAT GGG CAA AAA GAT GTG CGC TGG GGG GGG TGT CGT TCG GTG AGC GCG 72 4 

Asn Gly Gin Lys Asp Val Arg Trp Gly Gly Cys Arg Ser Val Ser Ala 
225 230 235 

GGG CAA CGC CCT GTG TTT CGT TTG GTG TGG GAA AAG GGA GGC CTA AAA 77 2 

Gly Gin Arg Pro Val Phe Arg Leu Val Trp Glu Lys Gly Gly Leu Lys 
240 245 250 

ATC ATG GTC GCT TAT TGG CCC TAT GTC CCT TAT GAT CAA TCC AAT CCT 82 0 

lie Met Val Ala Tyr Trp Pro Tyr Val Pro Tyr Asp Gin Ser Asn Pro 
255 260 265 

AAT TTG ATT GAT TAG ATG GGG TAT GGT AAC GCT AAA ATT GAT TAC AGG 86 8 

Asn Leu lie Asp Tyr Met Gly Tyr Gly Asn Ala Lys He Asp Tyr Arg 
270 275 280 

AGA GGG CGC CAC CAT TTT GAA TTG CAG CTT TAT GAT ATT TTC ACG CAA 916 
Arg Gly Arg His His Phe Glu Leu Gin Leu Tyr Asp He Phe Thr Gin 
285 290 295 300 

TAC TGG CGT TAT GAT CGC TGG CAT GGA GCT TTC CGC TTA GGC TAT ACC 964 
Tyr Trp Arg Tyr Asp Arg Trp His Gly Ala Phe Arg Leu Gly Tyr Thr 
305 310 315 

TAT CGC ATT AAC CCT TTT GTG GGG ATT TAT GCG CAG TGG TTT AAC GGC 1012 
Tyr Arg He Asn Pro Phe Val Gly He Tyr Ala Gin Trp Phe Asn Gly 
320 325 330 

TAT GGC GAT GGC TTG TAT GAA TAC GAT GTT TTT TCC AAT CGT ATA GGG 1060 
Tyr Gly Asp Gly Leu Tyr Glu Tyr Asp Val Phe Ser Asn Arg He Gly 
335 340 345 

GTA GGA ATA CGC TTA AAC CCT TAAAAAAGCG TTCTTTTAYG CTATAATTAA GACC 1115 
Val Gly He Arg Leu Asn Pro 
350 355 

AAAAA 1120 

(2) INFORMATION FOR SEQ ID NO:1342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 2: 

Met Lys Ser He Leu Leu Phe Met He Phe Val Val Cys Gin Leu Glu 
15 10 15 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341: 

GAAATAAGGA TGCTTG ATG AAA AGC ATT TTG CTC TTT ATG ATT TTT GTA GTT 52 

Met Lys Ser lie Leu Leu Phe Met lie Phe Val Val 
15 10 

TGT CAG TTA GAA GGC AAA AAA TTT TCA CAA GAT AAT TTT AAG GTG GAT 10 0 

Cys Gin Leu Glu Gly Lys Lys Phe Ser Gin Asp Asn Phe Lys Val Asp 
15 20 25 

TAT AAC TAG TAT TTG CGC AAA CAG GAT TTG CAC ATC ATT AAA ACG CAA 14 8 

Tyr Asn Tyr Tyr Leu Arg Lys Gin Asp Leu His lie lie Lys Thr Gin 
30 35 40 

AAC GAT TTG TCC AAT TCT TGG TAT CTC CCT CCA CAA AAA GCC CCC AAA 196 
Asn Asp Leu Ser Asn Ser Trp Tyr Leu Pro Pro Gin Lys Ala Pro Lys 
45 50 55 60 

GAA CAT TCT TGG GTG GAT TTT GCT AAA AAA TAT TTA AAC ATG ATG GAT 244 
Glu His Ser Trp Val Asp Phe Ala Lys Lys Tyr Leu Asn Met Met Asp 
65 70 75 

TAT CTA GGC ACT TAT TTT CTG CCT TTT TAT CAT ACT TTC ACC CCC ATT 2 92 

Tyr Leu Gly Thr Tyr Phe Leu Pro Phe Tyr His Ser Phe Thr Pro lie 
80 85 90 

TTT CAA TGG TAC CAC CCC AAT ATC AAC CCG TAT CAA CGC AAT GAG TTT 34 0 

Phe Gin Trp Tyr His Pro Asn lie Asn Pro Tyr Gin Arg Asn Glu Phe 
95 100 105 

AAG TTC CAA ATT AGT TTT AGA GTG CCT GTA TTT AGG CAT ATT CTT TGG 3 88 

Lys Phe Gin lie Ser Phe Arg Val Pro Val Phe Arg His lie Leu Trp 
110 115 120 

ACT AAA GGC ACG CTG TAT TTA GCT TAT ACC CAA ACT GAC TGG TTT CAA 43 6 

Thr Lys Gly Thr Leu Tyr Leu Ala Tyr Thr Gin Thr Asp Trp Phe Gin 
125 130 135 140 

ATT TAC AAT GAC CCC CAA TCC GCT CCC ATG CGA ATG ATG AAT TTC ATG 4 84 

lie Tyr Asn Asp Pro Gin Ser Ala Pro Met Arg Met Met Asn Phe Met 
145 150 155 

CCT GAA CTC ATT TAT GTT TAT CCT ATC AAT TTT AAA CCT TTT GGG GGT 53 2 

Pro Glu Leu lie Tyr Val Tyr Pro lie Asn Phe Lys Pro Phe Gly Gly 
160 165 170 

AAA ATA GGG AAT TTT TCT GAA ATT TGG ATA GGT TGG CAG CAC ATT TCT 58 0 

Lys lie Gly Asn Phe Ser Glu lie Trp lie Gly Trp Gin His lie Ser 
175 180 185 

AAT GGC GTG GGG GGC GCG CAA TGT TAC CAA CCT TTT AAT AAA GAA GGC 62 8 

Asn Gly Val Gly Gly Ala Gin Cys Tyr Gin Pro Phe Asn Lys Glu Gly 
190 195 200 

AAT CCT GAA AAC CAG TTT CCA GGA CAA CCT GTA ATC GTT AAA GAT TAT 67 6 
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Phe Ser Val Leu Leu lie Thr Gly Trp lie Val Ser Arg Gly lie Lys 
145 150 155 160 

Glu Gly lie Glu Lys Leu Asn Leu Val Leu Met Pro Leu Leu Phe Ala 

165 170 175 

Thr Phe Phe Gly Leu Leu Phe Tyr Ala Met Ser Met Asp Ser Phe Ser 

180 185 190 

Lys Ala Phe His Phe Met Phe Asp Phe Lys Pro Lys Asp Leu Thr Ser 

195 200 205 

Gin Val Phe Thr Tyr Ser Leu Gly Gin Val Phe Phe Ser Leu Ser lie 

210 215 220 

Gly Leu Gly lie Asn lie Thr Tyr Ala Ala Val Thr Asp Lys Thr Gin 
225 230 235 240 

Asn Leu Leu Lys Ser Thr lie Trp Val Val Leu Ser Gly lie Leu lie 

245 250 255 

Ser Leu Val Ala Gly Leu Met lie Phe Thr Phe Val Phe Glu Tyr Gly 

260 265 270 

Ala Asn Val Ser Gin Gly Thr Gly Leu lie Phe Thr Ser Leu Pro Val 

275 280 285 

Val Phe Gly Gin Met Gly Ala He Gly He Leu Val Ser He Leu Phe 

290 295 300 

Leu Leu Ala Leu Ala Phe Ala Gly He Thr Ser Thr Val Ala Leu Leu 
305 310 315 320 

Glu Pro Ser Val Met Tyr Leu Thr Glu Arg Tyr Gin Tyr Ser Arg Phe 

325 330 335 

Lys Val Thr Trp Gly Leu Val Ala Leu He Phe Val Val Gly Val Val 

340 345 350 

Leu He Phe Ser Leu His Lys Asp Tyr Lys Asp Tyr Leu Thr Phe Phe 

355 360 365 

Glu Lys Ser Leu Phe Asp Trp Leu Asp Phe Ala Ser Ser Thr He He 

370 375 380 

Met Pro Leu Gly Gly Met Ala Thr Phe He Phe Met Gly Trp Val Leu 
385 390 395 400 

Lys Lys Glu Lys Leu Arg Leu Leu Ser Val His Phe Leu Gly Pro Lys 

405 410 415 

Leu Phe Ala Thr Trp Tyr Phe Leu Leu Lys Tyr He Thr Pro Leu He 

420 425 430 

Val Phe Ser He Trp Leu Ser Lys He Tyr 
435 440 

(2) INFORMATION FOR SEQ ID NO: 1341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 17... 1081 
(D) OTHER INFORMATION: 
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Gly Val Val Leu lie Phe Ser Leu His Lys Asp Tyr Lys Asp Tyr Leu 
350 355 360 365 



ACT TTC TTT GAA AAA ACT CTT TTT GAT TGG TTG GAT TTT GCA TCA AGC 115 5 

Thr Phe Phe Glu Lys Ser Leu Phe Asp Trp Leu Asp Phe Ala Ser Ser 
370 375 380 



ACC ATT ATC ATG CCT TTA GGC GGG ATG GCA ACC TTT ATT TTT ATG GGT 12 0 3 

Thr lie He Met Pro Leu Gly Gly Met Ala Thr Phe He Phe Met Gly 
385 390 395 



TGG GTT TTG AAA AAA GAA AAA TTG CGT CTT TTG AGC GTG CAC TTT TTA 12 51 

Trp Val Leu Lys Lys Glu Lys Leu Arg Leu Leu Ser Val His Phe Leu 
400 405 410 



GGC CCT AAA TTG TTT GCA ACT TGG TAT TTC TTG CTT AAA TAT ATC ACC 12 9 9 

Gly Pro Lys Leu Phe Ala Thr Trp Tyr Phe Leu Leu Lys Tyr He Thr 
415 420 425 



CCT TTA ATT GTG TTT TCC ATT TGG TTG AGC AAG ATT TAT TAAAATATTT GG 13 5 0 
Pro Leu He Val Phe Ser He Trp Leu Ser Lys He Tyr 
430 435 440 



CATGGGAAAA TTTTCTAAAT TAGGCT 13 7 6 

(2) INFORMATION FOR SEQ ID NO: 134 0: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134 0: 



Met Gly Asn His 
1 

Ser Ala He Gly 
20 

Val Ser Gly Gly 
35 

Ser Val Gly Ala 
50 

Thr Gin Lys Asn 
65 

Lys Lys Arg Trp 

He Leu Thr Phe 
100 

Val Ser Val Ser 
115 

He Phe Thr Gin 
130 



Phe Ser Lys Leu 
5 

Leu Gly His He 

Gly Ala Phe Val 
40 

Ala Met Phe He 
55 

Val Thr Glu Ala 
70 

Lys Tyr Ala Gly 
85 

Tyr Gly Thr He 

Phe Asn Leu Pro 
120 

Thr Leu Gin Ser 
135 



Gly Phe Val Leu 
10 

Trp Arg Phe Pro 
25 

Leu Leu Phe Leu 

Ala Glu Met Leu 
60 

Phe Lys Glu Leu 
75 

Leu Leu Leu Val 
90 

Leu Gly Trp Val 
105 

Asn Asn He Gin 

He Gly Leu Gin 
140 



Ala Ala Leu Gly 
15 

Tyr Met Thr Gly 
30 

Phe Leu Ser Leu 
45 

Leu Gly Gin Ser 

Asp He Asn Pro 
80 

Ser Gly Pro Leu 
95 

Leu Tyr Tyr Leu 
110 

Glu Ser Glu Gin 
125 

Ser He Gly Leu 
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130 135 140 

ATA GGG CTT TTT AGC GTT TTA TTG ATA ACC GGA TGG ATT GTT TCT AGG 4 83 

lie Gly Leu Phe Ser Val Leu Leu lie Thr Gly Trp lie Val Ser Arg 
145 150 155 

GGG ATT AAA GAA GGC ATT GAA AAG CTC AAT TTG GTT TTA ATG CCC TTA 531 
Gly lie Lys Glu Gly lie Glu Lys Leu Asn Leu Val Leu Met Pro Leu 
160 165 170 

CTC TTT GCT ACT TTT TTT GGT TTG CTT TTC TAT GCG ATG AGC ATG GAT 57 9 

Leu Phe Ala Thr Phe Phe Gly Leu Leu Phe Tyr Ala Met Ser Met Asp 
175 180 185 

TCT TTT TCT AAA GCT TTT CAT TTC ATG TTT GAT TTC AAA CCA AAA GAT 62 7 

Ser Phe Ser Lys Ala Phe His Phe Met Phe Asp Phe Lys Pro Lys Asp 
190 195 200 205 

TTG ACC TCT CAA GTG TTC ACT TAT TCC TTG GGG CAG GTT TTC TTT TCC 67 5 

Leu Thr Ser Gin Val Phe Thr Tyr Ser Leu Gly Gin Val Phe Phe Ser 
210 215 220 

TTA AGC ATC GGT TTA GGG ATC AAT ATC ACT TAC GCT GCG GTT ACG GAT 72 3 

Leu Ser lie Gly Leu Gly lie Asn lie Thr Tyr Ala Ala Val Thr Asp 
225 230 235 

AAA ACG CAG AAT TTG CTT AAA AGC ACT ATT TGG GTG GTT TTA TCA GGA 771 
Lys Thr Gin Asn Leu Leu Lys Ser Thr lie Trp Val Val Leu Ser Gly 
240 245 250 

ATT CTA ATT TCT CTT GTG GCA GGA CTT ATG ATT TTC ACT TTT GTG TTT 819 
lie Leu lie Ser Leu Val Ala Gly Leu Met lie Phe Thr Phe Val Phe 
255 260 265 

GAA TAT GGG GCG AAT GTC TCA CAA GGC ACA GGG TTA ATC TTC ACT TCT 867 
Glu Tyr Gly Ala Asn Val Ser Gin Gly Thr Gly Leu lie Phe Thr Ser 
270 275 280 285 

TTA CCG GTG GTT TTT GGC CAA ATG GGA GCG ATA GGC ATT CTT GTT TCG 915 
Leu Pro Val Val Phe Gly Gin Met Gly Ala He Gly He Leu Val Ser 
290 295 300 

ATT CTT TTC TTG CTC GCG CTC GCT TTT GCT GGC ATC ACT TCT ACG GTG 96 3 

He Leu Phe Leu Leu Ala Leu Ala Phe Ala Gly He Thr Ser Thr Val 
305 310 315 

GCT TTA TTG GAG CCA AGC GTG ATG TAT CTT ACC GAA AGG TAT CAA TAC 1011 
Ala Leu Leu Glu Pro Ser Val Met Tyr Leu Thr Glu Arg Tyr Gin Tyr 
320 325 330 

TCT CGT TTT AAG GTT ACT TGG GGT CTT GTA GCA CTA ATT TTT GTG GTA 105 9 

Ser Arg Phe Lys Val Thr Trp Gly Leu Val Ala Leu He Phe Val Val 
335 340 345 

GGC GTG GTG TTG ATT TTC TCG CTC CAT AAG GAT TAT AAA GAT TAT CTC 1107 
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260 

(2) INFORMATION FOR SEQ ID NO: 133 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13... 1338 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339: 

TGGTAGTTAA GA ATG GGT AAT CAT TTT TCT AAA TTA GGA TTT GTT TTA GCC 51 
Met Gly Asn His Phe Ser Lys Leu Gly Phe Val Leu Ala 
15 10 

GCA TTA GGA AGC GCG ATA GGT TTA GGG CAT ATC TGG CGT TTC CCC TAC 9 9 

Ala Leu Gly Ser Ala lie Gly Leu Gly His lie Trp Arg Phe Pro Tyr 
15 20 25 

ATG ACT GGG GTG AGT GGT GGG GGT GCT TTT GTT TTA TTG TTT TTA TTT 14 7 

Met Thr Gly Val Ser Gly Gly Gly Ala Phe Val Leu Leu Phe Leu Phe 
30 35 40 45 

TTA TCT TTA AGC GTT GGC GCG GCG ATG TTT ATC GCT GAA ATG CTA TTA 195 
Leu Ser Leu Ser Val Gly Ala Ala Met Phe lie Ala Glu Met Leu Leu 
50 55 60 

GGA CAA AGC ACT CAA AAA AAT GTA ACA GAA GCT TTT AAA GAG CTT GAC 243 
Gly Gin Ser Thr Gin Lys Asn Val Thr Glu Ala Phe Lys Glu Leu Asp 
65 70 75 

ATT AAC CCC AAA AAA CGC TGG AAA TAC GCA GGG CTT TTG CTT GTT TCT 2 91 

lie Asn Pro Lys Lys Arg Trp Lys Tyr Ala Gly Leu Leu Leu Val Ser 
80 85 90 

GGG CCA TTA ATA CTG ACT TTT TAC GGC ACG ATT TTA GGT TGG GTG CTT 33 9 

Gly Pro Leu lie Leu Thr Phe Tyr Gly Thr lie Leu Gly Trp Val Leu 
95 100 105 

TAT TAT TTG GTG AGT GTT AGT TTT AAT TTG CCT AAC AAT ATC CAA GAA 3 87 

Tyr Tyr Leu Val Ser Val Ser Phe Asn Leu Pro Asn Asn lie Gin Glu 
110 115 120 125 

TCT GAA CAA ATT TTT ACT CAA ACT TTG CAG TCT ATA GGG CTA CAA TCC 43 5 

Ser Glu Gin lie Phe Thr Gin Thr Leu Gin Ser lie Gly Leu Gin Ser 
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AAT GCC GTG CCG OCT TTA TTG AGT GTA GCG CTC GCG CAG GCG GTC TTT 82 4 

Asn Ala Val Pro Pro Leu Leu Ser Val Ala Leu Ala Gin Ala Val Phe 
245 250 255 

GAC TTT TTA AAG GGG TAAGATGTTT AACAATAATG ACTTTAAGGA TTACAGAAAA T 88 0 
Asp Phe Leu Lys Gly 
260 



(2) INFORMATION FOR SEQ ID NO: 133 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 8: 

Met Asp Glu Lys Ala Asn Leu Phe Lys Glu Tyr Leu Arg Leu Leu Asp 

15 10 15 

Leu Val Lys Pro Lys lie Phe Val Phe Glu Asn Val Val Gly Leu Met 

20 25 30 

Ser Met Gin Lys Gly Gin Leu Phe Lys Gin lie Cys Asn Ala Phe Lys 

35 40 45 

Glu Arg Asp Tyr lie Leu Glu His Ala lie Leu Asn Ala Leu Asp Tyr 

50 55 60 

Gly Val Pro Gin Met Arg Glu Arg Val lie Leu Val Gly Val Leu Lys 
65 70 75 80 

Ser Phe Lys Gin Lys Phe Tyr Phe Pro Lys Pro lie Lys Thr His Phe 

85 90 95 

Ser Leu Lys Asp Ala Leu Gly Asp Leu Pro Pro lie Gin Ser Gly Glu 

100 105 110 

Asn Gly Asp Ala Leu Gly Tyr Leu Lys Asn Ala Asp Asn Val Phe Leu 

115 120 125 

Glu Phe Val Arg Asn Ser Lys Glu Leu. Ser Glu His Ser Ser Pro Lys 

130 135 140 

Asn Asn Glu Lys Leu lie Lys lie Met Gin Thr Leu Lys Asp Gly Gin 
145 150 155 160 

Ser Lys Asp Asp Leu Pro Glu Ser Leu Arg Pro Lys Ser Gly Tyr lie 

165 170 175 

Asn Thr Tyr Ala Lys Met Trp Trp Glu Lys Pro Ala Pro Thr lie Thr 

18.0 185 190 

Arg Asn Phe Ser Thr Pro Ser Ser Ser Arg Cys lie His Pro Arg Asp 

195 200 205 

Ser Arg Ala Leu Ser lie Arg Glu Gly Ala Arg Leu Gin Ser Phe Pro 

210 215 220 

Asp Asn Tyr Lys Phe Cys Gly Ser Gly Ser Ala Lys Arg Leu Gin lie 
225 230 235 240 

Gly Asn Ala Val Pro Pro Leu Leu Ser Val Ala Leu Ala Gin Ala Val 

245 250 255 

Phe Asp Phe Leu Lys Gly 



880 
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GTA AAA CCA AAA ATA TTT GTT TTT GAA AAT GTG GTG GGT TTA ATG TCT 152 
Val Lys Pro Lys lie Phe Val Phe Glu Asn Val Val Gly Leu Met Ser 
20 25 30 

ATG CAA AAA GGG CAA TTA TTC AAA CAA ATT TGT AAC GCT TTT AAA GAG 2 00 

Met Gin Lys Gly Gin Leu Phe Lys Gin lie Cys Asn Ala Phe Lys Glu 
35 40 45 

AGA GAT TAT ATT TTA GAG CAT GCC ATT TTG AAC GCC CTA GAT TAT GGT 24 8 

Arg Asp Tyr lie Leu Glu His Ala lie Leu Asn Ala Leu Asp Tyr Gly 
50 55 60 65 

GTG CCT CAA ATG AGA GAA CGA GTG ATT TTA GTG GGC GTG CTT AAA AGC 2 96 

Val Pro Gin Met Arg Glu Arg Val lie Leu Val Gly Val Leu Lys Ser 
70 75 80 

TTT AAA CAA AAA TTT TAC TTC CCT AAA CCC ATA AAA ACG CAT TTT TCT 344 
Phe Lys Gin Lys Phe Tyr Phe Pro Lys Pro lie Lys Thr His Phe Ser 
85 90 95 

CTG AAA GAC GCT TTA GGG GAT TTA CCA CCC ATT CAA AGC GGT GAA AAT 3 92 

Leu Lys Asp Ala Leu Gly Asp Leu Pro Pro lie Gin Ser Gly Glu Asn 
100 105 110 

GGT GAT GCT TTA GGT TAT CTT AAA AAT GCG GAT AAT GTT TTT TTG GAA 44 0 

Gly Asp Ala Leu Gly Tyr Leu Lys Asn Ala Asp Asn Val Phe Leu Glu 
115 120 125 

TTT GTG CGA AAT TCT AAA GAA TTA AGC GAA CAT AGC AGT CCT AAA AAC 488 
Phe Val Arg Asn Ser Lys Glu Leu Ser Glu His Ser Ser Pro Lys Asn 
130 135 140 145 

AAT GAA AAA CTG ATA AAA ATC ATG CAA ACG CTA AAA GAC GGA CAG AGT 53 6 

Asn Glu Lys Leu lie Lys lie Met Gin Thr Leu Lys Asp Gly Gin Ser 
150 155 160 

AAA GAT GAT TTG CCA GAA AGT CTG CGT CCC AAA AGT GGT TAT ATT AAT 5 84 

Lys Asp Asp Leu Pro Glu Ser Leu Arg Pro Lys Ser Gly Tyr lie Asn 
165 170 175 

ACC TAT GCC AAA ATG TGG TGG GAA AAA CCA GCC CCC ACC ATT ACA AGA 63 2 

Thr Tyr Ala Lys Met Trp Trp Glu Lys Pro Ala Pro Thr lie Thr Arg 
180 185 190 

AAT TTT TCT ACC CCA AGC AGT TCT AGG TGT ATC CAT CCA AGA GAC TCT 68 0 

Asn Phe Ser Thr Pro Ser Ser Ser Arg Cys lie His Pro Arg Asp Ser 
195 200 205 

AGA GCG TTA AGC ATT AGA GAG GGG GCA AGA TTG CAA AGC TTT CCT GAT 72 8 

Arg Ala Leu Ser lie Arg Glu Gly Ala Arg Leu Gin Ser Phe Pro Asp 
210 215 220 225 

AAT TAT AAA TTC TGT GGG AGT GGT AGC GCT AAA AGA TTG CAA ATT GGC 776 
Asn Tyr Lys Phe Cys Gly Ser Gly Ser Ala Lys Arg Leu Gin lie Gly 
230 235 240 
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385 390 395 400 

Leu Val lie Asp Thr Pro Gin Asp Phe Ser Gly Ala lie lie Glu Arg 

405 410 415 

Leu Gly Lys Arg Lys Ala Glu Met Lys Ala Met Asn Pro Met Ser Asp 

420 425 430 

Gly Tyr Thr Arg Leu Glu Phe Glu lie Pro Ala Arg Gly Leu lie Gly 

435 440 445 

Tyr Arg Ser Glu Phe Leu Thr Asp Thr Lys Gly Glu Gly Val Met Asn 

450 455 460 

His Ser Phe Leu Glu Phe Arg Pro Phe Ser Gly Ser Val Glu Ser Arg 
465 470 475 480 

Lys Asn Gly Ala Leu lie Ser Met Glu Asn Gly Glu Ala Thr Ala Phe 

485 490 495 

Ser Leu Phe Asn lie Gin Glu Arg Gly Thr Leu Phe lie Asn Pro Gin 

500 505 510 

Thr Lys Val Tyr Val Gly Met Val lie Gly Glu His Ser Arg Asp Asn 

515 520 525 

Asp Leu Asp Val Asn Pro lie Lys Ser Lys His Leu Thr Asn Met Arg 

530 535 540 

Ala Ser Gly Ser Asp Asp Ala lie Lys Leu Thr Pro Pro Arg Thr Met 
545 550 555 560 

Val Leu Glu Arg Ala Leu Glu Trp lie Glu Glu Asp Glu lie Leu Glu 

565 570 575 

Val Thr Pro Leu Asn Leu Arg lie Arg Lys Lys lie Leu Asp Pro Asn 

580 585 590 

Met Arg Lys Arg Ala Lys Lys 
595 

(2) INFORMATION FOR SEQ ID NO: 133 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 880 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 54... 839 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 37: 

TTCTACTAGG CGGACCACCA TGCCAGAGCT ATTCTACCCT TGGCAAAAGA AAA ATG 5 6 

Met 
1 

GAT GAA AAA GCG AAT CTG TTT AAA GAA TAT TTG CGG CTT TTA GAT TTA 104 
Asp Glu Lys Ala Asn Leu Phe Lys Glu Tyr Leu Arg Leu Leu Asp Leu 
5 10 15 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1336: 



Met 


Lys 


Asn 


He 


Arg 


Asn 


He 


Ala 


Val 


He 


Ala 


His 


Val 


Asp 


His 


Gly 


1 








5 










10 










15 




Lys 


Thr 


Thr 


Leu 


Val 


Asp 


Gly 


Leu 


Leu 


Ser 


Gin 


Ser 


Gly 


Thr 


Phe 


Ser 








20 










25 










30 






Glu 


Arg 


Glu 


Lys 


Val 


Asp 


Glu 


Arg 


Val 


Met 


Asp 


Ser 


Asn 


Asp 


Leu 


Glu 






35 










40 










45 








Arg 


Glu 


Arg 


Gly 


He 


Thr 


He 


Leu 


Ser 


Lys 


Asn 


Thr 


Ala 


He 


Tyr 


Tyr 




50 










55 










60 










Lys 


Asp 


Thr 


Lys 


He 


Asn 


He 


He 


Asp 


Thr 


Pro 


Gly His 


Ala 


Asp 


Phe 


65 










70 










75 










80 


Gly 


Gly Glu 


Val 


Glu 


Arg 


Val 


Leu 


Lys 


Met 


Val 


Asp 


Gly 


Val 


Leu 


Leu 










85 










90 










95 




Leu 


Val 


Asp 


Ala 


Gin 


Glu 


Gly 


Val 


Met 


Pro 


Gin 


Thr 


Lys 


Phe 


Val 


Val 








100 










105 










110 






Lys 


Lys 


Ala 


Leu 


Ser 


Phe 


Gly 


He 


Cys 


Pro 


He 


Val 


Val 


Val 


Asn 


Lys 






115 










120 










125 








lie 


Asp 


Lys 


Pro 


Ala 


Ala 


Glu 


Pro 


Asp 


Arg 


Val 


Val 


Asp 


Glu 


Val 


Phe 




130 










135 










140 










Asp 


Leu 


Phe 


Val 


Ala 


Met 


Gly 


Ala 


Ser 


Asp 


Lys 


Gin 


Leu 


Asp 


Phe 


Pro 


145 










150 










155 










160 


Val 


Val 


Tyr 


Ala 


Ala 


Ala 


Arg 


Asp 


Gly 


Tyr 


Ala 


Met 


Lys 


Ser 


Leu 


Asp 










165 










170 










175 




Asp 


Glu 


Lys 


Lys 


Asn 


Leu 


Glu 


Pro 


Leu 


Phe 


Glu 


Thr 


He 


Leu 


Glu 


His 








180 










185 










190 






Val 


Pro 


Ser 


Pro 


Ser 


Gly 


Ser 


Val 


Asp 


Glu 


Pro 


Leu 


Gin 


Met 


Gin 


He 






195 










200 










205 








Phe 


Thr 


Leu 


Asp 


Tyr 


Asp 


Asn 


Tyr 


Val 


Gly 


Lys 


He 


Gly 


He 


Ala 


Arg 




210 










215 










220 










Val 


Phe 


Asn 


Gly 


Ser 


Val 


Lys 


Lys 


Asn 


Glu 


Ser 


Val 


Leu 


Leu 


Met 


Lys 


225 










230 










235 










240 


Ser 


Asp 


Gly 


Ser 


Lys 


Glu 


Asn 


Gly 


Arg 


He 


Thr 


Lys 


Leu 


He 


Gly 


Phe 










245 










250 










255 




Leu 


Gly 


Leu 


Ala 


Arg 


Thr 


Glu 


He 


Glu 


Asn 


Ala 


Tyr 


Ala 


Gly 


Asp 


He 








260 










265 










270 






Val 


Ala 


He 


Ala 


Gly 


Phe 


Asn 


Ala 


Met 


Asp 


Val 


Gly Asp 


Ser 


Val 


Val 






275 










280 










285 








Asp 


Pro 


Ala 


Asn 


Pro 


Met 


Pro 


Leu 


Asp 


Pro 


Met 


His 


Leu 


Glu 


Glu 


Pro 




290 










295 










300 










Thr 


Met 


Ser 


Val 


Tyr 


Phe 


Ala 


Val 


Asn 


Asp 


Ser 


Pro 


Leu 


Ala 


Gly 


Leu 


305 










310 










315 










320 


Glu 


Gly 


Lys 


His 


Val 


Thr 


Ala 


Asn 


Lys 


Leu 


Lys 


Asp 


Arg 


Leu 


Leu 


Lys 










325 










330 










335 




Glu 


Met 


Gin 


Thr 


Asn 


He 


Ala 


Met 


Lys 


Cys 


Glu 


Glu 


Met 


Gly 


Glu 


Gly 








340 










345 










350 






Lys 


Phe 


Lys 


Val 


Ser 


Gly 


Arg 


Gly 


Glu 


Leu 


Gin 


He 


Thr 


He 


Leu 


Ala 






355 










360 










365 








Glu 


Asn 


Leu 


Arg 


Arg 


Glu 


Gly 


Phe 


Glu 


Phe 


Ser 


He 


Ser 


Arg 


Pro 


Glu 




370 










375 










380 










Val 


He 


lie 


Lys 


Glu 


Glu 


Asn 


Gly 


Val 


Lys 


Cys 


Glu 


Pro 


Phe 


Glu 


His 
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AAA GCT GAG ATG AAA GCG ATG AAT CCC ATG AGT GAT GGC TAT ACA AGA 13 53 

Lys Ala Glu Met Lys Ala Met Asn Pro Met Ser Asp Gly Tyr Thr Arg 
425 430 435 

TTA GAA TTT GAA ATT CCT GCA AGA GGG CTT ATC GGT TAT AGG AGC GAG 14 01 

Leu Glu Phe Glu lie Pro Ala Arg Gly Leu lie Gly Tyr Arg Ser Glu 
440 445 450 

TTT TTA ACC GAG ACC AAG GGC GAA GGC GTG ATG AAT CAT AGC TTT TTA 14 4 9 

Phe Leu Thr Asp Thr Lys Gly Glu Gly Val Met Asn His Ser Phe Leu 
455 460 465 

GAA TTC CGC CCT TTC AGC GGG AGC GTG GAA TCG CGC AAA AAT GGG GCG 14 97 

Glu Phe Arg Pro Phe Ser Gly Ser Val Glu Ser Arg Lys Asn Gly Ala 
470 475 480 

CTA ATC AGC ATG GAA AAT GGC GAA GCG ACC GCT TTT TCC CTT TTC AAT 154 5 

Leu lie Ser Met Glu Asn Gly Glu Ala Thr Ala Phe Ser Leu Phe Asn 
485 490 495 500 

ATC CAA GAA AGA GGC ACG CTT TTT ATC AAC CCC CAA ACG AAG GTT TAT 15 93 

lie Gin Glu Arg Gly Thr Leu Phe lie Asn Pro Gin Thr Lys Val Tyr 
505 510 515 

GTG GGC ATG GTC ATT GGC GAG CAC AGC CGG GAT AAT GAT TTA GAT GTC 1641 
Val Gly Met Val lie Gly Glu His Ser Arg Asp Asn Asp Leu Asp Val 
520 525 530 

AAT CCT ATT AAA TCC AAG CAT TTA ACC AAC ATG AGA GCG AGC GGG AGC 168 9 

Asn Pro lie Lys Ser Lys His Leu Thr Asn Met Arg Ala Ser Gly Ser 
535 540 545 

GAT GAT GCG ATC AAA CTC ACC CCG CCT AGG ACT ATG GTG TTA GAA AGA 173 7 

Asp Asp Ala lie Lys Leu Thr Pro Pro Arg Thr Met Val Leu Glu Arg 
550 555 560 

GCG TTA GAA TGG ATT GAA GAA GAT GAG ATT TTG GAA GTT ACC CCC TTG 17 85 

Ala Leu Glu Trp lie Glu Glu Asp Glu lie Leu Glu Val Thr Pro Leu 
565 570 575 580 

AAT TTA AGG ATC AGG AAA AAG ATT TTA GAC CCT AAC ATG AGG AAA AGG 183 3 

Asn Leu Arg lie Arg Lys Lys lie Leu Asp Pro Asn Met Arg Lys Arg 
585 590 595 

GCG AAA AAA TAAATAGAAT TTTTTGGAAT GCATGCCAAT TTATTCAACC AA 18 84 

Ala Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 1336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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Ser Gly Ser 

TAT GAC AAT 
Tyr Asp Asn 
215 

TCG GTT AAA 
Ser Val Lys 
230 

AAA GAA AAT 
Lys Glu Asn 
245 

AGG ACT GAG 
Arg Thr Glu 



GGG TTT AAT 
Gly Phe Asn 



CCC ATG CCT 
Pro Met Pro 
295 

TAT TTT GCT 
Tyr Phe Ala 
310 

GTT ACT GCT 
Val Thr Ala 
325 

AAT ATC GCT 
Asn lie Ala 



AGT GGG CGT 
Ser Gly Arg 



CGT GAA GGG 
Arg Glu Gly 
375 

GAA GAA AAT 
Glu Glu Asn 
390 

ACG CCC CAA 
Thr Pro Gin 
405 



Val Asp Glu 
200 

TAT GTG GGC 
Tyr Val Gly 



AAG AAT GAA 
Lys Asn Glu 



GGC CGT ATC 
Gly Arg lie 
250 

ATT GAA AAC 
lie Glu Asn 
265 

GCA ATG GAT 
Ala Met Asp 
280 

TTA GAT CCC 
Leu Asp Pro 

GTC AAT. GAT 
Val Asn Asp 

AAT AAA TTG 
Asn Lys Leu 
330 

ATG AAA TGC 
Met Lys Cys 
345 

GGG GAA TTG 
Gly Glu Leu 
360 

TTT GAA TTT 
Phe Glu Phe 



GGC GTT AAA 
Gly Val Lys 



GAT TTT AGT 
Asp Phe Ser 
410 



Pro Leu Gin 
205 

AAA ATC GGT 
Lys He Gly 
220 

AGC GTG CTG 
Ser Val Leu 
235 

ACT AAG CTT 
Thr Lys Leu 



GCT TAT GCG 
Ala Tyr Ala 



GTG GGC GAT 
Val Gly Asp 
285 

ATG CAT TTA 
Met His Leu 
300 

TCA CCC TTA 
Ser Pro Leu 
315 

AAA GAC AGG 
Lys Asp Arg 

GAA GAA ATG 
Glu Glu Met 



CAA ATC ACT 
Gin He Thr 
365 

AGC ATT TCA 
Ser He Ser 
380 

TGC GAG CCT 
Cys Glu Pro 
395 

GGG GCT ATC 
Gly Ala He 



Met Gin He 



ATC GCT AGG 
He Ala Arg 

TTG ATG AAA 
Leu Met Lys 
240 

ATA GGT TTT 
He Gly Phe 
255 

GGC GAT ATT 
Gly Asp He 
270 

AGC GTC GTT 
Ser Val Val 



GAA GAG CCT 
Glu Glu Pro 



GCC GGG TTA 
Ala Gly Leu 
320 

CTC TTA AAA 
Leu Leu Lys 
335 

GGC GAG GGC 
Gly Glu Gly 
350 

ATT TTA GCT 
He Leu Ala 



CGC CCT GAA 
Arg Pro Glu 



TTT GAG CAT 
Phe Glu His 
400 

ATT GAG AGA 
He Glu Arg 
415 



Phe Thr Leu 
210 

GTG TTT AAT 
Val Phe Asn 
225 

AGC GAT GGG 
Ser Asp Gly 

TTA GGG CTG 
Leu Gly Leu 

GTA GCG ATT 
Val Ala He 
275 

GAT CCT GCT 
Asp Pro Ala 
290 

ACG ATG AGC 
Thr Met Ser 
305 

GAA GGA AAG 
Glu Gly Lys 



GAA ATG CAA 
Glu Met Gin 



AAG TTT AAA 
Lys Phe Lys 
355 

GAA AAC TTG 
Glu Asn Leu 
370 

GTC ATC ATT 
Val He He 
385 

TTA GTG ATT 
Leu Val He 



TTG GGC AAA 
Leu Gly Lys 



Asp 



GGC 72 9 

Gly 



AGT 777 
Ser 



GCT 825 

Ala 

260 

GCC 873 
Ala 



AAC 921 
Asn 



GTG 969 
Val 



CAT 1017 
His 



ACC 1065 

Thr 

340 

GTG 1113 
Val 



CGC 1161 
Arg 



AAA 1209 
Lys 



GAC 12 57 

Asp 



AGA 13 05 

Arg 

420 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 5: 

TATTGTGTTA TACTTCTAAT TTCAATTTTG CTTGTTAGGA CATTT ATG AAA AAT ATT 5 7 

Met Lys Asn lie 
1 

AGA AAT ATC GCT GTA ATC GCG CAT GTT GAT CAT GGG AAA ACC ACT CTA 105 
Arg Asn lie Ala Val lie Ala His Val Asp His Gly Lys Thr Thr Leu 
5 10 15 20 

GTA GAT GGC TTA CTT TCT CAA TCT GGC ACA TTT AGT GAG AGG GAA AAA 15 3 

Val Asp Gly Leu Leu Ser Gin Ser Gly Thr Phe Ser Glu Arg Glu Lys 
25 30 35 

GTG GAT GAA AGG GTG ATG GAT AGC AAT GAT TTG GAA AGA GAA AGA GGG 2 01 

Val Asp Glu Arg Val Met Asp Ser Asn Asp Leu Glu Arg Glu Arg Gly 
40 45 50 

ATT ACT ATC CTG TCT AAA AAC ACC GCT ATT TAT TAC AAA GAC ACT AAA 24 9 

lie Thr lie Leu Ser Lys Asn Thr Ala lie Tyr Tyr Lys Asp Thr Lys 
55 60 65 

ATC AAT ATC ATT GAC ACT CCC GGG CAT GCT GAT TTT GGG GGC GAA GTG 2 97 

lie Asn lie lie Asp Thr Pro Gly His Ala Asp Phe Gly Gly Glu Val 
70 75 80 

GAG CGC GTT TTA AAA ATG GTG GAT GGG GTG TTG CTT TTA GTG GAC GCT 34 5 

Glu Arg Val Leu Lys Met Val Asp Gly Val Leu Leu Leu Val Asp Ala 
85 90 95 100 

CAA GAA GGG GTC ATG CCT CAA ACT AAA TTC GTG GTT AAA AAG GCT TTG 3 93 

Gin Glu Gly Val Met Pro Gin Thr Lys Phe Val Val Lys Lys Ala Leu 
105 110 115 

AGT TTT GGG ATT TGC CCT ATT GTG GTG GTG AAT AAA ATT GAT AAG CCT 441 
Ser Phe Gly lie Cys Pro lie Val Val Val Asn Lys lie Asp Lys Pro 
120 125 130 

GCC GCT GAA CCG GAC AGA GTG GTG GAT GAA GTT TTT GAC TTG TTC GTA 48 9 

Ala Ala Glu Pro Asp Arg Val Val Asp Glu Val Phe Asp Leu Phe Val 
135 140 145 

GCC ATG GGG GCT AGC GAT AAG CAA TTG GAT TTC CCT GTG GTG TAT GCC 53 7 

Ala Met Gly Ala Ser Asp Lys Gin Leu Asp Phe Pro Val Val Tyr Ala 
150 155 160 

GCC GCA CGA GAT GGC TAT GCG ATG AAA AGT TTA GAC GAT GAA AAG AAA 5 85 

Ala Ala Arg Asp Gly Tyr Ala Met Lys Ser Leu Asp Asp Glu Lys Lys 
165 170 175 180 

AAT TTA GAG CCT TTG TTT GAA ACG ATT TTA GAG CAT GTG CCA AGC CCT 63 3 

Asn Leu Glu Pro Leu Phe Glu Thr lie Leu Glu His Val Pro Ser Pro 
185 190 195 

AGC GGG AGC GTT GAT GAG CCT TTG CAA ATG CAA ATT TTC ACG CTT GAT 681 
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260 










265 










270 






Glu 


Met 


Ala 


Leu 


Lys 


Phe 


His 


He 


Arg 


Ser 


Leu 


He 


Asp 


Phe 


Asp 


Lys 






275 










280 










285 








Pro 


Phe 


Lys 


Asn 


Leu 


Met 


Thr 


Lys 


Ala 


Val 


Gly 


Leu 


Ala 


Leu 


Lys 


Lys 




290 










295 










300 










Thr 


Pro 


Asn 


Lys 


Asp 


Gin 


Lys 


He 


Ser 


Cys 


Phe 


Tyr 


Gin 


Asn 


Ser 


Lys 


305 










310 










315 










320 


Phe 


Lys 


Arg 


Ser 


Pro 


Ser 


Ser 


Phe 


Phe 


Asn 


Asn 


Pro 


Lys 


Lys 


He 


Phe 










325 










330 










335 




Asn 


He 


His 


Cys 


Ser 


Ser 


Lys 


Glu 


Asn 


Lys 


He 


Leu 


Asp 


His 


Leu 


Phe 








340 










345 










350 






Ser 


Leu 


Pro 


His 


Met 


Thr 


Leu 


Lys 


Asn 


Asn 


Ala 


His 


Phe 


Ala 


Leu 


Gly 






355 










360 










365 








He 


Val 


Thr 


Gly 


Asn 


Asn 


Lys 


Glu 


Lys 


Leu 


His 


Pro 


Lys 


Gin 


Glu 


Lys 




370 










375 










380 










Asn 


Thr 


He 


Pro 


He 


Phe 


Arg 


Gly 


Ser 


Asp 


He 


Leu 


Lys 


Asp 


Gly 


Leu 


385 










390 










395 










400 


Lys 


Ala 


Pro 


Ser 


Gin 


Phe 


He 


Asn 


Ala 


Gly 


Leu 


Lys 


Asp 


Cys 


Gin 


Gin 










405 










410 










415 




Val 


Ala 


Pro 


Leu 


Ser 


Leu 


Tyr 


Gin 


Ala 


Arg 


Glu 


Lys 


He 


Val 


Tyr 


Lys 








420 










425 










430 






Phe 


He 


Ser 


Ser 


Lys 


Leu 


Val 


Phe 


Phe 


Tyr 


Asp 


Asn 


Lys 


Gin 


Arg 


Leu 






435 










440 










445 








Phe 


Leu 


Asn 


Ser 


Ala 


Asn 


Met 


Phe 


Val 


Leu 


Lys 


Glu 


Asn 


Phe 


Pro 


He 




450 










455 










460 










Asn 


Ala 


His 


Ala 


Leu 


Lys 


Glu 


Leu 


Leu 


Asn 


Ser 


Asp 


Leu 


Met 


Gin 


Phe 


465 










470 










475 










480 


He 


Phe 


Glu 


Ser 


Leu 


Phe 


Lys 


Thr 


His 


Lys 


He 


Leu 


Arg 


Lys 


Asp 


Leu 










485 










490 










495 




Glu 


Cys 


Leu 


Pro 


Leu 


Phe 


Val 


Gin 


Phe 


He 


Asn 


Asp 


Asn 


Phe 


Asp 


Glu 








500 










505 










510 






Lys 


Phe 


Tyr 


Leu 


Lys 


Asn 


Leu 


Gly 


He 


Glu 


Lys 


Lys 


Asp 


Pro 


Lys 


His 






515 










520 










525 








Phe 


Thr 


He 


Arg 


Lys 


Asn 


His 


Ala 


Cys 


Cys 


Leu 


Ser 


Phe 


Gly 


Phe 


Arg 




530 










535 










540 











Gly 
545 



(2) INFORMATION FOR SEQ ID NO: 13 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ICEY: Coding Sequence 

(B) LOCATION: 46... 1842 
(D) OTHER INFORMATION: 
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GAC CCT AAA CAT TTC ACC ATC AGG AAA AAT CAT GCA TGT TGC TTG TCT 16 3 3 

Asp Pro Lys His Phe Thr lie Arg Lys Asn His Ala Cys Cys Leu Ser 
525 530 535 540 

TTT GGC TTT AGG GGA TAATCTCATC ACGCTTAGCC TTTTAAAAGA AATCGCTTTC A 16 8 9 
Phe Gly Phe Arg Gly 
545 

AACAGCAA 16 97 

(2) INFORMATION FOR SEQ ID NO:1334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334: 

Met Pro Ser Asn Ala Leu Leu lie Glu Glu lie Thr His Leu lie Asn 

15 10 15 

Val Ser His Ser Ser Val His Asn Trp lie Lys Thr Asn Leu Leu Glu 

20 25 30 

Lys Leu Glu lie Asp His Lys lie Tyr Val Lys Thr Ser Ser Phe Leu 

35 40 45 

Asp Phe Cys Arg Asn His Leu Gly Lys Asn Lys Leu Asn Lys Tyr Ala 

50 55 60 

Asn Lys Ser Leu Lys Gly Val His Asn His Gin Glu Leu lie Leu Lys 
65 70 75 80 

Tyr Leu Glu lie Leu Glu Asn Ser Ser Asp Leu Glu Lys Leu Gly Ser 

85 90 95 

Tyr Tyr Glu Glu Glu Leu Ser Asn Ala Thr Arg Asn Leu Glu Gly lie 

100 105 110 

Tyr Tyr Thr Pro Asn Arg lie Val Glu Gin Leu Phe Thr Leu Pro Lys 

115 120 125 

Asp Phe Asp Val Ser Gin Ala lie Phe Cys Asp Pro Ala Val Gly Ser 

130 135 140 

Gly Asn Phe lie Met His Ala Leu Lys Leu Gly Phe Lys Val Glu Asn 
145 150 155 160 

lie Tyr Gly Tyr Asp Thr Asp Ala Phe Ala Val Ala Leu Thr Lys Lys 

165 170 175 

Arg lie Lys Glu Arg Tyr His Leu Asp Cys Leu Asn lie Val Gin Lys 

180 185 190 

Asp Phe Leu Asn Leu Lys His Thr Pro Gin Phe Asp Cys lie Phe Thr 

195 200 205 

Asn Pro Pro Trp Gly Lys Lys Tyr Asn Gin Asn Gin Lys Glu Asn Phe 

210 215 220 

Lys Gin Gin Phe Asn Leu Ser Gin Ser Leu Asp Ser Ala Ser Leu Phe 
225 230 235 240 

Phe lie Ala Ser Leu Asn Cys Leu Lys Glu Asn Ala His Leu Gly Leu 

245 250 255 

Leu Leu Pro Glu Ser Cys Leu Asn lie Asp Ala Phe Lys Lys Met Arg 
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Ala Leu Lys Lys Thr Pro Asn Lys Asp Gin Lys lie Ser Cys Phe Tyr 
305 310 315 

CAA AAT AGC AAG TTC AAA CGC TCG CCC TCT TCT TTT TTT AAC AAC CCT 10 09 

Gin Asn Ser Lys Phe Lys Arg Ser Pro Ser Ser Phe Phe Asn Asn Pro 
320 325 330 

AAA AAG ATT TTT AAT ATC CAT TGC TCT AGC AAA GAA AAT AAA ATT TTA 10 57 

Lys Lys lie Phe Asn lie His Cys Ser Ser Lys Glu Asn Lys lie Leu 
335 340 345 

GAC CAC CTT TTT TCC CTC CCT CAT ATG ACT TTA AAA AAT AAC GCT CAT 110 5 

Asp His Leu Phe Ser Leu Pro His Met Thr Leu Lys Asn Asn Ala His 
350 355 360 

TTT GCT TTA GGG ATT GTT ACA GGC AAC AAT AAA GAA AAA TTA CAC CCC 115 3 

Phe Ala Leu Gly lie Val Thr Gly Asn Asn Lys Glu Lys Leu His Pro 
365 370 375 380 

AAA CAA GAA AAA AAT ACC ATT CCC ATT TTT AGG GGT TCA GAT ATT TTA 12 01 

Lys Gin Glu Lys Asn Thr lie Pro lie Phe Arg Gly Ser Asp lie Leu 
385 390 395 

AAA GAC GGA TTA AAA GCC CCT AGC CAA TTC ATT AAC GCT GGT TTA AAA 12 4 9 

Lys Asp Gly Leu Lys Ala Pro Ser Gin Phe lie Asn Ala Gly Leu Lys 
400 405 410 

GAC TGC CAG CAA GTC GCC CCC TTA AGC CTT TAT CAA GCT AGA GAA AAA 12 97 

Asp Cys Gin Gin Val Ala Pro Leu Ser Leu Tyr Gin Ala Arg Glu Lys 
415 420 425 

ATC GTG TAT AAA TTC ATT TCT TCA AAG CTT GTC TTT TTT TAT GAC AAT 13 4 5 

lie Val Tyr Lys Phe lie Ser Ser Lys Leu Val Phe Phe Tyr Asp Asn 
430 435 440 

AAG CAA CGC CTT TTT TTA AAC AGC GCG AAC ATG TTT GTT TTA AAA GAA 13 9 3 

Lys Gin Arg Leu Phe Leu Asn Ser Ala Asn Met Phe Val Leu Lys Glu 
445 450 455 460 

AAT TTC CCT ATC AAC GCT CAT GCA TTA AAA GAA TTG TTA AAC AGC GAT 14 41 

Asn Phe Pro lie Asn Ala His Ala Leu Lys Glu Leu Leu Asn Ser Asp 
465 470 475 

TTA ATG CAA TTC ATT TTT GAA TCG CTT TTT AAA ACG CAT AAA ATC TTA 14 8 9 

Leu Met Gin Phe lie Phe Glu Ser Leu Phe Lys Thr His Lys lie Leu 
480 485 490 

AGA AAA GAT TTG GAA TGT TTG CCC CTA TTT GTG CAA TTT ATT AAC GAT 15 3 7 

Arg Lys Asp Leu Glu Cys Leu Pro Leu Phe Val Gin Phe lie Asn Asp 
495 500 505 

AAT TTT GAT GAA AAA TTT TAT TTA AAA AAT TTA GGG ATA GAA AAA AAA 15 8 5 

Asn Phe Asp Glu Lys Phe Tyr Leu Lys Asn Leu Gly lie Glu Lys Lys 
510 515 520 
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80 85 90 

AAG TTG GGT TCT TAT TAT GAA GAA GAG CTT TCT AAC GCC ACC AGA AAT 33 7 

Lys Leu Gly Ser Tyr Tyr Glu Glu Glu Leu Ser Asn Ala Thr Arg Asn 
95 100 105 

TTA GAA GGC ATT TAG TAG ACT CCT AAC AGG ATA GTA GAA CAA CTT TTC 3 85 

Leu Glu Gly lie Tyr Tyr Thr Pro Asn Arg lie Val Glu Gin Leu Phe 
110 115 120 

ACC CTC CCT AAA GAT TTT GAT GTC TCT CAA GCG ATT TTT TGC GAT CCG 43 3 

Thr Leu Pro Lys Asp Phe Asp Val Ser Gin Ala lie Phe Cys Asp Pro 
125 130 135 140 

GCT GTG GGG AGT GGG AAT TTT ATC ATG CAT GCT TTA AAA CTG GGT TTT 481 
Ala Val Gly Ser Gly Asn Phe lie Met His Ala Leu Lys • Leu Gly Phe 
145 150 155 

AAG GTT GAA AAC ATT TAT GGC TAT GAT ACG GAC GCT TTT GCT GTC GCT 52 9 

Lys Val Glu Asn lie Tyr Gly Tyr Asp Thr Asp Ala Phe Ala Val Ala 
160 165 170 

TTG ACT AAA AAG CGT ATT AAA GAG CGT TAT CAT TTA GAT TGC CTT AAT 5 77 

Leu Thr Lys Lys Arg lie Lys Glu Arg Tyr His Leu Asp Cys Leu Asn 
175 180 185 

ATT GTG CAA AAA GAT TTT TTA AAT TTA AAA CAC ACC CCG CAA TTT GAT 62 5 

lie Val Gin Lys Asp Phe Leu Asn Leu Lys His Thr Pro Gin Phe Asp 
190 195 200 

TGC ATT TTC ACT AAC CCG CCA TGG GGC AAG AAA TAG AAT CAA AAC CAA 673 
Cys lie Phe Thr Asn Pro Pro Trp Gly Lys Lys Tyr Asn Gin Asn Gin 
205 210 215 220 

AAA GAA AAT TTT AAA GAG CAA TTC AAC CTC TCT CAA AGC CTA GAT AGC 721 
Lys Glu Asn Phe Lys Gin Gin Phe Asn Leu Ser Gin Ser Leu Asp Ser 
225 230 235 

GCG TCG CTC TTT TTT ATA GCG AGT TTG AAT TGT TTA AAA GAA AAC GCT 7 69 

Ala Ser Leu Phe Phe lie Ala Ser Leu Asn Cys Leu Lys Glu Asn Ala 
240 245 250 

CAT TTG GGG TTA TTA TTA CCC GAA AGT TGT TTG AAT ATT GAT GCG TTT 817 
His Leu Gly Leu Leu Leu Pro Glu Ser Cys Leu Asn lie Asp Ala Phe 
255 260 265 

AAA AAA ATG CGA GAA ATG GCT TTA AAG TTT CAC ATT AGA AGC CTG ATT 865 
Lys Lys Met Arg Glu Met Ala Leu Lys Phe His lie Arg Ser Leu lie 
270 275 280 

GAT TTT GAC AAA CCT TTT AAA AAT CTA ATG ACT AAG GCT GTG GGT TTG 913 
Asp Phe Asp Lys Pro Phe Lys Asn Leu Met Thr Lys Ala Val Gly Leu 
285 290 295 300 

GCG CTT AAA AAA ACC CCT AAT AAG GAT CAA AAA ATC TCA TGC TTT TAT 961 
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65 

Pro Asn Asn lie Gly 
85 

Gin lie Leu lie Lys 
100 

Cys Asp Ser Leu Glu 
115 

Pro Lys Lys Asp Glu 
130 

Ala Leu Phe Ser Asn 
145 

Lys Ser 



70 75 
Ser Val Val Gin Val Lys 
90 

Glu Lys Asn Glu Asn Ala 
105 

Leu Phe Gin Lys Glu Asp 
120 

Arg Glu Ser Lys Glu Trp 
135 

Ser Lys Asn Arg Ala lie 
150 155 



80 

Leu Pro Gly lie lie 
95 

Val Leu Phe Asn Arg 
110 

Ser lie Ala Gin Glu 
125 

Leu Asp Ser Lys Glu 
140 

Leu Glu Asn Leu His 
160 



(2) INFORMATION FOR SEQ ID NO: 1333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 14... 164 8 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 3: 



TTTAGGGGGG TAA ATG CCT TCA AAC GCT CTT TTG ATT GAA GAA ATC ACT 4 9 

Met Pro Ser Asn Ala Leu Leu lie Glu Glu lie Thr 
15 10 



CAT TTA ATC AAT GTT TCT CAT AGT AGC GTG CAT AAT TGG ATC AAA ACC 97 
His Leu lie Asn Val Ser His Ser Ser Val His Asn Trp lie Lys Thr 
15 20 25 



AAT CTT TTA GAG AAA CTA 

Asn Leu Leu Glu Lys Leu 
30 

AGT TCT TTT TTA GAT TTT 

Ser Ser Phe Leu Asp Phe 

45 50 

AAC AAA TAC GCT AAC AAA 

Asn Lys Tyr Ala Asn Lys 
65 

TTG ATT TTA AAA TAC CTA 

Leu lie Leu Lys Tyr Leu 



GAA ATT GAT CAT AAA ATT 
Glu lie Asp His Lys lie 
35 40 

TGC CGC AAC CAT TTA GGG 
Cys Arg Asn His Leu Gly 
55 

TCC TTA AAA GGC GTG CAT 
Ser Leu Lys Gly Val His 
70 

GAA ATA TTA GAA AAT AGC 
Glu lie Leu Glu Asn Ser 



TAT GTG AAA ACG 14 5 

Tyr Val Lys Thr 



AAA AAC AAG CTT 193 
Lys Asn Lys Leu 
60 

AAC CAT CAA GAA 241 
Asn His Gin Glu 
75 

TCT GAT CTA GAA 2 89 

Ser Asp Leu Glu 
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CTC CCT AAA GAA ATT CAT GAA AAA TTC GGC GCG TTA GTG CTA TTT GTT 196 
Leu Pro Lys Glu lie His Glu Lys Phe Gly Ala Leu Val Leu Phe Val 
45 50 55 



TTA GCC GGA TAG ACC TTA GAA AGO TTG ATA ATT GAT ACA AAA AGO GTG 2 44 

Leu Ala Gly Tyr Thr Leu Glu Ser Leu lie lie Asp Thr Lys Ser Val 
60 65 70 



CAA TTT GAA GCC GGG TTT GGC CCT AAT AAC ATT GGC AGT GTG GTT CAA 2 92 

Gin Phe Glu Ala Gly Phe Gly Pro Asn Asn lie Gly Ser Val Val Gin 
75 80 85 



GTA AAA CTT CCT GGC ATC ATT CAA ATC CTT ATC AAA GAA AAA AAT GAA 34 0 

Val Lys Leu Pro Gly He He Gin He Leu He Lys Glu Lys Asn Glu 
90 95 100 105 



AAT GCC GTT TTA TTC AAT CGT TGC GAT TCG CTT GAA TTG TTT CAA AAA 3 88 

Asn Ala Val Leu Phe Asn Arg Cys Asp Ser Leu Glu Leu Phe Gin Lys 
110 115 120 



GAA GAT TCA ATC GCG CAA GAG CCA AAA AAA GAC GAG CGG GAG TCT AAA 43 6 

Glu Asp Ser He Ala Gin Glu Pro Lys Lys Asp Glu Arg Glu Ser Lys 
125 130 135 

GAA TGG CTG GAT TCT AAA GAG GCT CTT TTT TCC AAT TCC AAA AAC CGC 4 84 

Glu Trp Leu Asp Ser Lys Glu Ala Leu Phe Ser Asn Ser Lys Asn Arg 
140 145 150 



GCG ATT TTA GAA AAT CTG CAC AAA AGC TAAAGGAATC ATTGATGAGC GTTTTGA 53 8 
Ala He Leu Glu Asn Leu His Lys Ser 
155 160 



AATTGCATGT AAAAGTCTTT CGTTTTGAAA CCAATA 5 74 

(2) INFORMATION FOR SEQ ID NO: 133 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332: 



Met Gin Arg Glu Leu Arg Leu Leu 

1 5 
Leu Gin Phe Leu Ser Lys Asn His 
20 

Arg Asp Ala He Asp Phe Ser Pro 

35 40 
Lys Phe Gly Ala Leu Val Leu Phe 

50 55 
Ser Leu He He Asp Thr Lys Ser 



Asn Asn Lys His Cys Met Glu Tyr 

10 15 

Leu Ser Phe Asn Leu Leu Cys Glu 

25 30 

Lys Leu Pro Lys Glu He His Glu 
45 

Val Leu Ala Gly Tyr Thr Leu Glu 
60 

Val Gin Phe Glu Ala Gly Phe Gly 
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Lys lie Glu Phe Lys Glu Arg Lys Glu Leu Arg Phe Lys Glu Gin Glu 
865 870 875 880 

Glu lie Thr Thr Lys Gin Lys Gin Ala Lys Glu lie Leu Glu Lys lie 

885 890 895 

Pro Asp Gin Gin Asp Lys Glu lie Gin Lys Phe Tyr Lys Asp Phe Ser 

900 905 910 

Lys Leu Leu Gin Thr Pro Thr Thr Ser Gin Asn Phe Glu Glu lie Ser 

915 920 925 

His Ser Tyr Asp Ala lie lie Ser Gin Leu Lys Gin His Lys Glu Gin 

930 935 940 

Thr Thr His Leu Leu Asn Lys Tyr Asp Asn Asp Leu Ser Tyr Ala lie 
945 950 955 960 

Thr Asn Lys Arg Leu His Lys His Leu Met Glu Gin Asn lie Ser Asn 

965 970 975 

Ser Ala Gly lie Phe Thr Leu Leu Ser Ala Leu Lys Lys Ala lie Asp 

980 985 990 

Ala Arg lie Phe Lys Arg Gin Glu lie Leu Asn Glu Glu Tyr Tyr Leu 

995 1000 1005 

Lys Asn Ala lie Lys Ala Glu Leu Asn Asn Ala Phe Lys Lys Asp Pro 

1010 1015 1020 

Ser Leu Lys Asp Leu Glu Lys Glu Lys Glu Leu lie lie Gin Thr Leu 
025 1030 1035 1040 

Phe Asn Glu Leu Thr Gin Asn His His Gin Gly Asn Pro His Ala 
1045 1050 1055 

(2) INFORMATION FOR SEQ ID NO: 1331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 26... 511 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1331: 

TAGAAGAATT TGAAAGGTTG CTCGC ATG CAA AGA GAA TTA AGG CTT TTA AAT 52 

Met Gin Arg Glu Leu Arg Leu Leu Asn 
1 5 

AAC AAG CAT TGC ATG GAA TAC TTG CAA TTT CTG TCC AAA AAC CAT TTG 10 0 

Asn Lys His Cys Met Glu Tyr Leu Gin Phe Leu Ser Lys Asn His Leu 
10 15 20 25 

AGT TTT AAC CTT TTG TGC GAA AGA GAT GCG ATT GAT TTT TCC CCC AAG 14 8 

Ser Phe Asn Leu Leu Cys Glu Arg Asp Ala lie Asp Phe Ser Pro Lys 
30 35 40 
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420 

Lys Glu lie lie 
435 

Arg Asn Leu Gin 
450 

Asp Pro Lys Gly 
465 

Ala lie Lys lie 

Gin Asp Lys Ala 
500 

Ser Tyr Thr Glu 
515 

Glu Ser lie Glu 
530 

Leu Leu Gin Glu 
545 

Thr lie Phe Asn 

lie Arg Asp Leu 
580 

Lys Ala Met Val 
595 

Cys Leu Phe Asn 
610 

Leu Arg lie Leu 
625 

Gin Glu Val Lys 

lie Val Phe Val 
660 

Leu Lys Arg Leu 
675 

Gin Ala Leu Ala 
690 

Tyr Leu lie Asp 
705 

Asp Asp Tyr Leu 

Ser Asp Ser His 
740 

Glu Asp lie Lys 
755 

Asp lie Glu Gly 
770 

Glu Leu Val Lys 
785 

Asn Leu lie Arg 

Lys lie Asp lie 
820 

Lys Ala Lys His 
835 

Pro Asn Asp Leu 
850 



Ser Lys Thr Glu 
440 

Arg Val Phe lie 
455 

Cys Phe Tyr Ala 
470 

Ala Leu Thr Gly 
485 

Thr Lys Asn Thr 

Ser lie Lys Asp 
520 

Thr Ser Tyr Lys 
535 

Ser lie Thr lie 
550 

Asp Glu Lys Tyr 
565 

Leu Asp Phe Arg 

Val Cys Phe Ser 
600 

Glu Val Gin Glu 
615 

Asn Lys Leu Lys 
630 

Glu Lys Val His 
645 

Phe Asn Met Leu 

Tyr lie His Arg 
680 

Arg Val Asn Arg 
695 

Phe Val Gly lie 
710 

Lys Glu Leu Asn 
725 

lie Lys Asp Met 

Asn Ala Tyr Asp 
760 

Met Thr Ser Ala 
775 

Val Ser Arg Ala 
790 

Thr Ser Asn Asp 
805 

Glu Lys He His 

Leu His Ala Leu 
840 

Met He Leu Glu 
855 



425 

Leu Gin Glu Ser 

He Asp Glu Ala 
460 

Asn Leu He Glu 
475 

Thr Pro Leu Leu 
490 

Phe Gly Asn Tyr 
505 

Arg His Thr Leu 

Glu Lys Leu Gin 
540 

Glu Asp Thr Glu 
555 

He Asn Ala Met 
570 

Arg Leu Asn Asp 
585 

Ser Lys Gin Ala 

Lys Val Leu Gin 
620 

Ser Ser Leu He 
635 

Ser Phe Lys His 
650 

Leu Thr Gly Phe 
665 

Glu Leu Lys Asp 

Ser Tyr Lys Asn 
700 

Gin Glu Asn Phe 
715 

Arg Phe Asn Gin 
730 

Phe Ala Asp Arg 
745 

Asp Leu Phe Asp 

He Val Ser Met 
780 

He Asn Thr Leu 
795 

Lys Lys He Leu 
810 

Lys He Ser Ser 
825 

Lys Asn He Asn 

Asp Leu He Ala 
860 



430 

He Gin Asn Ser 
445 

His Arg Ser Tyr 

Cys Asp Lys Thr 
480 

Glu Asp Asn Ala 
495 

Leu His Thr Tyr 
510 

Lys Leu Gin Leu 
525 

Glu He Tyr Arg 

Val Lys Lys Glu 
560 

Leu Tyr Tyr He 
575 

Asn Glu Arg Leu 
590 

Arg Leu Ala Asp 
605 

Glu Asn Pro Asn 

Leu His Asp Glu 
640 

Glu Asp Thr Asp 
655 

Asp Leu Pro Ser 
670 

His Asn Leu Leu 
685 

Met Ser Phe Gly 

Asp Lys Thr Thr 
720 

Ser Gly Ala Asn 
735 

Lys Thr Leu Glu 
750 

Tyr Pro He Asp 
765 

Ser Ala Met Asn 

Lys Glu Arg Tyr 
800 

Ser Leu Lys Glu 
815 

Met Leu His Gin 
830 

Glu Pro Lys Asn 
845 

Leu Leu Asp Phe 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 0: 



Met Pro Tyr Asn 
1 

Leu Ala Lys Leu 
20 

Lys Pro Asn Leu 
35 

Lys Ser Phe Glu 
50 

Ala Glu Met Lys 
65 

Tyr Glu Tyr Leu 

Pro Asn Asn Asn 
100 

Phe Arg Pro Asp 
115 

lie Glu Val Lys 
130 

Asp Arg His lie 
145 

Asn Leu Ala Gin 

Asn Lys Pro Asp 
180 

Phe Gin Arg Phe 
195 

Xaa Gin Lys Asn 
210 

Leu Glu Glu He 

225 

Thr Asp Thr Pro 

Thr Ser Phe Cys 
260 

He Ser Phe Leu 
275 

Tyr Ala Gin Met 
290 

His Tyr Gly Thr 
305 

Trp His Thr Gin 

Lys Leu He Arg 
340 

Lys Phe Tyr Phe 
355 

Asn Glu Phe Leu 
370 

Glu Asp Leu Ser 
385 

Gin Gly Asn Asp 
Pro Asn Glu Glu 



Glu He Thr Arg 
5 

Gly Tyr Asp Phe 

Asp Thr Ala Thr 
40 

Arg Leu Asn Pro 
55 

Lys Arg Leu Asn 
70 

Leu Lys Ser Glu 
85 

Leu Tyr Glu Met 

Thr Thr Leu Phe 
120 

Gin Pro Tyr Ala 
135 

Lys Arg Tyr Glu 
150 

He Trp Leu Phe 
165 

Gin Gly Ala Phe 

Val Glu Ala His 
200 

Asp Gin Asn His 
215 

Gin Lys Ser Val 
230 

Lys Ser Pro Lys 
245 

Ser Pro Lys Arg 

Lys Glu Lys Ser 
280 

Phe Ala Ser Leu 
295 

Asn Gin Asn Leu 
310 

Gly Ser Gly Lys 
325 

Asp Phe Phe Ser 

He Val Asp Arg 
360 

Lys Arg Gly Leu 
375 

Gin Lys Leu Lys 
390 

Glu He He Val 
405 

Lys Ser Pro Asn 



Val Gin He Pro 
10 

He Pro Thr Asn 
25 

Asn He Leu Thr 

Thr Lys Asn Ala 
60 

Cys Asp Asp Leu 
75 

Asn Gin He He 
90 

Met Thr Glu Leu 
105 

He Asn Gly Leu 

Lys Lys Gly He 
140 

Asn Pro Glu Asn 
155 

Ser Asp Asn Leu 
170 

Tyr Ser Ala Ser 
185 

Arg Leu Asp He 

Gin Asn Asp Gin 
220 

Leu Asn Glu Phe 
235 

Asp Thr Pro Thr 
250 

Leu Cys Phe He 
265 

Glu Phe Ly s Ly s 

Asn Val Leu Lys 
300 

Lys Asp Pro Leu 
315 

Thr Ala Leu Thr 
330 

Arg Ser Asn Leu 
345 

Leu Asp Leu Leu 

Cys Val His Glu 
380 

Ser Ser Ser Val 
395 

Val Asn He Gin 
410 

Glu Asp Pro Ser 



Ala Leu Met His 
15 

Ser Lys Glu Asn 
30 

Asn Ser Phe Thr 
45 

Gin Glu Thr Leu 

Gly Lys Ser Phe 
80 

Asp Phe Asp Asn 
95 

Pro Tyr Lys Ser 
110 

Pro Leu Val Asn 
125 

Lys Glu Glu Arg 

Lys Val Phe Tyr 
160 

Pro Tyr Asp Glu 
175 

Tyr Ser Pro He 
190 

Xaa Pro Xaa Pro 
205 

Asn His Arg Ser 

Asn Leu Lys Asp 
240 

Asn Ser Leu Leu 
255 

Leu Lys Tyr Gly 
270 

His Val Trp Arg 
285 

Glu Leu Gin Lys 

Lys Gly He He 
320 

Tyr His Leu Thr 
335 

Asn Lys Lys Thr 
350 

Glu Gin Ala Lys 
365 

Ala Glu Asn Lys 

Phe Glu Gly Ser 
400 

Lys Phe Lys Ala 
415 

Asn Ser Ala Pro 
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GAG ATT TTA GAA AAA ATC CCG GAT CAA CAA GAT AAA GAA ATC CAA AAG 2 73 9 

Glu lie Leu Glu Lys lie Pro Asp Gin Gin Asp Lys Glu lie Gin Lys 
895 900 905 

TTT TAC AAA GAC TTT TCA AAA TTA CTC CAA ACG CCC ACA ACA AGC CAG 2 787 

Phe Tyr Lys Asp Phe Ser Lys Leu Leu Gin Thr Pro Thr Thr Ser Gin 
910 915 920 

AAT TTT GAG GAA ATT TCT CAT TCC TAT GAT GCG ATC ATT TCA CAA CTC 2 83 5 

Asn Phe Glu Glu lie Ser His Ser Tyr Asp Ala lie lie Ser Gin Leu 
925 930 935 

AAA CAA CAC AAA GAA CAA ACC ACC CAC TTA TTA AAC AAA TAC GAT AAT 2 8 83 

Lys Gin His Lys Glu Gin Thr Thr His Leu Leu Asn Lys Tyr Asp Asn 
940 945 950 

GAT TTG TCT TAT GCG ATC ACG AAC AAA CGC CTT CAT AAG CAC CTT ATG 2 931 

Asp Leu Ser Tyr Ala lie Thr Asn Lys Arg Leu His Lys His Leu Met 
955 960 965 970 

GAA CAA AAC ATT TCT AAC TCA GCG GGA ATT TTC ACG CTT TTA AGC GCC 2 97 9 

Glu Gin Asn lie Ser Asn Ser Ala Gly lie Phe Thr Leu Leu Ser Ala 
975 980 985 

TTA AAA AAA GCT ATT GAT GCG CGT ATT TTT AAG CGT CAA GAA ATC TTA 3 02 7 

Leu Lys Lys Ala lie Asp Ala Arg lie Phe Lys Arg Gin Glu lie Leu 
990 995 1000 

AAC GAA GAG TAT TAC CTA AAA AAT GCC ATA AAA GCA GAA TTA AAT AAC 3 0 75 

Asn Glu Glu Tyr Tyr Leu Lys Asn Ala lie Lys Ala Glu Leu Asn Asn 
1005 1010 1015 

GCT TTC AAA AAA GAC CCC TCC TTA AAA GAT TTA GAA AAA GAA AAA GAA 312 3 

Ala Phe Lys Lys Asp Pro Ser Leu Lys Asp Leu Glu Lys Glu Lys Glu 
1020 1025 1030 

CTT ATC ATT CAA ACC CTT TTT AAC GAA CTC ACA CAA AAC CAC CAT CAA 3171 
Leu lie lie Gin Thr Leu Phe Asn Glu Leu Thr Gin Asn His His Gin 
1035 1040 1045 1050 

GGA AAT CCG CAT GCC TAATAACGCT TTATTGCAAA TCAAACAAGA CACCCT 3 2 22 

Gly Asn Pro His Ala 
1055 



(2) INFORMATION FOR SEQ ID NO: 133 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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GGC TTT GAT TTA CCC AGT CTC AAA CGC CTT TAT ATC CAC AGA GAA TTA 2 06 7 

Gly Phe Asp Leu Pro Ser Leu Lys Arg Leu Tyr lie His Arg Glu Leu 
670 675 680 

AAA GAT CAC AAT TTG CTC CAA GCC CTA GCC AGA GTG AAT CGC TCC TAT 2115 
Lys Asp His Asn Leu Leu Gin Ala Leu Ala Arg Val Asn Arg Ser Tyr 
685 690 695 

AAA AAC ATG TCT TTT GGC TAC CTT ATA GAT TTT GTA GGC ATT CAA GAA 216 3 

Lys Asn Met Ser Phe Gly Tyr Leu lie Asp Phe Val Gly lie Gin Glu 
700 705 710 

AAT TTT GAC AAA ACG ACT GAT GAT TAC TTG AAA GAA TTA AAC CGA TTC 2 211 

Asn Phe Asp Lys Thr Thr Asp Asp Tyr Leu Lys Glu Leu Asn Arg Phe 
715 720 725 730 

AAT CAA AGC GGT GCC AAT AGC GAT TCT CAT ATC AAA GAC ATG TTT GCG 2 25 9 

Asn Gin Ser Gly Ala Asn Ser Asp Ser His lie Lys Asp Met Phe Ala 
735 740 745 

GAT CGT AAG ACT TTA GAA GAA GAC ATT AAA AAC GCC TAT GAT GAT CTT 2 3 07 

Asp Arg Lys Thr Leu Glu Glu Asp lie Lys Asn Ala Tyr Asp Asp Leu 
750 755 760 

TTT GAT TAC CCC ATT GAC GAT ATA GAG GGC ATG ACT AGC GCC ATT GTC 235 5 

Phe Asp Tyr Pro lie Asp Asp lie Glu Gly Met Thr Ser Ala lie Val 
765 770 775 

AGC ATG AGC GCA ATG AAC GAG CTT GTA AAA GTC TCA CGC GCC ATT AAC 24 0 3 

Ser Met Ser Ala Met Asn Glu Leu Val Lys Val Ser Arg Ala lie Asn 
780 785 790 

ACG CTC AAA GAG CGC TAC AAT TTA ATC CGC ACT TCT AAT GAT AAA AAA 24 51 

Thr Leu Lys Glu Arg Tyr Asn Leu lie Arg Thr Ser Asn Asp Lys Lys 
795 800 805 810 

ATC CTT TCA CTA AAA GAA AAA ATT GAT ATT GAA AAG ATC CAT AAA ATC 24 99 

He Leu Ser Leu Lys Glu Lys He Asp He Glu Lys He His Lys He 
815 820 825 

TCT TCA ATG CTT CAT CAA AAA GCC AAA CAC CTC CAT GCG TTA AAG AAT 2 54 7 

Ser Ser Met Leu His Gin Lys Ala Lys His Leu His Ala Leu Lys Asn 
830 835 840 

ATC AAT GAG CCT AAA AAC CCA AAC GAT TTA ATG ATT TTA GAA GAC CTC 2 5 95 

He Asn Glu Pro Lys Asn Pro Asn Asp Leu Met He Leu Glu Asp Leu 
845 850 855 

ATC GCT CTT TTA GAC TTT AAA ATA GAG TTT AAA GAA CGC AAA GAA TTA 2 64 3 

He Ala Leu Leu Asp Phe Lys He Glu Phe Lys Glu Arg Lys Glu Leu 
860 865 870 

CGC TTT AAA GAA CAA GAA GAG ATT ACC ACC AAA CAA AAG CAA GCT AAA 2 6 91 

Arg Phe Lys Glu Gin Glu Glu He Thr Thr Lys Gin Lys Gin Ala Lys 
875 880 885 890 
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GAA TCC ATT CAA AAC AGC CGC AAT TTA CAA AGO GTG TTT ATC ATA GAT 13 95 

Glu Ser lie Gin Asn Ser Arg Asn Leu Gin Arg Val Phe lie lie Asp 
445 450 455 

GAA GCC CAC AGG AGC TAG GAT CCT AAA GGT TGC TTT TAG GGT AAT TTG 144 3 

Glu Ala His Arg Ser Tyr Asp Pro Lys Gly Cys Phe Tyr Ala Asn Leu 
460 465 470 

ATA GAA TGC GAC AAG ACA GGA ATT AAA ATC GCC CTC ACA GGC ACG GCC 14 91 

lie Glu Cys Asp Lys Thr Ala lie Lys lie Ala Leu Thr Gly Thr Pro 
475 480 485 490 

CTA TTA GAA GAC AAC GCG CAA GAT AAA GCC ACT AAA AAC ACT TTT GGC 153 9 

Leu Leu Glu Asp Asn Ala Gin Asp Lys Ala Thr Lys Asn Thr Phe Gly 
495 500 505 

AAC TAG TTG CAC ACC TAT TGT TAT ACA GAA TCC ATT AAA GAC AGA GAC 15 87 

Asn Tyr Leu His Thr Tyr Ser Tyr Thr Glu Ser lie Lys Asp Arg His 
510 515 520 

ACC CTA AAA CTC CAG TTA GAA AGC ATT GAA ACG AGC TAT AAA GAA AAA 1635 
Thr Leu Lys Leu Gin Leu Glu Ser lie Glu Thr Ser Tyr Lys Glu Lys 
525 530 535 

TTA CAA GAA ATC TAT CGC GTT TTA CAA GAA AGC ATC ACT ATT GAA GAC 16 8 3 

Leu Gin Glu lie Tyr Arg Leu Leu Gin Glu Ser lie Thr lie Glu Asp 
540 545 550 

ACA GAA GTT AAA AAA GAA ACG ATT TTT AAC GAT GAA AAA TAG ATT AAC 17 31 

Thr Glu Val Lys Lys Glu Thr lie Phe Asn Asp Glu Lys Tyr lie Asn 
555 560 565 570 

GCC ATG CTC TAT TAT ATC ATT AGA GAT TTA TTG GAT TTT AGG GGT TTG 17 7 9 

Ala Met Leu. Tyr Tyr lie lie Arg Asp Leu Leu Asp Phe Arg Arg Leu 
575 580 585 

AAT GAT AAT GAA CGC TTA AAG GGT ATG GTG GTT TGT TTT TGT AGC AAG 182 7 

Asn Asp Asn Glu Arg Leu Lys Ala Met Val Val Cys Phe Ser Ser Lys 
590 595 600 

CAA GCC AGA TTA GGT GAT TGT CTT TTT AAT GAA GTG GAA G7^ AAA GTC 187 5 

Gin Ala Arg Leu Ala Asp Cys Leu Phe Asn Glu Val Gin Glu Lys Val 
605 610 615 

TTA GAA GAA AAC GCC AAG CTA AGG ATT TTA AAC AAA CTC AAA TCC AGC 192 3 

Leu Gin Glu Asn Pro Asn Leu Arg lie Leu Asn Lys Leu Lys Ser Ser 
620 625 630 

CTG ATT TTG CAT GAT GAA CAA GAA GTC AAA GAA AAG GTT CAT TGT TTG 1971 
Leu lie Leu His Asp Glu Gin Glu Val Lys Glu Lys Val His Ser Phe 
635 640 645 650 

AAA CAT GAA GAT ACC GAT ATA GTG TTT GTG TTT AAC ATG CTT TTA AGG 2 019 

Lys His Glu Asp Thr Asp lie Val Phe Val Phe Asn Met Leu Leu Thr 
655 660 665 
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GAT CAA AAT 
Asp Gin Asn 
220 

GAA TTT AAC 
Glu Phe Asn 
235 

CCC ACA AAC 
Pro Thr Asn 



TTT ATC CTA 
Phe lie Leu 



AAA AAA CAC 
Lys Lys His 
285 

TTA AAA GAA 
Leu -Lys Glu 
300 

CCC CTA AAA 
Pro Leu Lys 
315 

TTA ACC TAC 
Leu Thr Tyr 

AAC CTA AAC 
Asn Leu Asn 



TTA TTG GAG 
Leu Leu Glu 
365 

CAT GAG GCA 
His Glu Ala 
380 

AGC GTT TTT 
Ser Val Phe 
395 

ATC CAA AAA 
lie Gin Lys 

CCC TCT AAT 
Pro Ser Asn 



CAT CGA TCG 
His Arg Ser 

CTT AAA GAC 
Leu Lys Asp 
240 

TCC CTT TTA 
Ser Leu Leu 
255 

AAA TAC GGC 
Lys Tyr Gly 
270 

GTT TGG CGT 
Val Trp Arg 



TTG CAA AAG 
Leu Gin Lys 



GGC ATC ATC 
Gly He He 
320 

CAC TTA ACC 
His Leu Thr 
335 

AAA AAG ACT 
Lys Lys Thr 
350 

CAA GCC AAA 
Gin Ala Lys 



GAA AAT AAA 
Glu Asn Lys 



GAA GGC TCT 
Glu Gly Ser 
400 

TTC AAA GCC 
Phe Lys Ala 
415 

AGC GCT CCT 
Ser Ala Pro 
430 



CTT GAA GAA 
Leu Glu Glu 
225 

ACC GAC ACC 
Thr Asp Thr 



ACT TCG TTT 
Thr Ser Phe 



ATC AGT TTC 

He Ser Phe 
275 

TAT GCG CAG 

Tyr Ala Gin 
290 

CAT TAT GGA 

His Tyr Gly 
305 

TGG CAC ACG 

Trp His Thr 

AAA CTC ATC 
Lys Leu He 

AAA TTT TAT 
Lys Phe Tyr 
355 

AAC GAG TTT 
Asn Glu Phe 
370 

GAG GAT TTG 
Glu Asp Leu 
385 

CAA GGG AAT 
Gin Gly Asn 



CCC AAT GAA 
Pro Asn Glu 



AAA GAA ATC 
Lys Glu He 
435 



ATT CAA AAA 
He Gin Lys 
230 

CCA AAA AGC 
Pro Lys Ser 
245 

TGC TCT CCA 
Cys Ser Pro 
260 

TTA AAA GAA 
Leu Lys Glu 



ATG TTT GCG 
Met Phe Ala 



ACA AAC CAA 
Thr Asn Gin 
310 

CAA GGC AGC 
Gin Gly Ser 
325 

AGA GAC TTT 
Arg Asp Phe 
340 

TTT ATT GTG 
Phe He Val 



TTA AAA AGA 
Leu Lys Arg 



AGC CAA AAA 
Ser Gin Lys 
390 

GAT GAA ATC 
Asp Glu He 
405 

GAA AAA TCC 
Glu Lys Ser 
420 

ATT TCT AAA 
He Ser Lys 



AGC GTC TTA 
Ser Val Leu 



CCT AAA GAC 
Pro Lys Asp 

AAA AGG CTT 
Lys Arg Leu 
265 

AAA TCA GAG 
Lys Ser Glu 
280 

AGC TTG AAC 

Ser Leu Asn 
295 

AAC CTA AAA 

Asn Leu Lys 



GGT AAA ACC 
Gly Lys Thr 



TTT AGC CGA 
Phe Ser Arg 
345 

GAC AGG TTG 
Asp Arg Leu 
360 

GGC CTT TGT 
Gly Leu Cys 
375 

TTA AAA AGC 
Leu Lys Ser 



ATC GTT GTG 
He Val Val 



CCC AAT GAA 
Pro Asn Glu 
425 

ACA GAA TTA 
Thr Glu Leu 
440 



AAC 723 
Asn 



ACC 771 

Thr 

250 

TGC 819 
Cys 



TTT . 8 67 
Phe 



GTT 915 
Val 



GAT 963 
Asp 



GCC 1011 

Ala 

330 

TCC 1059 
Ser 



GAT 1107 
Asp 



GTG 1155 
Val 



TCT 12 03 

Ser 



AAT 1251 

Asn 

410 

GAC 12 9 9 

Asp 



CAA 1347 
Gin 
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CCTTAAATCT AAGGGGTGTG C ATG CCA TAG AAT GAA ATC ACA AGG GTT CAA 51 

Met Pro Tyr Asn Glxi lie Thr Arg Val Gin 
15 10 

ATC CCT GCC TTA ATG CAT TTA GCC AAG TTG GGC TAT GAT TTT ATC CCC 9 9 

lie Pro Ala Leu Met His Leu Ala Lys Leu Gly Tyr Asp Phe lie Pro 
15 20 25 

ACT AAT TCT AAA GAA AAT AAG CCC AAC CTA GAC ACC GCC ACC AAC ATT 14 7 

Thr Asn Ser Lys Glu Asn Lys Pro Asn Leu Asp Thr Ala Thr Asn lie 
30 35 40 

TTA ACC AAT AGT TTC ACT AAA TCC TTT GAG CGG TTA AAC CCC ACT AAA 19 5 

Leu Thr Asn Ser Phe Thr Lys Ser Phe Glu Arg Leu Asn Pro Thr Lys 
45 50 55 

AAC GCA CAA GAA ACG CTT GCT GAA ATG AAA AAA CGC TTG AAT TGC GAT 24 3 

Asn Ala Gin Glu Thr Leu Ala Glu Met Lys Lys Arg Leu Asn Cys Asp 
60 65 70 

GAT TTG GGC AAA AGC TTT TAT GAA TAC TTG CTC AAA AGC GAG AAT CAA 2 91 

Asp Leu Gly Lys Ser Phe Tyr Glu Tyr Leu Leu Lys Ser Glu Asn Gin 
75 80 85 90 

ATC ATA GAC TTT GAT AAC CCT AAC AAC AAT CTT TAT GAA ATG ATG ACT 33 9 

lie lie Asp Phe Asp Asn Pro Asn Asn Asn Leu Tyr Glu Met Met Thr 
95 100 105 

GAA TTA CCC TAC AAA TCT TTT AGG CCT GAC ACC ACC CTT TTT ATC AAT 3 87 

Glu Leu Pro Tyr Lys Ser Phe Arg Pro Asp Thr Thr Leu Phe lie Asn 
110 115 120 

GGC TTG CCT TTG GTG AAT ATA GAA GTT AAA CAG CCT TAC GCC AAA AAA 43 5 

Gly Leu Pro Leu Val Asn lie Glu Val Lys Gin Pro Tyr Ala Lys Lys 
125 130 135 

GGC ATT AAA GAA GAA AGA GAT CGC CAC ATC AAA CGC TAT GAA AAC CCT 4 83 

Gly He Lys Glu Glu Arg Asp Arg His He Lys Arg Tyr Glu Asn Pro 
140 145 150 

GAA AAC AAA GTT TTT TAT AAT CTC GCG CAA ATC TGG CTT TTT AGC GAT 531 
Glu Asn Lys Val Phe Tyr Asn Leu Ala Gin He Trp Leu Phe Ser Asp 
155 160 165 170 

AAC TTA CCC TAT GAT GAA AAC AAA CCC GAT CAA GGC GCG TTT TAT AGC 579 
Asn Leu Pro Tyr Asp Glu Asn Lys Pro Asp Gin Gly Ala Phe Tyr Ser 
175 180 185 

GCT TCT TAT TCG CCC ATT TTC CAA CGC TTT GTT GAA GCT CAT AGG CTA 62 7 

Ala Ser Tyr Ser Pro He Phe Gin Arg Phe Val Glu Ala His Arg Leu 
190 195 200 

GAT ATT WCC CCC SSN CCC CSC CAA AAA AAT GAT CAA AAT CAT CAA AAC 67 5 

Asp He Xaa Pro Xaa Pro Xaa Gin Lys Asn Asp Gin Asn His Gin Asn 
205 210 215 
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AGCCTTTGTG TGTTCTTTTG TAAA 563 
(2) INFORMATION FOR SEQ ID NO: 132 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1328: 

Met Ser Asn Leu Gin Glu Leu Arg Glu His Leu Lys Glu Leu Glu Asn 

15 10 15 

Ser Phe Glu lie Gly Ser Phe Thr Lys Glu Asn lie Lys Glu Tyr Ala 

20 25 30 

Lys Cys Phe Phe Met Ser Leu Ser Met Phe Leu Glu Glu Gin Glu Lys 

35 40 45 

Asn Gin Gin Glu Glu Phe Leu Glu Gin Asp Thr Lys Glu Asn Gin Glu 

50 55 60 

Glu Leu lie Lys Asn lie Gin Thr Ser lie Ala Lys Asn Gin Glu Leu 
65 70 75 80 

Glu Lys He Ser Phe Glu Lys Trp Glu Asn Lys He Gin Glu Arg Val 

85 90 95 

Leu Pro Lys Leu Lys Arg He Val Thr His Lys Leu Gin Glu Ser He 

100 105 110 

Thr Ser Ser He Asn Thr Gin Leu Glu Ser Phe Lys Lys Asp Glu Leu 

115 120 125 

Asp Leu Ser Ser Val Phe Glu He Gin Arg Lys Asn Thr Gin He Ala 

130 135 140 

Tyr Arg Leu Ala He Gly Gly Leu He Gly He He Ala Leu Ser Ser 
145 150 155 160 

Gin He 



(2) INFORMATION FOR SEQ ID NO: 132 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 3186 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1329: 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 24 . . . 509 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1327: 

CAACTCTTTT TTAAGGGGGA CAC ATG TCT AAT TTG CAA GAA CTT AGA GAG CAT 5 3 

Met Ser Asn Leu Gin Glu Leu Arg Glu His 
15 10 

TTA AAA GAA TTA GAA AAT TCC TTT GAA ATA GGC TCT TTT ACT AAA GAA 101 
Leu Lys Glu Leu Glu Asn Ser Phe Glu lie Gly Ser Phe Thr Lys Glu 
15 20 25 

AAT ATT AAA GAA TAC GCT AAA TGC TTT TTT ATG AGT TTA AGC ATG TTT 14 9 

Asn lie Lys Glu Tyr Ala Lys Cys Phe Phe Met Ser Leu Ser Met Phe 
30 35 40 

TTA GAA GAA CAA GAA AAA AAC CAA CAA GAA GAG TTT TTA GAA CAA GAT 197 
Leu Glu Glu Gin Glu Lys Asn Gin Gin Glu Glu Phe Leu Glu Gin Asp 
45 50 55 

ACC AAA GAA AAT CAA GAA GAG CTC ATT AAA AAC ATT CAA ACA AGC ATT 24 5 

Thr Lys Glu Asn Gin Glu Glu Leu lie Lys Asn lie Gin Thr Ser He 
60 65 70 

GCT AAA AAC CAA GAG TTA GAA AAA ATC TCT TTT GAA AAA TGG GAG AAT 2 93 

Ala Lys Asn Gin Glu Leu Glu Lys He Ser Phe Glu Lys Trp Glu Asn 
75 80 85 90 

AAA ATT CAA GAA AGG GTT TTG CCT AAG TTA AAA CGC ATT GTT ACG CAT 341 
Lys He Gin Glu Arg Val Leu Pro Lys Leu Lys Arg He Val Thr His 
95 100 105 

AAG TTG CAA GAA AGT ATC ACA TCT AGC ATA AAC ACG CAA TTA GAG AGT 3 89 

Lys Leu Gin Glu Ser He Thr Ser Ser He Asn Thr Gin Leu Glu Ser 
110 115 120 

TTT AAA AAA GAT GAG TTA GAT TTA TCT AGC GTG TTT GAA ATC CAA AGA 43 7 

Phe Lys Lys Asp Glu Leu Asp Leu Ser Ser Val Phe Glu He Gin Arg 
125 130 135 

AAG AAC ACT CAA ATA GCG TAT AGA TTA GCT ATA GGG GGG CTT ATA GGT 4 85 

Lys Asn Thr Gin He Ala Tyr Arg Leu Ala He Gly Gly Leu He Gly 
140 145 150 

ATC ATT GCT TTA AGC TCG CAA ATT TGATTATTAA CTCTATACTT CACGCTTTTT 53 9 

He He Ala Leu Ser Ser Gin He 
155 160 
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Tyr Leu Met Met 
145 

Leu Asp Arg Leu 

lie Tyr Asn Gin 
180 

Asp Ser Ala Thr 
195 

Ala Lys Val Asp 
210 

Thr Ala Leu Leu 
225 

Glu Val Lys Gin 

Asp Leu Lys Glu 
260 

Leu Asp Ser Ser 
275 

Arg Met Asp Ser 
290 

Asp Ser lie Leu 
305 

Val Phe His Lys 

Leu Cys Phe lie 
340 

Asn Lys Lys lie 
355 

Val lie Val Leu 
370 

Asn Ala Gin Ser 
385 

Gin Arg Asn lie 

Leu Ser Asp Thr 
420 

Tyr Lys Val Leu 
435 

Asp Val Ser Leu 
450 

Asn Ser Ser Met 
465 

Lys Thr Trp Arg 

Asp Lys Lys His 
500 



Ser Ala Leu Lys 
150 

Asn Gly Leu Tyr 
165 

Gly Glu Asn Phe 

Asn Arg Ala Phe 
200 

Ser Tyr Asp Asp 
215 

Asn Ala lie Asp 
230 

Ala Lys Phe Ser 
245 

Phe Val Asn Ala 

Phe Glu Asp Tyr 
280 

Leu Asp Phe Lys 
295 

His Asn Asp Asp 
310 

Met lie Asp Arg 
325 

Asp Glu Phe Lys 

Asn Glu lie lie 
360 

Ala Leu Gin Asp 
375 

Phe lie Lys Asn 
390 

Asp Thr Lys Asp 
405 

Glu Lys His Phe 

Leu Lys Asn Met 
440 

Ser Ser Leu Gly 
455 

Val Glu His lie 
470 

Glu Val Phe Val 
485 

Leu Glu Lys Val 



Tyr Ser Asn lie 
155 

Ser Phe Thr Lys 
170 

His lie Asn Pro 
185 

Leu Leu His Phe 

His Lys Asp Lys 
220 

Thr Met Tyr Arg 
235 

Asn Gin Glu Leu 
250 

lie Ala Lys Thr 
265 

Leu Lys Ser Ser 

Thr Arg lie Ser 
300 

Asp Ala Gly Leu 
315 

Ala Leu Lys lie 
330 

Ser Tyr Ala Gin 
345 

Thr Gin Ala Arg 

lie Asn Gin Leu 
380 

Met Gly Gin Leu 

395 

Leu Asn Asp Lys 
410 

Leu Glu Asn Thr 
425 

Asn Asp Gly Ser 

Asn Tyr Leu Gin 
460 

Asp Asn Leu lie 
475 

Ser Asn Lys His 

490 
Leu Lys 
505 



Asp lie Leu Ala 
160 

Tyr Phe Asp Gly 
175 

Phe Ser Leu Glu 
190 

Tyr Ala Gin Met 
205 

Val Glu Asp Arg 

Asn Tyr Asn Asp 
240 

Pro Leu Pro Phe 
255 

Asn Thr Asp lie 
270 

Leu Phe Ser Ser 
285 

Thr lie Asn Thr 

Leu Ala Tyr Tyr 
320 

Asn Arg Gly Phe 

335 

Asn Glu Met Met 
350 

Lys Ala Asn Gly 
365 

Ser Glu Val Arg 

lie Leu Tyr Pro 
400 

Phe Gly lie Arg 
415 

Ala Val Asn Glu 
430 

Ser Asn lie lie 
445 

lie Phe Ser Ser 

Lys His Tyr Pro 
480 

Glu Asn Phe Asp 
495 



(2) INFORMATION FOR SEQ ID NO: 132 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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CAT TTT TTA GAA AAC ACC GCC GTT AAT GAA TAG AAA GTC TTA CTC AAA 13 5 2 

His Phe Leu Glu Asn Thr Ala Val Asn Glu Tyr Lys Val Leu Leu Lys 
425 430 435 

AAC ATG AAT GAT GGC TCA TCT AAC ATT ATA GAT GTG AGC CTA AGT TCT 14 0 0 

Asn Met Asn Asp Gly Ser Ser Asn lie lie Asp Val Ser Leu Ser Ser 
440 445 450 

TTG GGT AAT TAC CTA CAA ATC TTT AGC TCT AAT TCT AGC ATG GTA GAA 14 4 8 

Leu Gly Asn Tyr Leu Gin lie Phe Ser Ser Asn Ser Ser Met Val Glu 
455 460 465 470 

CAC ATT GAT AAT CTC ATT AAG CAT TAC CCT AAA ACT TGG CGA GAA GTC 14 96 

His lie Asp Asn Leu lie Lys His Tyr Pro Lys Thr Trp Arg Glu Val 

475 480 485 

TTT GTG AGT AAC AAA CAC GAA AAT TTT GAT GAC AAA AAA CAC TTA GAA 1544 

Phe Val Ser Asn Lys His Glu Asn Phe Asp Asp Lys Lys His Leu Glu 

490 495 500 

AAG GTG CTT AAA TGAAAAACAT CATGCGTTTA GTTTTTGTGA TAGTGGCTAT GT 15 98 

Lys Val Leu Lys 
505 



(2) INFORMATION FOR SEQ ID NO: 1326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 06 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326: 



Met Gin Glu Cys Ala Leu Asn Leu Val lie Arg Ala Lys Ser Lys Ala 

15 10 15 

Lys Leu Asp Lys Ser Leu Lys Glu lie Leu Ser Leu Leu Asn Asn Ala 

20 25 30 

Gly Leu Gly Ser Val Thr Glu Thr lie Gly Leu Lys Pro Ser Tyr Phe 

35 40 45 

Ser Phe Phe Pro Asn Asn Ala Asn lie Asn Pro Arg Met Arg His Gin 

50 55 60 

Thr Ser Gin Val lie Ala Ser Leu lie Leu Phe Glu Lys Asn Asn Thr 
65 70 75 80 

Gly Phe Arg Ala Asn Ser Trp Gly Asp Met Pro Leu Ser Val Phe Lys 

85 90 95 

Asn Leu Asp His Ser Pro Tyr Leu Phe Asn Phe His Asn Gin Glu Val 

100 105 110 

Lys His Lys Gly Val Leu Ala His Asn Val Ala Arg Val Val Gly His 

115 120 125 

Thr Met He He Gly Ala Thr Gly Ala Gly Lys Thr Thr Leu He Ser 
130 135 140 
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Ala Phe Leu Leu His Phe Tyr Ala Gin Met Ala Lys Val Asp Ser Tyr 
200 205 210 

GAT GAG CAT AAG GAT AAA GTA GAA GAT AGA ACA GCC CTT TTA AAT GCT 72 8 

Asp Asp His Lys Asp Lys Val Glu Asp Arg Thr Ala Leu Leu Asn Ala 
215 220 225 230 

ATT GAT ACG ATG TAT AGA AAT TAT AAC GAT GAA GTC AAA CAA GCC AAA 776 
lie Asp Thr Met Tyr Arg Asn Tyr Asn Asp Glu Val Lys Gin Ala Lys 
235 240 245 

TTT AGC AAC CAA GAA TTA CCC CTT CCT TTT GAT TTA AAA GAG TTT GTC 82 4 

Phe Ser Asn Gin Glu Leu Pro Leu Pro Phe Asp Leu Lys Glu Phe Val 
250 255 260 

AAT GCC ATT GCT AAA ACC AAT ACA GAC ATT TTA GAT AGT AGT TTT GAA 872 
Asn Ala lie Ala Lys Thr Asn Thr Asp lie Leu Asp Ser Ser Phe Glu 
265 270 275 

GAC TAT TTA AAA TCT TCC TTA TTT TCT AGC CGA ATG GAT AGT CTA GAT 92 0 

Asp Tyr Leu Lys Ser Ser Leu Phe Ser Ser Arg Met Asp Ser Leu Asp 
280 285 290 

TTT AAA ACT CGT ATT AGC ACC ATA AAT ACC GAT AGC ATT TTA CAT AAT 96 8 

Phe Lys Thr Arg lie Ser Thr lie Asn Thr Asp Ser lie Leu His Asn 
295 300 305 310 

GAT GAT GAC GCT GGG CTT TTA GCC TAC TAT GTC TTT CAT AAG ATG ATT 1016 
Asp Asp Asp Ala Gly Leu Leu Ala Tyr Tyr Val Phe His Lys Met lie 
315 320 325 

GAC AGA GCC TTA AAA ATC AAT CGT GGG TTT TTA TGC TTT ATT GAT GAG 1064 
Asp Arg Ala Leu Lys lie Asn Arg Gly Phe Leu Cys Phe lie Asp Glu 
330 335 340 

TTT AAG TCT TAC GCT CAA AAT GAA ATG ATG AAT AAA AAA ATC AAT GAA 1112 
Phe Lys Ser Tyr Ala Gin Asn Glu Met Met Asn Lys Lys lie Asn Glu 
345 350 355 

ATC ATT ACT CAA GCT AGA AAG GCT AAT GGG GTG ATT GTT CTA GCC TTA 116 0 

lie lie Thr Gin Ala Arg Lys Ala Asn Gly Val lie Val Leu Ala Leu 
360 365 370 

CAA GAC ATT AAC CAA CTA AGC GAA GTG AGA AAC GCT CAA AGC TTT ATA 12 0 8 

Gin Asp lie Asn Gin Leu Ser Glu Val Arg Asn Ala Gin Ser Phe lie 
375 380 385 390 

AAA AAT ATG GGG CAA TTG ATT TTG TAT CCC CAA AGA AAT ATT GAT ACC 12 5 6 

Lys Asn Met Gly Gin Leu lie Leu Tyr Pro Gin Arg Asn lie Asp Thr 
395 400 405 

AAA GAT TTA AAC GAT AAA TTT GGC ATT AGA CTA AGC GAT ACA GAA AAA 13 04 

Lys Asp Leu Asn Asp Lys Phe Gly lie Arg Leu Ser Asp Thr Glu Lys 
410 415 420 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 5: 

ATAATTATAT AGAATTAGTG CAAGCCAATC GTTTGAGC ATG CAA GAG TGT GCT TTA 5 6 

Met Gin Glu Cys Ala Leu 
1 5 

AAC TTA GTT ATA AGG GCT AAA AGT AAA GCT AAA TTA GAC AAG TCT TTA 104 
Asn Leu Val lie Arg Ala Lys Ser Lys Ala Lys Leu Asp Lys Ser Leu 
10 15 20 

AAA GAG ATT TTA TCC TTG CTT AAT AAT GCT GGA CTA GGC AGT GTT ACA 152 
Lys Glu lie Leu Ser Leu Leu Asn Asn Ala Gly Leu Gly Ser Val Thr 
25 30 35 

GAA ACT ATA GGG CTA AAA CCA TCT TAT TTT TCA TTC TTC CCA AAT AAC 2 00 

Glu Thr lie Gly Leu Lys Pro Ser Tyr Phe Ser Phe Phe Pro Asn Asn 
40 45 50 

GCC AAT ATC AAC CCT AGA ATG AGA CAT CAA ACT TCC CAA GTC ATA GCA 24 8 

Ala Asn lie Asn Pro Arg Met Arg His Gin Thr Ser Gin Val lie Ala 
55 60 65 70 

TCT TTG ATT TTG TTT GAG AAA AAT AAT ACA GGT TTT AGA GCA AAT TCT 2 96 

Ser Leu lie Leu Phe Glu Lys Asn Asn Thr Gly Phe Arg Ala Asn Ser 
75 80 85 

TGG GGG GAT ATG CCC TTA TCT GTG TTT AAG AAC CTA GAC CAT AGC CCT 34 4 

Trp Gly Asp Met Pro Leu Ser Val Phe Lys Asn Leu Asp His Ser Pro 
90 95 100 

TAT TTG TTT AAT TTT CAT AAT CAA GAA GTC AAA CAT AAG GGC GTG TTA 3 92 

Tyr Leu Phe Asn Phe His Asn Gin Glu Val Lys His Lys Gly Val Leu 
105 110 115 

GCC CAC AAT GTC GCA CGA GTA GTG GGA CAT ACC ATG ATT ATA GGA GCA 44 0 

Ala His Asn Val Ala Arg Val Val Gly His Thr Met lie lie Gly Ala 
120 125 130 

ACA GGT GCT GGT AAA ACC ACA CTC ATT AGC TAT TTG ATG ATG AGT GCC 4 88 

Thr Gly Ala Gly Lys Thr Thr Leu lie Ser Tyr Leu Met Met Ser Ala 
135 140 145 150 

TTA AAA TAT TCT AAC ATT GAT ATT TTA GCT CTT GAT AGA CTA AAT GGT 53 6 

Leu Lys Tyr Ser Asn lie Asp He Leu Ala Leu Asp Arg Leu Asn Gly 
155 160 165 

TTG TAT TCC TTT ACC AAG TAT TTT GAT GGG ATT TAT AAT CAA GGC GAA 5 84 

Leu Tyr Ser Phe Thr Lys Tyr Phe Asp Gly He Tyr Asn Gin Gly Glu 
170 175 180 

AAC TTT CAT ATT AAC CCT TTT TCA TTA GAA GAT AGC GCA ACT AAT AGA 63 2 

Asn Phe His He Asn Pro Phe Ser Leu Glu Asp Ser Ala Thr Asn Arg 
185 190 195 

GCC TTT TTA TTG CAT TTT TAT GCC CAA ATG GCA AAA GTG GAT AGT TAT 6 80 
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Met Leu Glu Ser Ala Leu Lys Tyr Cys Lys Glu Lys Ala lie Asp Leu 

15 10 15 

Leu Val Gly Phe Val Pro Lys Thr Tyr Ser Met Ala Gin Glu Cys Asn 

20 25 30 

lie Leu Gly Leu Tyr Asp Asp Ala Phe lie He Thr Lys Gin Glu Asn 

35 40 45 

Leu Val Gly He He Ser Leu Gin Gly Leu Ser Tyr Ser Asn Leu Met 

50 55 60 

Gin Lys Asp Leu Glu Gly Tyr Phe Asp Ala Arg Gin Asn Val Leu Asn 
SB 70 75 80 

Thr He Ser Lys Asp He Gin Leu Arg He Val Ala Lys Arg Arg Lys 

85 90 95 

Glu Phe He Asn Gin Ser Pro Asn He Asp Asn He Tyr Ala Lys Ala 

100 105 110 

He He Thr Gin Phe Glu Ser Lys Gly He Tyr Lys Thr Glu Tyr Phe 

115 120 125 

Leu Val Phe Glu Thr He Thr Ser Asn Val Lys Ser Phe Phe Glu Lys 

130 135 140 

Lys Lys Leu Glu Met Thr Thr Ser He Asn Glu Glu Leu Glu Glu Ser 
145 150 155 160 

Ser Lys Glu Asp Lys Gin Glu Asn Glu Asn Xaa Ser Asn Glu Thr His 

165 170 175 

Ser Asn Thr Ser Ser Lys Lys Asp Lys Lys Asn Lys Phe Lys Lys Lys 

180 185 190 

He Thr Phe Ser Thr Lys Ser Lys Arg Ala Leu Leu He Gin Thr He 

195 200 205 

Glu Arg Val Lys Asn Ala Leu Lys Glu Phe Lys Pro Thr Leu Leu Asn 

210 215 220 

Ser Lys Glu Val Leu Asn Phe Tyr Ala Glu Tyr He Asn Gly Lys Tyr 
225 230 235 240 

He Ala Phe Asn Pro Lys Leu Lys Arg Leu Ser Asp Thr He Leu His 

245 250 255 

Leu Met Cys He Leu Arg Lys He Thr Leu Ser Leu Asn Phe Lys He 

260 265 270 

Lys Thr Pro Phe Val Arg Val Trp Gly Leu Arg Leu Met Arg Ala Lys 

275 280 285 

Lys Phe Leu Arg Ser Leu Tyr Leu Leu Phe Tyr Thr Pro Lys Leu Asn 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 132 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 39... 1556 
(D) OTHER INFORMATION: 
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135 140 145 150 



ACT TCA ATT AAT GAA GAG TTA GAA GAA AGC TCT AAA GAA GAT AAA CAA 534 
Thr Ser lie Asn Glu Glu Leu Glu Glu Ser Ser Lys Glu Asp Lys Gin 
155 160 165 



GAG AAT GAA AAT MGC TCC AAT GAA ACT CAT TCA AAC ACA AGC TCT AAA 5 82 

Glu Asn Glu Asn Xaa Ser Asn Glu Thr His Ser Asn Thr Ser Ser Lys 
170 175 180 



AAA GAC AAG AAA AAC AAG TTC AAA AAA AAG ATA ACC TTT AGC ACC AAA 63 0 

Lys Asp Lys Lys Asn Lys Phe Lys Lys Lys lie Thr Phe Ser Thr Lys 
185 190 195 



AGT AAA AGA GCC TTA CTC ATT CAA ACC ATA GAA AGA GTA AAA AAC GCT 67 8 

Ser Lys Arg Ala Leu Leu He Gin Thr He Glu Arg Val Lys Asn Ala 
200 205 210 



CTT AAA GAA TTT AAA CCC ACT TTA CTA AAT TCT AAA GAA GTA TTA AAT 72 6 

Leu Lys Glu Phe Lys Pro Thr Leu Leu Asn Ser Lys Glu Val Leu Asn 
215 220 225 230 



TTC TAC GCA GAA TAC ATC AAT GGC AAA TAC ATC GCC TTT AAT CCT AAA 7 74 

Phe Tyr Ala Glu Tyr He Asn Gly Lys Tyr He Ala Phe Asn Pro Lys 
235 240 245 



TTA AAG CGA TTA AGC GAT ACT ATA TTG CAT CTA ATG TGC ATT TTA AGA 82 2 

Leu Lys Arg Leu Ser Asp Thr He Leu His Leu Met Cys He Leu Arg 
250 255 260 



AAG ATT ACT TTG TCA TTG AAT TTC AAA ATC AAA ACA CCT TTT GTG CGT 87 0 

Lys He Thr Leu Ser Leu Asn Phe Lys He Lys Thr Pro Phe Val Arg 
265 270 275 



GTG TGG GGA TTA AGG CTT ATG AGA GCG AAG AAA TTT CTT CGC TCC CTA 918 
Val Trp Gly Leu Arg • Leu Met Arg Ala Lys Lys Phe Leu Arg Ser Leu 
280 285 290 



TAT CTA CTC TTT TAC ACA CCC AAA TTG AAC TAGATTTAAT CTTTCATATC CGC 971 
Tyr Leu Leu Phe Tyr Thr Pro Lys Leu Asn 
295 300 

TCTTTAGGGC AATTTGAAAG CCTG 995 



(2) INFORMATION FOR SEQ ID NO: 13 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1324: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 3 7... 94 8 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 3: 

TAAAAACACC CCTAAAAGAA AAAGAAAGTC TTCTTA ATG TTA GAA AGC GCC CTT 5 4 

Met Leu Glu Ser Ala Leu 
1 5 

AAA TAT TGC AAG GAA AAA GCC ATA GAC CTT TTA GTA GGG TTT GTG CCA 102 
Lys Tyr Cys Lys Glu Lys Ala lie Asp Leu Leu Val Gly Phe Val Pro 
10 15 20 

AAA ACC TAT TCT ATG GCA CAA GAG TGC AAT ATT TTA GGC TTG TAT GAT 15 0 

Lys Thr Tyr Ser Met Ala Gin Glu Cys Asn lie Leu Gly Leu Tyr Asp 
25 30 35 

GAT GCT TTC ATT ATT ACC AAA CAA GAA AAT CTA GTA GGC ATT ATA TCC 198 
Asp Ala Phe He He Thr Lys Gin Glu Asn Leu Val Gly He He Ser 
40 45 50 

TTA CAA GGA CTA AGC TAT TCT AAT TTA ATG CAA AAA GAC TTA GAG GGC 24 6 

Leu Gin Gly Leu Ser Tyr Ser Asn Leu Met Gin Lys Asp Leu Glu Gly 
55 60 65 70 

TAT TTT GAT GCT AGA CAA AAT GTT CTC AAC ACC ATT AGT AAA GAC ATT 2 94 

Tyr Phe Asp Ala Arg Gin Asn Val Leu Asn Thr He Ser Lys Asp He 
75 80 85 

CAA TTA AGA ATT GTG GCT AAA AGO CGT AAG GAA TTT ATC AAT CAA AGT 3 42 

Gin Leu Arg He Val Ala Lys Arg Arg Lys Glu Phe He Asn Gin Ser 
90 95 100 

CCA AAT ATT GAC AAT ATT TAT GCC AAA GCT ATT ATC ACA CAA TTT GAA 3 90 

Pro Asn He Asp Asn He Tyr Ala Lys Ala He He Thr Gin Phe Glu 
105 110 115 

AGC AAG GGA ATC TAT AAA ACA GAG TAT TTT TTA GTG TTT GAA ACT ATC 438 
Ser Lys Gly He Tyr Lys Thr Glu Tyr Phe Leu Val Phe Glu Thr He 
120 125 130 

ACT TCT AAT GTC AAG TCT TTC TTT GAA AAA AAG AAA TTG GAA ATG ACT 4 86 

Thr Ser Asn Val Lys Ser Phe Phe Glu Lys Lys Lys Leu Glu Met Thr 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 1: 

CTGGTTTATG AGTATTTTTT AAAAGAAGTC CCC ATG CAA TTA GTT GGT ATT TCA 54 

Met Gin Leu Val Gly lie Ser 
1 5 

GTT TCT AAT CTC AAA GAA ATC AGC TCC AAA GAA AAA TTT CTT TGG CTC 102 
Val Ser Asn Leu Lys Glu lie Ser Ser Lys Glu Lys Phe Leu Trp Leu 
10 15 20 

AAT GCT AAG AGT TTT TTA CTC TCA GGA TTT GTG CCT TTT ATT ATG ATA 150 
Asn Ala Lys Ser Phe Leu Leu Ser Gly Phe Val Pro Phe lie Met lie 
25 30 35 

CCT TGG CTA GAT ATA TTG AAC TCT TTT GTG CTT TAT GTG TGC TTT CTC 19 8 

Pro Trp Leu Asp lie Leu Asn Ser Phe Val Leu Tyr Val Cys Phe • Leu 
40 45 50 55 

TTA ATT TTT AGC ATA GCG GAG TTC TTT GAT GAA GAT ATA AGT GAC ATT 246 
Leu lie Phe Ser lie Ala Glu Phe Phe Asp Glu Asp lie Ser Asp lie 
60 65 70 

TTA ATC GCT CAT TCC AAA ATT AAA ACC AAA GCT AAT TCA TTT TAC GCT T 2 95 
Leu lie Ala His Ser Lys lie Lys Thr Lys Ala Asn Ser Phe Tyr Ala 
75 80 85 

AAAAGGAAAA AATATGCAAA AAGAAGTCTT AGTAGAAAA 3 34 

(2) INFORMATION FOR SEQ ID NO: 1322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1322: 

Met Gin Leu Val Gly lie Ser Val Ser Asn Leu Lys Glu He Ser Ser 

15 10 15 

Lys Glu Lys Phe Leu Trp Leu Asn Ala Lys Ser Phe Leu Leu Ser Gly 

20 25 30 

Phe Val Pro Phe He Met He Pro Trp Leu Asp He Leu Asn Ser Phe 

35 40 45 

Val Leu Tyr Val Cys Phe Leu Leu He Phe Ser He Ala Glu Phe Phe 

50 55 60 

Asp Glu Asp He Ser Asp He Leu He Ala His Ser Lys He Lys Thr 
65 70 75 80 

Lys Ala Asn Ser Phe Tyr Ala 
85 

(2) INFORMATION FOR SEQ ID NO:1323: 
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AAC AAT CTT ACA AAA TAT TAT CCA AGT TCT GTG TGATTAAAAA AGAAAGAAAT 48 8 

Asn Asn Leu Thr Lys Tyr Tyr Pro Ser Ser Val 
140 145 150 

ATCGCATGAA AAAATTAAGT CATTTTAGAA AGCTTATCGC CTT 531 
(2) INFORMATION FOR SEQ ID NO: 132 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320: 

Met lie Lys Leu lie Leu His Lys Lys Ser lie Gin lie Asp Glu Thr 

15 10 15 

Leu Leu Asn Val Lys Glu His Leu Glu Lys Phe Tyr Ser Asn Lys Glu 

20 25 30 

Gin Glu Thr lie Ala Gin Thr Leu Glu Asn Glu Thr Glu lie Ser Cys 

35 40 45 

Ser Tyr Phe Trp Asp Lys Asp Phe Leu Leu Leu Glu Gin Leu Leu Glu 

50 55 60 

Asn Xaa Leu Gly His Phe Thr Phe Glu Ser Glu Phe Ala Leu Leu Lys 
65 70 75 80 

Asp Lys Glu Thr Leu Asn Leu Ser Gin lie Lys Gin lie Gly Val Leu 

85 90 95 

Lys Val Leu Thr Tyr Xaa Met lie Gin Thr Leu Lys Asn Gin lie lie 

100 105 110 

His Leu Ala Gin Val Val Asn Glu Glu Asn Leu Glu Lys Asp Glu Glu 

115 120 125 

Leu Val Val Tyr His Leu Asn Phe Thr Ser Arg Asn Asn Leu Thr Lys 

130 135 140 

Tyr Tyr Pro Ser Ser Val 
145 150 

(2) INFORMATION FOR SEQ ID NO: 1321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 294 
(D) OTHER INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO: 1319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B ) LOCATION : 1 9 ... 4 6 8 
(D) OTHER TNFOR^4ATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319: 

ATTTACTAAA GGAAAACA ATG ATT AAA CTA ATC TTA CAC AAG AAG TCC ATA 51 

Met lie Lys Leu lie Leu His Lys Lys Ser lie 
15 10 

CAA ATT GAT GAA AC A TTG CTG AAT GTA AAA GAG CAT TTA GAA AAG TTT 99 
Gin lie Asp Glu Thr Leu Leu Asn Val Lys Glu His Leu Glu Lys Phe 
15 20 25 

TAT TCA AAT AAA GAA CAA GAG AC A ATC GCT CAA ACT TTA GAG AAT GAA 14 7 

Tyr Ser Asn Lys Glu Gin Glu Thr lie Ala Gin Thr Leu Glu Asn Glu 
30 35 40 

ACA GAA ATT TCT TGT AGC TAT TTT TGG GAC AAA GAC TTC TTG TTG TTA 19 5 

Thr Glu lie Ser Cys Ser Tyr Phe Trp Asp Lys Asp Phe Leu Leu Leu 
45 50 55 

GAG CAA CTT TTA GAA AAT RAT TTA GGT CAT TTT ACC TTT GAG AGC GAG 24 3 

Glu Gin Leu Leu Glu Asn Xaa Leu Gly His Phe Thr Phe Glu Ser Glu 
60 65 70 75 

TTT GCC CTA CTA AAA GAT AAA GAG ACT TTA AAC CTA TCT CAA ATC AAA 2 91 

Phe Ala Leu Leu Lys Asp Lys Glu Thr Leu Asn Leu Ser Gin He Lys 
80 85 90 

CAA ATC GGT GTC TTA AAG GTT CTT ACC TAT GAR ATG ATA CAA ACC TTA 33 9 

Gin He Gly Val Leu Lys Val Leu Thr Tyr Xaa Met He Gin Thr Leu 
95 100 105 

AAA AAT CAA ATC ATT CAT TTA GCA CAA GTT GTC AAT GAA GAA AAT TTA 3 87 

Lys Asn Gin He He His Leu Ala Gin Val Val Asn Glu Glu Asn Leu 
110 115 120 

GAA AAA GAT GAA GAA CTT GTT GTC TAG CAC CTA AAT TTC ACG TCA CGC 435 
Glu Lys Asp Glu Glu Leu Val Val Tyr His Leu Asn Phe Thr Ser Arg 
125 130 135 
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Lys Tyr Asn Phe Asp Phe Asp Val lie Phe Tyr Ser Phe Gly Glu Gin 
185 190 195 

TTA ATG CCT ATT CTT ACT AGA ATG TTA GTT TCT GTC TCT AAG TCT CAT 67 8 

Leu Met Pro lie Leu Thr Arg Met Leu Val Ser Val Ser Lys Ser His 
200 205 210 

AGA AAG AGA CTT GAA AAC TAT GGC AAA GAG ATT AAA ACC TAATTTAGAT AA 72 9 
Arg Lys Arg Leu Glu Asn Tyr Gly Lys Asp lie Lys Thr 
215 220 225 

AGATGAGTTA AACACA 74 5 

(2) INFORMATION FOR SEQ ID NO: 1318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 18: 

Met Gly Gin Lys Arg Met Asn Lys Ser Asn Lys Leu Val lie lie Asn 

15 10 15 

Arg Ala lie Pro Gly Gly Gly Lys Thr Ser Leu lie Lys Gin lie Glu 

20 25 30 

Glu Leu Ala Lys Ser Leu Gly His Ser lie Ser Val His Ser Thr Asp 

35 40 45 

Glu Tyr Phe lie Gin Thr Asp Glu Glu Gly lie Arg His Tyr Val Val 

50 55 60 

Asp Lys Lys Lys Leu Asn Glu Tyr His Gin Asn Asn Gin Glu Ala Phe 
65 70 75 80 

Lys Gin Ala Leu Glu Asn Arg lie Asp lie Val Val Cys Asp Asn Thr 

85 90 95 

Asn Phe Glu Ser Trp Gin Ser Lys Pro Tyr Thr Asp Met Ala Arg Glu 

100 105 110 

Phe Gly Tyr Lys lie Leu Leu He Asp Phe Lys Asn Arg His Leu Glu 

115 120 125 

Thr Pro Met Asp Tyr Gly Trp Asp Val Ala Gin Cys He Lys Lys Pro 

130 135 140 

Arg Gly He Ala Lys His Tyr Asp Tyr Asp Phe Tyr Leu Glu Arg Val 
145 150 155 160 

Leu Val Glu Pro Gin Asp Tyr Glu Lys Gin Asn Arg Glu Leu Ser Leu 

165 170 175 

Lys Ala Leu Glu Phe Leu Lys Tyr Asn Phe Asp Phe Asp Val He Phe 

180 185 190 

Tyr Ser Phe Gly Glu Gin Leu Met Pro He Leu Thr Arg Met Leu Val 

195 200 205 

Ser Val Ser Lys Ser His Arg Lys Arg Leu Glu Asn Tyr Gly Lys Asp 

210 215 220 

He Lys Thr 
225 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317: 

TAAGGGGGGA TTTGGAAGGT GTTGGAATTG AAATTA ATG GGA CAG AAG AGG ATG 54 

Met Gly Gin Lys Arg Met 
1 5 

AAT AAA TCA AAC AAA TTA GTC ATT ATC AAT CGC GCC ATT CCA GGT GGG 102 
Asn Lys Ser Asn Lys Leu Val lie lie Asn Arg Ala lie Pro Gly Gly 
10 15 20 

GGC AAG ACC TCT TTG ATC AAA CAG ATT GAA GAG TTG GCA AAA AGC TTG 15 0 

Gly Lys Thr Ser Leu lie Lys Gin lie Glu Glu Leu Ala Lys Ser Leu 
25 30 35 

GGG CAT TCT ATT AGC GTT CAT TCT ACC GAT GAA TAT TTC ATC CAA ACA 198 
Gly His Ser lie Ser Val His Ser Thr Asp Glu Tyr Phe lie Gin Thr 
40 45 50 

GAT GAA GAG GGT ATC AGG CAT TAT GTT GTT GAT AAA AAG AAA CTC AAT 2 46 

Asp Glu Glu Gly lie Arg His Tyr Val Val Asp Lys Lys Lys Leu Asn 
55 60 65 70 

GAA TAG CAC CAA AAC AAT CAA GAA GCC TTC AAA CAA GCT TTA GAA AAT 2 94 

Glu Tyr His Gin Asn Asn Gin Glu Ala Phe Lys Gin Ala Leu Glu Asn 
75 80 85 

CGT ATA GAT ATT GTA GTG TGC GAT AAC ACC AAT TTT GAA TCG TGG CAA 34 2 

Arg lie Asp lie Val Val Cys Asp Asn Thr Asn Phe Glu Ser Trp Gin 
90 95 100 

AGC AAA CCA TAT ACA GAT ATG GCT AGA GAA TTT GGC TAT AAA ATT TTG 3 90 

Ser Lys Pro Tyr Thr Asp Met Ala Arg Glu Phe Gly Tyr Lys lie Leu 
105 110 115 

TTG ATT GAT TTT AAG AAT AGA CAC TTA GAA ACC CCC ATG GAT TAT GGA 43 8 

Leu lie Asp Phe Lys Asn Arg His Leu Glu Thr Pro Met Asp Tyr Gly 
120 125 130 

TGG GAT GTT GCG CAA TGC ATC AAG AAG CCA CGA GGT ATT GCA AAG CAT 4 86 

Trp Asp Val Ala Gin Cys He Lys Lys Pro Arg Gly He Ala Lys His 
135 140 145 150 

TAT GAC TAT GAT TTT TAT TTG GAG AGG GTT TTG GTT GAG CCA CAG GAT 53 4 

Tyr Asp Tyr Asp Phe Tyr Leu Glu Arg Val Leu Val Glu Pro Gin Asp 
155 160 165 

TAT GAG AAA CAA AAT AGA GAG TTG AGC TTA AAA GCC TTA GAA TTT TTG 5 82 

Tyr Glu Lys Gin Asn Arg Glu Leu Ser Leu Lys Ala Leu Glu Phe Leu 
170 175 180 

AAA TAC AAT TTT GAT TTT GAT GTG ATT TTT TAT TCT TTT GGG GAG CAA 63 0 
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35 

Leu Lys Gin Leu 
50 

Glu Phe Lys Glu 
65 

Leu Phe Ser Leu 

Glu Phe Val Glu 
100 

Gin Asn Glu Pro 
115 

Phe Ser Arg lie 
130 

Glu Glu Val Arg 
145 

Gly Asp Tyr lie 

Leu Leu lie He 
180 

Pro Tyr Leu Val 
195 

Gin Lys Ala Asp 
210 

Leu Val Gly Ala 
225 

Gin His Glu Lys 

Glu Val Trp Tyr 
260 

Tyr Thr Asn Pro 
275 

Leu Asp Lys Asp 
290 

Ser Gly Glu Leu 
305 

Glu Phe Lys Lys 

Thr Gly Phe Met 
340 

Asn Tyr Pro Thr 
355 



40 

Glu Asn Lys Glu 
55 

Glu Asn Gin Asp 
70 

Pro Leu Pro Thr 
85 

Glu Pro Val ,Ile 

He Gin Glu Lys 
120 

Gly Phe Asp Lys 
135 

Asp Ala Ser Val 
150 

Phe Tyr Val Ala 
165 

Leu Leu Thr He 

Gin Phe Ser Asn 
200 

Ser Ser He Thr 
215 

Tyr Val Leu Asn 
230 

Met Arg Gin Asn 
245 

Glu Phe Glu Lys 

Leu Leu Thr Arg 
280 

Leu Ala Tyr He 
295 

Glu Ser Leu Lys 
310 

Gin Glu He Asn 
325 

Val Thr Ser Tyr 

Ala Lys Ala He 
360 



Asn Leu Lys Ala 
60 

Thr Lys Glu Asn 
75 

Gin Thr Thr He 
90 

Glu Thr Glu Lys 
105 

Lys Glu Arg He 

Ser He Ala Pro 
140 

He Tyr His Leu 
155 

Cys Phe Phe Phe 
170 

Leu Leu Pro Leu 
185 

Asn Lys Glu Asn 

Ala Asn Lys Ala 
220 

Arg Glu Ser He 
235 

Thr He Lys Glu 
250 

Leu He Ala His 
265 

Lys Val Lys He 

Asp He Glu Val 
300 

Arg Tyr Lys Val 
315 

Phe Asp Ser Met 
330 

Asp Val Thr Glu 
345 

Gly Leu Phe Leu 



45 

Asp Asn Asn Thr 

Gin Pro Asn Asp 
80 

Asn Gly He Lys 
95 

Lys Glu Thr Ser 
110 

Phe Lys Asn Phe 
125 

Thr Met Leu Phe 

Glu Lys Lys Leu 
160 

Gly Thr Thr Ala 
175 

Lys Gin Lys Glu 
190 

Phe Ala Leu Val 
205 

Leu He Arg Ser 

Thr His He Glu 
240 

Gin Ser Ser Asn 
255 

Tyr Asp Ser He 
270 

Ala Asn He Tyr 
285 

Ser Leu Tyr His 

Val Met Ser Phe 
320 

Ser Leu Asn Pro 
335 

He Ala He Val 

350 
Ala Ser 
365 



(2) INFORMATION FOR SEQ ID NO: 1317: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 3 7... 717 
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ACT CAT ATT GAG CAA CAT GAA AAA ATG CGT CAA AAC ACC ATT AAA GAG 771 
Thr His lie Glu Gin His Glu Lys Met Arg Gin Asn Thr lie Lys Glu 
240 245 250 

CAA AGT TCC AAT GAA GTA TGG TAT GAA TTT GAA AAA CTC ATC GCT CAT 819 
Gin Ser Ser Asn Glu Val Trp Tyr Glu Phe Glu Lys Leu lie Ala His 
255 260 265 

TAT GAC AGC ATT TAG ACT AAT CCT TTA CTC ACA AGA AAA GTA AAG ATT 8 67 

Tyr Asp Ser lie Tyr Thr Asn Pro Leu Leu Thr Arg Lys Val Lys lie 
270 275 280 

GCA AAT ATT TAC TTA GAT AAA GAT TTA GCC TAT ATT GAC ATT GAA GTG 915 
Ala Asn lie Tyr Leu Asp Lys Asp Leu Ala Tyr lie Asp He Glu Val 
285 290 295 300 

AGC TTG TAT CAT AGT GGA GAA TTA GAG AGC TTG AAG CGC TAT AAA GTG 9 63 

Ser Leu Tyr His Ser Gly Glu Leu Glu Ser Leu Lys Arg Tyr Lys Val 
305 310 315 

GTG ATG AGT TTT GAA TTT AAA AAA CAA GAA ATC AAT TTT GAC TCC ATG 1011 
Val Met Ser Phe Glu Phe Lys Lys Gin Glu lie Asn Phe Asp Ser Met 
320 325 330 

TCT TTA AAT CCT ACA GGC TTT ATG GTT ACA AGT TAT GAT GTA ACT GAA 10 5 9 

Ser Leu Asn Pro Thr Gly Phe Met Val Thr Ser Tyr Asp Val Thr Glu 
335 340 345 

ATT GCG ATT GTG AAT TAC CCA ACC GCT AAA GCG ATT GGG CTT TTT CTT 110 7 

He Ala He Val Asn Tyr Pro Thr Ala Lys Ala He Gly Leu Phe Leu 
350 355 360 

GCT TCA TAGCTCCATA ACTAGCTAGA TCCAATATGT TTCCATATTT AGAACTAACC CC 116 5 

Ala Ser 

365 

GTTAGAGGAA GCTCCACAAG 1185 
(2) INFORMATION FOR SEQ ID NO: 1316; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316: 

Met Ser Lys Arg Ser Glu Val Leu Glu Gin Phe His Gly Gly Leu Lys 

15 10 15 

Asn Leu Glu Leu Gin Thr Lys Arg Arg Met Gly Leu Trp Gly Asp Pro 

20 25 30 

Lys Glu Asn Glu Glu Gin Thr Leu Phe Leu Glu Glu He Glu Asn Glu 
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GGC GGT TTA 
Gly Gly Leu 
15 

TGG GGC GAT 
Trp Gly Asp 
30 

ATT GAA AAT 
lie Glu Asn 
45 

GAG AAC AAC 
Asp Asn Asn 



GAG CCT AAC 
Gin Pro Asn 



AAT GGA ATT 
Asn Gly lie 
95 

AAA GAA ACA 
Lys Glu Thr 
110 

TTT AAA AAC 
Phe Lys Asn 
125 

ACA ATG CTT 
Thr Met Leu 



GAG AAA AAA 
Glu Lys Lys 



GGC ACA ACG 
Gly Thr Thr 
175 

AAA CAA AAA 
Lys Gin Lys 
190 

TTT GCT TTA 
Phe Ala Leu 
205 

CTT ATT CGT 
Leu lie Arg 



AAA AAT TTA 
Lys Asn Leu 



CCA. AAA GAG 
Pro Lys Glu 



GAA TTA AAG 
Glu Leu Lys 
50 

ACA GAA TTT 
Thr Glu Phe 
65 

GAT TTG TTT 
Asp Leu Phe 
80 

AAA GAA TTT 
Lys Glu Phe 



TCC CAA AAT 
Ser Gin Asn 



TTT TTC TCC 
Phe Phe Ser 
130 

TTT GAA GAA 
Phe Glu Glu 
145 " 

TTA GGC GAT 
Leu Gly Asp 
160 

GCA TTG CTT 
Ala Leu Leu 



GAG CCG TAT 
Glu Pro Tyr 



GTT CAA AAG 
Val Gin Lys 
210 

TCA TTA GTG 
Ser Leu Val 
225 



GAA TTA CAA 
Glu Leu Gin 
20 

AAT GAA GAA 
Asn Glu Glu 
35 

CAA TTA GAA 
Gin Leu Glu 



AAA GAA GAA 
Lys Glu Glu 

TCT TTG CCA 
Ser Leu Pro 
85 

GTA GAA GAG 
Val Glu Glu 
100 

GAG CCA ATC 
Glu Pro lie 
115 

AGA ATA GGC 
Arg lie Gly 



GTG AGA GAT 
Val Arg Asp 



TAT ATC TTT 
Tyr lie Phe 
165 

ATT ATC TTA 
lie lie Leu 
180 

TTA GTG CAA 
Leu Val Gin 
195 . 

GCA GAT AGC 
Ala Asp Ser 



GGA GCG TAT 
Gly Ala Tyr 



ACT AAA AGA 
Thr Lys Arg 



CAA ACT TTG 
Gin Thr Leu 
40 

AAC AAA GAA 
Asn Lys Glu 
55 

AAT CAA GAC 
Asn Gin Asp 
70 

TTG CCC ACT 
Leu Pro Thr 



CCT GTG ATA 
Pro Val lie 



CAA GAA AAA 
Gin Glu Lys 
120 

TTT GAT AAA 
Phe Asp Lys 
135 

GCA AGC GTT 
Ala Ser Val 
150 

TAT GTA GCG 
Tyr Val Ala 



CTG ACT ATT 
Leu Thr lie 



TTT TCT AAC 
Phe Ser Asn 
200 

AGC ATT ACA 
Ser lie Thr 
215 

GTG CTA AAC 
Val Leu Asn 
230 



CGC ATG GGT 
Arg Met Gly 
25 

TTT TTA GAA 
Phe Leu Glu 



AAT CTT AAA 
Asn Leu Lys 

ACT AAA GAA 
Thr Lys Glu 
75 

CAA ACC ACC 
Gin Thr Thr 
90 

GAA ACA GAG 
Glu Thr Glu 
105 

AAA GAA AGA 
Lys Glu Arg 



AGT ATT GCC 
Ser lie Ala 



ATC TAT CAT 
lie Tyr His 
155 

TGT TTC TTC 
Cys Phe Phe 
170 

CTG TTG CCC 
Leu Leu Pro 
185 

AAT AAA GAA 
Asn Lys Glu 



GCC AAT AAA 
Ala Asn Lys 



AGG GAA AGC 
Arg Glu Ser 
235 



TTG 99 
Leu 



GAA 14 7 

Glu 



GCA 195 

Ala 

60 

AAC 243 
Asn 



ATC 291 
He 



AAA 339 
Lys 



ATT 3 87 

He 



CCT 435 

Pro 

140 

TTA 4 8 3 

Leu 



TTT 531 
Phe 



TTA 5 7 9 

Leu 



AAT 62 7 

Asn 



GCT 675 

Ala 

220 

ATT 72 3 

He 
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Cys Val Leu Pro Tyr Glu Tyr Glu Asn Lys Glu Asp Lys Glu Asn Ser 
145 150 155 160 

Glu Glu He Leu Lys Glu Asp Phe Arg Leu Leu Asn Thr Lys Gly Gly 

165 170 175 

Leu Ser Val Lys Arg Ala Leu He Asn Asn Arg His Ser Tyr Glu Ala 

180 185 190 

He Lys Leu Arg Pro He Lys Gin Glu Leu Val Pro Gly Leu Cys Leu 

195 200 205 

Phe Phe Gin Gly Ser Leu Glu Phe Asn Asp Lys Thr Thr Lys Thr Met 

210 215 220 

Arg Thr Ser Leu Leu Asp Gin He Gin Gin Asp Asp Lys Ser Tyr Leu 
225 230 235 240 

Lys He Trp Glu Lys Tyr Leu He Lys Ser Ala Gin Lys Ser Phe Asn 

245 250 255 

Glu Ala Lys Glu Val Gly Val Leu Glu He Glu Ser Val Ser Lys Glu 

260 265 270 

Gly Gly Asn Leu Arg He Arg Phe Lys Pro Ala Leu Gly Lys Asn Lys 

275 280 285 

Met Glu He Leu Lys Lys Ser Gin Phe Lys Lys Gly Ser Asp Leu Gly 

290 295 300 

Val Leu Glu Asp Leu Asp Pro Gin Asn Glu Glu Asn Leu He Asn Leu 
305 310 315 320 

He Ser Glu Gin Lys Lys Gin He Ser Lys Asn Asn Ser Gin Ser He 

325 330 335 

Met He Glu Asp He Ser Gly Asp Asp Phe He He Asp Tyr Asp Leu 

340 345 350 

Ser He Lys Glu Gly Asp Ala Phe His Leu Asn Tyr Met Gly Asp Leu 

355 360 365 

Asn Thr Leu Lys Lys Gin Tyr Ser Ala Leu Asp Lys Thr Lys Lys Gly 

370 375 380 

Leu Lys Arg Gin Ser 
385 

(2) INFORMATION FOR SEQ ID NO: 1315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16 . . . 1113 
<D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1315: 

TCCATAGGCT AGTTG ATG TCA AAA AGA AGO GAA GTT TTA GAA CAA TTT CAT 51 
Met Ser Lys Arg Ser Glu Val Leu Glu Gin Phe His 
15 10 
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Lys Lys Gly Ser Asp Leu Gly Val Leu Glu Asp Leu Asp Pro Gin Asn 
300 305 310 

GAA GAA AAT TTA ATC AAT CTT ATT TCT GAA CAA AAG AAA CAA ATT TCT 1013 
Glu Glu Asn Leu lie Asn Leu lie Ser Glu Gin Lys Lys Gin lie Ser 
315 320 325 

AAA AAC AAC AGC CAA TCA ATA ATG ATT GAA GAC ATT AGT GGG GAT GAT 1061 
Lys Asn Asn Ser Gin Ser lie Met lie Glu Asp lie Ser Gly Asp Asp 
330 335 340 345 

TTT ATT ATA GAT TAG GAT CTT TCC ATA AAA GAG GGC GAT GCT TTT CAT 110 9 

Phe lie lie Asp Tyr Asp Leu Ser lie Lys Glu Gly Asp Ala Phe His 
350 355 360 

TTA AAT TAT ATG GGG GAT CTA AAT ACG CTT AAA AAA CAA TAT AGC GCA 1157 
Leu Asn Tyr Met Gly Asp Leu Asn Thr Leu Lys Lys Gin Tyr Ser Ala 
365 370 375 

TTA GAT AAG ACA AAG AAA GGT TTG AAG CGC CAA TCC TAATTTAGGA TTAATT 12 0 9 
Leu Asp Lys Thr Lys Lys Gly Leu Lys Arg Gin Ser 
380 385 

TTAAACATTA AAGAGGATAA AGAGAATAGT GATAGCGATA ATGATACTGC A 12 6 0 

(2) INFORMATION FOR SEQ ID NO: 1314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314: 

Met Leu Ser Phe lie Lys Glu Asp Ser lie lie Lys Ala Tyr Asn Leu 

15 10 15 

Asn Thr Ala Lys Leu Glu Pro Lys Asp Arg Glu Lys Leu Gly Leu Leu 

20 25 30 

Lys lie Glu Lys Asn Lys lie Tyr Phe His Leu Asp Glu Lys Arg Tyr 

35 40 45 

Leu Lys Leu Glu lie lie Gly Lys Thr Lys Glu Lys Glu lie Lys Asn 

50 55 60 

Ala Phe Cys Ser Asn Ala Phe Leu Ala Ala Gin Val Leu Asn Leu Asn 
65 70 75 80 

Gin Glu Arg Gin Val Leu Glu Leu Lys Cys His Phe Phe Lys His Pro 

85 90 95 

lie Lys lie Leu Pro Glu Pro Leu Asn lie Asn Phe Lys Asp Thr lie 

100 105 110 

lie Lys Lys Leu Leu Lys Asp Met Gly Lys Asp Lys Lys lie Glu Asp 

115 120 125 

Phe Lys Glu Thr Cys lie Leu Lys lie Ala Gly Phe Thr Tyr Phe Val 
130 135 140 
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75 80 85 

TGC CAT TTC TTC AAG CAC CCT ATA AAA ATT CTT CCT GAA CCA TTA AAC 341 
Cys His Phe Phe Lys His Pro lie Lys lie Leu Pro Glu Pro Leu Asn 
90 95 100 105 

ATT AAT TTC AAA GAC ACA ATC ATA AAA AAG TTA CTA AAA GAT ATG GGC 3 89 

lie Asn Phe Lys Asp Thr lie lie Lys Lys Leu Leu Lys Asp Met Gly 
110 115 120 

AAA GAT AAA AAA ATA GAA GAT TTT AAA GAA ACT TGT ATT TTA AAA ATA 437 
Lys Asp Lys Lys lie Glu Asp Phe Lys Glu Thr Cys lie Leu Lys lie 
125 130 135 

GCT GOT TTT ACT TAT TTT GTG TGC GTA TTG CCT TAT GAA TAT GAG AAT 4 85 

Ala Gly Phe Thr Tyr Phe Val Cys Val Leu Pro Tyr Glu Tyr Glu Asn 
140 145 150 

AAA GAG GAT AAA GAG AAT AGT GAA GAG ATT TTA AAA GAA GAT TTC AGG 53 3 

Lys Glu Asp Lys Glu Asn Ser Glu Glu lie Leu Lys Glu Asp Phe Arg 
155 160 165 

CTG TTA AAT ACC AAG GGG GGA TTA AGC GTT AAG CGT GCT TTG ATA AAT 5 81 

Leu Leu Asn Thr Lys Gly Gly Leu Ser Val Lys Arg Ala Leu lie Asn 
170 175 180 185 

AAC AGG CAT TCT TAT GAA GCG ATA AAA TTA AGA CCC ATT AAA CAA GAG 62 9 

Asn Arg His Ser Tyr Glu Ala lie Lys Leu Arg Pro lie Lys Gin Glu 
190 195 200 

TTA GTG CCT GGT TTG TGT TTG TTT TTT CAA GGT TCA TTA GAA TTT AAT 67 7 

Leu Vai Pro Gly Leu Cys Leu Phe Phe Gin Gly Ser Leu Glu Phe Asn 
205 210 215 

GAT AAA ACC ACA AAA ACC ATG CGA ACG AGC CTT TTA GAC CAG ATC CAG 72 5 

Asp Lys Thr Thr Lys Thr Met Arg Thr Ser Leu Leu Asp Gin lie Gin 
220 225 230 

CAA GAT GAC AAA TCT TAT TTA AAA ATT TGG GAA AAA TAT CTC ATC AAA 77 3 

Gin Asp Asp Lys Ser Tyr Leu Lys lie Trp Glu Lys Tyr Leu lie Lys 
235 240 245 

AGC GCT CAA AAA AGT TTT AAT GAG GCA AAA GAA GTG GGG GTT TTA GAG 821 
Ser Ala Gin Lys Ser Phe Asn Glu Ala Lys Glu Val Gly Val Leu Glu 
250 255 260 265 

ATT GAA AGC GTG AGT AAA GAA GGA GGG AAT TTA AGA ATT CGT TTT AAG 86 9 

lie Glu Ser Val Ser Lys Glu Gly Gly Asn Leu Arg lie Arg Phe Lys 
270 275 280 

CCA GCT TTA GGC AAG AAT AAA ATG GAA ATC TTA AAG AAA TCA CAA TTT 917 
Pro Ala Leu Gly Lys Asn Lys Met Glu lie Leu Lys Lys Ser Gin Phe 
285 290 295 

AAA AAG GGG AGT GAT TTA GGG GTT TTA GAG GAT TTA GAC CCA CAA AAT 965 
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35 40 45 

Val Ala Thr Ser Tyr Tyr Gin Lys Ser Glu Val Phe Tyr Phe Val Ser 

50 55 60 

Asn Phe Thr Asp Pro lie Phe Leu Pro Thr Thr Leu Phe Lys Gly lie 
65 70 75 80 

Glu Val Val Lys Ser Glu Trp Glu Phe Ala Glu lie Ala Asn Asn lie 

85 90 95 

Leu lie Phe His His Asp lie Gin Gin Glu Lys lie Leu lie Asn Asn 

100 105 110 

Lys Arg Leu Asp His Tyr Arg lie Glu lie Asp Leu Glu Lys Glu Leu 

115 120 125 

Thr lie Ser Tyr Asn Gly Phe Leu lie Lys Val Gin Lys Cys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 1313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 27 . . . 1193 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1313: 

AATGGTTTTT TAATTAAGGT TCAAAA ATG CTG AGT TTT ATC AAA GAA GAT AGC 5 3 

Met Leu Ser Phe lie Lys Glu Asp Ser 
1 5 

ATC ATC AAG GCT TAT AAC CTC AAT ACC GCA AAA CTA GAG CCA AAA GAT 101 
lie lie Lys Ala Tyr Asn Leu Asn Thr Ala Lys Leu Glu Pro Lys Asp 
10 15 20 25 

AGA GAA AAA TTG GGA TTA TTA AAG ATT GAA AAA AAT AAA ATA TAT TTT 14 9 

Arg Glu Lys Leu Gly Leu Leu Lys lie Glu Lys Asn Lys He Tyr Phe 
30 35 40 

CAT CTA GAT GAA AAG CGT TAT TTG AAA TTA GAG ATC ATA GGC AAA ACC 197 
His Leu Asp Glu Lys Arg Tyr Leu Lys Leu Glu He He Gly Lys Thr 
45 50 55 

AAA GAA AAA GAA ATT AAA AAC GCT TTT TGC AGT AAT GCT TTT CTT GCA 24 5 

Lys Glu Lys Glu He Lys Asn Ala Phe Cys Ser Asn Ala Phe Leu Ala 
60 65 70 

GCT CAA GTC CTA AAT TTA AAC CAA GAA AGA CAA GTT TTA GAA TTG AAG 2 93 

Ala Gin Val Leu Asn Leu Asn Gin Glu Arg Gin Val Leu Glu Leu Lys 
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ATA GAG AGC TTA GAA AAA GCT AGC TTG CAA GTG TTA GAA TGT GAA AAT 103 
lie Glu Ser Leu Glu Lys Ala Ser Leu Gin Val Leu Glu Cys Glu Asn 
5 10 15 20 

TGT TCA ATG ACT TAT TAT GAT AG A GAT TAT AAT AG A GAA TGT GAG ATT 151 
Cys Ser Met Thr Tyr Tyr Asp Arg Asp Tyr Asn Arg Glu Cys Glu lie 
25 30 35 

TGC CCT TAT TGC GAT GCT AAA AAA CCT GTC AGA CTT GTA GCA ACA AGT 199 
Cys Pro Tyr Cys Asp Ala Lys Lys Pro Val Arg Leu Val Ala Thr Ser 
40 45 50 

TAT TAG CAA AAG AGC GAA GTT TTT TAT TTT GTC TCG AAT TTT ACA GAC 24 7 

Tyr Tyr Gin Lys Ser Glu Val Phe Tyr Phe Val Ser Asn Phe Thr Asp 
55 60 ' 65 

CCT ATT TTT TTA CCG ACA ACC TTA TTT AAG GGG ATT GAA GTG GTT AAA 2 95 

Pro lie Phe Leu Pro Thr Thr Leu Phe Lys Gly lie Glu Val Val Lys 
70 75 80 

AGC GAA TGG GAG TTT GCA GAG ATT GCT AAT AAT ATA TTG ATT TTT CAT 34 3 

Ser Glu Trp Glu Phe Ala Glu lie Ala Asn Asn lie Leu He Phe His • 
85 90 95 100 

CAT GAC ATA CAA CAA GAA AAG ATT CTC ATT AAT AAT AAA AGA TTG GAT 3 91 

His Asp He Gin Gin Glu Lys He Leu He Asn Asn Lys Arg Leu Asp 
105 110 115 

CAC TAT AGG ATA GAA ATA GAT TTA GAA AAA GAA TTG ACT ATT TCA TAG 43 9 

His Tyr Arg He Glu He Asp Leu Glu Lys Glu Leu Thr He Ser Tyr 
120 125 130 

AAT GGT TTT TTA ATT AAG GTT CAA AAA TGC TGAGTTTTAT CAAAGAAGAT AGC 4 92 
Asn Gly Phe Leu He Lys Val Gin Lys Cys 
135 140 

ATCATCAAGG CTTATAA 5 0 9 

(2) INFORMATION FOR SEQ ID NO: 13 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1312: 

Met Pro Leu Phe He Glu Ser Leu Glu Lys Ala Ser Leu Gin Val Leu 

15 10 15 

Glu Cys Glu Asn Cys Ser Met Thr Tyr Tyr Asp Arg Asp Tyr Asn Arg 

20 25 30 

Glu Cys Glu He Cys Pro Tyr Cys Asp Ala Lys Lys Pro Val Arg Leu 
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50 55 60 

Ser Phe Lys Pro lie Glu Ser His Phe Pro Met Ser lie Pro Leu Val 
65 70 75 80 

Thr Leu Lys Glu Lys Gin Gly Tyr Val Met Lys Met Ala Glu Gly Tyr 

85 90 95 

Glu Pro Leu Lys Thr Phe Leu Lys Lys Pro Ser lie Leu Glu Asn Glu 

100 105 110 

Glu Lys Asp Gly lie Phe Arg lie Asn Asn Ala lie Gin Glu Leu Cys 

115 120 125 

Lys Asp Asn His Tyr Met Thr Leu Ser Leu Ser Tyr Tyr Ser Gin Thr 

130 135 140 

Gin Gly Leu Arg Ser Arg Leu Lys lie Leu Thr His Leu Ala Lys Leu 
145 150 155 160 

Leu Phe Arg Leu Gin Ser Lys Gly Leu Val Tyr Gly Asp Leu Asn Leu 

165 170 175 

Asn Asn Val Phe Tyr Lys Asp Asn Ser Ala Phe Leu lie Asp Ala Asp 

180 185 190 

Asn Val Arg Tyr Glu Ser Glu Lys Ala Leu Cys Val lie Phe Thr Pro 

195 200 205 

Asn Tyr Gly Ala Leu Glu He Ser Gin Thr Ser Lys Asn Ser Asp Thr 

210 215 220 

Thr Asn Tyr Asn Thr Met Leu Ser Asp Thr Phe Ser Phe Ala lie He 
225 230 235 240 

Thr Tyr Glu Leu Leu Asn Met Val His Pro Phe Asp Gly Asn Lys Ala 

245 250 255 

Asp Asp Ser Val Glu Asn Phe He Glu Leu Pro Trp He Glu Asp Arg 

260 265 270 

Lys Asp Asp Ser Asn Arg Ser Cys Gly Leu Leu Pro Phe Phe Leu Thr 

275 280 285 

Arg Asp Leu Lys Asn Leu Leu Ala 
290 295 

(2) INFORMATION FOR SEQ ID NO: 1311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 44... 469 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311: 

ATGCTTTGAA GAAGGCAAAA AAGATCCTTT GAAACGCCCT ACT ATG CCC TTA TTT 5 5 

Met Pro Leu Phe 
1 
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160 165 170 

GAC TTG AAT TTA AAC AAT GTT TTT TAT AAA GAC AAT TCA GCG TTT TTA 57 9 

Asp Leu Asn Leu Asn Asn Val Phe Tyr Lys Asp Asn Ser Ala Phe Leu 
175 180 185 

ATT GAT GCG GAT AAT GTG CGT TAT GAG AGC GAA AAA GCC CTG TGT GTT 62 7 

lie Asp Ala Asp Asn Val Arg Tyr Glu Ser Glu Lys Ala Leu Cys Val 
190 195 200 

ATT TTT ACG CCT AAC TAT GGG GCT TTA GAG ATT AGC CAA ACC TCT AAA 6 75 

lie Phe Thr Pro Asn Tyr Gly Ala Leu Glu lie Ser Gin Thr Ser Lys 
205 210 215 220 

AAT AGC GAT AC A ACC AAT TAG AAC ACC ATG CTT AGC GAT ACC TTT TCT 7 23 

Asn Ser Asp Thr Thr Asn Tyr Asn Thr Met Leu Ser Asp Thr Phe Ser 
225 230 235 

TTT GCT ATC ATA ACT TAT GAA CTT TTA AAT ATG GTT CAT CCT TTT GAT 771 
Phe Ala lie lie Thr Tyr Glu Leu Leu Asn Met Val His Pro Phe Asp 
240 245 250 

GGG AAT AAG GCA GAT GAT AGT GTA GAA AAT TTT ATA GAA TTG CCT TGG 819 
Gly Asn Lys Ala Asp Asp Ser Val Glu Asn Phe lie Glu Leu Pro Trp 
255 260 265 

ATT GAA GAT AGA AAG GAT GAT AGC AAT CGT TCT TGT GGC TTA CTG CCT 867 
lie Glu Asp Arg Lys Asp Asp Ser Asn Arg Ser Cys Gly Leu Leu Pro 
270 275 280 

TTT TTC TTA ACA AGG GAT TTA AAA AAT TTA TTA GCG TAATGCTTTG AAGAAG 919 
Phe Phe Leu Thr Arg Asp Leu Lys Asn Leu Leu Ala 
285 290 295 

GCAAAAAAGA TCCTTTGAAA CGCCCTAC 94 7 

(2) INFORMATION FOR SEQ ID NO: 13 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310: 

Met Glu Leu Glu Glu lie Val Asp Ser Glu Arg Asn lie His Lys Thr 

15 10 15 

lie Glu Val Leu Gly Lys Gly Gly Gin Gly lie Val Tyr Arg Cys Leu 

20 25 30 

Asp Lys Asp Val Ala lie Lys Val Val Leu Arg Asp Gly Asp Phe lie 

35 40 45 

Lys Asp Lys Glu Ser Leu Lys Gin Tyr Glu Lys Ser Val Leu Asn Leu 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16 . . . 903 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 09: 

TAGAAAAGGT AGTTT ATG GAG TTA GAA GAA ATT GTT GAT AGT GAG AGG AAT 51 
Met Glu Leu Glu Glu lie Val Asp Ser Glu Arg Asn 
15 10 

ATC CAT AAG ACT ATA GAA GTT TTA GGA AAA GGC GGA CAG GGT ATA GTG 99 
lie His Lys Thr lie Glu Val Leu Gly Lys Gly Gly Gin Gly lie Val 
15 20 25 

TAT CGC TGT TTG GAT AAG GAT GTG GCT ATT AAG GTA GTA TTG AGG GAT 14 7 

Tyr Arg Cys Leu Asp Lys Asp Val Ala lie Lys Val Val Leu Arg Asp 
30 35 40 

GGA GAT TTT ATT AAA GAC AAA GAA TCC CTC AAA CAA TAT GAA AAA AGO 195 
Gly Asp Phe lie Lys Asp Lys Glu Ser Leu Lys Gin Tyr Glu Lys Ser 
45 50 55 60 

GTT CTA AAC TTA TCT TTT AAG CCG ATA GAG AGT CAT TTC CCT ATG TCA 24 3 

Val Leu Asn Leu Ser Phe Lys Pro lie Glu Ser His Phe Pro Met Ser 
65 70 75 

ATT CCA CTG GTA ACT TTG AAA GAA AAA CAA GGC TAT GTG ATG AAA ATG 2 91 

lie Pro Leu Val Thr Leu Lys Glu Lys Gin Gly Tyr Val Met Lys Met 
80 85 90 

GCT GAG GGC TAT GAA CCA CTA AAA ACT TTT TTA AAG AAG CCC AGC ATT 33 9 

Ala Glu Gly Tyr Glu Pro Leu Lys Thr Phe Leu Lys Lys Pro Ser He 
95 100 105 

TTA GAA AAC GAA GAA AAA GAT GGG ATT TTT AGG ATC AAT AAT GCC ATT 3 87 

Leu Glu Asn Glu Glu Lys Asp Gly He Phe Arg He Asn Asn Ala He 
110 115 120 

CAA GAA CTT TGC AAA GAT AAC CAT TAT ATG ACT TTA AGT TTA AGT TAT 43 5 

Gin Glu Leu Cys Lys Asp Asn His Tyr Met Thr Leu Ser Leu Ser Tyr 
125 130 135 140 

TAC TCA CAA ACA CAA GGA TTG AGA TCA CGA TTA AAA ATA CTC ACC CAT 4 83 

Tyr Ser Gin Thr Gin Gly Leu Arg Ser Arg Leu Lys He Leu Thr His 
145 150 155 

TTA GCA AAA CTT CTA TTC AGA TTG CAA AGT AAG GGT TTG GTG TAT GGG 531 
Leu Ala Lys Leu Leu Phe Arg Leu Gin Ser Lys Gly Leu Val Tyr Gly 
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Thr Ser Met Thr Ala Phe Arg Phe Phe Glu Ala Leu Lys Lys Arg Gly 

20 25 30 

His Val Met Arg Val Val Ala Pro His Val Asp Asn Leu Gly Ser Glu 

35 40 45 

Glu Glu Gly Tyr Tyr Asn Leu Lys Glu Arg Tyr lie Pro Leu Val Thr 

50 55 60 

Glu lie Ser His Lys Gin His lie Leu Phe Ala Lys Pro Asp Glu Lys 
65 70 75 80 

lie Leu Arg Lys Ala Phe Lys Gly Ala Asp Met lie His Thr Tyr Leu 

85 90 95 

Pro Phe Leu Leu Glu Lys Thr Ala Val Lys lie Ala Arg Glu Met Gin 

100 105 110 

Val Pro Tyr He Gly Ser Phe His Leu Gin Pro Glu His He Ser Tyr 

115 120 125 

Asn Met Lys Leu Gly Trp Phe Ser Trp Phe Asn Met Met Leu Phe Ser 

130 135 140 

Trp Phe Lys Ser Ser His Tyr Arg Tyr He His His He His Cys Pro 
145 150 155 160 

Ser Lys Phe He Val Glu Glu Leu Glu Lys Tyr Asn Tyr Gly Gly Lys 

165 170 175 

Lys Tyr Ala He Ser Asn Gly Phe Asp Pro Met Phe Arg Phe Glu His 

180 185 190 

Pro Gin Lys Ser Leu Phe Asp Thr Thr Pro Phe Lys He Ala Met Val 

195 200 205 

Gly Arg Tyr Ser Asn Glu Lys Asn Gin Ser Val Leu He Lys Ala Val 

210 215 220 

Ala Leu Ser Lys Tyr Lys Gin Asp He Val Leu Leu Leu Lys Gly Lys 
225 230 235 240 

Gly Pro Asp Glu Lys Lys He Lys Leu Leu Ala Gin Lys Leu Gly Val 

245 250 255 

Lys Ala Glu Phe Gly Phe Val Asn Ser Asn Glu Leu Leu Glu He Leu 

260 265 270 

Lys Thr Cys Thr Leu Tyr Val His Ala Ala Asn Val Glu Ser Glu Ala 

275 280 285 

He Ala Cys Leu Glu Ala He Ser Val Gly He Val Pro Val He Ala 

290 295 300 

Asn Ser Pro Leu Ser Ala Thr Arg Gin Phe Ala Leu Asp Glu Arg Ser 
305 310 315 320 

Leu Phe Glu Pro Asn Asn Ala Lys Asp Leu Ser Ala Lys He Asp Trp 

325 330 335 

Trp Leu Glu Asn Lys Leu Glu Arg Glu Arg Met Gin Asn Glu Tyr Ala 

340 345 350 

Lys Ser Ala Leu Asn Tyr Thr Leu Glu Asn Ser Val He Gin He Glu 

355 360 365 

Lys Val Tyr Glu Glu Ala He Arg Asp Phe Lys Asn Asn Pro His Leu 

370 375 380 

Phe Lys Thr Leu Ser 
385 

(2) INFORMATION FOR SEQ ID NO: 13 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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He Val Leu Leu Leu Lys Gly Lys Gly Pro Asp Glu Lys Lys He Lys 
235 240 245 

CTT TTA GCC CAA AAA CTA GGC GTA AAA GCG GAG TTT GGG TTT GTC AAT 82 2 

Leu Leu Ala Gin Lys Leu Gly Val Lys Ala Glu Phe Gly Phe Val Asn 
250 255 260 

TCC AAT GAA TTG TTA GAG ATC TTA AAA ACT TGC ACC CTT TAT GTG CAT 87 0 

Ser Asn Glu Leu Leu Glu He Leu Lys Thr Cys Thr Leu Tyr Val His 
265 270 275 280 

GCA GCC AAT GTG GAA AGC GAA GCG ATT GCG TGC TTA GAG GCC ATT AGC 918 
Ala Ala Asn Val Glu Ser Glu Ala He Ala Cys Leu Glu Ala He Ser 
285 290 295 

GTG GGG ATT GTG CCT GTT ATC GCT AAT AGC CCT TTA AGC GCG ACC AGG 966 
Val Gly He Val Pro Val He Ala Asn Ser Pro Leu Ser Ala Thr Arg 
300 305 310 

CAA TTT GCG CTA GAT GAA CGA TCG CTA TTT GAA CCT AAT AAC GCT AAA 1014 
Gin Phe Ala Leu Asp Glu Arg Ser Leu Phe Glu Pro Asn Asn Ala Lys 
315 320 325 

GAT TTG AGC GCT AAA ATA GAT TGG TGG TTA GAA AAC AAG CTT GAA AGA 106 2 

Asp Leu Ser Ala Lys He Asp Trp Trp Leu Glu Asn Lys Leu Glu Arg 
330 335 340 

GAA AGG ATG CAA AAC GAA TAC GCT AAA AGC GCT TTA AAT TAC ACT TTA 1110 
Glu Arg Met Gin Asn Glu Tyr Ala Lys Ser Ala Leu Asn Tyr Thr Leu 
345 350 355 360 

GAA AAT TCA GTC ATT CAA ATT GAA AAA GTT TAT GAA GAA GCG ATC AGA 115 8 

Glu Asn Ser Val He Gin He Glu Lys Val Tyr Glu Glu Ala He Arg 
365 370 375 

GAT TTT AAA AAT AAC CCC CAT CTC TTT AAA ACC TTA TCA TAATGAAAGG AT 12 0 9 
Asp Phe Lys Asn Asn Pro His Leu Phe Lys Thr Leu Ser 
380 385 

AAAAAATGCA AGAAG 12 24 

(2) INFORMATION FOR SEQ ID NO: 13 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308: 

Met Val He Val Leu Val Val Asp Ser Phe Lys Asp Thr Ser Asn Gly 
15 10 15 
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10 15 20 

TTT GAA GCG CTG AAA AAA AGA GGG CAT GTG ATG AGA GTG GTC GCC CCT 150 
Phe Glu Ala Leu Lys Lys Arg Gly His Val Met Arg Val Val Ala Pro 
25 30 35 40 

CAT GTG GAT AAT TTA GGG AGT GAA GAA GAG GGG TAT TAC AAC CTT AAA 198 
His Val Asp Asn Leu Gly Ser Glu Glu Glu Gly Tyr Tyr Asn Leu Lys 
45 50 55 

GAG CGC TAC ATC CCC CTA GTT ACA GAA ATT TCA CAC AAA CAA CAC ATC 24 6 

Glu Arg Tyr lie Pro Leu Val Thr Glu lie Ser His Lys Gin His lie 
60 65 70 

CTT TTT GCT AAA CCC GAT GAA AAA ATC TTA AGA AAG GCT TTT AAG GGA 2 94 

Leu Phe Ala Lys Pro Asp Glu Lys lie Leu Arg Lys Ala Phe Lys Gly 
75 80 85 

GCG GAT ATG ATC CAT ACT TAT TTG CCT TTT TTG CTA GAA AAA ACA GCC 34 2 

Ala Asp Met lie His Thr Tyr Leu Pro Phe Leu Leu Glu Lys Thr Ala 
90 95 100 

GTA AAA ATC GCG CGA GAA ATG CAA GTG CCT TAT ATT GGC TCT TTC CAT 3 90 

Val Lys lie Ala Arg Glu Met Gin Val Pro Tyr lie Gly Ser Phe His 
105 110 115 120 

TTA CAG CCA GAG CAT ATT TCT TAT AAC ATG AAA TTG GGG TGG TTT TCT 43 8 

Leu Gin Pro Glu His lie Ser Tyr Asn Met Lys Leu Gly Trp Phe Ser 
125 130 135 

TGG TTC AAC ATG ATG CTT TTT TCG TGG TTT AAA TCT TCG CAT TAC CGC 4 86 

Trp Phe Asn Met Met Leu Phe Ser Trp Phe Lys Ser Ser His Tyr Arg 
140 145 150 

TAT ATC CAC CAT ATC CAT TGC CCG TCA AAA TTC ATT GTA GAA GAA TTA 534 
Tyr lie His His lie His Cys Pro Ser Lys Phe He Val Glu Glu Leu 
155 160 165 

GAA AAA TAC AAC TAT GGA GGG AAA AAA TAC GCT ATT TCT AAC GGC TTT 5 82 

Glu Lys Tyr Asn Tyr Gly Gly Lys Lys Tyr Ala He Ser Asn Gly Phe 
170 175 180 

GAT CCC ATG TTT AGA TTT GAA CAC CCG CAA AAA AGC CTT TTT GAC ACC 63 0 

Asp Pro Met Phe Arg Phe Glu His Pro Gin Lys Ser Leu Phe Asp Thr 
185 190 195 200 

ACA CCC TTT AAA ATC GCT ATG GTA GGA CGC TAT TCT AAT GAA AAA AAT 67 8 

Thr Pro Phe Lys He Ala Met Val Gly Arg Tyr Ser Asn Glu Lys Asn 
205 210 215 

CAA AGC GTT TTA ATC AAA GCG GTT GCT TTA AGC AAA TAC AAA CAA GAT 72 6 

Gin Ser Val Leu He Lys Ala Val Ala Leu Ser Lys Tyr Lys Gin Asp 
220 225 230 

ATT GTA TTA TTG CTC AAA GGC AAA GGG CCT GAT GAG AAA AAA ATC AAA 774 
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35 










40 










45 








Lys 


Asn 


Ala 


Thr 


Pro 


Leu 


Lys 


Asp 


Lys 


Val 


Val 


Ala 


Asp 


Val 


Gly 


Cys 




50 










55 










60 










Asn 


Asn 


Gly 


Tyr 


Tyr 


Leu 


Phe 


Lys 


Met 


Leu 


Glu 


His 


Gly 


Pro 


Lys 


Ser 


65 










70 










75 










80 


Leu 


Val 


Gly 


Phe 


Asp 


Pro 


Gly 


Val 


Leu 


Val 


Lys 


Lys 


Gin 


Phe 


Glu 


Phe 










85 










90 










95 




Leu 


Ala 


Pro 


Phe 


Phe 


Asp 


Lys 


Glu 


Lys 


Lys 


He 


He 


Tyr 


Glu 


Ser 


Leu 








100 










105 










110 






Gly 


Val 


Glu 


Asp 


Leu 


His 


Glu 


Lys 


Tyr 


Pro 


Asn 


Ala 


Phe 


Asp 


Val 


He 






115 










120 










125 








Phe 


Cys 


Leu 


Gly 


Val 


Leu 


Tyr 


His 


Arg 


Lys 


Ser 


Pro 


Leu 


Glu 


Ala 


Leu 




130 










135 










140 










Lys 


Ala 


Leu 


Tyr 


His 


Ala 


Leu 


Lys 


He 


Lys 


Gly 


Glu 


Leu 


Val 


Leu 


Asp 


145 










150 










155 










160 


Thr 


Leu 


He 


He 


Asp 


Ser 


Pro 


Leu 


Asp 


He 


Ala 


Leu 


Cys 


Pro 


Lys 


Lys 










165 










170 










175 




Thr 


Tyr 


Ala 


Lys 


Met 


Lys 


Asn 


Val 


Tyr 


Phe 


He 


Pro 


Ser 


Val 


Ser 


Ala 








180 










185 










190 






Leu 


Lys 


Gly 


Trp 


Cys 


Glu 


Arg 


Val 


Gly 


Phe 


Glu 


Asn 


Phe 


Glu 


He 


Leu 






195 










200 










205 








Ser 


Val 


Leu 


Lys 


Thr 


Thr 


Pro 


Lys 


Glu 


Gin 


Arg 


Lys 


Thr 


Asp 


Phe 


He 




210 










215 










220 










Leu 


Gly 


Gin 


Ser 


Leu 


Glu 


Asp 


Phe 


Leu 


Asp 


Lys 


Thr 


Asp 


Pro 


Ser 


Lys 


225 










230 










235 










240 


Thr 


Leu 


Glu 


Gly 


Tyr 


Asp 


Ala 


Pro 


Leu 


Arg 


Gly 


Tyr 


Phe 


Lys 


Met 


Leu 










245 










250 










255 




Lys 


Pro 


Ser 


Lys 


Arg 

























260 



(2) INFORMATION FOR SEQ ID NO:1307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 1197 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 07: 

TTAAATTCAA TTTTAAAGAA GAGTAGTTAA ATG GTT ATT GTT TTA GTC GTG GAT 54 

Met Val He Val Leu Val Val Asp 
1 5 

AGT TTT AAA GAC ACC AGT AAT GGC ACT TCT ATG ACA GCG TTT CGT TTT 102 
Ser Phe Lys Asp Thr Ser Asn Gly Thr Ser Met Thr Ala Phe Arg Phe 
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CCT AAC GCT TTT GAT GTC ATT TTT TGC TTA GGG GTG CTA TAG CAC AGA 43 6 

Pro Asn Ala Phe Asp Val lie Phe Cys Leu Gly Val Leu Tyr His Arg 
125 130 135 

AAA AGO CCG CTA GAG GCT TTA AAA GCC TTG TAT CAC GCT TTG AAA ATA 4 84 

Lys Ser Pro Leu Glu Ala Leu Lys Ala Leu Tyr His Ala Leu Lys lie 
140 145 150 

AAA GGG GAG CTG GTG TTG GAT ACC TTA ATC ATT GAT TCG CCC TTA GAC 53 2 

Lys Gly Glu Leu Val Leu Asp Thr Leu lie lie Asp Ser Pro Leu Asp 
155 160 165 

ATC GCC CTT TGC CCT AAA AAA ACT TAT GCT AAA ATG AAA AAT GTT TAT 58 0 

lie Ala Leu Cys Pro Lys Lys Thr Tyr Ala Lys Met Lys Asn Val Tyr 
170 175 180 185 

TTT ATC CCC AGT GTT AGC GCG TTA AAA GGG TGG TGC GAA AGG GTA GGG 62 8 

Phe lie Pro Ser Val Ser Ala Leu Lys Gly Trp Cys Glu Arg Val Gly 
190 195 200 

TTT GAA AAT TTT GAG ATT CTT AGC GTT TTA AAG ACC ACG CCT AAA GAA 67 6 

Phe Glu Asn Phe Glu He Leu Ser Val Leu Lys Thr Thr Pro Lys Glu 
205 210 215 

CAG CGT AAA ACG GAT TTT ATT TTG GGG CAG AGT TTG GAA GAT TTT TTG 72 4 

Gin Arg Lys Thr Asp Phe He Leu Gly Gin Ser Leu Glu Asp Phe Leu 
220 225 230 

GAT AAA ACA GAT CCC TCT AAA ACT TTA GAG GGG TAT GAC GCC CCT TTA 77 2 

Asp Lys Thr Asp Pro Ser Lys Thr Leu Glu Gly Tyr Asp Ala Pro Leu 
235 240 245 

AGG GGG TAT TTT AAA ATG CTT AAA CCA AGC AAG CGT TAAATAAAGG ATTAAG 82 4 
Arg Gly Tyr Phe Lys Met Leu Lys Pro Ser Lys Arg 
250 255 260 

ATAGTGCAAG 8 3 4 

(2) INFORMATION FOR SEQ ID NO: 13 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 06: 

Met Leu He Cys Asn Asp Lys Ser Asn Pro Lys Thr Leu Leu Glu Glu 

15 10 15 

He Met Ala Leu Arg Pro Trp Arg Lys Gly Pro Phe Glu He Ser Gin 

20 25 30 

He Lys He Asp Ser Glu Trp Asp Ser Ser He Lys Trp Asp Leu Val 
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485 490 495 

lie Thr Ser Lys Pro Pro Gly Thr lie Glu Trp Glu 
500 505 

(2) INFORMATION FOR SEQ ID NO: 13 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 26 . . . 808 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1305: 

AAAAACCCCC ACGCTATGGT AAATC ATG CTC ATT TGT AAC GAT AAA TCC AAT 52 

Met Leu lie Cys Asn Asp Lys Ser Asn 
1 5 

CCA AAA ACC CTT TTA GAA GAA ATC ATG GCG TTA AGG CCA TGG CGT AAA 10 0 

Pro Lys Thr Leu Leu Glu Glu lie Met Ala Leu Arg Pro Trp Arg Lys 

10 15 20 25 

GGC OCT TTT GAA ATT TCT CAA ATC AAG ATT GAT AGC GAA TGG GAT AGC 14 8 

Gly Pro Phe Glu lie Ser Gin lie Lys lie Asp Ser Glu Trp Asp Ser 
30 35 40 

TCC ATT AAA TGG GAT CTA GTT AAA AAC GCC ACT CCT TTA AAA GAT AAG 196 
Ser lie Lys Trp Asp Leu Val Lys Asn Ala Thr Pro Leu Lys Asp Lys 
45 50 55 

GTT GTG GCT GAT GTG GOT TGC AAT AAC GGC TAT TAC TTG TTT AAA ATG 244 
Val Val Ala Asp Val Gly Cys Asn Asn Gly Tyr Tyr Leu Phe Lys Met 
60 65 70 

CTA GAA CAT GGG CCT AAA AGT TTG GTG GGG TTT GAT CCG GGC GTT TTA 2 92 

Leu Glu His Gly Pro Lys Ser Leu Val Gly Phe Asp Pro Gly Val Leu 
75 80 85 

GTC AAA AAA CAA TTT GAA TTT TTA GCC CCC TTT TTT GAT AAA GAA AAA 3^40 
Val Lys Lys Gin Phe Glu Phe Leu Ala Pro Phe Phe Asp Lys Glu Lys 
90 95 100 105 

AAA ATC ATT TAT GAG TCT TTG GGG GTA GAG GAT TTG CAT GAA AAA TAC 3 88 

Lys lie lie Tyr Glu Ser Leu Gly Val Glu Asp Leu His Glu Lys Tyr 
110 115 120 



-1920- 



BNSDOCID: <WO 9B43478A1 I > 



wo 98/43478 



PCT/US98/06371 



Ser Gly Gly Pro 
50 

Gly Ltys lie Phe 
65 

Met Gin Tyr Leu 

Glu Gin Glu Phe 
100 

lie Phe Glu Gly 
115 

Asp Lys Val lie 
130 

Pro Asn Ser Pro 
145 

Gin Phe His Pro 

Glu Asn Phe Ala 
180 

Gin His Phe Ala 
195 

Asn Ala Lys Val 
210 

Val Ala Thr Leu 
225 

Phe Val Asp His 

Ala Met Phe Lys 
260 

Glu Val Phe Leu 
275 

Arg Lys lie lie 
290 

Lys Lys His His 
305 

Leu Tyr Pro Asp 

Val lie Lys Thr 
340 

Phe Lys Leu lie 
355 

Leu Leu Gly Lys 
370 

Pro Phe Pro Gly 
385 

Glu Ser Lys lie 

Glu Leu Lys Lys 
420 

Val Leu Leu Asn 
435 

Tyr Glu Asn Ala 
450 

Thr Ala Ser Phe 
465 

Asn Arg lie Thr 



Ala Ser Val Tyr 
55 

Asp Leu Asn Val 
70 

Val Asp Phe Phe 
85 

Gly Lys Ala Val 

Val Lys lie Lys 
120 

Glu Leu Pro Lys 
135 

His Cys Ala lie 
150 

Glu Val Val Gin 
165 

Leu Leu Val Cys 

Gin Arg Glu lie 
200 

Leu Cys Ala Val 
215 

Leu His Arg Ala 
230 

Gly Leu Leu Arg 
245 

Asp Leu Lys lie 

Ser Lys Leu Lys 
280 

Gly Glu Thr Phe 
295 

Leu Lys Gly Lys 
310 

Val He Glu Ser 
325 

His His Asn Val 

Glu Pro Leu Arg 
360 

Glu Leu Gly Val 
375 

Pro Gly Leu Ala 
390 

Lys Arg Leu Gin 
405 

Ala Asn Leu Tyr 

Val Asn Ser Val 
440 

He Cys Leu Arg 
455 

Ser Phe Leu Glu 
470 

Asn Glu Val Ser 



Ala Lys Asp Ala 
60 

Pro He Leu Gly 

75 

Gly Gly Val Val 
90 

Leu Glu He Thr 
105 

Ser Leu Val Trp 

Gly Phe Thr Thr 
140 

Glu Asn Gly Lys 
155 

Ser Glu Glu Gly 
170 

Gly Cys Glu Lys 
185 

Ala Arg Leu Lys 

Ser Gly Gly Val 
220 

He Lys Asp Asn 
235 

Lys Asn Glu Lys 
250 

Pro Leu Asn Thr 
265 

Gly Val Ser Glu 

He Glu Val Phe 
300 

He Glu Phe Leu 
315 

Val Ser Val Lys 
330 

Gly Gly Leu Pro 
345 

Glu Leu Phe Lys 

Ser Gin Asp Phe 
380 

Val Arg He Leu 
395 

Glu Ala Asp Phe 
410 

Asp Lys Val Trp 
425 

Gly Val Met Gly 

Ala Val Asn Ala 
460 

His Ser Phe Leu 
475 

Gly He Asn Arg 



Tyr Lys Pro Ser 

He Cys Tyr Gly 
80 

Val Gly Ala Asn 
95 

Gin Asn Ser Val 
110 

Met Ser His Met 
125 

Leu Ala Lys Ser 

He Phe Gly Leu 
160 

Gly Lys He Leu 
175 

Thr Trp Gly Met 
190 

Glu Lys He Ala 
205 

Asp Ser Thr Val 

Leu He Ala Val 
240 

Glu Arg Val Gin 
255 

He Asp Ala Lys 
270 

Pro Glu Leu Lys 
285 

Glu Lys Glu Ala 

Ala Gin Gly Thr 
320 

Gly Pro Ser Lys 
335 

Glu Trp Met Asp 
350 

Asp Glu Val Arg 
365 

Leu Met Arg His 

Gly Glu He Ser 
400 

He Phe He Glu 
415 

Gin Ala Phe Cys 
430 

Asp Asn Arg Thr 
445 

Ser Asp Gly Met 

Glu Lys Val Ser 
480 

Val Val Tyr Asp 
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CAG GAT TTT TTA ATG CGC CAC CCT TTT CCA GGG COT GGG CTT OCT GTA 12 0 3 

Gin Asp Phe Leu Met Arg His Pro Phe Pro Gly Pro Gly Leu Ala Val 

380 385 390 

AGG ATT TTA GGC GAA ATC AGT GAG AGT AAG ATC AAA CGC TTG CAA GAA 12 51 

Arg He Leu Gly Glu He Ser Glu Ser Lys He Lys Arg Leu Gin Glu 
395 400 405 

GCG GAT TTT ATT TTT ATA GAG GAA CTT AAA AAA GCC AAT TTG TAT GAC 12 99 

Ala Asp Phe He Phe He Glu Glu Leu Lys Lys Ala Asn Leu Tyr Asp 
410 415 420 425 

AAG GTT TGG CAA GCT TTT TGC GTG CTG TTG AAT GTC AAT TCT GTG GGG 13 4 7 

Lys Val Trp Gin Ala Phe Cys Val Leu Leu Asn Val Asn Ser Val Gly 
430 435 440 

GTT ATG GGG GAT AAC CGC ACT TAT GAA AAC GCT ATT TGC TTA AGA GCG 13 95 

Val Met Gly Asp Asn Arg Thr Tyr Glu Asn Ala He Cys Leu Arg Ala 
445 450 455 

GTA AAT GCG AGC GAT GGC ATG ACG GCG AGC TTT TCA TTT TTA GAG CAT 144 3 

Val Asn Ala Ser Asp Gly Met Thr Ala Ser Phe Ser Phe Leu Glu His 

460 465 470 

TCT TTT TTA GAA AAG GTT TCT AAC CGT ATC ACT AAT GAA GTG AGC GGT 14 91 

Ser Phe Leu Glu Lys Val Ser Asn Arg He Thr Asn Glu Val Ser Gly 
475 480 485 

ATC AAT AGG GTG GTG TAT GAC ATT ACC TCT AAA CCA CCA GGA ACG ATT 15 3 9 

He Asn Arg Val Val Tyr Asp He Thr Ser Lys Pro Pro Gly Thr He 
490 495 500 505 

GAA TGG GAA TGATTATCTT AAAAAATAGC ACTA 15 7 2 

Glu Trp Glu 



(2) INFORI/IATION FOR SEQ ID NO: 1304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1304: 

Met He Leu Val Leu Asp Phe Gly Ser Gin Tyr Thr Gin Leu He Ala 

15 10 15 

Arg Arg Leu Arg Glu Arg Gly He Tyr Thr Glu He Val Pro Phe Phe 

20 25 30 

Glu Ser He Glu Asn He Gin Lys Lys Ala Pro Lys Gly Leu He Leu 

3.5 4 0 45 
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AAC GGC AAG ATT TTT GGC TTG CAA TTC CAC CCA GAA GTC GTT CAA AGO 531 
Asn Gly Lys lie Phe Gly Leu Gin Phe His Pro Glu Val Val Gin Ser 
155 160 165 

GAA GAA GGG GGT AAG ATT TTA GAA AAT TTT GCC CTT TTA GTT TGC GGC 57 9 

Glu Glu Gly Gly Lys lie Leu Glu Asn Phe Ala Leu Leu Val Cys Gly 
170 175 180 185 

TGT GAA AAA ACT TGG GGG ATG CAG CAT TTC GCT CAA AGA GAA ATC GCA 62 7 

Cys Glu Lys Thr Trp Gly Met Gin His Phe Ala Gin Arg Glu He Ala 
190 195 200 

CGA TTG AAA GAA AAA ATC GCT AAC GCT AAG GTT TTG TGC GCG GTG AGT 67 5 

Arg Leu Lys Glu Lys He Ala Asn Ala Lys Val Leu Cys Ala Val Ser 
205 210 215 

GGG GGC GTG GAT TCT ACG GTG GTC GCT ACG CTG TTG CAC AGA GCC ATT 72 3 

Gly Gly Val Asp Ser Thr Val Val Ala Thr Leu Leu His Arg Ala He 
220 225 230 

AAG GAT AAT TTG ATC GCT GTT TTT GTG GAT CAT GGC TTG TTG CGT AAA 771 
Lys Asp Asn Leu He Ala Val Phe Val Asp His Gly Leu Leu Arg Lys 
235 240 245 

AAT GAA AAA GAA AGG GTG CAA GCG ATG TTT AAG GAC TTG AAA ATC CCT 819 
Asn Glu Lys Glu Arg Val Gin Ala Met Phe Lys Asp Leu Lys He Pro 
250 255 260 265 

TTA AAC ACG ATA GAC GCT AAA GAA GTC TTT TTG TCT AAA TTA AAG GGC 8 67 

Leu Asn Thr He Asp Ala Lys Glu Val Phe Leu Ser Lys Leu Lys Gly 
270 275 280 

GTG AGC GAG CCT GAA TTG AAG CGA AAA ATC ATC GGC GAG ACC TTT ATT 915 
Val Ser Glu Pro Glu Leu Lys Arg Lys He He Gly Glu Thr Phe He 
285 290 295 

GAA GTG TTT GAA AAA GAA GCC AAA AAG CAC CAT TTA AAA GGC AAA ATT 96 3 

Glu Val Phe Glu Lys Glu Ala Lys Lys His His Leu Lys Gly Lys He 
300 305 310 

GAA TTT TTA GCC CAA GGC ACT TTA TAG CCT GAT GTG ATT GAA TCC GTG 1011 
Glu Phe Leu Ala Gin Gly Thr Leu Tyr Pro Asp Val He Glu Ser Val 
315 320 325 

AGC GTT AAA GGG CCT TCA AAA GTG ATC AAA ACC CAT CAT AAT GTG GGC 10 5 9 

Ser Val Lys Gly Pro Ser Lys Val He Lys Thr His His Asn Val Gly 
330 335 340 345 

GGA CTG CCT GAA TGG ATG GAT TTT AAA CTC ATA GAG CCT TTA AGG GAG 110 7 

Gly Leu Pro Glu Trp Met Asp Phe Lys Leu He Glu Pro Leu Arg Glu 
350 355 360 

TTG TTT AAA GAT GAG GTG CGC TTA CTG GGT AAA GAA TTG GGC GTT AGT 115 5 

Leu Phe Lys Asp Glu Val Arg Leu Leu Gly Lys Glu Leu Gly Val Ser 
365 370 375 
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(A) LENGTH: 1572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 1548 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 03: 

TCTCACAAAA ATAAAGAATG AGCG ATG ATT TTA GTA TTA GAT TTT GGG AGT 51 

Met lie Leu Val Leu Asp Phe Gly Ser 
1 5 

CAA TAC ACA CAG CTG ATT GCT AGA AGA TTG AGA GAG AGA GGG ATT TAT 9 9 

Gin Tyr Thr Gin Leu lie Ala Arg Arg Leu Arg Glu Arg Gly lie Tyr 
10 15 20 25 

ACA GAA ATA GTC CCT TTT TTT GAA AGC ATA GAA AAC ATT CAA AAA AAA 14 7 

Thr Glu lie Val Pro Phe Phe Glu Ser lie Glu Asn lie Gin Lys Lys 
30 35 40 

GCC CCC AAA GGT TTG ATT TTG AGT GGG GGG CCA GCG AGC GTG TAT GCT 195 
Ala Pro Lys Gly Leu lie Leu Ser Gly Gly Pro Ala Ser Val Tyr Ala 
45 50 55 

AAA GAC GCT TAC AAG CCT AGT GGG AAA ATC TTT GAT TTG AAT GTG CCG 243 
Lys Asp Ala Tyr Lys Pro Ser Gly Lys lie Phe Asp Leu Asn Val Pro 
60 65 70 

ATT TTA GGG ATT TGC TAC GGC ATG CAG TAT TTG GTG GAT TTT TTT GGG 2 91 

lie Leu Gly lie Cys Tyr Gly Met Gin Tyr Leu Val Asp Phe Phe Gly 
75 80 85 

GGG GTA GTG GTT GGT GCG AAT GAG CAA GAA TTT GGT AAG GCT GTT TTA 339 
Gly Val Val Val Gly Ala Asn Glu Gin Glu Phe Gly Lys Ala Val Leu 
90 95 100 105 

GAA ATC ACT CAA AAT TCT GTG ATT TTT GAA GGC GTG AAG ATT AAA AGC 3 87 

Glu lie Thr Gin Asn Ser Val lie Phe Glu Gly Val Lys lie Lys Ser 
110 115 120 

CTT GTG TGG ATG AGC CAT ATG GAT AAA GTC ATA GAA CTG CCT AAA GGC 435 
Leu Val Trp Met Ser His Met Asp Lys Val lie Glu Leu Pro Lys Gly 
125 130 135 

TTT ACT ACC CTT GCA AAA AGC CCT AAT TCC CCC CAT TGC GCG ATT GAA 4 83 

Phe Thr Thr Leu Ala Lys Ser Pro Asn Ser Pro His Cys Ala lie Glu 
140 145 150 
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Leu Tyr Ala Phe Ser Leu lie Asp Ser Gin Tyr Cys Ser Lys Gly Gly 
110 115 120 



AAT TAG AGA TAG GAA CTA GAA AAA TTA GAA CGC TGG TTT GTG CAA AAA 4 37 

Asn Tyr Arg Tyr Glu Leu Glu Lys Leu Glu Arg Trp Phe Val Gin Lys 
125 130 135 



GCA CCT GAG TTG GCT GAA AGO TAT AGG GTG AAT TAG AAA AAT CAA TAG 4 85 

Ala Pro Glu Leu Ala Glu Ser Tyr Arg Val Asn Tyr Lys Asn Gin Tyr 
140 145 150 



AAT AAA ACA GAG ATC TCA CAA AAA TAAAGAATGA GCGATGATTT TAGTATTAGA 53 9 

Asn Lys Thr Gin lie Ser Gin Lys 
155 160 



T 540 
(2) INFORMATION FOR SEQ ID NO: 13 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1302: 



Met Asn lie Phe Gin 

1 5 
Ser Val Gly Val Leu 
20 

Asp Lys Ser Leu Thr 
35 

Lys Gin Asp Met Lys 
50 

Lys Leu Ser Lys Lys 
65 

Glu Thr Leu Met Ser 
85 

Phe Leu Phe Gin Ser 
100 

lie Asp Ser Gin Tyr 
115 

Glu Lys Leu Glu Arg 
130 

Ser Tyr Arg Val Asn 
145 

Gin Lys 



Thr Ser Leu Lys Cys 
10 

Leu Gly Asp Ser Lys 
25 

Pro Pro Phe Leu Asn 
40 

Lys Glu Val lie Phe 
55 

Val Leu Cys Asp Phe 
70 

Gly Met Pro Lys Lys 
90 

Lys Glu Asn Lys Thr 
105 

Cys Ser Lys Gly Gly 
120 

Trp Phe Val Gin Lys 
135 

Tyr Lys Asn Gin Tyr 
150 



Cys Val Gly Leu Val Leu 
15 

Ala Phe Lys Val Arg Val 
30 

Val Leu Ser Leu Ala Phe 
45 

Val lie Thr Lys Ser Asn 
60 

Asp Ala Phe Leu Leu Pro 
75 80 
Ala Leu Phe His Lys Glu 
95 

Leu Tyr Ala Phe Ser Leu 
110 

Asn Tyr Arg Tyr Glu Leu 
125 

Ala Pro Glu Leu Ala Glu 
140 

Asn Lys Thr Gin lie Ser 
155 160 



(2) INFORMATION FOR SEQ ID NO:1303: 
(i) SEQUENCE CHARACTERISTICS: 
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Leu Leu lie Leu Ser Leu Leu Pro Leu Ala Leu Trp Lys Glu Met lie 

485 490 495 

Leu Leu Leu Glu Gly Val lie Leu Arg Met Trp Phe Met Pro Cys Val 
500 505 510 

Arg Trp 



(2) INFORMATION FOR SEQ ID NO: 13 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 24... 509 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 01: 

ATTGACGCTT ATAAGGATAA AAG ATG AAT ATT TTT CAA ACG AGT TTG AAA TGT 53 

Met Asn lie Phe Gin Thr Ser Leu Lys Cys 
15 10 

TGC GTG GGG TTG GTT TTG TCT GTG GGG GTC TTA TTA GGG GAT TCT AAA 101 
Cys Val Gly Leu Val Leu Ser Val Gly Val Leu Leu Gly Asp Ser Lys 
15 20 25 

GCT TTT AAG GTT AGG GTG GAT AAA AGT TTA ACC CCG CCT TTT TTG AAT 14 9 

Ala Phe Lys Val Arg Val Asp Lys Ser Leu Thr Pro Pro Phe Leu Asn 
30 35 40 

GTG CTT TCA TTA GCT TTT AAA CAA GAC ATG AAA AAA GAG GTC ATT TTT 197 
Val Leu Ser Leu Ala Phe Lys Gin Asp Met Lys Lys Glu Val lie Phe 
45 50 55 

GTG ATT ACC AAA AGO AAT AAG TTG AGT AAA AAA GTG CTT TGT GAT TTT 245 
Val lie Thr Lys Ser Asn Lys Leu Ser Lys Lys Val Leu Cys Asp Phe 
60 65 70 

GAC GCT TTT TTA TTG CCT GAG ACT CTG ATG AGC GGC ATG CCT AAA AAA 2 93 

Asp Ala Phe Leu Leu Pro Glu Thr Leu Met Ser Gly Met Pro Lys Lys 
75 80 85 90 

GCA CTA TTC CAT AAA GAG TTT TTA TTC CAA TCT AAA GAA AAT AAA ACG 341 
Ala Leu Phe His Lys Glu Phe Leu Phe Gin Ser Lys Glu Asn Lys Thr 
95 100 105 

CTC TAT GCG TTT TCG CTG ATT GAT TCT CAA TAT TGC TCA AAA GGT GGA 3 89 
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35 40 45 



Asn 


Gly 


Val 


He 


Lys 


Asp 


He 


Val 


Pro 


Gin 


Lys 


Ser 


Asp 


Tyr 


Asn 


Pro 




50 










55 










60 










Thr 


Met 


Met 


Leu 


Lys 


Ala 


Met 


Val 


Asp 


Arg 


Val 


Tyr 


Ser 


Asp 


Ser 


Arg 


65 










70 










75 










80 


Val 


Lys 


Tyr 


Pro 


Cys 


Val 


Arg 


Lys 


Ser 


Phe 


Leu 


Glu 


Asn 


Lys 


Lys 


Asn 










85 










90 










95 




His 


Lys 


Glu 


Leu 


Arg 


Gly Arg 


Glu 


Glu 


Phe 


Val 


Arg 


Val 


Ser 


Trp 


Asp 








100 










105 










110 






Val 


Ala 


Leu 


Asp 


Leu 


Ala 


Ala 


Lys 


Lys 


Leu 


Lys 


Glu 


He 


Pro 


Lys 


Glu 






115 










120 










125 








Asn 


He 


Tyr 


Asn 


Ala 


Ser 


Tyr 


Gly 


Gly 


Trp 


Gly 


His 


Ala 


Gly 


Ser 


Leu 




130 










135 










140 










His 


Arg 


Cys 


His 


His 


Leu 


Ala 


Trp 


Arg 


Phe 


Phe 


Asn 


Thr 


Thr 


Leu 


Gly 


145 










150 










155 










160 


Gly Ala 


He 


Gly 


Thr 


Asp 


Gly 


Glu 


Tyr 


Ser 


Asn 


Gly 


Ala 


Ala 


Ala 


Arg 










165 










170 










175 




He 


Asn 


Pro 


Met 


He 


Val 


Gly 


Asp 


Met 


Glu 


Val 


Tyr 


Ser 


Gin 


Gin 


Thr 








180 










185 










190 






Thr 


His 


Glu 


Glu 


Met 


He 


Lys 


Asn 


Cys 


Lys 


Val 


Tyr 


Val 


Met 


Trp 


Gly 






195 










200 










205 








Ala 


Asp 


Leu 


Leu 


Lys 


Cys 


Asn 


Arg 


He 


Asp 


Tyr 


Phe 


Val 


Pro 


Asn 


His 




210 










215 










220 










Val 


Asn 


Asp 


Ser 


Tyr 


Tyr 


Pro 


Lys 


Tyr 


Lys 


Arg 


Ala 


Gly 


He 


Lys 


Phe 


225 










230 










235 










240 


He 


Ser 


He 


Asp 


Pro 


He 


Tyr 


Thr 


Glu 


Thr 


Ala 


Gin 


Ala 


Phe 


Ser 


Ala 










245 










250 










255 




Glu 


Trp 


He 


Pro 


He 


Arg 


Pro 


Asn 


Thr 


Asp 


Val 


Ala 


Leu 


Met 


Leu 


Gly 








260 










265 










270 






Met 


Met 


His 


Tyr 


Leu 


Tyr 


Thr 


Ser 


Asn 


Gin 


Tyr 


Asp 


Lys 


Ala 


Phe 


He 






275 










280 










285 








Ala 


Lys 


Tyr 


Thr 


Asp 


Gly 


Phe 


Asp 


Lys 


Phe 


Leu 


Pro 


Tyr 


Leu 


Leu 


Gly 




290 










295 










300 










Glu 


Ser 


Asp 


Asn 


Ala 


Pro 


Lys 


Thr 


Leu 


Glu 


Trp 


Ala 


Ser 


Gin 


He 


Thr 


305 










310 










315 










320 


Gly 


Val 


Ser 


Ala 


Glu 


Lys 


He 


Lys 


Glu 


Leu 


Ala 


Asp 


Leu 


Phe 


Val 


Ser 










325 










330 










335 




Lys 


Arg 


Thr 


Phe 


Leu 


Ala 


Gly 


Asn 


Trp 


Ala 


Met 


Gin 


Arg 


Ala 


Gin 


Tyr 








340 










345 










350 






Gly 


Glu 


Gin 


Pro 


Asp 


Trp 


Ala 


Leu 


He 


Val 


Leu 


Ala 


Ser 


Met 


He 


Gly 






355 










360 










365 








Gin 


Val 


Gly 


Leu 


Ser 


Gly 


Gly 


Gly 


Phe 


Gly 


Phe 


Ser 


Met 


His 


Tyr 


Gly 




370 










375 










380 










Gly Asn 


Ala 


Gin 


Ala 


Ser 


Ser 


Gly 


Ala 


Arg 


He 


Val 


Pro 


Met 


He 


Ser 


385 










390 










395 










400 


Gin 


Gly 


His 


Asn 


Ser 


Val 


Lys 


Ser 


Val 


He 


Pro 


Ala 


Ser 


Arg 


Val 


Ser 










405 










410 










415 




Glu 


Ala 


He 


Leu 


Asn 


Pro 


Asp 


Lys 


Glu 


He 


Asp 


Phe 


Met 


Gly 


Lys 


Lys 








420 










425 










430 






Leu 


Lys 


Leu 


Pro 


Lys 


He 


Lys 


Met 


He 


Tyr 


Asn 


Cys 


Gly 


Ala 


Asp 


Leu 






435 










440 










445 








Leu 


Gly 


His 


Glu 


Thr 


Asp 


Thr 


Asn 


Glu 


Leu 


He 


Arg 


Ala 


Leu 


Arg 


Thr 




450 










455 










460 










Leu 


Asp 


Cys 


Val 


He 


Val 


His 


Glu 


Pro 


Trp 


Trp 


Arg 


Leu 


Arg 


Gin 


Asn 


465 










470 










475 










480 
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TCG GGT GGG GGC TTT GGC TTT TCT ATG CAT TAT GGA GGG AAC GCT CAA 12 06 

Ser Gly Gly Gly Phe Gly Phe Ser Met His Tyr Gly Gly Asn Ala Gin 
375 380 385 

GCA AGC TCA GGG GCA AGA ATT GTT CCT ATG ATT TCA CAA GGG CAT AAT 12 54 

Ala Ser Ser Gly Ala Arg He Val Pro Met He Ser Gin Gly His Asn 
390 395 400 

TCT GTA AAA AGC GTT ATT CCA GCA TCT AGG GTT TCT GAA GCG ATT TTA 13 02 

Ser Val Lys Ser Val He Pro Ala Ser Arg Val Ser Glu Ala He Leu 
405 410 415 420 

AAT CCG GAT AAA GAA ATT GAT TTT ATG GGC AAA AAA CTC AAA TTG CCT 13 5 0 

Asn Pro Asp Lys Glu He Asp Phe Met Gly Lys Lys Leu Lys Leu Pro 
425 430 435 

AAA ATC AAA ATG ATT TAT AAT TGT GGG GCG GAT TTA TTA GGG CAT GAA 13 9 8 

Lys He Lys Met He Tyr Asn Cys Gly Ala Asp Leu Leu Gly His Glu 
440 445 450 

ACT GAT ACA AAC GAG CTG ATT CGC GCT TTA AGG ACC TTA GAT TGC GTG 144 6 

Thr Asp Thr Asn Glu Leu He Arg Ala Leu Arg Thr Leu Asp Cys Val 
455 460 465 

ATC GTG CAT GAG CCT TGG TGG CGC CTA CGG CAA AAT TTG CTG ATA TTG 14 94 

He Val His Glu Pro Trp Trp Arg Leu Arg Gin Asn Leu Leu He Leu 
470 475 480 

TCT TTG CTT CCA CTA GCA CTG TGG AAA GAG ATG ATA TTG CTT TTG GAG 154 2 

Ser Leu Leu Pro Leu Ala Leu Trp Lys Glu Met He Leu Leu Leu Glu 
485 490 495 500 

GGA GTT ATT CTA AGA ATG TGG TTT ATG CCA TGC GTA AGG TGG TAGAGCCTG 15 93 
Gly Val He Leu Arg Met Trp Phe Met Pro Cys Val Arg Trp 
505 510 

TTTATGAATC TAAAGACGAT TATGAGATTT TCAGACAGCT TGC 163 6 

(2) INFORMATION FOR SEQ ID NO: 13 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 00: 

Met Ser He Ser Arg Arg Ser He Leu Thr Lys He Pro He Ala Leu 

^5 10 15 

Ala Ser Ala Asn Val Leu Lys Ala Val Gly Val Phe Glu Lys Val Glu 

20 25 30 

Ser He Pro His Ala Thr His Phe Gly Pro Phe He Ala Lys Val Gin 
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His Leu Ala Trp Arg Phe Phe Asn Thr Thr Leu Gly Gly Ala lie Gly 
150 155 160 

ACT GAT GGG GAA TAT AGT AAT GGC GCG GCC GCA AGA ATA AAC CCT ATG 5 82 

Thr Asp Gly Glu Tyr Ser Asn Gly Ala Ala Ala Arg Tie Asn Pro Met 
165 170 175 180 

ATT GTA GGG GAT ATG GAA GTT TAT TCG CAA CAA ACC ACG CAT GAA GAG 63 0 

lie Val Gly Asp Met Glu Val Tyr Ser Gin Gin Thr Thr His Glu Glu 
185 190 195 

ATG ATT AAA AAT TGT AAG GTG TAT GTC ATG TGG GGG GCG GAT TTA CTC 67 8 

Met lie Lys Asn Cys Lys Val Tyr Val Met Trp Gly Ala Asp Leu Leu 
200 205 210 

AAG TGC AAC CGC ATT GAT TAT TTT GTG CCA AAC CAT GTC AAT GAC AGC 72 6 

Lys Cys Asn Arg lie Asp Tyr Phe Val Pro Asn His Val Asn Asp Ser 
215 220 225 

TAC TAC CCC AAG TAT AAA AGA GCT GGT ATT AAA TTC ATT AGT ATC GAT 7 74 

Tyr Tyr Pro Lys Tyr Lys Arg Ala Gly lie Lys Phe lie Ser lie Asp 
230 235 240 

CCC ATT TAT ACC GAA ACC GCT CAA GCC TTT AGT GCT GAA TGG ATA CCC 82 2 

Pro lie Tyr Thr Glu Thr Ala Gin Ala Phe Ser Ala Glu Trp lie Pro 
245 250 255 260 

ATT CGC CCT AAC ACT GAT GTA GCG TTA ATG CTA GGC ATG ATG CAT TAT 87 0 

lie Arg Pro Asn Thr Asp Val Ala Leu Met Leu Gly Met Met His Tyr 
265 270 275 

CTT TAT ACG AGC AAT CAA TAT GAT AAA GCG TTT ATC GCT AAA TAC ACT 918 
Leu Tyr Thr Ser Asn Gin Tyr Asp Lys Ala Phe lie Ala Lys Tyr Thr 
280 285 290 

GAT GGT TTT GAT AAA TTT TTA CCC TAT TTG CTA GGA GAG AGC GAT AAT 96 6 

Asp Gly Phe Asp Lys Phe Leu Pro Tyr Leu Leu Gly Glu Ser Asp Asn 
295 300 305 

GCG CCT AAG ACT TTA GAA TGG GCG TCT CAA ATC ACT GGA GTG AGC GCA 1014 
Ala Pro Lys Thr Leu Glu Trp Ala Ser Gin lie Thr Gly Val Ser Ala 
310 315 320 

GAA AAA ATC AAA GAA TTA GCG GAT TTG TTT GTT TCT AAA CGC ACT TTT 1062 
Glu Lys lie Lys Glu Leu Ala Asp Leu Phe Val Ser Lys Arg Thr Phe 
325 330 335 340 

TTA GCG GGT AAT TGG GCC ATG CAA AGA GCT CAG TAT GGC GAG CAA CCG 1110 
Leu Ala Gly Asn Trp Ala Met Gin Arg Ala Gin Tyr Gly Glu Gin Pro 
345 350 355 

GAT TGG GCG TTA ATT GTT TTA GCT AGC ATG ATT GGT CAA GTG GGC TTA 115 8 

Asp Trp Ala Leu lie Val Leu Ala Ser Met lie Gly Gin Val Gly Leu 
360 365 370 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 43... 1584 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 99: 

AATTTGGGGT GTTTTAACAG AATGCAAGCT TGAAGGAGAA TC ATG TCC ATT TCA 54 

Met Ser lie Ser 
1 

CGC AGA AGT ATC CTA ACA AAA ATC CCA ATC GCG CTC GCT AGC GCT AAT 102 
Arg Arg Ser He Leu Thr Lys He Pro He Ala Leu Ala Ser Ala Asn 
5 10 15 20 

GTT TTG AAA GCT GTT GGT GTT TTT GAA AAA GTA GAA TCC ATT CCG CAT 15 0 

Val Leu Lys Ala Val Gly Val Phe Glu Lys Val Glu Ser He Pro His 
25 30 35 

GCA ACG CAT TTT GGC CCC TTT ATC GCA AAG GTT CAA AAT GGA GTG ATT 198 
Ala Thr His Phe Gly Pro Phe He Ala Lys Val Gin Asn Gly Val He 
40 45 50 

AAA GAT ATT GTC CCC CAA AAA AGC GAT TAT AAC CCT ACT ATG ATG TTA 24 6 

Lys Asp He Val Pro Gin Lys Ser Asp Tyr Asn Pro Thr Met Met Leu 
55 60 65 

AAA GCG ATG GTT GAT AGG GTG TAT TCA GAT AGT AGG GTG AAG TAT CCT 2 94 

Lys Ala Met Val Asp Arg Val Tyr Ser Asp Ser Arg Val Lys Tyr Pro 
70 75 80 

TGC GTG CGC AAG AGC TTC TTA GAA AAC AAA AAA AAC CAC AAA GAA TTG 342 
Cys Val Arg Lys Ser Phe Leu Glu Asn Lys Lys Asn His Lys Glu Leu 
85 90 95 100 

CGC GGG AGA GAA GAG TTT GTG CGT GTG AGT TGG GAT GTG GCG TTG GAT 3 90 

Arg Gly Arg Glu Glu Phe Val Arg Val Ser Trp Asp Val Ala Leu Asp 
105 110 115 

TTA GCG GCT AAA AAG CTT AAA GAA ATC CCT AAA GAA AAC ATT TAT AAT 43 8 

Leu Ala Ala Lys Lys Leu Lys Glu He Pro Lys Glu Asn He Tyr Asn 
120 125 130 

GCC AGT TAT GGT GGC TGG GGG CAT GCG GGC AGC TTG CAT CGT TGC CAT 4 86 

Ala Ser Tyr Gly Gly Trp Gly His Ala Gly Ser Leu His Arg Cys His 
135 140 145 

CAT TTA GCA TGG CGT TTT TTT AAC ACG ACT TTA GGA GGG GCT ATT GGC 5 34 
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Ser lie Asn lie 
180 

Gin Leu Glu Val 
195 

Gly Val Val Arg 
210 

Gly lie Glu Gly 
225 

Val Asn Pro Glu 

Ala lie Ala Tyr 
260 

Ala Thr lie Glu 
275 

Gly Tyr Ala lie 
290 

Phe Val Asp lie 
305 

lie Ser Trp Asp 

Gly Gin Glu lie 
340 

Arg Leu Arg Val 
355 

Phe Glu Ser Lys 
370 

Thr Leu Thr Asp 
385 

Leu Leu His Asn 

Asp His Tyr Lys 
420 

Asn Lys Lys Asp 
435 

Ser Pro Thr Glu 
450 

Gin Gly Lys Val 
465 

Asp Gly lie Asp 

Lys Asp Glu lie 
500 

lie Glu Lys Ser 
515 

Arg Lys Lys Glu 
530 

Lys Met Thr Leu 
545 



165 

Ser Arg Lys Arg 

Ser Lys Glu Leu 
200 

Gin lie Thr Pro 
215 

Leu Val His Tyr 
230 

Lys Tyr Tyr Lys 
245 

Asp Ala Glu Lys 

Asp Pro Trp Glu 
280 

Lys Val Val Val 
295 

Gly Asn Asp lie 
310 

Lys Asn Val Ser 
325 

Asp Val Lys lie 

Ser Leu Lys Gin 
360 

His Gin Val Gly 
375 

Phe Gly Ala Phe 
390 

His Asp Ala Phe 
405 

lie Gly Asp Val 

Lys Lys lie Ser 
440 

Glu Phe Ala Gin 
455 

Val Ser lie Lys 
470 

Val Leu lie Lys 
485 

Lys lie Gly Gin 

Asn Asn Lys Val 
520 

Lys Glu Glu Leu 
535 

Gly Asp lie Leu 
550 



170 

Phe Phe Glu Val 
185 

Leu Glu Ala Thr 

Phe Gly lie Phe 
220 

Ser Glu lie Ser 
235 

Glu Gly Asp Glu 
250 

Arg Arg Leu Ser 
265 

Glu lie Gin Asp 

Ser Asn lie Glu 
300 

Glu Gly Phe Leu 
315 

His Pro Asn Asn 
330 

lie Asp lie Asp 
345 

Leu Thr Asn Arg 

Asp Val Leu Glu 
380 

Leu Asn Leu Gly 
395 

Trp Asp Lys Asp 
410 

lie Lys Val Lys 
425 

Leu Ser Ala Lys 

Lys His Lys Thr 
460 

Asp. Phe Gly Val 
475 

Asn Glu Asp Leu 
490 

Glu lie Thr Cys 
505 

Arg Ala Ser Val 

Gin Ala Phe Asn 
540 

Lys Glu Lys Leu 
555 



175 

Asn Asp Lys Arg 
190 

Glu Pro Val Leu 
205 

Val Glu Ala Lys 

His Lys Gly Pro 
240 

Val Tyr Val Lys 
255 

Leu Ser lie Lys 
270 

Lys Leu Lys Pro 
285 

His Tyr Gly Val 

His Val Ser Glu 
320 

Tyr Leu Ser Val 
335 

Pro Lys Asn Arg 
350 

Pro Phe Asp Val 
365 

Gly Lys Val Ala 

Gly Val Asp Gly 
400 

Lys Lys Cys Lys 
415 

lie Leu Lys lie 
430 

His Leu Val Thr 
445 

Asp Ser Val lie 

Phe lie Asn Ala 
480 

Asn Pro Leu Lys 
495 

Val Val Val Ala 
510 

His Arg Leu Glu 
525 

Thr Ser Asp Asp 



(2) INFORMATION FOR SEQ ID NO: 12 99: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 6 base pairs 

(B) TYPE: nucleic acid 
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AAA AAT GAA GAT TTG AAC CCC TTG AAA AAA GAT GAA ATT AAA ATA GGC 154 2 

Lys Asn Glu Asp Leu Asn Pro Leu Lys Lys Asp Glu lie Lys lie Gly 
490 495 500 

CAA GAA ATC ACA TGC GTG GTG GTT GCG ATT GAA AAA TCT AAC AAC AAG • 15 90 
Gin Glu lie Thr Cys Val Val Val Ala lie Glu Lys Ser Asn Asn Lys 
505 510 515 

GTG CGT GCT TCT GTG CAT AGG TTA GAG CGC AAA AAA GAA AAA GAA GAA 163 8 

Val Arg Ala Ser Val His Arg Leu Glu Arg Lys Lys Glu Lys Glu Glu 
520 525 530 535 

TTG CAA GCT TTT AAC ACG AGC GAT GAT AAA ATG ACT TTA GGG GAT ATT 16 8 6 

Leu Gin Ala Phe Asn Thr Ser Asp Asp Lys Met Thr Leu Gly Asp lie 
540 545 550 

CTT AAA GAA AAA CTC TAAAGAGTGA TTTTAAAAGC ATGAGAATGG CATGAGATTT A 1742 
Leu Lys Glu Lys Leu 
555 

AGGGTG 174 8 

(2) INFOR^4ATION FOR SEQ ID NO: 12 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 98: 

Met Ser Lys lie Ala Asp Asp Gin Asn Phe Asn Asp Glu Glu Glu Asn 

15 10 15 

Phe Ala Lys Leu Phe Lys Lys Glu Leu Glu Lys Glu Glu Thr Leu Glu 

20 25 30 

Lys Gly Thr lie Lys Glu Gly Leu Val Val Ser lie Asn Glu Asn Asp 

35 40 45 

Gly Tyr Ala Met Val Ser Val Gly Gly Lys Thr Glu Gly Arg Leu Ala 

50 55 60 

Leu Asn Glu lie Thr Asp Glu Lys Gly Gin Leu Leu Tyr Gin Lys Asn 
65 70 75 80 

Asp Pro lie He Val His Val Ser Glu Lys Gly Glu His Pro Ser Val 

85 90 95 

Ser Tyr Lys Lys Ala He Ser Gin Gin Lys He Gin Ala Lys He Glu 

100 105 110 

Glu Leu Gly Glu Asn Tyr Glu Asn Ala He He Glu Gly Lys He Val 

115 120 125 

Gly Lys Asn Lys Gly Gly Tyr He Val Glu Ser Gin Gly Val Glu Tyr 

130 135 140 

Phe Leu Ser Arg Ser His Ser Ser Leu Lys Asn Asp Ala Asn His He 
145 150 155 160 

Gly Lys Arg Val Lys Ala Cys He He Arg Val Asp Lys Glu Asn His 
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AAA AGA CGC 
Lys Arg Arg 
265 

GAA GAG ATT 
Glu Glu lie 
280 

GTG AGC AAC 
Val Ser Asn 



ATT GAA GGC 
lie Glu Gly 



AGC CAC CCT 
Ser His Pro 
330 

ATC ATT GAG 
lie lie Asp 
345 

CAA CTC ACT 
Gin Leu Thr 
360 

GGG GAT GTT 
Gly Asp Val 



TTT TTA AAT 
Phe Leu Asn 



TTT TGG GAT 
Phe Trp Asp 
410 

GTG ATC AAA 
Val lie Lys 
425 

TCT TTG AGC 
Ser Leu Ser 
440 

CAA AAG CAT 
Gin Lys His 

AAG GAT TTT 
Lys Asp Phe 



CTT TCA CTC 
Leu Ser Leu 



CAA GAC AAG 
Gin Asp Lys 
285 

ATT GAA CAT 
lie Glu His 
300 

TTT TTG CAT 
Phe Leu His 
315 

AAC AAT TAC 
Asn Asn Tyr 



ATT GAT CCA 
lie Asp Pro 



AAC AGG CCT 
Asn Arg Pro 
365 

TTA GAA GGC 
Leu Glu Gly 
380 

CTG GGT GGG 
Leu Gly Gly 
395 

AAA GAT AAA 
Lys Asp Lys 

GTG AAA ATC 
Val Lys lie 



GCG AAG CAC 
Ala Lys His 
445 

AAA ACA GAC 
Lys Thr Asp 
460 

GGC GTT TTC 
Gly Val Phe 
475 



TCC ATA AAA 
Ser lie Lys 
270 

CTA AAA CCC 
Leu Lys Pro 

TAT GGG GTG 
Tyr Gly Val 



GTT TCT GAA 
Val Ser Glu 
320 

TTG AGC GTG 
Leu Ser Val 
335 

AAA AAT CGC 
Lys Asn Arg 
350 

TTT GAT GTT 
Phe Asp Val 



AAA GTG GCG 
Lys Val Ala 



GTG GAT GGT 
Val Asp Gly 
400 

AAA TGC AAA 
Lys Cys Lys 
415 

CTT AAA ATC 
Leu Lys lie 
430 

TTG GTG ACT 
Leu Val Thr 



AGC GTG ATT 
Ser Val lie 



ATT AAT GCT 
lie Asn Ala 
480 



GCG ACT ATA 
Ala Thr lie 
275 

GGA TAC GCC 
Gly Tyr Ala 
290 

TTT GTG GAT 

Phe Val Asp 
305 

ATC TCT TGG 

lie Ser Trp 



GGG CAA GAG 
Gly Gin Glu 



CGC TTA AGG 
Arg Leu Arg 
355 

TTT GAA TCT 
Phe Glu Ser 
370 

ACT TTA ACG 
Thr Leu Thr 
385 

TTG CTC CAC 
Leu Leu His 



GAC CAC TAT 
Asp His Tyr 

AAC AAA AAA 
Asn Lys Lys 
435 

TCC CCT ACA 
Ser Pro Thr 
450 

CAA GGC AAA 

Gin Gly Lys 
465 

GAT GGC ATT 

Asp Gly lie 



GAA GAC CCA 
Glu Asp Pro 



ATT AAG GTA 
lie Lys Val 



ATT GGT AAT 
lie Gly Asn 
310 

GAT AAA AAT 
Asp Lys Asn 
325 

ATT GAT GTG 
lie Asp Val 
340 

GTT TCT TTA 
Val Ser Leu 



AAA CAC CAA 
Lys His Gin 



GAT TTT GGG 
Asp Phe Gly 
390 

AAT CAC GAC 
Asn His Asp 
405 

AAA ATT GGC 
Lys lie Gly 
420 

GAT AAA AAG 
Asp Lys Lys 

GAA GAA TTC 
Glu Glu Phe 



GTG GTG AGT 
Val Val Ser 
470 

GAT GTG CTG 
Asp Val Leu 
485 



TGG 870 
Trp 



GTG 918 

Val 

295 

GAT 966 
Asp 



GTC 1014 
Val 



AAA 1062 
Lys 



AAG 1110 
Lys 



GTG 1158 

Val 

375 

GCG 1206 
Ala 



GCT 1254 
Ala 



GAT 13 0 2 

Asp 



ATT 1350 
lie 



GCT 1398 

Ala 

455 

ATT 144 6 

He 



ATC 14 94 

He 
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CTA GTC GTT TCC ATC AAT GAG AAT GAT GGT TAT GCC ATG GTG AGC GTG 198 
Leu Val Val Ser lie Asn Glu Asn Asp Gly Tyr Ala Met Val Ser Val 
40 45 50 55 

GGC GGT AAG ACA GAA GGC CGT TTG GCT TTG AAT GAG ATC ACC GAT GAA 24 6 

Gly Gly Lys Thr Glu Gly Arg Leu Ala Leu Asn Glu lie Thr Asp Glu 
60 65 70 

AAG GGG CAG TTG CTG TAT CAA AAA AAT GAG CCC ATT ATC GTG CAT GTG 2 94 

Lys Gly Gin Leu Leu Tyr Gin Lys Asn Asp Pro lie lie Val His Val 
75 80 85 

TCC GAA AAA GGT GAA CAC CCT AGC GTT TCC TAG AAA AAG GCC ATT TCC 34 2 

Ser Glu Lys Gly Glu His Pro Ser Val Ser Tyr Lys Lys Ala lie Ser 
90 95 " 100 

CAA CAA AAG ATT CAA GCT AAA ATT GAA GAA TTA GGC GAA AAC TAT GAA 3 90 

Gin Gin Lys lie Gin Ala Lys lie Glu Glu Leu Gly Glu Asn Tyr Glu 
105 110 115 

AAC GCC ATT ATT GAA GGC AAG ATT GTA GGC AAG AAT AAA GGG GGT TAT 43 8 

Asn Ala lie lie Glu Gly Lys lie Val Gly Lys Asn Lys Gly Gly Tyr 
120 125 130 135 

ATC GTG GAG TCT CAA GGC GTG GAG TAT TTC CTC TCC CGC TCG CAC TCT 4 86 

lie Val Glu Ser Gin Gly Val Glu Tyr Phe Leu Ser Arg Ser His Ser 
140 145 150 

TCT TTA AAG AAT GAC GCA AAC CAT ATC GGC AAA CGC GTT AAA GCG TGC 534 
Ser Leu Lys Asn Asp Ala Asn His lie Gly Lys Arg Val Lys Ala Cys 
155 160 165 

ATC ATT CGT GTG GAT AAG GAA AAC CAT TCT ATC AAT ATT TCT CGC AAA 5 82 

lie lie Arg Val Asp Lys Glu Asn His Ser lie Asn lie Ser Arg Lys 
170 175 180 

CGA TTC TTT GAA GTC AAT GAC AAA CGA CAA CTT GAG GTT TCT AAG GAA 63 0 

Arg Phe Phe Glu Val Asn Asp Lys Arg Gin Leu Glu Val Ser Lys Glu 
185 190 195 

TTG TTA GAA GCC ACA GAG CCG GTG TTA GGG GTT GTG CGC CAG ATC ACC 67 8 

Leu Leu Glu Ala Thr Glu Pro Val Leu Gly Val Val Arg Gin lie Thr 
200 205 210 215 

CCT TTT GGC ATT TTT GTA GAA GCT AAG GGG ATT GAG GGC TTG GTC CAT 72 6 

Pro Phe Gly lie Phe Val Glu Ala Lys Gly lie Glu Gly Leu Val His 
220 225 230 

TAT TCT GAA ATC AGC CAT AAG GGA CCA GTC AAT CCT GAA AAA TAG TAC 7 74 

Tyr Ser Glu lie Ser His Lys Gly Pro Val Asn Pro Glu Lys Tyr Tyr 
235 240 245 

AAA GAG GGC GAT GAA GTC TAT GTC AAA GCC ATC GCT TAT GAT GCA GAA 822 
Lys Glu Gly Asp Glu Val Tyr Val Lys Ala lie Ala Tyr Asp Ala Glu 
250 255 260 
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Phe Ser Ser Gin lie His Lys Gly Val Cys Gin Lys lie Glu Leu Ser 

340 345 350 

Leu Cys Gly Glu lie Asn Gin Phe Lys Asp Ala Leu Val Ala Phe Met 

355 360 365 

Leu Val Gly Val Leu Lys Pro Val Val Gly Asp Lys lie Asn Tyr He 

370 375 380 

Asn Ala Pro Phe Val Ala Lys Glu Arg Gly He Glu He Lys Val Ser 
385 390 395 400 

Leu Lys Glu Ser Ala Ser Pro Tyr Lys Asn Met Leu Ser Leu Thr Leu 

405 410 415 

Asn Ala Ala Asn Gly Thr He Ser Val Ser Gly Thr Val Phe Glu Glu 

420 425 430 

Asp lie Leu Lys Leu Thr Glu He Asp Gly Phe His He Asp He Glu 

435 440 445 

Pro Lys Gly Lys Met Leu Leu Phe Arg Asn Thr Asp He Pro Gly Val 

450 455 460 

He Gly Ser Val Gly Asn Ala Phe Ala Arg His Gly He Asn He Ala 
465 470 475 480 

Asp Phe Arg Leu Gly Arg Asn Thr Gin Lys Glu Ala Leu Ala Leu He 

485 490 495 

He Val Asp Glu Glu Val Ser Leu Glu Val Leu Glu Glu Leu Lys Asn 

500 505 510 

He Pro Ala Cys Leu Ser Val His Tyr Val Val He 
515 520 

(2) INFORMATION FOR SEQ ID NO: 12 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34 . . . 1701 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 97: 

GGATTTTATA TTTATTTTAT AGTAAGGCAG TCA ATG AGC AAG ATA GCA GAT GAT 54 

Met Ser Lys He Ala Asp Asp 
1 5 

CAG AAC TTT AAT GAC GAG GAG GAA AAC TTC GCA AAA CTC TTT AAA AAA 102 
Gin Asn Phe Asn Asp Glu Glu Glu Asn Phe Ala Lys Leu Phe Lys Lys 
10 15 20 

GAA TTA GAA AAA GAA GAA ACC CTA GAA AAA GGC ACT ATC AAA GAA GGG 150 
Glu Leu Glu Lys Glu Glu Thr Leu Glu Lys Gly Thr He Lys Glu Gly 
25 30 35 
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(2) INFORMATION FOR SEQ ID NO: 12 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 96: 



Met Tyr Gin Val 
1 

lie Leu Glu Ala 
20 

Pro Lys Lys Glu 
35 

Thr Arg Ser Met 
50 

His Leu Lys Ser 
65 

Leu Glu Ser Cys 

Ala Asn Thr lie 
100 

Ala Val Arg Ser 
115 

Leu Trp Lys Arg 
130 

Leu Gly lie lie 
145 

Ala Lys Ala Phe 

Ser Ser Lys Ala 
180 

Asp lie Leu Gin 
195 

Glu Thr He Asn 
210 

Gly Val He Leu 
225 

Ala Leu Tyr Glu 

Asp Val Phe Ser 
260 

Pro Asn Val Tyr 
275 

Gin Glu Glu He 
290 

Arg Gly Ser Ser 
305 

Asp Ala Ser Ala 



Ala He Cys Asp 
5 

Gin Lys Asp He 

Leu Leu Glu Lys 
40 

Thr Pro He Thr 
55 

He Val Arg Ala 
70 

Ser Gin Lys Gly 
85 

Ala Ala Val Glu 

Phe Pro Cys Ala 
120 

Glu Asp Trp Tyr 
135 

Gly Phe Gly Asn 
150 

Glu Met Glu Val 
165 

Thr Asp Leu Gly 

Cys Asp Met He 
200 

Met He Gly Ala 
215 

Leu Asn Cys Ala 
230 

Ala Leu Glu Thr 
245 

Lys Glu Pro Gly 

Ala Thr Pro His 
280 

Ser Lys Gin Ala 
295 

His Pro His Ala 
310 

Lys Ala Tyr Leu 
325 



Pro He His Ala 
10 

Val Leu His Asp 
25 

Leu Thr Pro Met 

Ser Asp Phe Leu 
60 

Gly Val Gly Val 
75 

He Val Val Met 
90 

Leu Thr Met Ala 
105 

Asn Asp Gin He 

Gly Thr Glu Leu 
140 

He Gly Ser Arg 
155 

Leu Ala Tyr Asp 
170 

Val He Tyr Thr 
185 

Thr He His Thr 

Lys Glu He Glu 
220 

Arg Gly Gly Leu 
235 

Lys Lys Val Arg 
250 

He His Asn Lys 
265 

He Gly Ala Asn 

Ala Gin Gly Val 
300 

Leu Asn Leu Pro 
315 

Asn Leu Ala Gin 
330 



Lys Gly He Gin 
15 

Tyr Ser Lys Cys 
30 

Asp Ala Leu He 
45 

Lys Pro Leu Thr 

Asp Asn He Asp 
80 

Asn He Pro Thr 
95 

His Leu He Asn 
110 

Lys His Gin Arg 
125 

Lys Asn Lys Lys 

Val Gly He Arg 
160 

Pro Tyr He Pro 
175 

Lys Asn Phe Glu 
190 

Pro Lys Asn Lys 
205 

Arg Met Lys Lys 

Tyr Asn Glu Asp 
240 

Trp Leu Gly He 

2 55 

Leu Leu Asp Leu 
270 

Thr Leu Glu Ser 
285 

Met Glu Ser Leu 

Met Gin Ala Phe 
320 

Lys Leu Gly Tyr 
335 



-1904- 



BNSCXXID: <WO__9843476A1J_> 



wo 98/43478 



PCT/US98/0637I 



TTA CCC ATG CAA GCT TTT GAC GCG AGC GCA AAA GCC TAG TTG AAT TTA 1011 
Leu Pro Met Gin Ala Phe Asp Ala Ser Ala Lys Ala Tyr Leu Asn Leu 
315 320 325 330 

GCG CAA AAA TTG GGT TAT TTT TCC AGT CAA ATC CAT AAG GGC GTG TGC 105 9 

Ala Gin Lys Leu Gly Tyr Phe Ser Ser Gin lie His Lys Gly Val Cys 
335 340 345 

CAA AAA ATT GAG CTC AGT CTT TGT GGG GAG ATC AAC CAA TTT AAA GAC 110 7 

Gin Lys lie Glu Leu Ser Leu Cys Gly Glu lie Asn Gin Phe Lys Asp 
350 355 360 

GCT CTT GTA GCC TTT ATG TTA GTG GGG GTG TTA AAA GOT GTT GTA GGG 1155 
Ala Leu Val Ala Phe Met Leu Val Gly Val Leu Lys Pro Val Val Gly 
365 370 375 

GAT AAA ATC AAT TAC ATT AAC GCC CCC TTT GTG GCC AAA GAA AGA GGT 12 03 

Asp Lys lie Asn Tyr lie Asn Ala Pro Phe Val Ala Lys Glu Arg Gly 
380 385 390 

ATT GAG ATT AAG GTT AGC CTT AAA GAA AGC GCT TCG CCC TAT AAG AAC 1251 
He Glu He Lys Val Ser Leu Lys Glu Ser Ala Ser Pro Tyr Lys Asn 
395 400 405 410 

ATG CTC TCT TTA ACC CTC AAT GCG GCT AAT GGC ACA ATC AGC GTG AGC 12 9 9 

Met Leu Ser Leu Thr Leu Asn Ala Ala Asn Gly Thr He Ser Val Ser 
415 420 425 

GGC ACG GTG TTT GAA GAA GAT ATT TTA AAA CTC ACT GAG ATT GAT GGG 134 7 

Gly Thr Val Phe Glu Glu Asp He Leu Lys Leu Thr Glu He Asp Gly 
430 435 440 

TTT CAT ATT GAT ATA GAG CCA AAG GGT AAA ATG CTT TTA TTC AGG AAT 13 95 

Phe His He Asp He Glu Pro Lys Gly Lys Met Leu Leu Phe Arg Asn 
445 450 455 

ACG GAT ATT CCA GGC GTT ATT GGG AGT GTG GGG AAT GCG TTC GCT AGG 14 4 3 

Thr Asp He Pro Gly Val He Gly Ser Val Gly Asn Ala Phe Ala Arg 
460 465 470 

CAT GGC ATT AAC ATC GCT GAT TTT CGT TTG GGG CGT AAC ACG CAA AAA 14 91 

His Gly He Asn He Ala Asp Phe Arg Leu Gly Arg Asn Thr Gin Lys 
475 480 485 490 

GAA GCC CTA GCA CTC ATT ATT GTA GAT GAA GAA GTT TCT TTG GAA GTT 153 9 

Glu Ala Leu Ala Leu He He Val Asp Glu Glu Val Ser Leu Glu Val 
495 500 505 

TTA GAA GAG CTT AAA AAC ATT CCT GCG TGC TTA AGC GTT CAT TAT GTG 15 8 7 

Leu Glu Glu Leu Lys Asn He Pro Ala Cys Leu Ser Val His Tyr Val 
510 515 520 

GTT ATT TAAGGTAGTT GGATGCGAGA TTTTTTAAAA CTTTTAAAAA 16 3 3 

Val He 
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GTG ATG AAT ATC CCT ACC OCT AAC ACQ ATT GCC GCT GTG GAA TTG ACC 3 39 

Val Met Asn lie Pro Thr Ala Asn Thr lie Ala Ala Val Glu Leu Thr 
95 100 105 

ATG GCG CAT TTG ATC AAT GCA GTG CGT TCG TTC CCT TGT GCA AAC GAT 3 87 

Met Ala His Leu lie Asn Ala Val Arg Ser Phe Pro Cys Ala Asn Asp 
110 115 120 

CAA ATC AAA CAC CAA AGO TTA TGG AAA AGA GAA GAT TGG TAT GGC ACG 435 
Gin lie Lys His Gin Arg Leu Trp Lys Arg Glu Asp Trp Tyr Gly Thr 
125 130 135 

GAA TTG AAA AAT AAA AAG CTG GGC ATC ATT GGT TTT GGG AAT ATT GGC 4 83 

Glu Leu Lys Asn Lys Lys Leu Gly lie lie Gly Phe Gly Asn lie Gly 
140 145 150 

TCT AGG GTG GGC ATT AGA GCA AAA GCC TTT GAA ATG GAA GTT CTA GCC 531 
Ser Arg Val Gly lie Arg Ala Lys Ala Phe Glu Met Glu Val Leu Ala 
155 160 165 170 

TAT GAT CCT TAT ATC CCT TCT TCA AAA GCC ACT GAT TTA GGA GTC ATT 57 9 

Tyr Asp Pro Tyr lie Pro Ser Ser Lys Ala Thr Asp Leu Gly Val lie 
175 180 185 

TAC ACG AAA AAT TTT GAA GAC ATT TTG CAA TGC GAT ATG ATC ACT ATC 62 7 

Tyr Thr Lys Asn Phe Glu Asp lie Leu Gin Cys Asp Met lie Thr lie 
190 195 200 

CAC ACC CCT AAA AAT AAA GAA ACG ATT AAC ATG ATA GGT GCT AAA GAG 675 
His Thr Pro Lys Asn Lys Glu Thr lie Asn Met He Gly Ala Lys Glu 
205 210 215 

ATT GAG CGC ATG AAA AAA GGG GTT ATT TTG CTC AAT TGC GCT AGG GGT 723 
He Glu Arg Met Lys Lys Gly Val He Leu Leu Asn Cys Ala Arg Gly 
220 225 230 

GGG CTT TAT AAT GAA GAC GCT CTT TAT GAG GCT TTA GAA ACC AAA AAA 771 
Gly Leu Tyr Asn Glu Asp Ala Leu Tyr Glu Ala Leu Glu Thr Lys Lys 
235 240 245 250 

GTG CGT TGG CTT GGC ATT GAT GTC TTT TCT AAA GAG CCT GGC ATT CAC 819 
Val Arg Trp Leu Gly He Asp Val Phe Ser Lys Glu Pro Gly He His 
255 260 265 

AAC AAG CTT TTA GAC TTG CCC AAT GTT TAT GCG ACC CCC CAT ATT GGC 867 
Asn Lys Leu Leu Asp Leu Pro Asn Val Tyr Ala Thr Pro His He Gly 
270 275 280 

GCA AAC ACT TTA GAA TCC CAA GAA GAA ATT TCC AAA CAA GCC GCT CAA 915 
Ala Asn Thr Leu Glu Ser Gin Glu Glu He Ser Lys Gin Ala Ala Gin 
285 290 295 

GGG GTT ATG GAA TCT TTA AGG GGT TCA AGC CAC CCG CAT GCT TTG AAT 963 
Gly Val Met Glu Ser Leu Arg Gly Ser Ser His Pro His Ala Leu Asn 
300 305 310 
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130 135 140 

lie Tyr Ser Gin lie Ser Glu Thr Cys Lys His Leu Lys Leu Lys Gly 
145 150 155 160 

Leu Met Cys lie Gly Ala His Thr Asp Asp Glu Lys Glu lie Glu Lys 

165 170 175 

Ser Phe lie Thr Thr Lys Lys Leu Phe Asp Gin lie Lys Asn Ala Ser 

180 185 190 

Val Leu Ser Met Gly Met Ser Asp Asp Phe Glu Leu Ala lie Ala Cys 

195 200 205 

Gly Ala Asn Leu Leu Arg lie Gly Ser Phe Leu Phe Lys Glu 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 12 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 1593 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1295: 



TTTTAAATTC AAAGGATAAA A ATG TAT CAA GTA GCC ATT TGC GAC CCC ATC 51 

Met Tyr Gin Val Ala lie Cys Asp Pro lie 
15 10 



CAT GCT AAA GGC ATT CAA ATT TTA GAA GCT CAA AAA GAC ATT GTC TTG 9 9 

His Ala Lys Gly lie Gin lie Leu Glu Ala Gin Lys Asp lie Val Leu 
15 20 25 



CAT GAT TAT TCC AAA TGC CCT AAA AAG GAG CTT TTA GAA AAA CTC ACT 14 7 

His Asp Tyr Ser Lys Cys Pro Lys Lys Glu Leu Leu Glu Lys Leu Thr 
30 35 40 



CCC ATG GAT GCG CTC ATC ACT CGC AGC ATG ACC CCT ATC ACA AGC GAT 19 5 

Pro Met Asp Ala Leu lie Thr Arg Ser Met Thr Pro lie Thr Ser Asp 
45 50 55 



TTT TTA AAG CCC TTA ACC CAC TTA AAA TCC ATC GTG AGA GCG GGC GTG 24 3 

Phe Leu Lys Pro Leu Thr His Leu Lys Ser He Val Arg Ala Gly Val 
60 65 70 



GGA GTG GAT AAT ATT GAT TTA GAA AGC TGC TCT CAA AAA GGG ATT GTA 2 91 

Gly Val Asp Asn He Asp Leu Glu Ser Cys Ser Gin Lys Gly He Val 
75 80 85 90 
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GGG GTG GTG COT GAA GAA GCG CTA GAA ATT TAT TCT CAA ATC AGT GAA 4 84 

Gly Val Val Pro Glu Glu Ala Leu Glu lie Tyr Ser Gin lie Ser Glu 
140 145 150 

ACT TGC AAG CAC CTC AAG CTT AAG GGG CTT ATG TGT ATA GGG GCA CAC 532 
Thr Cys Lys His Leu Lys Leu Lys Gly Leu Met Cys lie Gly Ala His 
155 160 165 

ACA GAT GAT GAA AAG GAA ATT GAA AAA TCC TTT ATC ACC ACC AAA AAG 580 
Thr Asp Asp Glu Lys Glu lie Glu Lys Ser Phe lie Thr Thr Lys Lys 
170 175 180 

CTT TTT GAC CAA ATA AAG AAT GCG AGC GTT CTT TCA ATG GGC ATG AGT 62 8 

Leu Phe Asp Gin lie Lys Asn Ala Ser Val Leu Ser Met Gly Met Ser 
185 190 195 

GAT GAT TTT GAA TTA GCG ATT GCT TGC GGG GCG AAT CTT TTA AGG ATT 67 6 

Asp Asp Phe Glu Leu Ala lie Ala Cys Gly Ala Asn Leu Leu Arg lie 
200 205 210 215 

GGC TCT TTT TTG TTC AAA GAG TAAGATGCTA GAAACTTATG CACTTAAAAA TGGG 731 
Gly Ser Phe Leu Phe Lys Glu 
220 

GCTGTTTTTA TCTCTGATGC GC 753 
(2) INFORMATION FOR SEQ ID NO: 12 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1294: 

Met Leu Asp Tyr Arg Gin Lys lie Asp Ala Leu lie Thr Lys lie Glu 

15 10 15 

Lys Ala Arg Thr Ala Tyr Ser Arg His His He Val Lys He Val Ala 

20 25 30 

Val Ser Lys Asn Ala Ser Pro Glu Ala He Gin His Tyr Tyr Asn Cys 

35 40 45 

Ser Gin Arg Ala Phe Gly Glu Asn Lys Val Gin Asp Leu Lys Thr Lys 

50 55 60 

Met His Ser Leu Glu His Leu Pro Leu Glu Trp His Met He Gly Ser 
65 70 75 80 

Leu Gin Glu Asn Lys He Asn Ala Leu Leu Ser Leu Lys Pro Ala Leu 

85 90 95 

Leu His Ser Leu Asp Ser Leu Lys Leu Ala Leu Lys He Glu Lys Arg 

100 105 110 

Cys Glu He Leu Gly Val Asn Leu Asn Ala Leu Leu Gin Val Asn Ser 

115 120 125 

Ala Tyr Glu Glu Ser Lys Ser Gly Val Val Pro Glu Glu Ala Leu Glu 
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(2) INFORMATION FOR SEQ ID NO:1293: 

( i ) S EQUENCE CHARACTER I S T I CS : 

(A) LENGTH: 753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 32... 697 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 93: 

GATCTATAAA GAATAGCCAT AAAGAAGAAT T ATG TTA GAT TAT CGC CAA AAA 52 

Met Leu Asp Tyr Arg Gin Lys 
1 5 

ATT GAT GCT CTC ATC ACC AAA ATA GAA AAG GCT CGC ACC GCC TAT TCA 10 0 

lie Asp Ala Leu lie Thr Lys lie Glu Lys Ala Arg Thr Ala Tyr Ser 
10 15 20 

AGG CAC CAC ATT GTC AAA ATC GTG GCT GTT TCA AAA AAC GCT TCC CCA 14 8 

Arg His His He Val Lys He Val Ala Val Ser Lys Asn Ala Ser Pro 
25 30 35 

GAA GCT ATC CAA CAT TAT TAT AAC TGC TCT CAA AGG GCT TTT GGA GAA 196 
Glu Ala He Gin His Tyr Tyr Asn Cys Ser Gin Arg Ala Phe Gly Glu 
40 45 50 55 

AAT AAA GTT CAA GAT TTA AAA ACT AAA ATG CAT TCT TTA GAG CAT TTG 244 
Asn Lys Val Gin Asp Leu Lys Thr Lys Met His Ser Leu Glu His Leu 
60 65 70 

CCC CTT GAA TGG CAC ATG ATA GGC TCT TTA CAA GAA AAT AAA ATC AAT 2 92 

Pro Leu Glu Trp His Met He Gly Ser Leu Gin Glu Asn Lys He Asn 
75 80 85 

GCG CTT TTG AGT TTA AAG CCC GCT CTT TTG CAT TCT TTA GAC TCT TTA 34 0 

Ala Leu Leu Ser Leu Lys Pro Ala Leu Leu His Ser Leu Asp Ser Leu 
90 95 100 

AAA CTC GCT TTG AAA ATA GAA AAG CGT TGC GAA ATA TTG GGC GTC AAT 3 88 

Lys Leu Ala Leu Lys He Glu Lys Arg Cys Glu He Leu Gly Val Asn 
105 110 115 

TTA AAC GCT CTT TTA CAG GTT AAT AGC GCG TAT GAG GAA AGT AAA AGC 436 
Leu Asn Ala Leu Leu Gin Val Asn Ser Ala Tyr Glu Glu Ser Lys Ser 
120 125 130 135 
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TTATAAAATT GTAAAATACT CTTATCTCAA ACGCTAAAAA GGGGTTTTAA ATGGATG 102 6 

(2) INFORMATION FOR SEQ ID NO: 12 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1292: 

Met Ala Glu Lys Thr Ala Asn Asp Leu Lys Leu Ser Glu lie Glu Leu 

15 10 15 

Val Asp Phe Arg lie Tyr Gly Met Gin Glu Gly Val Pro Tyr Glu Gly 

20 25 30 

lie Tyr Gly lie Asn Val Ala Lys Val Gin Glu lie lie Pro Met Pro 

35 40 45 

Thr Leu Phe Glu Tyr Pro Thr Asn Leu Asp Tyr lie lie Gly Val Phe 

50 55 60 

Asp Leu Arg Ser lie lie lie Pro Leu lie Asp Leu Ala Lys Trp lie 
65 70 75 80 

Gly lie lie Pro Asp Lys Ser Lys Glu Asn Glu Lys lie Val lie lie 

85 90 95 

Thr Glu Phe Asn Asn Val Lys Met Gly Phe Leu Val His Ser Ala Arg 

100 105 110 

Arg lie Arg Arg lie Ser Trp Lys Asp Val Glu Pro Ala Ser Phe Ser 

115 120 125 

Ala Ser Asn Ser lie Asn Lys Glu Asn lie Thr Gly Thr Thr Arg lie 

130 135 140 

Glu Asn Asp Lys Thr Leu Leu lie Leu Asp Leu Glu Ser lie Leu Asp 
145 150 155 160 

Asp Leu Lys Leu Asn Glu Asp Ala Lys Asn Ala Lys Asp Thr His Lys 

165 170 175 

Glu Arg Phe Glu Gly Glu Val Leu Phe Leu Asp Asp Ser Lys Thr Ala 

180 185 190 

Arg Lys Thr Leu Lys Asn His Leu Ser Lys Leu Gly Phe Ser lie Thr 

195 200 205 

Glu Ala Val Asp Gly Glu Asp Gly Leu Asn Lys Leu Glu Met Leu Phe 

210 215 220 

Lys Lys Tyr Gly Asp Asp Leu Arg Lys His Leu Lys Phe lie lie Ser 
225 230 235 240 

Asp Val Glu Met Pro Lys Met Asp Gly Tyr His Phe Leu Phe Lys Leu 

245 250 255 

Gin Lys Asp Pro Arg Phe Ala Tyr lie Pro Val lie Phe Asn Ser Ser 

260 265 270 

lie Cys Asp Asn Tyr Ser Ala Glu Arg Ala Lys Glu Met Gly Ala Val 

275 280 285 

Ala Tyr Leu Val Lys Phe Asp Ala Glu Lys Phe Thr Glu Glu lie Ser 

290 295 300 

Lys lie Leu Asp Lys Asn Ala 
305 310 
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AAC GAA AAA ATC GTC ATT ATC ACT GAA TTT AAC AAC GTT AAA ATG GGT 34 2 

Asn Glu Lys lie Val lie lie Thr Glu Phe Asn Asn Val Lys Met Gly 
90 95 100 105 

TTT TTA GTC CAT TCG GCT AGG CGT ATC AGG CGC ATT AGC TGG AAA GAT 3 90 

Phe Leu Val His Ser Ala Arg Arg lie Arg Arg lie Ser Trp Lys Asp 
110 115 120 

GTG GAG CCT GCA TCC TTT AGC GCC TCT AAT AGC ATC AAT AAA GAA AAT 43 8 

Val Glu Pro Ala Ser Phe Ser Ala Ser Asn Ser lie Asn Lys Glu Asn 
125 130 135 

ATT ACC GGC ACG ACA CGC ATT GAA AAC GAC AAA ACC CTG CTC ATT TTG 4 86 

He Thr Gly Thr Thr Arg He Glu Asn Asp Lys Thr Leu Leu He Leu 
140 145 150 

GAT TTA GAA AGC ATT TTA GAC GAT TTA AAG CTT AAT GAA GAC GCT AAA 534 
Asp Leu Glu Ser He Leu Asp Asp Leu Lys Leu Asn Glu Asp Ala Lys 
155 160 165 

AAC GCT AAA GAT ACC CAT AAA GAG CGT TTT GAA GGC GAA GTG TTG TTT 5 82 

Asn Ala Lys Asp Thr His Lys Glu Arg Phe Glu Gly Glu Val Leu Phe 
170 175 180 185 

TTA GAC GAT AGC AAG ACC GCA AGA AAA ACC TTA AAA AAC CAT TTG AGC 63 0 

Leu Asp Asp Ser Lys Thr Ala Arg Lys Thr Leu Lys Asn His Leu Ser 
190 195 200 

AAA TTA GGT TTT AGC ATC ACT GAA GCT GTG GAT GGG GAA GAC GGG TTG 67 8 

Lys Leu Gly Phe Ser He Thr Glu Ala Val Asp Gly Glu Asp Gly Leu 
205 210 215 

AAC AAA TTA GAA ATG TTA TTC AAA AAA TAC GGG GAC GAT TTG AGA AAG 72 6 

Asn Lys Leu Glu Met Leu Phe Lys Lys Tyr Gly Asp Asp Leu Arg Lys 
220 225 230 

CAT TTG AAA TTC ATT ATT TCA GAT GTT GAA ATG CCT AAA ATG GAT GGC 774 
His Leu Lys Phe He He Ser Asp Val Glu Met Pro Lys Met Asp Gly 
235 240 245 

TAT CAT TTC TTA TTC AAG CTC CAA AAA GAC CCT AGG TTT GCT TAT ATT 822 
Tyr His Phe Leu Phe Lys Leu Gin Lys Asp Pro Arg Phe Ala Tyr He 
250 255 260 265 

CCT GTG ATT TTT AAT TCT TCT ATT TGC GAT AAT TAC AGC GCT GAA AGG 87 0 

Pro Val He Phe Asn Ser Ser He Cys Asp Asn Tyr Ser Ala Glu Arg 
270 275 280 

GCT AAA GAA ATG GGG GCT GTA GCG TAT TTA GTC AAG TTT GAC GCA GAA 918 
Ala Lys Glu Met Gly Ala Val Ala Tyr Leu Val Lys Phe Asp Ala Glu 
285 290 295 

AAA TTC ACC GAA GAA ATT TCT AAG ATT TTA GAC AAG AAT GCG TAATTCTTT 96 9 
Lys Phe Thr Glu Glu He Ser Lys He Leu Asp Lys Asn Ala 
300 305 310 
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195 200 205 

Gly Val Lys Gly Asp Glu lie Leu Lys Glu lie Val Phe Leu Ala Ala 

210 215 220 

Lys Leu Lys lie Pro Phe Leu Val Cys Glu Met Gly Tyr Asp Gin Leu 
225 230 235 240 

Lys Ser Leu Lys Glu Cys Leu Glu Phe Cys Gly Tyr Asp Ala Glu Phe 

245 250 255 

Tyr Lys Asp Leu Ser Gly Phe Asp Arg Gly Phe Val Gly Val Leu Lys 

260 265 270 

Ser Phe Leu Arg 
275 

(2) INFORMATION FOR SEQ ID NO: 12 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28.,. 960 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1291: 

CCATCTCAAA ATAAGGACGC CTAAATC ATG GCA GAA AAA ACA GCT AAC GAT TTA 54 

Met Ala Glu Lys Thr Ala Asn Asp Leu 
1 5 

AAA CTA AGT GAG ATA GAA CTC GTG GAT TTT CGT ATT TAT GGC ATG CAA 102 
Lys Leu Ser Glu lie Glu Leu Val Asp Phe Arg lie Tyr Gly Met Gin 
10 15 20 25 

GAG GGC GTC CCT TAT GAG GGG ATT TAT GGT ATC AAT GTG GCT AAA GTC 15 0 

Glu Gly Val Pro Tyr Glu Gly He Tyr Gly He Asn Val Ala Lys Val 
30 35 40 

CAA GAA ATC ATC CCC ATG CCC ACC CTT TTT GAA TAC CCC ACG AAT TTG 19 8 

Gin Glu He He Pro Met Pro Thr Leu Phe Glu Tyr Pro Thr Asn Leu 
45 50 55 

GAT TAC ATT ATC GGC GTG TTT GAT TTG CGC TCC ATA ATC ATT CCG CTT 24 6 

Asp Tyr He He Gly Val Phe Asp Leu Arg Ser He He He Pro Leu 
60 65 70 

ATA GAC TTG GCT AAA TGG ATA GGG ATT ATC CCA GAT AAA AGC AAG GAA 2 94 

He Asp Leu Ala Lys Trp He Gly He He Pro Asp Lys Ser Lys Glu 
75 80 85 
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TTA GCC GCT AAA TTA AAA ATC CCT TTT TTG GTT TGT GAA ATG GGG TAT 72 3 

Leu Ala Ala Lys Leu Lys lie Pro Phe Leu Val Cys Glu Met Gly Tyr 
225 230 235 

GAC CAG TTG AAA AGO TTG AAA GAA TGC TTG GAA TTT TGC GGT TAT GAT 771 
Asp Gin Leu Lys Ser Leu Lys Glu Cys Leu Glu Phe Cys Gly Tyr Asp 
240 245 250 

GCA GAG TTT TAG AAG GAT TTG AGC GGC TTT GAT AGA GGG TTT GTG GGC 819 
Ala Glu Phe Tyr Lys Asp Leu Ser Gly Phe Asp Arg Gly Phe Val Gly 
255 260 265 

GTT TTA AAA AGT TTT TTA AGA TAAATTAAAA CTTAATTACC CTTTTAGTGT TACA 874 
Val Leu Lys Ser Phe Leu Arg 
270 275 

ATAAAAACAC TTAA 888 
(2) INFORMATION FOR SEQ ID NO: 12 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1290: 

Met Thr Leu Ser Gin Ala Leu Asn Lys Ala Lys Lys Gly Leu Ser Gin 

15 10 15 

Lys Gly Phe Arg Gly Gly Leu Glu Ser Glu lie Leu Leu Gly Phe Val 

20 25 30 

Leu Gin Lys Glu Arg Val Phe Leu His Thr His Ala Tyr Leu Glu Leu 

35 40 45 

Asn His Glu Glu Glu Val Arg Phe Phe Glu Leu Val Glu Lys Arg Leu 

50 55 60 

Asn Asn Cys Pro lie Glu Tyr Leu Leu Glu Ser Cys Asp Phe Tyr Gly 
65 70 75 80 

Arg Ser Phe Phe Val Asn Glu His Val Leu lie Pro Arg Pro Glu Thr 

85 90 95 

Glu lie Leu Val Gin Lys Ala Leu Asp lie lie Ser Gin Tyr His Leu 

100 105 110 

Lys Glu lie Gly Glu lie Gly lie Gly Ser Gly Cys Val Ser Val Ser 

115 120 125 

Leu Ala Leu Glu Asn Pro Asn Leu Ser lie Tyr Ala Ser Asp lie Ser 

130 135 140 

Pro Asn Ala Leu Glu Val Ala Ser Lys Asn He Glu His Phe Cys Leu 
145 150 155 160 

Lys Glu Arg Val Phe Leu Lys Gin Thr Arg Leu Trp Asp His Met Pro 

165 170 175 

Met He Glu Met Leu Val Ser Asn Pro Pro Tyr He Ala Arg Asn Tyr 

180 185 190 

Pro Leu Glu Lys Ser Val Leu Lys Glu Pro His Glu Ala Leu Phe Gly 
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Met Thr Leu Ser Gin Ala Leu Asn Lys Ala Lys Lys Gly 
15 10 

TTA TCG CAA AAA GGT TTT AGG GGG GGC TTA GAA TOT GAA ATT TTA TTA 9 9 

Leu Ser Gin Lys Gly Phe Arg Gly Gly Leu Glu Ser Glu lie Leu Leu 
15 20 25 

GGC TTT GTC TTG CAA AAA GAA AGG GTT TTT TTG CAC ACG CAT GCC TAT 14 7 

Gly Phe Val Leu Gin Lys Glu Arg Val Phe Leu His Thr His Ala Tyr 
30 35 40 45 

TTA GAG TTA AAC CAC GAA GAA GAG GTG CGT TTT TTT GAA TTG GTA GAA 195 
Leu Glu Leu Asn His Glu Glu Glu Val Arg Phe Phe Glu Leu Val Glu 
50 55 60 

AAG CGC TTG AAT AAC TGC CCC ATA GAG TAT TTA TTA GAA AGC TGT GAT 24 3 

Lys Arg Leu Asn Asn Cys Pro lie Glu Tyr Leu Leu Glu Ser Cys Asp 
65 70 75 

TTT TAT GGG CGC TCT TTT TTT GTG AAT GAG CAT GTT TTA ATC CCA CGA 2 91 

Phe Tyr Gly Arg Ser Phe Phe Val Asn Glu His Val Leu lie Pro Arg 
80 85 90 

CCT GAA ACC GAG ATT TTG GTC CAA AAA GCC CTT GAT ATT ATT TCT CAA 33 9 

Pro Glu Thr Glu He Leu Val Gin Lys Ala Leu Asp He He Ser Gin 
95 100 105 

TAC CAT TTA AAA GAG ATA GGC GAA ATC GGC ATA GGG AGC GGA TGC GTG 3 87 

Tyr His Leu Lys Glu He Gly Glu He Gly He Gly Ser Gly Cys Val 
110 115 120 125 

TCT GTG AGT TTG GCT TTA GAA AAC CCT AAT CTC TCT ATT TAT GCG AGC 43 5 

Ser Val Ser Leu Ala Leu Glu Asn Pro Asn Leu Ser He Tyr Ala Ser 
130 135 140 

GAT ATT TCA CCA AAC GCT TTA GAA GTG GCG TCC AAA AAT ATT GAG CAC 4 83 

Asp He Ser Pro Asn Ala Leu Glu Val Ala Ser Lys Asn He Glu His 
145 150 155 

TTT TGT CTA AAA GAG CGT GTT TTT TTA AAA CAA ACA CGC CTT TGG GAT 531 
Phe Cys Leu Lys Glu Arg Val Phe Leu Lys Gin Thr Arg Leu Trp Asp 
160 165 170 

CAT ATG CCC ATG ATA GAA ATG CTT GTC TCT AAC CCG CCC TAT ATC GCT 57 9 

His Met Pro Met He Glu Met Leu Val Ser Asn Pro Pro Tyr He Ala 
175 180 185 

AGA AAT TAT CCT TTG GAA AAA TCC GTC CTC AAA GAA CCG CAT GAA GCC 62 7 

Arg Asn Tyr Pro Leu Glu Lys Ser Val Leu Lys Glu Pro His Glu Ala 
190 195 200 205 

CTT TTT GGG GGG GTT AAA GGC GAT GAG ATC TTA AAA GAA ATC GTT TTT 67 5 

Leu Phe Gly Gly Val Lys Gly Asp Glu He Leu Lys Glu He Val Phe 
210 215 220 
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(ii) MOLECUL.E TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288: 

Met Arg Phe Val Tyr His Pro Leu Ala Lys Glu Pro Val Leu Lys lie 

15 10 15 

Glu Gly Glu Ser Tyr Thr His Leu Tyr Arg Ser Arg Arg Val Lys Ser 

20 25 30 

Ala Ser Arg Leu Asp Leu Arg Asn Leu Lys Asp Gly Phe Leu Tyr Thr 

35 40 45 

Tyr Glu His Ala Glu lie Thr Lys Lys His Ala Leu Leu Lys Leu Val 

50 55 60 

Gly Ala Arg Leu Leu Glu Val Met Ala Ser Lys Lys Thr His Leu lie 
65 70 75 80 

Leu Ser Val He Glu He Lys Asn He Glu Lys He Leu Pro Phe Leu 

85 90 95 

Asn Gin Leu Gly Val Ser Lys Leu Ser Leu Phe Tyr Ala Asp Phe Ser 

100 105 110 

Gin Arg Asn Glu Lys He Asp He Ala Lys Leu Glu Arg Phe Gin Lys 

115 120 125 

He Leu He His Ser Cys Glu Gin Cys Gly Arg Ser Ala Leu Met Glu 

130 135 140 

Leu Glu Val Phe Ser Asn Thr Lys Glu Ala Leu Lys Ala Tyr Pro Lys 
145 150 155 160 

Ala Ser Val Leu Asp Phe Lys Gly Glu Thr Leu Xaa Ala Ser Ala Asp 

165 170 175 

Phe Glu Lys Gly Val He He Gly Pro Glu Gly Gly Phe Ser Glu Pro 

180 185 190 

Glu Arg Gly Tyr Phe Lys Glu Arg Glu He Tyr Arg He Pro Leu Asp 

195 200 205 

Met Val Leu Lys Ser Glu Ser Ala Cys Val Phe Val Ala Ser He Ala 

210 215 220 

Gin Val 
225 

(2) INFORMATION FOR SEQ ID NO: 12 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13... 840 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 89: 
TTATGAAATT GA ATG ACC CTT TCG CAA GCC CTA AAC AAA GCC AAA AAA GGA 51 
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Phe Leu Tyr Thr Tyr Glu His Ala Glu lie Thr Lys Lys His Ala Leu 
45 50 55 60 

TTA AAG CTA GTG GGC GCG CGA TTA TTA GAG GTT ATG GCC AGT AAA AAA 242 

Leu Lys Leu Val Gly Ala Arg Leu Leu Glu Val Met Ala Ser Lys Lys 

65 70 75 



ACG CAT TTG ATT TTA AGO GTG ATT GAA ATC AAA AAC ATT GAA AAA ATC 2 90 

Thr His Leu lie Leu Ser Val lie Glu lie Lys Asn lie Glu Lys lie 
80 85 90 



CTA CCC TTT TTA AAT CAG TTA GGC GTG AGC AAG TTG AGT TTA TTC TAT 33 8 

Leu Pro Phe Leu Asn Gin Leu Gly Val Ser Lys Leu Ser Leu Phe Tyr 
95 100 105 



GCG GAT TTT AGC CAA CGC AAT GAA AAA ATA GAC ATC GCT AAA TTA GAG 3 86 

Ala Asp Phe Ser Gin Arg Asn Glu Lys lie Asp lie Ala Lys Leu Glu 
110 115 120 



CGC TTT CAA 
Arg Phe Gin 
125 

GCT TTA ATG 
Ala Leu Met 



GCC TAT CCT 
Ala Tyr Pro 



GCA AGC GCG 
Ala Ser Ala 
175 

TTT AGC GAA 
Phe Ser Glu 
190 

ATC CCG TTA 
lie Pro Leu 
205 



AAG ATT TTG 
Lys lie Leu 
130 

GAA TTG GAA 
Glu Leu Glu 
145 

AAG GCG AGC 
Lys Ala Ser 
160 

GAT TTT GAA 
Asp Phe Glu 



CCA GAA AGA 
Pro Glu Arg 



GAT ATG GTG 
Asp Met val 
210 



ATC CAT TCT 
lie His Ser 



GTG TTT TCA 
Val Phe Ser 



GTT TTG GAT 
Val Leu Asp 
165 

AAG GGC GTT 
Lys Gly Val 
180 

GGG TAT TTT 
Gly Tyr Phe 
195 

CTA AAG TCT 
Leu Lys Ser 



TGC GAG CAG 
Cys Glu Gin 
135 

AAC ACT AAA 
Asn Thr Lys 
150 

TTT AAG GGC 
Phe Lys Gly 



ATC ATA GGG 
lie He Gly 



AAA GAG CGT 
Lys Glu Arg 
200 

GAG AGT GCA 
Glu Ser Ala 
215 



TGT GGT AGG 
Cys Gly Arg 



GAG GCG CTA 
Glu Ala Leu 
155 

GAA ACC TTA 
Glu Thr Leu 
170 

CCT GAG GGG 
Pro Glu Gly 
185 

GAA ATT TAT 
Glu He Tyr 



TGC GTG TTT 
Cys Val Phe 



AGT 434 

Ser 

140 

AAA 482 
Lys 

NCC 53 0 

Xaa 



GGC 57 8 

Gly 



CGC 626 
Arg 



GTA 674 

Val 

220 



GCG AGT ATC GCA CAA GTT TAGGGGGTTA TTGGGGATTT TAAATCCTAA AAAATC 72 8 

Ala Ser He Ala Gin Val 
225 



(2) INFORMATION FOR SEQ ID NO: 12 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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Glu Lys Met Lys Leu Phe Arg Ser His Gly Met Leu Lys Lys Asp Phe 

115 120 125 

Phe Glu Gly Glu Val Lys Ser lie Gly His Asn Phe Arg Leu Asn Glu 

130 135 140 

lie Gin Ser Ala Leu Gly Leu Ser Gin Leu Lys Lys Ala Pro Phe Leu 
145 150 155 160 

Met Gin Lys Arg Glu Glu Ala Ala Leu Thr Tyr Asp Arg lie Phe Lys 

165 170 175 

Asp Asn Pro Tyr Phe Thr Pro Leu His Pro Leu Leu Lys Asp Lys Ser 

180 185 190 

Ser Asn His Leu Tyr Pro lie Leu Met His Gin Lys Phe Phe Thr Cys 

195 200 205 

Lys Lys Leu lie Leu Glu Ser Leu His Lys Arg Gly lie Leu Ala Gin 

210 215 220 

Val His Tyr Lys Pro He Tyr Gin Tyr Gin Leu Tyr Gin Gin Leu Phe 
225 230 235 240 

Asn Thr Ala Pro Leu Lys Ser Ala Glu Asp Phe Tyr His Ala Glu He 

245 250 255 

Ser Leu Pro Cys His Ala Asn Leu Asn Leu Glu Ser Val Gin Asn He 

260 265 270 

Ala His Ser Val Leu Lys Thr Phe Glu Ser Phe Lys He Glu 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 12 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 15 . . . 692 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 87: 

TAATGGGCCT TTGA ATG CGT TTT GTC TAT CAC CCT TTA GCC AAA GAG CCT 5 0 

Met Arg Phe Val Tyr His Pro Leu Ala Lys Glu Pro 
15 10 

GTT TTA AAA ATA GAA GGC GAG AGT TAT ACG CAT TTA TAG CGA TCA AGG 9 8 

Val Leu Lys He Glu Gly Glu Ser Tyr Thr His Leu Tyr Arg Ser Arg 
15 20 25 

CGT GTC AAA AGT GCG AGT CGT TTG GAT TTG AG A AAT TTA AAA GAC GGC 14 6 

Arg Val Lys Ser Ala Ser Arg Leu Asp Leu Arg Asn Leu Lys Asp Gly 
30 35 40 

TTT TTA, -TAG ACC TAT GAG CAT GCA GAA ATC ACT AAA AAA CAC GCC CTT 194 
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Fro Tyi Phe Thr Fro Leu His Pro Leu Leu Lys Asp Lys Ser Ser Asn 
180 185 190 

CAC CTT TAT CCT ATT TTA ATG CAC CAA AAA TTT TTT ACA TGC AAA AAA 6 80 

His Leu Tyr Pro lie Leu Met His Gin Lys Phe Phe Thr Cys Lys Lys 
195 200 205 210 

CTC ATT TTA GAA AGT TTG CAC AAG CGT GGC ATT TTA GCC CAA GTG CAT 72 8 

Leu lie Leu Glu Ser Leu His Lys Arg Gly lie Leu Ala Gin Val His 
215 220 225 

TAC AAG CCC ATT TAT CAA TAC CAA TTG TAC CAA CAG CTC TTC AAT ACA 77 6 

Tyr Lys Pro He Tyr Gin Tyr Gin Leu Tyr Gin Gin Leu Phe Asn Thr 
230 235 240 

GCC CCA TTA AAA AGC GCA GAG GAT TTC TAT CAC GCT GAA ATT TCC TTG 8 24 

Ala Pro Leu Lys Ser Ala Glu Asp Phe Tyr His Ala Glu He Ser Leu 
245 250 255 

CCT TGT CAT GCG AAT TTA AAT TTA GAG AGC GTT CAA AAC ATC GCT CAT 8 72 

Pro Cys His Ala Asn Leu Asn Leu Glu Ser Val Gin Asn He Ala His 
260 265 270 

AGC GTT TTA AAA ACT TTT GAG AGT TTT AAA ATA GAA TGAGTTTCAT TTAGGG 924 
Ser Val Leu Lys Thr Phe Glu Ser Phe Lys He Glu 
275 280 285 

CTTCAAATCT TAATCATTAA GAATGGTGCG GAAGAAA 961 
(2) INFORMATION FOR SEQ ID NO: 12 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 86: 

Met Leu Leu Glu Ser Gly Tyr Thr Pro Val Phe Ala Gly He Lys Asn 

15 10 15 

Asp Gly Asn He Asp Glu Leu Ala Leu Glu Lys Leu He Asn Glu Arg 

20 25 30 

Thr Lys Ala He Val Ser Val Asp Tyr Ala Gly Lys Ser Val Glu Val 

35 40 45 

Glu Ser Val Gin Lys Leu Cys Lys Lys His Ser Leu Ser Phe Leu Ser 

50 55 60 

Asp Ser Ser His Ala Leu Gly Ser Glu Tyr Gin Asn Lys Lys Val Gly 
65 70 75 80 

Gly Phe Ala Leu Ala Ser Val Phe Ser Phe His Ala He Lys Pro He 

85 90 95 

Thr Thr Ala Glu Gly Gly Ala Val Val Thr Asn Asp Ser Glu Leu His 
100 105 110 
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(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 85: 

ATCGTAATGA AATAATCACC ACCCCTATAA GCTTTGTAGC GACGGCTAAC ATG CTT 5 6 

Met Leu 
1 

TTA GAA AGC GGT TAT ACA CCC GTA TTT GCT GGA ATT AAA AAC GAT GGC 104 
Leu Glu Ser Gly Tyr Thr Pro Val Phe Ala Gly lie Lys Asn Asp Gly 
5 10 15 

AAT ATA GAT GAA TTA GCC CTA GAA AAG CTC ATT AAC GAA AGA ACC AAA 15 2 

Asn lie Asp Glu Leu Ala Leu Glu Lys Leu lie Asn Glu Arg Thr Lys 
20 25 30 

GCC ATA GTG AGC GTG GAT TAT GCC GGT AAA AGC GTG GAA GTA GAA AGC 200 
Ala lie Val Ser Val Asp Tyr Ala Gly Lys Ser Val Glu Val Glu Ser 
35 40 45 50 

GTT CAA AAG CTT TGC AAA AAG CAT TCT TTG AGC TTT CTT TCT GAC AGC 24 8 

Val Gin Lys Leu Cys Lys Lys His Ser Leu Ser Phe Leu Ser Asp Ser 
55 60 65 

TCG CAT GCT CTA GGA AGC GAG TAT CAA AAC AAA AAA GTA GGA GGC TTT 2 96 

Ser His Ala Leu Gly Ser Glu Tyr Gin Asn Lys Lys Val Gly Gly Phe 
70 75 80 

GCG TTA GCG AGC GTG TTT AGT TTC CAT GCC ATT AAG CCC ATC ACT ACG 344 
Ala Leu Ala Ser Val Phe Ser Phe His Ala lie Lys Pro lie Thr Thr 
85 90 95 

GCT GAA GGG GGA GCG GTC GTT ACT AAC GAT AGC GAA TTG CAT GAA AAA 3 92 

Ala Glu Gly Gly Ala Val Val Thr Asn Asp Ser Glu Leu His Glu Lys 
100 105 110 

ATG AAA TTG TTT CGC TCT CAT GGC ATG CTC AAA AAA GAT TTT TTT GAA 44 0 

Met Lys Leu Phe Arg Ser His Gly Met Leu Lys Lys Asp Phe Phe Glu 
115 120 125 130 

GGC GAA GTC AAA AGC ATA GGG CAT AAC TTC CGC TTG AAT GAA ATC CAA 4 88 

Gly Glu Val Lys Ser lie Gly His Asn Phe Arg Leu Asn Glu He Gin 
135 140 145 

AGC GCT TTG GGT TTG AGC CAG CTT AAA AAA GCC CCC TTT TTA ATG CAA 53 6 

Ser Ala Leu Gly Leu Ser Gin Leu Lys Lys Ala Pro Phe Leu Met Gin 
150 155 160 

AAA AGA GAA GAA GCC GCT CTA ACC TAT GAC AGG ATT TTT AAA GAT AAC 5 84 

Lys Arg Glu Glu Ala Ala Leu Thr Tyr Asp Arg He Phe Lys Asp Asn 
165 170 175 

CCT TAT TTC ACC CCT TTA CAC CCC TTG TTA AAA GAT AAA AGC TCT AAC 63 2 
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195 200 205 

Arg Phe Leu Val Ser Lys Ala Leu Tyr Phe Leu Gin Lys Glu Ser Leu 

210 215 220 

Gly Ala Met Gly Phe Leu Arg Gin Arg Glu Val Phe Arg Lys Arg Ser 
225 230 235 240 

Leu Lys Ala Ser Asp lie Ser Phe Asn lie Ala Pro Leu lie Asn Ser 

245 250 255 

Ala Gly Arg Met Gin Asp Ala Lys Met Ala Leu Asp Phe Leu Ser Ala 

260 265 270 

Asn Asn Ser Gin Asp Gly Tyr Ser Leu Tyr Glu Arg Leu Lys Ala Cys 

275 280 285 

Asn Leu Lys Arg Lys Met lie Gin Gin Gin Val Phe Glu Glu Ala Phe 

290 295 300 

Lys His Ala Met Val Gly Glu Lys lie lie Val Ala Phe Lys Asp Asn 
305 310 315 320 

Trp His Glu Gly Val Leu Gly lie Val Ala Ser Lys Leu Val Glu Ala 

325 330 335 

Thr Gin Lys Pro Ser Leu Val Phe Thr Phe Lys Glu Gly Val Tyr Lys 

340 345 350 

Gly Ser Ala Arg Ser Ser Ser Asn lie Asp Leu lie *Asp Ala Leu Asn 

355 360 365 

Gly Val Ser Ser Leu Leu Leu Gly Tyr Gly Gly His Arg Gin Ala Cys 

370 375 380 

Gly Leu Ser Val Glu Lys Asn Asn lie lie Ser Leu Phe Glu Thr Leu 
385 390 395 400 

Glu Asn Phe Asp Phe Lys Val Leu Pro Phe Cys Lys Thr Glu Pro Pro 

405 410 415 

Leu Thr Leu Lys Leu Lys Asp lie Asp Arg Glu Leu Leu Glu lie lie 

420 425 430 

Glu Met Gly Glu Pro Tyr Gly Gin Glu Asn Pro Glu Pro Leu Phe Gin 

435 440 445 

Ala Lys Asn Leu Glu Val lie Glu Glu Lys lie lie Lys Glu Ser His 

450 455 460 

Gin Val Leu Arg Phe Lys Asp Lys Glu Cys Val Lys Glu Ala lie Tyr 
465 470 475 480 

Phe Ser Ala Glu Arg Phe Leu Lys Ala Gly Glu Lys Val Ser Val Leu 

485 490 495 

Phe Ser Val Glu Leu Asp Glu Cys Ser Asn Glu Pro Lys Met Phe Val 

500 505 510 

Lys Ser Leu Leu 
515 

(2) INFORMATION FOR SEQ ID NO: 12 85: 

(i) SEQUENCE CH/^RACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 908 
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ATC ATT AAA GAA AGC CAC GAG GTT TTG CGT TTT AAG GAT AAA GAA TGC 1442 
lie lie Lys Glu Ser His Gin Val Leu Arg Phe Lys Asp Lys Glu Cys 
460 465 470 

GTC AAA GAG GOT ATT TAT TTT AGC GCT GAG CGG TTT TTG AAA GCG GGC 14 90 

Val Lys Glu Ala lie Tyr Phe Ser Ala Glu Arg Phe Leu Lys Ala Gly 
475 480 485 490 

GAA AAG GTG AGC GTG GTT TTT AGC GTG GAA TTA GAT GAG TGT TCT AAT 153 8 

Glu Lys Val Ser Val Leu Phe Ser Val Glu Leu Asp Glu Cys Ser Asn 
495 500 505 

GAG CCT AAA ATG TTT GTT AAA AGT TTG TTG TAGTGCCTTT TGTTGAAGAA GAA 15 91 
Glu Pro Lys Met Phe Val Lys Ser Leu Leu 
510 515 

TTTGAAATTT TAAAACCCAC CAAAGCCTTG tTTTTT 162 7 

(2) INFORMATION FOR SEQ ID NO : 12 84 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 84: 

Met Lys Gin Lys Leu Lys Ala Gin lie Lys Glu Arg Met Ala Ser lie 

15 10 15 

Ala Tyr Asn Glu Lys Gly Phe Pro Ser Pro Phe Leu Phe Lys Asp Leu 

20 25 30 

Lys Lys Ala Ala Leu Lys lie lie Glu Ala Met Arg Thr Asn Thr Glu 

35 40 45 

lie Leu Val Val Gly Asp Tyr Asp Ala Asp Gly Val lie Ser Ser Ala 

50 55 60 

lie Met Ala Lys Phe Phe Glu Ser Leu Asn Tyr Lys His Val Arg lie 
65 70 75 80 

Ala lie Pro Asn Arg Phe Met Asp Gly Tyr Gly He Ser Lys Lys Phe 

85 90 95 

Leu Glu Lys His His Ala Pro Leu He He Thr Val Asp Asn Gly He 

100 105 110 

Asn Ala Phe Glu Ala Ala Arg Phe Cys Lys Glu Lys Asn Tyr Thr Leu 

115 120 125 

He He Thr Asp His His Cys Leu His His Asp Glu Val Pro Asp Ala 

130 135 140 

Tyr Ala Val He Asn Pro Lys Gin Pro Asp Cys Asp Phe He Gin Lys 
145 150 155 160 

Glu Val Cys Gly Ala Leu Val Ala Phe Tyr Leu Cys Tyr Gly He His 

165 170 175 

Gin Leu Leu Gly Lys Glu Lys Ser His Ser Ser Glu Leu Leu Cys Leu 

180 185 190 

Ala Gly Val Ala Thr He Ala Asp Met Met Pro Leu Thr Phe Phe Asn 
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GTT TTT AGA AAA CGC TCT TTA AAA GCG AGT GAT ATT TCT TTT AAT ATC 77 0 

Val Phe Arg hys Arg Ser Leu Lys Ala Ser Asp lie Ser Phe Asn He 
235 240 245 250 

GCC CCC TTA ATC AAC TCC GCA GGG CGC ATG CAA GAT GCG AAA ATG GCT 818 
Ala Pro Leu He Asn Ser Ala Gly Arg Met Gin Asp Ala Lys Met Ala 
255 260 265 

TTA GAT TTT TTA AGC GCG AAT AAT TCT CAA GAT GGC TAT TCT TTG TAT 86 6 

Leu Asp Phe Leu Ser Ala Asn Asn Ser Gin Asp Gly Tyr Ser Leu Tyr 
270 275 280 

GAA CGC TTG AAA GCA TGC AAT TTG AAG CGT AAA ATG ATC CAA CAG CAG 914 
Glu Arg Leu Lys Ala Cys Asn Leu Lys Arg Lys Met He Gin Gin Gin 
285 290 295 

GTT TTT GAA GAA GCT TTT AAG CAT GCG ATG GTT GGA GAA AAA ATT ATC 962 
Val Phe Glu Glu Ala Phe Lys His Ala Met Val Gly Glu Lys He He 
300 305 310 

GTC GCT TTT AAG GAC AAT TGG CAT GAG GGA GTG CTG GGG ATT GTG GCT 1010 
Val Ala Phe Lys Asp Asn Trp His Glu Gly Val Leu Gly He Val Ala 
315 320 325 330 

TCA AAA TTA GTG GAA GCC ACT CAA AAG CCA AGC CTG GTT TTT ACC TTT 105 8 

Ser Lys Leu Val Glu Ala Thr Gin Lys Pro Ser Leu Val Phe Thr Phe 
335 340 345 

AAA GAA GGG GTG TAT AAA GGG AGC GCT CGT AGC TCT TCA AAC ATT GAC 1106 
Lys Glu Gly Val Tyr Lys Gly Ser Ala Arg Ser Ser Ser Asn He Asp 
350 355 360 

TTG ATT GAC GCT TTG AAT GGG GTT TCT TCT TTA TTG CTC GGC TAT GGA 1154 
Leu He Asp Ala Leu Asn Gly Val Ser Ser Leu Leu Leu Gly Tyr Gly 
365 370 375 

GGG CAT AGG CAA GCT TGC GGG TTG AGC GTT GAA AAA AAC AAT ATC ATC 12 02 

Gly His Arg Gin Ala Cys Gly Leu Ser Val Glu Lys Asn Asn He He 
380 385 390 

TCG CTC TTT GAA ACT TTA GAA AAT TTT GAT TTT AAA GTT TTA CCT TTT 12 5 0 

Ser Leu Phe Glu Thr Leu Glu Asn Phe Asp Phe Lys Val Leu Pro Phe 
395 400 405 410 

TGT AAA ACA GAG CCC CCT TTA ACG CTC AAA TTA AAA GAC ATT GAC AGA 12 98 

Cys Lys Thr Glu Pro Pro Leu Thr Leu Lys Leu Lys Asp He Asp Arg 
415 420 425 

GAG CTT TTA GAG ATT ATA GAA ATG GGC GAA CCT TAT GGG CAA GAA AAC 13 4 6 

Glu Leu Leu Glu He He Glu Met Gly Glu Pro Tyr Gly Gin Glu Asn 
430 435 440 

CCT GAA CCC CTA TTC CAA GCA AAA AAT TTA GAA GTC ATA GAA GAA AAA 13 94 

Pro Glu Pro Leu Phe Gin Ala Lys Asn Leu Glu Val He Glu Glu Lys 
445 450 455 
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GAG CGC ATG OCT TCT ATC OCT TAT AAT GAA AAA GGG TTT OCT AGC CCC 98 
Glu Arg Met Ala Ser lie Ala Tyr Asn Glu Lys Gly Phe Pro Ser Pro 
15 20 25 

TTT TTA TTT AAA GAG TTG AAA AAA GCC GCG CTC AAA ATC ATA GAA GCC 14 6 

Phe Leu Phe Lys Asp Leu Lys Lys Ala Ala Leu Lys lie lie Glu Ala 
30 35 40 

ATG CGC ACA AAC ACA GAA ATT TTA GTG GTG GGC GAT TAT GAC GCT GAC 194 
Met Arg Thr Asn Thr Glu lie Leu Val Val Gly Asp Tyr Asp Ala Asp 
45 50 55 

GGC GTG ATT AGC TCT GCT ATC ATG GCA AAA TTT TTT GAA AGC CTG AAC 2 42 

Gly Val lie Ser Ser Ala lie Met Ala Lys Phe Phe Glu Ser Leu Asn 
60 65 70 

TAT AAG CAT GTC CGC ATT GCA ATC CCT AAT CGC TTC ATG GAT GGC TAT 2 90 

Tyr Lys His Val Arg lie Ala lie Pro Asn Arg Phe Met Asp Gly Tyr 
75 80 85 90 

GGG ATT TCT AAA AAA TTT TTA GAA AAA CAC CAC GCC CCT TTA ATC ATC 338 
Gly lie Ser Lys Lys Phe Leu Glu Lys His His Ala Pro Leu lie lie 
95 100 105 

ACG GTG GAT AAC GGG ATT AAC GCC TTT GAA GCC GCG CGA TTT TGT AAA 3 86 

Thr Val Asp Asn Gly lie Asn Ala Phe Glu Ala Ala Arg Phe Cys Lys 
110 115 120 

GAA AAA AAT TAG ACC CTT ATC ATC ACA GAT CAC CAT TGC TTA CAC CAT 4 34 

Glu Lys Asn Tyr Thr Leu lie lie Thr Asp His His Cys Leu His His 
125 130 135 

GAT GAA GTC CCA GAC GCT TAT GCG GTG ATC AAC CCC AAG CAA CCG GAT 4 82 

Asp Glu Val Pro Asp Ala Tyr Ala Val lie Asn Pro Lys Gin Pro Asp 
140 145 150 

TGT GAT TTT ATC CAA AAG GAA GTG TGC GGG GCG TTG GTA GCG TTT TAT 53 0 

Cys Asp Phe lie Gin Lys Glu Val Cys Gly Ala Leu Val Ala Phe Tyr 
155 160 165 170 

TTG TGC TAT GGG ATC CAT CAG CTT TTA GGA AAA GAA AAA AGC CAT TCT 57 8 

Leu Cys Tyr Gly lie His Gin Leu Leu Gly Lys Glu Lys Ser His Ser 
175 180 185 

AGT GAG TTA TTA TGT TTA GCG GGC GTG GCG ACT ATT GCT GAC ATG ATG 62 6 

Ser Glu Leu Leu Cys Leu Ala Gly Val Ala Thr lie Ala Asp Met Met 
190 195 200 

CCT TTG ACT TTT TTT AAC CGC TTT TTA GTT TCT AAA GCC TTG TAT TTT 6 74 

Pro Leu Thr Phe Phe Asn Arg Phe Leu Val Ser Lys Ala Leu Tyr Phe 
205 210 215 

TTG CAA AAA GAA TCC TTA GGG GCG ATG GGT TTT TTG CGC CAA AGA GAA 72 2 

Leu Gin Lys Glu Ser Leu Gly Ala Met Gly Phe Leu Arg Gin Arg Glu 
220 225 230 
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TTA AGT ATG CTG AAA AAC TTT TAG CTA CCG ATA TTA TAAAAGATAA TCTTAA 3 95 
Leu Ser Met Leu Lys Asn Phe Tyr Leu Pro lie Leu 
100 105 110 

TAAC 3 99 

(2) INFORMATION FOR SEQ ID NO:1282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282: 

Met Glu Asn Asp Val Lys Glu Asp Leu Glu Gin Ala Arg Pro Lys Leu 

15 10 15 

Glu Pro Glu Lys Gin Lys Gin Glu Pro Glu Glu Gin Lys Gin Glu Lys 

20 25 30 

Gin Asp Lys Gin Glu Gin Lys Pro Lys Gin Glu Lys Glu Glu Ser Lys 

35 40 45 

Ser Lys Glu Gin Glu Glu Asn Lys Lys Gin Lys Arg Ser Ser Tyr lie 

50 55 60 

Phe Trp Gly Cys lie He Gly Leu Cys He Val Val He He Ala Lys 
65 70 75 80 

He He Ala Phe Gly Gly Ser Ser Glu Glu Ala Lys Ala Asp Lys Pro 

85 90 95 

Lys Asn Ser Leu Ser Met Leu Lys Asn Phe Tyr Leu Pro He Leu 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 12 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 21... 1568 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1283: 

TCTAAATCCT AATCTAACAA ATG AAA CAA AAG CTT AAA GCT CAA ATC AAA 5 0 

Met Lys Gin Lys Leu Lys Ala Gin He Lys 
15 10 
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Lys Tyr Gly Val Ser Leu lie Tyr Leu lie Glu Ser Thr lie Leu Tyr 

50 55 60 

Phe Val Ser Lys Asp Leu Ser Trp lie Val Ala Leu Thr lie Phe Ser 
65 70 75 80 

Leu Ser Leu lie Leu Val Ala Phe Lys lie Phe Leu Leu Lys Asp Asn 

85 90 95 

Pro. Asn Lys Arg Phe Lys Asn Asn Lys Arg Asp Lys Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 1281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47.. -3 79 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281: 

AGCGCTCAAA TCATTTATTG GTTATCAAAA TATTTTAGGA GTGAGT ATG GAA AAT 5 5 

Met Glu Asn 
1 

GAT GTT AAA GAA GAT CTA GAG CAA GCA AGA CCA AAG TTA GAG CCA GAA 103 
Asp Val Lys Glu Asp Leu Glu Gin Ala Arg Pro Lys Leu Glu Pro Glu 
5 10 15 

AAG CAA AAG CAA GAG CCA GAG GAA CAG AAA CAA GAA AAA CAA GAC AAA 151 
Lys Gin Lys Gin Glu Pro Glu Glu Gin Lys Gin Glu Lys Gin Asp Lys 
20 25 30 35 

CAA GAG CAG AAG CCA AAG CAA GAA AAA GAA GAG TCA AAG AGC AAG GAA 199 
Gin Glu Gin Lys Pro Lys Gin Glu Lys Glu Glu Ser Lys Ser Lys Glu 
40 45 50 

CAA GAA GAA AAC AAA AAA CAA AAG AGA TCT AGC TAT ATT TTT TGG GGA 24 7 

Gin Glu Glu Asn Lys Lys Gin Lys Arg Ser Ser Tyr lie Phe Trp Gly 
55 60 65 

TGT ATT ATT GGT TTG TGT ATA GTT GTT ATT ATT GCC AAA ATT ATT GCG 2 95 

Cys lie lie Gly Leu Cys lie Val Val lie lie Ala Lys lie lie Ala 
70 75 80 

TTT GGC GGA TCT AGT GAG GAG GCA AAA GCA GAC AAA CCA AAA AAC TCT 34 3 

Phe Gly Gly Ser Ser Glu Glu Ala Lys Ala Asp Lys Pro Lys Asn Ser 
85 90 95 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 9: 

TAAAACGCTC TTGAAAGGGT GAGCGTGGAA TTTGCTTCTA TCGTTTGGTT GCTCATAGTC 6 0 

AATATTCTGA TTTTTATTCT C ATG CTG GTG GAT AAA AAT TCG GCT GAT CAA 111 

Met Leu Val Asp Lys Asn Ser Ala Asp Gin 
15 10 

AAA ATG TGG CGT ATT OCT GAA AAA GCT TTG TGG GTT TTA TCG CTC CTT 15 9 

Lys Met Trp Arg lie Pro Glu Lys Ala Leu Trp Val Leu Ser Leu Leu 
15 20 25 

GGC GGG TCT GTC GGG TTT TTG GTC GCT ATG GTT GTG TCC CAC CAT AAG 2 07 

Gly Gly Ser Val Gly Phe Leu Val Ala Met Val Val Ser His His Lys 
30 35 40 

ATC TTA AAG CCT GAG TTT AAA TAG GGC GTT TCG CTC ATT TAC TTG ATA 25 5 

lie Leu Lys Pro Glu Phe Lys Tyr Gly Val Ser Leu lie Tyr Leu lie 
45 50 55 

GAG AGC ACA ATC CTT TAC TTT GTC AGC AAA GAT CTT TCT TGG ATA GTA 3 03 

Glu Ser Thr lie Leu Tyr Phe Val Ser Lys Asp Leu Ser Trp lie Val 
60 65 70 

GCG CTA ACG ATA TTC TCA CTA TCT TTG ATA CTG GTA GCG TTT AAG ATC 3 51 

Ala Leu Thr lie Phe Ser Leu Ser Leu lie Leu Val Ala Phe Lys lie 
75 80 85 90 

TTC CTC CTT AAA GAC AAC CCT AAC AAA CGC TTC AAA AAC AAC AAG AGG 3 99 

Phe Leu Leu Lys Asp Asn Pro Asn Lys Arg Phe Lys Asn Asn Lys Arg 
95 100 105 

GAT AAA AAA TAATGTCTTA TTTTTTTAAA ATCATTCTGG GCACAAGCGT GATCGTGGG 45 7 
Asp Lys Lys 



GG 459 
(2) INFORMATION FOR SEQ ID NO: 12 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 80: 

Met Leu Val Asp Lys Asn Ser Ala Asp Gin Lys Met Trp Arg lie Pro 

1 5 10 15 

Glu Lys Ala Leu Trp Val Leu Ser Leu Leu Gly Gly Ser Val Gly Phe 

20 25 30 

Leu Val Ala Met Val Val Ser His His Lys lie Leu Lys Pro Glu Phe 
-3 5 4 0 4 5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278: 

Met Lys Ser His Phe Gin Tyr Ser Thr Leu Glu Asn lie Pro Lys Ala 

15 10 15 

Phe Asp lie Leu Lys Asp Pro Pro Lys Lys Leu Tyr Cys Val Gly Asp 

20 25 30 

Thr Lys Leu Leu Asp Thr Pro Leu Lys Val Ala lie He Gly Thr Arg 

35 40 45 

Arg Pro Thr Pro Tyr Ser Lys Gin His Thr He Thr Leu Ala Arg Glu 

50 55 60 

Leu Ala Lys Asn Gly Ala Val He Val Ser Gly Gly Ala Leu Gly Val 
65 70 75 80 

Asp He He Ala Gin Glu Asn Ala Leu Pro Lys Thr He Met Leu Ser 

85 90 95 

Pro Cys Ser Leu Asp Phe He Tyr Pro Thr Asn Asn His Lys Val He 

100 105 110 

Gin Glu He Ala Gin Asn Gly Leu He Leu Ser Glu Tyr Glu Lys Asp 

115 120 125 

Phe Met Pro He Lys Gly Ser Phe Leu Ala Arg Asn Arg Leu Val He 

130 135 140 

Ala Leu Ser Asp Val Val He He Pro Gin Ala Asp Leu Lys Ser Gly 
145 150 155 160 

Ser Met Ser Ser Ala Arg Leu Ala Gin Lys Tyr Gin Lys Pro Leu Phe 

165 170 175 

Val Leu Pro Gin Arg Leu Asn Glu Ser Asp Gly Thr Asn Glu Leu Leu 

180 185 190 

Glu Lys Gly Gin Ala Gin Gly He Phe Asn He Gin Asn Phe He Asn 

195 200 205 

Thr Leu Leu Lys Asp Tyr His Leu Lys Glu Met Pro Glu Met Glu Asp 

210 215 220 

Glu Phe Leu Glu Tyr Cys Ala Lys Asn Pro Ser Tyr Glu Glu Ala Tyr 
225 230 235 240 

Leu Lys Phe Gly Asp Lys Leu Leu Glu Tyr Glu Leu Leu Gly Lys He 

245 250 255 

Lys Arg He Asn His He Val Val Leu Ala 
260 265 

(2) INFORMATION FOR SEQ ID NO: 12 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 82... 408 
(D) OTHER INFORMATION: 
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ATC TAT CCT ACG AAC AAT CAT AAA GTG ATC CAA GAA ATC GCG CAA AAC 3 91 

lie Tyr Pro Thr Asn Asn His Lys Val lie Gin Glu lie Ala Gin Asn 
105 110 115 

GGC TTG ATT TTA AGC GAA TAT GAA AAG GAT TTC ATG CCC ATT AAA GGC 43 9 

Gly Leu lie Leu Ser Glu Tyr Glu Lys Asp Phe Met Pro lie Lys Gly 
120 125 130 

TCT TTT TTG GCG AGA AAC CGC CTG GTG ATC GCT TTA AGC GAT GTG GTG 4 87 

Ser Phe Leu Ala Arg Asn Arg Leu Val lie Ala Leu Ser Asp Val Val 
135 140 145 150 

ATT ATC CCC CAA GCG GAT TTA AAA AGC GGC TCT ATG AGC AGC GCG AGA 53 5 

lie He Pro Gin Ala Asp Leu Lys Ser Gly Ser Met Ser Ser Ala Arg 
155 160 165 

TTA GCC CAG AAA TAC CAA AAG CCT TTA TTT GTT TTA CCC CAA CGC CTG 583 
Leu Ala Gin Lys Tyr Gin Lys Pro Leu Phe Val Leu Pro Gin Arg Leu 
170 175 180 

AAT GAG AGC GAT GGC ACT AAT GAG CTT TTA GAA AAA GGG CAG GCT CAA 631 
Asn Glu Ser Asp Gly Thr Asn Glu Leu Leu Glu Lys Gly Gin Ala Gin 
185 190 195 

GGG ATA TTT AAT ATT CAA AAT TTT ATA AAC ACC CTT TTA AAA GAC TAC 6 79 

Gly He Phe Asn He Gin Asn Phe He Asn Thr Leu Leu Lys Asp Tyr 
200 205 210 

CAT TTA AAA GAA ATG CCT GAA ATG GAA GAT GAA TTT TTA GAA TAT TGT 72 7 

His Leu Lys Glu Met Pro Glu Met Glu Asp Glu Phe Leu Glu Tyr Cys 
215 220 225 230 

GCC AAA AAC CCG AGC TAT GAA GAA GCG TAT CTC AAA TTT GGG GAT AAG 775 
Ala Lys Asn Pro Ser Tyr Glu Glu Ala Tyr Leu Lys Phe Gly Asp Lys 
235 240 245 

CTT TTA GAA TAC GAG CTG TTG GGT AAG ATC AAG CGC ATC AAT CAC ATT 82 3 

Leu Leu Glu Tyr Glu Leu Leu Gly Lys He Lys Arg He Asn His He 
250 255 260 

GTG GTG TTA GCG TGATTTTGGC ATGCGATGTG GGGTTAAAAC GCATTGGCAT CGCTG 88 0 
Val Val Leu Ala 
265 

CGCTTTTAAA TGGCGT 8 96 

(2) INFORMATION FOR SEQ ID NO: 1278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Lys Met Cys Glu Tyr Ala Lys Thr Leu Ala Ser Glu Phe His Arg Phe 

485 490 495 

Tyr Asn Ala Gly Lys lie Leu Asp Thr Pro Lys Ala Lys Glu Leu Leu 

500 505 510 

Lys lie Cys Leu lie Val Ser Leu Ser Leu Ser Asn Ala Phe Lys Leu 

515 520 525 

Leu Gly lie Glu lie Lys Thr Lys lie Ser Ala Arg Asp 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 127 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 38... 835 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1277: 



AAGTAGAGAT TATATTACCT AGATAGTGAA TCAACGA ATG AAA AGC CAC TTC CAA 55 

Met Lys Ser His Phe Gin 
1 5 



TAG AGC ACG CTA GAA AAT ATC CCT AAA GCC TTT GAC ATT CTC AAA GAC 103 
Tyr Ser Thr Leu Glu Asn lie Pro Lys Ala Phe Asp lie Leu Lys Asp 
10 15 20 



CCC CCT AAA AAA CTC TAT TGT GTG GGC GAT ACC AAG CTT TTG GAC ACG 151 
Pro Pro Lys Lys Leu Tyr Cys Val Gly Asp Thr Lys Leu Leu Asp Thr 
25 30 35 



CCT TTA AAA GTG GCG ATC ATA GGC ACA AGA AGA CCC ACC CCT TAG AGC 19 9 

Pro Leu Lys Val Ala lie lie Gly Thr Arg Arg Pro Thr Pro Tyr Ser 
40 45 50 



AAG CAA CAC ACG ATC ACT CTA GCT AGA GAG CTT GCT AAA AAT GGC GCG 24 7 

Lys Gin His Thr lie Thr Leu Ala Arg Glu Leu Ala Lys Asn Gly Ala 
55 60 65 70 



GTT ATT GTG AGT GGG GGA GCG TTA GGC GTG GAT ATT ATC GCT CAA GAA 2 95 

Val He Val Ser Gly Gly Ala Leu Gly Val Asp He He Ala Gin Glu 
75 80 85 



AAC GCC TTG CCA AAA ACG ATC ATG CTT TCG CCT TGC AGT TTG GAT TTC 34 3 

Asn Ala Leu Pro Lys Thr He Met Leu Ser Pro Cys Ser Leu Asp Phe 
90 95 100 
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35 

lie Ala Glu Glu 
50 

Gly Leu Phe Asp 
65 

Leu Ser Leu Asp 

Lys Glu Lys Phe 
100 

Phe Leu Glu Phe 
115 

His Ala Arg Gly 
130 

Phe Leu Gly His 
145 

Ser Gin lie Arg 

His Val Leu Lys 
180 

Glu Tyr lie lie 
195 

Ser Leu Leu Lys 
210 

Tyr Ala Arg Asp 
225 

Leu Gly lie His 

His Lys Asp Ala 
260 

Glu Lys Asp Ser 
275 

Ser Asp Arg Val 
290 

Gly Asp lie Val 
305 

Tyr He Asn lie 

Lys Ala Ser Leu 
340 

Leu Leu Ala Gin 
355 

Met Ser Lys Arg 
370 

Asp Val Gly Lys 
385 

Asp Thr His Leu 

Ser Asn Pro He 
420 

Met Leu Glu Lys 
435 

Leu Thr Asn Leu 
450 

Ser Leu Pro Lys 
465 



40 

Leu Ala Leu Lys 
55 

Ser Val Val Ala 
70 

Phe Leu Glu Arg 
85 

Gly Ser Gin Val 

Val Ser Ala Asn 
120 

Ala Val Phe Gly 
135 

Glu Val Leu Cys 
150 

Leu Leu Gly Leu 
165 

Glu Ser Val Thr 

Glu He Ala Lys 
200 

Glu Asn Glu Glu 
215 

Leu Met Leu Leu 
230 

Phe Asp Ser Tyr 
245 

Val Phe Glu Gin 

Lys He Trp Leu 
280 

Leu He Lys Glu 
295 

Tyr His Asp Glu 
310 

Trp Gly Ala Asp 
325 

Glu Phe Leu Gly 

Met Val Arg Leu 
360 

Ala Gly Asn Phe 
375 

Asp Ala Leu Arg 
390 

Glu Phe Asp Val 
405 

Tyr Tyr He His 

Ser Pro Phe Ser 
440 

Asn Ala Glu Glu 
455 

Ala He Glu Ser 
470 



He Ser Thr His 
60 

Cys Lys Gly Tyr 
75 

Phe Thr Gin Lys 
90 

Lys Ser Glu Arg 
105 

Pro Thr Gly Pro 

Asp Ser Leu Ala 
140 

Glu Tyr Tyr Val 
155 

Ser Val Trp Leu 
170 

Tyr Pro Glu Val 
185 

Lys Ala Asn Asn 

Thr He He Glu 
220 

Glu He Lys Asp 
235 

Ala Ser Glu Lys 
250 

Leu Glu Lys Ala 
265 

Lys Ser Ser Leu 

Asp Lys Ser Tyr 
300 

Lys Phe Lys Gin 
315 

His His Gly Tyr 
330 

Tyr Asp Ser Asn 
345 

Leu Lys Asp Asn 

He Leu He Lys 
380 

Phe He Phe Leu 
395 

Asn Thr Leu Lys 
410 

Tyr Ala Asn Ser 
425 

Lys Glu Glu Val 

Lys Tyr Leu Leu 
460 

Ser Phe Glu Glu 
475 



45 

Glu Lys Thr Gin 

He Asn Phe Thr 
80 

Ala Leu Glu Leu 
95 

Ser Gin Lys He 
110 

Leu His He Gly 
125 

Lys He Ala Arg 

Asn Asp Met Gly 
160 

Ala Tyr Arg Glu 
175 

Phe Tyr Lys Gly 
190 

Asp Leu Glu Pro 
205 

Val Leu Ser Gly 

Asn Leu Asp Ala 
240 

Glu Val Phe Lys 
255 

Asn Ala Leu Tyr 
270 

Tyr Gin Asp Glu 
285 

Thr Tyr Leu Ala 

Asp Tyr Thr Lys 
320 

He Ala Arg Val 
335 

Lys Leu Glu Val 
350 

Glu Pro Tyr Lys 
365 

Asp Val Val Asp 

Ser Lys Arg Leu 
400 

Lys Gin Asp Ser 

415 ■ 

Arg He His Thr 
430 

Leu Gin Thr Pro 
445 

Phe Ser Ala Leu 

Tyr Gly Leu Gin 
480 
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ACT TTA AAA AAG CAA GAC AGC TCA AAC CCC ATT TAG TAT ATC CAT TAG 13 0 0 

Thr Leu Lys Lys Gin Asp Ser Ser Asn Pro lie Tyr Tyr lie His Tyr 
410 415 420 425 

GCT AAT TCG CGC ATC CAC ACC ATG CTA GAA AAA TCG CCC TTT TCT AAA 134 8 

Ala Asn Ser Arg lie His Thr Met Leu Glu Lys Ser Pro Phe Ser Lys 
430 435 440 



GAA GAG GTT TTG CAA ACC OCT TTA ACC AAT TTA AAC GCT GAA GAA AAA 13 96 

Glu Glu Val Leu Gin Thr Pro Leu Thr Asn Leu Asn Ala Glu Glu Lys 

445 450 455 

TAC TTG CTT TTT AGC GCT TTA AGC TTG CCT AAA GCA ATT GAA TCC TCT 144 4 

Tyr Leu Leu Phe Ser Ala Leu Ser Leu Pro Lys Ala lie Glu Ser Ser 
460 465 470 



TTT GAA GAA TAC GGC TTG CAA AAA ATG TGC GAA TAC GCA AAA ACC CTC 14 92 

Phe Glu Glu Tyr Gly Leu Gin Lys Met Cys Glu Tyr Ala Lys Thr Leu 
475 480 485 



GCA TCA GAA TTC CAC CGC TTC TAT AAC 

Ala Ser Glu Phe His Arg Phe Tyr Asn 
490 495 

CCT AAA GCT AAA GAG CTT TTA AAA ATT 

Pro Lys Ala Lys Glu Leu Leu Lys lie 
510 

TTA AGC AAC GCT TTT AAA CTT TTA GGC 

Leu Ser Asn Ala Phe Lys Leu Leu Gly 

525 530 



GCT GGC AAA ATC TTA GAC ACC 154 0 

Ala Gly Lys lie Leu Asp Thr 
500 505 

TGT TTA ATA GTA AGC TTG AGC 15 8 8 

Cys Leu lie Val Ser Leu Ser 
515 520 

ATA GAG ATA AAG ACC AAA ATT 16 3 6 

lie Glu lie Lys Thr Lys lie 
535 



TCC GCT AGA GAT TAAGCCAATA TTTAATTTTT TGTTATAACA TTCCCCTTAT TTTTT 16 93 
Ser Ala Arg Asp 
540 



GAAACTAAGG AGAATATTAT 1713 
(2) INFORMATION FOR SEQ ID NO: 1276: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 6: 



Met His Thr Leu lie Lys Gly lie 

1 5 
Val lie Val Glu Tyr Pro Lys Asp 
20 

Pro lie Ala Phe Asn Leu Ala Lys 



Leu Glu Glu lie Leu Glu Glu Glu 

10 15 

Arg Glu His Gly His Tyr Ala Thr 
25 30 

Val Phe Lys Lys Ser Pro Leu Ala 
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Fro ^^lu val Phe Tyr Lys Gly Glu Tyr He He Glu He Ala Lys Lys 
190 195 200 

GCG AAC AAC GAT TTA GAA CCA AGC CTT TTA AAA GAA AAC GAA GAA ACG 67 6 

Ala Asn Asn Asp Leu Glu Pro Ser Leu Leu Lys Glu Asn Glu Glu Thr 
205 210 215 

ATT ATT GAA GTT TTA AGC GGC TAT GCT AGG GAT CTA ATG CTT TTA GAA 72 4 

He He Glu Val Leu Ser Gly Tyr Ala Arg Asp Leu Met Leu Leu Glu 
220 225 230 

ATT AAA GAT AAT TTA GAC GCT TTA GGC ATT CAT TTT GAT TCC TAT GCG 7 72 

He Lys Asp Asn Leu Asp Ala Leu Gly He His Phe Asp Ser Tyr Ala 
235 240 245 

AGC GAA AAA GAA GTT TTT AAA CAT AAA GAT GCG GTG TTT GAA CAA TTA 82 0 

Ser Glu Lys Glu Val Phe Lys His Lys Asp Ala Val Phe Glu Gin Leu 
250 255 260 265 

GAA AAA GCG AAC GCC CTT TAT GAA AAG GAT TCT AAA ATC TGG CTC AAA 86 8 

Glu Lys Ala Asn Ala Leu Tyr Glu Lys Asp Ser Lys He Trp Leu Lys 
270 275 280 

TCT TCA CTC TAG CAG GAT GAA AGC GAT CGG GTG CTC ATT AAA GAA GAT 916 
Ser Ser Leu Tyr Gin Asp Glu Ser Asp Arg Val Leu He Lys Glu Asp 
285 290 295 

AAA AGC TAC ACT TAT TTA GCG GGC GAT ATT GTC TAT CAT GAT GAA AAA 964 
Lys Ser Tyr Thr Tyr Leu Ala Gly Asp He Val Tyr His Asp Glu Lys 
300 305 310 

TTC AAG CAA GAT TAT ACC AAA TAC ATC AAC ATT TGG GGG GCA GAC CAC 1012 
Phe Lys Gin Asp Tyr Thr Lys Tyr He Asn He Trp Gly Ala Asp His 
315 320 325 

CAC GGC TAT ATC GCT AGA GTG AAA GCC AGC CTT GAG TTT TTG GGC TAT 10 6 0 

His Gly Tyr He Ala Arg Val Lys Ala Ser Leu Glu Phe Leu Gly Tyr 
330 335 340 345 

GAT TCC AAC AAG CTT GAA GTC TTG CTC GCT CAA ATG GTG CGC TTG CTC 110 8 

Asp Ser Asn Lys Leu Glu Val Leu Leu Ala Gin Met Val Arg Leu Leu 
350 355 360 

AAA GAT AAC GAG CCT TAC AAG ATG AGT AAA AGA GCG GGT AAT TTT ATT 115 6 

Lys Asp Asn Glu Pro Tyr Lys Met Ser Lys Arg Ala Gly Asn Phe He 
365 370 375 

TTG ATT AAA GAT GTG GTT GAT GAT GTG GGT AAG GAC GCT TTG AGG TTT 12 04 

Leu He Lys Asp Val Val Asp Asp Val Gly Lys Asp Ala Leu Arg Phe 
380 385 390 

ATT TTT TTG AGC AAA CGG CTT GAC ACT CAT TTA GAA TTT GAT GTC AAT 12 52 

He Phe Leu Ser Lys Arg Leu Asp Thr His Leu Glu Phe Asp Val Asn 
395 400 405 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1275: 

TTAAAAGCAA * ACAAGAAAGT TAAGC ATG CAC ACT CTC ATT AAG GGC ATT TTA 52 

Met His Thr Leu He Lys Gly He Leu 
1 5 

GAA GAG ATT TTA GAA GAA GAA GTC ATT GTT GAA TAG CCT AAA GAC AGA 10 0 

Glu Glu He Leu Glu Glu Glu Val He Val Glu Tyr Pro Lys Asp Arg 
10 15 20 25 

GAG CAT GGG CAT TAG GCT ACG CCC ATT GCT TTC AAT CTC GCC AAA GTT 14 8 

Glu His Gly His Tyr Ala Thr Pro He Ala Phe Asn Leu Ala Lys Val 
30 35 40 

TTT AAA AAA TCG CCC TTA GCC ATC GCT GAA GAG TTA GCC CTT AAA ATC 196 
Phe Lys Lys Ser Pro Leu Ala He Ala Glu Glu Leu Ala Leu Lys He 
45 50 55 

AGC ACG CAT GAA AAA ACT CAA GGG CTT TTT GAC AGC GTA GTG GCT TGT 24 4 

Ser Thr His Glu Lys Thr Gin Gly Leu Phe Asp Ser Val Val Ala Cys 
60 65 70 

AAG GGC TAT ATC AAT TTC ACG CTT TCT TTA GAT TTT TTG GAG CGT TTC 2 92 

Lys Gly Tyr He Asn Phe Thr Leu Ser Leu Asp Phe Leu Glu Arg Phe 
75 80 85 

ACC CAA AAA GCT TTG GAA TTG AAA GAA AAA TTT GGC TCT CAA GTT AAA 34 0 

Thr Gin Lys Ala Leu Glu Leu Lys Glu Lys Phe Gly Ser Gin Val Lys 
90 95 100 105 

AGC GAA CGT TCT CAA AAA ATC TTT TTA GAA TTT GTG AGC GCT AAC CCC 3 88 

Ser Glu Arg Ser Gin Lys He Phe Leu Glu Phe Val Ser Ala Asn Pro 
110 115 120 

ACA GGG CCT TTA CAC ATA GGG CAT GCT AGA GGG GCG GTG TTT GGC GAT 436 
Thr Gly Pro Leu His He Gly His Ala Arg Gly Ala Val Phe Gly Asp 
125 130 135 

AGT TTG GCT AAA ATC GCT CGC TTT TTA GGG CAT GAA GTT TTA TGC GAG 4 84 

Ser Leu Ala Lys He Ala Arg Phe Leu Gly His Glu Val Leu Cys Glu 
140 145 150 

TAT TAT GTC AAT GAC ATG GGA TCT CAA ATC CGC TTG TTA GGG CTT TCT 532 
Tyr Tyr Val Asn Asp Met Gly Ser Gin He Arg Leu Leu Gly Leu Ser 
155 160 165 

GTA TGG CTC GCT TAG AGA GAA CAT GTT TTA AAA GAA AGC GTA ACT TAG 580 
Val Trp Leu Ala Tyr Arg Glu His Val Leu Lys Glu Ser Val Thr Tyr 
170 175 180 185 

CCA GAA GTC TTT TAC AAA GGC GAA TAG ATC ATT GAA ATC GCT AAA AAG 62 8 
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15 10 15 

Lys Thr Ser Phe Leu Ser Glu Tyr Leu Asn Gin Thr Asp His Gin Gly 

20 25 30 

Val Ala Leu lie lie Asn Glu lie Gly Gin Ala Ala Leu Asp Gin Arg 

35 40 45 

lie Leu Ser Val Gin Tyr Cys Gly Glu Lys Met Leu Tyr Leu Asn Ala 

50 55 60 

Gly Cys Val Cys Cys Asn Lys Arg Leu Asp Leu Val Glu Ser Leu Lys 
65 70 75 80 

Ala Thr Leu Asn Asn Tyr Glu Trp His Gly Glu lie Leu Arg Arg lie 

85 90 95 

lie lie Glu Thr Thr Gly Leu Ala Asn Pro Ala Pro lie Leu Trp Thr 

100 105 110 

lie Leu Ser Asp Thr Phe Leu Gly Val His Phe Glu lie Gin Ser Val 

115 120 125 

Val Ala Cys Val Asp Ala Leu Asn Ala Arg Glu His Leu Thr Asn Asn 

130 135 140 

Glu Ala Lys Glu Gin lie Val Phe Ala Asp Ser Val Leu Leu Thr Lys 
145 150 155 160 

Thr Asp Leu Gin Asn Asp Ser Ala Ala Leu Thr Lys Leu Lys Glu Arg 

165 170 175 

lie Gin Ala Leu Asn Pro Ser Ala Glu lie Phe Asp Lys Arg Ala lie 

180 185 190 

Asp Tyr Glu Ser Leu Phe Ser Arg Lys Asn Arg Ala Arg Asn Phe Met 

195 200 205 

Pro Arg Met Pro Lys Asp Ser His Ser Gin Gly Phe Glu Thr Leu Ser 

210 215 220 

lie Asn Phe Glu Gly Thr Met Glu Trp Ser Ala Phe Gly lie Trp Leu 
225 230 235 240 

Ser Leu Leu Leu His Gin Tyr Gly Thr Gin lie Leu Arg lie Lys Gly 

245 250 255 

lie lie Asp lie Gly Ser Gly Phe Leu Val Ser lie Asn Gly Val Met 

260 265 270 

His Val lie Tyr Pro Pro Lys His lie Leu Lys Asp Gin Asn Gly Ser 

275 280 285 

Asn Leu Val Phe lie Met Arg His Leu Glu Arg Glu Lys lie Leu Asn 

290 295 300 

Ser Leu Lys Gly Phe Lys Asp Phe Leu Gly lie Lys Gly Phe Glu Thr 
305 310 315 320 

Gin 



(2) INFORMATION FOR SEQ ID NO: 127 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 26... 1648 
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GCT TTA ACA AAA CTA AAA GAG AGG ATA CAA GCC CTT AAC CCT AGT GCA 641 
Ala Leu Thr Lys Leu Lys Glu Arg lie Gin Ala Leu Asn Pro Ser Ala 
170 175 180 

GAA ATT TTT GAC AAG AGG GCG ATA GAG TAT GAG AGC CTC TTT TCA CGC 68 9 

Glu lie Phe Asp Lys Arg Ala lie Asp Tyr Glu Ser Leu Phe Ser Arg 
185 190 195 200 

AAA AAT AGG GCG CGA AAT TTT ATG CCA AGA ATG CCA AAA GAT TCG CAC 73 7 

Lys Asn Arg Ala Arg Asn Phe Met Pro Arg Met Pro Lys Asp Ser His 
205 210 215 

TCG CAA GGC TTT GAG ACT TTA AGC ATT AAT TTT GAA GGC ACG ATG GAG 7 85 

Ser Gin Gly Phe Glu Thr Leu Ser lie Asn Phe Glu Gly Thr Met Glu 
220 225 230 

TGG AGC GCG TTT GGG ATT TGG CTG AGT TTG TTA TTG CAT CAA TAC GGC 83 3 

Trp Ser Ala Phe Gly lie Trp Leu Ser Leu Leu Leu His Gin Tyr Gly 
235 240 245 

ACA CAG ATT TTA CGC ATC AAG GGG ATT ATT GAC ATT GGA AGC GGC TTT 8 81 

Thr Gin lie Leu Arg lie Lys Gly He He Asp He Gly Ser Gly Phe 
250 255 260 

TTG GTG AGT ATT AAC GGC GTG ATG CAT GTC ATT TAC CCG CCT AAG CAT 92 9 

Leu Val Ser He Asn Gly Val Met His Val He Tyr Pro Pro Lys His 
265 270 275 280 

ATT TTA AAG GAT CAA AAC GGC TCT AAC CTC GTT TTT ATC ATG CGC CAT 977 
He Leu Lys Asp Gin Asn Gly Ser Asn Leu Val Phe He Met Arg His 
285 290 295 

TTA GAG CGT GAA AAA ATC TTA AAT TCC TTA AAG GGT TTT AAG GAT TTT 102 5 

Leu Glu Arg Glu Lys He Leu Asn Ser Leu Lys Gly Phe Lys Asp Phe 
300 305 310 

CTC GGC ATC AAG GGT TTT GAA ACC CAA TAATTTTTCT ATTTATGGAT AGCTGTT 107 9 
Leu Gly He Lys Gly Phe Glu Thr Gin 
315 320 

TGCATTTTGA TGGGGAAAAG A 1100 

(2) INFORMATION FOR SEQ ID NO:1274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274: 
Met Pro Lys He Pro He Thr Leu He Thr Gly Phe Leu Gly Ser Gly 

-1873- 



BNSDOCID: <WO 9843478 A 1 I > 



wo 98/43478 



PCT/US98/06371 



(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 90 . . . 1052 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 73: 

AAGGCGCAAA ACTTAGCCAA AAAAGAGATG GACGCACTAG ATTCTCATCT GTTAGCGTTT 6 0 

TTAAATCAAA ATGCAAATGC CATTCACTG ATG CCC AAA ATC CCT ATC ACG CTC 113 

Met Pro Lys lie Pro lie Thr Leu 
1 5 

ATC ACC GGT TTT TTA GGC AGC GGT AAA ACG AGT TTT TTG AGC GAA TAT 161 
lie Thr Gly Phe Leu Gly Ser Gly Lys Thr Ser Phe Leu Ser Glu Tyr 
10 15 20 

TTA AAC CAA ACA GAT CAC CAA GGC GTC GCT CTT ATC ATC AAT GAA ATC 2 09 

Leu Asn Gin Thr Asp His Gin Gly Val Ala Leu lie lie Asn Glu lie 
25 30 35 40 

GGT CAA GCC GCT TTG GAT CAG CGC ATC TTA AGC GTT CAA TAT TGC GGT 25 7 

Gly Gin Ala Ala Leu Asp Gin Arg lie Leu Ser Val Gin Tyr Cys Gly 
45 50 55 

GAA AAA ATG CTC TAT CTT AAC GCA GGG TGC GTG TGT TGC AAC AAA CGC 3 05 

Glu Lys Met Leu Tyr Leu Asn Ala Gly Cys Val Cys Cys Asn Lys Arg 
60 65 70 

TTG GAT TTA GTG GAG TCT CTA AAA GCC ACG CTC AAC AAC TAT GAA TGG 3 53 

Leu Asp Leu Val Glu Ser Leu Lys Ala Thr Leu Asn Asn Tyr Glu Trp 
75 80 85 

CAC GGC GAA ATT CTA AGG CGC ATC ATC ATT GAA ACT ACC GGT TTA GCC 4 01 

His Gly Glu lie Leu Arg Arg lie lie lie Glu Thr Thr Gly Leu Ala 
90 95 100 

AAC CCG GCA CCG ATT TTA TGG ACG ATT TTG AGC GAC ACT TTT TTA GGA 44 9 

Asn Pro Ala Pro lie Leu Trp Thr lie Leu Ser Asp Thr Phe Leu Gly 
105 110 115 120 

GTG CAT TTT GAG ATT CAA AGC GTG GTG GCT TGC GTG GAT GCA TTG AAT 4 97 

Val His Phe Glu lie Gin Ser Val Val Ala Cys Val Asp Ala Leu Asn 
125 130 135 

GCT AGA GAG CAT TTA ACC AAC AAT GAA GCT AAA GAG CAA ATC GTT TTT 54 5 

Ala Arg Glu His Leu Thr Asn Asn Glu Ala Lys Glu Gin lie Val Phe 
140 145 150 

GCT GAT AGC GTT TTA TTG ACC AAA ACG GAT TTA CAA AAC GAT AGC GCG 5 93 

Ala Asp Ser Val Leu Leu Thr Lys Thr Asp Leu Gin Asn Asp Ser Ala 
155 160 165 
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65 70 75 80 

Leu Lys Lys Gly Thr Ser Phe Gly Ala Pro Thr Glu Leu Glu Thr Thr 

85 90 95 

Leu Ala Lys Glu lie lie Ser Cys Tyr Glu Gly Leu Asp Lys Val Arg 

100 105 110 

Leu Val Ser Ser Gly Thr Glu Ala Thr Met Ser Ala lie Arg Leu Ala 

115 120 125 

Arg Ala Tyr Ser Gin Lys Asp Asp Leu lie Lys Phe Glu Gly Cys Tyr 

130 135 140 

His Gly His Ser Asp Ser Leu Leu Val Lys Ala Gly Ser Gly Cys Ala 
145 150 155 160 

Thr Phe Gly Ser Pro Ser Ser Leu Gly Val Pro Asn Asp Phe Ser Lys 

165 170 175 

His Thr Leu Val Ala Arg Tyr Asn Asp Leu Asn Ser Thr Glu Glu Cys 

180 185 190 

Phe Lys Lys Gly Asn Val Gly Cys Val lie lie Glu Pro lie Ala Gly 

195 200 205 

Asn Met Gly Leu Val Pro Ala Gin Lys Glu Phe Leu Leu Gly Leu Lys 

210 215 220 

Ala Leu Cys Glu Lys Tyr Gin Ala Val Leu lie Leu Asp Glu Val Met 
225 230 235 240 

Ser Gly Phe Arg Ala Ser Leu Ser Gly Ser Gin Glu Phe Tyr Gly Val 

245 250 255 

Val Pro Asp Leu Val Thr Phe Gly Lys Val lie Gly Ala Gly Leu Pro 

260 265 270 

Leu Ala Cys Phe Gly Gly Arg Ala Glu lie Met Asp Leu Leu Ser Pro 

275 280 285 

lie Gly Ser Val Tyr Gin Ala Gly Thr Leu Ser Gly Asn Pro Leu Ala 

290 295 300 

Val Cys Ala Gly Leu Ser Ala Leu Tyr Lys lie Lys Arg Asp Lys Thr 
305 310 315 320 

Leu Tyr Thr Arg Leu Asp Ala Leu Ala lie Arg Leu Thr Gin Gly Leu 

325 330 335 

Gin Lys Ser Ala Gin Asn Tyr Asn lie Ala Leu Glu Thr Leu Asn Met 

340 345 350 

Gly Ser Met Phe Gly Phe Phe Phe Asn Glu Asn Ala Val His Asp Phe 

355 360 365 

Asp Asp Ala Leu Lys Ser Asp Thr Glu Met Phe Ala Lys Phe His Gin 

370 375 380 

Lys Met Leu Phe Lys Gly Val Tyr Leu Ala Cys Ser Ser Phe Glu Thr 
385 390 395 400 

Gly Phe lie Cys Glu Pro Met Thr Glu Glu Met lie Asp Leu Thr lie 

405 410 415 

Ala Lys Ala Asp Glu Ser Phe Asp Glu lie lie Lys Gly Val 
420 425 430 



(2) INFORMATION FOR SEQ ID NO: 1273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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GGG TTG AGT GCG CTT TAT AAA ATC AAA AGA GAC AAA ACC CTT TAT ACT 1017 
Gly Leu Ser Ala Leu Tyr Lys lie Lys Arg Asp Lys Thr Leu Tyr Thr 
310 315 320 

CGC TTA GAC GCT TTA GCT ATT CGT TTG ACT CAA GGC TTA CAA AAG AGC 10 65 

Arg Leu Asp Ala Leu Ala lie Arg Leu Thr Gin Gly Leu Gin Lys Ser 
325 330 335 

GCT CAA AAC TAT AAC ATC GCT TTA GAG ACG CTT AAC ATG GGG AGC ATG 1113 
Ala Gin Asn Tyr Asn lie Ala Leu Glu Thr Leu Asn Met Gly Ser Met 
340 345 350 355 

TTT GGC TTT TTC TTT AAC GAA AAT GCG GTG CAC GAT TTT GAT GAC GCT 1161 
Phe Gly Phe Phe Phe Asn Glu Asn Ala Val His Asp Phe Asp Asp Ala 
360 365 370 

TTA AAA AGC GAT ACG GAG ATG TTT GCA AAA TTC CAC CAA AAA ATG CTC 12 0 9 

Leu Lys Ser Asp Thr Glu Met Phe Ala Lys Phe His Gin Lys Met Leu 
375 380 385 

TTT AAG GGC GTG TAT TTG GCG TGC TCA AGC TTT GAA ACC GGC TTT ATT 12 5 7 

Phe Lys Gly Val Tyr Leu Ala Cys Ser Ser Phe Glu Thr Gly Phe lie 
390 395 400 

TGT GAG CCT ATG ACT GAA GAG ATG ATT GAT TTA ACG ATC GCA AAA GCC 13 0 5 

Cys Glu Pro Met Thr Glu Glu Met lie Asp Leu Thr lie Ala Lys Ala 
405 410 415 

GAT GAA AGT TTT GAT GAA ATC ATA AAA GGT GTG TGAATTTTTT GAAAAAGCCA 135 8 
Asp Glu Ser Phe Asp Glu lie lie Lys Gly Val 
420 425 430 

AAGTATTATA AATTCATAGA GGGGGCGAAT TATTTGAGCT TGGGGCTT 14 0 6 

(2) INFORMATION FOR SEQ ID NO:1272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272: 

Met Glu Leu Leu His Ser lie Asn Asp Phe Asn Glu Ala Lys Gin Val 

15 10 15 

lie Ala Gly Gly Val Asn Ser Pro Val Arg Ala Phe Lys Ser Val Lys 

20 25 30 

Gly Thr Pro Pro Phe lie Leu Lys Gly Lys Gly Ala Tyr Leu Tyr Asp 

35 40 45 

Val Asp Asn Asn His Tyr lie Asp Phe Val Gin Ser Trp Gly Pro Leu 

50 55 60 

lie Phe -Gly His Ala Asp Glu Glu lie Glu Glu Asn lie lie Asn Ala 

-1870- 



BNSDCXID: <WO 984347BA1_L> 



wo 98/43478 



PCT/US98/06371 



GGC ACT TCT 
Gly Thr Ser 
85 

GAA ATC ATT 
Glu lie lie 
100 

AGC GGC ACA 
Ser Gly Thr 



AGC CAA AAA 
Ser Gin Lys 



AGC GAC TCC 
Ser Asp Ser 
150 

TCG CCT TCT 
Ser Pro Ser 
165 

GTG GCT CGT 
Val Ala Arg 
180 

GGC AAT GTG 
Gly Asn Val 



TTA GTG CCG 
Leu Val Pro 



GAA AAA TAC 
Glu Lys Tyr 
230 

AGA GCG AGC 
Arg Ala Ser 
245 

TTG GTA ACC 
Leu Val Thr 
260 

TTT GGA GGG 
Phe Gly Gly 



GTG TAT CAA 
Val Tyr Gin 



TTT GGC GCT 
Phe Gly Ala 



TCT TGT TAT 
Ser Cys Tyr 
105 

GAA GCG ACC 
Glu Ala Thr 
120 

GAT GAT TTG 
Asp Asp Leu 
135 

TTA TTG GTG 
Leu Leu Val 



TCT TTA GGC 
Ser Leu Gly 



TAT AAC GAT 
Tyr Asn Asp 
185 

GGT TGC GTC 
Gly Cys Val 
200 

GCT CAA AAA 
Ala Gin Lys 
215 

CAA GCG GTG 
Gin Ala Val 



TTG AGC GGT 
Leu Ser Gly 



TTT GGT AAG 
Phe Gly Lys 
265 

CGT GCA GAA 
Arg Ala Glu 
280 

GCA GGC ACT 
Ala Gly Thr 
295 



CCC ACA GAA 
Pro Thr Glu 
90 

GAA GGC TTA 
Glu Gly Leu 



ATG AGC GCG 
Met Ser Ala 



ATC AAG TTT 
lie Lys Phe 
140 

AAA GCG GGT 
Lys Ala Gly 
155 

GTG CCG AAC 
Val Pro Asn 
170 

TTA AAC TCC 
Leu Asn Ser 



ATC ATT GAA 
lie lie Glu 



GAG TTT TTA 
Glu Phe Leu 
220 

CTG ATT TTA 
Leu lie Leu 
235 

TCG CAA GAA 
Ser Gin Glu 
250 

GTG ATA GGT 
Val He Gly 



ATT ATG GAC 
He Met Asp 



TTG AGC GGT 
Leu Ser Gly 
300 



TTA GAA ACC 
Leu Glu Thr 
95 

GAT AAG GTG 
Asp Lys Val 
110 

ATA CGA CTC 
He Arg Leu 
125 

GAA GGG TGC 
Glu Gly Cys 



AGC GGG TGT 
Ser Gly Cys 



GAT TTT AGC 
Asp Phe Ser 
175 

ACA GAA GAA 
Thr Glu Glu 
190 

CCC ATT GCC 
Pro He Ala 
205 

TTG GGC TTA 
Leu Gly Leu 

GAT GAA GTG 
Asp Glu Val 



TTT TAT GGC 
Phe Tyr Gly 
255 

GCT GGG CTT 
Ala Gly Leu 
270 

TTG CTT TCG 
Leu Leu Ser 
285 

AAC CCC CTA 
Asn Pro Leu 



ACT TTA GCT 
Thr Leu Ala 



CGT TTA GTC 
Arg Leu Val 



GCT AGA GCT 
Ala Arg Ala 
130 

TAC CAT GGG 
Tyr His Gly 
145 

GCT ACT TTT 
Ala Thr Phe 
160 

AAA CAC ACT 
Lys His Thr 



TGC TTT AAA 
Cys Phe Lys 



GGG AAT ATG 
Gly Asn Met 
210 

AAG GCC TTG 
Lys Ala Leu 
225 

ATG AGC GGT 
Met Ser Gly 
240 

GTG GTG CCG 
Val Val Pro 



CCT TTG GCG 
Pro Leu Ala 



CCC ATT GGA 
Pro He Gly 
290 

GCG GTG TGC 
Ala Val Cys 
305 



AAG 345 
Lys 



AGT 3 93 

Ser 

115 

TAT 441 
Tyr 



CAT 4 8 9 

His 



GGA 537 
Gly 



CTA 585 
Leu 



AAA 633 

Lys 

195 

GGG 681 
Gly 



TGT 7 2 9 

Cys 



TTT 7 77 

Phe 



GAT 82 5 

Asp 



TGT 8 73 

Cys 

275 

AGC 921 
Ser 



GCG 969 
Ala 
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100 105 110 

Ser Leu Thr Ala Tyr Val Arg Met Asn Glu Lys Leu Val Lys Val Pro 

115 120 125 

Met Gin Tyr Thr lie Ala Glu Asp Lys lie Val Val Lys Gly Val Leu 

130 135 140 

Asp Leu Leu Asn Phe Gly Leu Lys Asn Glu Leu Ala Ser Leu Ala Lys 
145 150 155 160 

Arg Cys Glu Ser Phe His Glu Gly Leu Thr Trp Ser Gin Val Glu lie 

165 170 175 

Gin Phe Glu Ser Met lie Lys Gly 
180 

(2) INFORMATION FOR SEQ ID NO: 1271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 
{B} LOCATION: 49... 1338 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271: 

CAAGTGGAAA TCCAATTTGA AAGCATGATC AAGGGATAAT GTAAAATC ATG GAG TTG 5 7 

Met Glu Leu 
1 

TTG CAC AGC ATT AAT GAT TTC AAT GAA GCT AAG CAG GTG ATC GCT GGG 105 
Leu His Ser lie Asn Asp Phe Asn Glu Ala Lys Gin Val lie Ala Gly 
5 10 15 

GGG GTC AAT TCA CCT GTT AGG GCG TTT AAG AGC GTT AAA GGC ACT CCC 15 3 

Gly Val Asn Ser Pro Val Arg Ala Phe Lys Ser Val Lys Gly Thr Pro 
20 25 30 35 

CCC TTT ATT TTA AAA GGC AAG GGG GCG TAT CTT TAT GAT GTG GAT AAC 201 
Pro Phe lie Leu Lys Gly Lys Gly Ala Tyr Leu Tyr Asp Val Asp Asn 
40 45 50 

AAC CAT TAT ATA GAT TTT GTG CAA AGC TGG GGG CCT TTG ATT TTT GGG 24 9 

Asn His Tyr lie Asp Phe Val Gin Ser Trp Gly Pro Leu lie Phe Gly 
55 60 65 

CAT GCT GAT GAA GAG ATT GAA GAA AAT ATT ATT AAT GCA TTA AAA AAA 2 97 

His Ala Asp Glu Glu lie Glu Glu Asn lie He Asn Ala Leu Lys Lys 
70 75 80 
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GGC GAT GAA TTG AAA AAC AAA AAT GTG AAA GAA GCT TTT TTC GCT CTT 2 91 

Gly Asp Glu Leu Lys Asn Lys Asn Val Lys Glu Ala Phe Phe Ala Leu 
80 85 90 

TTT AAA AAC ACT AAC ATC AAA GTA ACT TTC AGG AAT GTG ATA GAA GGC 33 9 

Phe Lys Asn Thr Asn lie Lys Val Thr Phe Arg Asn Val lie Glu Gly 
95 100 105 

GAT CAT GCA GGT TCT CTT ACG GCT TAT GTG AGA ATG AAT GAA AAG CTG 3 87 

Asp His Ala Gly Ser Leu Thr Ala Tyr Val Arg Met Asn Glu Lys Leu 
110 115 120 

GTG AAA GTG CCT ATG CAA TAC ACG ATT GCT GAG GAT AAG ATC GTG GTT 43 5 

Val Lys Val Pro Met Gin Tyr Thr lie Ala Glu Asp Lys lie Val Val 
125 130 135 140 

AAA GGG GTT TTG GAT TTA TTG AAT TTT GGC TTG AAA AAC GAA TTA GCG 483 
Lys Gly Val Leu Asp Leu Leu Asn Phe Gly Leu Lys Asn Glu Leu Ala 
145 150 155 

AGC TTG GCC AAA CGA TGC GAA AGC TTT CAT GAG GGC TTG ACT TGG TCG 531 
Ser Leu Ala Lys Arg Cys Glu Ser Phe His Glu Gly Leu Thr Trp Ser 
160 165 170 

CAA GTG GAA ATC CAA TTT GAA AGC ATG ATC AAG GGA TAATGTAAAA TCATGG 583 
Gin Val Glu He Gin Phe Glu Ser Met He Lys Gly 
175 180 

AGTTGTTGCA CAGCATTAAT GATTTCAATG AAGCTAAG 621 

(2) INFORMATION FOR SEQ ID NO: 12 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 70: 

Met Lys Lys Met Val Leu Val Ser Val Leu Leu Ala Gly Phe Leu Gin 

15 10 15 

Ala Val Asn Leu Asp Leu Ser Ser Ala Lys Leu Thr Trp Thr Ala Phe 

20 25 30 

Lys Thr Lys Ala Lys Thr Pro Val Asn Gly Ser Phe Glu Ser He Thr 

35 40 45 

Tyr Lys Leu Gly Lys Ser Gin Asp Ser Leu Lys Thr Leu Leu Glu Gly 

50 55 60 

Ala Ser Ala Ser Met Asp Ser Leu Lys Val Asn Leu Gly Asp Glu Leu 

65 70 75 80 

Lys Asn Lys Asn Val Lys Glu Ala Phe Phe Ala Leu Phe Lys Asn Thr 

85 90 95 

Asn He Lys Val Thr Phe Arg Asn Val He Glu Gly Asp His Ala Gly 
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225 230 235 240 

Arg Leu Glu Leu Glu Lys Phe Ser Ser Ala Leu Ala Asn Lys Pro Phe 

245 250 255 

Gly Val Leu Leu Asn Lys Cys Asp Val Val Glu Asn lie Asp Glu Met 

260 265 270 

Thr Lys Asp Phe Cys Ala Phe Leu Asn Leu Gly Ala Gin Lys Leu Asn 

275 280 285 

Glu Phe Gly Leu Glu Pro Tyr Leu Gly Phe Leu His Pro His Leu Thr 

290 295 300 

Asn Asp Phe Glu Asn Asn Pro Asn Glu Gin Ser Ala Leu Phe Val Leu 
305 310 315 320 

Pro Leu Ser Ala Val Ser Ala Leu Asn Val His Ala Leu Lys Phe Val 

325 330 335 

Leu Leu Glu Ala Leu Pro 
340 

(2) INFORMATION FOR SEQ ID NO: 12 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16.., 567 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1269: 

ATTAAAGGAT AATGA ATG AAA AAA ATG GTT TTG GTA TCG GTT TTA CTA GCA 51 
Met Lys Lys Met Val Leu Val Ser Val Leu Leu Ala 
15 10 

GGG TTT TTG CAA GCG GTG AAT TTG GAT TTA TCT TCG GCT AAG CTA ACA 9 9 

Gly Phe Leu Gin Ala Val Asn Leu Asp Leu Ser Ser Ala Lys Leu Thr 
15 20 25 

TGG ACA GCC TTT AAA ACT AAG GCT AAA ACA CCA GTA AAT GGG ACT TTT 14 7 

Trp Thr Ala Phe Lys Thr Lys Ala Lys Thr Pro Val Asn Gly Ser Phe 
30 35 40 

GAA AGC ATC ACC TAT AAA TTG GGT AAA TCT CAA GAT AGT TTA AAA ACC 195 
Glu Ser lie Thr Tyr Lys Leu Gly Lys Ser Gin Asp Ser Leu Lys Thr 
45 50 55 60 

CTT TTA GAG GGA GCG AGC GCG AGC ATG GAT AGC TTG AAA GTC AAT TTA 24 3 

Leu Leu Glu Gly Ala Ser Ala Ser Met Asp Ser Leu Lys Val Asn Leu 
65 70 75 
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AAT GAT TTT GAA AAT AAC CCT AAT GAG CAA TCA GCG CTC TTT GTC TTG 10 21 

Asn Asp Phe Glu Asn Asn Pro Asn Glu Gin Ser Ala Leu Phe Val Leu 
305 310 315 320 

CCC CTT TCA GCG GTT AGO GCT CTT AAT GTG CAT GCA CTC AAA TTT GTG 106 9 

Pro Leu Ser Ala Val Ser Ala Leu Asn Val His Ala Leu Lys Phe Val 
325 330 335 

TTG TTG GAA GCG TTA CCC TAAAACGCTA TTTTTAAAAT AATCCATTAA AATAAAGG 112 5 
Leu Leu Glu Ala Leu Pro 
340 

CGAGGAATGA AAAGAT 1141 
(2) INFORMATION FOR SEQ ID NO: 126 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126 8: 

Met Val Ser Phe Arg Arg Glu Lys Phe Val He Lys Gly Gly Pro Asp 

1 5 10 15 

Gly Gly Asp Gly Gly Asp Gly Gly Asp Val Tyr Phe Glu Val Asp Asn 

20 25 30 

Asn Thr Asp Thr Leu Ala Ser Phe Arg Gly Thr Lys His His Lys Ala 

35 40 45 

Lys Asn Gly Ala Pro Gly Gly Thr Arg Asn Cys Ala Gly Lys Lys Gly 

50 55 60 

Glu Asp Lys He He Val Val Pro Pro Gly Thr Gin Val Phe Val Gly 
65 70 75 80 

Asp Glu Leu Trp Leu Asp Leu Val Glu Pro Lys Glu Arg Val Leu Ala 

85 90 95 

Leu Lys Gly Gly Lys Gly Gly Leu Gly Asn Ala His Phe Lys Ser Ala 

100 105 110 

Thr Lys Gin Gin Pro Thr Tyr Ala Gin Lys. Gly Leu Glu Gly Val Glu 

115 120 125 

Lys Cys Val Arg Leu Glu Leu Lys Leu He Ala Asp He Gly Leu Val 

130 135 140 

Gly Phe Pro Asn Ala Gly Lys Ser Thr Leu He Ser Thr He Ser Asn 
145 150 155 160 

Ala Lys Pro Lys He Ala Asn Tyr Glu Phe Thr Thr Leu Val Pro Asn 

165 170 175 

Leu Gly Val Val Ser Val Asp Glu Lys Ser Gly Phe Leu Met Ala Asp 

180 185 190 

He Pro Gly He He Glu Gly Ala Ser Glu Gly Lys Gly Leu Gly He 

195 200 205 

Ser Phe Leu Lys His He Glu Arg Thr Lys Val Leu Ala Phe Val Leu 

210 215 220 

Asp Ala Ser Arg Leu Asp Leu Gly He Lys Glu Gin Tyr Gin Arg Leu 
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GAT GAG TTG TGG CTT GAT TTA GTG GAA CCT AAA GAA AGG GTG TTA GCC 34 9 

Asp Glu Leu Trp Leu Asp Leu Val Glu Pro Lys Glu Arg Val Leu Ala 
85 90 95 

TTA AAA GGG GGC AAG GGG GGG TTA GGG AAT GCA CAT TTT AAA AGC GCG 3 97 

Leu Lys Gly Gly Lys Gly Gly Leu Gly Asn Ala His Phe Lys Ser Ala 
100 105 110 

ACT AAA CAA CAA CCC ACT TAG GCG CAA AAA GGC TTA GAG GGG GTT GAA 445 
Thr Lys Gin Gin Pro Thr Tyr Ala Gin Lys Gly Leu Glu Gly Val Glu 
115 120 125 

AAA TGC GTG CGT TTG GAA TTA AAA CTC ATC GCT GAT ATA GGG TTA GTG 4 93 

Lys Cys Val Arg Leu Glu Leu Lys Leu lie Ala Asp lie Gly Leu Val 
130 135 140 

GGC TTC CCT AAT GCG GGT AAA TCC ACG CTC ATT TCC ACC ATC TCT AAC 541 
Gly Phe Pro Asn Ala Gly Lys Ser Thr Leu lie Ser Thr lie Ser Asn 
145 150 155 160 

GCT AAG CCT AAA ATC GCT AAC TAT GAA TTT ACG ACT CTA GTG CCT AAT 58 9 

Ala Lys Pro Lys lie Ala Asn Tyr Glu Phe Thr Thr Leu Val Pro Asn 
165 170 175 

TTA GGG GTT GTG AGC GTG GAT GAA AAA AGC GGA TTT CTA ATG GCG GAT 63 7 

Leu Gly Val Val Ser Val Asp Glu Lys Ser Gly Phe Leu Met Ala Asp 
180 185 190 

ATT CCT GGC ATT ATT GAA GGG GCT AGC GAG GGA AAG GGC TTA GGG ATT 68 5 

He Pro Gly He He Glu Gly Ala Ser Glu Gly Lys Gly Leu Gly He 
195 200 205 

AGC TTT TTA AAG CAT ATT GAA CGC ACC AAA GTT CTA GCT TTT GTT TTA 73 3 

Ser Phe Leu Lys His He Glu Arg Thr Lys Val Leu Ala Phe Val Leu 
210 215 220 

GAG GCT TCC AGG CTG GAT TTA GGC ATT AAA GAG CAA TAG CAA CGC TTG 7 81 

Asp Ala Ser Arg Leu Asp Leu Gly He Lys Glu Gin Tyr Gin Arg Leu 
225 230 235 240 

AGG TTG GAG TTG GAA AAA TTT TCA TCC GCT TTG GCC AAT AAG CCT TTT 82 9 

Arg Leu Glu Leu Glu Lys Phe Ser Ser Ala Leu Ala Asn Lys Pro Phe 
245 250 255 

GGG GTG TTG CTC AAT AAA TGC GAT GTT GTA GAA AAC ATT GAT GAG ATG 8 77 

Gly Val Leu Leu Asn Lys Cys Asp Val Val Glu Asn He Asp Glu Met 
260 265 270 

ACT A^.G GAT TTT TGT GCC TTT TTA AAT TTG GGA GCG GAG AAA TTA AAC 92 5 

Thr Lys Asp Phe Cys Ala Phe Leu Asn Leu Gly Ala Gin Lys Leu Asn 
275 280 285 

GAG TTT GGT TTA GAG CCG TAT TTA GGG TTT TTG CAC CCC CAT TTA ACC 973 
Glu Phe Gly Leu Glu Pro Tyr Leu Gly Phe Leu His Pro His Leu Thr 
290 295 300 
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145 150 155 160 

lie Gly lie Ala Arg Ala Ijeu lie Leu His Pro Ser Val Val lie Cys 

165 170 175 

Asp Glu Pro Val Ser Ala Leu Asp Val Ser lie Gin Ala Gin Val Leu 

180 185 190 

Asn Leu Leu Leu Asp Leu Gin Lys Glu Met Gly Leu Thr Tyr lie Phe 

195 200 205 

lie Ser His Asp Leu Gly Val Val Glu His lie Ser Asp Lys lie lie 

210 215 220 

Val Met Asn Gin Gly Gin lie Val Glu Thr Gly Asp Val Asp Ser Val 
225 230 235 240 

lie Ser Ala Pro Lys His Pro Tyr Thr Gin Lys Leu Leu Asn Ala Val 

245 250 255 

Pro His Leu Glu Lys Ser Met Gin Arg Phe Ala Lys 
260 265 

(2) INFORMATION FOR SEQ ID NO: 12 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 62 . . . 1087 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 67: 

TTAAGCTGTG TTTGTAGATA GCGTGGAAAT TATCATCGCT TCGGGTAAGG GGGGGCCTGG 6 0 

A ATG GTG ACT TTT AGG CGA GAA AAA TTT GTC ATC AAA GGA GGC CCT GAT 10 9 

Met Val Ser Phe Arg Arg Glu Lys Phe Val lie Lys Gly Gly Pro Asp 
15 10 15 

GGG GGC GAT GGA GGC GAT GGA GGC GAT GTG TAT TTT GAA GTG GAT AAC 15 7 

Gly Gly Asp Gly Gly Asp Gly Gly Asp Val Tyr Phe Glu Val Asp Asn 
20 25 30 

AAT ACC GAC ACT CTA GCG AGT TTT AGA GGC ACC AAA CAC CAT AAG GCT 2 05 

Asn Thr Asp Thr Leu Ala Ser Phe Arg Gly Thr Lys His His Lys Ala 
35 40 45 

AAA AAT GGG GCT CCA GGA GGT ACA CGA AAT TGC GCG GGC AAA AAG GGC 2 53 

Lys Asn Gly Ala Pro Gly Gly Thr Arg Asn Cys Ala Gly Lys Lys Gly 
50 55 60 

GAA GAC AAG ATC ATT GTC GTG CCA CCA GGA ACG CAG GTT TTT GTA GGT 3 01 

Glu Asp Lys He He Val Val Pro Pro Gly Thr Gin Val Phe Val Gly 
65 70 75 80 
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185 190 

CAA AAA GAA ATG GGG CTG ACT TAT 
Gin Lys Glu Met Gly Leu Thr Tyr 
200 205 

GTG GTG GAG CAT ATA AGC GAT AAA 
Val Val Glu His lie Ser Asp Lys 
215 220 

ATC GTA GAA ACG GGG GAT GTG GAT 
lie Val Glu Thr Gly Asp Val Asp 
235 

CCT TAT ACG CAG AAA TTA CTC AAT 
Pro Tyr Thr Gin Lys Leu Leu Asn 
250 

ATG CAA AGA TTT GCC AAA TAAAAGAAAG GATTTTTAAG CTGTGTTTGT AGATAGCG 
Met Gin Arg Phe Ala Lys 
265 

TGGAAATTAT CATCGC 

(2) INFORMATION FOR SEQ ID NO: 12 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

CC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266: 



Met 


Lys 


Leu 


Leu 


Glu 


He 


Lys 


Glu 


Leu 


Lys 


Lys 


Ser 


Tyr 


Ala 


He 


Asp 


1 








5 










10 










15 


Arg 


Gly 


Leu 


Phe 


Lys 


Pro 


Lys 


Arg 


Val 


He 


His 


Ala 


Leu 


Asn 


Gly 


He 








20 










25 










30 




Ser 


Phe 


Glu 


Val 


Glu 


Gin 


Asn 


Glu 


Val 


Leu 


Ser 


He 


Val 


Gly 


Glu 


Ser 






35 










40 










45 






Gly 


Cys 


Gly 


Lys 


Ser 


Thr 


Thr 


Ala 


Lys 


He 


Leu 


Ala 


Gly 


He 


Glu 


Arg 




50 










55 










60 








Gin 


Asp 


Ser 


Gly 


Ala 


He 


Tyr 


Phe 


Asn 


Gly 


Lys 


Arg 


His 


Leu 


His 


Phe 


65 










70 










75 










80 


Ser 


Lys 


Gin 


Asp 


Trp 
85 


Phe 


Asp 


Tyr 


Arg 


Lys 
90 


Lys 


Val 


Gin 


Met 


He 
95 


Phe 


Gin 


Asp 


Pro 


Tyr 
100 


Ser 


Ser 


Leu 


Asn 


Pro 
105 


Arg 


Trp 


Lys 


Val 


Gly 
110 


Glu 


He 


He 


Ala 


Glu 


Pro 


Leu 


Leu 


Leu 


Asn 


Ser 


His 


Phe 


Ser 


Lys 


Lys 


Glu 


He 






115 










120 










125 






Lys 


Thr 


Lys 


Val 


Leu 


Glu 


He 


Met 


Gin 


Lys 


Val 


Gly 


Leu 


Lys 


Leu 


Glu 




130 










135 










140 








Trp 


He 


Asp 


Arg 


Tyr 


Pro 


His 


Gin 


Phe 


Ser 


Gly 


Gly 


Gin 


Arg 


Gin 


Arg 
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195 



ATT TTT ATC AGC CAT GAT TTA GGG 67 8 

He Phe He Ser His Asp Leu Gly 
210 

ATC ATC GTA ATG AAT CAG GGG CAA 72 6 

He He Val Met Asn Gin Gly Gin 
225 230 

AGC GTG ATA AGC GCT CCA AAG CAC 774 
Ser Val He Ser Ala Pro Lys His 
240 245 

GCG GTG CCG CAT TTG GAA AAA TCC 82 2 

Ala Val Pro His Leu Glu Lys Ser 
255 260 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 65: 

AAATGTGGAT TATTTGAGTT TTCAAAAGGA GTTGCG ATG AAG CTC TTA GAA ATT 54 

Met Lys Leu Leu Glu lie 
1 5 

AAA GAA TTG AAA AAA TCC TAT GCG ATA GAC AGG GGG TTA TTC AAG COT 102 

Lys Glu Leu Lys Lys Ser Tyr Ala lie Asp Arg Gly Leu Phe Lys Pro 
10 15 20 

AAA AGA GTG ATC CAT GCG CTC AAT GGG ATC AGT TTT GAA GTG GAA CAA 15 0 

Lys Arg Val lie His Ala Leu Asn Gly lie Ser Phe Glu Val Glu Gin 
25 30 35 

AAT GAA GTT TTG AGC ATT GTG GGG GAG AGC GGT TGC GGG AAA AGC ACG 198 
Asn Glu Val Leu Ser lie Val Gly Glu Ser Gly Cys Gly Lys Ser Thr 
40 45 50 

ACA GCC AAA ATT TTA GCC GGG ATT GAA AGG CAA GAT AGC GGG GCG ATT 246 
Thr Ala Lys lie Leu Ala Gly lie Glu Arg Gin Asp Ser Gly Ala lie 
55 60 65 70 

TAT TTC AAT GGT AAG CGC CAT TTG CAT TTT AGC AAA CAG GAT TGG TTT 2 94 

Tyr Phe Asn Gly Lys Arg His Leu His Phe Ser Lys Gin Asp Trp Phe 
75 80 85 

GAT TAG CGC AAA AAG GTG CAA ATG ATT TTT CAA GAC CCT TAT TCT AGC 342 
Asp Tyr Arg Lys Lys Val Gin Met lie Phe Gin Asp Pro Tyr Ser Ser 
90 95 100 

CTA AAC CCT CGG TGG AAA GTG GGC GAG ATC ATC GCT GAA CCC TTG CTT 3 90 

Leu Asn Pro Arg Trp Lys Val Gly Glu lie lie Ala Glu Pro Leu Leu 
105 110 115 

TTA AAT TCT CAT TTT TCA AAA AAA GAA ATC AAA ACA AAA GTG CTA GAG 43 8 

Leu Asn Ser His Phe Ser Lys Lys Glu lie Lys Thr Lys Val Leu Glu 
120 125 130 

ATC ATG CAA AAA GTG GGC TTG AAA TTA GAA TGG ATC GAT CGT TAG CCC 4 86 

lie Met Gin Lys Val Gly Leu Lys Leu Glu Trp lie Asp Arg Tyr Pro 
135 140 145 150 

CAC CAA TTT TCA GGC GGT CAA AGG CAA CGA ATC GGC ATT GCT AGG GCG 534 
His Gin Phe Ser Gly Gly Gin Arg Gin Arg lie Gly He Ala Arg Ala 
155 160 165 

CTC ATT TTG CAT CCT AGC GTG GTG ATT TGC GAT GAG CCT GTG TCT GCG 5 82 

Leu He Leu His Pro Ser Val Val He Cys Asp Glu Pro Val Ser Ala 
170 175 180 

CTA GAC GTG TCC ATT CAA GCG CAA GTG TTG AAT TTG CTC TTG GAT TTG 63 0 

Leu Asp Val Ser He Gin Ala Gin Val Leu Asn Leu Leu Leu Asp Leu 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1264: 

Met lie Leu Glu Val Lys Asp Leu Lys Thr Tyr Phe Phe Thr Asp Lys 

15 10 15 

Gly Val Asn Lys Ala Val Asp Gly Val Ser Phe Gly Leu Lys Lys Ser 

20 25 30 

Gin Thr Leu Cys lie Val Gly Glu Ser Gly Ser Gly Lys Ser lie Thr 

35 40 45 

Ser Leu Ser lie Leu Gly Leu lie Glu Lys Pro Gly Gin lie Val Gly 

50 55 60 

Gly Ser lie Gin Phe Leu Gly Gin Asp Leu Leu Gin Leu Lys Glu Lys 
65 70 75 80 

Gin Met Gin Lys Glu lie Arg Gly Lys Lys lie Gly Met lie Phe Gin 

85 90 95 

Glu Pro Met Thr Ser Leu Asn Pro Ser Tyr Thr Val Gly Phe Gin He 

100 105 110 

Asn Glu Val Leu Lys He His His Pro Asn Leu Asn Lys Lys Glu Arg 

115 120 125 

Leu Glu Arg Val Val Tyr Glu Leu Glu Arg Val Gly He Pro His Ala 

130 135 140 

Gly Asp Lys Tyr His Glu Tyr Pro Phe Asn Leu Ser Gly Gly Gin Arg 
145 150 155 160 

Gin Arg Val Met He Ala Met Ala Met Val Cys Glu Pro Glu He Leu 

165 170 175 

He Ala Asp Glu Pro Thr Thr Ala Leu Asp Val Thr He Gin Ala Gin 

180 185 190 

He Leu Glu Leu Met Lys Glu Leu Gin Gin Lys Lys Gly Thr Ser He 

195 200 205 

Leu Phe He Thr His Asp Leu Gly Val Val Ala Gin He Ala Asp Glu 

210 215 220 

Val Val Val Met Tyr Lys Gly His Val Val Glu Gin Ala Ser Ala Lys 
225 230 235 240 

Glu Leu Phe Ala Asp Pro Arg His Pro Tyr Thr Lys Ala Leu Leu Ser 

245 250 255 

Ala He Pro Lys Pro Gly Lys Glu Tyr Arg Lys Lys Arg Leu Glu Thr 

260 265 270 

Val Asp Glu Asn Val Asp Tyr Leu Ser Phe Gin Lys Glu Leu Arg 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 1265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 84 0 
(D) OTHER INFORMATION: 
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105 110 115 

CAC CCT AAC CTC AAT AAA AAA GAA CGC TTA GAA AGG GTG GTT TAT GAA 43 6 

His Pro Asn Leu Asn Lys Lys Glu Arg Leu Glu Arg Val Val Tyr Glu 
120 125 130 135 

TTA GAG CGT GTG GGC ATT CCC CAT GCA GGG GAT AAA TAG CAC GAA TAC 4 84 

Leu Glu Arg Val Gly lie Pro His Ala Gly Asp Lys Tyr His Glu Tyr 
140 145 150 

CCT TTC AAT CTC AGC GGG GGG CAG CGC CAA AGG GTG ATG ATC GCT ATG 532 
Pro Phe Asn Leu Ser Gly Gly Gin Arg Gin Arg Val Met lie Ala Met 
155 160 165 

GCT ATG GTG TGT GAG CCT GAA ATC TTG ATC GCT GAT GAG CCT ACG ACA 580 
Ala Met Val Cys Glu Pro Glu lie Leu He Ala Asp Glu Pro Thr Thr 
170 175 180 

GCC TTA GAT GTA ACC ATT CAA GCG CAA ATT TTA GAA TTG ATG AAA GAA 62 8 

Ala Leu Asp Val Thr He Gin Ala Gin He Leu Glu Leu Met Lys Glu 
185 190 195 

TTG CAA CAA AAA AAA GGC ACT TCT ATT TTG TTT ATC ACC CAT GAT TTA 67 6 

Leu Gin Gin Lys Lys Gly Thr Ser lie Leu Phe He Thr His Asp Leu 
200 205 210 215 

GGC GTG GTG GCG CAA ATC GCT GAT GAA GTG GTG GTG ATG TAT AAA GGG 724 
Gly Val Val Ala Gin He Ala Asp Glu Val Val Val Met Tyr Lys Gly 
220 225 230 

CAT GTG GTG GAG CAA GCG AGT GCG AAA GAG CTT TTT GCT GAT CCA AGA 772 
His Val Val Glu Gin Ala Ser Ala Lys Glu Leu Phe Ala Asp Pro Arg 
235 240 245 

CAC CCT TAT ACG AAA GCT CTT TTA AGC GCG ATC CCT AAA CCG GGC AAA 82 0 

His Pro Tyr Thr Lys Ala Leu Leu Ser Ala He Pro Lys Pro Gly Lys 
250 255 260 

GAA TAC CGC AAA AAA CGC TTA GAG ACC GTG GAT GAA AAT GTG GAT TAT 868 
Glu Tyr Arg Lys Lys Arg Leu Glu Thr Val Asp Glu Asn Val Asp Tyr 
265 270 275 

TTG AGT TTT CAA AAG GAG TTG CGA TGAAGCTCTT AGAAATTAAA GAATTGAAAA 922 
Leu Ser Phe Gin Lys Glu Leu Arg 
280 285 

AATCCTATGC GATAGACAGG GGGTTATTCA AGC 955 
(2) INFORMATION FOR SEQ ID NO:1264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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290 295 300 

Met Ser Leu lie lie Ala Met Met Tyr He Gly Ala Asn Leu Leu Val 
305 310 315 320 

Asp He Leu Tyr Ala Phe He Asp Pro Arg He Arg Leu Ser 
325 330 



(2) INFORMATION FOR SEQ ID N0:1263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 32... 892 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263: 

ATCCTAAACG CACCTCTTAA AAGGAGCTTG C ATG ATT TTA GAA GTT AAA GAT 52 

Met He Leu Glu Val Lys Asp 
1 5 

TTA AAA ACT TAT TTT TTC ACC GAT AAG GGC GTG AAT AAA GCA GTG GAT 100 
Leu Lys Thr Tyr Phe Phe Thr Asp Lys Gly Val Asn Lys Ala Val Asp 
10 15 20 

GGT GTG AGT TTT GGT TTG AAA AAG TCT CAA ACG CTC TGC ATT GTA GGG 14 8 

Gly Val Ser Phe Gly Leu Lys Lys Ser Gin Thr Leu Cys He Val Gly 
25 30 35 

GAG AGC GGG AGC GGG AAA AGC ATC ACT TCG CTC TCT ATT TTA GGG TTG 196 
Glu Ser Gly Ser Gly Lys Ser He Thr Ser Leu Ser He Leu Gly Leu 
40 45 50 55 

ATT GAA AAA CCG GGT CAA ATT GTG GGA GGG AGC ATT CAA TTT TTA GGG 244 
He Glu Lys Pro Gly Gin He Val Gly Gly Ser He Gin Phe Leu Gly 
60 65 70 

CAG GAT TTG TTG CAA CTC AAA GAA AAG CAG ATG CAA AAA GAA ATT AGG 2 92 

Gin Asp Leu Leu Gin Leu Lys Glu Lys Gin Met Gin Lys Glu He Arg 
75 80 85 

GGT AAA AAA ATT GGC ATG ATC TTT CAA GAG CCT ATG ACA AGC CTA AAC 34 0 

Gly Lys Lys He Gly Met He Phe Gin Glu Pro Met Thr Ser Leu Asn 
90 95 100 

CCT TCC TAC ACG GTG GGG TTT CAA ATC AAT GAA GTG TTG AAA ATC CAC 3 88 

Pro Ser Tyr Thr Val Gly Phe Gin He Asn Glu Val Leu Lys He His 
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TAC GCT TTT ATT GAT CCT AGA ATA AGG TTG TCA TAATGGAGTC TTTTAGAGAG 107 0 
Tyr Ala Phe lie Asp Pro Arg lie Arg Leu Ser 



(2) INFORMATION FOR SEQ ID NO:1262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1262: 



Met Leu Ser Phe He He Lys Arg He Leu Trp Ala He Pro Thr Leu 

15 10 15 

Phe Gly Val Ser He He Val Phe Met Met Val His Leu Val Pro Gly 

20 25 30 

Asp Pro Ala Leu Val He Leu Gly Glu Lys Ala Asn Gin Ala Ala He 

35 40 45 

Asp Ala Leu Arg Glu Gin Phe Gly Leu Asn Lys Pro Leu He Glu Gin 

50 55 60 

Tyr Phe Phe Phe He Asn Asn Val Leu His Gly Asn Phe Gly Thr Ser 
SB 70 75 80 

He Met Thr Gly Glu Pro Val Met His Glu Phe Trp Gin Arg Phe Pro 

85 90 95 

Ala Thr Val Glu Leu Ala Leu He Ala Leu Phe Met Ala Leu Val Leu 

100 105 110 

Gly He Ser Val Gly Val Leu Ala Ala He Lys Arg Tyr Ser Val Phe 

115 120 125 

Asp Tyr Ser Ser Met Thr Phe Ala Leu Ala Gly He Ser Met Pro Val 

130 135 140 

Phe Trp Leu Gly Leu Met Leu He Tyr He Phe Ser Val Gin Leu Gly 
145 150 155 160 

Trp Leu Pro Val Phe Gly Arg Leu Ser Asp Val Tyr Tyr Leu Asp Gly 

165 170 175 

Pro Thr Gly Leu Tyr Leu He Asp Ser Leu He Ala Arg Asp Tyr Gly 

180 185 190 

Ala Phe Met Asp Thr He Lys His Leu He Leu Pro Ser He Val Leu 

195 200 205 

Ala Thr Val Ser Thr Ala Val He Ala Arg Met Thr Arg Ala Ser Met 

210 215 220 

Ala Glu Val Ser Lys Glu Asp Tyr Val Arg Thr Ala Lys Ala Lys Gly 
225 230 235 240 

Cys Ser Ser Phe Arg Val He Phe Val His Thr Leu Arg Asn Ala Leu 

245 250 255 

He Pro Val Thr Thr He Ala Gly Leu Met Leu Ala Gly Leu Leu Gly 

260 265 270 

Gly Ser Met He Thr Glu Thr Val Phe Ser Trp Pro Gly He Gly Lys 

275 280 285 

Trp He Val Asn Ala Leu Asn Gin Arg Asp Phe Pro He He Gin Ser 



325 



330 



TTTATCCAAC 



1080 
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Glu Leu Ala Leu lie Ala Leu Phe Met Ala Leu Val Leu Gly lie Ser 
100 105 110 115 

GTT GGC GTG TTA GOT GCG ATC AAA CGC TAT AGC GTG TTT GAT TAT TCC 441 

Val Gly Val Leu Ala Ala lie Lys Arg Tyr Ser Val Phe Asp Tyr Ser 
120 125 130 

AGC ATG ACT TTT GCT TTA GCC GGG ATT TCT ATG CCG GTG TTT TGG CTA 489 

Ser Met Thr Phe Ala Leu Ala Gly lie Ser Met Pro Val Phe Trp Leu 
135 140 145 

GGG CTC ATG GTG ATT TAT ATC TTT AGC GTG CAA TTG GGG TGG TTG CCT 53 7 

Gly Leu Met Leu lie Tyr lie Phe Ser Val Gin Leu Gly Trp Leu Pro 
150 155 160 

GTT TTT GGG CGT TTG AGC GAT GTG TAT TAT TTA GAT GGC CCC ACA GGT 585 

Val Phe Gly Arg Leu Ser Asp Val Tyr Tyr Leu Asp Gly Pro Thr Gly 

165 170 175 

CTT TAT TTG ATA GAC AGC CTG ATC GCA AGG GAT TAT GGG GCG TTT ATG 63 3 

Leu Tyr Leu lie Asp Ser Leu lie Ala Arg Asp Tyr Gly Ala Phe Met 
180 185 190 195 

GAT ACG ATC AAG CAC TTG ATT TTG CCT AGC ATT GTG TTA GCC ACG GTT 681 

Asp Thr lie Lys His Leu lie Leu Pro Ser lie Val Leu Ala Thr Val 
200 205 210 

TCT ACC GCT GTT ATT GCC AGA ATG ACT CGC GCG AGC ATG GCA GAA GTG 72 9 

Ser Thr Ala Val lie Ala Arg Met Thr Arg Ala Ser Met Ala Glu Val 
215 220 225 

TCT AAA GAA GAT TAT GTG CGT ACC GCT AAA GCT AAG GGG TGT AGC TCC 777 

Ser Lys Glu Asp Tyr Val Arg Thr Ala Lys Ala Lys Gly Cys Ser Ser 
230 235 240 

TTT AGG GTG ATT TTT GTG CAC ACT TTG CGT AAT GCT TTA ATC CCT GTA 82 5 

Phe Arg Val lie Phe Val His Thr Leu Arg Asn Ala Leu He Pro Val 

245 250 255 

ACG ACT ATC GCA GGC TTG ATG TTG GCC GGG CTT TTA GGG GGG AGC ATG 873 

Thr Thr He Ala Gly Leu Met Leu Ala Gly Leu Leu Gly Gly Ser Met 
260 265 270 275 

ATA ACT GAA ACG GTT TTC TCA TGG CCT GGG ATT GGT AAG TGG ATT GTT 921 

He Thr Glu Thr Val Phe Ser Trp Pro Gly He Gly Lys Trp He Val 
280 285 290 

AAT GCG CTC AAC CAG CGC GAT TTC CCG ATT ATC CAG TCC ATG TCT TTG 96 9 

Asn Ala Leu Asn Gin Arg Asp Phe Pro He He Gin Ser Met Ser Leu 
295 300 305 

ATT ATT GCC ATG ATG TAT ATT GGG GCT AAT CTC TTA GTG GAT ATT TTA 1017 
He He Ala Met Met Tyr He Gly Ala Asn Leu Leu Val Asp -He Leu 
310 315 320 
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Tyr Val Pro Leu Ala Tyr Pro Tyr Ser Val Val Pro His Leu Ser Lys 

515 520 525 

Val Lys Gly Tyr Lys Thr Thr Gly Val Ser Val Asn Arg Phe Phe Lys 

530 535 540 

Val Tyr Leu Glu Lys 
545 

(2) INFORMATION FOR SEQ ID NO: 12 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 1050 
(D) OTHER INFORMATION: 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261: 

GCGTGAATCG CTTCTTTAAG GTGTATTTAG AAAAATAAAA GGGGTTGC ATG CTG AGT 5 7 

Met Leu Ser 
1 

TTT ATC ATT AAG CGT ATT TTG TGG GCG ATC CCC ACG CTG TTT GGA GTG 105 
Phe lie lie Lys Arg lie Leu Trp Ala lie Pro Thr Leu Phe Gly Val 
5 10 15 

AGT ATC ATT GTG TTT ATG ATG GTG CAT TTA GTG CCA GGA GAT CCG GCA 153 
Ser lie lie Val Phe Met Met Val His Leu Val Pro Gly Asp Pro Ala 
20 25 30 35 

TTA GTG ATT TTA GGT GAA AAG GCC AAT CAA GCC GCT ATT GAT GCT TTA 2 01 

Leu Val lie Leu Gly Glu Lys Ala Asn Gin Ala Ala lie Asp Ala Leu 
40 45 50 

AGA GAG CAA TTT GGA TTG AAT AAG CCC TTG ATA GAG CAG TAT TTT TTC 24 9 

Arg Glu Gin Phe Gly Leu Asn Lys Pro Leu He Glu Gin Tyr Phe Phe 
55 60 65 

TTT ATC AAT AAT GTG TTG CAT GGC AAT TTT GGC ACT TCT ATC ATG ACC 2 97 

Phe He Asn Asn Val Leu His Gly Asn Phe Gly Thr Ser He Met Thr 
70 75 80 

GGT GAG CCT GTG ATG CAT GAG TTT TGG CAA CGC TTC CCG GCC ACG GTG 34 5 

Gly Glu Pro Val Met His Glu Phe Trp Gin Arg Phe Pro Ala Thr Val 
85 90 95 

GAA TTAJ3CT TTG ATC GCT CTG TTT ATG GCT CTT GTT TTG GGT ATT AGC 3 93 
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65 

Leu Val Gin Phe 

Thr Ser Trp Asp 
100 

Arg Lys Gly Val 
115 

Glu Phe Ser Ala 
130 

Lys Ala Lys Arg 
145 

Glu Gly Met Gly 

Asp Tyr Thr lie 
180 

Ala Asn Leu Gly 
195 

Asp Tyr Leu Ala 
210 

lie Gly Thr Gly 
225 

lie lie Leu Leu 

Asp Lys Val Val 
260 

Ala Leu Arg Thr 
275 

Glu Val Glu Gin 
290 

Ala Gly Leu Leu 
305 

Phe Asp Asn Pro 

Asp Asp Tyr lie 
340 

Asn Pro Phe Pro 
355 

Tyr Glu Tyr Asp 
370 

Tyr Pro Asn Gly 
385 

Pro Lys Gly Ala 

Asp Val Lys lie 
420 

Gly Leu Gly Glu 
435 

Ala Asp Pro Asp 
450 

Ser Ala lie Pro 
465 

Ser Asn Leu Leu 

Glu Ala Leu Tyr 
500 



70 

Arg Tyr Gly Thr 
85 

lie Ser Pro Asp 

Tyr Phe His Gin 
120 

Lys Asp Val Leu 
135 

Tyr Tyr Ser Pro 
150 

Met Ser His lie 
165 

Arg Phe Thr Leu 

Met Asp Phe Leu 
200 

Gin Asn Asn Lys 
215 

Pro Phe Lys Phe 
230 

Lys Asn Gin Asp 
245 

Val Arg Thr He 

Gly Glu He Met 
280 

Leu Glu Lys Val 
295 

Ala Ser Trp Leu 
310 

Leu Val Arg Leu 
325 

Lys Val Leu Tyr 

Pro Thr He Trp 
360 

Leu Lys Lys Ala 
375 

Phe Lys Thr Thr 
390 

Val Phe He Gin 
405 

Glu Val Tyr Glu 

His Glu Met Ala 
440 

Asn Phe Leu Tyr 
455 

Thr Gin Asn His 
470 

He Lys Ala Lys 
485 

Leu Lys Ala Gin 



75 

Thr Glu Val Glu 
90 

Gly Leu Val Tyr 
105 

Thr Lys Tyr Trp 

Phe Ser Phe Glu 
140 

Gly Ala Lys Ser 
155 

He Lys Ser lie 
170 

Asn Gly Pro Glu 
185 

Ser He Leu Ser 

Lys Asp Glu Leu 
220 

Phe Leu Trp Asn 
235 

Tyr Trp Gly Pro 
250 

Pro Asn Ser Ser 
265 

Leu Met Thr Gly 

Pro Asn He Val 
300 

Ser Leu Asn Thr 
315 

Ala He Asn His 
330 

Glu Gly Phe Ala 
345 

Gly Tyr Asn Tyr 

Lys Glu Leu Leu 
380 

He Phe Thr Thr 
395 

Ala Ser Leu Ala 
410 

Trp Gly Ala Tyr 
425 

Phe Ser Gly Trp 

Thr Leu Trp Ser 
460 

Ser Phe Tyr Lys 
475 

Arg Val Ser Asp 
490 

Glu He He His 
505 



80 

Pro Ala Leu Ala 
95 

Thr Phe His Leu 
110 

Asn Lys Lys Val 
125 

Arg Gin Met Asp 

Tyr Lys Tyr Trp 
160 

Glu Ala Leu Asp 
175 

Ala Pro Phe Leu 
190 

Lys Asp Tyr Ala 
205 

Ala Lys Lys Pro 

Lys Asp Glu Lys 
240 

Lys Ala Tyr Leu 

255 

Thr Arg Ala Leu 
270 

Pro Asn Leu Asn 
285 

Val Asp Lys Ser 

Gin Lys Lys Tyr 
320 

Ala He Asn Ala 
335 

Gin Lys Met Val 
350 

Asn He Lys Pro 
365 

Lys Gin Ala Gly 

Ala Thr Arg Asn 
400 

Lys He Gly He 
415 

Leu Lys Arg Thr 
430 

Met Ala Asp He 
445 

Glu Gin Ala Ala 

Asn Lys Glu Phe 
480 

Gin Lys Glu Arg 
495 

Lys Asp Ala Pro 
510 
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GTG TAT GAG TGG GGG GCT TAT TTG AAA AGA ACG GGT CTG GGC GAA CAT 13 5 0 

Val Tyr Glu Trp Gly Ala Tyr Leu Lys Arg Thr Gly Leu Gly Glu His 
425 430 435 

GAA ATG GCG TTT TCA GGC TGG ATG GCA GAC ATT GCG GAT CCG GAT AAT 13 98 

Glu Met Ala Phe Ser Gly Trp Met Ala Asp lie Ala Asp Pro Asp Asn 
440 445 450 

TTC TTA TAG ACC TTA TGG AGC GAG CAA GCC GCC TCA GCT ATA CCC ACT 14 4 6 

Phe Leu Tyr Thr Leu Trp Ser Glu Gin Ala Ala Ser Ala lie Pro Thr 
455 460 465 

CAA AAC CAT TCC TTT TAT AAA AAT AAG GAG TTT TCC AAT CTG CTC ATA 14 94 

Gin Asn His Ser Phe Tyr Lys Asn Lys Glu Phe Ser Asn Leu Leu lie 
470 475 480 485 

AAG GCT AAA CGC GTT TCG GAT CAA AAA GAG AGG GAA GCC CTT TAT TTA 1542 
Lys Ala Lys Arg Val Ser Asp Gin Lys Glu Arg Glu Ala Leu Tyr Leu 
490 495 500 

AAG GCA CAA GAA ATT ATC CAT AAA GAC GCG CCT TAT GTG CCT TTA GCC 15 90 

Lys Ala Gin Glu lie lie His Lys Asp Ala Pro Tyr Val Pro Leu Ala 
505 510 515 

TAT CCT TAT TCG GTG GTG CCG CAT TTG TCT AAA GTT AAG GGT TAT AAA 163 8 

Tyr Pro Tyr Ser Val Val Pro His Leu Ser Lys Val Lys Gly Tyr Lys 
520 525 530 

ACG ACC GGA GTG AGC GTG AAT CGC TTC TTT AAG GTG TAT TTA GAA AAA T 16 87 
Thr Thr Gly Val Ser Val Asn Arg Phe Phe Lys Val Tyr Leli 'Glu Lys 
535 540 545 

AAAAGGGGTT GCATGCTGAG TTTTATCATT AAGCG 172 2 

(2) INFORMATION FOR SEQ ID NO: 12 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1260: 

Met Asn Asn Val Phe Val Lys Gly Leu Phe Phe Phe Leu Leu Leu Phe 

15 10 15 

Gly Phe Phe Leu Lys Ala Ser Glu Ser Pro Asn Ala Thr Leu Asn Pro 

20 25 30 

Ser Lys Glu Asn Val Ser Val Glu Glu Gin Lys Arg Phe Gly Gly Val 

35 40 45 

Leu Val Phe Ala Arg Gly Ala Asp Gly Ser Ser Met Asp Pro Ala Leu 

50 55 60 

Val Thr Asp Gly Glu Ser Tyr Val Ala Thr Gly Asn lie Tyr Asp Thr 
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Asp Phe Leu Ser lie Leu Ser Lys Asp Tyr Ala Asp Tyr Leu Ala Gin 
200 205 210 

AAT AAT AAA AAA GAC GAG TTG GCT AAA AAA CCT ATT GGG ACA GGG CCT 726 
Asn Asn Lys Lys Asp Glu Leu Ala Lys Lys Pro lie Gly Thr Gly Pro 
215 220 225 

TTC AAA TTC TTT TTG TGG AAT AAA GAT GAA ' AAA ATC ATT CTT TTA AAA 7 74 

Phe Lys Phe Phe Leu Trp Asn Lys Asp Glu Lys lie lie Leu Leu Lys 
230 235 240 245 

AAT CAA GAT TAT TGG GGG CCT AAA GCG TAT TTG GAT AAG GTG GTG GTG 822 
Asn Gin Asp Tyr Trp Gly Pro Lys Ala Tyr Leu Asp Lys Val Val Val 
250 255 260 

CGC AGO ATT CCT AAT TCT TCC ACT CGC GCT TTA GCG TTG CGC ACC GGC 87 0 

Arg Thr lie Pro Asn Ser Ser Thr Arg Ala Leu Ala Leu Arg Thr Gly 
265 270 275 

GAA ATC ATG CTC ATG ACT GGG CCT AAT CTC AAT GAA GTG GAG CAA TTA 918 
Glu lie Met Leu Met Thr Gly Pro Asn Leu Asn Glu Val Glu Gin Leu 
280 285 290 

GAA AAA GTC CCT AAT ATC GTG GTG GAC AAA AGT GCT GGG TTG TTG GCG 96 6 

Glu Lys Val Pro Asn lie Val Val Asp Lys Ser Ala Gly Leu Leu Ala 
295 300 305 

AGT TGG CTT TCG TTG AAC ACG CAA AAA AAG TAT TTT GAC AAC CCT TTG 1014 
Ser Trp Leu Ser Leu Asn Thr Gin Lys Lys Tyr Phe Asp Asn Pro Leu 
310 315 320 325 

GTG CGT TTG GCT ATC AAT CAT GCG ATC AAT GCA GAT GAT TAC ATC AAA 10 62 

Val Arg Leu Ala lie Asn His Ala lie Asn Ala Asp Asp Tyr lie Lys 
330 335 340 

GTG CTT TAT GAA GGC TTT GCT CAA AAA ATG GTC AAT CCT TTC CCG CCC 1110 
Val Leu Tyr Glu Gly Phe Ala Gin Lys Met Val Asn Pro Phe Pro Pro 
345 350 355 

ACC ATA TGG GGT TAT AAC TAC AAT ATC AAA CCC TAT GAA TAC GAT TTG 115 8 

Thr lie Trp Gly Tyr Asn Tyr Asn lie Lys Pro Tyr Glu Tyr Asp Leu 
360 365 370 

AAA AAG GCT AAG GAG TTG TTG AAA CAA GCG GGC TAT CCT AAC GGC TTT 12 06 

Lys Lys Ala Lys Glu Leu Leu Lys Gin Ala Gly Tyr Pro Asn Gly Phe 
375 380 385 

AAA ACC ACT ATT TTT ACC ACT GCC ACT CGT AAC CCA AAA GGA GCG GTG 12 54 

Lys Thr Thr lie Phe Thr Thr Ala Thr Arg Asn Pro Lys Gly Ala Val 
390 395 400 405 

TTC ATA CAG GCG AGC CTG GCT AAA ATT GGC ATT GAT GTG AAA ATT GAA 13 02 

Phe lie Gin Ala Ser Leu Ala Lys lie Gly lie Asp Val Lys lie Glu 
410 415 420 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1259: 

ATCTCTATCC TTTATAGAAT TTGTTGTGGA GACTGGCTT ATG AAT AAT GTT TTT 54 

Met Asn Asn Val Phe 
1 5 

GTT AAG GGT TTG TTT TTT TTT CTT TTA TTG TTT GGG TTT TTT TTG AAA 102 
Val Lys Gly Leu Phe Phe Phe Leu Leu Leu Phe Gly Phe Phe Leu Lys 
10 15 20 

GCT TCA GAA AGC CCA AAC GCT ACT CTT AAT CCA TCT AAA GAA AAT GTT 15 0 

Ala Ser Glu Ser Pro Asn Ala Thr Leu Asn Pro Ser Lys Glu Asn Val 
25 30 35 

TCT GTT GAA GAG CAA AAG CGT TTT GGA GGC GTT TTA GTT TTT GCA AGA 198 
Ser Val Glu Glu Gin Lys Arg Phe Gly Gly Val Leu Val Phe Ala Arg 
40 45 50 

GGC GCT GAT GGC TCG AGC ATG GAT CCT GCT TTA GTG ACT GAT GGC GAA 24 6 

Gly Ala Asp Gly Ser Ser Met Asp Pro Ala Leu Val Thr Asp Gly Glu 
55 60 65 

AGC TAT GTA GCA ACG GGC AAT ATT TAT GAC ACG CTC GTG CAA TTC AGA 2 94 

Ser Tyr Val Ala Thr Gly Asn lie Tyr Asp Thr Leu Val Gin Phe Arg 
70 75 80 85 

TAC GGC ACC ACA GAA GTT GAA CCC GCC TTA GCG ACA AGC TGG GAC ATA 34 2 

Tyr Gly Thr Thr Glu Val Glu Pro Ala Leu Ala Thr Ser Trp Asp lie 
90 95 100 

TCC CCA GAT GGT CTT GTA TAT ACC TTT CAT TTA CGC AAA GGG GTT TAT 3 90 

Ser Pro Asp Gly Leu Val Tyr Thr Phe His Leu Arg Lys Gly Val Tyr 
105 110 115 

TTC CAC CAA ACG AAG TAT TGG AAT AAA AAA GTA GAG TTT AGC GCT AAA 43 8 

Phe His Gin Thr Lys Tyr Trp Asn Lys Lys Val Glu Phe Ser Ala Lys 
120 125 130 

GAT GTG CTG TTT TCG TTT GAA CGC CAG ATG GAT AAA GCT AAA CGA TAT 4 86 

Asp Val Leu Phe Ser Phe Glu Arg Gin Met Asp Lys Ala Lys Arg Tyr 
135 140 145 

TAT AGC CCG GGG GCT AAA AGC TAT AAG TAT TGG GAA GGC ATG GGC ATG 5 34 

Tyr Ser Pro Gly Ala Lys Ser Tyr Lys Tyr Trp Glu Gly Met Gly Met 
150 155 160 165 

TCT CAT ATT ATT AAG AGC ATT GAA GCT TTA GAT GAC TAT ACC ATT AGA 5 82 

Ser His lie lie Lys Ser lie Glu Ala Leu Asp Asp Tyr Thr lie Arg 
170 175 180 

TTC ACA CTT AAT GGG CCA GAA GCC CCG TTT TTA GCG AAT TTG GGC ATG 63 0 

Phe Thr Leu Asn Gly Pro Glu Ala Pro Phe Leu Ala Asn Leu Gly Met 
185 190 195 

GAC TTT TTG AGC ATT TTG AGT AAG GAT TAC GCT GAT TAC CTG GCT CAA 67 8 
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Leu Pro Gly Leu Asp Leu lie lie Phe Pro Glu Tyr Ser Thr His Gly 

50 55 60 

lie Met Tyr Asp Arg Gin Glu Met Phe Asp Thr Ala Ala Ser Val Pro 
65 70 75 80 

Gly Glu Glu Thr Ala He Phe Ala Glu Ala Cys Lys Lys Asn Lys Val 

85 90 95 

Trp Gly Val Phe Ser Leu Thr Gly Glu Lys His Glu Gin Ala Lys Lys 

100 105 110 

Asn Pro Tyr Asn Thr Leu He Leu Val Asn Asp Lys Gly Glu He Val 

115 120 125 

Gin Lys Tyr Arg Lys He Leu Pro Trp Cys Pro He Glu Cys Trp Tyr 

130 135 140 

Pro Gly Asp Lys Thr Tyr Val Val Asp Gly Pro Lys Gly Leu Lys Val 
145 150 155 160 

Ser Leu He He Cys Asp Asp Gly Asn Tyr Pro Glu He Trp Arg Asp 

165 170 175 

Cys Ala Met Arg Gly Ala Glu Leu He Val Arg Cys Gin Gly Tyr Met 

180 185 190 

Tyr Pro Ala Lys Glu Gin Gin He Ala He Val Lys Ala Met Ala Trp 

195 200 205 

Ala Asn Gin Cys Tyr Val Ala Val Ala Asn Ala Thr Gly Phe Asp Gly 

210 215 220 

Val Tyr Ser Tyr Phe Gly His Ser Ser He He Gly Phe Asp Gly His 
225 230 235 240 

Thr Leu Gly Glu Cys Gly Glu Glu Glu Asn Gly Leu Gin Tyr Ala Gin 

245 250 255 

Leu Ser Val Gin Gin He Arg Asp Ala Arg Lys Tyr Asp Gin Ser Gin 

260 265 270 

Asn Gin Leu Phe Lys Leu Leu His Arg Gly Tyr Ser Gly Val Phe Ala 

275 280 285 

Ser Gly Asp Gly Asp Lys Gly Val Ala Glu Cys Pro Phe Glu Phe Tyr 

290 295 300 

Lys Thr Trp Val Asn Asp Pro Lys Lys Ala Gin Glu Asn Val Glu Lys 
305 310 315 320 

He Thr Arg Pro Ser Val Gly Val Ala Ala Cys Pro Val Gly Asp Leu 
325 330 335 

Pro Thr Lys 



(2) INFORMATION FOR SEQ ID NO: 1259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 40... 1686 
(D) OTHER INFORMATION: 
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lie Ala lie Val Lys Ala Met Ala Trp Ala Asn Gin Cys Tyr Val Ala 
200 205 210 215 

GTA GCG AAT GCG ACC GGT TTT GAT GGG GTG TAT TCC TAT TTT GGG CAT 72 4 

Val Ala Asn Ala Thr Gly Phe Asp Gly Val Tyr Ser Tyr Phe Gly His 
220 225 230 

TCT AGC ATT ATT GGT TTT GAG GGG CAT ACT TTG GGC GAA TGC GGG GAA 7 72 

Ser Ser He He Gly Phe Asp Gly His Thr Leu Gly Glu Cys Gly Glu 
235 240 245 

GAA GAA AAT GGT CTT CAA TAC GCT CAA CTT TCT GTG CAA CAA ATC CGT 82 0 

Glu Glu Asn Gly Leu Gin Tyr Ala Gin Leu Ser Val Gin Gin He Arg 
250 255 260 

GAT GCG AGA AAA TAC GAC CAA AGC CAA AAC CAA CTC TTC AAA CTC TTG 86 8 

Asp Ala Arg Lys Tyr Asp Gin Ser Gin Asn Gin Leu Phe Lys Leu Leu 
265 270 275 

CAC AGA GGT TAT AGT GGG GTT TTT GCT AGT GGC GAT GGG GAT AAG GGT 916 
His Arg Gly Tyr Ser Gly Val Phe Ala Ser Gly Asp Gly Asp Lys Gly 
280 285 290 295 

GTG GCG GAA TGC CCT TTT GAG TTC TAT AAA ACT TGG GTG AAT GAC CCC - 964 

Val Ala Glu Cys Pro Phe Glu Phe Tyr Lys Thr Trp Val Asn Asp Pro 
300 305 310 

AAA AAA GCT CAA GAA AAT GTA GAA AAA ATC ACC CGC CCA AGC GTG GGT 1012 
Lys Lys* Ala Gin Glu Asn Val Glu Lys He Thr Arg Pro Ser Val Gly 
315 320 325 

GTG GCC GCT TGT CCT GTG GGC GAT TTG CCC ACG AAA TAAAGGGCAA AAGGAG 1064 
Val Ala Ala Cys Pro Val Gly Asp Leu Pro Thr Lys 
330 335 

GAGGGGGGGG G 107 5 

(2) INFORMATION FOR SEQ ID NO: 125 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258: 

Met Arg His Gly Asp He Ser Ser Ser Pro Asp Thr Val Gly Val Ala 

15 10 15 

Val Val Asn Tyr Lys Met Pro Arg Leu His Thr Lys Asn Glu Val Leu 

20 25 30 

Glu Asn Cys Arg Asn He Ala Lys Val He Gly Gly Val Lys Gin Gly 
35 40 45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257: 

GGAACTCTCA TCAAAAAACA AGGAACATAA T ATG AGA CAT GGA GAT ATT AGT 52 

Met Arg His Gly Asp lie Ser 
1 5 

AGC AGC CCA GAT ACT GTG GGT GTA GCG GTA GTT AAT TAT AAG ATG CCT 10 0 

Ser Ser Pro Asp Thr Val Gly Val Ala Val Val Asn Tyr Lys Met Pro 
10 15 20 

AGA CTC CAC ACT AAG AAT GAG GTG TTG GAA AAT TGT CGC AAT ATC GCT 14 8 

Arg Leu His Thr Lys Asn Glu Val Leu Glu Asn Cys Arg Asn lie Ala 
25 30 35 

AAG GTG ATT GGT GGG GTC AAA CAG GGT TTG CCT GGG TTG GAT CTG ATT 196 
Lys Val lie Gly Gly Val Lys Gin Gly Leu Pro Gly Leu Asp Leu lie 
40 45 50 55 

ATT TTC CCT GAA TAG AGC ACG CAT GGG ATT ATG TAT GAC AGA CAA GAA 24 4 

lie Phe Pro Glu Tyr Ser Thr His Gly lie Met Tyr Asp Arg Gin Glu 
60 65 70 

ATG TTT GAT ACA GCC GCA AGC GTT CCT GGA GAA GAA ACC GCG ATC TTT 2 92 

Met Phe Asp Thr Ala Ala Ser Val Pro Gly Glu Glu Thr Ala lie Phe 
75 80 85 

GCT GAA GCT TGT AAG AAA AAC AAG GTT TGG GGA GTG TTC TCT TTG ACA 34 0 

Ala Glu Ala Cys Lys Lys Asn Lys Val Trp Gly Val Phe Ser Leu Thr 
90 95 100 

GGG GAA AAA CAC GAG CAA GCC AAA AAG AAT CCC TAT AAC ACT TTG ATT 38 8 

Gly Glu Lys His Glu Gin Ala Lys Lys Asn Pro Tyr Asn Thr Leu lie 
105 110 115 

CTT GTC AAT GAT AAG GGT GAG ATC GTG CAA AAA TAC CGC AAA ATC TTG 43 6 

Leu Val Asn Asp Lys Gly Glu lie Val Gin Lys Tyr Arg Lys lie Leu 
120 125 130 135 

CCT TGG TGC CCT ATT GAA TGT TGG TAT CCT GGG GAT AAA ACT TAT GTG 4 84 

Pro Trp Cys Pro lie Glu Cys Trp Tyr Pro Gly Asp Lys Thr Tyr Val 
140 145 150 

GTT GAT GGG CCT AAG GGC TTG AAA GTT TCT TTG ATT ATT TGC GAT GAT 53 2 

Val Asp Gly Pro Lys Gly Leu Lys Val Ser Leu lie lie Cys Asp Asp 
155 160 165 

GGA AAC TAC CCT GAA ATT TGG CGC GAT TGC GCG ATG CGT GGG GCA GAA 58 0 

Gly Asn Tyr Pro Glu He Trp Arg Asp Cys Ala Met Arg Gly Ala Glu 
170 175 180 

CTC ATT GTG CGC TGT CAA GGT TAC ATG TAT CCG GCT AAG GAG CAA CAA 62 8 

Leu He Val Arg Cys Gin Gly Tyr Met Tyr Pro Ala Lys Glu Gin Gin 
185 190 195 

ATT GCA ATA GTA AAA GCT ATG GCG TGG GCC AAT CAA TGT TAT GTA GCG 67 6 
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Glu Ser Asp Phe Ser Leu Leu Glu Leu Lys Asn Lys Arg Phe Ser Asp 

2595 2600 2605 

Pro Asn Pro Glu Val Phe Val Lys Tyr Ser Gin Leu Ser Lys His Pro 

2610 2615 2620 

Asn Asn Leu Trp Val Gin Gly Val Gly Gly Ala Ser Phe lie Ser Gly 
625 2630 2635 2640 

Gly Asn Gly Thr Leu Tyr Gly Leu Asn Ala Gly Tyr Asp Arg Leu Val 

2645 2650 2655 

Lys Asn Val He Leu Gly Gly Tyr Val Ala Tyr Gly Tyr Ser Asp Phe 

2660 2665 2670 

Asn Gly Asn He Met His Ser Leu Gly Asn Asn Val Asp Val Gly Met 

2675 2680 2685 

Tyr Ala Arg Ala Phe Leu Lys Arg Asn Glu Phe Thr Leu Ser Ala Asn 

2690 2695 2700 

Glu Thr Tyr Gly Gly Asn Ala Thr Ser He Asn Ser Ser Asn Ser Leu 
705 2710 2715 2720 

Leu Ser Val Leu Asn Gin Arg Tyr Asn Tyr Asn Thr Trp Thr Thr Ser 

2725 2730 2735 

Val Asn Gly Asn Tyr Gly Tyr Asp Phe Met Phe Lys Gin Lys Ser Val 

2740 2745 2750 

Val Leu Lys Pro Gin Val Gly Leu Ser Tyr His Phe He Gly Leu Ser 

2755 2760 2765 

Gly Met Lys Gly Asn Asp Ala Ala Tyr Lys Gin Phe Leu Met His Ser 

2770 2775 2780 

Asn Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met Gly Leu Glu Ser 
785 2790 2795 2800 

Arg Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val Thr Ala Arg Leu 

2805 2810 2815 

Gly Arg Asp Leu Leu He Lys Ser Lys Gly Ser Asn Thr Val Arg Phe 

2820 2825 2830 

Val Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu Val Phe Asn Thr 

2835 2840 2845 

Phe Ala Ser Val He Thr Gly Gly Glu Met His Leu Trp Arg Leu Val 

2850 2855 2860 

Tyr Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu Gin Tyr Gin Asp 
865 2870 2875 2880 

He Asn He Thr Gly Asn Val Gly Met Arg Val Ala Phe 
2885 2890 

(2) INFORMATION FOR SEQ ID NO: 1257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : Genomi c DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 32 . . . 1048 
(D) OTHER INFORMATION: 
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145 2150 2155 2160 

Thr Thr Leu Asn Ala Ser Asn Gly Leu Lys lie Asn Ala Ala Asn Asn 

2165 2170 2175 

Asn Val Ser Val Ser Gin Gly Asn Leu Phe Val Ser Ala Ser Cys Ala 

2180 2185 2190 

Gin Gin Ser Asp Pro Thr Thr Ala Asn lie Ala Asn Pro Cys Ala Leu 

2195 2200 2205 

Ser Ala Gin Ser Thr Asn Gly Ala Ser Ser Asn Asn Ala Ser Asn Asn 

2210 2215 2220 

Ala Pro lie Ala Leu Ser Asn Asn Asp Glu Ser Leu Met Val Ala Ala 
225 2230 2235 2240 

Asn Asp Phe Asn Phe Ser Gly Asn lie Tyr Ala Asn Gly Val Val Asp 

2245 2250 2255 

Phe Ser Lys lie Lys Gly Ser Ala Asn lie Lys Asn Leu Tyr Leu Tyr 

2260 2265 2270 

Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr lie Ser Asn Gin Ala 

2275 2280 2285 

Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn Leu Asn lie Gin 

2290 2295 2300 

Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys lie Glu Val Leu Gin Asn 
305 2310 2315 2320 

Leu Val lie Ala Ser Asn Ala Ser Leu Ser Thr Gly lie Tyr Gly Leu 

2325 2330 2335 

Glu Val Gly Gly Ala Leu Asn Asn Ser Gly Ala lie His Phe Asn Leu 

2340 2345 2350 

Glu Asn Thr Gin Thr Pro Thr Pro Leu lie Gin Ala Glu Gly He He 

2355 2360 2365 

Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val Asn Asn Ser Met 

2370 2375 2380 

Ala Asn Asn Thr Thr Tyr Thr Leu Leu Lys Ser Ser Arg Tyr He Asp 
385 2390 2395 2400 

Tyr Asn He Asn Pro Asn Ser Leu Gin Ser Tyr Leu Asn Leu Tyr Thr 

2405 2410 2415 

Leu He Asn He Asn Gly Asn His He Glu Glu Lys Asn Gly Ala Leu 

2420 2425 2430 

Thr Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys Gly Leu Leu Leu 

2435 2440 2445 

Ser Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin Asn Asn He Leu 

2450 2455 2460 

Ser Leu Ser Val Leu Tyr Asn Gin Val Lys Met Ser Cys Gly Asp Lys 
465 2470 2475 2480 

Ala Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr He Val Gly He 

2485 2490 2495 

Gin Gly Gin Ser Ala Leu Asn Gin He Glu Ala Val Gly Gly Asn Ala 

2500 2505 2510 

He Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys Glu Asn Pro Phe 

2515 2520 2525 

Phe Ala Pro He Tyr Leu Lys Asn His Ser Leu Asn Glu He Leu Gly 

2530 2535 2540 

Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu He Ser Asn Pro Asn 
545 2550 2555 2560 

Phe Arg Asp Asn Ala Thr Asn Leu Leu Glu Leu Ala Ser Tyr Thr Gin 

2565 2570 2575 

Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg Ser Arg Glu Gly 
2580 2585 2590 
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Leu Asn Gin Leu Thr Lys Leu lie Thr Pro Asn Asp Trp Lys Asn lie 

1715 1720 1725 

Asn Glu Leu lie Asp Asn Ala Asn Asn Ser Val Val Gin Asn Phe Asn 

1730 1735 1740 

Asn Gly Thr Leu lie Val Gly Ala Thr Gin lie Gly Gin Thr Asp Thr 
745 1750 1755 1760 

Asn Ser Ala Val Val Phe Gly Gly Leu Gly Tyr Gin Thr Pro Cys Asp 

1765 1770 1775 

Tyr Thr Asp lie Val Cys Gin Lys Phe Arg Gly Thr Tyr Leu Gly Gin 

1780 1785 1790 

Leu Leu Glu Ser Ser Ser Ala Asp Leu Gly Tyr lie Asp Thr Thr Phe 

1795 1800 1805 

Asn Ala Lys Glu lie Tyr Leu Thr Gly Thr Leu Gly Ser Gly Asn Ala 

1810 1815 1820 

Trp Gly Thr Gly Gly Ser Ala Ser Val Thr Phe Asn Ser Gin Thr Ser 
825 1830 1835 1840 

Leu lie Leu Asn Gin Ala Asn lie Val Ser Ser Gin Thr Asp Gly lie 

1845 1850 1855 

Phe Ser Met Leu Gly Gin Glu Gly lie Asn Lys Val Phe Asn Gin Ala 

1860 1865 1870 

Gly Leu Ala Asn lie Leu Gly Glu Val Ala Val Gin Ser lie Asn Lys 

1875 1880 1885 

Ala Gly Gly Leu Gly Asn Leu lie Val Asn Thr Leu Gly Ser Asn Ser 

1890 1895 1900 

Val lie Gly Gly Tyr Leu Thr Pro Glu Gin Lys Asn Gin Thr Leu Ser 
905 1910 1915 1920 

Gin Leu Leu Gly Gin Asn Asn Phe Asp Asn Leu Met Asn Asp Ser Gly 

1925 1930 1935 

Leu Asn Thr Ala lie Lys Asp Leu lie Arg Gin Lys Leu Gly Phe Trp 

1940 1945 1950 

Thr Gly Leu Val Gly Gly Leu Ala Gly Leu Gly Gly lie Asp Leu Gin 

1955 1960 1965 

Asn Pro Glu Lys Leu lie Gly Ser Met Ser lie Asn Asp Leu Leu Ser 

1970 1975 1980 

Lys Lys Gly Leu Phe Asn Gin lie Thr Gly Phe lie Ser Ala Asn Asp 
985 1990 1995 2000 

lie Gly Gin Val lie Ser Val Met Leu Gin Asp lie Val Lys Pro Ser 

2005 2010 2015 

Asn Ala Leu Lys Asn Asp Val Ala Ala Leu Gly Lys Gin Met lie Gly 

2020 2025 2030 

Glu Phe Leu Gly Gin Asp Thr Leu Asn Ser Leu Glu Ser Leu Leu Gin 

2035 2040 2045 

Asn Gin Gin lie Lys Ser Val Leu Asp Lys Val Leu Ala Ala Lys Gly 

2050 2055 2060 

Leu Gly Pro He Tyr Glu Gin Gly Leu Gly Asp Leu He Pro Asn Leu 
065 2070 2075 2080 

Gly Lys Lys Gly Leu Phe Ala Pro Tyr Gly Leu Ser Gin Val Trp Gin 

2085 2090 2095 

Lys Gly Asp Phe Ser Phe Asn Ala Gin Gly Asn Val Phe Val Gin Asn 

2100 2105 2110 

Ser Thr Phe Ser Asn Ala Asn Gly Gly Thr Leu Ser Phe Asn Ala Gly 

2115 2120 2125 

Asn Ser Leu He Phe Ala Gly Asn Asn His He Ala Phe Thr Asn His 

2130 2135 2140 

Ala Gly Thr Leu Gin Leu Leu Ser Asp Gin Val Ser Asn He Asn He 
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265 1270 1275 1280 

Asn Ala Lys Asn Ala lie Ser Phe Thr Asn Ser Thr Asn Leu Ser Ser 

1285 1290 1295 

Gly Leu Tyr Gin Met Gin Ala Lys Ser Val Leu Phe Asp Asn Ser Asn 

1300 1305 1310 

Leu Ser Val Ser Val Gly Thr Ser Ser lie Lys Ala Asn Ala lie Asn 

1315 1320 1325 

Leu Ser Gin Asn Ala Ser lie Asn Ala Ser Asn His Ser Thr Leu Glu 

1330 1335 1340 

Leu Gin Gly Asp Leu Asn Val Asn Asp Thr Ser Ser Leu Asn Leu Asn 
345 1350 1355 1360 

Gin Ser Thr lie Asn Val Ser Asn Asn Ala Thr lie Asn Asp Tyr Ala 

1365 1370 1375 

Ser Leu lie Ala Ser Asn Gly Ser His Leu Asn Phe Asn Gly Ala Val 

1380 1385 1390 

Asn Phe Asn Ser Ala Asn lie Thr Thr Ser Leu Asn Asn Ser Ser lie 

1395 1400 1405 

Val Phe Lys Gly Ala Val Ser Leu Gly Gly Gin Phe Asn Leu Ser Asn 

1410 1415 1420 

Asn Ser Ser Leu Asp Phe Gin Gly Ser Ser Ala lie Thr Ser Asn Thr 
425 1430 1435 1440 

Ala Phe Asn Phe Tyr Asp Asn Ala Phe Ser Gin Ser Pro lie Thr Phe 

1445 1450 1455 

His Gin Ala Leu Asp lie Lys Ala Pro Leu Ser Leu Gly Gly Asn Leu 

1460 1465 1470 

Leu Asn Pro Asn Asn Ser Ser Val Leu Asp Leu Lys Asn Ser Gin Leu 

1475 1480 1485 

Val Phe Gly Asp Gin Gly Ser Leu Asn lie Ala Asn lie Asp Leu Leu 

1490 1495 1500 

Ser Asp Leu Asn Asp Asn Lys Asn Arg Val Tyr Asn lie lie Gin Ala 
505 1510 1515 1520 

Asp Met Asn Ser Asn Trp Tyr Glu Arg lie Ser Phe Phe Gly Met His 

1525 1530 1535 

lie Asn Asp Gly lie Tyr Asp Ala Lys Asn Gin Thr Tyr Ser Phe Thr 

1540 1545 1550 

Asn Pro Leu Asn Asn Ala Leu Lys lie Thr Glu Ser Phe Lys Asp Asn 

1555 1560 1565 

Gin Leu Ser Val Thr Leu Ser Gin lie Pro Gly lie Lys Asn Thr Leu 

1570 1575 1580 

Tyr Asn lie Gly Ser Glu lie Phe Asn Tyr Gin Lys Val Tyr Asn Asn 
585 1590 1595 1600 

Ala Asn Gly Val Tyr Ser Tyr Ser Asp Asp Ala Gin Gly Val Phe Tyr 

1605 1610 1615 

Leu Thr Ser Asn Val Lys Gly Tyr Tyr Asn Pro Asn Gin Ser Tyr Gin 

1620 1625 1630 

Ala Ser Gly Ser Asn Asn Thr Thr Lys Asn Asn Asn Leu Thr Ser Glu 

1635 1640 1645 

Ser Ser lie lie Ser Gin Thr Tyr Asn Ala Gin Gly Asn Pro lie Ser 

1650 1655 1660 

Ala Leu His lie Tyr Asn Lys Gly Tyr Asn Phe Asn Asn lie Lys Ala 
665 1670 1675 1680 

Leu Gly Gin Met Ala Leu Lys Leu Tyr Pro Glu lie Lys Lys Val Leu 

1685 1690 1695 

Gly Asn Asp Phe Ser Pro Ser Ser Leu Asn Ala Leu Asn Ser Asn Ala 
1700 1705 1710 
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Phe Asn Ser Gly Ser Phe Glu lie Ser Ala Lys Asn Ala Ser Phe Asn 

835 840 845 

Asn Ala Asn Phe Asn Asn Ser Ala Ser Phe Asn Phe Asn Asn Ser Asn 

850 855 860 

Ala Thr Thr Ser Phe Val Gly Asp Phe Thr Asn Ala Asn Ser Asn Leu 
865 870 875 880 

Gin lie Ala Gly Asn Ala Val Phe Gly Asn Ser Thr Asn Gly Ser Gin 

885 890 895 

Asn Thr Ala Asn Phe Asn Asn Thr Gly Ser Val Asn lie Ser Gly Asn 

900 905 910 

Ala Thr Phe Asp Asn Val Val Phe Asn Gly Pro Thr Asn Thr Ser Val 

915 920 925 

Lys Gly Gin Val Thr Leu Asn Asn lie Thr Leu Lys Asn Leu Asn Ala 

930 935 940 

Pro Leu Ser Phe Gly Asp Gly Thr lie Thr Phe Asn Ala His Ser Val 
945 950 955 960 

lie Asn lie Ala Glu Ser lie Thr Asn Gly Asn Pro lie Thr Leu Val 

965 970 975 

Ser Ser Ser Lys Glu lie Glu Tyr Asn Asn Ala Phe Ser Lys Asn Leu 

980 985 990 

Trp Gin Leu lie Asn Tyr Gin Gly His Gly Ala Ser Ser Glu Lys Leu 

995 1000 1005 

Val Ser Ser Ala Gly Asn Gly Val Tyr Asp Val Val Tyr Ser Phe Asn 

1010 1015 1020 

Asn Gin Thr Tyr Asn Phe Gin Glu Val Phe Ser Gin Asn Ser lie Ser 
025 1030 1035 1040 

lie Arg Arg Leu Gly Val Asn Met Val Phe Asp Tyr Val Asp Met Glu 

1045 1050 1055 

Lys Ser Asp His Leu Tyr Tyr Gin Asn Ala Leu Gly Phe Met Thr Tyr 

1060 1065 1070 

Met Pro Asn Ser Tyr Asn Asn Asn Leu Gly Asn Ala Asn Asn Thr lie 

1075 1080 1085 

Tyr Tyr Tyr Asp Lys Ser lie Asp Phe Tyr Ala Ser Gly Lys Thr Leu 

1090 1095 1100 

Phe Thr Lys Ala Glu Phe Ser Gin Thr Phe Thr Gly Gin Asn Ser Ala 
105 1110 1115 1120 

lie Val Phe Gly Ala Lys Ser lie Trp Thr Ser Leu Ser Asp Ala Pro 

1125 1130 1135 

Gin Ser Asn Thr lie lie Arg Phe Gly Asp Asn Lys Gly Ala Gly Ser 

1140 1145 1150 

Asn Asp Ala Ser Gly His Cys Trp Asn Leu Gin Cys lie Gly Phe lie 

1155 1160 1165 

Thr Gly His Tyr Glu Ala Gin Lys lie Tyr lie Thr Gly Ser lie Glu 

1170 1175 1180 

Ser Gly Asn Arg lie Ser Ser Gly Gly Gly Ala Ser Leu Asn Phe Asn 
185 1190 1195 1200 

Gly Leu Gin Gly lie Leu Leu Thr Asn Ala Thr Leu Tyr Asn Arg Ala 

1205 1210 1215 

Ala Gly Thr Gin Ser Ser Ser Met Asn Phe lie Ser Asn Ser Ala Asn 

1220 1225 1230 

lie Gin Ala Gin Asn Ser Tyr Phe lie Asp Asp Thr Ala Gin Asn Gly 

1235 1240 1245 

Gly Asn Pro Asn Phe Ser Phe Asn Ala Leu Asn Leu Asp Phe Ser Asn 

1250 1255 1260 

Ser Ser Phe Arg Gly Tyr Val Gly Lys Thr Gin Ser Val Phe Lys Phe 
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385 

Ala Thr Phe Asn 

Phe Asn Asn Thr 
420 

Val Thr Gly Gly 
435 

Ser Thr Leu Asp 
450 

Thr Phe Asn Leu 
465 

Asn Ser Ser Gly 

Asn Gly Leu Thr 
500 

Gin Ser Phe Ala 
515 

Thr Gly Gin Leu 
530 

Ser Ser Pro Ser 
545 

Gly Tyr Lys lie 

His Asn Ser lie 
580 

Pro Pro Val lie 
595 

lie Asn Ala Asp 
610 

Ser Gin Asn Phe 
625 

lie Trp Gly Ser 

Gly Ser Asn Leu 
660 

Val Ser Ser Ser 
675 

Leu Asn Gly His 
690 

Thr Asn Gly Thr 
705 

Arg Ser Gly Asn 

Asn Gly Val Asp 
740 

Asn Ala Gly lie 
755 

Asp Asn Ser Gin 
770 

Val Tyr Gly Thr 
785 

Phe Asn Ala Gly 

Ser Tyr Gin Phe 
820 



390 

Asn Thr Gly Lys 
405 

Thr Phe Asn Thr 

Val Thr Leu Ser 
440 

Phe Gly Ser Ser 
455 

Thr Ser Leu Gly 
470 

Gly lie Thr Tyr 
485 

Ser Ala Leu Lys 

Gin Gly Leu Trp 
520 

Leu Asn Glu Asn 
535 

Lys Ser Ser Thr 
550 

Gly Asp Thr lie 
565 

lie lie Gin Ala 

Asn Gly Ser Lys 
600 

Met Pro Trp Tyr 
615 

Thr Glu Ser Gly 
630 

Tyr Thr Asn Ser 
645 

Val lie Gly Tyr 

Gly Thr Val Ser 
680 

Cys Gly Pro Trp 
695 

Tyr Ser Ala Tyr 
710 

Arg lie Gly Thr 
725 

Ser lie Asn lie 

Tyr Ser Ser Ser 
760 

Asn Leu Asn Gly 
775 

Thr Phe Thr Asn 
790 

Gin Ala Val Phe 
805 

Ser Gly Asp Ser 



395 

lie Thr lie Glu 
410 

Ser Val Asp Thr 
425 

Gly Lys Asn Asp 

Lys lie Thr Leu 
460 

Ser Glu Lys Ser 
475 

Ser Asn Leu Leu 
490 

Thr Asn Glu Ser 
505 

Asp lie lie Thr 

Ala Ala Thr Ser 
540 

Asn Ser Thr Gin 
555 

Tyr Lys Leu Gin 
570 

Leu Glu Ser Gly 
585 

Phe Asp Leu Ser 

Asp Hi s Lys Tyr 
620 

Thr Tyr Tyr Leu 
635 

Phe Lys Gin Thr 
650 

Asn Ser Thr Trp 
665 

Phe Gly Asp Thr 

Pro Tyr Tyr Gin 
700 

His Val Tyr lie 
715 

Gly Gly Ala Ala 
730 

Ala Asn Ala Thr 
745 

Met Thr Phe Ser 

Leu Asn Ser Asn 
780 

Glu Ala Lys Asp 
795 

Glu Asn Thr Asn 
810 

Leu Asn Phe Ser 
825 



400 

Lys Asp Ala Ser 
415 

Asn Asn Met Ser 
430 

Leu Lys Asn Gly 
445 

Ala Gin Gly Thr 

Val Thr lie Leu 
480 

Asn His Ala lie 

495 

Leu Ser Asn Pro 
510 

Tyr Asn Gly Val 
525 

Lys Pro Thr Asp 

Val Tyr Gin Val 
560 

Glu Thr Phe Ser 
575 

Thr Tyr Thr Pro 
590 

Ala Ser Asn Tyr 
605 

Tyr lie Pro Lys 

Pro Ser Val Gin 
640 

Phe Ser Ala Asn 
655 

Thr Asp His Asn 
670 

Ser Gly Ser Ala 
685 

Cys Thr Gly Thr 

Thr Ala Asn Leu 
720 

Asn Leu lie Phe 
735 

lie Thr Gin His 
750 

Thr Gin Ser Met 
765 

Gly Lys Leu Ser 

Gly Lys Phe lie 
800 

Phe Asn Gly Gly 
815 

Asn Asn Asn Gin 
830 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256: 

Met Lys Lys Phe Lys Lys Lys Pro Lys Ser lie Lys Arg Ser His Gin 

15 10 15 

Asn Gin Lys Thr lie Leu Lys Arg Pro Leu Trp Leu Met Pro Leu Leu 

20 25 30 

lie Ser Gly Phe Ala Ser Gly Val Tyr Ala Asn Asn Leu Trp Asp Leu 

35 40 45 

Leu Asn Pro Lys Val Gly Gly Glu Tyr Val His Trp Val Lys Gly Ser 

50 55 60 

Gin Tyr Cys Ala Trp Trp Glu Phe Ala Gly Cys Leu Lys Asn Val Trp 
65 70 75 80 

Gly Ala Asn His Lys Gly Tyr Asp Ala Gly Asn Ala Ala Asn Tyr Leu 

85 90 95 

Ser Ser Gin Asn Tyr Gin Ala lie Ser Val Gly Ser Gly Asn Glu Thr 

100 105 110 

Gly Thr Tyr Ser Leu Ser Gly Phe Thr Asn Tyr Val Gly Gly Asn Leu 

115 120 125 

Thr lie Asn Leu Gly Asn Ser Val Val Leu Asp Leu Ser Gly Ser Asn 

130 135 140 

Ser Phe Thr Ser Tyr Gin Gly Tyr Asn Gin Gly Lys Asp Asp Val Thr 
145 150 155 160 

Phe Thr Val Gly Ala lie Asn Leu Asn Gly Thr Leu Glu Val Gly Asn 

165 170 175 

Arg Val Gly Ser Gly Ala Gly Thr His Thr Gly Thr Ala Thr Leu Asn 

180 185 190 

Leu Asn Ala Asn Lys Val Asn lie Asn Ser Asn lie Asn Ala Tyr Lys 

195 200 205 

Thr Ser Gin Val Asn lie Gly Asn Ala Asn Ser Val lie Thr lie Gly 

210 215 220 

Ser Val Ser Leu Ser Gly Asp Val Cys Ser Ser Leu Ala Ser Val Gly 
225 230 235 240 

lie Gly Ala Asn Cys Ser Thr Ser Gly Pro Ser Tyr Ser Phe Lys Gly 

245 250 255 

Thr Thr Asn Ala Thr Asn Thr Ala Phe Ser Asn Ala Ser Gly Ser Phe 

260 265 270 

Thr Phe Glu Glu Asn Ala Thr Phe Ser Gly Ala Lys Trp Asn Gly Gly 

275 280 285 

Thr Tyr Thr Phe Asn Lys Glu Phe Ser Ala Thr Asn Asn Thr Ala Phe 

290 295 300 

Ser Ser Gly Ser Phe Asn Phe Lys Gly Val Ser Ser Phe Asn Gly Thr 
305 310 315 320 

Ser Phe Ser Asn Ala Ser Tyr Thr Phe Asp Asn Gin Ala Thr Phe Gin 

325 330 335 

Asn Ser Ser Phe Asn Gly Gly Thr Phe Thr Phe Asn Asn Gin Thr Asn 

340 345 350 

Pro Thr Asn Asn Ala Gin His Pro Gin lie Gin Asn Ser Ser Phe Ser 

355 360 365 

Gly Asn Ala Thr Thr Leu Lys Gly Phe Val Asn Phe Gin Gin Ala Phe 

370 375 380 

Asn Asn. .Ser Asn His Gin Leu Thr lie Gin Asn Ala Ser Phe Asn Asn 
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TCT AAT TCT TTG CTC TCT GTG TTG AAC CAA CGC TAG AAC TAG AAC ACC 8211 
Ser Asn Ser Leu Leu Ser Val Leu Asn Gin Arg Tyr Asn Tyr Asn Thr 
2720 2725 2730 

TGG ACA ACG AGO GTG AAC GGG AAT TAG GGC TAT GAT TTC ATG TTC AAA 82 5 9 

Trp Thr Thr Ser Val Asn Gly Asn Tyr Gly Tyr Asp Phe Met Phe Lys 
2735 2740 2745 

CAA AAA AGC GTG GTG CTA AAA CCT CAA GTG GGT TTG AGC TAT CAT TTC 83 07 

Gin Lys Ser Val Val Leu Lys Pro Gin Val Gly Leu Ser Tyr His Phe 
2750 2755 2760 

ATA GGT CTA AGT GGG ATG AAA GGC AAT GAT GCC GCT TAC AAA CAA TTC 83 55 

lie Gly Leu Ser Gly Met Lys Gly Asn Asp Ala Ala Tyr Lys Gin Phe 
2765 2770 2775 2780 

CTC ATG CAT TCA AAC CCC TCT AAC GAA TCG GTT TTA ACG CTC AAC ATG 84 0 3 

Leu Met His Ser Asn Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met 
2785 2790 2795 

GGG TTG GAG AGC CGT AAA TAT TTT GGT AAA AAT TCC TAT TAT TTT GTA 84 51 

Gly Leu Glu Ser Arg Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val 
2800 2805 2810 

ACG GCG AGA CTA GGT AGG GAT CTT TTG ATC AAA TCT AAA GGC AGC AAT 84 99 

Thr Ala Arg Leu Gly Arg Asp Leu Leu lie Lys Ser Lys Gly Ser Asn 
2815 2820 2825 

ACG GTG CGT TTT GTG GGC GAA AAC ACT TTA TTG TAT CGC AAG GGG GAA 8 54 7 

Thr Val Arg Phe Val Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu 
2830 2835 2840 

GTT TTT AAC ACT TTT GCG AGC GTG ATT ACA GGG GGC GAA ATG CAT TTG 8 5 95 

Val Phe Asn Thr Phe Ala Ser Val lie Thr Gly Gly Glu Met His Leu 
2845 2850 2855 2860 

TGG CGT TTG GTG TAT GTG AAT GCG GGG GTG GGG CTT AAG ATG GGC TTG 8 64 3 

Trp Arg Leu Val Tyr Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu 
2865 2870 2875 

CAA TAC CAA GAT ATT AAT ATA ACC GGG AAT GTG GGC ATG CGA GTG GCG 86 91 

Gin Tyr Gin Asp lie Asn lie Thr Gly Asn Val Gly Met Arg Val Ala 
2880 2885 2890 

TTT TAGCTTTTTT GCTATAATGC TTCGTTCAAA TTTTATGGTT AGGTTTTTCT ATGT 874 8 

Phe 



(2) INFORMATION FOR SEQ ID NO: 1256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 893 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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ATT GTG GGC ATT CAA GGG CAA AGC GCG CTC AAT CAA ATT GAA GOT GTT 7 53 9 

lie Val Gly lie Gin Gly Gin Ser Ala Leu Asn Gin lie Glu Ala Val 
2495 2500 2505 

GGG GGG AAC GCT ATC AAG TGG CTT TCA ACA TTG ATG ATG GAG ACT AAA 75 87 

Gly Gly Asn Ala lie Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys 
2510 2515 2520 

GAA AAC CCG TTT TTT GCG CCG ATT TAT TTA AAA AAC CAC TCT TTG AAT 7 63 5 

Glu Asn Pro Phe Phe Ala Pro lie Tyr Leu Lys Asn His Ser Leu Asn 
2525 2530 2535 2540 

GAA ATC TTA GGC GTA ACA AAA GAT CTT CAA AAC ACC GCA AGC TTG ATT 7 683 

Glu lie Leu Gly Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu He 
2545 2550 2555 

TCT AAC CCT AAT TTT AGA GAT AAC GCT ACC AAT CTT TTA GAA TTG GCG 7 731 

Ser Asn Pro Asn Phe Arg Asp Asn Ala Thr Asn Leu Leu Glu Leu Ala 
2560 2565 2570 

AGT TAC ACC CAA CAA ACC AGC CGT TTA ACA AAA CTC TCT GAT TTT AGA 777 9 

Ser Tyr Thr Gin Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg 
2575 2580 2585 

TCT AGA GAG GGA GAG TCT GAT TTT TCT TTG TTA GAG CTT AAA AAC AAG 7 82 7 

Ser Arg Glu Gly Glu Ser Asp Phe Ser Leu Leu Glu Leu Lys Asn Lys 
2590 2595 2600 

CGT TTT AGC GAT CCT AAT CCA GAG GTT TTT GTC AAA TAC TCT CAA CTT 7 87 5 

Arg Phe Ser Asp Pro Asn Pro Glu Val Phe Val Lys Tyr Ser Gin Leu 
2605 2610 2615 2620 

AGC AAA CAC CCA AAT AAC CTT TGG GTT CAA GGG GTG GGA GGA GCG AGC 7 92 3 

Ser Lys His Pro Asn Asn Leu Trp Val Gin Gly Val Gly Gly Ala Ser 
2625 2630 2635 

TTT ATT TCT GGG GGC AAT GGC ACG CTT TAT GGC TTG AAT GCG GGC TAT 7 971 

Phe He Ser Gly Gly Asn Gly Thr Leu Tyr Gly Leu Asn Ala Gly Tyr 
2640 2645 2650 

GAC AGG TTG GTT AAA AAT GTG ATC CTT GGG GGT TAT GTG GCT TAT GGC 8019 
Asp Arg Leu Val Lys Asn Val He Leu Gly Gly Tyr Val Ala Tyr Gly 
2655 2660 2665 

TAT AGC GAC TTT AAT GGG AAC ATC ATG CAT TCT TTG GGT AAT AAT GTG 8 0 67 

Tyr Ser Asp Phe Asn Gly Asn lie Met His Ser Leu Gly Asn Asn Val 
2670 2675 2680 

GAT GTG GGG ATG TAT GCG AGG GCT TTT TTA AAA AGG AAC GAA TTC ACT 8115 
Asp Val Gly Met Tyr Ala Arg Ala Phe Leu Lys Arg Asn Glu Phe Thr 
2685 2690 2695 2700 

TTG AGC GCG AAT GAA ACT TAT GGA GGC AAT GCA ACT AGT ATC AAT TCT 8163 
Leu Ser Ala Asn Glu Thr Tyr Gly Gly Asn Ala Thr Ser He Asn Ser 
2705 2710 2715 
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CTG TAT CTT TAC AAT AAC GCT CAA TTC CAA GCC AAC AAT CTC ACT ATT 6 8 67 

Leu Tyr Leu Tyr Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr lie 
2270 2275 2280 

TCC AAT CAA GCG GTG TTA GAA AAA AAC GCC AGC TTT GTA ACG AAT AAT 6 915 

Ser Asn Gin Ala Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn 
2285 2290 2295 2300 

TTA AAC ATT CAA GGA GCG TTT AAC AAC AAC GCC ACG CAA AAA ATA GAG 6 963 

Leu Asn lie Gin Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys lie Glu 
2305 2310 2315 

GTG CTT CAA AAT TTA GTG ATC GCT TCA AAC GCT TCT TTA AGC ACC GGG 7011 
Val Leu Gin Asn Leu Val lie Ala Ser Asn Ala Ser Leu Ser Thr Gly 
2320 2325 2330 

ATT TAT GGG TTA GAA GTA GGG GGG GCT TTG AAT AAT TCT GGA GCG ATC . 7 05 9 
lie Tyr Gly Leu Glu Val Gly Gly Ala Leu Asn Asn Ser Gly Ala lie 
2335 2340 2345 

CAT TTT AAT TTA GAA AAT ACC CAA ACG CCA ACG CCG CTC ATT CAA GCA 7107 
His Phe Asn Leu Glu Asn Thr Gin Thr Pro Thr Pro Leu lie Gin Ala 
2350 2355 2360 

GAG GGG ATC ATT AAC CTC AAC ACC ACC CAA ACG CCT TTT ATG AAT GTC 715 5 

Glu Gly lie lie Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val 
2365 2370 2375 2380 

AAT AAC AGC ATG GCC AAT AAT ACG ACT TAC ACT TTA TTA AAA AGC AGC 72 0 3 

Asn Asn Ser Met Ala Asn Asn Thr Thr Tyr Thr Leu Leu Lys Ser Ser 
2385 2390 2395 

CGT TAC ATT GAT TAC AAT ATC AAC CCC AAC AGC TTG CAA TCG TAT TTG 72 51 

Arg Tyr lie Asp Tyr Asn lie Asn Pro Asn Ser Leu Gin Ser Tyr Leu 
2400 2405 2410 

AAT CTC TAC ACT TTA ATC AAT ATC AAC GGG AAC CAC ATA GAG GAA AAA 72 99 

Asn Leu Tyr Thr Leu lie Asn lie Asn Gly Asn His lie Glu Glu Lys 
2415 2420 2425 

AAC GGC GCA TTG ACT TAT TTG GGC CAA CGG GTT TTG TTG CAA GAT AAG 734 7 

Asn Gly Ala Leu Thr Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys 
2430 2435 2440 

GGG TTA TTG TTA AGC GTA GCG CTG CCC AAC TCA AAC AAC GCT TCT CAA 7 3 95 

Gly Leu Leu Leu Ser Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin 
2445 2450 2455 2460 

AAC AAC ATT TTA AGC CTT TCT GTC CTT TAT AAC CAA GTT AAA ATG TCT 744 3 

Asn Asn lie Leu Ser Leu Ser Val Leu Tyr Asn Gin Val Lys Met Ser 
2465 2470 2475 

TGC GGC GAT AAA GCG ATG GAT TTT ACC CCC CCT ACC TTA CAA GAT TAC 74 91 

Cys Gly Asp Lys Ala Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr 
2480 2485 2490 
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AGC TTG TTG CAA AAC CAG GAG ATT AAA AGC GTT TTA GAG AAA GTC CTA 6195 
Ser Leu Leu Gin Asn Gin Gin lie Lys Ser Val Leu Asp Lys Val Leu 
2045 2050 2055 2060 

GCG GOT AAA GGT TTA GGG OCT ATT TAT GAA CAA GGC TTG GGG GAT TTG 62 4 3 

Ala Ala Lys Gly Leu Gly Pro lie Tyr Glu Gin Gly Leu Gly Asp Leu 
2065 2070 2075 

ATA OCT AAT CTT GGT AAA AAA GGG CTT TTC GCT OCT TAT GGC TTG AGT 62 91 

lie Pro Asn Leu Gly Lys Lys Gly Leu Phe Ala Pro Tyr Gly Leu Ser 
2080 2085 2090 

CAA GTG TGG CAA AAA GGG GAT TTT AGT TTC AAC GCA CAA GGC AAT GTT 63 3 9 

Gin Val Trp Gin Lys Gly Asp Phe Ser Phe Asn Ala Gin Gly Asn Val 
2095 2100 2105 

TTT GTG CAA AAT TCC ACT TTC TCT AAC GCC AAT GGA GGC ACG CTC TCT 63 87 

Phe Val Gin Asn Ser Thr Phe Ser Asn Ala Asn Gly Gly Thr Leu Ser 
2110 2115 2120 

TTT AAC GCA GGA AAT TCG CTC ATT TTT GCC GGA AAC AAT CAT ATT GCA 64 3 5 

Phe Asn Ala Gly Asn Ser Leu lie Phe Ala Gly Asn Asn His lie Ala 
2125 2130 2135 2140 

TTC ACT AAC CAC GCT GGA ACT CTT CAA TTA TTG TCC GAT CAA GTT TCT 64 83 

Phe Thr Asn His Ala Gly Thr Leu Gin Leu Leu Ser Asp Gin Val Ser 
2145 2150 2155 

AAC ATT AAC ATC ACC ACG CTT AAC GCT AGC AAC GGC CTT AAG ATT AAC 65 31 

Asn lie Asn lie Thr Thr Leu Asn Ala Ser Asn Gly Leu Lys lie Asn 
2160 2165 2170 

GCC GCT AAT AAC AAT GTT TCT GTG TCT CAA GGC AAT CTG TTT GTC AGC 6 57 9 

Ala Ala Asn Asn Asn Val Ser Val Ser Gin Gly Asn Leu Phe Val Ser 
2175 2180 2185 

GCT AGC TGC GCG CAA CAA AGC GAT CCA ACT ACA GCT AAT ATT GCA AAC 6 62 7 

Ala Ser Cys Ala Gin Gin Ser Asp Pro Thr Thr Ala Asn lie Ala Asn 
2190 2195 2200 

CCT TGC GCG CTT AGC GCC CAA AGC ACG AAT GGC GCT TCT TCT AAT AAT 6 67 5 

Pro Cys Ala Leu Ser Ala Gin Ser Thr Asn Gly Ala Ser Ser Asn Asn 
2205 2210 2215 2220 

GCG TCA AAT AAC GCG CCA ATC GCC TTG AGT AAT AAC GAT GAA AGC TTG 672 3 

Ala Ser Asn Asn Ala Pro lie Ala Leu Ser Asn Asn Asp Glu Ser Leu 
2225 2230 2235 

ATG GTT GCG GCG AAT GAT TTC AAT TTT TCA GGC AAT ATT TAC GCT AAT 67 71 

Met Val Ala Ala Asn Asp Phe Asn Phe Ser Gly Asn lie Tyr Ala Asn 
2240 2245 2250 

GGG GTG GTT GAT TTT TCA AAG ATT AAA GGC TCT GCA AAC ATT AAA AAC 6 819 

Gly Val Val Asp Phe Ser Lys lie Lys Gly Ser Ala Asn lie Lys Asn 
2255 2260 2265 
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AGC GGG AAC GCA TGG GGG ACT GGG GGG AGC GCG AGC GTA ACT TTT AAC 552 3 

Ser Gly Asn Ala Trp Gly Thr Gly Gly Ser Ala Ser Val Thr Phe Asn 
1825 1830 1835 

AGC CAA ACT TCG CTC ATT CTC AAT CAG GCT AAT ATC GTA AGC TCG CAA 55 71 

Ser Gin Thr Ser Leu lie Leu Asn Gin Ala Asn lie Val Ser Ser Gin 
1840 1845 1850 

ACC GAT GGG ATC TTT AGC ATG CTG GGT CAA GAG GGT ATT AAT AAG GTT 5619 
Thr Asp Gly lie Phe Ser Met Leu Gly Gin Glu Gly lie Asn Lys Val 
1855 1860 1865 

TTC AAT CAA GCC GGG CTC GCT AAT ATT TTG GGC GAA GTG GCG GTG CAA 5 6 67 

Phe Asn Gin Ala Gly Leu Ala Asn lie Leu Gly Glu Val Ala Val Gin 
1870 1875 1880 

TCC ATC AAC AAA GCC GGG GGA TTA GGG AAT TTG ATA GTA AAT ACG CTA 5 715 

Ser lie Asn Lys Ala Gly Gly Leu Gly Asn Leu lie Val Asn Thr Leu 
1885 1890 1895 1900 

GGG AGT AAT AGC GTG ATT GGG GGG TAT TTA ACG CCT GAA CAA AAA AAT 57 6 3 

Gly Ser Asn Ser Val lie Gly Gly Tyr Leu Thr Pro Glu Gin Lys Asn 
1905 1910 1915 

CAA ACC CTA AGC CAG CTT TTA GGG CAG AAT AAC TTT GAT AAT CTC ATG 5811 
Gin Thr Leu Ser Gin Leu Leu Gly Gin Asn Asn Phe Asp Asn Leu Met 
1920 1925 1930 

AAC GAT AGC GGT TTG AAT ACG GCG ATT AAG GAT TTG ATC AGA CAA AAA 5 85 9 

Asn Asp Ser Gly Leu Asn Thr Ala lie Lys Asp Leu lie Arg Gin Lys 
1935 1940 1945 

TTA GGC TTT TGG ACC GGG CTA GTG GGG GGA TTA GCC GGA CTA GGG GGC 5 90 7 

Leu Gly Phe Trp Thr Gly Leu Val Gly Gly Leu Ala Gly Leu Gly Gly 
1950 1955 1960 

ATT GAT TTG CAA AAC CCT GAA AAG CTT ATA GGC AGC ATG TCA ATC AAT 5 95 5 

lie Asp Leu Gin Asn Pro Glu Lys Leu lie Gly Ser Met Ser lie Asn 
1965 1970 1975 1980 

GAT TTA TTG AGT AAA AAA GGG TTG TTC AAT CAG ATC ACC GGC TTT ATT 6 0 03 

Asp Leu Leu Ser Lys Lys Gly Leu Phe Asn Gin lie Thr Gly Phe lie 
1985 1990 1995 

TCC GCT AAC GAT ATA GGG CAA GTC ATA AGC GTA ATG TTG CAA GAT ATT 60 51 

Ser Ala Asn Asp lie Gly Gin Val lie Ser Val Met Leu Gin Asp lie 
2000 2005 2010 

GTC AAA CCG AGC AAC GCT TTA AAA AAC GAT GTA GCG GCT TTA GGC AAG 60 99 

Val Lys Pro Ser Asn Ala Leu Lys Asn Asp Val Ala Ala Leu Gly Lys 
2015 2020 2025 

CAA ATG ATT GGC GAA TTT TTA GGC CAA GAC ACG CTC AAT TCT TTA GAA 614 7 

Gin Met lie Gly Glu Phe Leu Gly Gin Asp Thr Leu Asn Ser Leu Glu 
2030 2035 2040 
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GTT TAT AAC AAC GCT AAT GGC GTG TAT TCT TAT AGO GAT GAT GCA CAA 4 851 

Val Tyr Asn Asn Ala Asn Gly Val Tyr Ser Tyr Ser Asp Asp Ala Gin 
1600 1605 1610 

GGC GTG TTT TAT CTC ACA AGC AAC GTG AAA GGC TAT TAG AAC CCT AAC 4 89 9 

Gly Val Phe Tyr Leu Thr Ser Asn Val Lys Gly Tyr Tyr Asn Pro Asn 
1615 1620 1625 

CAA TCC TAT CAA GCC AGC GGC AGT AAC AAC ACC ACG AAA AAT AAT AAT 4 94 7 

Gin Ser Tyr Gin Ala Ser Gly Ser Asn Asn Thr Thr Lys Asn Asn Asn 
1630 1635 1640 



CTA ACC TCT GAA TCT TCT ATC ATC 
Leu Thr Ser Glu Ser Ser lie lie 
1645 1650 

AAC CCT ATT AGC GCG TTG CAC ATC 
Asn Pro lie Ser Ala Leu His lie 
1665 



TCG CAA ACC TAT AAC GCG CAA GGC 4 995 

Ser Gin Thr Tyr Asn Ala Gin Gly 
1655 1660 

TAT AAC AAG GGC TAT AAT TTC AAC 5 04 3 

Tyr Asn Lys Gly Tyr Asn Phe Asn 
1670 1675 



AAT ATC AAA GCG TTA GGG CAA ATG GCT CTC AAA CTC TAG CCT GAA ATC 50 91 

Asn lie Lys Ala Leu Gly Gin Met Ala Leu Lys Leu Tyr Pro Glu lie 
1680 1685 1690 

AAA AAG GTA TTA GGG AAT GAT TTT TCG CCC TCA AGT TTG AAC GCT TTA 513 9 

Lys Lys Val Leu Gly Asn Asp Phe Ser Pro Ser Ser Leu Asn Ala Leu 
1695 1700 1705 

AAC TCT AAT GCG CTA AAC CAA CTT ACC AAA CTC ATC ACG CCT AAC GAC 518 7 

Asn Ser Asn Ala Leu Asn Gin Leu Thr Lys Leu lie Thr Pro Asn Asp 
1710 1715 1720 



TGG AAA AAC ATT AAC GAG TTG ATT GAT AAC GCA AAC AAT TCG GTG GTG 52 3 5 

Trp Lys Asn lie Asn Glu Leu lie Asp Asn Ala Asn Asn Ser Val Val 
1725 1730 1735 1740 

CAA AAT TTC AAT AAC GGC ACT TTG ATT GTG GGA GCG ACT CAA ATA GGG 52 8 3 

Gin Asn Phe Asn Asn Gly Thr Leu lie Val Gly Ala Thr Gin lie Gly 
1745 1750 1755 

CAA ACA GAC ACC AAT AGC GCG GTT GTT TTT GGG GGC TTG GGC TAT CAA 53 31 

Gin Thr Asp Thr Asn Ser Ala Val Val Phe Gly Gly Leu Gly Tyr Gin 
1760 1765 1770 

ACA CCT TGT GAT TAT ACT GAT ATT GTG TGC CAA AAA TTT AGA GGC ACT 53 7 9 

Thr Pro Cys Asp Tyr Thr Asp He Val Cys Gin Lys Phe Arg Gly Thr 
1775 1780 1785 

TAT TTA GGA GAG CTT TTA GAG TCC AGC TCG GCT GAT TTG GGC TAT ATT 54 2 7 

Tyr Leu Gly Gin Leu Leu Glu Ser Ser Ser Ala Asp Leu Gly Tyr He 
1790 1795 1800 



GAC ACG ACT TTT AAC GCT AAA GAA ATT TAT CTT ACC GGC ACT TTA GGG 54 7 5 

Asp Thr Thr Phe Asn Ala Lys Glu He Tyr Leu Thr Gly Thr Leu Gly 
1805 1810 1815 1820 
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AAC GAT TAT GCG AGC TTG ATT GCG ACT AAT GGC TCT CAC CTT AAT TTT 417 9 

Asn Asp Tyr Ala Ser Leu lie Ala Ser Asn Gly Ser His Leu Asn Phe 
1375 1380 1385 

AAC GGG GCG GTT AAT TTC AAT TCA GCG AAT ATT ACT ACG AGT TTG AAT 4227 
Asn Gly Ala Val Asn Phe Asn Ser Ala Asn lie Thr Thr Ser Leu Asn 
1390 1395 1400 

AAT TCC TCT ATC GTG TTT AAG GGG GCG GTC TCT TTA GGA GGG CAG TTT 427 5 

Asn Ser Ser lie Val Phe Lys Gly Ala Val Ser Leu Gly Gly Gin Phe 
1405 1410 1415 1420 

AAT TTA AGC AAT AAC TCT TCT TTA GAT TTC CAA GGC TCT AGC GCT ATC 4 32 3 

Asn Leu Ser Asn Asn Ser Ser Leu Asp Phe Gin Gly Ser Ser Ala lie 
1425 1430 1435 

ACC TCT AAC ACG GCG TTT AAT TTC TAT GAT AAC GCT TTT TCT CAA AGC 43 71 

Thr Ser Asn Thr Ala Phe Asn Phe Tyr Asp Asn Ala Phe Ser Gin Ser 
1440 1445 1450 

CCC ATC ACT TTC CAT CAA GCC CTT GAC ATT AAA . GCG CCC TTA AGT TTG 4419 
Pro lie Thr Phe His Gin Ala Leu Asp lie Lys Ala Pro Leu Ser Leu 
1455 1460 1465 

GGA GGC AAC CTT TTA AAC CCT AAC AAC AGC AGC GTG CTG GAT TTA AAA 44 67 

Gly Gly Asn Leu Leu Asn Pro Asn Asn Ser Ser Val Leu Asp Leu Lys 
1470 1475 1480 

AAC AGC CAG CTT GTT TTT GGC GAT CAA GGG AGT TTG AAT ATC GCT AAC 4 515 

Asn Ser Gin Leu Val Phe Gly Asp Gin Gly Ser Leu Asn lie Ala Asn 
1485 1490 1495 1500 

ATT GAT TTA CTA AGC GAT CTA AAT GAT AAT AAA AAT CGT GTG TAT AAC 4563 
lie Asp Leu Leu Ser Asp Leu Asn Asp Asn Lys Asn Arg Val Tyr Asn 
1505 1510 1515 

ATC ATT CAA GCG GAC ATG AAT AGT AAT TGG TAT GAG CGT ATC AGC TTC 4 611 

lie lie Gin Ala Asp Met Asn Ser Asn Trp Tyr Glu Arg lie Ser Phe 
1520 1525 1530 

TTT GGC ATG CAC ATC AAT GAC GGG ATT TAT GAT GCT AAA AAC CAA ACT 4 65 9 

Phe Gly Met His lie Asn Asp Gly He Tyr Asp Ala Lys Asn Gin Thr 
1535 1540 1545 

TAT AGT TTC ACT AAC CCC CTT AAT AAC GCC CTA AAA ATC ACC GAG AGC 47 07 

Tyr Ser Phe Thr Asn Pro Leu Asn Asn Ala Leu Lys lie Thr Glu Ser 
1550 1555 1560 

TTT AAA GAC AAC CAA CTA AGC GTT ACG CTC TCT CAA ATC CCG GGT ATT 4755 
Phe Lys Asp Asn Gin Leu Ser Val Thr Leu Ser Gin He Pro Gly He 
1565 1570 1575 1580 

AAA AAC ACG CTC TAT AAC ATT GGC TCT GAA ATT TTT AAC TAG CAA AAA 4 8 03 

Lys Asn Thr Leu Tyr Asn lie Gly Ser Glu He Phe Asn Tyr Gin Lys 
1585 1590 1595 
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GGA GCA GGG AGT AAT GAT GCG AGC GGG CAT TGC TGG 7VAT TTG CAA TGC 3 5 07 

Gly Ala Gly Ser Asn Asp Ala Ser Gly His Cys Trp Asn Leu Gin Cys 
1150 1155 1160 

ATA GGC TTT ATT ACA GGG CAT TAT GAA GCG CAA AAG ATT TAG ATC ACC 3 555 

lie Gly Phe lie Thr Gly His Tyr Glu Ala Gin Lys lie Tyr lie Thr 
1165 1170 1175 1180 

GGT AGC ATT GAA AGC GGG AAT CGC ATT TCT AGC GGT GGG GGC GCG AGC 3 603 

Gly Ser He Glu Ser Gly Asn Arg He Ser Ser Gly Gly Gly Ala Ser 
1185 1190 1195 

CTT AAT TTT AAC GGG CTT CAA GGC ATT CTT TTA ACG AAC GCG ACT TTG 3 651 

Leu Asn Phe Asn Gly Leu Gin Gly He Leu Leu Thr Asn Ala Thr Leu 
1200 1205 1210 

TAT AAC CGC GCC GCT GGC ACG CAA AGC TCG TCT ATG AAT TTT ATC TCT 3 6 99 

Tyr Asn Arg Ala Ala Gly Thr Gin Ser Ser Ser Met Asn Phe He Ser 
1215 1220 1225 

AAC AGC GCG AAC ATT CAG GCT CAA AAC TCC TAT TTT ATA GAC GAT ACC 3 74 7 

Asn Ser Ala Asn He Gin Ala Gin Asn Ser Tyr Phe He Asp Asp Thr 
1230 1235 1240 

GCA CAA AAT GGC GGT AAC CCT AAT TTC AGT TTC AAC GCT TTG AAT CTG 3 7 95 

Ala Gin Asn Gly Gly Asn Pro Asn Phe Ser Phe Asn Ala Leu Asn Leu 
1245 1250 1255 1260 

GAT TTT TCT AAC AGC TCT TTT AGA GGC TAT GTG GGG AAA ACG CAA TCT 3 84 3 

Asp Phe Ser Asn Ser Ser Phe Arg Gly Tyr Val Gly Lys Thr Gin Ser 
1265 1270 1275 

GTT TTT AAA TTC AAT GCC AAG AAT GCG ATC AGT TTC ACC AAC AGC ACG 3 8 91 

Val Phe Lys Phe Asn Ala Lys Asn Ala He Ser Phe Thr Asn Ser Thr 
1280 1285 1290 

AAT TTA AGC TCT GGT TTG TAT CAA ATG CAA GCT AAA AGC GTG TTG TTT 3 93 9 

Asn Leu Ser Ser Gly Leu Tyr Gin Met Gin Ala Lys Ser Val Leu Phe 
1295 1300 1305 

GAC AAT TCC AAT TTA AGC GTT TCA GTG GGG ACA AGC AGT ATT AAA GCC 3 9 87 

Asp Asn Ser Asn Leu Ser Val Ser Val Gly Thr Ser Ser He Lys Ala 
1310 1315 1320 

AAT GCG ATC AAT CTT TCT CAA AAT GCC TCT ATT AAT GCG AGC AAC CAT 4 03 5 

Asn Ala He Asn Leu Ser Gin Asn Ala Ser He Asn Ala Ser Asn His 
1325 1330 1335 1340 

TCA ACC TTA GAA CTT CAA GGC GAT TTG AAT GTG AAC GAC ACC AGC TCG 4 083 

Ser Thr Leu Glu Leu Gin Gly Asp Leu Asn Val Asn Asp Thr Ser Ser 
1345 1350 1355 

CTC AAC CTC AAC CAA AGC ACG ATT AAT GTT TCC AAT AAC GCC ACG ATC 4131 
Leu Asn Leu Asn Gin Ser Thr He Asn Val Ser Asn Asn Ala Thr He 
1360 1365 1370 
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AAC ACG AGC GTG AAA GGG CAG GTT ACT TTA AAT AAC ATC ACT TTA AAA 2 8 35 

Asn Thr Ser Val Lys Gly Gin Val Thr Leu Asn Asn lie Thr Leu Lys 
925 930 935 940 

AAC CTG AAC GCC CCT TTG TCT TTT GGC GAT GGG ACG ATT ACT TTT AAC 2 8 83 

Asn Leu Asn Ala Pro Leu Ser Phe Gly Asp Gly Thr lie Thr Phe Asn 
945 950 955 

GCT CAT TCG GTG ATT AAT ATT GCT GAA TCT ATC ACT AAT GGC AAC CCT 2 931 

Ala His Ser Val lie Asn lie Ala Glu Ser lie Thr Asn Gly Asn Pro 
960 965 970 

ATC ACT CTT GTA AGC TCT TCT AAA GAA ATT GAA TAC AAC AAC GCT TTC 297 9 

lie Thr Leu Val Ser Ser Ser Lys Glu He Glu Tyr Asn Asn Ala Phe 
975 980 985 

AGT AAA AAT CTA TGG CAG CTC ATC AAC TAC CAA GGG CAT GGG GCA AGC 3 027 

Ser Lys Asn Leu Trp Gin Leu He Asn Tyr Gin Gly His Gly Ala Ser 
990 995 1000 

AGT GAA AAG CTC GTC TCT AGC GCG GGT AAT GGC GTT TAT GAT GTG GTG 3 07 5 

Ser Glu Lys Leu Val Ser Ser Ala Gly Asn Gly Val Tyr Asp Val Val 
1005 1010 1015 1020 

TAT TCT TTC AAT AAC CAA ACC TAC AAT TTC CAA GAG GTT TTT TCA CAA 312 3 

Tyr Ser Phe Asn Asn Gin Thr Tyr Asn Phe Gin Glu Val Phe Ser Gin 
1025 1030 1035 

AAC AGC ATT TCT ATC CGG CGT TTG GGC GTT AAC ATG GTG TTT GAT TAT 3171 
Asn Ser He Ser He Arg Arg Leu Gly Val Asn Met Val Phe Asp Tyr 
1040 1045 1050 

GTG GAT ATG GAA AAA TCG GAT CAT TTA TAT TAT CAA AAC GCT CTC GGT 3219 
Val Asp Met Glu Lys Ser Asp His Leu Tyr Tyr Gin Asn Ala Leu Gly 
1055 1060 1065 

TTT ATG ACC TAC ATG CCT AAT AGC TAT AAC AAT AAT TTA GGG AAT GCA 32 67 

Phe Met Thr Tyr Met Pro Asn Ser Tyr Asn Asn Asn Leu Gly Asn Ala 
1070 1075 1080 

AAC AAC ACC ATT TAC TAT TAC GAC AAG AGC ATT GAT TTT TAT GCG AGC 3 315 

Asn Asn Thr He Tyr Tyr Tyr Asp Lys Ser He Asp Phe Tyr Ala Ser 
1085 1090 1095 1100 

GGG AAA ACT CTA TTC ACT AAA GCG GAA TTT TCT CAA ACA TTC ACC GGG 3 3 63 

Gly Lys Thr Leu Phe Thr Lys Ala Glu Phe Ser Gin Thr Phe Thr Gly 
1105 1110 1115 

CAA AAC AGC GCG ATC GTT TTT GGG GCT AAA AGC ATA TGG ACG AGC TTA 3411 
Gin Asn Ser Ala He Val Phe Gly Ala Lys Ser He Trp Thr Ser Leu 
1120 1125 1130 

AGC GAT GCA CCG CAG TCT AAC ACC ATC ATT CGC TTT GGG GAC AAT AAG 34 5 9 

Ser Asp Ala Pro Gin Ser Asn Thr He He Arg Phe Gly Asp Asn Lys 
1135 1140 1145 
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TGC ACA GGC ACG ACT AAC GGC ACT TAT AGO GCC TAT CAT GTG TAT ATC 2163 
Cys Ttir Gly Thr Thr Asn Gly Thr Tyr Ser Ala Tyr His Val Tyr lie 
705 710 715 

ACA GCG AAT CTG CGT TCT GGC AAT CGT ATA GGC ACC GGT GGG GCA GCT 2211 
Thr Ala Asn Leia Arg Ser Gly Asn Arg lie Gly Thr Gly Gly Ala Ala 
720 725 730 

AAT CTA ATC TTT AAT GGG GTA GAT AGT ATC AAT ATC GCT AAC GCT ACC 22 5 9 

Asn I^eu lie Phe Asn Gly Val Asp Ser lie Asn lie Ala Asn Ala Thr 
735 740 745 

ATC ACG CAA CAT AAC GCC GGA ATC TAT TCA AGC TCT ATG ACT TTT TCC 2 3 07 

lie Thr Gin His Asn Ala Gly lie Tyr Ser Ser Ser Met Thr Phe Ser 
750 755 760 

ACG CAA AGC ATG GAT AAT TCG CAG AAT TTG AAT GGT CTA AAT TCT AAC 2 3 55 

Thr Gin Ser Met Asp Asn Ser Gin Asn Leu Asn Gly Leu Asn Ser Asn 
765 770 775 780 

GGC AAA CTT TCG GTG TAT GGC ACC ACT TTC ACT • AAC GAA GCT AAA GAT 24 03 

Gly Lys Leu Ser Val Tyr Gly Thr Thr Phe Thr Asn Glu Ala Lys Asp 
785 790 795 

GGG AAA TTC ATT TTC AAT GCA GGG CAA GCG GTT TTT GAA AAC ACC AAC 24 51 

Gly Lys Phe lie Phe Asn Ala Gly Gin Ala Val Phe Glu Asn Thr Asn 
800 805 810 

TTT AAT GGA GGG AGT TAC CAA TTC AGC GGC GAT AGC TTG AAT TTT TCA 2 4 99 

Phe Asn Gly Gly Ser Tyr Gin Phe Ser Gly Asp Ser Leu Asn Phe Ser 
815 820 825 

AAC AAC AAC CAG TTC AAT AGC GGT TCG TTT GAA ATT AGC GCA AAA AAC 2 54 7 

Asn Asn Asn Gin Phe Asn Ser Gly Ser Phe Glu lie Ser Ala Lys Asn 
830 835 840 

GCT TCG TTC AAT AAC GCT AAC TTT AAC AAC AGC GCT TCT TTT AAT TTC 2 5 95 

Ala Ser Phe Asn Asn Ala Asn Phe Asn Asn Ser Ala Ser Phe Asn Phe 
845 850 855 860 

AAT AAT TCT AAC GCG ACC ACT TCG TTT GTG GGG GAT TTC ACT AAC GCT 2 64 3 

Asn Asn Ser Asn Ala Thr Thr Ser Phe Val Gly Asp Phe Thr Asn Ala 
865 870 875 

AAT TCA AAT TTG CAA ATC GCC GGG AAC GCT GTT TTT GGG AAC TCT ACT 2 6 91 

Asn Ser Asn Leu Gin lie Ala Gly Asn Ala Val Phe Gly Asn Ser Thr 
880 885 890 

AAT GGC TCT CAA AAT ACC GCT AAT TTT AAT AAT ACC GGC TCT GTT AAT 2 73 9 

Asn Gly Ser Gin Asn Thr Ala Asn Phe Asn Asn Thr Gly Ser Val Asn 
895 900 905 

ATT TCA GGG AAT GCA ACC TTT GAT AAT GTG GTG TTT AAT GGC CCT ACG 2 7 87 

lie Ser Gly Asn Ala Thr Phe Asp Asn Val Val Phe Asn Gly Pro Thr 
910 915 920 
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GTA ACG ATT TTA AAT TCT AGC GOT GGG ATC ACT TAT AGT AAC CTT TTA 14 91 

Val Thr lie Leu Asn Ser Ser Gly Gly lie Thr Tyr Ser Asn Leu Leu 
480 485 490 

AAC CAT GCA ATC AAC GGC TTG ACA AGT GCC TTA AAA ACG AAC GAA AGC 15 3 9 

Asn His Ala lie Asn Gly Leu Thr Ser Ala Leu Lys Thr Asn Glu Ser 
495 500 505 

CTT TCA AAT CCG CAA AGT TTC GCT CAA GGT TTG TGG GAT ATA ATC ACT 15 8 7 

Leu Ser Asn Pro Gin Ser Phe Ala Gin Gly Leu Trp Asp lie lie Thr 
510 515 520 

TAC AAT GGG GTT ACC GGG CAG CTT TTG AAT GAA AAC GCT GCA ACA TCT 16 3 5 

Tyr Asn Gly Val Thr Gly Gin Leu Leu Asn Glu Asn Ala Ala Thr Ser 
525 530 535 540 

AAA CCC ACT GAC TCT TCG CCC TCT AAA TCC TCT ACA AAC TCT ACG CAA 16 8 3 

Lys Pro Thr Asp Ser Ser Pro Ser Lys Ser Ser Thr Asn Ser Thr Gin 
545 550 555 

GTC TAT CAA GTG GGT TAC AAA ATA GGG GAT ACT ATC TAC AAA CTG CAA 1731 
Val Tyr Gin Val Gly Tyr Lys lie Gly Asp Thr He Tyr Lys Leu Gin 
560 565 570 

GAA ACT TTC AGC CAC AAT TCC ATT ATT ATT CAG GCT TTA GAG AGC GGG 177 9 

Glu Thr Phe Ser His Asn Ser He He He Gin Ala Leu Glu Ser Gly 
575 580 585 

ACT TAC ACG CCA CCC CCT GTC ATT AAC GGC TCC AAA TTT GAC TTA TCC 182 7 

Thr Tyr Thr Pro Pro Pro Val He Asn Gly Ser Lys Phe Asp Leu Ser 
590 595 600 

GCT TCA AAT TAT ATC AAT GCT GAC ATG CCT TGG TAT GAC CAT AAA TAT 18 75 

Ala Ser Asn Tyr He Asn Ala Asp Met Pro Trp Tyr Asp His Lys Tyr 
605 610 615 620 

TAC ATC CCT AAA TCC CAA AAT TTT ACA GAG AGC GGG ACT TAT TAC TTG 192 3 

Tyr He Pro Lys Ser Gin Asn Phe Thr Glu Ser Gly Thr Tyr Tyr Leu 
625 630 635 

CCG AGC GTC CAA ATA TGG GGG AGC TAC ACT AAC TCG TTT AAA CAA ACT 1971 
Pro Ser Val Gin He Trp Gly Ser Tyr Thr Asn Ser Phe Lys Gin Thr 
640 645 650 

TTT AGC GCA AAT GGT AGT AAT CTG GTG ATT GGG TAT AAC TCA ACA TGG 2 019 

Phe Ser Ala Asn Gly Ser Asn Leu Val He Gly Tyr Asn Ser Thr Trp 
655 660 665 

ACT GAT CAT AAT GTC TCT TCT AGC GGC ACG GTG TCT TTT GGG GAC ACT 2 0 67 

Thr Asp His Asn Val Ser Ser Ser Gly Thr Val Ser Phe Gly Asp Thr 
670 675 680 

TCA GGG AGC GCT CTT AAT GGG CAT TGC GGA CCT TGG CCG TAT TAC CAA 2115 
Ser Gly Ser Ala Leu Asn Gly His Cys Gly Pro Trp Pro Tyr Tyr Gin 
685 690 695 700 
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TCT TTT AAA GGG ACQ ACT AAC OCT ACT AAC ACQ GCG TTT AGT AAT GCA 819 
Ser Phe Lys Gly Thr Thr Asn Ala Thr Asn Thr Ala Phe Ser Asn Ala 
255 260 265 

AGC GGC AGT TTC ACT TTT GAA GAG AAC GCC ACT TTT AGC GGG GCG AAA 8 67 

Ser Gly Ser Phe Thr Phe Glu Glu Asn Ala Thr Phe Ser Gly Ala Lys 
270 275 280 

TGG AAT GGG GGG ACT TAT ACC TTT AAT AAA GAG TTT AGC GCT ACC AAT 915 
Trp Asn Gly Gly Thr Tyr Thr Phe Asn Lys Glu Phe Ser Ala Thr Asn 
285 290 295 300 

AAC ACC GCC TTT AGT AGC GGT AGT TTT AAT TTT AAA GGT GTA AGC TCT 963 
Asn Thr Ala Phe Ser Ser Gly Ser Phe Asn Phe Lys Gly Val Ser Ser 
305 310 ' 315 

TTT AAT GGT ACT TCG TTT AGT AAC GCT TCT TAT ACT TTT GAC AAT CAA 1011 
Phe Asn Gly Thr Ser Phe Ser Asn Ala Ser Tyr Thr Phe Asp Asn Gin 
320 325 330 

GCC ACT TTC CAA AAC AGC TCC TTT AAT GGG GGG ACT TTT ACT TTT AAT 105 9 

Ala Thr Phe Gin Asn Ser Ser Phe Asn Gly Gly Thr Phe Thr Phe Asn 
335 340 345 

AAC CAA ACT AAT CCA ACT AAC AAC GCT CAG CAC CCC CAA ATT CAA AAC 1107 
Asn Gin Thr Asn Pro Thr Asn Asn Ala Gin His Pro Gin lie Gin Asn 
350 355 360 

AGC TCT TTT AGT GGT AAC GCT ACC ACT CTT AAG GGC TTT GTG AAT TTC 1155 
Ser Ser Phe Ser Gly Asn Ala Thr Thr Leu Lys Gly Phe Val Asn Phe 
365 370 375 380 

CAG CAA GCC TTT AAC AAT TCA AAC CAC CAA CTA ACG ATC CAA AAC GCT 12 03 

Gin Gin Ala Phe Asn Asn Ser Asn His Gin Leu Thr lie Gin Asn Ala 
385 390 395 

TCC TTT AAT AAC GCC ACT TTT AAC AAT ACC GGT AAA ATC ACT ATA GAA 12 51 

Ser Phe Asn Asn Ala Thr Phe Asn Asn Thr Gly Lys lie Thr lie Glu 
400 405 410 

AAA GAT GCG AGT TTT AAT AAC ACG ACA TTC AAC ACT TCT GTT GAT ACA 12 99 

Lys Asp Ala Ser Phe Asn Asn Thr Thr Phe Asn Thr Ser Val Asp Thr 
415 420 425 

AAC AAC ATG AGT GTT ACC GGT GGC GTT ACT TTA AGC GGT AAA AAT GAC 13 4 7 

Asn Asn Met Ser Val Thr Gly Gly Val Thr Leu Ser Gly Lys Asn Asp 
430 435 440 

TTG AAA AAT GGC TCA ACC CTT GAT TTT GGG AGT TCT AAA ATC ACT CTC 13 95 

Leu Lys Asn Gly Ser Thr Leu Asp Phe Gly Ser Ser Lys lie Thr Leu 
445 450 455 460 

GCT CAA GGG ACG ACT TTC AAC CTC ACA AGT TTA GGC AGT GAG AAG AGC 144 3 

Ala Gin Gly Thr Thr Phe Asn Leu Thr Ser Leu Gly Ser Glu Lys Ser 
465 470 475 
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ATG CCT TTA CTC ATC AGC GGG TTT GCT AGT GGG GTG TAT GCG AAT AAT 14 7 

Met Pro Leu Leu lie Ser Gly Phe Ala Ser Gly Val Tyr Ala Asn Asn 
30 35 40 

CTG TGG GAT TTG TTA AAC CCA AAA GTG GGG GGT GAG TAT GTG CAT TGG 195 
Leu Trp Asp Leu Leu Asn Pro Lys Val Gly Gly Glu Tyr Val His Trp 
45 50 55 60 

GTT AAG GGC AGT CAG TAT TGT GCA TGG TGG GAA TTT GCT GGG TGT TTA 24 3 

Val Lys Gly Ser Gin Tyr Cys Ala Trp Trp Glu Phe Ala Gly Cys Leu 
65 70 75 

AAG AAT GTA TGG GGG GCA AAT CAT AAA GGC TAT GAT GCT GGA AAC GCC 2 91 

Lys Asn Val Trp Gly Ala Asn His Lys Gly Tyr Asp Ala Gly Asn Ala 
80 85 90 

GCT AAC TAT TTG TCT TCT CAA AAC TAT CAA GCT ATT TCG GTG GGT AGT 33 9 

Ala Asn Tyr Leu Ser Ser Gin Asn Tyr Gin Ala lie Ser Val Gly Ser 
95 100 105 

GGG AAT GAA ACG GGG ACT TAT AGT TTA AGC GGT TTT ACC AAT TAT GTT 3 87 

Gly Asn Glu Thr Gly Thr Tyr Ser Leu Ser Gly Phe Thr Asn Tyr Val 
110 115 120 

GGG GGC AAT CTC ACG ATC AAT CTA GGC AAT AGC GTT GTT TTA GAT TTA 43 5 

Gly Gly Asn Leu Thr lie Asn Leu Gly Asn Ser Val Val Leu Asp Leu 
125 130 135 140 

AGC GGT TCT AAT AGT TTC ACT TCG TAT CAA GGT TAT AAT CAA GGC AAA 483 
Ser Gly Ser Asn Ser Phe Thr Ser Tyr Gin Gly Tyr Asn Gin Gly Lys 
145 150 155 

GAT GAT GTA ACA TTT ACG GTT GGC GCA ATC AAT TTA AAC GGC ACT TTA 531 
Asp Asp Val Thr Phe Thr Val Gly Ala lie Asn Leu Asn Gly Thr Leu 
160 165 170 

GAA GTG GGT AAT CGT GTG GGA TCG GGA GCT GGC ACG CAC ACC GGC ACA 57 9 

Glu Val Gly Asn Arg Val Gly Ser Gly Ala Gly Thr His Thr Gly Thr 
175 180 185 

GCC ACT TTA AAC TTG AAC GCT AAT AAG GTC AAT ATC AAT TCC AAT ATC 627 
Ala Thr Leu Asn Leu Asn Ala Asn Lys Val Asn lie Asn Ser Asn lie 
190 195 200 

AAC GCG TAT AAA ACT TCG CAA GTG AAT ATA GGC AAC GCT AAC AGC GTT 67 5 

Asn Ala Tyr Lys Thr Ser Gin Val Asn lie Gly Asn Ala Asn Ser Val 
205 210 215 220 

ATT ACC ATT GGT TCG GTT TCT TTG AGT GGG GAT GTT TGC AGT TCT TTA 72 3 

He Thr He Gly Ser Val Ser Leu Ser Gly Asp Val Cys Ser Ser Leu 
225 230 235 

GCT AGC GTT GGG ATA GGG GCT AAT TGC TCC ACT TCT GGG CCT AGC TAT 771 
Ala Ser Val Gly He Gly Ala Asn Cys Ser Thr Ser Gly Pro Ser Tyr 
240 245 250 
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130 

Lteu Asn Lys Glu 
145 

Trp Glu Ala Val 

Arg Gly Cys Leu 
180 

lie Met Ser Arg 
195 

Gly Ala Met Lys 
210 

Val Gly lie Leu 
225 

Val Lys Phe Phe 

Leu Ser Arg Arg 
260 

Asn Asp Asp Tyr 
275 

Ser Gin lie Thr 
290 

Ala Gin Ala Ser 
305 

Tyr Phe Trp Phe 
Gin Arg 







135 




Gly 


Gin 


Ala 


He 




150 






Gly 


Thr 


Thr 


Leu 


165 








Gly 


Arg 


Leu 


Thr 


Arg 


Glu 


Ala 


Phe 








200 


Glu 


Leu 


He 


Lys 






215 




Val 


Asp 


Gin 


Asn 




230 






Asp 


Arg 


Asp 


Ala 


245 








Tyr 


Asn 


He 


Asp 


Ser 


His 


Tyr 


Thr 








280 


Asp 


Asn 


Ala 


Gin 






295 




Leu 


Cys 


Glu 


Glu 




310 






His 


Arg 


Arg 


Phe 


325 









140 

Thr Leu Cys Met 
155 

Ala Gin Tyr Tyr 
170 

Lys Phe Ala Pro 
185 

Gly Val Arg Phe 

Met Tyr Asn Gin 
220 

Val Val Pro Lys 
235 

Thr His Thr Thr 
250 

He Gin Pro Val 
265 

Ala Thr Tyr Tyr 

Asn Asp He Leu 
300 

Val He Arg Asn 
315 

Lys Ser Thr His 
330 



His Phe Gly Tyr 
160 

Glu Asn Tyr Gly 
175 

He Asn His Met 
190 

Val Asn Lys He 
205 

Gly Asn Gly Leu 

Asp Gly Val Val 
240 

He Ala Ser He 
255 

Phe He Asp Phe 
270 

Pro Ser He Arg 
285 

Glu Cys Thr Gin 

His Pro Glu Ser 
320 

Pro Glu He Tyr 
335 



(2) INFORMATION FOR SEQ ID NO: 125 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 16 . . . 8694 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255: 

AGAGGGTAGC ATTTA ATG AAA AAG TTT AAA AAG AAA CCA AAA AGT ATC AAA 51 
Met Lys Lys Phe Lys Lys Lys Pro Lys Ser He Lys 
15 10 

CGA TCG CAT CAA AAT CAA AAA ACA ATC TTA AAG CGT CCT TTA TGG CTT 9 9 

Arg Ser His Gin Asn Gin Lys Thr He Leu Lys Arg Pro Leu Trp Leu 
15 20 25 
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TTG TCG CGC CGT TAT AAT ATA GAT ATT CAG CCG GTA TTC ATT GAT TTT 875 
Leu Ser Arg Arg Tyr Asn lie Asp lie Gin Pro Val Phe He Asp Phe 
260 265 270 



AAT GAG GAT TAT TCG CAT TAT ACA GCG ACC TAT TAT CCG AGT ATC CGC 92 3 

Asn Asp Asp Tyr Ser His Tyr Thr Ala Thr Tyr Tyr Pro Ser He Arg 
275 280 285 



TCT CAA ATC ACC GAT AAC GCG CAA AAC GAT ATT TTA GAA TGC ACG CAA 971 
Ser Gin He Thr Asp Asn Ala Gin Asn Asp He Leu Glu Cys Thr Gin 
290 295 300 



GCC CAA GCG AGT TTG TGC GAA GAG GTG ATT AGA AAC CAC CCG GAA AGT 1019 
Ala Gin Ala Ser Leu Cys Glu Glu Val He Arg Asn His Pro Glu Ser 
305 310 315 320 



TAT TTT TGG TTC CAT AGO CGT TTT AAA AGC ACC CAC CCT GAG ATT TAT 10 67 

Tyr Phe Trp Phe His Arg Arg Phe Lys Ser Thr His Pro Glu He Tyr 
325 330 335 



CAA AGA TAGGGTTTTG TTTTAATCAA AAATTAAAAA CTAAAGCCTT ATTTTTAAGA AAA 112 6 
Gin Arg 



CTTT 113 0 

(2) INFORMATION FOR SEQ ID NO:1254: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1254: 



Met Arg Phe Lys 
1 

Leu He His Glu 
20 

Glu Leu Arg He 

35 

Leu Gly Phe He 
50 

He Lys Ala Val 
65 

Phe Asp Ala Lys 

Glu Glu Glu Lys 
100 

Phe He He Leu 
115 

Tyr Asp Ala Arg 



His Leu Lys Gly 
5 

Lys He Leu Lys 

Leu Ser He Phe 
40 

Leu Ala Lys Met 
55 

Ala Trp Leu Met 
70 

Ala Asn Leu Asp 
85 

Lys Arg He lie 

Glu Thr He Arg 
120 

Phe Thr Leu He 



Lys Arg Met Thr 
10 

Gin Asp Asp Lys 
25 

He Val Glu Ser 

Pro His Ser Trp 
60 

Lys Thr Phe Asp 
75 

Phe Val Phe Gly 
90 

Lys Lys Gly Tyr 
105 

Val He Phe He 
Asn Glu Glu Asn 



Tyr Lys Glu Arg 
15 

Gly Phe Lys Thr 
30 

Leu Val Asn He 
45 

Phe Leu Arg Cys 

Lys Cys Arg Tyr 
80 

Asp Ser Lys Ser 
95 

Glu Asn Phe Ala 
110 

Pro Lys Asp Glu 
125 

Val Trp Lys Ser 
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GAA CTG CGC ATT TTG AGT ATT TTT ATC GTG GAA TCT TTA GTG AAT ATT 203 
Glu Leu Arg lie Leu Ser lie Phe lie Val Glu Ser Leu Val Asn lie 
35 40 45 

TTG GGG TTT ATT TTA GCT AAA ATG CCC CAT TCG TGG TTT TTA AGG TGC 251 
Leu Gly Phe lie Leu Ala Lys Met Pro His Ser Trp Phe Leu Arg Cys 
50 55 60 

ATT AAA GCG GTG GCG TGG CTC ATG AAA ACT TTT GAT AAG TGC CGT TAT 2 99 

lie Lys Ala Val Ala Trp Leu Met Lys Thr Phe Asp Lys Cys Arg Tyr 
65 70 75 80 

TTT GAC GCT AAG GCC AAT TTG GAT TTT GTG TTT GGG GAT TCT AAA AGC 34 7 

Phe Asp Ala Lys Ala Asn Leu Asp Phe Val Phe Gly Asp Ser Lys Ser 
85 90 95 

GAA GAA GAG AAA AAA AGG ATC ATT AAA AAG GGT TAT GAA AAT TTT GCT 3 95 

Glu Glu Glu Lys Lys Arg lie lie Lys Lys Gly Tyr Glu Asn Phe Ala 
100 105 110 

TTC ATT ATT TTA GAA ACT ATT AGA GTG ATC TTT ATC CCT AAA GAT GAA 44 3 

Phe lie lie Leu Glu Thr lie Arg Val lie Phe lie Pro Lys Asp Glu 
115 120 125 

TAC GAC GCT CGT TTC ACG CTC ATC AAT GAA GAA AAT GTG TGG AAA TCT 4 91 

Tyr Asp Ala Arg Phe Thr Leu lie Asn Glu Glu Asn Val Trp Lys Ser 
130 135 140 

TTA AAC AAG GAA GGC CAA GCG ATC ACT TTA TGC ATG CAT TTT GGC TAT 53 9 

Leu Asn Lys Glu Gly Gin Ala lie Thr Leu Cys Met His Phe Gly Tyr 
145 150 155 160 

TGG GAA GCG GTA GGC ACG ACT TTA GCG CAA TAT TAT GAA AAT TAT GGT 5 87 

Trp Glu Ala Val Gly Thr Thr Leu Ala Gin Tyr Tyr Glu Asn Tyr Gly 
165 170 175 

AGG GGG TGT TTG GGG CGT TTG ACT AAA TTT GCC CCT ATC AAT CAC ATG 63 5 

Arg Gly Cys Leu Gly Arg Leu Thr Lys Phe Ala Pro lie Asn His Met 
180 185 190 

ATT ATG AGT AGG CGA GAG GCG TTT GGG GTG CGT TTT GTC AAT AAA ATA 68 3 

lie Met Ser Arg Arg Glu Ala Phe Gly Val Arg Phe Val Asn Lys lie 
195 200 205 

GGG GCG ATG AAA GAA CTC ATT AAA ATG TAT AAT CAA GGC AAT GGT CTG 731 
Gly Ala Met Lys Glu Leu lie Lys Met Tyr Asn Gin Gly Asn Gly Leu 
210 215 220 

GTG GGG ATT TTA GTG GAT CAA AAT GTC GTG CCT AAA GAT GGG GTG GTG 779 
Val Gly lie Leu Val Asp Gin Asn Val Val Pro Lys Asp Gly Val Val 
225 230 235 240 

GTG AAA TTC TTT GAT AGA GAC GCT ACG CAC ACC ACG ATC GCT TCT ATT 82 7 

Val Lys Phe Phe Asp Arg Asp Ala Thr His Thr Thr lie Ala Ser lie 
245 250 255 
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Pro Asn lie Asn 
305 

His Glu Lys His 

Glu Ala Leu Ser 
340 

Leu Lys lie Ala 
355 

Tyr Leu Ala His 
370 

Tyr Gly Phe Ser 
385 

Phe Ser His Lys 

Ala Met Met Pro 
420 

Ser Leu Ala Glu 
435 

Thr Leu Glu Lys 
450 

Leu Ala Lys Glu 
465 

lie Glu Phe Ala 



Pro Ser Leu lie 
310 

Ser Gin Lys Val 
325 

Pro Leu His Lys 

Gly Glu Leu Ala 
360 

Lys His Ser Ala 
375 

His Gin Asp Arg 
390 

Lys lie Pro Lys 
405 

Ser Leu Leu Thr 

Asn Leu Cys Leu 
440 

Asn Lys Leu Val 
455 

Met Leu Pro Lys 
470 



Ser Leu Lys Asp 
315 

Lys Lys Glu Cys 
330 

lie Asp Glu Lys 
345 

Ser Met Gly Lys 

Tyr Phe lie Leu 
380 

Ala lie He Cys 
395 

Asp Asn Ala He 
410 

Leu Gin Trp Leu 
425 

Thr Asp Ser His 

He His Ser Asn 
460 

Leu Val Lys Pro 
475 



Arg Phe Leu Pro 
320 

Val Lys Leu Phe 
335 

Tyr Leu Phe His 
350 

He Leu Ser Val 
365 

Asn Ala Leu Ser 

Leu Leu Ala Gin 
400 

Ala His Met Ser 
415 

Ser Phe He Leu 
430 

His Leu Lys Tyr 
445 

Asp Thr Leu Tyr 

He Pro Leu Thr 
480 



(2) INFORMATION FOR SEQ ID NO:1253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 1073 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1253: 
GCCAACTACC ATAAAAAGGA TTTTTCTATC CAAAATATAG AGCCTAAAAA AATTAAAGA 5 9 



ATG CGT TTT AAA CAT CTT AAA GGA AAA AGA ATG ACT TAC AAA GAA CGA 107 
Met Arg Phe Lys His Leu Lys Gly Lys Arg Met Thr Tyr Lys Glu Arg 
15 10 15 

CTC ATA CAC GAA AAA ATA TTG AAA CAA GAC GAC AAG GGT TTT AAA ACA 155 
Leu He His Glu Lys He Leu Lys Gin Asp Asp Lys Gly Phe Lys Thr 
20 25 30 
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GCT TGAAAATAGC GATTGTCAGG CTTTCAGCG 153 0 

Ala 



(2) INFORMATION FOR SEQ ID NO: 1252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1252 : 



Met Ala Lys lie 
1 

Leu Ala Val Phe 
20 

Glu Thr Lys Ser 
35 

Gly lie Leu Gin 
50 

Glu Phe Lys Glu 
65 

Val Ala Thr Ser 

Ala Arg Val Lys 
100 

Gin Lys Glu Ala 
115 

Lys Asn Ser Gly 
130 

Ala Leu lie Glu 
145 

Gly Thr lie Arg 

Lys Leu Ala Lys 
180 

Lys His Lys Asn 
195 

Lys Val Leu Met 
210 

Tyr Glu lie Asp 
225 

Met Leu Lys Glu 

Leu Asp Ser lie 
260 

His Leu Lys Thr 
275 

Gly Val Phe Leu 
290 



Thr Thr Val lie 
5 

Lys Lys Thr Ser 

Lys Val Arg lie 
40 

Glu lie Pro Met 
55 

lie Ala Leu Lys 
70 

Ala Val Arg Asp 
85 

Lys Ala Cys Gly 

Leu Tyr Gly Gly 
* 120 

lie Thr lie Asp 
135 

Lys Gly Lys lie 
150 

lie Lys Glu Met 
165 

Ala Phe lie Gin 

Ala Phe Gly Val 
200 

Lys Arg Phe Cys 
215 

Ala His Lys Asn 
230 

Asp Gin Leu Arg 
245 

Arg Ser Gly Ala 

Ser Leu Met lie 
280 

Ser Asp Leu Leu 
295 



Asp lie Gly Ser 
10 

Gin Phe Gly Phe 
25 

Ser Glu Gly Cys 

Gin Arg Ala Val 
60 

Tyr Lys Ser Lys 
75 

Ala Pro Asn Arg 
90 

Leu Gin lie Lys 
105 

lie Ala Cys Ala 

lie Gly Gly Gly 
140 

Lys Asp Leu lie 
155 

Phe Leu Asp Lys 
170 

Lys Glu Val Ser 
185 

Gly Gly Thr lie 

Tyr Pro lie Asp 
220 

Leu Ala Phe lie 
235 

Leu Leu Gly Val 
250 

Leu lie Leu Ser 
265 

Thr Ser Gly Val 

Arg His His Tyr 
300 



Asn Ser Val Arg 
15 

Tyr Leu Leu Phe 
30 

Tyr Ala Phe Asn 
45 

Lys Ala Leu Ser 

Lys lie Leu Cys 
80 

Leu Glu Phe Val 
95 

lie lie Asp Gly 
110 

Asn Leu Leu His 
125 

Ser Thr Glu Cys 

Ser Leu Asp Val 
160 

Asp Leu Glu Val 
175 

Lys Leu Pro Phe 
190 

Arg Ala Leu Ser 
205 

Ser Leu His Gly 

Glu Lys lie Val 
240 

Asn Glu Glu Arg 
255 

Val Val Leu Glu 
270 

Gly Val Arg Glu 
285 

His Lys Phe Pro 
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ATC AGG AGC GGG GCG TTG ATT TTA TCA GTC GTT TTG GAG CAT TTA AAA 8 74 

lie Arg Ser Gly Ala Leu lie Leu Ser Val Val Leu Glu His Leu Lys 
260 265 270 275 

ACT TCT TTA ATG ATC ACT AGT GGG GTG GGG GTG AG A GAA GGC GTG TTT 922 
Thr Ser Leu Met lie Thr Ser Gly Val Gly Val Arg Glu Gly Val Phe 
280 285 290 

TTG AGC GAT TTA TTG CGC CAT CAT TAC CAT AAA TTC CCC CCC AAT ATC 97 0 

Leu Ser Asp Leu Leu Arg His His Tyr His Lys Phe Pro Pro Asn lie 
295 300 305 

AAC CCC TCT CTC ATC TCT TTA AAA GAT CGC TTT TTG CCC CAT GAA AAG 1018 
Asn Pro Ser Leu lie Ser Leu Lys Asp Arg Phe Leu Pro His Glu Lys 
310 315 320 

CAC AGC CAA AAG GTC AAA AAA GAA TGC GTG AAA TTG TTT GAA GCC TTA 106 6 

His Ser Gin Lys Val Lys Lys Glu Cys Val Lys Leu Phe Glu Ala Leu 
325 330 335 

TCG CCT TTG CAT AAA ATA GAT GAA AAA TAC CTT TTC CAT TTA AAG ATT 1114 
Ser Pro Leu His Lys lie Asp Glu Lys Tyr Leu Phe His Leu Lys lie 
340 345 350 355 

GCG GGG GAA TTA GCG AGC ATG GGT AAG ATT TTA AGC GTC TAT TTA GCC 1162 
Ala Gly Glu Leu Ala Ser Met Gly Lys lie Leu Ser Val Tyr Leu Ala 
360 365 370 

CAC AAG CAC AGC GCG TAT TTT ATT TTA AAC GCT TTG AGT TAT GGC TTT 1210 
His Lys His Ser Ala Tyr Phe lie Leu Asn Ala Leu Ser Tyr Gly Phe 
375 380 385 

AGC CAC CAG GAT AGA GCG ATC ATT TGC TTA TTA GCC CAA TTC AGC CAT 12 5 8 

Ser His Gin Asp Arg Ala lie lie Cys Leu Leu Ala Gin Phe Ser His 
390 395 400 

AAA AAA ATC CCT AAA GAC AAC GCT ATC GCC CAC ATG AGC GCG ATG ATG 13 0 6 

Lys Lys lie Pro Lys Asp Asn Ala lie Ala His Met Ser Ala Met Met 
405 410 415 

CCA AGC CTT TTA ACC TTA CAA TGG CTG AGT TTT ATC CTT TCT TTA GCC 13 54 

Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe lie Leu Ser Leu Ala 
420 425 430 435 

GAA AAT TTG TGC CTA ACA GAC AGC CAT CAT TTA AAA TAC ACG CTA GAA 14 02 

Glu Asn Leu Cys Leu Thr Asp Ser His His Leu Lys Tyr Thr Leu Glu 
440 445 450 

AAA AAC AAG CTT GTG ATC CAT TCT AAT GAC ACG CTT TAC TTG GCT AAA 145 0 

Lys Asn Lys Leu Val lie His Ser Asn Asp Thr Leu Tyr Leu Ala Lys 
455 460 465 

GAA ATG CTC CCC AAA CTC GTT AAG CCC ATT CCT TTG ACG ATA GAG TTT 14 9 8 

Glu Met Leu Pro Lys Leu Val Lys Pro He Pro Leu Thr He Glu Phe 
470 475 480 
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TCT AAG GTT AGG ATT TCA GAG GGC TGT TAT GCG TTT AAT GGA ATC TTG 2 02 

Ser Lys Val Arg lie Ser Glu Gly Cys Tyr Ala Phe Asn Gly lie Leu 
40 45 50 

CAA GAA ATC CCC ATG CAA CGA GCC GTT AAA GCC TTG AGC GAA TTT AAA 25 0 

Gin Glu lie Pro Met Gin Arg Ala Val Lys Ala Leu Ser Glu Phe Lys 
55 60 65 

GAA ATC GCT CTC AAA TAG AAA AGC AAA AAA ATC CTG TGC GTG GCG ACC 2 98 

Glu lie Ala Leu Lys Tyr Lys Ser Lys Lys lie Leu Cys Val Ala Thr 
70 75 80 

TCA GCG GTG CGC GAT GCC CCT AAT CGG CTG GAG TTT GTA GCG AGG GTG 34 6 

Ser Ala Val Arg Asp Ala Pro Asn Arg Leu Glu Phe Val Ala Arg Val 
85 90 95 

AAA AAG GCT TGC GGT TTG CAA ATC AAA ATC ATT GAT GGG CAA AAA GAA 3 94 

Lys Lys Ala Cys Gly Leu Gin lie Lys lie lie Asp Gly Gin Lys Glu 
100 105 110 115 

GCG CTC TAT GGC GGG ATT GCG TGC GCG AAT TTG TTG CAT AAA AAT TCA 44 2 

Ala Leu Tyr Gly Gly lie Ala Cys Ala Asn Leu Leu His Lys Asn Ser 
120 125 130 

GGG ATC ACG ATA GAT ATT GGA GGG GGT AGC ACC GAG TGC GCG TTG ATT 4 90 

Gly lie Thr lie Asp lie Gly Gly Gly Ser Thr Glu Cys Ala Leu He 
135 140 145 

GAA AAA GGC AAG ATT AAG GAC TTA ATC TCG CTT GAT GTT GGG ACG ATT 53 8 

Glu Lys Gly Lys He Lys Asp Leu He Ser Leu Asp Val Gly Thr He 
150 155 160 

CGC ATT AAA GAA ATG TTT TTA GAC AAA GAC TTA GAG GTC AAA TTG GCT 5 86 

Arg He Lys Glu Met Phe Leu Asp Lys Asp Leu Glu Val Lys Leu Ala 
165 170 175 

AAA GCC TTT ATC CAA AAA GAA GTC TCT AAA CTG CCC TTT AAA CAC AAA 634 
Lys Ala Phe He Gin Lys Glu Val Ser Lys Leu Pro Phe Lys His Lys 
180 185 190 195 

AAC GCC TTT GGG GTG GGG GGG ACG ATC AGA GCG TTG AGT AAG GTA TTG 6 82 

Asn Ala Phe Gly Val Gly Gly Thr He Arg Ala Leu Ser Lys Val Leu 
200 205 210 

ATG AAA CGC TTT TGT TAC CCT ATT GAT TCT TTG CAT GGC TAT GAA ATA 73 0 

Met Lys Arg Phe Cys Tyr Pro He Asp Ser Leu His Gly Tyr Glu He 
215 220 225 

GAT GCA CAT AAA AAT TTA GCG TTC ATT GAA AAA ATC GTC ATG CTC AAA 77 8 

Asp Ala His Lys Asn Leu Ala Phe He Glu Lys He Val Met Leu Lys 
230 235 240 

GAA GAT CAA TTA CGG CTT TTA GGG GTG AAT GAA GAG CGT TTG GAT AGC 82 6 

Glu Asp Gin Leu Arg Leu Leu Gly Val Asn Glu Glu Arg Leu Asp Ser 
245 250 255 
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Pro 


Lys 


Val 


Cys 


Lys 


Ser 


He 


His 


Met 


Pro 


Leu 


Gin 


Ser 


Gly 


Ser 


Ser 










245 










250 










255 




Ala 


Val 


Leu 


Lys 


Met 


Met 


Arg 


Arg 


Gly 


Tyr 


Ser 


Lys 


Glu 


Trp 


Phe 


Leu 








260 










265 










270 






Asn 


Arg 


Val 


Glu 


Arg 


Leu 


Lys 


Ala 


Leu 


Val 


Pro 


Glu 


Val 


Gly 


He 


Ser 






275 










280 










285 








Thr 


Asp 


He 


He 


Val 


Gly 


Phe 


Pro 


Asn 


Glu 


Ser 


Asp 


Lys 


Asp 


Phe 


Glu 




290 










295 










300 










Asp 


Thr 


Met 


Glu 


Val 


Leu 


Glu 


Lys 


Val 


Arg 


Phe 


Asp 


Thr 


Leu 


Tyr 


Ser 


305 










310 










315 










320 


Phe 


He 


Tyr 


Ser 


Pro 


Arg 


Pro 


Phe 


Thr 


Glu 


Ala 


Gly 


Ala 


Trp 


Lys 


Glu 










325 










330 










335 




Arg 


Val 


Pro 


Leu 


Glu 


Val 


Ser 


Ser 


Ser 


Arg 


Leu 


Glu 


Arg 


Leu 


Gin 


Asn 








340 










345 










350 






Arg 


His 


Lys 


Glu 


He 


Leu 


Glu 


Glu 


Lys 


Ala 


Lys 


Leu 


Glu 


Val 


Gly 


Lys 






355 










360 










365 








Thr 


His 


Val 


Val 


Leu 


Val 


Glu 


Asn 


Arg 


Arg 


Glu 


Met 


Asp 


Asn 


Gin 


He 




370 










375 










380 










Val 


Gly 


Phe 


Glu 


Gly 


Arg 


Ser 


Asp 


Thr 


Gly 


Lys 


Phe 


He 


Glu 


Val 


Thr 


385 










390 










395 










400 


Cys 


Lys 


Glu 


Lys 


Arg 


Asn 


Pro 


Gly 


Glu 


Leu 


Val 


Lys 


Val 


Glu 


He 


He 










405 










410 










415 




Ser 


His 


Ser 


Lys 


Gly 


Arg 


Leu 


Met 


Ala 


Ala 


Thr 


Lys 


Gly 


Asn 







420 425 430 

(2) INFORMATION FOR SEQ ID NO:1251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50 . . . 1501 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 51: 

ATTGAAATAC AAATACGAAA GCTTAAAAGA GCAAGATTAA AGGCTAGCA ATG GCT AAA 5 8 

Met Ala Lys 
1 

ATC ACA ACC GTG ATT GAT ATA GGC TCT AAT TCA GTG CGT TTG GCT GTC 10 6 
He Thr Thr. Val He Asp He Gly Ser Asn Ser Val Arg Leu Ala Val 
S 10 15 

TTT AAA AAG ACG AGC CAG TTT GGG TTT TAC TTG CTT TTT GAG ACT AAG 154 
Phe Lys Lys Thr Ser Gin Phe Gly Phe Tyr Leu Leu Phe Glu Thr Lys 
20 _ 25 30 35 
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ACG GGG AAA TTC ATT GAA GTA ACT TGT AAG GAA AAA AGA AAC CCG GGC 12 54 

Thr Gly L.ys Phe lie Glu Val Thr Cys Lys Glu Lys Arg Asn Pro Gly 
395 4,00 405 



GAG CTT GTA AAA GTG GAG ATT ATT TCT CAT TCC AAA GGG CGC TTG ATG 13 02 

Glu Leu Val Lys Val Glu lie lie Ser His Ser Lys Gly Arg Leu Met 
410 415 420 



GCG GCC ACT AAA GGC AAC TAATAAAAAT AACCAATGAA AAAGCGGGTT TAAAGG 13 5 6 

Ala Ala Thr Lys Gly Asn 
425 430 



(2) INFORMATION FOR SEQ ID NO : 1250 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 50: 



Met Gly Cys Ala Met Asn Ser Arg Asp Ser Glu His Leu Leu Ser Glu 

15 10 15 

Leu Ser Lys Leu Asp Tyr Lys Glu Thr Asn Asp Pro Lys Thr Ala Asp 

20 25 30 

Leu lie Leu lie Asn Thr Cys Ser Val Arg Glu Lys Pro Glu Arg Lys 

35 40 45 

Leu Phe Ser Glu lie Gly Gin Phe Ala Lys lie Lys Lys Pro Asn Ala 

50 55 60 

Lys lie Gly Val Cys Gly Cys Thr Ala Ser His Met Gly Ala Asp lie 
65 70 75 80 

Leu Lys Lys Ala Pro Ser Val Ser Phe Val Leu Gly Ala Arg Asn Val 

85 90 95 

Ser Lys lie Ser Gin Val lie His Lys Glu Lys Ala Val Glu Val Ala 

100 105 110 

lie Asp Tyr Asp Glu Ser Ala Tyr Ala Phe Glu Phe Phe Glu Lys Lys 

115 120 125 

Ala Gin lie Arg Ser Leu Leu Asn - lie Ser lie Gly Cys Asp Lys Lys 

130 135 140 

Cys Ala Tyr Cys lie Val Pro His Thr Arg Gly Lys Glu lie Ser lie 
145 150 155 160 

Pro Met Asp Leu lie Leu Lys Glu Ala Glu Lys Leu Ala Asn Asn Gly 

165 170 175 

Thr Lys Glu Leu Met Leu Leu Gly Gin Asn Val Asn Asn Tyr Gly Ala 

180 185 190 

Arg Phe Ser Ser Glu His Ala Lys Val Asp Phe Ser Asp Leu Leu Asp 

195 200 205 

Lys Leu Ser Glu He Gin Gly He Glu Arg He Arg Phe Thr Ser Pro 

210 215 220 

His Pro Leu His Met Asn Asp Gly Phe Leu Glu Arg Phe Ala Lys Asn 
225 230 235 240 
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GCT GAG A7^ TTA GCG AAT AAC GGC ACC AAA GAG CTT ATG CTT TTA GGG 5 82 

Ala Glu Lys Leu Ala Asn Asn Gly Thr Lys Glu Leu Met Leu Leu Gly 
170 175 180 

GAG AAT GTG AAT AAT TAG GGC GCG CGT TTC AGC AGC GAG CAT GCG AAA 63 0 

Gin Asn Val Asn Asn Tyr Gly Ala Arg Phe Ser Ser Glu His Ala Lys 
185 190 195 200 

GTG GAT TTT AGC GAT TTG TTG GAT AAA TTG AGC GAA ATC CAG GGG ATT 67 8 

Val Asp Phe Ser Asp Leu Leu Asp Lys Leu Ser Glu lie Gin Gly lie 
205 210 215 

GAA AGG ATA CGA TTC ACT TCG CCT CAC CCC TTG CAC ATG AAT GAT GGA 7 26 

Glu Arg lie Arg Phe Thr Ser Pro His Pro Leu His Met Asn Asp Gly 
220 225 230 

TTT TTA GAG CGT TTT GCC AAA AAC CCT AAA GTG TGC AAG AGT ATC CAC 7 74 

Phe Leu Glu Arg Phe Ala Lys Asn Pro Lys Val Cys Lys Ser lie His 
235 240 245 

ATG CCT TTA CAG AGC GGA TCT AGC GCG GTG TTA AAG ATG ATG CGA AGG 82 2 

Met Pro Leu Gin Ser Gly Ser Ser Ala Val Leu Lys Met Met Arg Arg 
250 255 260 

GGT TAT AGT AAG GAG TGG TTT TTA AAT AGG GTG GAG AGG TTA AAA GCT 87 0 

Gly Tyr Ser Lys Glu Trp Phe Leu Asn Arg Val Glu Arg Leu Lys Ala 
265 270 275 280 

TTA GTG CCT GAA GTG GGC ATT AGC ACG GAT ATT ATC GTA GGC TTC CCT 918 
Leu Val Pro Glu Val Gly lie Ser Thr Asp lie lie Val Gly Phe Pro 
285 290 295 

AAT GAG AGC GAT AAG GAT TTT GAA GAC AC A ATG GAG GTG CTA GAA AAA 96 6 

Asn Glu Ser Asp Lys Asp Phe Glu Asp Thr Met Glu Val Leu Glu Lys 
300 305 310 

GTG CGC TTT GAC ACG CTC TAT AGT TTC ATT TAT TCC CCA CGC CCT TTC 1014 
Val Arg Phe Asp Thr Leu Tyr Ser Phe lie Tyr Ser Pro Arg Pro Phe 
315 320 325 

ACT GAA GCG GGA GCT TGG AAG GAA AGA GTG CCG TTA GAA GTT TCA TCT 10 62 

Thr Glu Ala Gly Ala Trp Lys Glu Arg Val Pro Leu Glu Val Ser Ser 
330 335 340 

TCA AGG TTG GAG AGG TTG CAA AAC AGG CAC AAA GAA ATT TTA GAA GAA 1110 
Ser Arg Leu Glu Arg Leu Gin Asn Arg His Lys Glu lie Leu Glu Glu 
345 350 355 360 

AAA GCC AAG CTA GAA GTG GGC AAA ACG CAT GTG GTG TTG GTG GAA AAC 115 8 

Lys Ala Lys Leu Glu Val Gly Lys Thr His Val Val Leu Val Glu Asn 
365 370 375 

AGG CGT GAA ATG GAT AAT CAA ATC GTG GGT TTT GAA GGG CGT AGC GAT 12 0 6 

Arg Arg Glu Met Asp Asn Gin lie Val Gly Phe Glu Gly Arg Ser Asp 
380 385 390 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 13 2 0 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249: 

AGGCTTGTAT TGAAAGTTTA TATTGAAACC ATG GGT TGT GCC ATG AAT TCT AGG 54 

Met Gly Cys Ala Met Asn Ser Arg 
1 5 

GAT AGT GAG CAT TTA TTG AGC GAG CTG TCC AAA CTA GAC TAT AAA GAG 102 
Asp Ser Glu His Leu Leu Ser Glu Leu Ser Lys Leu Asp Tyr Lys Glu 
10 15 20 

ACC AAT GAC CCT AAA ACA GCG GAT TTG ATT TTA ATC AAC ACT TGC AGC 15 0 

Thr Asn Asp Pro Lys Thr Ala Asp Leu lie Leu lie Asn Thr Cys Ser 
25 30 35 40 

GTG CGC GAA AAG CCT GAA CGA AAA TTG TTT TCA GAA ATC GGT CAA TTC 198 
Val Arg Glu Lys Pro Glu Arg Lys Leu Phe Ser Glu lie Gly Gin Phe 
45 50 55 

GCT AAA ATC AAA AAA CCC AAC GCC AAA ATC GGG GTT TGC GGG TGC ACT 24 6 

Ala Lys lie Lys Lys Pro Asn Ala Lys lie Gly Val Cys Gly Cys Thr 
60 65 70 

GCA AGC CAC ATG GGA GCG GAT ATT TTG AAA AAA GCC CCA AGC GTG AGC 2 94 

Ala Ser His Met Gly Ala Asp lie Leu Lys Lys Ala Pro Ser Val Ser 
75 80 85 

TTT GTG TTA GGG GCT AGG AAT GTG TCT AAA ATC TCT CAA GTG ATC CAT 342 
Phe Val Leu Gly Ala Arg Asn Val Ser Lys lie Ser Gin Val lie His 
90 95 100 

AAA GAA AAA GCG GTT GAA GTG GCG ATT GAT TAT GAT GAA AGC GCG TAT 3 90 

Lys Glu Lys Ala Val Glu Val Ala lie Asp Tyr Asp Glu Ser Ala Tyr 
105 110 115 120 

GCG TTT GAA TTT TTT GAA AAA AAG GCT CAA ATC CGA TCG TTG CTA AAT 43 8 

Ala Phe Glu Phe Phe Glu Lys Lys Ala Gin lie Arg Ser Leu Leu Asn 
125 130 135 

ATC TCT ATA GGG TGC GAT AAG AAA TGC GCT TAT TGC ATC GTC CCG CAC 4 86 

lie Ser lie Gly Cys Asp Lys Lys Cys Ala Tyr Cys lie Val Pro His 
140 145 150 

ACT AGG GGG AAA GAA ATT TCT ATC CCT ATG GAT TTG ATT TTA AAA GAA 53 4 

Thr Arg Gly Lys Glu lie Ser lie Pro Met Asp Leu lie Leu Lys Glu 
155 160 165 
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Ala Gin Phe Phe 
50 

His Thr His Phe 
65 

Ser Phe Ser Gly 

Leu Glu Asn Cys 
100 

Lys Ser Gly Val 
115 

Val Asn Leu Leu 
130 

Phe Leu Gly Ser 
145 

Lys Phe Lys Glu 

Leu Ala Val His 
180 

Val lie Gin Leu 
195 

Leu Glu Ser Leu 
210 

Phe Glu Asn Phe 
225 

Leu Tyr Lys Gly 

Thr Leu Phe Val 
260 

lie Lys Ser Gin 
275 

Asn Arg Leu Leu 
290 

Ala Gly Leu Ser 
305 

Ser Leu Ser Leu 

Met Pro Leu Leu 
340 

His Gly Ala Lys 
355 

Lys Arg Ala Asp 
370 

Glu Gin Ala He 
385 

Leu Phe Leu Gly 



Glu Asn Ser Val 
55 

Glu Phe Ser Asn 
70 

Trp Leu Gly Ser 
85 

Gin Gly Ala He 

Gly Ser Val Gly 
120 

Lys Glu Ser Pro 
135 

Ser Tyr Ser Leu 
150 

Leu Lys Asp Leu 
165 

Ala Pro Tyr Ser 

Ala Lys Asp Ser 
200 

Glu Glu Leu Glu 
215 

Tyr Gin His Phe 
230 

Ala Asn Asp Tyr 
245 

His Asn Gin Phe 

Val Lys Asn Ala 
280 

Ser Gly Gin Ala 
295 

Val Ser Val Ala 
310 

Leu Asp Glu Leu 
325 

Glu Leu Ala Lys 

Ala Leu Ala Leu 
360 

Leu Ser Val Phe 
375 

Leu Gin Phe Leu 
390 

Gly Lys Arg Val 
405 



Leu Leu Pro Ala 
60 

Asn Lys Ala Ser 
75 

Val Leu Asn Asn 
90 

Gin Asn Ala He 
105 

Ala He Ser Asn 

Leu Asn Ala Val 
140 

Glu Lys Leu Lys 
155 

Glu Asp Lys Lys 
170 

Val Gin Lys Asp 
185 

Gin Ser Leu Leu 

Trp Val Glu Asn 
220 

Leu Lys Glu Ser 
235 

He Asp Met Phe 
250 

Ala Ser Leu Glu 
265 

Phe Leu He Thr 

Leu Asp Leu Glu 
300 

Thr Asp Gly Leu 
315 

Arg Ala Phe Leu 
330 

He Ala Leu Leu 
345 

Asn Asn Gly Glu 

Gly Phe Asn Glu 
380 

Leu His Ala Lys 
395 

He 



Phe He Asn Ala 

Phe Asp Tyr Gly 
80 

Gly Gly Ala He 
95 

Ser Thr Gin Leu 
110 

His Leu He Glu 
125 

Val Phe Leu Glu 

Ala Phe Glu Ala 
160 

Leu Lys Ala Ala 
175 

Met Ala Leu Ser 
190 

Ser Thr His Phe 
205 

Ser Lys Gly Trp 

His Phe Lys Ser 
240 

Lys Asp Thr His 
255 

Ala Leu Lys Arg 
270 

Cys Pro Phe Ser 
285 

Arg Thr Lys Glu 

Ser Ser Asn He 
320 

Leu Thr His Asn 
335 

Gly Ala Thr Arg 
350 

He Glu Ala Asn 
365 

Lys Phe Thr Lys 

Glu Val Glu Cys 
400 



(2) INFORMATION FOR SEQ ID NO: 1249: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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Gin Val Lys Asn Ala Phe Leu lie Thr Cys Pro Phe Ser Asn Arg Leu 
280 285 290 

TTG AGC GGG CAA GCG TTG GAT TTA GAA AG A ACT AAA GAA GCC GGT TTG 96 7 

Leu Ser Gly Gin Ala Leu Asp Leu Glu Arg Thr Lys Glu Ala Gly Leu 
295 300 305 

AGC GTG AGC GTG GCC ACT GAT GGC TTG AGT TCT AAC ATT TCG CTG AGC 1015 
Ser Val Ser Val Ala Thr Asp Gly Leu Ser Ser Asn lie Ser Leu Ser 
310 315 320 

CTT TTA GAC GAA TTA AGA GCG TTT TTG CTC ACC CAT AAC ATG CCG TTA 1063 
Leu Leu Asp Glu Leu Arg Ala Phe Leu Leu Thr His Asn Met Pro Leu 
325 330 335 

TTA GAA TTA GCT AAA ATA GCC CTT TTA GGG GCG ACT AGG CAT GGG GCT 1111 
Leu Glu Leu Ala Lys lie Ala Leu Leu Gly Ala Thr Arg His Gly Ala 
340 345 350 355 

AAA GCT TTA GCT TTG AAT AAT GGC GAG ATA GAA GCC AAC AAA AGG GCG 115 9 

Lys Ala Leu Ala Leu Asn Asn Gly Glu lie Glu Ala Asn Lys Arg Ala 
360 365 370 

GAT TTG AGC GTG TTT GGT TTT AAT GAA AAA TTC ACT AAA GAG CAA GCG 12 07 

Asp Leu Ser Val Phe Gly Phe Asn Glu Lys Phe Thr Lys Glu Gin Ala 
375 380 385 

ATT TTG CAA TTT TTA TTG CAT GCT AAA GAA GTG GAG TGC TTG TTT TTA 12 5 5 

lie Leu Gin Phe Leu Leu His Ala Lys Glu Val Glu Cys Leu Phe Leu 
390 395 400 

GGG GGG AAA AGG GTG ATC TAATTTGTTT TAAAGACAGA ATGCGTTAAA ATGAGAAA 1311 
Gly Gly Lys Arg Val lie 
405 

TCTAAATCAA TTAAGGAAAG AGTCAATGAA ACTAGTTTT 13 5 0 

(2) INFORMATION FOR SEQ ID NO:1248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 8: 

Met Gin Glu lie lie Gly Ala Ser Leu Val Phe Leu Cys Asn Glu Lys 

15 10 15 

Cys Glu Val Leu Glu Asp Tyr Gly Val Val Phe Asp Glu Lys lie Val 

20 25 30 

Glu lie Gly Asp Tyr Gin Ser Leu Thr Leu Lys Tyr Pro His Leu Lys 
35 40 45 
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6 0 6 5 

TTT GAA TTT TCC AAC AAC AAG GCG AGT TTT GAT TAG GGG AGT TTT TCT 2 95 

Phe Glu Phe Ser Asn Asn Lys Ala Ser Phe Asp Tyr Gly Ser Phe Ser 
70 75 80 

GGC TGG TTA GGG AGC GTG TTA AAC AAT GGG GGG GCG ATT TTA GAA AAT 34 3 

Gly Trp Leu Gly Ser Val Leu Asn Asn Gly Gly Ala lie Leu Glu Asn 
85 90 95 

TGC CAA GGG GCT ATT CAA AAC GCT ATC AGC ACG CAA TTA AAA AGC GGG 3 91 

Cys Gin Gly Ala lie Gin Asn Ala lie Ser Thr Gin Leu Lys Ser Gly 
100 105 110 115 

GTG GGG AGC GTG GGA GCG ATT TCT AAC CAC CTG ATA GAA GTT AAT TTG 43 9 

Val Gly Ser Val Gly Ala lie Ser Asn His Leu lie Glu Val Asn Leu 
120 125 130 

TTA AAA GAA AGC CCT TTG AAT GCT GTC GTG TTT TTA GAG TTT TTA GGG 4 87 

Leu Lys Glu Ser Pro Leu Asn Ala Val Val Phe Leu Glu Phe Leu Gly 
135 140 145 

AGC AGT TAT TCT TTA GAA AAA TTA AAA GCG TTT GAG GCC AAA TTT AAG 53 5 

Ser Ser Tyr Ser Leu Glu Lys Leu Lys Ala Phe Glu Ala Lys Phe Lys 
150 155 160 

GAA TTA AAA GAT TTA GAA GAT AAA AAA CTT AAA GCG GCT CTC GCT GTG 5 83 

Glu Leu Lys Asp Leu Glu Asp Lys Lys Leu Lys Ala Ala Leu Ala Val 
165 170 175 

CAT GCC CCT TAT TCG GTC CAA AAA GAC ATG GCT TTG AGC GTC ATC CAA 631 
His Ala Pro Tyr Ser Val Gin Lys Asp Met Ala Leu Ser Val lie Gin 
180 185 190 195 

TTA GCC AAA GAT TCA CAA AGC CTG CTT TCT ACG CAT TTT TTA GAA TCG 679 
Leu Ala Lys Asp Ser Gin Ser Leu Leu Ser Thr His Phe Leu Glu Ser 
200 205 210 

CTT GAA GAA TTA GAA TGG GTA GAA AAC TCT AAA GGG TGG TTT GAA AAT 727 
Leu Glu Glu Leu Glu Trp Val Glu Asn Ser Lys Gly Trp Phe Glu Asn 
215 220 225 

TTT TAC CAG CAT TTT TTA AAG GAG TCT CAT TTC AAA TCG CTC TAT AAG 775 
Phe Tyr Gin His Phe Leu Lys Glu Ser His Phe Lys Ser Leu Tyr Lys 
230 235 240 

GGC GCG AAC GAT TAC ATT GAC ATG TTT AAA GAC ACG CAC ACT TTA TTC 82 3 

Gly Ala Asn Asp Tyr lie Asp Met Phe Lys Asp Thr His Thr Leu Phe 
245 250 255 

GTG CAT AAC CAG TTC GCT TCT TTA GAA GCG TTA AAA AGG ATT AAA TCT 871 
Val His Asn Gin Phe Ala Ser Leu Glu Ala Leu Lys Arg lie Lys Ser 
260 265 270 275 

CAA GTC AAA AAC GCT TTT TTA ATC ACA TGC CCC TTT TCT AAC CGC CTA 919 
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50 

Ser Met Pro Phe Val 
65 

lie Glu Thr Arg lie 
85 

Phe Ser Leu Gin Val 
100 

Lys Cys Val Leu Arg 
115 

Glu Glu His Ala Phe 
130 

Phe Lys Glu Lys lie 
145 

lie Asp Asn Tyr Pro 
165 

Phe Cys Leu 



55 

Leu Asn lie Leu Leu Thr 
70 75 
Leu His Ala Asn Pro Leu 
90 

Gly Val Lys Asn His Ser 
105 

Leu Glu Val Leu Lys Asn 
120 

Lys Trp Phe Val Lys Lys 
135 

Leu Pro Lys Glu Ser Lys 
150 155 
Tyr Ser Lys Thr Ala Pro 
170 



60 

Gin Ala lie Tyr Pro 
80 

Ser Tyr Ser Asn Ala 
95 

Lys Phe Thr Leu Asn 
110 

Pro His Asn Phe Val 
125 

Ser Tyr Glu Lys lie 
140 

Val Phe Ser Phe Phe 
160 

Tyr Gin Val Ser Leu 
175 



(2) INFORMATION FOR SEQ ID N0:1247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47 . . . 1273 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247: 

AAAGAATGAT CTTAAAAGGG CAAACCACAT TTATTAAGGA GAATGC ATG CAA GAA 5 5 

Met Gin Glu 
1 

ATC ATA GGA GCG TCT TTA GTT TTT TTG TGC AAT GAA AAG TGC GAA GTG 103 
He He Gly Ala Ser Leu Val Phe Leu Cys Asn Glu Lys Cys Glu Val 
5 10 . 15 

TTA GAA GAT TAT GGC GTA GTC TTT GAT GAA AAG ATT GTT GAA ATA GGC 151 
Leu Glu Asp Tyr Gly Val Val Phe Asp Glu Lys He Val Glu He Gly 
20 25 30 35 

GAT TAT CAA AGT TTA ACG CTT AAA TAC OCT CAC TTA AAG GCG CAG TTT 19 9 

Asp Tyr Gin Ser Leu Thr Leu Lys Tyr Pro His Leu Lys Ala Gin Phe 
40 45 50 

TTT GAA AAT TCC GTT CTG TTG CCC GCT TTT ATC AAC GCG CAC ACC CAT 24 7 

Phe Glu Asn Ser Val Leu Leu Pro Ala Phe He Asn Ala His Thr His 
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45 50 55 

TTT TTG GGT GCG TTT TTA AGC ATG CCT TTT GTT TTG AAC ATT TTA CTC 2 44 

Phe Leu Gly Ala Phe Leu Ser Met Pro Phe Val Leu Asn lie Leu Leu 
60 65 70 

ACT CAA GCG ATT TAC CCC ATA GAA ACA CGC ATC TTA CAC GCT AAC CCT 2 92 

Thr Gin Ala lie Tyr Pro lie Glu Thr Arg lie Leu His Ala Asn Pro 
75 80 85 90 

TTA AGT TAC AGC AAC GCC TTT TCT TTG CAA GTG GGA GTC AAA AAC CAT 34 0 

Leu Ser Tyr Ser Asn Ala Phe Ser Leu Gin Val Gly Val Lys Asn His 
95 100 105 

TCC AAA TTT ACT CTA AAC AAA TGC GTT TTA CGC CTA GAA GTG CTT AAA 3 88 

Ser Lys Phe Thr Leu Asn Lys Cys Val Leu Arg Leu Glu Val Leu Lys 
110 115 120 

AAC CCT CAC AAT TTT GTA GAA GAG CAT GCT TTT AAA TGG TTT GTC AAA 43 6 

Asn Pro His Asn Phe Val Glu Glu His Ala Phe Lys Trp Phe Val Lys 
125 130 135 

AAA AGC TAT GAA AAA ATT TTT AAA GAA AAG ATT TTG CCC AAA GAA TCT 4 84 

Lys Ser Tyr Glu Lys lie Phe Lys Glu Lys lie Leu Pro Lys Glu Ser 
140 145 150 

AAG GTC TTT TCA TTC TTT ATT GAC AAC TAC CCT TAT TCA AAA ACG GCC 5 32 

Lys Val Phe Ser Phe Phe lie Asp Asn Tyr Pro Tyr Ser Lys Thr Ala 
155 160 165 170 

CCT TAT CAA GTT TCT TTG TTT TGT TTA TAAAAAACTA AAAGATAACG CCCAAGA 5 86 
Pro Tyr Gin Val Ser Leu Phe Cys Leu 
175 

TAACATTCAT TAAAAAGCGA TTAAAAACGC TTAAAGGCAT AGAT 63 0 

(2) INFORMATION FOR SEQ ID NO: 1246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1246: 

Met Ala Ser Leu Ala Phe lie Gin Ala Phe Leu Glu Ser Phe Lys Gly 

15 10 15 

Phe Leu Ser Gin Ala Thr Leu lie Ser Val Leu lie Ala Ser Val Leu 

20 25 30 

lie Leu Phe Cys Ala lie Leu Leu Leu Leu Ala Leu Leu Leu Arg Asn 

35 40 45 

Arg Leu Ala Ser Tyr lie Ala Thr Ala Ala Phe Leu Gly Ala Phe Leu 
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195 200 205 

Glu Val Ser Lys Leu Pro Asn Thr Pro Val Glu Leu Ser Ser lie Gin 

210 215 220 

Leu Arg Glu lie Val Leu Pro Ala Lys He Lys Glu Gin He Glu Lys 
225 230 235 240 

Val Gin He Ala Arg Gin Glu Ser Glu Arg Val Lys Tyr Glu Val Glu 

245 250 255 

Arg Ser Lys Gin Glu Ala Gin Lys Gin Ala Ala Leu Ala Lys Gly Glu 

260 265 270 

Ala Asp Ala Asn Arg He Lys Ala Gin Gly Val Ala Asp Ala He Val 

275 280 285 

He Glu Ala Lys Ala Lys Ser Gin Ala Asn Leu Ser He Ser Gin Ser 

290 295 300 

Leu Ser Asp Lys Leu Leu Arg Leu Arg Gin He Glu Val Gin Gly Gin 
305 310 315 320 

Phe Asn Glu Ala Leu Lys Thr Asn Asn Asn Ala Gin He Met Leu Thr 

325 330 335 

Pro Gly Gly Ala Val Pro Asn He Trp He Asp Thr Lys Ser Lys Val 

340 345 350 

Lys Ser Ser He Ala Glu Thr Lys Glu Pro 
355 360 



(2) INFORMATION FOR SEQ ID NO: 1245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 23... 559 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1245: 

GACTAAAGAG CCTTAAAAAC GC ATG GCA TCT CTT GCC TTT ATC CAA GCT TTT 52 

Met Ala Ser Leu Ala Phe He Gin Ala Phe 
15 10 

TTG GAG TCT TTT AAG GGA TTT TTA AGT CAA GCG ACT CTA ATC AGO GTT 10 0 

Leu Glu Ser Phe Lys Gly Phe Leu Ser Gin Ala Thr Leu He Ser Val 
15 20 25 

TTA ATA GCG AGC GTT TTA ATC CTT TTT TGC GCG ATT TTG CTC CTT TTG 14 8 

Leu He Ala Ser Val Leu He Leu Phe Cys Ala He Leu Leu Leu Leu 
30 35 40 

GCT CTG CTT TTG AGA AAC CGC TTA GCT AGC TAT ATA GCA ACA GCA GCT 196 
Ala Leu Leu Leu Arg Asn Arg Leu Ala Ser Tyr He Ala Thr Ala Ala 
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TTA AG A CTG CGC CAA ATT GAA GTT CAA GGC CAG TTT AAT GAA GCG TTA 1014 
Leu Arg Leu Arg Gin lie Glu Val Gin Gly Gin Phe Asn Glu Ala Leu 
310 315 320 325 

AAA ACG AAC AAT AAC GCT CAA ATC ATG CTC ACT CCA GGT GGG GCT GTG 1062 
Lys Thr Asn Asn Asn Ala Gin lie Met Leu Thr Pro Gly Gly Ala Val 
330 335 340 

CCT AAT ATT TGG ATT GAC ACT AAA AGC AAG GTT AAA TCT AGT ATT GCC 1110 
Pro Asn lie Trp lie Asp Thr Lys Ser Lys Val Lys Ser Ser lie Ala 
345 350 355 

GAG ACT AAA GAG CCT TAAAAACGCA TGGCATCTCT TGCCTTTATC CAAGCTTTTT T 116 6 
Glu Thr Lys Glu Pro 
360 

GGAGTCTTTT AAGGGATTTT TAAGTCAAGC GACTCT 12 02 

(2) INFORMATION FOR SEQ ID NO: 1244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244: 

Met Pro lie Asp Leu Asn Glu His Leu Lys Lys Lys Asn Ser Gin Arg 

15 10 15 

Glu Thr Pro Thr Pro Asn Thr Pro Asn Asn Gly Gly Arg Phe lie Pro 

20 25 30 

Pro Ser Asn Ser Phe Asn Ser Lys Lys Leu Ser Val Leu lie Val lie 

35 40 45 

Val Leu Leu Gly Val lie Ala Phe Leu Ala Lys Pro Phe Glu Val lie 

50 55 60 

Ser Ser Gly Glu lie Gly lie Lys lie Thr Ala Gly Lys Tyr Glu Pro 
65 70 75 80 

Thr Pro Leu Gin Pro Gly lie His Phe Phe Val Pro lie lie Gin Asp 

85 90 95 

lie Leu lie Val Asp Thr Arg lie Arg Asn lie Asn Phe Ser Arg Thr 

100 105 110 

Glu Asp Met Gly Val Ala Gly Lys Asn Gin Gly He Phe Arg Asn Asp 

115 120 125 

Ala He Asn Val Met Asp Ser Arg Gly Leu Thr Val Ser He Glu Leu 

130 135 140 

Thr Val Gin Tyr Arg Leu Asn Pro Gin Thr Thr Pro Gin Thr He Ala 
145 150 155 160 

Thr Tyr Gly Leu Ser Trp Glu Gin Lys He He Asn Pro Val Val Arg 

165 170 175 

Asp Val Val Arg Ser Val Val Gly Arg Tyr Pro Ala Glu Asp Leu Pro 

180 185 190 

He Lys Arg Asn Glu He Ala Ala Leu He Asn Ser Gly He Asn Lys 
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Gly lie His Phe Phe Val Pro He He Gin Asp He Leu He Val Asp 
90 95 100 

ACA AGG ATT AGG AAT ATC AAT TTT TCA CGC ACC GAA GAG ATG GGC GTG 3 90 

Thr Arg He Arg Asn He Asn Phe Ser Arg Thr Glu Asp Met Gly Val 
105 110 115 

GCG GGT AAA AAC CAA GGG ATT TTT AGA AAC GAG GCT ATT AAT GTG ATG 4 38 

Ala Gly Lys Asn Gin Gly He Phe Arg Asn Asp Ala He Asn Val Met 
120 125 130 

GAT AGT AGG GGT TTG ACC GTT TCT ATT GAA CTC ACC GTG CAA TAC CGC 4 86 

Asp Ser Arg Gly Leu Thr Val Ser He Glu Leu Thr Val Gin Tyr Arg 
135 140 145 

TTA AAC CCC CAA ACC ACC CCC CAA ACG ATC GCT ACT TAT GGC TTG TCT 5 34 

Leu Asn Pro Gin Thr Thr Pro Gin Thr He Ala Thr Tyr Gly Leu Ser 
150 155 160 165 

TGG GAG CAA AAA ATC ATC AAC CCT GTG GTG CGC GAT GTG GTG CGC TCT 5 82 

Trp Glu Gin Lys He He Asn Pro Val Val Arg Asp Val Val Arg Ser 
170 175 180 

GTC GTG GGG CGC TAT CCG GCT GAA GAT TTA CCC ATT AAG CGC AAT GAA 63 0 

Val Val Gly Arg Tyr Pro Ala Glu Asp Leu Pro He Lys Arg Asn Glu 
185 190 195 

ATC GCC GCT CTT ATT AAT AGC GGT ATC AAT AAA GAA GTT TCT AAG CTC 678 
He Ala Ala Leu He Asn Ser Gly He Asn Lys Glu Val Ser Lys Leu 
200 205 210 

CCT AAC ACC CCT GTG GAA TTA AGC TCT ATC CAA TTG AGA GAA ATC GTC 72 6 

Pro Asn Thr Pro Val Glu Leu Ser Ser He Gin Leu Arg Glu He Val 
215 220 225 

TTG CCC GCT AAG ATT AAA GAG CAA ATA GAA AAA GTC CAA ATC GCG CGC 7 74 

Leu Pro Ala Lys He Lys Glu Gin He Glu Lys Val Gin He Ala Arg 
230 235 240 245 

CAA GAA TCA GAA AGG GTG AAA TAC GAG GTG GAG CGC TCC AAG CAA GAA 82.2 
Gin Glu Ser Glu Arg Val Lys Tyr Glu Val Glu Arg Ser Lys Gin Glu 
250 255 260 

GCT CAA AAA CAA GCC GCT CTG GCT AAA GGG GAA GCG GAC GCT AAC AGG 87 0 

Ala Gin Lys Gin Ala Ala Leu Ala Lys Gly Glu Ala Asp Ala Asn Arg 
265 270 275 

ATT AAG GCT CAG GGC GTG GCT GAT GCG ATT GTG ATT GAG GCT AAG GCA 918 
He Lys Ala Gin Gly Val Ala Asp Ala He Val He Glu Ala Lys Ala 
280 285 290 

AAA TCT CAA GCT AAT TTA AGC ATT TCG CAA AGC TTG AGC GAC AAG CTT 966 
Lys Ser Gin Ala Asn Leu Ser He Ser Gin Ser Leu Ser Asp Lys Leu 
295 300 305 
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Asn Asn lie Glu Lys lie Leu His Asn Ala Phe Asn Thr Phe Leu His 
385 390 395 400 

Asn Pro Thr lie Ala Leu Lys Lys Asn Ala Gin Lys Glu Glu Ser Asp 

405 410 415 

Val Leu Val Gly Ala lie Lys Asn Leu Phe Asn Leu Asp Lys Ser Asn 

420 425 430 

Ala Asn His Ala Gin Asn Leu Asn Leu Tyr Lys Cys Glu Tyr Tyr Glu 
435 440 445 

Glu 



(2) INFORMATION FOR SEQ ID NO: 1243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 40... 1125 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1243: 

AAAACCCAAA CGCCGTTAAA ATATTTAAAA AGGAAATTC ATG CCC ATT GAT TTG 54 

Met Pro lie Asp Leu 
1 5 

AAC GAA CAT TTA AAA AAG AAA AAT TCT CAA AGA GAA ACC CCC ACG CCT 102 
Asn Glu His Leu Lys Lys Lys Asn Ser Gin Arg Glu Thr Pro Thr Pro 
10 15 20 

AAT ACG CCT AAT AAT GGG GGG CGT TTC ATC CCG CCG TCT AAT TCT TTT 15 0 

Asn Thr Pro Asn Asn Gly Gly Arg Phe He Pro Pro Ser Asn Ser Phe 
25 30 35 

AAT TCT AAA AAA CTA TCG GTT TTA ATT GTC ATT GTC CTT TTA GGC GTT 198 
Asn Ser Lys Lys Leu Ser Val Leu He Val He Val Leu Leu Gly Val 
40 45 50 

ATC GCT TTT TTG GCC AAG CCT TTT GAA GTG ATT AGC TCA GGA GAA ATT 24 6 

He Ala Phe Leu Ala Lys Pro Phe Glu Val He Ser Ser Gly Glu He 
55. 60 65 

GGC ATT AAA ATC ACC GCC GGG AAA TAC GAA CCC ACC CCC TTA CAG CCA 2 94 

Gly He Lys He Thr Ala Gly Lys Tyr Glu Pro Thr Pro Leu Gin Pro 
70 75 80 85 

GGG ATCCAC TTC TTT GTG CCT ATC ATT CAA GAC ATT CTC ATT GTG GAT 342 
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(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1242: 

Met Glu Leu Glu Thr His Leu Ser Lys Tyr Phe Thr Leu Ala Phe Thr 

15 10 15 

His Lys Ser Met Ser Leu Glu Met Arg Glu Lys Leu Ala lie Asn Ser 

20 25 30 

Asn Ala Thr Leu Lys Glu Phe Leu Gin Thr lie Lys Asn His Cys Pro 

35 40 45 

Asn lie Lys Glu Cys Met Val Leu Ser Thr Cys Asn Arg Phe Glu lie 

50 55 60 

Tyr Ala Ser Leu Lys His Gly Ala Asn Thr Asn Glu Gin Lys Asn Ala 
65 70 75 80 

Leu Leu Lys lie Leu Ala Gin Asn Lys Lys Met Ser Val Ser Asp Leu 

85 90 95 

Glu Lys Cys Val Leu Met Asn Thr Asp Glu Ser Ala Val His His Val 

100 105 110 

Phe Ser Val Cys Ser Ser Leu Asp Ser Leu Val Val Gly Glu Thr Gin 

115 120 125 

lie Thr Gly Gin Met Lys Asn Ala Tyr Lys Phe Ala Phe Glu Glu Lys 

130 135 140 

Phe Cys Ser Lys Asp Leu Thr Arg Leu Leu His Phe Ala Phe Lys Cys 
145 150 155 160 

Ala Ala Lys Val Arg Asn Leu Thr Gly lie Ser Lys Gin Gly Val Ser 

165 170 175 

lie Ser Ser Val Ala Val Lys Glu Ala Leu Asn lie Phe Glu Lys Glu 

180 185 190 

Arg lie Lys Asp Lys Lys Ala Leu Val lie Gly Leu Gly Glu Met Ala 

195 200 205 

Gin Leu Val lie Lys His Leu Leu Asn Lys Gin Phe Glu Ala Leu lie 

210 215 220 

Leu Gly Arg Asn Ala Ala Lys Phe Glu Asp Phe lie Lys Glu Leu Glu 
225 230 235 240 

Glu Pro Lys Lys Val Ser Phe Gin Asn lie Glu Asn Leu Asn Ala Tyr 

245 250 255 

lie Asn Glu Tyr Glu Leu Leu Phe Cys Ala Thr Ser Ser Pro His Phe 

260 265 270 

lie Val Gin Asn Arg Met Leu Lys Glu Thr lie Phe Arg Arg Phe Trp 

275 280 285 

Phe Asp Leu Ala Val Pro Arg Asn lie Glu Lys Pro Val Leu Asp Asn 

290 295 300 

lie Phe Leu Tyr Ser Val Asp Asp Leu Glu Pro Met Val Arg Glu Asn 
305 310 315 320 

Val Glu Asn Arg Gin Glu Ser Arg Met Arg Ala Tyr Glu lie Val Gly 

325 330 335 

Leu Ala Thr Met Glu Phe Tyr Gin Trp lie Gin Ser Leu Glu Val Glu 

340 345 350 

Pro Val lie Lys Asp Leu Arg Glu Leu Ala Arg lie Ser Ala Gin Lys 

355 360 365 

Glu Leu Gin Lys Ala Leu Lys Lys Arg Tyr Val Pro Lys Glu Tyr Glu 
370 375 380 
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TCG CCG CAT TTT ATC GTG CAA AAT CGC ATG TTA AAA GAA ACG ATT TTC 8 67 

Ser Pro His Phe lie Val Gin Asn Arg Met Leu Lys Glu Thr lie Phe 
270 275 280 

AGG CGT TTT TGG TTT GAT TTA GCC GTG CCA CGG AAT ATT GAA AAG CCG 915 
Arg Arg Phe Trp Phe Asp Leu Ala Val Pro Arg Asn lie Glu Lys Pro 
285 290 295 300 

GTA TTG GAT AAT ATT TTC TTA TAG AGC GTT GAT GAT TTA GAG CCT ATG 96 3 

Val Leu Asp Asn lie Phe Leu Tyr Ser Val Asp Asp Leu Glu Pro Met 
305 310 315 

GTG AGA GAA AAT GTG GAA AAC AGG CAA GAG AGC AGA ATG AGA GCT TAT 1011 
Val Arg Glu Asn Val Glu Asn Arg Gin Glu Ser Arg Met Arg Ala Tyr 
320 325 330 

GAG ATT GTA GGG CTT GCC ACA ATG GAG TTT TAC CAA TGG ATT CAA AGT 105 9 

Glu He Val Gly Leu Ala Thr Met Glu Phe Tyr Gin Trp He Gin Ser 
335 340 345 

TTA GAA GTA GAG CCT GTG ATT AAG GAT TTA AGG GAA TTG GCT AGG ATT 1107 
Leu Glu Val Glu Pro Val He Lys Asp Leu Arg Glu Leu Ala Arg He 
350 355 360 

TCA GCC CAA AAA GAA TTG CAA AAA GCG CTT AAA AAA CGC TAT GTG CCT 115 5 

Ser Ala Gin Lys Glu Leu Gin Lys Ala Leu Lys Lys Arg Tyr Val Pro 
365 370 375 380 

AAA GAA TAC GAA AAC AAC ATT GAA AAG ATC TTG CAC AAC GCT TTC AAC 12 0 3 

Lys Glu Tyr Glu Asn Asn He Glu Lys He Leu His Asn Ala Phe Asn 
385 390 395 

ACT TTT TTG CAT AAC CCT ACC ATC GCC TTA AAA AAG AAC GCT CAA AAA 12 51 

Thr Phe Leu His Asn Pro Thr He Ala Leu Lys Lys Asn Ala Gin Lys 
400 405 410 

GAA GAA TCC GAT GTG CTT GTG GGT GCG ATT AAA AAC TTG TTT AAT TTA 12 99 

Glu Glu Ser Asp Val Leu Val Gly Ala He Lys Asn Leu Phe Asn Leu 
415 420 425 

GAC AAA TCT AAC GCT AAC CAT GCC CAG AAT TTG AAT CTC TAT AAA TGC 13 4 7 

Asp Lys Ser Asn Ala Asn His Ala Gin Asn Leu Asn Leu Tyr Lys Cys 
430 435 440 

GAA TAT TAC GAG GAA TAATGCATGC TATTTTCAAA ACTCTTTGCC CCCACTCTCA A 14 0 3 

Glu Tyr Tyr Glu Glu 

445 

AGAA 14 0 7 

(2) INFORMATION FOR SEQ ID NO:1242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 9 amino acids 

(B) TYPE: amino acid 
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AAC CAT TGC CCT AAC ATC AAA GAG TGC ATG GTG TTA TCC ACA TGC AAT 195 
Asn His Cys Pro Asn lie Lys Glu Cys Met Val Leu Ser Thr Cys Asn 
45 50 55 60 

CGC TTT GAA ATC TAT GCG AGC CTA AAA CAC GGC GOT AAT ACT AAT GAA 24 3 

Arg Phe Glu lie Tyr Ala Ser Leu Lys His Gly Ala Asn Thr Asn Glu 
65 70 75 

CAA AAA AAC GCA CTA TTA AAG ATT TTG GCT CAA AAT AAA AAA ATG AGC 2 91 

Gin Lys Asn Ala Leu Leu Lys lie Leu Ala Gin Asn Lys Lys Met Ser 
80 85 90 

GTG TCT GAT TTA GAA AAA TGC GTT TTA ATG AAC ACT GAT GAA AGC GCA 33 9 

Val Ser Asp Leu Glu Lys Cys Val Leu Met Asn Thr Asp Glu Ser Ala 
95 100 105 

GTC CAT CAT GTC TTT AGC GTG TGC AGC AGT TTG GAT AGC TTG GTG GTT 3 87 

Val His His Val Phe Ser Val Cys Ser Ser Leu Asp Ser Leu Val Val 
110 115 120 

GGG GAA ACT CAA ATC ACA GGG CAG ATG AAA AAC GCT TAT AAA TTC GCT 435 
Gly Glu Thr Gin lie Thr Gly Gin Met Lys Asn Ala Tyr Lys Phe Ala 
125 130 135 140 

TTT GAA GAG AAA TTT TGC TCT AAA GAT TTA ACC CGA TTG CTC CAT TTT 4 83 

Phe Glu Glu Lys Phe Cys Ser Lys Asp Leu Thr Arg Leu Leu His Phe 
145 150 155 

GCT TTC AAA TGC GCC GCT AAA GTG CGC AAT TTA ACC GGC ATT TCC AAG 531 
Ala Phe Lys Cys Ala Ala Lys Val Arg Asn Leu Thr Gly lie Ser Lys 
160 165 170 

CAA GGG GTT TCC ATC TCT TCA GTG GCG GTC AAA GAA GCG CTT AAT ATT 57 9 

Gin Gly Val Ser lie Ser Ser Val Ala Val Lys Glu Ala Leu Asn lie 
175 180 185 

TTT GAA AAA GAA AGG ATT AAG GAT AAA AAA GCC CTT GTG ATA GGG CTT 62 7 

Phe Glu Lys Glu Arg lie Lys Asp Lys Lys Ala Leu Val lie Gly Leu 
190 195 200 

GGC GAG ATG GCT CAA TTA GTC ATC AAG CAC CTT TTA AAC AAG CAA TTT 675 
Gly Glu Met Ala Gin Leu Val lie Lys His Leu Leu Asn Lys Gin Phe 
205 210 215 220 

GAA GCG CTT ATC TTA GGG CGT AAT GCG GCT AAA TTT GAA GAT TTC ATC 72 3 

Glu Ala Leu lie Leu Gly Arg Asn Ala Ala Lys Phe Glu Asp Phe lie 
225 230 235 

AAA GAA TTA GAA GAA CCT AAA AAA GTA AGC TTT CAA AAT ATA GAA AAT 771 
Lys Glu Leu Glu Glu Pro Lys Lys Val Ser Phe Gin Asn lie Glu Asn 
240 245 250 

TTA AAC GCT TAT ATC AAT GAA TAC GAA CTG CTT TTT TGC GCC ACT TCT 819 
Leu Asn Ala Tyr lie Asn Glu Tyr Glu Leu Leu Phe Cys Ala Thr Ser 
255 260 265 
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35 40 45 

Gly Ala Phe Val lie Leu Glu Glu Gin Lys Asp Lys Ser Tyr Lys Val 

50 55 60 

Val Glu Glu Tyr Pro Ser Ser Arg Thr His He He Val Arg Asp Leu 
65 70 75 80 

Gin Gly Asn Glu Arg Val Leu Ser Asn Glu Glu He Gin Lys Leu He 

85 90 95 

Lys Glu Glu Glu Ala Lys He Asp Asn Gly Thr Ser Lys Leu Val Gin 

100 105 110 

Pro Asn Asn Gly Gly Ser Asn Glu Gly Ser Gly Phe Gly Leu Gly Ser 

115 120 125 

Ala He Leu Gly Ser Ala Ala Gly Ala He Leu Gly Ser Tyr He Gly 

130 135 140 

Asn Lys Leu Phe Asn Asn Pro Asn Tyr Gin Gin Asn Ala Gin Arg Thr 
145 150 155 160 

Tyr Lys Ser Pro Gin Ala Tyr Gin Arg Ser Gin Asn Ser Phe Ser Lys 

165 170 175 

Ser Ala Pro Ser Ala Ser Ser Met Gly Gly Ala Ser Lys Gly Gin Ser 

180 185 190 

Gly Phe Phe Gly Ser Ser Arg Pro Thr Ser Ser Pro Ala Val Ser Ser 

195 200 205 

Gly Thr Arg Gly Phe Asn Ser 
210 215 

(2) INFORMATION FOR SEQ ID NO: 1241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 1362 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241: 

TTATAGGACT TTTTA ATG GAG TTA GAA ACT CAT TTG TCA AAA TAT TTC ACC 51 
Met Glu Leu Glu Thr His Leu Ser Lys Tyr Phe Thr 
15 10 

CTA GCC TTT ACG CAT AAA AGC ATG AGC TTA GAA ATG CGA GAA AAA CTC 9 9 

Leu Ala Phe Thr His Lys Ser Met Ser Leu Glu Met Arg Glu Lys Leu 
15 20 25 

GCT ATT AAT TCG AAT GCA ACG CTT AAA GAA TTT TTA CAA ACC ATT AAA 14 7 

Ala He Asn Ser Asn Ala Thr Leu Lys Glu Phe Leu Gin Thr He Lys 
30 35 40 
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GTG CGC GAT TTG CAA GGC AAT GAA CGC GTG TTA AGC AAT GAA GAG ATT 2 91 

Val Arg Asp Leu Gin Gly Asn Glu Arg Val Leu Ser Asn Glu Glu lie 
80 85 90 

CAA AAG CTC ATC AAA GAA GAA GAA GCT AAA ATT GAT AAC GGC ACG AGC 33 9 

Gin Lys Leu lie Lys Glu Glu Glu Ala Lys lie Asp Asn Gly Thr Ser 
95 100 105 

AAG CTT GTC CAG OCT AAT AAT GGA GGG AGT AAT GAA GGC TCA GGC TTT 3 87 

Lys Leu Val Gin Pro Asn Asn Gly Gly Ser Asn Glu Gly Ser Gly Phe 
110 115 120 

GGC TTG GGG AGC GCG ATT TTA GGG AGC GCG GCG GGG GCG ATT TTA GGG 4 35 

Gly Leu Gly Ser Ala lie Leu Gly Ser Ala Ala Gly Ala lie Leu Gly 
125 130 135 140 

AGT TAT ATT GGT AAT AAG CTT TTC AAT AAC CCT AAT TAG CAG CAA AAC 4 83 

Ser Tyr lie Gly Asn Lys Leu Phe Asn Asn Pro Asn Tyr Gin Gin Asn 
145 150 155 

GCC CAA CGG ACC TAC AAA TCC CCA CAA GCT TAC CAA CGC TCT CAA AAT 531 
Ala Gin Arg Thr Tyr Lys Ser Pro Gin Ala Tyr Gin Arg Ser Gin Asn 
160 165 170 

TCC TTT TCT AAA AGT GCG CCC AGT GCT TCA AGC ATG GGC GGA GCG AGT 57 9 

Ser Phe Ser Lys Ser Ala Pro Ser Ala Ser Ser Met Gly Gly Ala Ser 
175 180 185 

AAG GGA CAG AGC GGG TTT TTT GGC TCT AGT AGG CCT ACT AGT TCA CCG 62 7 

Lys Gly Gin Ser Gly Phe Phe Gly Ser Ser Arg Pro Thr Ser Ser Pro 
190 195 200 

GCG GTA AGC TCT GGG ACA AGG GGC TTT AAC TCA TAATTTAATT GATTCAAGGC 680 
Ala Val Ser Ser Gly Thr Arg Gly Phe Asn Ser 
205 210 215 

TAAAAA 686 
(2) INFORMATION FOR SEQ ID NO: 124 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 0: 

Met Lys Lys Pro Tyr Arg Lys lie Ser Asp Tyr Ala lie Val Gly Gly 

15 10 15 

Leu Ser Ala Leu Val Met Val Ser lie Val Gly Cys Lys Ser Asn Ala 

20 25 30 

Asp Asp Lys Pro Lys Glu Gin Ser Ser Leu Ser Gin Ser Val Gin Lys 
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Met Leu lie Met Asp Trp Lys Leu 

1 5 
lie Thr Ala Thr Thr Ala Thr Met 
20 

Asn Thr Asn Met Ser Thr lie Lys 

35 40 
Ser Leu Val Leu Ala Pro Asp Phe 

50 55 
Lys Ala His Pro Val lie Pro Pro 
65 70 
Lys Pro Cys Ala Leu Val Ser Leu 
85 

Lys lie Thr Ser Lys Val Leu Ala 
100 

Thr Ser Ala Lys Glu Arg Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 123 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 660 
(D) OTHER INFORMATION: 



Lys Val Val Lys 
10 

Gly lie Leu Thr 
25 

Glu Lys Pro Ala 

Ala Leu Met lie 
60 

Lys Ser Pro Glu 
75 

Phe Leu Ser Leu 
90 

Val Asn Lys Leu 
105 



Glu He He Thr 
15 

Thr Tyr Ser Leu 
30 

Lys Lys Val Glu 
45 

Asp Cys Pro Thr 

Met He Phe Pro 
80 

Thr Pro Ser Phe 
95 

Ser He Val Pro 
110 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 9: 

TATAAGGTTG CTCTC ATG AAA AAA CCC TAT AGG AAG ATT TCT GAT TAT GCG 51 
Met Lys Lys Pro Tyr Arg Lys He Ser Asp Tyr Ala 
15 10 

ATC GTG GGT GGT TTG AGC GCG TTA GTG ATG GTG AGC ATT GTG GGG TGT 99 
He Val Gly Gly Leu Ser Ala Leu Val Met Val Ser He Val Gly Cys 
15 20 25 

AAG AGC AAT GCT GAT GAC AAA CCA AAA GAG CAA AGC TCT TTA AGT CAA 14 7 

Lys Ser Asn Ala Asp Asp Lys Pro Lys Glu Gin Ser Ser Leu Ser Gin 
30 35 40 

AGC GTT CAA AAA GGC GCG TTT GTG ATT TTA GAA GAG CAA AAG GAT AAA 195 
Ser Val Gin Lys Gly Ala Phe Val He Leu Glu Glu Gin Lys Asp Lys 
45 50 55 60 

TCT TAC AAG GTT GTT GAA GAA TAC CCC AGC TCA AGA ACC CAC ATT ATA 24 3 

Ser Tyr Lys Val Val Glu Glu Tyr Pro Ser Ser Arg Thr His He He 
65 70 75 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 375 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1237: 

TAGAGCTTGA TTTTTTTA ATG TTA ATA ATG GAT TGG AAA TTA AAA GTG GTG 51 

Met Leu lie Met Asp Trp Lys Leu Lys Val Val 
15 10 

AAA GAA ATC ATC ACC ATT ACC GCC ACA ACC GCC ACA ATG GGG ATC TTA 9 9 

Lys Glu lie lie Thr lie Thr Ala Thr Thr Ala Thr Met Gly He Leu 
15 20 25 

ACC ACA TAT TCA TTA AAC ACT AAT ATG AGC ACC ATT AAA GAA AAG CCG 14 7 

Thr Thr Tyr Ser Leu Asn Thr Asn Met Ser Thr He Lys Glu Lys Pro 
30 35 40 

GCA AAA AAA GTA GAA AGC CTT GTT TTA GCC CCG GAT TTT GCG TTA ATG 195 
Ala Lys Lys Val Glu Ser Leu Val Leu Ala Pro Asp Phe Ala Leu Met 
45 50 55 

ATA GAC TGC CCC ACT AAA GCG CAC CCT GTC ATT CCC CCC AAA AGC CCT 24 3 

He Asp Cys Pro Thr Lys Ala His Pro Val He Pro Pro Lys Ser Pro 
60 65 70 75 

GAG ATG ATA TTC CCC AAG CCT TGC GCT TTA GTT TCT TTA TTT TTA TCG 2 91 

Glu Met He Phe Pro Lys Pro Cys Ala Leu Val Ser Leu Phe Leu Ser 
80 85 90 

CTC ACG CCG TCT TTT AAA ATC ACA TCT AAA GTT TTA GCC GTC AAT AAG 33 9 

Leu Thr Pro Ser Phe Lys He Thr Ser Lys Val Leu Ala Val Asn Lys 
95 100 105 

CTT TCT ATC GTT CCC ACT AGT GCT AAA GAA AGA GCG TAAGGCAACA ACTCTA 3 91 
Leu Ser He Val Pro Thr Ser Ala Lys Glu Arg Ala 
110 115 

TCATTATTTT AAAATCCAAA TTT 414 

(2) INFORMATION FOR SEQ ID NO: 123 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 8: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 6: 



Met 


Ala 


Lys 


His 


Asp 


Leu 


Val 


Gly 


Ser 


Val 


Leu 


Trp 


Asp 


Ala 


Tyr 


Ser 


1 








5 










10 










15 




Lys 


Glu 


Val 


Gin 


Arg 


Arg 


Met 


Asp 


Asn 


Pro 


Thr 


His 


Leu 


Gly 


Val 


He 








20 










25 










30 






Thr 


Glu 


Glu 


Gin 


Ala 


Lys 


Ala 


Lys 


Asn 


Ala 


Lys 


Leu 


He 


Val 


Ala 


Asp 






35 










40 










45 








Tyr 


Gly 


Ala 


Glu 


Ala 


Cys 


Gly 


Asp 


Ala 


Val 


Arg 


Leu 


Tyr 


Trp 


Leu 


Val 




50 










55 










60 










Asp 


Glu 


Ser 


Thr 


Asp 


Arg 


He 


Val 


Asp 


Ala 


Lys 


Phe 


Lys 


Ser 


Phe 


Gly 


65 










70 










75 










80 


Cys 


Gly 


Thr 


Ala 


He 


Ala 


Ser 


Ser 


Asp 


Met 


Met 


Val 


Glu 


Leu 


Cys 


Leu 










85 










90 










95 




Asn 


Lys 


Arg 


Val 


Gin 


Asp 


Ala 


Val 


Lys 


He 


Thr 


Asn 


Leu 


Asp 


Val 


Glu 








100 










105 










110 






Arg 


Gly 


Leu 


Arg 


Asp 


Asp 


Pro 


Asp 


Thr 


Pro 


Ala 


Val 


Pro 


Gly 


Gin 


Lys 






115 










120 










125 








Met 


His 


Cys 


Ser 


Val 


Met 


Ala 


Tyr 


Asp 


Val 


He 


Lys 


Lys 


Ala 


Ala 


Gly 




130 










135 










140 










Met 


Tyr 


Leu 


Gly 


Lys 


Asn 


Ala 


Glu 


Asp 


Phe 


Glu 


Glu 


Glu 


He 


He 


Val 


145 










150 










155 










160 


Cys 


Glu 


Cys 


Ala 


Arg 


Val 


Ser 


Leu 


Gly 


Thr 


He 


Lys 


Glu 


Val 


He 


Lys 










165 










170 










175 




Leu 


Asn 


Asp 


Leu 


Lys 


Ser 


Val 


Glu 


Glu 


He 


Thr 


Asn 


Tyr 


Thr 


Lys 


Ala 








180 










185 










190 






Gly 


Ala 


Phe 


Cys 


Lys 


Ser 


Cys 


Val 


Arg 


Pro 


Gly 


Gly 


His 


Glu 


Lys 


Arg 






195 










200 










205 








Asp 


Tyr 


Tyr 


Leu 


Val 


Asp 


He 


Leu 


Lys 


Glu 


Val 


Arg 


Glu 


Glu 


Met 


Glu 




210 










215 










220 










Ala 


Glu 


Lys 


Leu 


Lys 


Ala 


Thr 


Ala 


Asn 


Lys 


Ser 


Gin 


Ser 


Gly 


Glu 


Leu 


225 










230 










235 










240 


Ala 


Phe 


Arg 


Glu 


Met 


Thr 


Met 


Val 


Gin 


Lys 


He 


Lys 


Ala 


Val 


Asp 


Lys 










245 










250 










255 




Val 


He 


Asp 


Glu 


Asn 


He 


Arg 


Pro 


Met 


Leu 


Met 


Met 


Asp 


Gly 


Gly 


Asp 








260 










265 










270 






Leu 


Glu 


He 


Leu 


Asp 


He 


Lys 


Glu 


Ser 


Asp 


Asp 


Tyr 


He 


Asp 


Val 


Tyr 






275 










280 










285 








He 


Arg 


Tyr 


Met 


Gly 


Ala 


Cys 


Asp 


Gly 


Cys 


Met 


Ser 


Ala 


Thr 


Thr 


Gly 




290 










295 










300 










Thr 


Leu 


Phe 


Ala 


He 


Glu 


Asn 


Ala 


Leu 


Gin 


Glu 


Leu 


Leu 


Asp 


Arg 


Ser 


305 










310 










315 










320 


He 


Arg 


Val 


Leu 


Pro 


He 























325 



(2) INFORMATION FOR SEQ ID NO: 123 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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GAA GAA GAA ATC ATC GTG TGC GAG TGC GCT AGG GTG AGT TTA GGT ACG 531 
Glu Glu Glu lie lie Val Cys Glu Cys Ala Arg Val Ser Leu Gly Thr 
155 160 165 170 

ATT AAA GAA GTG ATT AAG CTC AAT GAT TTA AAA AGC GTT GAA GAA ATC 57 9 

lie Lys Glu Val lie Lys Leu Asn Asp Leu Lys Ser Val Glu Glu lie 
175 180 185 

ACT AAC TAC ACC AAA GCC GGT GCT TTT TGT AAA AGC TGT GTG AGG CCT 62 7 

Thr Asn Tyr Thr Lys Ala Gly Ala Phe Cys Lys Ser Cys Val Arg Pro 
190 195 200 

GGA GGG CAT GAA AAA AGG GAT TAT TAC TTG GTG GAT ATT CTT AAA GAA 6 75 

Gly Gly His Glu Lys Arg Asp Tyr Tyr Leu Val Asp lie Leu Lys Glu 
205 210 215 

GTG CGC GAA GAA ATG GAA GCT GAA AAA CTT AAA GCG ACC GCT AAT AAA 72 3 

Val Arg Glu Glu Met Glu Ala Glu Lys Leu Lys Ala Thr Ala Asn Lys 
220 225 230 

TCC C7^ AGC GGA GAA TTG GCT TTC AGG GAA ATG ACT ATG GTT CAA AAG 771 
Ser Gin Ser Gly Glu Leu Ala Phe Arg Glu Met Thr Met Val Gin Lys 
235 240 245 250 

ATT AAA GCG GTG GAT AAA GTC ATT GAT GAA AAT ATC CGC CCG ATG CTT 819 
lie Lys Ala Val Asp Lys Val lie Asp Glu Asn lie Arg Pro Met Leu 
255 260 265 

ATG ATG GAT GGA GGG GAT TTA GAG ATT TTA GAG ATT AAA GAA AGC GAT 8 67 

Met Met Asp Gly Gly Asp Leu Glu lie Leu Asp He Lys Glu Ser Asp 
270 275 280 

GAT TAC ATT GAT GTG TAT ATC CGC TAC ATG GGG GCA TGT GAT GGG TGC 915 
Asp Tyr He Asp Val Tyr He Arg Tyr Met Gly Ala Cys Asp Gly Cys 
285 290 295 

ATG AGC GCG ACT ACC GGG ACT TTA TTT GCC ATT GAA AAC GCT TTG CAG 963 
Met Ser Ala Thr Thr Gly Thr Leu Phe Ala He Glu Asn Ala Leu Gin 
300 305 310 

GAA TTA TTG GAT CGC AGT ATC AGG GTG TTA CCG ATT TGAACTTTTT AGGGGG 1015 
Glu Leu Leu Asp Arg Ser He Arg Val Leu Pro He 
315 320 325 

TGGAGGCCTT 102 5 

(2) INFORMATION FOR SEQ ID NO: 12 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22 . . . 999 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235: 

AAAGAATATA AAGGAATCAA A ATG GCA AAA CAT GAT TTA GTG GGT TCG GTT 51 

Met Ala Lys His Asp Leu Val Gly Ser Val 
1 5 10 

CTC TGG GAC GCA TAT TCT AAA GAA GTT CAA AGO CGC ATG GAC AAC CCC 99 
Leu Trp Asp Ala Tyr Ser Lys Glu Val Gin Arg- Arg Met Asp Asn Pro 
15 20 25 

ACG CAT TTA GGG GTC ATC ACC GAA GAG CAG GCT AAA GCC AAA AAC GCT 14 7 

Thr His Leu Gly Val lie Thr Glu Glu Gin Ala Lys Ala Lys Asn Ala 
30 35 40 

AAG CTC ATT GTG GCG GAT TAT GGC GCA GAG GCA TGC GGT GAT GCG GTG 195 
Lys Leu lie Val Ala Asp Tyr Gly Ala Glu Ala Cys Gly Asp Ala Val 
45 50 55 

AGG TTG TAT TGG CTT GTA GAT GAA AGC ACG GAT AGA ATT GTT GAC GCG 24 3 

Arg Leu Tyr Trp Leu Val Asp Glu Ser Thr Asp Arg lie Val Asp Ala 
60 65 70 

AAG TTT AAA AGC TTT GGT TGC GGA ACA GCG ATC GCA AGC TCA GAC ATG 291 
Lys Phe Lys Ser Phe Gly Cys Gly Thr Ala lie Ala Ser Ser Asp Met 
75 80 85 90 

ATG GTA GAG TTG TGC TTG AAT AAA AGA GTC CAA GAT GCG GTA AAA ATC 33 9 

Met Val Glu Leu Cys Leu Asn Lys Arg Val Gin Asp Ala Val Lys lie 
95 100 105 

ACG AAT TTA GAT GTG GAA AGA GGC TTG AGA GAC GAT CCG GAC ACG CCG 387 
Thr Asn Leu Asp Val Glu Arg Gly Leu Arg Asp Asp Pro Asp Thr Pro 
110 115 120 

GCG GTG CCT GGG CAA AAA ATG CAC TGC TCG GTG ATG GCG TAT GAT GTG 435 
Ala Val Pro Gly Gin Lys Met His Cys Ser Val Met Ala Tyr Asp Val 
125 130 135 

ATC AAA AAA GCT GCC GGC ATG TAT TTG GGG AAA AAC GCT GAA GAT TTT 4 83 

lie Lys Lys Ala Ala Gly Met Tyr Leu Gly Lys Asn Ala Glu Asp Phe 
140 145 150 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1234: 

Met Val Val Leu Gly Ser Thr Gly Ser lie Gly Lys Asn Ala Leu Lys 

15 10 15 

lie Ala Lys Lys Phe Gly lie Glu lie Glu Ala Leu Ser Cys Gly Lys 

20 25 30 

Asn lie Ala Leu lie Asn Glu Gin lie Gin Val Phe Lys Pro Lys Lys 

35 40 45 

Val Ala lie Leu Asp Pro Ser Asp Leu Asn Asp Leu Glu Pro Leu Gly 

50 55 60 

Ala Glu Val Phe Val Gly Leu Glu Gly lie Asp Ala Met lie Glu Glu 
65 70 75 80 

Cys Thr Ser Asn Leu Val Leu Asn Ala lie Val Gly Val Ala Gly Leu 

85 90 95 

Lys Ala Ser Phe Lys Ser Leu Gin Arg Asn Lys Lys Leu Ala Leu Ala 

100 105 110 

Asn Lys Glu Ser Leu Val Ser Ala Gly His Leu Leu Asp lie Ser Gin 

115 120 125 

lie Thr Pro lie Asp Ser Glu His Phe Gly Leu Trp Ala Leu Leu Gin 

130 135 140 

Asn Lys Thr Leu Lys Pro Lys Ser Leu lie lie Ser Ala Ser Gly Gly 
145 150 155 160 

Ala Phe Arg Asp Thr Pro Leu Glu Phe He Pro He Gin Asn Ala Gin 

165 170 175 

Asn Ala Leu Lys His Pro Asn Trp Ser Met Gly Ser Lys He Thr He 

180 185 190 

Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu He Leu Glu Thr Tyr 

195 200 205 

Trp Leu Phe Gly Ala Ser Leu Lys He Asp Ala Leu He Glu Arg Ser 

210 215 220 

Ser He Val His Ala Leu Val Glu Phe Glu Asp Asn Ser He He Ala 
225 230 235 240 

His Leu Ala Ser Ala Asp Met Gin Leu Pro He Ser Tyr Ala He Asp 

245 250 255 

Pro Lys Leu Ala Ser Leu Ser Ala Ser He Lys Pro Leu Asp Leu Tyr 

260 265 270 

Ala Leu Ser Ala He Lys Phe Glu Pro He Ser Met Glu Arg Tyr Thr 

275 280 285 

Leu Trp Cys Tyr Lys Asp Leu Leu Leu Glu Asn Pro Lys Leu Gly Val 

290 295 300 

Val Leu Asn Ala Ser Asn Glu Val Ala Met Glu Lys Phe Leu Asn Lys 
305 310 315 320 

Glu He Ala Phe Gly Gly Leu He Gin Thr lie Ser Gin Ala Leu Glu 

325 330 335 

Ser Tyr Asp Lys Met Pro Phe Lys Leu Ser Ser Leu Glu Glu Val Leu 

340 345 350 

Glu Leu Asp Lys Glu Val Arg Glu Arg Phe Lys Asn Val Ala Gly Val 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 1235: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1025 base pairs 
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lie Thr lie Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu lie Leu 
190 195 200 205 

GAA ACT TAT TGG CTT TTT GGC GCG TCT TTA AAG ATT GAT GCG CTG ATT 674 
Glu Thr Tyr Trp Leu Phe Gly Ala Ser Leu Lys lie Asp Ala Leu lie 
210 215 220 

GAA AGG AGT TCT ATC GTG CAT GCT TTG GTG GAG TTT GAA GAC AAC TCT 7 22 

Glu Arg Ser Ser lie Val His Ala Leu Val Glu Phe Glu Asp Asn Ser 
225 230 235 

ATC ATC GCG CAT TTA GCG AGC GCA GAT ATG CAA TTA CCC ATA AGC TAT 77 0 

lie lie Ala His Leu Ala Ser Ala Asp Met Gin Leu Pro lie Ser Tyr 
240 245 250 

GCG ATC GAT CCG AAG TTG GCC TCT TTG AGC GCG TCT ATC AAG CCC TTA 818 
Ala lie Asp Pro Lys Leu Ala Ser Leu Ser Ala Ser lie Lys Pro Leu 
255 260 265 

GAT CTA TAC GCT TTA AGC GCG ATT AAA TTT GAA CCC ATT AGC ATG GAG 8 66 

Asp Leu Tyr Ala Leu Ser Ala lie Lys Phe Glu Pro lie Ser Met Glu 
270 275 280 285 

CGC TAC ACT TTG TGG TGT TAT AAA GAC TTA CTG CTA GAA AAC CCT AAG 914 
Arg Tyr Thr Leu Trp Cys Tyr Lys Asp Leu Leu Leu Glu Asn Pro Lys 
290 295 300 

CTT GGC GTG GTG CTG AAT GCG AGC AAT GAA GTG GCG ATG GAG AAG TTT 9 62 

Leu Gly Val Val Leu Asn Ala Ser Asn Glu Val Ala Met Glu Lys Phe 
305 310 315 

TTA AAC AAA GAG ATC GCT TTT GGT GGC CTT ATC CAA ACC ATT TCT CAA 1010 
Leu Asn Lys Glu lie Ala Phe Gly Gly Leu lie Gin Thr lie Ser Gin 
320 325 330 

GCC TTA GAA TCA TAC GAT AAA ATG CCT TTC AAG CTC TCT AGT TTA GAA 105 8 

Ala Leu Glu Ser Tyr Asp Lys Met Pro Phe Lys Leu Ser Ser Leu Glu 
335 340 345 

GAA GTG CTG GAA TTA GAC AAA GAA GTT AGG GAG CGT TTT AAA AAT GTA 110 6 

Glu Val Leu Glu Leu Asp Lys Glu Val Arg Glu Arg Phe Lys Asn Val 
350 355 360 365 

GCG GGA GTG TAGTATAATA AGATTTTGCT TCTAATAGCG TTTTATTTCA AT 115 7 

Ala Gly Val 



(2) INFORMATION FOR SEQ ID NO: 1234: 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 368 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
-(D) TOPOLOGY: linear 
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(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 3: 

TAGGGGATTG A ATG GTT GTT TTA GGA AGO ACQ GGC TOT ATT GGG AAA AAC 50 
Met Val Val Leu Gly Ser Thr Gly Ser lie Gly Lys Asn 
15 10 

GCC CTA AAA ATC GCA AAA AAA TTT GGC ATA GAA ATA GAG GCC TTA AGC 9 8 

Ala Leu Lys lie Ala Lys Lys Phe Gly lie Glu lie Glu Ala Leu Ser 
15 20 25 

TGT GGG AAA AAT ATC GCT TTA ATC AAT GAA CAA ATC CAA GTT TTC AAA 14 6 

Cys Gly Lys Asn lie Ala Leu lie Asn Glu Gin lie Gin Val Phe Lys 
30 35 40 45 

CCC AAG AAA GTG GCG ATT TTA GAT CCT AGC GAT TTG AAT GAT TTA GAG 194 
Pro Lys Lys Val Ala lie Leu Asp Pro Ser Asp Leu Asn Asp Leu Glu 
50 55 60 

CCT TTG GGT GCG GAA GTG TTT GTG GGG TTA GAG GGC ATT GAT GCG ATG 24 2 

Pro Leu Gly Ala Glu Val Phe Val Gly Leu Glu Gly lie Asp Ala Met 
65 70 75 

ATA GAA GAG TGC ACC TCA AAT TTA GTC CTT AAC GCC ATT GTG GGC GTG 2 90 

lie Glu Glu Cys Thr Ser Asn Leu Val Leu Asn Ala lie Val Gly Val 
80 85 90 

GCA GGA TTG AAA GCG AGC TTT AAA AGC TTA CAA AGG AAT AAA AAA CTG 33 8 

Ala Gly Leu Lys Ala Ser Phe Lys Ser Leu Gin Arg Asn Lys Lys Leu 
95 100 105 

GCC CTA GCG AAT AAA GAA AGC TTA GTG AGC GCG GGG CAT TTA TTA GAC 3 86 

Ala Leu Ala Asn Lys Glu Ser Leu Val Ser Ala Gly His Leu Leu Asp 
110 115 120 125 

ATT TCA CAA ATC ACG CCC ATT GAT AGC GAG CAT TTT GGT TTG TGG GCG 4 34 

lie Ser Gin lie Thr Pro lie Asp Ser Glu His Phe Gly Leu Trp Ala 
130 135 140 

TTG TTG CAA AAC AAG ACT TTA AAG CCT AAA TCC TTA ATC ATT AGC GCG 4 82 

Leu Leu Gin Asn Lys Thr Leu Lys Pro Lys Ser Leu lie lie Ser Ala 
145 150 155 

AGT GGG GGG GCT TTC AGG GAC ACG CCT TTA GAA TTT ATT CCT ATT CAA 53 0 

Ser Gly Gly Ala Phe Arg Asp Thr Pro Leu Glu Phe lie Pro lie Gin 
160 165 170 

AAC GCG CAA AAT GCG CTC AAG CAC CCT AAT TGG AGC ATG GGA TCT AAA 578 
Asn Ala Gin Asn Ala Leu Lys His Pro Asn Trp Ser Met Gly Ser Lys 
175 180 185 

ATC ACC ATT GAT TCA GCG AGC ATG GTC AAT AAG CTT TTT GAA ATC CTA 62 6 
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Gly Asp 
180 

TCGCT^AAAAA ATTTGGC 661 
(2) INFORMATION FOR SEQ ID NO: 1232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232: 

Met Val Met Ala Ser Val lie Ala Tyr Gin Lys Ala His His Ser Glu 

15 10 15 

Ala lie Leu Pro Phe Leu Tyr Pro Gly Val Gly Phe Phe Ala Leu Phe 

20 25 30 

Gly Val Tyr Lys Asp Phe Gly Ala Val Ala lie lie Trp Leu Leu Val 

35 40 45 

Val Val Val Ala Ser Asp Val Gly Ala Phe Phe Gly Gly Lys Leu Leu 

50 55 60 

Gly Lys Thr Pro Phe Thr Pro Thr Ser Pro Asn Lys Thr Leu Glu Gly 
65 70 75 80 

Ala Leu lie Gly Val Val Leu Ala Ser Val Leu Gly Ser Phe Val Gly 

85 90 95 

Met Gly Lys Leu Ser Gly Gly Phe Phe Met Ala Leu Phe Phe Ser Phe 

100 105 110 

Leu lie Ala Leu Val Ala Val Phe Gly Asp Leu Tyr Glu Ser Tyr Leu 

115 120 125 

Lys Arg Lys Val Gly lie Lys Asp Ser Gly Lys lie Leu Pro Gly His 

130 135 140 

Gly Gly Val Leu Asp Arg Leu Asp Ser Met Leu Phe Gly Ala Leu Gly 
145 ISO 155 160 

Leu His Ala Leu Leu Tyr Phe Leu Glu lie Trp Lys Glu Thr Ala Val 
165 170 175 

Phe Leu Gly Asp 
180 

(2) INFORMATION FOR SEQ ID NO: 1233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 12 . , . 1115 
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(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231: 

GGTGGCGGCG TATTTTAACG GGCGTCCTAT AGAATGCGCT CTTATTAGCG CC ATG GTC 5 8 

Met Val 
1 

ATG GCT AGT GTT ATC GCT TAT CAA AAA GCG CAC CAT AGC GAA GCC ATT 10 6 

Met Ala Ser Val lie Ala Tyr Gin Lys Ala His His Ser Glu Ala lie 
5 10 15 

TTA CCC TTT TTG TAT CCG GGC GTT GGG TTT TTT GCG CTT TTT GGG GTT 154 
Leu Pro Phe Leu Tyr Pro Gly Val Gly Phe Phe Ala Leu Phe Gly Val 
20 25 30 

TAT AAG GAT TTT GGT GCA GTA GCG ATC ATT TGG CTT TTA GTC GTG GTG 2 02 

Tyr Lys Asp Phe Gly Ala Val Ala lie lie Trp Leu Leu Val Val Val 
35 40 45 50 

GTT GCA AGC GAT GTG GGG GCG TTT TTT GGA GGC AAG CTT TTA GGC AAA 250 
Val Ala Ser Asp Val Gly Ala Phe Phe Gly Gly Lys Leu Leu Gly Lys 
55 60 65 

ACC CCT TTC ACG CCC ACT TCG CCG AAT AAA ACC TTA GAG GGC GCG TTG 2 98 

Thr Pro Phe Thr Pro Thr Ser Pro Asn Lys Thr Leu Glu Gly Ala Leu 
70 75 80 

ATT GGC GTG GTT TTG GCG AGC GTT TTA GGA TCG TTT GTG GGC ATG GGG 34 6 

lie Gly Val Val Leu Ala Ser Val Leu Gly Ser Phe Val Gly Met Gly 
85 90 95 

AAA TTG AGC GGA GGC TTT TTT ATG GCG CTC TTT TTT AGT TTT TTA ATC 3 94 

Lys Leu Ser Gly Gly Phe Phe Met Ala Leu Phe Phe Ser Phe Leu lie 
100 105 110 

GCT CTT GTG GCG GTG TTT GGG GAT TTG TAT GAA AGC TAT TTG AAA AGA 44 2 

Ala Leu Val Ala Val Phe Gly Asp Leu Tyr Glu Ser Tyr Leu Lys Arg 
115 120 125 130 

AAG GTC GGT ATC AAA GAT AGC GGT AAG ATT TTA CCC GGG CAT GGG GGC 4 90 

Lys Val Gly lie Lys Asp Ser Gly Lys lie Leu Pro Gly His Gly Gly 
135 140 145 

GTT TTA GAC CGG TTG GAT TCC ATG CTT TTT GGG GCT TTA GGC TTG CAT 53 8 

Val Leu Asp Arg Leu Asp Ser Met Leu Phe Gly Ala Leu Gly Leu His 
150 155 160 

GCG CTG TTG TAT TTT TTA GAA ATT TGG AAA GAA ACG GCG GTG TTT TTA 5 86 

Ala Leu Leu Tyr Phe Leu Glu lie Trp Lys Glu Thr Ala Val Phe Leu 
165 170 175 

GGG GAT TGAATGGTTG TTTTAGGAAG CACCGGCTCT ATTGGGAAAA ACGCCCTAAA AA 644 
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225 230 235 240 

Ser lie Cys Leu Met Leu Gly lie Ala Tyr Ala Ala Ser lie Gly Ser 

245 250 255 

Leu Gly Thr Leu He Gly Thr Pro Pro Asn Ala Leu Leu Ala Gly Tyr 

260 265 270 

Met Lys Thr Ala Phe Asn He Glu He Asp Phe Ala Gin Trp Met Val 

275 280 285 

Phe Gly Thr Pro Leu Ala Phe He Met Leu He Leu Ala Trp Leu Leu 

290 295 300 

Leu Thr Tyr Val He Phe Pro Leu Lys He Lys Glu He Pro Gly Gly 
305 310 315 320 

Lys Glu Val He Arg Val Glu Leu Lys Lys Leu Gly Arg Leu Ser Gin 

325 330 335 

Ala Glu He Ser Val Gly He He Phe He Leu Ala Ser Leu Gly Trp 

340 345 350 

He Phe Leu Gly Val Met Leu Lys Ser Trp Gly Val Lys He Asp Lys 

355 360 365 

He Asp Ser Val He Ala Met Gly Val Ser Ala Leu Leu Phe He Leu 

370 375 380 

Pro Ala Asn His Gin Gly Asp Arg Leu He Asp Trp Gly Val Ala Lys 
385 390 395 400 

Lys Leu Pro Trp Asp Val Leu Leu Leu Phe Gly Gly Gly Leu Ala Leu 

405 410 415 

Ser Ala Gin Phe Ser Lys Thr Gly Leu Ser Leu Trp He Gly His Leu 

420 425 430 

Val Ser Gly Phe Ser His Leu Pro He Leu Phe He He Val Met Val 

435 440 445 

Thr Leu Met Val He Phe Leu Thr Glu He Thr Ser Asn Thr Ala Thr 

450 455 460 

Ala Ala Ala Phe Leu Pro Val He Gly Gly Val Ala Met Gly Met Gly 
465 470 475 480 

Tyr Glu Asn His Gin Ser Leu Leu Leu Thr He Pro Val Ala Leu Ser 

485 490 495 

Ala Thr Cys Ala Phe Met Leu Pro Val Val Thr Pro Pro Asn Ala He 

500 505 510 

Ala Tyr Gly Ser Gly Tyr Val Lys He Thr Asp Met He Lys Ala Gly 

515 520 525 

Leu Trp Leu Asn Leu Val Gly Val Val Leu He Ser Thr Phe Ser Tyr 

530 535 540 

Phe Leu Val Ser Leu He Phe Asn 
545 550 

(2) INFORMATION FOR SEQ ID NO: 1231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 592 
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AAA ATA ACG GAC ATG ATT AAA GCC GGT TTG TOG CTT AAT CTG GTA GGA 1637 

Lys lie Thr Asp Met lie Lys Ala Gly Leu Trp Leu Asn Leu Val Gly 
520 525 530 535 

GTT GTT TTG ATT AGC ACG TTT AGC TAT TTT TTG GTT TCG TTA ATA TTT 16 85 

Val Val Leu lie Ser Thr Phe Ser Tyr Phe Leu Val Ser Leu lie Phe 
540 545 550 

AAT TGATTAAGGA AAAAAGTGAA AGAAGAGTTA TTTAAAGAAA AATCTCGTTA CATTAC 1744 
Asn 



AGGGTTTGTT TTAATC 176 0 

(2) INFORMATION FOR SEQ ID NO: 1230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 30: 

Met Glu Asn His Ser His Ala Asn Thr His Thr Asp Thr Arg Thr Asp 

15 10 15 

Asp Lys Ser Thr Lys lie Val Arg Leu Leu Gly Leu lie Gly Gly Ala 

20 25 30 

Leu lie Ala Leu Val lie Tyr Tyr Ala Leu Asn Ser Gin Met Pro His 

35 40 45 

lie Val Glu Glu lie Pro Lys Leu Ser Ser Leu Asn Tyr Lys Ala Met 

50 55 60 

Pro Val Val Ala Gly Val Ala Val Leu Met Gly lie Trp Trp Met Thr 
65 70 75 80 

Glu Ala lie Asp Leu Pro Ala Thr Ala Leu Leu Pro Leu Val Leu Phe 

85 90 95 

Ser Val Phe Ser Val Asp Gin Phe Ala Ser Val Ser Ser Ser Tyr Ala 

100 105 110 

Ser Pro lie lie Phe Leu Phe Met Gly Gly Phe lie Leu Ala Leu Ser 

115 120 125 

Met Gin Lys Trp Asn Leu His Thr Arg lie Ala Leu Ser lie lie Leu 

130 135 140 

Leu Val Gly Thr Ser Pro Arg Arg Leu lie Leu Gly Phe Met Met Ala 
145 150 155 160 

Thr Gly Phe Leu Ser Met Trp Val Ser Asn Thr Ala Thr Ala Val Met 

165 170 175 

Met Leu Pro Val Gly Met Ser Val Leu Gin Leu Val Ala Lys Leu Val 

180 185 190 

Gly Lys Glu Asp Ala Ser Asn Ser Trp His Gin Lys Glu Glu lie Thr 

195 200 205 

Lys Ala His Gly Gly lie Met Ser Asn lie Val His Lys Gly Lys Asp 

210 215 220 

lie Thr Gin Val lie Gin Glu Lys Thr Thr lie Tyr Arg Thr Asn Phe 
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ATC ATG CTC ATT TTA GCG TOG CTC TTG CTC ACT TAT GTG ATT TTC CCT 965 
lie Met Leu lie Leu Ala Trp Leu Leu Leu Thr Tyr Val lie Phe Pro 
300 305 310 

TTA AAG ATT AAA GAA ATC CCA GGG GGT AAG GAA GTC ATT AGG GTA GAG 1013 
Leu Lys lie Lys Glu lie Pro Gly Gly Lys Glu Val lie Arg Val Glu 
315 320 325 

TTA AAA AAA TTA GGC CGT TTG AGT CAG GCG GAA ATC TCT GTG GGG ATT 1061 
Leu Lys Lys Leu Gly Arg Leu Ser Gin Ala Glu lie Ser Val Gly lie 
330 335 340 

ATT TTT ATT TTA GCG TCT TTA GGG TGG ATT TTT TTA GGC GTA ATG TTA 110 9 

lie Phe lie Leu Ala Ser Leu Gly Trp lie Phe Leu Gly Val Met Leu 
345 350 355 

AAA TCT TGG GGC GTT AAG ATA GAT AAA ATT GAT TCA GTG ATC GCT ATG 115 7 

Lys Ser Trp Gly Val Lys lie Asp Lys lie Asp Ser Val lie Ala Met 
360 365 370 375 

GGG GTT TCT GCG CTT TTA TTC ATT TTG CCC GCT AAC CAT CAG GGC GAT 12 05 

Gly Val Ser Ala Leu Leu Phe lie Leu Pro Ala Asn His Gin Gly Asp 
380 385 390 

AGG CTC ATT GAT TGG GGT GTT GCT AAA AAA CTC CCT TGG GAT GTG TTG 12 5 3 

Arg Leu lie Asp Trp Gly Val Ala Lys Lys Leu Pro Trp Asp Val Leu 
395 400 405 

CTT TTA TTT GGC GGC GGG TTA GCC TTG AGC GCG CAA TTT TCT AAA ACC 13 01 

Leu Leu Phe Gly Gly Gly Leu Ala Leu Ser Ala Gin Phe Ser Lys Thr 
410 415 420 

GGG TTG AGT TTG TGG ATC GGG CAT TTA GTC TCT GGC TTT TCG CAT TTA 13 4 9 

Gly Leu Ser Leu Trp lie Gly His Leu Val Ser Gly Phe Ser His Leu 
425 430 435 

CCG ATT TTA TTC ATC ATT GTC ATG GTT ACT TTA ATG GTC ATT TTC TTA 13 97 

Pro lie Leu Phe lie lie Val Met Val Thr Leu Met Val lie Phe Leu 
440 445 450 455 

ACC GAA ATC ACT TCT AAC ACC GCC ACC GCT GCC GCA TTT TTA CCG GTG 144 5 

Thr Glu lie Thr Ser Asn Thr Ala Thr Ala Ala Ala Phe Leu Pro Val 
460 465 470 

ATT GGA GGG GTT GCG ATG GGC ATG GGT TAT GAA AAC CAT CAG AGC TTG 14 9 3 

lie Gly Gly Val Ala Met Gly Met Gly Tyr Glu Asn His Gin Ser Leu 
475 480 485 

TTA TTG ACC ATT CCT GTA GCC TTG AGT GCG ACT TGC GCG TTC ATG CTC 15 41 

Leu Leu Thr lie Pro Val Ala Leu Ser Ala Thr Cys Ala Phe Met Leu 
490 495 500 

CCT GTG GTC ACC CCA CCG AAT GCA ATA GCT TAT GGC TCT GGG TAT GTT 15 8 9 

Pro Val Val Thr Pro Pro Asn Ala lie Ala Tyr Gly Ser Gly Tyr Val 
505 510 515 
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Val Leu Met Gly He Trp Trp Met Thr Glu Ala He Asp Leu Pro Ala 
75 80 85 

ACC GCG CTT TTA CCT TTG GTG CTT TTT AGC GTC TTT AGO GTG GAT CAA 341 
Thr Ala Leu Leu Pro Leu Val Leu Phe Ser Val Phe Ser Val Asp Gin 
90 95 100 

TTC GCT AGC GTC AGC TCT TCT TAC GCA TOG CCG ATC ATC TTT CTT TTT 3 89 

Phe Ala Ser Val Ser Ser Ser Tyr Ala Ser Pro He He Phe Leu Phe 
105 110 115 

ATG GGA GGG TTT ATT TTA GCC CTA AGC ATG CAA AAA TGG AAT TTG CAC 437 
Met Gly Gly Phe He Leu Ala Leu Ser Met Gin Lys Trp Asn Leu His 
120 125 130 135 

ACG CGC ATC GCT TTA AGC ATT ATT TTA TTA GTA GGC ACA AGC CCT AGG 4 85 

Thr Arg He Ala Leu Ser He He Leu Leu Val Gly Thr Ser Pro Arg 
140 145 150 

AGG TTG ATT TTA GGT TTC ATG ATG GCT ACA GGC TTT CTG TCT ATG TGG 53 3 

Arg Leu He Leu Gly Phe Met Met Ala Thr Gly Phe Leu Ser Met Trp 
155 160 165 

GTG AGC AAT ACC GCA ACG GCG GTG ATG ATG CTC CCT GTT GGC ATG AGC 581 
Val Ser Asn Thr Ala Thr Ala Val Met Met Leu Pro Val Gly Met Ser 
170 175 180 

GTT TTG CAA TTA GTC GCT AAA CTG GTG GGC AAA GAA GAC GCC TCT AAT 62 9 

Val Leu Gin Leu Val Ala Lys Leu Val Gly Lys Glu Asp Ala Ser Asn 
185 190 195 

TCA TGG CAT CAA AAA GAA GAA ATC ACC AAA GCG CAT GGG GGT ATT ATG 6 77 

Ser Trp His Gin Lys Glu Glu He Thr Lys Ala His Gly Gly He Met 
200 205 210 215 

AGT AAT ATC GTG CAT AAG GGT AAA GAT ATT ACT CAA GTC ATT CAA GAA 72 5 

Ser Asn He Val His Lys Gly Lys Asp He Thr Gin Val He Gin Glu 
220 225 230 

AAG ACT ACT ATC TAT CGC ACG AAT TTC AGT ATT TGC TTG ATG CTT GGC 7 73 

Lys Thr Thr He Tyr Arg Thr Asn Phe Ser He Cys Leu Met Leu Gly 
235 240 245 

ATC GCT TAT GCG GCT TCT ATT GGC TCT TTA GGC ACT TTG ATT GGC ACG 821 
He Ala Tyr Ala Ala Ser He Gly Ser Leu Gly Thr Leu He Gly Thr 
250 255 260 

CCG CCT AAC GCT TTA TTG GCC GGC TAT ATG AAA ACC GCT TTC AAT ATT 86 9 

Pro Pro Asn Ala Leu Leu Ala Gly Tyr Met Lys Thr Ala Phe Asn He 
265 270 275 

GAA ATT GAT TTC GCT CAG TGG ATG GTG TTT GGG ACG CCG TTA GCC TTT 917 
Glu He Asp Phe Ala Gin Trp Met Val Phe Gly Thr Pro Leu Ala Phe 
280 285 290 295 
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Gly Lys His His lie Arg Val Asn Ala Leu Ser Ala Gly Pro lie Arg 

180 185 190 

Thr Leu Ala Ser Ser Gly lie Ala Asp Phe Arg Met lie Leu Lys Trp 

195 200 205 

Asn Glu lie Asn Ala Pro Leu Arg Lys Asn Val Ser Leu Glu Glu Val 

210 215 220 

Gly Asn Ala Gly Met Tyr Leu Leu Ser Ser Leu Ser Ser Gly Val Ser 
225 230 235 240 

Gly Glu Val His Phe Val Asp Ala Gly Tyr His Val Met Gly Met Gly 

245 250 255 

Ala Val Glu Glu Lys Asp Asn Lys Ala Thr Leu Leu Trp Asp Leu His 
260 265 270 

Lys Glu Gin 
275 

(2) INFORMATION FOR SEQ ID NO: 122 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 33... 1688 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 9: 

TGGAGGTTAG TAATTTTAAA GGGTAAAATA AA ATG GAA AAT CAT TCG CAT GCC 5 3 

Met Glu Asn His Ser His Ala 
1 5 

AAT ACG CAT ACC GAT ACG CGC ACC GAT GAT AAA AGC ACT AAG ATC GTG 101 

Asn Thr His Thr Asp Thr Arg Thr Asp Asp Lys Ser Thr Lys lie Val 

10 15 20 

CGC TTG TTG GGG TTA ATA GGG GGA GCG TTA ATC GCG CTT GTT ATC TAC 14 9 

Arg Leu Leu Gly Leu lie Gly Gly Ala Leu lie Ala Leu Val lie Tyr 
25 30 35 

TAT GCG CTC AAT TCT CAA ATG CCT CAT ATT GTA GAA GAA ATC CCC AAG 197 
Tyr Ala Leu Asn Ser Gin Met Pro His lie Val Glu Glu lie Pro Lys 
40 45 50 55 

CTC AGT TCT TTG AAT TAT AAG GCG ATG CCT GTT GTG GCA GGG GTG GCT 24 5 

Leu Ser Ser Leu Asn Tyr Lys Ala Met Pro Val Val Ala Gly Val Ala 
60 65 70 

GTT TTA ATG GGG ATA TGG TGG ATG ACT GAA GCC ATT GAC TTG CCC GCA 2 93 
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lie Leu Lys Trp Asn Glu lie Asn Ala Pro Leu Arg Lys Asn Val Ser 
205 210 215 220 



TTA GAA GAA GTG GGC AAT GCC GGG ATG TAT TTG CTC TCT AGT TTG TCT 72 3 

Leu Glu Glu Val Gly Asn Ala Gly Met Tyr Leu Leu Ser Ser Leu Ser 
225 230 235 



AGC GGG GTG AGT GGG GAA GTG CAT TTT GTG GAT GCT GGC TAT CAT GTT 771 
Ser Gly Val Ser Gly Glu Val His Phe Val Asp Ala Gly Tyr His Val 
240 245 250 



ATG GGC ATG GGG GCT GTG GAA GAA AAA GAT AAT AAA GCT ACG CTA CTG 819 
Met Gly Met Gly Ala Val Glu Glu Lys Asp Asn Lys Ala Thr Leu Leu 
255 260 265 



TGG GAT TTG CAT AAA GAA CAA TAAGGGGTAT TGATGAAATT AAGCGAATTG TTAA 874 
Trp Asp Leu His Lys Glu Gin 
270 275 

ACGCCTATTC TATTG 8 89 



(2) INFORMATION FOR SEQ ID NO:1228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 8: 



Met Gly Phe Leu Lys Gly Lys Lys Gly Leu He Val Gly Val Ala Asn 

15 10 15 

Asn Lys Ser He Ala Tyr Gly He Ala Gin Ser Cys Phe Asn Gin Gly 

20 25 30 

Ala Thr Leu Ala Phe Thr Tyr Leu Asn Glu Ser Leu Glu Lys Arg Val 

35 40 45 

Arg Pro He Ala Gin Glu Leu Asn Ser Pro Tyr Val Tyr Glu Leu Asp 

50 55 60 

Val Ser Lys Glu Glu His Phe Lys Ser Leu Tyr Asn Ser Val Lys Lys 
65 70 75 80 

Asp Leu Gly Ser Leu Asp Phe He Val His Ser Val Ala Phe Ala Pro 

85 90 95 

Lys Glu Ala Leu Glu Gly Ser Leu Leu Glu Thr Ser Lys Ser Ala Phe 

100 105 110 

Asn Thr Ala Met Glu He Ser Val Tyr Ser Leu He Glu Leu Thr Asn 

115 120 125 

Thr Leu Lys Pro Leu Leu Asn Asn Gly Ala Ser Val Leu Thr Leu Ser 

130 135 140 

Tyr Leu Gly Ser Thr Lys Tyr Met Ala His Tyr Asn Val Met Gly Leu 
145 150 155 160 

Ala Lys Ala Ala Leu Glu Ser Ala Val Arg Tyr Leu Ala Val Asp Leu 
165 170 175 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1227: 

TTATAAAGGA AAATC ATG GGA TTT TTA AAA GGT AAA AAA GGG CTT ATT GTA 51 
Met Gly Phe Leu Lys Gly Lys Lys Gly Leu lie Val 
15 10 

GGG GTG GCG AAC AAT AAA TCC ATC GCT TAT GGG ATC GCT CAA TCT TGT 9 9 

Gly Val Ala Asn Asn Lys Ser lie Ala Tyr Gly lie Ala Gin Ser Cys 
15 20 25 

TTC AAT CAA GGG GCT ACT TTG GCT TTC ACT TAT TTG AAT GAG AGT TTA 14 7 

Phe Asn Gin Gly Ala Thr Leu Ala Phe Thr Tyr Leu Asn Glu Ser Leu 
30 35 40 

GAA AAG CGC GTA AGG CCT ATC GCG CAG GAA TTG AAT AGC CCC TAT GTG 195 
Glu Lys Arg Val Arg Pro lie Ala Gin Glu Leu Asn Ser Pro Tyr Val 
45 50 55 60 

TAT GAA TTG GAT GTG AGC AAA GAA GAG CAT TTC AAG TCG CTA TAG AAT 24 3 

Tyr Glu Leu Asp Val Ser Lys Glu Glu His Phe Lys Ser Leu Tyr Asn 
65 70 75 

AGC GTT AAA AAG GAT TTA GGC TCA TTG GAT TTT ATT GTT CAT AGC GTG 2 91 

Ser Val Lys Lys Asp Leu Gly Ser Leu Asp Phe lie Val His Ser Val 
80 85 90 

GCC TTT GCC CCT AAA GAG GCT TTA GAG GGG AGC TTG TTG GAA ACT TCT 33 9 

Ala Phe Ala Pro Lys Glu Ala Leu Glu Gly Ser Leu Leu Glu Thr Ser 
95 100 105 

AAA AGC GCG TTT AAC ACC GCT ATG GAA ATT TCT GTT TAT TCT TTA ATA 3 87 

Lys Ser Ala Phe Asn Thr Ala Met Glu lie Ser Val Tyr Ser Leu He 
110 115 120 

GAG CTG ACA AAC ACC CTA AAA CCT TTA TTG AAT AAC GGA GCG TCT GTT 435 
Glu Leu Thr Asn Thr Leu Lys Pro Leu Leu Asn Asn Gly Ala Ser Val 
125 130 135 140 

TTG ACT CTA AGC TAT TTG GGT AGC ACC AAA TAC ATG GCG CAT TAG AAT 4 83 

Leu Thr Leu Ser Tyr Leu Gly Ser Thr Lys Tyr Met Ala His Tyr Asn 
145 150 155 

GTG ATG GGG TTG GCT AAA GCG GCC CTA GAG AGT GCG GTG CGT TAT TTA 531 
Val Met Gly Leu Ala Lys Ala Ala Leu Glu Ser Ala Val Arg Tyr Leu 
160 165 170 

GCG GTG GAT TTA GGC AAA CAC CAT ATA AGA GTG AAT GCC CTA TCG GCC 57 9 

Ala Val Asp Leu Gly Lys His His He Arg Val Asn Ala Leu Ser Ala 
175 180 185 

GGG CCT ATT AGG ACG CTC GCT TCT AGC GGG ATC GCT GAT TTT AGA ATG 62 7 

Gly Pro He Arg Thr Leu Ala Ser Ser Gly He Ala Asp Phe Arg Met 
190 195 200 

ATT TTA AAA TGG AAT GAA ATC AAC GCC CCT TTA AGA AAA AAT GTG AGT 6 75 
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100 105 110 

Thr Tyr Glu Lys Leu Lys Asp Glu Leu Lys Lys lie Lys Lys Ser Lys 

115 120 125 

Leu Lys Ser Phe lie Gin Ser Tyr Val Ser Asp Leu Lys Asn Phe Tyr 

130 135 140 

Gin Ala Phe Leu Asp Leu Leu Ser Glu lie Asp Thr Asn Pro Thr Thr 
145 150 155 160 

Phe Lys Val Met Leu lie Asn Lys He Asp Ser Ser Phe Phe Asn Ser 

165 170 175 

Leu He Arg Leu Lys He Asn Asn Glu Leu Asp Asp Glu Thr Leu Lys 

180 185 190 

Leu Phe Ala Lys Thr Asp He Val Leu Phe Lys Ala Thr Arg Asp Arg 

195 200 205 

Pro Gly Thr Asp Asn Leu He Asn Ala Tyr Leu Lys Lys* Gly Lys Glu 

210 215 220 

Gly Leu Lys Ser Glu Met He Ala Gin Cys Arg Asn Asp He Gly Leu 
225 230 235 240 

Ala Phe Trp Gin Ser Val Asn Asn Ala Ser Asn Ser Ser Cys Phe His 

245 250 255 

Tyr He Phe Phe Glu Lys Asn Cys Gin Glu Met Gly Leu Ala Asp Leu 

260 265 270 

Lys Lys Leu He Pro Arg Lys Gin Phe Ser Gin Glu Lys Glu His He 

275 280 285 

He Pro He Asn Leu Leu Lys Gin Glu Ser Asn Asn Lys He Arg Asp 

290 295 300 

Leu Gly Phe Glu Asp Lys Lys Asp Leu Glu Asp Tyr He Asp Thr Tyr 
305 310 315 320 

Gly Asn Leu He Ser Leu Glu Lys Ser Leu Asn Arg Lys Ala Ser Asp 

325 330 335 

Lys Asp Leu Tyr Gly Lys Asp Glu He Tyr Lys Ser Ser Glu He Pro 

340 345 350 

Phe Asn Arg Arg Phe Asp Thr Lys Asn Phe Asn Lys Lys Ala Leu Val 

355 360 365 

Lys Arg Asn Glu Glu Met Arg Glu Trp Leu He Asp Thr Phe Phe Lys 

370 375 380 

Asp Phe Ala Ala His 
385 

(2) INFORMATION FOR SEQ ID NO: 1227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 840 
(D) OTHER INFORMATION: 
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TTC TCC CAA GAA AAA GAA CAC ATC ATC CCC ATC AAT TTA TTA AAA CAG 918 
Phe Ser Gin Glu Lys Glu His lie lie Pro lie Asn Leu Leu Lys Gin 
285 290 295 

GAA TCC AAC AAT AAG ATC AGA GAT CTT GGT TTT GAA GAC AAA AAA GAT 96 6 

Glu Ser Asn Asn Lys lie Arg Asp Leu Gly Phe Glu Asp Lys Lys Asp 
300 305 310 

CTT GAA GAC TAC ATT GAC ACA TAC GGC AAC CTC ATC TCC CTG GAA AAA 1014 
Leu Glu Asp Tyr lie Asp Thr Tyr Gly Asn Leu lie Ser Leu Glu Lys 
315 320 325 

TCG CTC AAT CGT AAG GCA AGC GAT AAG GAT CTG TAT GGA AAA GAT GAA 10 62 

Ser Leu Asn Arg Lys Ala Ser Asp Lys Asp Leu Tyr Gly Lys Asp Glu 
330 335 340 

ATC TAT AAA AGT AGT GAG ATC CCT TTC AAC AGG CGC TTT GAT ACA AAA 1110 
lie Tyr Lys Ser Ser Glu lie Pro Phe Asn Arg Arg Phe Asp Thr Lys 
345 350 355 360 

AAC TTC AAT AAG AAG GCA TTG GTA AAA AGA AAT GAA GAA ATG CGA GAA 115 8 

Asn Phe Asn Lys Lys Ala Leu Val Lys Arg Asn Glu Glu Met Arg Glu 
365 370 375 

TGG CTG ATC GAC ACC TTT TTT AAG GAT TTC GCC GCC CAC TAAAGAGAGT GA 12 0 9 
Trp Leu lie Asp Thr Phe Phe Lys Asp Phe Ala Ala His 
380 385 

GATTAAAAGA GAGTGATCGC ACTCA 12 3 4 

(2) INFORMATION FOR SEQ ID NO: 1226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226: 

Met Val Leu Met Arg Phe Glu Glu Pro Asp Pro Gly Arg Ala lie Arg 

15 10 15 

Thr Phe Gin Ser Val Asn Asp Arg Gly Val Pro Leu Leu Leu Leu Asp 

20 25 30 

Lys Leu Lys Ser Leu Leu lie Tyr Tyr Ser Asn lie Phe Cys Asp Gly 

35 40 45 

Lys Arg Gly Leu Asp Gin Phe lie lie Asp His Phe Gly Glu lie Phe 

50 55 60 

Lys lie Phe Ala Lys lie Lys Lys Ser Asp His lie Ser Ser Val Gly 

65 70 75 80 

Gly Phe Asp Glu Gly Asp lie Phe Arg Tyr His Ala Gly Ser Gin Lys 

85 90 95 

Phe Asp Gly lie Glu Phe Leu Gly His Tyr Glu Ala Ser Thr Asp Lys 
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ATC GAT CAT TTT GGG GAG ATC TTT AAG ATC TTT GCC AAG ATT AAA AAG 24 6 

lie Asp His Phe Gly Glu lie Phe Lys lie Phe Ala Lys lie Lys L»ys 
60 65 70 

AGC GAC CAC ATC TCC AGC GTT GGA GGC TTT GAT GAA GGC GAT ATC TTC 2 94 

Ser Asp His lie Ser Ser Val Gly Gly Phe Asp Glu Gly Asp lie Phe 
75 80 85 

CGC TAC CAC GCA GGG AGC CAA AAA TTT GAT GGA ATC GAG TTT TTA GGG 34 2 

Arg Tyr His Ala Gly Ser Gin Lys Phe Asp Gly lie Glu Phe Leu Gly 
90 95 100 

CAC TAC GAA GCA AGC ACG GAC AAA ACC TAC GAG AAA CTC AAA GAT GAA 3 90 

His Tyr Glu Ala Ser Thr Asp Lys Thr Tyr Glu Lys Leu Lys Asp Glu 
105 110 115 120 

CTA AAA AAA ATC AAA AAA AGC AAA TTG AAA AGT TTC ATC CAA TCC TAT 43 8 

Leu Lys Lys lie Lys Lys Ser Lys Leu Lys Ser Phe lie Gin Ser Tyr 
125 130 135 

GTC AGC GAT TTG AAA AAT TTC TAT CAG GCT TTT CTT GAT CTA TTG AGC 4 86 

Val Ser Asp Leu Lys Asn Phe Tyr Gin Ala Phe Leu Asp Leu Leu Ser 
140 145 150 

GAG ATT GAC ACC AAC CCA ACC ACC TTT AAG GTC ATG CTC ATC AAC AAG 53 4 

Glu lie Asp Thr Asn Pro Thr Thr Phe Lys Val Met Leu lie Asn Lys 
155 160 165 

ATC GAC TCG TCT TTT TTC AAT TCG CTC ATC CGC CTG AAA ATC AAC AAC 582 
lie Asp Ser Ser Phe Phe Asn Ser Leu lie Arg Leu Lys lie Asn Asn 
170 175 180 

GAA CTA GAC GAT GAA ACG CTG AAA CTC TTT GCC AAA ACC GAT ATT GTG 63 0 

Glu Leu Asp Asp Glu Thr Leu Lys Leu Phe Ala Lys Thr Asp lie Val 
185 190 195 200 

CTT TTC AAA GCT ACT AGA GAT AGG CCA GGA ACG GAC AAC CTG ATT AAT 67 8 

Leu Phe Lys Ala Thr Arg Asp Arg Pro Gly Thr Asp Asn Leu lie Asn 
205 210 215 

GCG TAT CTT AAA AAG GGC AAA GAG GGA TTG AAG AGC GAG ATG ATT GCT 726 
Ala Tyr Leu Lys Lys Gly Lys Glu Gly Leu Lys Ser Glu Met lie Ala 
220 225 230 

CAA TGC AGA AAT GAT ATA GGG CTG GCT TTT TGG CAG TCT GTA AAC AAC 7 74 

Gin Cys Arg Asn Asp lie Gly Leu Ala Phe Trp Gin Ser Val Asn Asn 
235 240 245 

GCA TCC AAC TCA TCA TGC TTC CAC TAT ATC TTC TTT GAA AAG AAC TGC 82 2 

Ala Ser Asn Ser Ser Cys Phe His Tyr lie Phe Phe Glu Lys Asn Cys 
250 255 260 

CAG GAG ATG GGT CTT GCC GAT CTC AAA AAA TTG ATC CCT AGG AAG CAA 870 
Gin Glu Met Gly Leu Ala Asp Leu Lys Lys Leu lie Pro Arg Lys Gin 
265 270 275 280 
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195 

Leu Cys Pro lie 
210 

His Phe lie Leu 
225 

Met Asp Phe Asn 

Val Leu Ala Leu 
260 

Glu Lys Arg Arg 
275 

Gin Lys Gly Glu 
290 

Asn Leu Gly Leu 
305 

Ala Ser Lys Phe 

Glu Arg Ser Leu 
340 

(2) INFOR^4ATION FOR SEQ ID NO: 1225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31,.. 1197 
(D) OTHER INFORMATION: 



Leu Ala 

Asn Lys 
230 
Gly Phe 
245 

Gly Glu 

Phe Phe 

Ala Leu 

His Pro 
310 
Leu Lys 
325 



200 
Thr Thr 
215 

Ser Leu 

Lys Ser 

Glu Glu 

Ala Arg 
280 
Thr Glu 
295 

Lys Phe 
Ala Asn 



Leu Gly 

Gin Thr 

Met Val 
250 
Pro Arg 
265 

Ser Leu 

Asn Asn 

Tyr Lys 

Thr Pro 
330 



Ala Ser 
220 
Pro Asp 
235 

Glu Ala 

lie Asn 

Phe Val 

lie Lys 
300 
Glu lie 
315 

Leu Ser 



205 

Met lie 

Ser Ala 

lie Lys 

Pro Lys 
270 
lie Lys 
285 

Ala Leu 
Leu Gly 
Ala Asp 



Glu Lys 

Phe Ser 
240 
Gin Ser 
255 

Thr Leu 

Asp lie 

Arg Pro 

Gin Lys 
320 
Asp lie 

335 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225: 

GTGAATGAGC GTTTGGAGGT GCTGTTGGAA ATG GTT TTG ATG CGG TTT GAA GAG 54 

Met Val Leu Met Arg Phe Glu Glu 
1 5 

CCC GAT CCT GGA AGA GCT ATC AGA ACC TTT CAG AGC GTG AAT GAC AGA 102 
Pro Asp Pro Gly Arg Ala lie Arg Thr Phe Gin Ser Val Asn Asp Arg 
10 15 20 

GGC GTG CCT CTC CTC TTG CTA GAC AAA CTA AAA TCC CTT CTC ATC TAT 15.0 
Gly Val Pro Leu Leu Leu Leu Asp Lys Leu Lys Ser Leu Leu lie Tyr 
25 30 35 40 

TAG TCC AAC ATT TTT TGC GAT GGG AAA AGG GGG CTA GAC CAA TTT ATC 198 
Tyr Ser Asn lie Phe Cys Asp Gly Lys Arg Gly Leu Asp Gin Phe lie 
45 50 55 
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GTT ATT AAG GAT ATT CAA AAA GGC GAA GCA TTG ACT GAA AAC AAT ATC 915 
Val lie Lys Asp lie Gin Lys Gly Glu Ala Leu Thr Glu Asn Asn lie 
285 290 295 

AAA GCC TTA CGC CCC AAC CTT GGC TTA CAC CCT AAA TTT TAT AAA GAA 96 3 

Lys Ala Leu Arg Pro Asn Leu Gly Leu His Pro Lys Phe Tyr Lys Glu 
300 305 310 315 

ATT TTA GGC CAA AAA GCA TCA AAA TTC TTA AAA GCC AAC ACC CCC TTA 1011 
lie Leu Gly Gin Lys Ala Ser Lys Phe Leu Lys Ala Asn Thr Pro Leu 
320 325 330 

AGC GCT GAT GAT ATA GAA CGC TCA TTG TAGGTTCGTT TTGATCAAAA AATGGGG 1065 
Ser Ala Asp Asp lie Glu Arg Ser Leu 
335 340 

TTTTTAATTT TGTTTTATGG TTTTAGATTT GATTTTAAAC TCATTTTCTT TATTTTAA 112 3 

(2) INFORMATION FOR SEQ ID NO: 1224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224: 

Met Leu Gin Pro Pro Lys lie Val Ala Glu Leu Ser Ala Asn His Asn 

15 10 15 

Gin Asp Leu Asn Leu Ala Lys Glu Ser Leu His Ala lie Lys Glu Ser 

20 25 30 

Gly Ala Asp Phe Val Lys Leu Gin Thr Tyr Thr Pro Ser Cys Met Thr 

35 40 45 

Leu Asn Ser Lys Glu Asp Pro Phe lie lie Gin Gly Thr Leu Trp Asp 

50 55 60 

Lys Glu Asn Leu Tyr Glu Leu Tyr Gin Lys Ala Ser Thr Pro Leu Glu 
65 70 75 80 

Trp His Ala Glu Leu Phe Glu Leu Ala Arg Lys Leu Asp Leu Gly lie 

85 90 95 

Phe Ser Ser Pro Phe Ser Ser Gin Ala Leu Glu Leu Leu Glu Ser Leu 

100 105 110 

Asn Cys Pro Met Tyr Lys lie Ala Ser Phe Glu lie Val Asp Leu Asp 

115 120 125 

Leu lie Glu Lys Ala Ala Arg Thr Gin Lys Pro lie lie Leu Ser Ser 

130 135 140 

Gly He Ala Thr His Thr Glu Leu Gin Asp Ala He Ser Leu Cys Arg 
145 150 155 160 

Arg Val Asn Asn Phe Asp He Thr Leu Leu Lys Cys Val Ser Ala Tyr 

165 170 175 

Pro Ser Lys He Glu Asp Ala Asn Leu Leu Ser Met Val Lys Leu Gly 

180 185 190 

Glu He- Phe Gly Val Lys Phe Gly Leu Ser Asp His Thr He Gly Ser 
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Gly Thr Leu Trp Asp Lys Glu Asn L»eu Tyr Glu Leu Tyr Gin Lys Ala 
60 65 70 75 

TCT ACC CCC CTA GAA TGG CAT GCT GAA TTG TTT GAG TTG GCT AGA AAG 2 91 

Ser Thr Pro Leu Glu Trp His Ala Glu Leu Phe Glu Leu Ala Arg Lys 
80 85 90 

CTT GAT TTA GGC ATT TTT AGC TCG CCT TTT AGT TCA CAA GCT TTA GAG 33 9 

Leu Asp Leu Gly lie Phe Ser Ser Pro Phe Ser Ser Gin Ala Leu Glu 
95 100 105 

CTT TTA GAG AGC CTA AAT TGC CCC ATG TAT AAA ATC GCT AGT TTT GAA 3 87 

Leu Leu Glu Ser Leu Asn Cys Pro Met Tyr Lys lie Ala Ser Phe Glu 
110 115 120 

ATC GTT GAT TTG GAC TTG ATT GAA AAG GCC GCT CGC ACA CAA AAG CCC 43 5 

lie Val Asp Leu Asp Leu lie Glu Lys Ala Ala Arg Thr Gin Lys Pro 
125 130 135 

ATT ATC CTT TCT AGC GGT ATC GCT ACA CAC ACC GAA TTG CAA GAC GCT 4 83 

lie lie Leu Ser Ser Gly lie Ala Thr His Thr Glu Leu Gin Asp Ala 
140 145 150 155 

ATC TCA TTG TGC AGA AGA GTG AAT AAT TTT GAC ATC ACC CTT TTA AAA 531 
lie Ser Leu Cys Arg Arg Val Asn Asn Phe Asp lie Thr Leu Leu Lys 
160 165 170 

TGC GTG AGC GCT TAT CCC AGT AAA ATA GAA GAC GCT AAC TTG TTG AGC 5 79 

Cys Val Ser Ala Tyr Pro Ser Lys lie Glu Asp Ala Asn Leu Leu Ser 
175 180 185 

ATG GTT AAA TTA GGC GAA ATC TTT GGC GTT AAA TTT GGC TTG AGC GAT 627 
Met Val Lys Leu Gly Glu lie Phe Gly Val Lys Phe Gly Leu Ser Asp 
190 195 200 

CAC ACG ATT GGC TCT CTT TGC CCC ATT TTA GCC ACC ACT TTA GGA GCG 675 
His Thr He Gly Ser Leu Cys Pro He Leu Ala Thr Thr Leu Gly Ala 
205 210 215 

AGC ATG ATA GAA AAG CAT TTC ATT TTA AAC AAA TCC TTA CAA ACC CCA 723 
Ser Met He Glu Lys His Phe He Leu Asn Lys Ser Leu Gin Thr Pro 
220 225 230 235 

GAC AGC GCT TTT AGC ATG GAT TTT AAC GGA TTT AAA AGC ATG GTT GAA 771 
Asp Ser Ala Phe Ser Met Asp Phe Asn Gly Phe Lys Ser Met Val Glu 
240 245 250 

GCC ATC AAG CAA AGC GTT TTA GCC TTA GGC GAA GAA GAG CCA AGA ATC 819 
Ala He Lys Gin Ser Val Leu Ala Leu Gly Glu Glu Glu Pro Arg He 
255 260 265 

AAT CCA AAG ACT TTA GAA AAG CGA AGA TTT TTT GCA CGC TCT TTA TTT 8 67 

Asn Pro Lys Thr Leu Glu Lys Arg Arg Phe Phe Ala Arg Ser Leu Phe 
270 275 280 
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Leu Lys Asn Arg Gly Phe Thr Asp Gly Tyr Leu Met Arg Arg Pro Phe 

210 215 220 

Glu Arg Leu Asp Thr Gin Asn His Gin Thr Ala lie Ser Glu Gly Asp 
225 230 235 240 

Phe Gin Val Asn Gly Glu lie Thr Glu Asp Gly Arg Phe Phe Ala Cys 

245 250 255 

Lys Phe Thr Thr Thr Thr Asn Thr Ala Tyr Glu lie lie Ala Pro Lys 

260 265 270 

Asn Ala Ala lie Thr Pro lie Val Asn Glu lie Gly Lys lie Tyr Thr 

275 280 285 

Phe Glu Lys Arg Ser Tyr Leu Val Leu Tyr Lys lie Leu Leu Glu Asn 

290 295 300 

Asn Thr Glu Leu Glu Thr lie His Ser Gly Asn Val Asn Leu Val Arg 
305 310 315 320 

Leu Pro Ala Pro Leu Pro Ala Phe Ser Phe Leu Arg Thr Gin Val Arg 
325 330 335 

Val 



(2) INFORMATION FOR SEQ ID NO:1223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 1038 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 23: 

ATTAAGGAGA AATAAGAA ATG TTA CAA CCC CCT AAA ATT GTC GCT GAA TTG 51 

Met Leu Gin Pro Pro Lys lie Val Ala Glu Leu 
15 10 

AGC GCT AAT CAT AAC CAG GAT TTA AAC CTA GCC AAA GAA AGC CTT CAT 9 9 

Ser Ala Asn His Asn Gin Asp Leu Asn Leu Ala Lys Glu Ser Leu His 
15 20 25 

GCC ATT AAG GAA AGC GGT GCG GAT TTT GTC AAG CTC CAA ACC TAC ACG 14 7 

Ala lie Lys Glu Ser Gly Ala Asp Phe Val Lys Leu Gin Thr Tyr Thr 
30 35 40 

CCA AGC TGC ATG ACT TTA AAC TCT AAA GAA GAT CCT TTC ATC ATT CAA 195 
Pro Ser Cys Met Thr Leu Asn Ser Lys Glu Asp Pro Phe lie lie Gin 
45 50 55 

GGC ACT TTA TGG GAT AAA GAA AAT TTG TAT GAA TTG TAT CAA AAG GCT 24 3 
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lie Tyr Thr Phe Glu Lys Arg Ser Tyr Leu Val Leu Tyr Lys lie Leu 
290 295 300 

TTA GAA AAT AAC ACC GAG CTA GAA ACT ATC CAT AGC GGG AAC GTG AAT 102 2 

Leu Glu Asn Asn Thr Glu Leu Glu Thr lie His Ser Gly Asn Val Asn 
305 310 315 

TTA GTG CGA CTG CCC GCA CCC TTA CCG GCT TTT AGT TTT TTA CGC ACC 107 0 

Leu Val Arg Leu Pro Ala Pro Leu Pro Ala Phe Ser Phe Leu Arg Thr 
320 325 330 

CAA GTC AGA GTC TAAAAATGGC GTTTAGAGAT TAGGTATTGA AAATGATTAA GAGAA 112 7 
Gin Val Arg Val 
335 

ACGCATG 1134 
(2) INFORMATION FOR SEQ ID NO: 1222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO:1222: 

Met Ala Glu Leu Glu Pro Asp Ala Phe lie lie Ala Ala Pro Gly Val 

15 10 15 

Val Lys Leu Ala Leu Lys lie Ala Pro His lie Pro lie His Leu Ser 

20 25 30 

Thr Gin Ala Asn Val Leu Asn Leu Leu Asp Ala Gin Val Phe Tyr Asp 

35 40 45 

Leu Gly Val Lys Arg lie Val Cys Ala Arg Glu Leu Ser Leu Asn Asp 

50 55 60 

Ala lie Glu lie Lys Lys Ala Leu Pro Asn Leu Glu Leu Glu lie Phe 
65 70 75 80 

Val His Gly Ser Met Cys Phe Ala Phe Ser Gly Arg Cys Leu lie Ser 

85 90 95 

Ala Leu Gin Lys Gly Arg Val Pro Asn Arg Gly Ser Cys Ala Asn Asp 

100 105 110 

Cys Arg Phe Asp Tyr Glu Tyr Tyr Val Lys Asn Pro Asp Asn Gly Val 

115 120 125 

Met Met Arg Leu Val Glu Glu Glu Gly Val Gly Thr His lie Phe Asn 

130 135 140 

Ala Lys Asp Leu Asn Leu Ser Gly His lie Ala Glu lie Leu Ser Ser 
145 150 155 160 

Asn Ala lie Ser Ala Leu Lys lie Glu Gly Arg Thr Lys Ser Ser Tyr 

165 170 175 

Tyr Ala Ala Gin Thr Thr Arg lie Tyr Arg Leu Ala Val Asp Asp Phe 

180 185 190 

Tyr His Asn Thr Leu Lys Pro Ser Phe Tyr Ala Ser Glu Leu Asn Thr 
195 200 205 
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65 70 75 

GAA ATC TTT GTG CAT GGG AGC ATG TGC TTT GCC TTT TCA GGG CGC TGC 35 0 

Glu lie Phe Val His Gly Ser Met Cys Phe Ala Phe Ser Gly Arg Cys 
80 85 90 

TTG ATT TCG GCC TTA CAA AAG GGG CGC GTG CCT AAT AGA GGG AGT TGC 3 98 

Leu lie Ser Ala Leu Gin Lys Gly Arg Val Pro Asn Arg Gly Ser Cys 
95 100 105 

GCG AAT GAT TGC CGG TTT GAT TAT GAA TAT TAC GTG AAA AAC CCT GAT 44 6 

Ala Asn Asp Cys Arg Phe Asp Tyr Glu Tyr Tyr Val Lys Asn Pro Asp 
110 115 120 125 

AAT GGC GTG ATG ATG AGA CTG GTT GAA GAA GAG GGC GTA GGC ACG CAT 4 94 

Asn Gly Val Met Met Arg Leu Val Glu Glu Glu Gly Val Gly Thr His 
130 135 140 

ATT TTT AAC GCT AAG GAT TTG AAC CTC TCT GGC CAT ATC GCT GAA ATT 54 2 

lie Phe Asn Ala Lys Asp Leu Asn Leu Ser Gly His lie Ala Glu lie 
145 150 155 

TTA AGT TCC AAC GCC ATT AGC GCG CTT AAG ATT GAA GGG CGC ACC AAG 5 90 

Leu Ser Ser Asn Ala lie Ser Ala Leu Lys lie Glu Gly Arg Thr Lys 
160 165 170 

TCC AGT TAC TAC GCC GCG CAA ACC ACG CGC ATC TAT CGT TTA GCG GTT 63 8 

Ser Ser Tyr Tyr Ala Ala Gin Thr Thr Arg He Tyr Arg Leu Ala Val 
175 180 185 

GAT GAT TTT TAC CAT AAC ACC TTA AAG CCG AGT TTT TAT GCC AGC GAA 68 6 

Asp Asp Phe Tyr His Asn Thr Leu Lys Pro Ser Phe Tyr Ala Ser Glu 
190 195 200 205 

TTG AAC ACG CTT AAA AAC AGG GGT TTT ACG GAC GGC TAT TTG ATG CGA 7 34 

Leu Asn Thr Leu Lys Asn Arg Gly Phe Thr Asp Gly Tyr Leu Met Arg 
210 215 220 

AGG CCT TTT GAA AGG TTG GAT ACT CAA AAC CAC CAA ACA GCC ATT AGC 7 82 

Arg Pro Phe Glu Arg Leu Asp Thr Gin Asn His Gin Thr Ala He Ser 
225. 230 235 

GAA GGG GAT TTT CAA GTC AAT GGC GAA ATA ACC GAA GAC GGG CGT TTT 83 0 

Glu Gly Asp Phe Gin Val Asn Gly Glu He Thr Glu Asp Gly Arg Phe 
240 245 250 

TTT GCA TGC AAA TTC ACC ACT ACC ACT AAC ACC GCT TAT GAA ATC ATC 87 8 

Phe Ala Cys Lys Phe Thr Thr Thr Thr Asn Thr Ala Tyr Glu He He 
255 260 265 

GCT CCC AAA AAT GCG GCT ATC ACG CCC ATA GTC AAT GAA ATT GGC AAG 92 6 

Ala Pro Lys Asn Ala Ala He Thr Pro He Val Asn Glu He Gly Lys 
270 275 280 285 

ATT TAC ACC TTT GAA AAA CGC TCT TAT TTA GTG CTG TAT AAA ATC CTT 974 
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Gin Ser Leu Leu Arg Arg Ser His Lys Lys Glu Glu Val Ser Glu Pro 

115 120 125 

Gly Asp Ala Asn lie Phe Arg Val Asp Lys Asp Ser Arg Glu Val Tyr 

130 135 140 

Met His Glu Lys Lys Leu Asp Leu Tiir Arg Ala Glu Tyr Glu lie Leu 
145 150 155 160 

Ser Leu Leu lie Ser Lys Lys Gly Tyr Val Phe Ser Arg Glu Ser He 

165 170 175 

Ala He Glu Ser Glu Ser He Asn Pro Glu Ser Ser Asn Lys Ser He 

180 185 190 

Asp Val He He Gly Arg Leu Arg Ser Lys He Glu Lys Asn Pro Lys 

195 200 205 

Gin Pro Gin Tyr He He Ser Val Arg Gly He Gly Tyr Lys Leu Glu 
210 215 220 

Tyr 
225 



(2) INFORMATION FOR SEQ ID NO: 1221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 72 . , . 1082 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221: 

AAGTCTATGC CACGATCAAT GGTTTCCCTT TCAATTCACA GCTCAAACTT TTAGAAGAAC 6 0 

ATATTGATAA A ATG GCA GAA TTA GAG CCG GAC GCT TTT ATT ATC GCT GCG 110 
Met Ala Glu Leu Glu Pro Asp Ala Phe He He Ala Ala 
15 10 

CCT GGT GTG GTG AAA CTC GCT TTA AAA ATC GCC CCG CAT ATC CCT ATC 15 8 

Pro Gly Val Val Lys Leu Ala Leu Lys He Ala Pro His He Pro He 
15 20 25 

CAT TTA TCC ACG CAA GCG AAT GTG TTA AAT TTG CTA GAT GCA CAA GTG 2 06 

His Leu Ser Thr Gin Ala Asn Val Leu Asn Leu Leu Asp Ala Gin Val 
30 35 40 45 

TTT TAT GAT TTA GGG GTT AAA CGC ATC GTG TGC GCG AGG GAA TTG AGC 254 
Phe Tyr Asp Leu Gly Val Lys Arg He Val Cys Ala Arg Glu Leu Ser 
50 55 60 

CTG AAT GAT GCG ATT GAG ATT AAA AAA GCC TTA CCT AAT TTA GAA TTA 3 02 

Leu Asn Asp Ala He Glu He Lys Lys Ala Leu Pro Asn Leu Glu Leu 
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GGC GAT GCG AAT ATC TTT AGG GTG GAT AAG GAT AGC CGA GAA GTG TAT 432 
Gly Asp Ala Asn lie Phe Arg Val Asp Lys Asp Ser Arg Glu Val Tyr 
130 135 140 



ATG CAT GAA AAA AAG CTG GAC TTA ACT AGG GOT GAA TAT GAA ATC CTT 4 80 

Met His Glu Lys Lys Leu Asp Leu Thr Arg Ala Glu Tyr Glu lie Leu 
145 150 155 160 



TCG CTT CTC ATT AGC AAA AAA GGT TAT GTG TTT AGC CGT GAA AGC ATT 52 8 

Ser Leu Leu lie Ser Lys Lys Gly Tyr Val Phe Ser Arg Glu Ser lie 
165 170 175 



GCG ATT GAG AGC GAG AGC ATC AAC CCT GAA AGC TCT AAT AAA AGC ATT 576 
Ala lie Glu Ser Glu Ser lie Asn Pro Glu Ser Ser Asn Lys Ser lie 
180 185 190 

GAT GTG ATC ATT GGC CGT TTG CGA TCT AAG ATT GAA AAA AAT CCT AAA 62 4 

Asp Val lie lie Gly Arg Leu Arg Ser Lys lie Glu Lys Asn Pro Lys 
195 200 205 



CAA CCG CAA TAC ATC ATC TCT GTT AGA GGG ATT GGT TAT AAA TTA GAA 672 
Gin Pro Gin Tyr lie lie Ser Val Arg Gly lie Gly Tyr Lys Leu Glu 
210 215 220 



TAC TGA 67 8 

Tyr 

225 



(2) INFORMATION FOR SEQ ID NO:1220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 0: 



Met lie Glu Val Leu Met lie Glu Asp Asp He Glu Leu Ala Glu Phe 

15 10 15 

Leu Ser Glu Phe Leu Leu Gin His Gly He His Val Thr Asn Tyr Asp 

20 25 30 

Glu Pro Tyr Thr Gly He Ser Ala Ala Asn Thr Gin Asn Tyr Asp Leu 

35 40 45 

Leu Leu Leu Asp Leu Thr Leu Pro Asn Leu Asp Gly Leu Glu Val Cys 

50 55 60 

Arg Arg He Ser Lys Gin Lys His He Pro He He He Ser Ser Ala 
65 70 75 80 

Arg Ser Asp Val Glu Asp Lys He Lys Ala Leu Asp Tyr Gly Ala Asp 

85 90 95 

Asp Tyr Leu Pro Lys Pro Tyr Asp Pro Lys Glu Leu Leu Ala Arg He 
100 105 110 
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Gin Glu lie Thr Glu Leu Gin Ser Lys Asn Val Arg Leu Gin Lys Arg 

65 70 75 80 

Leu Phe Glu Leu Lys Glu Leu Arg Pro Arg Asp 
85 90 

(2) INFORMATION FOR SEQ ID NO: 1219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...675 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219: 

ATG ATA GAA GTT TTA ATG ATA GAA GAT GAT ATA GAA TTA GCC GAG TTT 4 8 

Met lie Glu Val Leu Met lie Glu Asp Asp lie Glu Leu Ala Glu Phe 
15 10 15 

TTG AGO GAG TTT TTG CTC CAA CAT GGC ATT CAT GTA ACC AAT TAC GAT 96 
Leu Ser Glu Phe Leu Leu Gin His Gly lie His Val Thr Asn Tyr Asp 
20 25 30 

GAG CCA TAT ACC GGC ATT AGT GCG GCT AAC ACA CAA AAT TAT GAT TTG 144 
Glu Pro Tyr Thr Gly lie Ser Ala Ala Asn Thr Gin Asn Tyr Asp Leu 
35 40 45 

TTG TTG TTG GAT TTG ACT TTG CCT AAT TTA GAC GGG CTT GAA GTG TGT 192 
Leu Leu Leu Asp Leu Thr Leu Pro Asn Leu Asp Gly Leu Glu Val Cys 
50 55 60 

AGG CGC ATC TCC AAA CAA AAA. CAT ATC CCT ATT ATT ATT TCT TCA GCG 24 0 

Arg Arg lie Ser* Lys Gin Lys His lie Pro lie lie lie Ser Ser Ala 
65 70 75 80 

AGA AGT GAT GTG GAA GAT AAG ATT AAA GCA CTA GAT TAT GGG GCT GAT 2 88 

Arg Ser Asp Val Glu Asp Lys lie Lys Ala Leu Asp Tyr Gly Ala Asp 
85 90 95 

GAT TAC CTC CCT AAA CCC TAT GAT CCT AAA GAA TTA TTA GCT CGC ATC 33 6 

Asp Tyr Leu Pro Lys Pro Tyr Asp Pro Lys Glu Leu Leu Ala Arg lie 
100 105 110 

CAA TCG CTA CTC AGG CGT TCT CAT AAA AAA GAA GAA GTG AGT GAG CCA 3 84 

Gin Ser Leu Leu Arg Arg Ser His Lys Lys Glu Glu Val Ser Glu Pro 
115 120 125 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217: 

AAGGGGGGAT TTATATCGGT AAAGAGTTGT TTAAGC ATG OCT AGT GGC CTT TTT 54 

Met Ala Ser Gly Leu Phe 

1 5 

GAA AAC GAT GGA ATC AAA GAC AAC AAA GCG CGA GAT TTT TTC TAT AGC 102 

Glu Asn Asp Gly lie Lys Asp Asn Lys Ala Arg Asp Phe Phe Tyr Ser 
10 15 20 

CAT AGC TCC CTA ATT GTC TTT TTC CTT TTA CTG CTT GGG TTT GGG TAT 15 0 

His Ser Ser Leu lie Val Phe Phe Leu Leu Leu Leu Gly Phe Gly Tyr 
25 30 35 

TAT TTA GGG AAG TTG CTT TTT GGG GGC TCT TCT TTA GAA GTT TAT TTG 198 
Tyr Leu Gly Lys Leu Leu Phe Gly Gly Ser Ser Leu Glu Val Tyr Leu 
40 45 50 

GAT TTA AGA GAC AAG CAT GAA CGA TTG CAG CAA GAA ATC ACC GAA TTG 24 6 

Asp Leu Arg Asp Lys His Glu Arg Leu Gin Gin Glu lie Thr Glu Leu 
55 60 65 70 

CAA AGC AAG AAT GTG CGC TTG CAA AAG CGT TTG TTT GAG TTG AAG GAA 2 94 

Gin Ser Lys Asn Val Arg Leu Gin Lys Arg Leu Phe Glu Leu Lys Glu 
75 80 85 

TTA CGG CCT AGA GAT TAGATTTAAG GAAAATGGTA GTGTTAAAAA AGATGATAGG T 3 50 
Leu Arg Pro Arg Asp 
90 

TTGGTGG 3 57 

(2) INFORMATION FOR SEQ ID NO: 1218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218: 

Met Ala Ser Gly Leu Phe Glu Asn Asp Gly lie Lys Asp Asn Lys Ala 

1 5 10 15 

Arg Asp Phe Phe Tyr Ser His Ser Ser Leu lie Val Phe Phe Leu Leu 

20 25 30 

Leu Leu Gly Phe Gly Tyr Tyr Leu Gly Lys Leu Leu Phe Gly Gly Ser 

35 40 45 

Ser Leu Glu Val Tyr Leu Asp Leu Arg Asp Lys His Glu Arg Leu Gin 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 1216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216: 



Met Gly Arg Phe Ser 

1 5 
Leu Ala Leu Leu Gly 
20 

Ser Ala Thr Ala Leu 
35 

Val Ser Tyr Glu Asp 
50 

Leu Leu Leu Gly Ser 
65 

Gly Phe Leu Asn Phe 
85 

Ala His Leu Asp Pro 
100 

Tyr Leu Ser His Tyr 
115 

Leu Phe Ala Leu Phe 

130 

Leu Gin Glu Leu Ser 
145 

lie Leu Thr Pro Tyr 
165 

Gly Tyr He Gin Ser 
180 

Phe Leu Tyr Leu He 
195 

Phe He Ala Tyr Arg 
210 



Leu Lys Glu He Leu Met 
10 

Trp He Phe Gly Lys Pro 
25 

He Val Met Val Leu Met 
40 

He He Lys Asn Lys Ser 
55 

Leu Leu Thr Met Ala Gly 
70 75 
He Gly Asn Ala Ala Gin 
90 

Leu He Ala Val Leu Phe 
105 

Phe Phe Ala Ser He Thr 
120 

Val Gly He Gly Ser His 
135 

Leu Phe Leu Met Phe Ser 
150 155 
Gly Thr Gly Pro Ser Thr 
170 

Lys Asp Phe Trp Lys Trp 
185 

Val Phe Leu Ser Val Cys 
200 

Trp Leu 
215 



Leu Ser Leu Thr Leu 
15 

Leu Gly Leu His Ala 
30 

Ala Phe Cys Lys He 
45 

Ala Phe Asn He Phe 
60 

Gly Leu Lys Asn Val 
80 

Asn Phe Leu Glu His 
95 

He Val Ala Leu Phe 
110 

Ala His Val Ser Ala 
125 

He Gin Gly Val Asn 
140 

Leu Gly He Met Gly 
160 

He Tyr Tyr Gly Ser 
175 

Gly Phe He Phe Gly 
190 

Thr Pro Trp Val Lys 
205 



(2) INFORMATION FOR SEQ ID NO:1217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 309 
(D) OTHER INFORMATION: 
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15 10 

ACC TTA TTG GCT TTA CTG GGT TGG ATT TTT GGC AAA CCT TTA GGC TTG 15 9 

Thr Leu Leu Ala Leu Leu Gly Trp lie Phe Gly Lys Pro Leu Gly Leu 
15 20 25 30 

CAT GCG AGT GCG ACG GCT TTG ATT GTC ATG GTT TTA ATG GCG TTT TGT 207 
His Ala Ser Ala Thr Ala Leu lie Val Met Val Leu Met Ala Phe Cys 
35 40 45 

AAG ATT GTA AGO TAT GAA GAC ATC ATT AAA AAC AAG AGC GCG TTC AAT 25 5 

Lys lie Val Ser Tyr Glu Asp lie lie Lys Asn Lys Ser Ala Phe Asn 
50 55 60 

ATT TTT TTA TTG CTT GGA TCG CTG CTC ACG ATG GCT GGC GGG CTT AAA 3 03 

lie Phe Leu Leu Leu Gly Ser Leu Leu Thr Met Ala Gly Gly Leu Lys 
65 70 75 

AAT GTA GGG TTT TTA AAT TTT ATC GGC T^T GCG GCT CAA AAT TTT TTA 351 
Asn Val Gly Phe Leu Asn Phe lie Gly Asn Ala Ala Gin Asn Phe Leu 
80 85 90 

GAG CAT GCT CAC TTG GAT CCG TTA ATA GCG GTC TTG TTT ATT GTA GCC 3 99 

Glu His Ala His Leu Asp Pro Leu lie Ala Val Leu Phe lie Val Ala 
95 100 105 110 

CTC TTT TAT CTG TCG CAT TAT TTT TTC GCA AGC ATC ACC GCT CAT GTG 44 7 

Leu Phe Tyr Leu Ser His Tyr Phe Phe Ala Ser lie Thr Ala His Val 
115 120 125 

AGC GCG TTA TTC GCG CTT TTT GTA GGG ATT GGT TCG CAC ATT CAA GGG 4 95 

Ser Ala Leu Phe Ala Leu Phe Val Gly lie Gly Ser His lie Gin Gly 
130 135 140 

GTC AAT TTG CAA GAA TTG AGC TTG TTT TTA ATG TTT TCT TTA GGG ATT 54 3 

Val Asn Leu Gin Glu Leu Ser Leu Phe Leu Met Phe Ser Leu Gly lie 
145 150 155 

ATG GGG ATT TTA ACG CCC TAT GGC ACA GGC CCA TCC ACC ATT TAT TAC 5 91 

Met Gly lie Leu Thr Pro Tyr Gly Thr Gly Pro Ser Thr lie Tyr Tyr 
160 165 170 

GGG AGC GGG TAT ATT CAA AGC AAG GAT TTT TGG AAA TGG GGG TTT ATT 63 9 

Gly Ser Gly Tyr lie Gin Ser Lys Asp Phe Trp Lys Trp Gly Phe lie 
175 180 185 190 

TTT GGC TTT TTG TAT TTA ATC GTG TTT TTA AGC GTG TGC ACA CCT TGG 68 7 

Phe Gly Phe Leu Tyr Leu lie Val Phe Leu Ser Val Cys Thr Pro Trp 
195 200 205 

GTC AAA TTC ATC GCT TAT AGG TGG TTG TAGCTGGAAA CTTTACACAA CGCCCTT 741 
Val Lys Phe lie Ala Tyr Arg Trp Leu 
210 215 

TTAAAATGGT ATGAAGAA 75 9 
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(2) INFORMATION FOR SEQ ID N0:1214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214: 



Met Met Arg His Asp Tyr Leu Glu Leu Phe Glu Gly His Ala Glu Phe 

15 10 15 

Asn Met Val Lys Asp Phe Cys Ser Arg Val Tyr Glu Leu Ser Glu Phe 

20 25 30 

Leu Asp Lys Lys Leu Gin Val Lys Tyr Glu Asp Lys Gly Glu Pro Leu 

35 40 45 

Lys lie Thr Trp His Ser Asn Cys His Ala Leu Arg Val Ala Lys Val 

50 55 60 

lie Asp Ser Ala Lys Asn Leu lie Arg Gin Leu Lys Asn Val Glu Leu 
65 70 75 80 

lie Glu Leu Glu Lys Glu Glu Glu Cys Cys Gly Phe Gly Gly Thr Phe 

85 90 95 

Ser Val Lys Glu Pro Glu lie Ser Ala Val Met Val Lys Glu Lys lie 

100 105 110 

Lys Asn lie Glu Ser Arg Gin Val Asp Val lie Val Ser Ala Asp Ala 

115 120 125 

Gly Cys Leu Met Asn lie Ser Thr Ala Met Gin Lys Met Gly Ser Leu 

130 135 140 

Thr Lys Pro Met His Phe Tyr Asp Phe Leu Ala Ser Arg Leu Gly Leu 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 1215: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70,.. 714 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1215: 



AACCTGCAAA CCCACCTTAA AAGGCTCAAA AGAAGTGAGT TTGTGGGCCA AAAAAAGGAA 6 0 

TTAGAGGGC ATG GGG AGG TTT TCT TTA AAA GAA ATT TTA ATG CTC AGC CTT 111 
— Met Gly Arg Phe Ser Leu Lys Glu lie Leu Met Leu Ser Leu 
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(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 531 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213: 

TATTCCAATA ACGACTACCC TATTATTTTG CCTAGCGGTT CATGCACAGG G ATG ATG 5 7 

Met Met 
1 

COG CAT GAT TAT TTG GAA TTG TTT GAA GGG CAT GCG GAA TTC AAC ATG 105 
Arg His Asp Tyr Leu Glu Leu Phe Glu Gly His Ala Glu Phe Asn Met 
5 10 15 

GTT AAA GAT TTT TGC TCT AGG GTG TAT GAA TTG AGC GAA TTT TTG GAT 15 3 

Val Lys Asp Phe Cys Ser Arg Val Tyr Glu Leu Ser Glu Phe Leu Asp 
20 25 30 

AAA AAA TTG CAA GTC AAA TAT GAA GAT AAG GGC GAA CCC CTT AAA ATC 201 
Lys Lys Leu Gin Val Lys Tyr Glu Asp Lys Gly Glu Pro Leu Lys lie 
35 40 45 50 

ACA TGG CAT TCT AAT TGC CAT GCC TTA AGG GTG GCT AAA GTG ATT GAC 24 9 

Thr Trp His Ser Asn Cys His Ala Leu Arg Val Ala Lys Val lie Asp 
55 60 65 

TCG GCG AAA AAC CTC ATC AGA CAG CTT AAA AAT GTG GAA CTC ATT GAA 2 97 

Ser Ala Lys Asn Leu lie Arg Gin Leu Lys Asn Val Glu Leu lie Glu 
70 75 80 

TTG GAA AAA GAA GAA GAA TGC TGC GGG TTT GGG GGG ACT TTT TCG GTT 34 5 

Leu Glu Lys Glu Glu Glu Cys Cys Gly Phe Gly Gly Thr Phe Ser Val 
85 90 95 

AAA GAG CCT GAA ATT TCA GCG GTT ATG GTT AAA GAA AAG ATT AAA AAC 3 93 

Lys Glu Pro Glu lie Ser Ala Val Met Val Lys Glu Lys lie Lys Asn 
100 105 110 

ATA GAA AGC CGT CAA GTG GAT GTG ATT GTT TCA GCG GAT GCT GGG TGT 441 
lie Glu Ser Arg Gin Val Asp Val lie Val Ser Ala Asp Ala Gly Cys 
115 120 125 130 

TTG ATG AAT ATC AGC ACC GCT ATG CAA AAA ATG GGC TCT TTG ACA AAA 4 89 

Leu Met Asn lie Ser Thr Ala Met Gin Lys Met Gly Ser Leu Thr Lys 
135 140 145 

CCC ATG CAT TTT TAT GAC TTT TTA GCC TCA AGA CTT GGG CTT TAACATTAA 54 0 
Pro Met His Phe Tyr Asp Phe Leu Ala Ser Arg Leu Gly Leu 
150 155 160 

AGAATTATTT TAAGGAATGA TCATGGAAAA A 571 
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Gin Lys Val Ala He He Leu Tyr 
205 210 

GGGCAATTAG CCCCTGATT 7 00 

(2) INFORMATION FOR SEQ ID NO: 1212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1212: 

Met Ser Lys Glu Leu He Leu Lys Arg He Lys Glu Ala Arg Ala Lys 

15 10 15 

His Ala He Gin Gly Ala Asn Pro He Tyr Arg Asn He He Lys Val 

20 25 30 

Glu Phe Glu Asp Leu Val Glu Glu Tyr Lys His Phe Gin Val Leu Asn 

35 40 45 

Lys Ala Glu Val He Glu Ser Ala Lys Glu Asn Leu Glu Gin Ala He 

50 55 60 

Leu Lys Ala Leu Glu Asn Phe Lys Ser Lys Lys He Leu His Ser Thr 
65 70 75 80 

Asp Leu Asn Leu Asn Phe Glu Ala Phe Lys Asp Phe Thr Leu Gin Pro 

85 90 95 

Tyr Asp Lys Glu He Glu Ala Met Arg Glu Glu Leu Phe Glu He Asp 

100 105 110 

Thr Ala Leu Leu His Gly Val Cys Gly He Ser Ser Leu Gly Met He 

115 120 125 

Gly Ala Val Ser Ser His Ala Ser Pro Arg Leu Leu Ser Leu He Thr 

130 135 140 

Leu Asn Cys He He Leu Leu Lys Lys Glu Ser He Val Arg Asn Leu 
145 150 155 160 

Ser Glu Gly Met Gin Ala Leu Lys Asn Gin Ser Gin Asn Gly Ala Leu 

165 170 175 

Pro Thr Asn Met Leu Leu He Gly Gly Pro Ser Arg Thr Ala Asp He 

180 185 190 

Glu Leu Lys Thr Val Phe Gly Val His Gly Pro Gin Lys Val Ala He 

195 200 205 

He Leu Tyr 
210 

(2) INFORMATION FOR SEQ ID NO: 1213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 11: 

ACTTAGAAGG GGTGATTT ATG AGT AAA GAG CTT ATT TTA AAG CGC ATT AAA 51 

Met Ser Lys Glu Leu lie Leu Lys Arg lie Lys 
15 10 

GAA GCC AGA GCC AAG CAT GCC ATT CAG GGA GCG AAC CCT ATT TAT AGG 99 
Glu Ala Arg Ala Lys His Ala He Gin Gly Ala Asn Pro He Tyr Arg 
15 20 25 

AAT ATC ATT AAA GTG GAG TTT GAG GAC TTG GTG GAA GAA TAC AAG CAT 14 7 

Asn He He Lys Val Glu Phe Glu Asp Leu Val Glu Glu Tyr Lys His 
30 35 40 

TTC CAA GTG TTG AAT AAA GCT GAA GTC ATT GAA AGC GCT AAA GAA AAT 195 
Phe Gin Val Leu Asn Lys Ala Glu Val He Glu Ser Ala Lys Glu Asn 
45 50 55 

TTA GAG CAA GCC ATT TTA AAG GCT TTA GAA AAT TTT AAA AGC AAA AAA 24 3 

Leu Glu Gin Ala He Leu Lys Ala Leu Glu Asn Phe Lys Ser Lys Lys 
60 65 70 75 

ATC TTA CAC TCC ACA GAT TTG AAT TTG AAT TTT GAA GCG TTT AAG GAT 2 91 

He Leu His Ser Thr Asp Leu Asn Leu Asn Phe Glu Ala Phe Lys Asp 
80 85 90 

TTT ACT TTA CAG CCT TAT GAT AAA GAA ATT GAA GCG ATG CGT GAA GAG 33 9 

Phe Thr Leu Gin Pro Tyr Asp Lys Glu He Glu Ala Met Arg Glu Glu 
95 100 105 

TTG TTT GAG ATT GAT ACG GCT TTA TTG CAT GGG GTT TGT GGG ATT TCA 3 87 

Leu Phe Glu He Asp Thr Ala Leu Leu His Gly Val Cys Gly He Ser 
110 115 120 

AGC TTG GGC ATG ATT GGG GCG GTC TCT TCG CAT GCA AGC CCG CGA TTG 43 5 

Ser Leu Gly Met He Gly Ala Val Ser Ser His Ala Ser Pro Arg Leu 
125 130 135 

CTT TCG CTC ATC ACC CTT AAT TGC ATC ATC TTA TTG AAA AAA GAA TCC 4 83 

Leu Ser Leu He Thr Leu Asn Cys He He Leu Leu Lys Lys Glu Ser 
140 145 150 155 

ATT GTG CGC AAT TTG AGT GAA GGC ATG CAA GCT TTA AAA AAC CAA AGC 531 
He Val Arg Asn Leu Ser Glu Gly Met Gin Ala Leu Lys Asn Gin Ser 
160 165 170 

CAA AAC GGT GCA TTA CCC ACA AAC ATG CTC CTT ATT GGC GGG CCT AGC 57 9 

Gin Asn Gly Ala Leu Pro Thr Asn Met Leu Leu He Gly Gly Pro Ser 
175 180 185 

CGG ACA GCT GAC ATT GAA TTA AAA ACC GTT TTT GGG GTG CAT GGG CCT 62 7 

Arg Thr Ala Asp He Glu Leu Lys Thr Val Phe Gly Val His Gly Pro 
190 195 200 

CAA AAA GTC GCT ATC ATT CTC TAC TAAAGGATAA GAATGGAAAA ATTAGAAGTA 681 
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AAG GTT TTA GAG AGT TTG TAGTTTAACT TTCTAACTTT CGCCCATTTT AATTTGAG 544 
Lys Val Leu Glu Ser Leu 
150 

ATTTTTTAGC CATT 55 8 

(2) INFORMATION FOR SEQ ID NO: 1210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210: 



Met 


Glu 


Lys 


Leu 


Glu 


Val 


Gly 


Gin 


Leu 


Ala 


Pro 


Asp 


Phe 


Arg 


Leu 


Lys 


1 








5 










10 










15 




Asn 


Ser 


Asp 


Gly 


Val 


Glu 


He 


Ser 


Leu 


Lys 


Asp 


Leu 


Leu 


His 


Lys 


Lys 








20 










25 










30 






Val 


Val 


Leu 


Tyr 


Phe 


Tyr 


Pro 


Lys 


Asp 


Asn 


Thr 


Pro 


Gly 


Cys 


Thr 


Leu 






35 










40 










45 








Glu 


Ala 


Lys 


Asp 


Phe 


Ser 


Ala 


Leu 


Phe 


Ser 


Glu 


Phe 


Glu 


Lys 


Lys 


Asn 




50 










55 










60 










Ala 


Val 


Val 


Val 


Gly 


He 


Ser 


Pro 


Asp 


Asn 


Ala 


Gin 


Ser 


His 


Gin 


Lys 


65 










70 










75 










80 


Phe 


He 


Ser 


Gin 


Cys 


Ser 


Leu 


Asn 


Val 


He 


Leu 


Leu 


Cys 


Asp 


Glu 


Asp 










85 










90 










95 




Lys 


Lys 


Ala 


Ala 


Asn 


Leu 


Tyr 


Lys 


Ala 


Tyr 


Gly 


Lys 


Arg 


Met 


Leu 


Tyr 








100 










105 










110 






Gly 


Lys 


Glu 


His 


Leu 


Gly 


He 


He 


Arg 


Ser 


Thr 


Phe 


He 


He 


Asn 


Thr 






115 










120 










125 








Gin 


Gly 


Val 


Leu 


Glu 


Lys 


Cys 


Phe 


Tyr 


Asn 


Val 


Lys 


Ala 


Lys 


Gly 


His 




130 










135 










140 










Ala 


Gin 


Lys 


Val 


Leu 


Glu 


Ser 


Leu 



















145 150 



(2) INFORMATION FOR SEQ ID NO: 12 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 651 
(D) OTHER INFORMATION: 
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(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51 - . . 506 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209: 

GGGTGCATGG GCCTCAAAAA GTCGCTATCA TTCTCTACTA AAGGATAAGA ATG GAA 5 6 

Met Glu 
1 

AAA TTA GAA GTA GGG CAA TTA GCC CCT GAT TTT AGA TTG AAA AAC AGC 104 

Lys Leu Glu Val Gly Gin Leu Ala Pro Asp Phe Arg Leu Lys Asn Ser 
5 10 15 

GAT GGC GTG GAA ATT TCT TTA AAA GAT TTG CTC CAT AAA AAA GTG GTG 152 
Asp Gly Val Glu lie Ser Leu Lys Asp Leu Leu His Lys Lys Val Val 
20 25 30 

CTG TAT TTC TAG CCT AAA GAC AAC ACC CCC GGA TGC ACT CTA GAA GCC 200 
Leu Tyr Phe Tyr Pro Lys Asp Asn Thr Pro Gly Cys Thr Leu Glu Ala 
35 40 45 50 

AAA GAC TTT AGC GCT CTA TTT AGT GAA TTT GAA AAG AAA AAC GCT GTT 24 8 

Lys Asp Phe Ser Ala Leu Phe Ser Glu Phe Glu Lys Lys Asn Ala Val 
55 60 65 

GTC GTA GGC ATA AGC CCT GAT AAC GCG CAA TCG CAT CAA AAA TTT ATC 2 96 

Val Val Gly lie Ser Pro Asp Asn Ala Gin Ser His Gin Lys Phe lie 
70 75 80 

AGC CAA TGC TCT TTG AAT GTG ATT TTG CTC TGC GAT GAA GAT AAA AAA 34 4 

Ser Gin Cys Ser Leu Asn Val lie Leu Leu Cys Asp Glu Asp Lys Lys 
85 90 95 

GCC GCC AAT CTT TAC AAA GCT TAT GGC AAA CGC ATG CTT TAT GGG AAG 3 92 

Ala Ala Asn Leu Tyr Lys Ala Tyr Gly Lys Arg Met Leu Tyr Gly Lys 
100 105 110 

GAG CAT TTG GGG ATT ATC CGC TCC ACC TTC ATT ATC AAC ACG CAA GGC 44 0 

Glu His Leu Gly He He Arg Ser Thr Phe He He Asn Thr Gin Gly 
115 120 125 130 

GTT TTA GAA AAA TGT TTC TAC AAT GTC AAA GCG AAA GGC CAT GCT CAA 4 88 

Val Leu Glu Lys Cys Phe Tyr Asn Val Lys Ala Lys Gly His Ala Gin 
135 140 145 
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65 70 75 80 

TTA AGC CAA GAA GAT CGC AAC AAG CGT AAA AAA GAA ATT CTT GAA GCG 34 8 

Leu Ser Gin Glu Asp Arg Asn Lys Arg Lys Lys Glu lie Leu Glu Ala 
85 90 95 

ATT GOT AAC AAA AAG AAA ACA ATG ACC ATG AAA GAA TAT CGT GAA GAA 3 96 

lie Ala Asn Lys Lys Lys Thr Met Thr Met Lys Glu Tyr Arg Glu Glu 
100 105 110 

GGG TTG GAT TTG CAT GAT TGC GCA TGC GAA GGC CGT TTT CAT GAT CAT 4 44 

Gly Leu Asp Leu His Asp Cys Ala Cys Glu Gly Pro Phe His Asp His 
115 120 125 

GAG AGA AAA AAA GGG AAA AAA CCA AGC CAT CAT AAG CAT TAGCGCTTAG GG 4 95 
Glu Arg Lys Lys Gly Lys Lys Pro Ser His His Lys His 
130 135 140 

TGTGCTAACT TTTTTTGATT TTTGTGAAAC CACGCCGTAA GTCCCTAGCT TTTGGCTGTG 555 
GGGATTAAGG 565 

(2) INFORMATION FOR SEQ ID NO: 12 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1208: 

Met Lys Lys Leu Ala Val Ser Leu Leu Phe Thr Gly Thr Phe Leu Gly 

15 10 15 

Leu Phe Leu Asn Ala Ser Asp Phe Lys Ser Met Asp Asp Lys Gin Leu 

20 25 30 

Leu Glu Gin Ala Gly Lys Val Ala Pro Ser Glu Val Pro Glu Phe Arg 

35 40 45 

Ala Glu Val Asn Lys Arg Leu Ala Val Met Lys Glu Glu Asp Arg Lys 

50 55 60 

Asn Tyr Lys Ala Asp Phe Lys Lys Ala Met Asp Lys Asn Leu Ala Ser 
65 70 75 80 

Leu Ser Gin Glu Asp Arg Asn Lys Arg Lys Lys Glu lie Leu Glu Ala 

85 90 95 

He Ala Asn Lys Lys Lys Thr Met Thr Met Lys Glu Tyr Arg Glu Glu 

100 105 110 

Gly Leu Asp Leu His Asp Cys Ala Cys Glu Gly Pro Phe His Asp His 

115 120 125 

Glu Arg Lys Lys Gly Lys Lys Pro Ser His His Lys His 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 120 9: 

(i) SEQUENCE CHARACTERISTICS: 
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Asp Phe Leu Ser 
675 

Thr Gin Leu Thr 
690 

Val Phe Asp Glu 
705 

lie Glu Ala Cys 

Ala Pro Ser Asp 
740 

lie Thr Ser lie 
755 

Ala Val Ala Lys 
770 

His 
785 



Leu Phe Asp Gly 
680 

Leu Gly Val Asp 
695 

Arg Asn Glu Ala 
710 

Lys Arg His Asn 
725 

Tyr Pro Glu Val 

Ser Leu Asn Pro 
760 

Leu Glu Lys Glu 
775 



Phe Ser lie Gly 

Arg Asp Ser Glu 
700 

Met Leu Lys Met 
715 

Lys Tyr Cys Gly 
730 

Thr Glu Phe Leu 
745 

Asp Ser Val lie 

Leu Lys Glu His 
780 



Ser Asn Asp Leu 
685 

Leu Val Ser His 

Phe Lys Lys Ala 
720 

lie Cys Gly Gin 
735 

Val Lys Glu Gly 
750 

Pro Thr Trp Asn 
765 

Gly Leu Thr Glu 



(2) INFORMATION FOR SEQ ID NO: 1207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 61... 483 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 07: 

GTTATCTTTA ATCAATCAGA TGATAGAATT TATCTTTTAT TTTTGAATTG GGAGCATTTG 6 0 

ATG AAA AAA TTA GCG GTT TCT TTA TTA TTT ACA GGG ACT TTT TTG GGG 10 8 
Met Lys Lys Leu Ala Val Ser Leu Leu Phe Thr Gly Thr Phe Leu Gly 
15 10 15 

CTT TTT TTG AAT GCG AGC GAT TTT AAG AGC ATG GAT GAC AAG CAA CTA 15 6 

Leu Phe Leu Asn Ala Ser Asp Phe Lys Ser Met Asp Asp Lys Gin Leu 
20 25 30 

TTA GAG CAA GCA GGG AAA GTT GCT CCT AGC GAA GTC CCT GAG TTT CGC 2 04 

Leu Glu Gin Ala Gly Lys Val Ala Pro Ser Glu Val Pro Glu Phe Arg 
35 40 45 

GCG GAA GTC AAT AAG CGA TTA GCA GTG ATG AAA GAA GAA GAT CGT AAA 2 52 

Ala Glu Val Asn Lys Arg Leu Ala Val Met Lys Glu Glu Asp Arg Lys 
50 55 60 

AAT TAT AAA GCG GAT TTT AAG AAA GCG ATG GAT AAG AAT TTA GCT TCT 3 00 

Asn Tyr Lys Ala Asp Phe Lys Lys Ala Met Asp Lys Asn Leu Ala Ser 
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225 

Arg Gly Ser Glu 

Trp Gin Ser Phe 
260 

Tyr Ala lie Glu 
275 

Arg Pro Met Asp 
290 

Phe lie Val Gin 
305 

Glu Glu Ser Gin 

Lys Lys Glu lie 
340 

Ser Gly Lys Val 
355 

Lys Glu Gly Glu 
370 

Pro Cys Met Lys 
385 

Thr Cys His Ala 

Val Gly Val Ser 
420 

Thr Val Ser Cys 
435 

Tyr Glu His Glu 
450 

Gin Thr Lys lie 
465 

Phe Ser Gin Leu 

lie lie Leu Asn 
500 

His His Lys Lys 
515 

Ala Gly Tyr Ala 
530 

Gly He Gly Met 
545 

Arg Thr Ser Asp 

Ser Ser Tyr Glu 
5 80 

Ala Ser Arg Tyr 
595 

Cys Glu Ala Leu 
610 

Lys Val Met He 
625 

Leu Glu He Leu 

Glu He Tyr He 
660 



230 

His Pro Thr Arg 
245 

Ser Leu Ser Asp 

He Glu Lys His 
280 

He Glu Trp Ala 
295 

Ala Arg Pro Glu 
310 

Val Phe Glu Lys 
325 

He Leu Gin Gly 

Arg He He Asn 
360 

He Leu Val Thr 
375 

Lys Ala Ser Ala 
390 

Ala He Val Ala 
405 

Gly Ala Thr Asp 

Ala Glu Gly Glu 
440 

He Glu Arg Val 
455 

Tyr He Asn He 
470 

Pro Asn His Gly 
485 

Gin He Lys Ala 

Ser Val Lys Glu 
520 

Asn Pro Lys Asp 
535 

He Ser Ala Ala 
550 

Phe Lys Ser Asn 
565 

Pro Asn Glu Glu 

Tyr Ser Glu Ser 
600 

Ala Leu Val Arg 
615 

Pro Phe Leu Arg 
630 

Arg Lys Asn Asn 
645 

Met Cys Glu Leu 



235 

Asn He Lys Thr 
250 

Glu Asp Val Leu 
265 

Tyr Ser Lys Glu 

Lys Asp Gly Glu 
300 

Thr Val Gin Ser 
315 

Phe Lys Phe Lys 
330 

Arg Ala He Gly 
345 

Asp Leu Glu His 

Asp Asn Thr Asp 
380 

Val He Thr Asn 
395 

Arg Glu He Gly 
410 

Ser Leu Tyr Thr 
425 

Glu Gly Tyr Val 

Glu Leu Ser Asn 
460 

Gly Asn Pro Glu 
475 

Val Gly Leu Ala 
490 

His Pro Leu Ala 
505 

Lys Asn Glu He 

Phe Phe Val Lys 
540 

Phe Tyr Pro Lys 
555 

Glu Tyr Met Arg 
570 

Asn Pro Met Leu 
585 

Tyr Asn Glu Ala 

Glu Glu Met Gly 
620 

Thr He Glu Glu 
635 

Leu Glu Ser Gly 
650 

Pro Val Asn Val 
665 



240 

Thr Lys Lys Glu 
255 

He Leu Ala Lys 
270 

Ala Lys Gin Tyr 
285 

Ser Gly Glu He 

Gin Lys Ser Lys 
320 

Asn Pro Asn Glu 
335 

Ser Lys He Gly 
350 

Met Asn -Ser Phe 
365 

Pro Asp Trp Glu 

Arg Gly Gly Arg 
400 

Val Pro Ala He 
415 

Gly Met Glu He 
430 

Tyr Ala Gly He 
445 

Met Gin Glu Thr 

Lys Ala Phe Gly 
480 

Arg Met Glu Met 
495 

Leu Val Asp Leu 
510 

Glu Asn Leu Met 
525 

Lys He Ala Glu 

Pro Val He Val 
560 

Met Leu Gly Gly 
575 

Gly Tyr Arg Gly 
590 

Phe Ser Trp Glu 
605 

Leu Thr Asn Met 

Gly Lys Lys Val 
640 

Lys Asn Gly Leu 
655 

He Leu Ala Asp 
670 
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ACA GAG TTT TTA GTC AAA GAG GGC ATC ACT TCC ATT TCT TTA AAC CCT 2 37 0 

Thr Glu Phe Leu Val Lys Glu Gly lie Thr Ser lie Ser Leu Asn Pro 
745 750 755 760 

GAT AGC GTG ATC CCC ACT TGG AAC GCT GTA GCC AAG TTA GAA AAA GAA 2418 
Asp Ser Val lie Pro Thr Trp Asn Ala Val Ala Lys Leu Glu Lys Glu 
765 770 775 

CTA AAA GAA CAT GGC TTA ACT GAA CAT TGATAATAAA TAAATCAATC TAACTTG 24 72 
Leu Lys Glu His Gly Leu Thr Glu His 
780 785 

AGTGGATTTT TCGTATTAGT TTCCAT 24 9 8 

(2) INFORMATION FOR SEQ ID NO: 1206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 06: 

Met Phe Gin Glu Leu Val Pro lie Gly lie Lys Val Pro Asp Gly Phe 

15 10 15 

Ala lie Thr Ser Glu Ala Tyr Trp Tyr Leu Leu Glu Gin Gly Gly Ala 

20 25 30 

Lys Gin Lys lie lie Glu Leu Leu Glu Asn Val Asp Ala Thr Glu lie 

35 40 45 

Asp Val Leu Lys lie Arg Ser Lys Gin lie Arg Glu Leu lie Phe Gly 

50 55 60 

Thr Pro Phe Pro Ser Asp Leu Arg Asp Glu lie Phe Gin Ala Tyr Glu 
65 70 75 80 

lie Leu Ser Gin Gin Tyr His Met Lys Glu Ala Asp Val Ala Val Arg 

85 90 95 

Ser Ser Ala Thr Ala Glu Asp Leu Pro Asp Ala Ser Phe Ala Gly Gin 

100 105 110 

Gin Asp Thr Tyr Leu Asn lie Lys Gly Lys Thr Glu Leu lie His Tyr 

115 120 125 

lie Lys Ser Cys Leu Ala Ser Leu Phe Thr Asp Arg Ala lie Ser Tyr 

130 135 140 

Arg Ala Ser Arg Gly Phe Asp His Leu Lys Val Ala Leu Ser Val Gly 
145 150 155 160 

Val Gin Lys Met Val Arg Ala Asp Lys Gly Ser Ala Gly Val Met Phe 

165 170 175 

Ser He Asp Thr Glu Thr Gly Phe Lys Asp Ala Val Phe He Thr Ser 

180 185 190 

Ala Trp Gly Leu Gly Glu Asn Val Val Gly Gly Thr He Asn Pro Asp 

195 200 205 

Glu Phe Tyr Val Phe Lys Pro Thr Leu Glu Gin Asn Lys Arg Pro He 

210 215 220 

He Lys Arg Gin Leu Gly Asn Lys Thr Gin Lys Met Val Tyr Ala Pro 
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AAA AAT GAA ATT GAA AAC CTC ATG GCA GGC TAT GOT AAC CCT AAA GAT 16 98 

Lys Asn Glu lie Glu Asn Leu Met Ala Gly Tyr Ala Asn Pro Lys Asp 
525 530 535 

TTT TTT GTG AAA AAA ATC GCT GAA GGC ATT GGC ATG ATC AGT GCA GCG 174 6 

Phe Phe Val Lys Lys lie Ala Glu Gly lie Gly Met lie Ser Ala Ala 
540 545 550 

TTT TAG CCT AAA CCT GTC ATT GTG AGA ACG AGC GAT TTC AAA TCC AAT 17 94 

Phe Tyr Pro Lys Pro Val lie Val Arg Thr Ser Asp Phe Lys Ser Asn 
555 560 565 

GAA TAC ATG CGC ATG CTT GGC GGC TCT AGC TAT GAG CCT AAT GAA GAA 184 2 

Glu Tyr Met Arg Met Leu Gly Gly Ser Ser Tyr Glu Pro Asn Glu Glu 
570 575 580 

AAC CCC ATG CTT GGC TAT AGG GGG GCT AGT CGG TAT TAT TCA GAG AGC 18 90 

Asn Pro Met Leu Gly Tyr Arg Gly Ala Ser Arg Tyr Tyr Ser Glu Ser 
585 590 595 600 

TAT AAT GAA GCG TTT TCG TGG GAG TGT GAA GCC TTA GCG TTA GTG AGG 193 8 

Tyr Asn Glu Ala Phe Ser Trp Glu Cys Glu Ala Leu Ala Leu Val Arg 
605 610 615 

GAA GAA ATG GGA TTA ACC AAC ATG AAA GTG ATG ATC CCT TTT TTG CGA 198 6 

Glu Glu Met Gly Leu Thr Asn Met Lys Val Met lie Pro Phe Leu Arg 
620 625 630 

ACC ATT GAA GAG GGT AAA AAA GTC CTA GAA ATC TTA AGA AAA AAC AAT 2 034 

Thr lie Glu Glu Gly Lys Lys Val Leu Glu lie Leu Arg Lys Asn Asn 
635 640 645 

TTA GAA TCC GGT AAA AAC GGG CTT GAA ATT TAT ATC ATG TGC GAA TTG 2 0 82 

Leu Glu Ser Gly Lys Asn Gly Leu Glu lie Tyr He Met Cys Glu Leu 
650 655 660 

CCG GTG AAT GTC ATT TTG GCT GAT GAT TTC TTG AGC TTG TTT GAT GGC 213 0 

Pro Val Asn Val He Leu Ala Asp Asp Phe Leu Ser Leu Phe Asp Gly 
665 670 675 680 

TTT TCT ATT GGA TCA AAC GAT TTA ACC CAG CTC ACT TTA GGC GTG GAT 217 8 

Phe Ser He Gly Ser Asn Asp Leu Thr Gin Leu Thr Leu Gly Val Asp 
685 690 695 

AGA GAC AGC GAA TTG GTC AGC CAT GTC TTT GAT GAA AGG AAT GAA GCG 2 22 6 

Arg Asp Ser Glu Leu Val Ser His Val Phe Asp Glu Arg Asn Glu Ala 
700 705 710 

ATG CTA AAG ATG TTT AAA AAA GCG ATT GAA GCT TGC AAA AGG CAC AAC 2 2 74 

Met Leu Lys Met Phe Lys Lys Ala He Glu Ala Cys Lys Arg His Asn 
715 720 725 

AAA TAT TGC GGG ATT TGC GGG CAA GCC CCA AGC GAT TAC CCT GAA GTA 2 3 22 

Lys Tyr Cys Gly He Cys Gly Gin Ala Pro Ser Asp Tyr Pro Glu Val 
730 735 740 
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AAA GAT GGC GAG AGC GGG GAA ATC TTT ATC GTT CAA GCG CGC CCA GAA 102 6 

Lys Asp Gly Glu Ser Gly Glu lie Phe lie Val Gin Ala Arg Pro Glu 
300 305 310 

ACC GTT CAA AGC CAA AAA AGT AAA GAA GAA AGT CAA GTC TTT GAA AAA 1074 
Thr Val Gin Ser Gin Lys Ser Lys Glu Glu Ser Gin Val Phe Glu Lys 
315 320 325 

TTC AAA TTC AAA AAC CCT AAC GAA AAG AAA GAG ATT ATC TTA CAA GGC 112 2 

Phe Lys Phe Lys Asn Pro Asn Glu Lys Lys Glu lie lie Leu Gin Gly 
330 335 340 

AGA GCG ATT GGG AGT AAA ATT GGC TCA GGA AAA GTG CGC ATC ATC AAT 117 0 

Arg Ala lie Gly Ser Lys lie Gly Ser Gly Lys Val Arg lie lie Asn 
345 350 355 360 

GAT TTG GAG CAC ATG AAT TCT TTT AAA GAG GGC GAA ATT TTA GTT ACG 1218 
Asp Leu Glu His Met Asn Ser Phe Lys Glu Gly Glu lie Leu Val Thr 
365 370 375 

GAT AAC ACC GAT CCG GAC TGG GAG CCT TGC ATG AAA AAA GCG AGC GCG 12 66 

Asp Asn Thr Asp Pro Asp Trp Glu Pro Cys Met Lys Lys Ala Ser Ala 
380 385 390 

GTT ATC ACT AAT CGT GGA GGG CGC ACT TGC CAT GCC GCT ATT GTG GCG 1314 
Val He Thr Asn Arg Gly Gly Arg Thr Cys His Ala Ala He Val Ala 
395 400 405 

AGA GAA ATT GGC GTG CCA GCT ATC GTT GGG GTG AGC GGG GCG ACT GAT 1362 
Arg Glu He Gly Val Pro Ala He Val Gly Val Ser Gly Ala Thr Asp 
410 415 420 

AGC CTT TAT ACC GGC ATG GAA ATC ACG GTT TCT TGC GCT GAG GGC GAA 1410 
Ser Leu Tyr Thr Gly Met Glu He Thr Val Ser Cys Ala Glu Gly Glu 
425 430 435 440 

GAG GGC TAT GTG TAT GCG GGC ATT TAT GAG CAT GAA ATT GAA AGG GTG 14 5 8 

Glu Gly Tyr Val Tyr Ala Gly He Tyr Glu His Glu He Glu Arg Val 
445 450 455 

GAG CTT TCT AAC ATG CAA GAA ACT CAA ACA AAA ATT TAC ATC AAT ATT 15 06 

Glu Leu Ser Asn Met Gin Glu Thr Gin Thr Lys He Tyr He Asn He 
460 465 470 

GGA AAC CCT GAA AAA GCC TTT GGC TTT TCT CAA CTC CCT AAT CAC GGC 1554 
Gly Asn Pro Glu Lys Ala Phe Gly Phe Ser Gin Leu Pro Asn His Gly 
475 480 485 

GTA GGG CTA GCC AGG ATG GAA ATG ATT ATT TTA AAT CAA ATC AAA GCC 16 02 

Val Gly Leu Ala Arg Met Glu Met He He Leu Asn Gin He Lys Ala 
490 495 500 

CAC CCT TTA GCT TTA GTG GAT TTG CAC CAC AAA AAA AGC GTG AAA GAA 165 0 
His Pro Leu Ala Leu Val Asp Leu His His Lys Lys Ser Val Lys Glu 
505 510 515 520 



-1759- 



BNSDOCID: <WO 9843478A1 I > 



wo 98/43478 



PCT/US98/06371 



GAT GAG ATT TTT CAA GCT TAT GAG ATT TTA AGC GAG CAA TAG CAC ATG 3 54 

Asp Glu lie Phe Gin Ala Tyr Glu lie Leu Ser Gin Gin Tyr His Met 
75 80 85 

AAA GAA GCC GAT GTG GCT GTA AGG AGT TCC GCT ACT GCA GAA GAT TTG 4 02 

Lys Glu Ala Asp Val Ala Val Arg Ser Ser Ala Thr Ala Glu Asp Leu 
90 95 100 

CCG GAC GCT TCT TTT GCC GGG CAG CAA GAC ACT TAT TTA AAC ATT AAG 45 0 

Pro Asp Ala Ser Phe Ala Gly Gin Gin Asp Thr Tyr Leu Asn lie Lys 
105 110 115 120 

GGT AAA ACC GAA TTG ATC CAC TAT ATC AAA TCC TGT TTA GCG TCG CTT 4 98 

Gly Lys Thr Glu Leu lie His Tyr lie Lys Ser Cys Leu Ala Ser Leu 
125 130 135 

TTT ACC GAT AGA GCG ATT AGC TAT AGG GCG AGT CGT GGG TTT GAT CAT 54 6 

Phe Thr Asp Arg Ala lie Ser Tyr Arg Ala Ser Arg Gly Phe Asp His 
140 145 150 

TTA AAA GTC GCG CTC AGC GTG GGG GTG CAA AAA ATG GTG CGA GCG GAT 5 94 

Leu Lys Val Ala Leu Ser Val Gly Val Gin Lys Met Val Arg Ala Asp 
155 160 165 

AAA GGC AGC GCG GGC GTG ATG TTT TCT ATT GAC ACC GAA ACC GGT TTT 64 2 

Lys Gly Ser Ala Gly Val Met Phe Ser lie Asp Thr Glu Thr Gly Phe 
170 175 180 

AAA GAC GCG GTG TTT ATC ACT TCA GCG TGG GGG TTA GGC GAA AAT GTG 6 90 

Lys Asp Ala Val Phe lie Thr Ser Ala Trp Gly Leu Gly Glu Asn Val 
185 190 195 200 

GTG GGT GGC ACG ATA AAC CCT GAT GAA TTT TAT GTG TTT AAG CCC ACT 73 8 

Val Gly Gly Thr lie Asn Pro Asp Glu Phe Tyr Val Phe Lys Pro Thr 
205 210 215 

TTA GAG CAA AAC AAA CGC CCC ATT ATC AAA CGC CAA CTC GGC AAT AAA 7 86 

Leu Glu Gin Asn Lys Arg Pro lie lie Lys Arg Gin Leu Gly Asn Lys 
220 225 230 

ACG CAA AAA ATG GTC TAT GCC CCA AGG GGT AGC GAA CAC CCC ACC AGA 834 
Thr Gin Lys Met Val Tyr Ala Pro Arg Gly Ser Glu His Pro Thr Arg 
235 240 245 

AAC ATT AAA ACC ACC AAA AAA GAA TGG CAA TCC TTT TCA TTG AGC GAT 8 82 

Asn lie Lys Thr Thr Lys Lys Glu Trp Gin Ser Phe Ser Leu Ser Asp 
250 255 260 

GAA GAC GTG CTG ATT TTA GCC AAA TAG GCC ATT GAA ATT GAA AAA CAC 93 0 

Glu Asp Val Leu lie Leu Ala Lys Tyr Ala lie Glu lie Glu Lys His 
265 270 275 280 

TAC TCT AAA GAA GCC AAA CAA TAG CGC CCT ATG GAT ATA GAA TGG GCT 97 8 

Tyr Ser Lys Glu Ala Lys Gin Tyr Arg Pro Met Asp lie Glu Trp Ala 
285 290 295 
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Arg 


His 


Asn 


Arg 


Leu 


Leu 


Pro 


Arg 


Asp 


Tyr 


Leu 


Gly 


Tyr 


Arg 


Ser 


Leu 






115 










120 










125 








Gly 


Glu 


Glu 


lie 


Ser 


He 


Phe 


Asn 


Pro 


Lys 


Asp 


Tyr 


Asp 


Ser 


Trp 


Gin 




130 










135 










140 










Glu 


Arg 


Ala 


Asp 


Thr 


Glu 


He 


Leu 


Arg 


Gin 


Leu 


Gin 


Glu 


Ser 


Lys 


Lys 


145 










150 










155 










160 


Tyr 


Phe 


Val 


Phe 


He 


Lys 


Gly 


Cys 


Gly 


He 


Phe 


Ala 


Tyr 


His 


Arg 


Glu 










165 










170 










175 




Leu 


Ser 


Lys 


Leu 


Met 


Glu 


Val 


Phe 


Asp 


Leu 


He 


Glu 


Asn 


Ser 


Cys 


Lys 








180 










185 










190 






Val 


Leu 


Arg 


Leu 


Gly 


Asp 


Leu 


Met 


Asp 


Tyr 


Cys 


Tyr 


Asn 


Asp 


Asp 


Pro 






195 










200 










205 








Arg 


Leu 


Ser 


Val 



























210 



(2) INFORMATION FOR SEQ ID NO: 12 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 91. . .2445 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 05: 

GGGAGTTTTG TGCGATATAT CAAGTTTTTC AAAGAGTTGA ACAATAAAAA TGTGAATCTG 6 0 

GTTGGGGGCA AGAACGCTAG TATTGGTGAA ATG TTT CAA GAA TTA GTG CCT ATT 114 

Met Phe Gin Glu Leu Val Pro He 
1 5 

GGT ATT AAA GTG CCT GAT GGC TTT GCG ATC ACC AGC GAA GCG TAT TGG 162 
Gly He Lys Val Pro Asp Gly Phe Ala He Thr Ser Glu Ala Tyr Trp 
10 15 20 

TAT CTT TTA GAG CAA GGA GGG GCT AAA CAA AAA ATC ATA GAG CTT TTA 210 
Tyr Leu Leu Glu Gin Gly Gly Ala Lys Gin Lys He He Glu Leu Leu 
25 30 35 40 

GAA AAT GTT GAT GCC ACC GAA ATT GAT GTG TTA AAA ATC CGC TCC AAA 2 58 

Glu Asn Val Asp Ala Thr Glu He Asp Val Leu Lys He Arg Ser Lys 
45 50 55 

CAA ATC AGA GAG CTT ATT TTT GGC ACG CCT TTT CCT AGC GAT TTG AGA 3 06 

Gin He Arg Glu Leu He Phe Gly Thr Pro Phe Pro Ser Asp Leu Arg 
60 65 70 
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CGA TTG CTC CCT AGA GAT TAT TTA GGG TAT CGT TCT TTG GGC GAA GAA 44 2 

Arg Leu Leu Pro Arg Asp Tyr Leu Gly Tyr Arg Ser Leu Gly Glu Glu 
120 125 130 

ATT TCC ATT TTT AAC CCC AAA GAC TAT GAC AGC TGG CAA GAA AGA GCG 4 90 

lie Ser lie Phe Asn Pro Lys Asp Tyr Asp Ser Trp Gin Glu Arg Ala 

135 140 145 

GAT ACA GAA ATT TTA CGC CAA CTG CAA GAG AGC AAA AAA TAT TTT GTT 53 8 

Asp Thr Glu lie Leu Arg Gin Leu Gin Glu Ser Lys Lys Tyr Phe Val 
150 155 160 

TTC ATT AAG GGG TGT GGG ATT TTT GCC TAC CAC AGA GAG CTT TCT AAA 5 86 

Phe lie Lys Gly Cys Gly lie Phe Ala Tyr His Arg Glu Leu Ser Lys 
165 170 175 

CTC ATG GAA GTT TTT GAT TTG ATT GAA AAC TCA TGC AAG GTT TTA CGA 6 34 

Leu Met Glu Val Phe Asp Leu lie Glu Asn Ser Cys Lys Val Leu Arg 
180 185 190 195 

TTG GGC GAT TTA ATG GAT TAT TGC TAT AAT GAT GAT CCA CGA TTG AGC 6 82 

Leu Gly Asp Leu Met Asp Tyr Cys Tyr Asn Asp Asp Pro Arg Leu Ser 
200 205 210 

GTG TAAAAAGCTA AAAAGGATAA AACATGACCA TCAAC 72 0 

Val 



(2) INFORMATION FOR SEQ ID NO: 12 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 04: 



Met lie Asn Met Asn Thr His Thr Arg Gly lie Asp Ser Asn Leu lie 

15 10 15 

His Ser Leu Gin Ser lie Ser Leu Ser Met Phe Arg Lys Gly Phe Phe 

20 25 30 

Gly Leu Tyr Gin Gly Ser lie Ser Ala Arg lie Gly Ala Asn Gin Phe 

35 40 45 

Val lie Asn Lys Arg Asn Ala Val Phe Asp Gin Leu Asn Glu Asn Thr 

50 55 60 

Leu Leu Val Leu His Asp Lys He Asp Tyr Arg Trp Lys Glu Ala Ser 
65 70 75 80 

Leu Asp Ser Pro He His Ala Ser Val Tyr Arg Glu Phe Leu Asp Ala 

85 90 95 

Lys Phe He Ala Tyr Ala Arg Pro Pro Tyr Ser Leu Ala Tyr Ser Leu 
100 105 110 
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260 265 270 

Ser Glu Glu Glu 
275 

(2) INFORMATION FOR SEQ ID NO:1203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50 . . . 685 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1203: 

AGCTCTTTAC TATTTTATTA TCTATCTTTT ATTAAAAAAA CTTGTTATC ATG ATA AAC 5 8 

Met lie Asn 
1 

ATG AAC AC A CAC ACA AGA GGC ATT GAC AGC AAT CTG ATT CAT TCG CTC 106 
Met Asn Thr His Thr Arg Gly lie Asp Ser Asn Leu lie His Ser Leu 
5 10 15 

CAA AGC ATT TCA TTA TCC ATG TTT AGA AAG GGT TTT TTT GGG CTT TAT 154 
Gin Ser lie Ser Leu Ser Met Phe Arg Lys Gly Phe Phe Gly Leu Tyr 
20 25 30 35 

CAA GGC TCT ATT TCA GCA CGC ATT GGC GCA AAT CAA TTT GTG ATC AAC 202 
Gin Gly Ser lie Ser Ala Arg lie Gly Ala Asn Gin Phe Val lie Asn 
40 45 50 

AAA AGA AAC GCT GTT TTT GAT CAA TTG AAT GAA AAC ACC TTA CTG GTT 250 
Lys Arg Asn Ala Val Phe Asp Gin Leu Asn Glu Asn Thr Leu Leu Val 
55 60 65 

TTG CAT GAC AAA ATA GAT TAC CGC TGG AAA GAA GCG AGC TTG GAT TCG 298 
Leu His Asp Lys lie Asp Tyr Arg Trp Lys Glu Ala Ser Leu Asp Ser 
70 75 80 

CCC ATT CAT GCG AGC GTG TAT AGG GAG TTT TTG GAC GCT AAA TTC ATC 34 6 

Pro lie His Ala Ser Val Tyr Arg Glu Phe Leu Asp Ala Lys Phe lie 
85 90 95 

GCT TAC GCG CGC CCT CCT TAT AGT TTG GCG TAT TCC TTG CGC CAC AAC 3 94 

Ala Tyr Ala Arg Pro Pro Tyr Ser Leu Ala Tyr Ser Leu Arg His Asn 
100 105 110 115 
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CCT GTA GAA TTT CAA AAC AAG CGG ATG CAA CTG CTT TGC GTG GGG AAA 82 5 

Pro Val Glu Phe Gin Asn Lys Arg Met Gin Leu Leu Cys Val Gly Lys 
245 250 255 260 

CTA GAA TGC GAT TAT GAA GGG TTT TTT CAA ACG ATT TCT GAG GAG GAA T 874 
Leu Glu Cys Asp Tyr Glu Gly Phe Phe Gin Thr lie Ser Glu Glu Glu 
265 270 275 

AATGAAAGAT GAACACAACC AAGAACACGA TCATTTAA 912 

(2) INFORMATION FOR SEQ ID NO: 1202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202: 

Met Val He Asp Glu He Phe Gin He Met Met Leu Arg Arg He Lys 

15 10 15 

Val Gly Ser Asn Leu Asn Lys Lys Glu Ser Leu Leu Asp Ala Phe Val 

20 25 30 

Lys Thr Tyr Leu Gin He Leu Glu Pro He Ser Ser Lys Arg Leu Lys 

35 40 45 

Glu Leu Ala Asp Leu Lys He Ser Cys Ala Thr He Arg Asn Tyr Phe 

50 55 60 

Gin He Leu Ser Lys Glu Gly Met Leu Tyr Gin Ala His Ser Ser Gly 
65 70 75 80 

Ala Arg Leu Pro Thr Phe Lys Ala Phe Glu Asn Tyr Trp Gin Lys Ser 

85 90 95 

Leu Arg Phe Glu Thr Leu Lys Val Asn Glu Lys Arg Leu Lys Ser Ala 

100 105 110 

Ser Glu Asn Phe Gly Leu Phe Thr Leu Leu Lys Lys Pro Ser Leu Glu 

115 120 125 

Arg Leu Glu Arg Val He Glu Cys Glu Lys Arg Phe Leu He Leu Asp 

130 135 140 

Phe Leu Ala Phe Ser Cys Ala Leu Gly Tyr Ser Val Lys Met Glu Lys 
145 150 155 160 

Phe Leu Leu Glu Leu Val Gly Arg Ser Val Lys Glu Val Arg Ser He 

165 170 175 

Ala Ala Ser Phe Asn Ala Leu Ser Leu Ala Arg Gin Leu Glu Arg Leu 

180 185 190 

Glu Tyr Ser Asn Thr Gin He Thr Arg Phe Asn Leu Met Gly Leu Lys 

195 200 205 

Thr Leu Leu Asn Ser Pro Leu Phe Phe Asp He Leu Gly Gly Lys Val 

210 215 220 

Leu Glu Arg Leu Ser Lys Gly Leu His Phe He Glu Pro Asp Cys Met 
225 230 235 240 

Leu Val Thr Arg Pro Val Glu Phe Gin Asn Lys Arg Met Gin Leu Leu 

245 250 255 

Cys Val Gly Lys Leu Glu Cys Asp Tyr Glu Gly Phe Phe Gin Thr He 
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Leu Asn Lys Lys Glu Ser Leu Leu Asp Ala Phe Val Lys Thr Tyr Leu 
25 30 35 

CAG ATT TTA GAA CCC ATT AGT TCT AAA CGC TTA AAA GAG TTG GCG GAC 2 01 

Gin lie Leu Glu Pro lie Ser Ser Lys Arg Leu Lys Glu Leu Ala Asp 
40 45 SO 

TTG AAA ATA TCT TGC GCG ACG ATC AGG AAT TAT TTT CAA ATC CTT TCT 24 9 

Leu Lys lie Ser Cys Ala Thr lie Arg Asn Tyr Phe Gin lie Leu Ser 
55 60 65 

AAA GAG GGC ATG CTT TAT CAA GCC CAT TCT AGT GGC GCT AGA TTG CCC 2 97 

Lys Glu Gly Met Leu Tyr Gin Ala His Ser Ser Gly Ala Arg Leu Pro 
70 75 80 

ACT TTT AAG GCG TTT GAA AAC TAT TGG CAA AAG TCG TTG CGC TTT GAA 34 5 

Thr Phe Lys Ala Phe Glu Asn Tyr Trp Gin Lys Ser Leu Arg Phe Glu 
85 90 95 100 

ACT TTA AAG GTG AAT GAA AAA CGC CTA AAA AGC GCG AGT GAA AAT TTT 3 93 

Thr Leu Lys Val Asn Glu Lys Arg Leu Lys Ser Ala Ser Glu Asn Phe 
105 ■ 110 115 

GGG CTT TTC ACG CTG TTA AAA AAA CCC AGT TTG GAG CGT TTA GAA AGA 44 1 

Gly Leu Phe Thr Leu Leu Lys Lys Pro Ser Leu Glu Arg Leu Glu Arg 
120 125 130 

GTC ATT GAG TGC GAA AAA CGC TTT TTG ATT TTG GAC TTT TTG GCG TTT 48 9 

Val lie Glu Cys Glu Lys Arg Phe Leu lie Leu Asp Phe Leu Ala Phe 
135 140 145 

TCT TGC GCA CTG GGT TAC AGC GTT AAA ATG GAA AAG TTT TTA TTA GAG 53 7 

Ser Cys Ala Leu Gly Tyr Ser Val Lys Met Glu Lys Phe Leu Leu Glu 
150 155 160 

CTT GTG GGC AGA AGC GTT AAA GAA GTG CGC TCA ATC GCT GCT TCT TTC 58 5 

Leu Val Gly Arg Ser Val Lys Glu Val Arg Ser lie Ala Ala Ser Phe 
165 170 175 180 

AAT GCG TTG AGT TTG GCC AGG CAA TTA GAG CGT TTG GAG TAT TCC AAC 63 3 

Asn Ala Leu Ser Leu Ala Arg Gin Leu Glu Arg Leu Glu Tyr Ser Asn 
185 190 195 

ACA CAA ATC ACA CGC TTT AAT CTG ATG GGG TTA AAA ACG CTT TTA AAC 681 
Thr Gin lie Thr Arg Phe Asn Leu Met Gly Leu Lys Thr Leu Leu Asn 
200 205 210 

AGC CCT TTA TTT TTT GAC ATT TTA GGG GGT AAG GTT TTA GAG CGT TTG 72 9 

Ser Pro Leu Phe Phe Asp lie Leu Gly Gly Lys Val Leu Glu Arg Leu 
215 220 225 

AGT AAG GGT TTG CAT TTT ATA GAG CCT GAT TGC ATG CTA GTA ACA CGC 777 
Ser Lys Gly Leu His Phe lie Glu Pro Asp Cys Met Leu Val Thr Arg 
230 235 240 
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Tyr 


Tyr 


Gin 


Asn 
18 0 


Pro 


Leu 


Leu 


Leu 


Gly 
185 


Ala 


Asp 


He 


Val 


Ala 
190 


His 


Ser 


CjXy 


i nr 


Lys 
195 


Tyr 


Leu 


Gly 


Gly 


His 
200 


Ser 


Asp 


Val 


Val 


Ala 
205 


Gly 


Leu 


Val 


Tht" 


Tlir 
210 


Asn 


Asn 


Glu 


Ala 


Leu 
215 


Ala 


Gin 


Glu 


He 


Ala 
220 


Phe 


Phe 


Gin 


Asn 


Ala 


lie 


Gly 


Gly 


Val 


Leu 


Gly 


Pro 


Gin 


Asp 


Ser 


Trp 


Leu 


Leu 


Gin 


Arg 


o o c 










230 










235 










240 


Gly 


lie 


Lys 


Thr 


Leu 
245 


Gly 


Leu 


Arg 


Met 


Glu 
250 


Ala 


His 


Gin 


Lys 


Asn 
255 


Ala 


Leu 


Cys 


Val 


Ala 
2 60 


Glu 


Phe 


Leu 


Glu 


Lys 
265 


His 


Pro 


Lys 


Val 


Glu 
270 


Arg 


Val 


Tyr 


Tyr 


Pro 


Gly 


Leu 


Pro 


Thr 


His 


Pro 


Asn 


Tyr 


Glu 


Leu 


Ala 


Lys 


Lys 






275 










280 










285 








Met 


Arg 


Gly 


Phe 


Ser 


Gly 


Met 


Leu 


Ser 


Phe 


Thr 


Leu 


Lys 


Asn 


Asp 




290 










295 










300 








Seir 


Glu 


Ala 


Val 


Ala 


Phe 


Val 


Glu 


Ser 


Leu 


Lys 


Leu 


Phe 


He 


Leu 


Gly 


305 










310 










315 










320 


Glu 


Ser 


Leu 


Gly 


Gly 
325 


Val 


Glu 


Ser 


Leu 


Val 
330 


Gly 


He 


Pro 


Ala 


Phe 
335 


Met 


Thr 


His 


Ala 


Cys 
340 


He 


Pro 


Lys 


Thr 


Gin 
345 


Arg 


Glu 


Ala 


Ala 


Gly 
350 


He 


Arg 


Asp 


Gly 


Leu 
355 


Val 


Arg 


Leu 


Ser 


Val 
360 


Gly 


He 


Glu 


His 


Glu 
365 


Gin 


Asp 


Leu 


Leu 


Glu 
370 


Asp 


Leu 


Glu 


Gin 


Ala 
375 


Phe 


Ala 


Lys 


He 


Gly 
380 











(2) INFORMATION FOR SEQ ID NO: 12 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 6... 873 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 01: 

ATTATTAACT TTTTATGCTA TAATGCGAGG GTTCTTTCAT CAAGA ATG GTG ATT GAC 

Met Val He Asp 
1 

GAG ATT TTT CAA ATA ATG ATG TTA AGA AGA ATT AAA GTA GGT TCT AAT 
Glu He Phe Gin He Met Met Leu Arg Arg He Lys Val Gly Ser Asn 
5 10 15 20 

TTG AAT AAA AAA GAG AGT TTG TTA GAT GCG TTT GTT AAA ACC TAT CTG 
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CTT AAA CTA TTC ATT TTA GGC GAG AGT TTG GGC GGG GTG GAA AGT TTG 1011 

Leu Lys Leu Phe lie Leu Gly Glu Ser Leu Gly Gly Val Glu Ser Leu 
315 320 325 

GTG GGG ATT CCG GCA TTT ATG ACC CAT GCG TGC ATC CCT AAA ACG CAA 105 9 

Val Gly lie Pro Ala Phe Met Thr His Ala Cys lie Pro Lys Thr Gin 

330 335 340 345 

CGA GAA GCT GCT GGG ATT AGA GAT GGC CTG GTG CGC TTG TCT GTA GGG 110 7 

Arg Glu Ala Ala Gly lie Arg Asp Gly Leu Val Arg Leu Ser Val Gly 

350 355 360 

ATT GAG CAT GAA CAG GAT TTG TTA GAA GAT TTA GAG CAA GCG TTC GCT 1155 

lie Glu His Glu Gin Asp Leu Leu Glu Asp Leu Glu Gin Ala Phe Ala 
365 370 375 

AAA ATA GGC TAAAGTTTCA TTACAATTTA TGAATAAAGG AGTTAAAAAC ATGAA 12 0 9 

Lys lie Gly 
380 



(2) INFORMATION FOR SEQ ID NO: 1200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1200: 

Met Arg Met Gin Thr Lys Leu lie His Gly Gly lie Ser Glu Asp Ala 

15 10 15 

Thr Thr Gly Ala Val Ser Val Pro lie Tyr Gin Thr Ser Thr Tyr Arg 

20 25 30 

Gin Asp Ala He Gly Arg His Lys Gly Tyr Glu Tyr Ser Arg Ser Gly 

35 40 45 

Asn Pro Thr Arg Phe Ala Leu Glu Glu Leu He Ala Asp Leu Glu Gly 

50 55 60 

Gly Val Lys Gly Phe Ala Phe Ala Ser Gly Leu Ala Gly He His Ala 
65 70 75 80 

Val Phe Ser Leu Leu Gin Ser Gly Asp His Val Leu Leu Gly Asp Asp 

85 90 95 

Val Tyr Gly Gly Thr Phe Arg Leu Phe Asn Gin Val Leu Val Lys Asn 

100 105 110 

Gly Leu Ser Cys Thr He He Asp Thr Ser Asp He Ser Gin He Lys 

115 120 125 

Lys Ala He Lys Pro Asn Thr Lys Ala Leu Tyr Leu Glu Thr Pro Ser 

130 135 140 

Asn Pro Leu Leu Lys He Thr Asp Leu Ala Gin Cys Ala Ser Val Ala 
145 150 155 160 

Lys Asp His Gly Leu Leu Thr He Val Asp Asn Thr Phe Ala Thr Pro 
165 170 175 
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CAT GTG TTA TTG GGC GAT GAT GTT TAT GGG GGG ACT TTC CGC TTG TTT 33 9 

His Val Leu Leu Gly Asp Asp Val Tyr Gly Gly Thr Phe Arg Leu Phe 
90 95 100 105 

AAT CAA GTG CTT GTC AAA AAC GGG CTT TCT TGC ACC ATT ATA GAC ACT 3 87 

Asn Gin Val Leu Val Lys Asn Gly Leu Ser Cys Thr lie lie Asp Thr 
110 115 120 

AGC GAT ATA TCC CAA ATT AAA AAG GCT ATC AAG CCC AAC ACC AAA GCC 4 35 

Ser Asp lie Ser Gin lie Lys Lys Ala lie Lys Pro Asn Thr Lys Ala 
125 130 135 

CTT TAT TTA GAA ACC CCT AGT AAC CCC TTG CTT AAA ATC ACG GAT TTA 4 83 

Leu Tyr Leu Glu Thr Pro Ser Asn Pro Leu Leu Lys lie Thr Asp Leu 
140 145 150 

GCG CAA TGC GCT AGT GTC GCT AAA GAT CAT GGT TTG CTC ACT ATC GTG 531 
Ala Gin Cys Ala Ser Val Ala Lys Asp His Gly Leu Leu Thr lie Val 
155 160 165 

GAT AAC ACC TTT GCC ACC CCC TAT TAT CAA AAC CCG CTT CTT TTG GGA 579 
Asp Asn Thr Phe Ala Thr Pro Tyr Tyr Gin Asn Pro Leu Leu Leu Gly 
170 175 180 185 

GCG GAC ATT GTG GCA CAT AGC GGC ACC AAA TAG TTA GGC GGG CAT AGC 62 7 

Ala Asp lie Val Ala His Ser Gly Thr Lys Tyr Leu Gly Gly His Ser 
190 195 200 

GAT GTG GTC GCC GGG CTT GTA ACC ACT AAT AAT GAA GCG CTA GCC CAA 67 5 

Asp Val Val Ala Gly Leu Val Thr Thr Asn Asn Glu Ala Leu Ala Gin 
205 210 215 

GAG ATC GCT TTT TTC CAA AAC GCT ATC GGT GGG GTT TTA GGC CCT CAA 72 3 

Glu lie Ala Phe Phe Gin Asn Ala lie Gly Gly Val Leu Gly Pro Gin 
220 225 230 

GAC AGC TGG CTG TTG CAA AGA GGG ATT AAA ACG CTG GGA TTG CGC ATG 771 
Asp Ser Trp Leu Leu Gin Arg Gly lie Lys Thr Leu Gly Leu Arg Met 
235 240 245 

GAA GCC CAT CAA AAA AAC GCT CTT TGT GTG GCT GAG TTT TTA GAA AAA 819 
Glu Ala His Gin Lys Asn Ala Leu Cys Val Ala Glu Phe Leu Glu Lys 
250 255 260 265 

CAC CCT AAA GTG GAA AGG GTT TAT TAG CCG GGC CTT CCC ACT CAC CCT 867 
His Pro Lys Val Glu Arg Val Tyr Tyr Pro Gly Leu Pro Thr His Pro 
270 275 280 

AAT TAC GAA CTA GCT AAA AAA CAG ATG CGT GGC TTT AGC GGG ATG CTC 915 
Asn Tyr Glu Leu Ala Lys Lys Gin Met Arg Gly Phe Ser Gly Met Leu 
285 290 295 

TCT TTC ACT CTC AAA AAT GAT AGC GAG GCG GTT GCT TTT GTA GAA AGC 963 
Ser Phe Thr Leu Lys Asn Asp Ser Glu Ala Val Ala Phe Val Glu Ser 
300 305 310 
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65 70 75 80 

lie Leu Ala Leu Lys Ser Leu Ser Pro Arg Leu Lys Ala Leu lie Leu 

85 90 95 

Arg Thr Ser Ser Ser Met Arg Phe Thr lie Leu Asn Leu Ala Tyr Met 

100 105 110 

Ser Val Lys Thr Thr Ser Ala He Val Ala Val Ser He Gin Arg Asn 

115 120 125 

Asn Phe Val Val Phe Ser Ser Lys Thr Leu Phe Thr Pro Ser Leu He 

130 135 140 

Ala Leu Phe Ser Val Leu Met Ser Val 
145 150 



(2) INFORMATION FOR SEQ ID NO: 1199: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25 . . . 1164 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199: 



AAATTCAATA AAAAAGGAAA AACC ATG CGC ATG CAA ACC AAA TTA ATC CAT 51 

Met Arg Met Gin Thr Lys Leu He His 
1 5 



GGG GGC ATT AGT GAG GAC GCA ACA ACG GGG GCG GTG AGC GTG CCT ATT 99 
Gly Gly He Ser Glu Asp Ala Thr Thr Gly Ala Val Ser Val Pro He 
10 15 20 25 



TAT CAA ACT TCC ACC TAC CGC CAA GAC GCC ATA GGC CGC CAT AAG GGC 14 7 

Tyr Gin Thr Ser Thr Tyr Arg Gin Asp Ala He Gly Arg His Lys Gly 
30 35 40 



TAT GAA TAC TCT CGC TCA GGC AAC CCC ACG CGC TTT GCT TTA GAA GAA 195 
Tyr Glu Tyr Ser Arg Ser Gly Asn Pro Thr Arg Phe Ala Leu Glu Glu 
45 50 55 



CTC ATC GCT GAT TTA GAA GGG GGG GTT AAG GGG TTT GCT TTT GCC TCT 24 3 

Leu He Ala Asp Leu Glu Gly Gly Val Lys Gly Phe Ala Phe Ala Ser 
60 65 70 



GGA TTA GCT GGA ATC CAC GCC GTT TTT TCC CTC TTG CAA TCA GGC GAT 2 91 

Gly Leu Ala Gly He His Ala Val Phe Ser Leu Leu Gin Ser Gly Asp 
75 80 85 
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TGG GTT TCT TGC GTG ATT TGG ACT AAT TTA TCC ATT GAA GTT TTA TTG 14 7 

Trp Val Ser Cys Val lie Trp Thr Asn Leu Ser lie Glu Val Leu Leu 
30 35 40 

AGG GTG GAA ATC CCT TTA ATC TCT TCC ATA ATC AAG CGG GCG TTT TCC 195 
Arg Val Glu lie Pro Leu lie Ser Ser lie lie Lys Arg Ala Phe Ser 
45 50 55 60 

ACA AAC AAA TTG ATC CCA CGG CCC ACT TGC GAG ATT TCA TCG TTG CGA 24 3 

Thr Asn Lys Leu lie Pro Arg Pro Thr Cys Glu lie Ser Ser Leu Arg 
65 70 75 

TCA CCC ACA TCA ATT TTG GCT CTC AAA TCC TTA TCC CCA CGG CTA AAA 2 91 

Ser Pro Thr Ser He Leu Ala Leu Lys Ser Leu Ser Pro Arg Leu Lys 
80 85 90 

GCG TTG ATT TTA AGG ACC AGT TCA TCA ATG CGT TTC ACG ATC CTT AAT 33 9 

Ala Leu He Leu Arg Thr Ser Ser Ser Met Arg Phe Thr He Leu Asn 
95 100 105 

TTA GCG TAT ATG AGC GTC AAA ACC ACT AGC GCT ATC GTC GCT GTC AGT 3 87 

Leu Ala Tyr Met Ser Val Lys Thr Thr Ser Ala He Val Ala Val Ser 
110 115 120 

ATC CAA AGG AAT AAT TTC GTG GTG TTT TCA TCA AAA ACC TTA TTC ACG 4 3 5 

He Gin Arg Asn Asn Phe Val Val Phe Ser Ser Lys Thr Leu Phe Thr 
125 130 135 140 

CCT TCT TTG ATC GCT TTA TTT TCT GTG TTA ATG TCA GTG TAATAGGAAG TC 4 86 
Pro Ser Leu He Ala Leu Phe Ser Val Leu Met Ser Val 
145 150 

GTTGCGATCA CCATTTGAGA AACTTCATCA TAATGCGAG 525 

(2) INFORMATION FOR SEQ ID NO: 1198: 

(i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRTUsTDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198: 

Met Glu Ala Phe He Met Leu Ala He Ser Val Ala Leu Phe Phe Thr 

15 10 15 

Glu Phe Arg Val Val Glu Glu Ser Phe Met Leu Phe Trp Val Ser Cys 

20 25 30 

Val He Trp Thr Asn Leu Ser He Glu Val Leu Leu Arg Val Glu He 

35 40 45 

Pro Leu He Ser Ser He He Lys Arg Ala Phe Ser Thr Asn Lys Leu 

50 55 60 

He Pro Arg Pro Thr Cys Glu He Ser Ser Leu Arg Ser Pro Thr Ser 
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355 360 365 

Ser Gin He Thr Glu Ser Ala His Thr Glu Glu Glu Leu Ser Ser Lys 

370 375 380 

Val Glu Gin Leu Ser Arg Asn Ala Asp Asp Val Lys Ser He Leu Asp 
385 390 395 400 

He He Asn Asp He Ala Asp Gin Thr Asn Leu Leu Ala Leu Asn Ala 

405 410 415 

Ala He Glu Ala Ala Arg Ala Gly Glu His Gly Arg Gly Phe Ala Val 

420 425 430 

Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys Ser Leu 

435 440 445 

Ala Glu He Asn Ser Thr He Met Val He Val Gin Glu He Asn Ala 

450 455 460 

Val Ser Ser Gin Met Asn Leu Asn Ser Gin Lys Met Glu Arg Leu Ser 
465 470 475 480 

Asp Met Ser Lys Ser Val Gin Glu Thr Tyr Glu Lys Met Ser Ser Asn 

485 490 495 

Leu Ser Ser Val Val Ser Asp Ser Asn Gin Ser Met Asp Asp Tyr Ala 

500 505 510 

Lys Ser Gly His Gin He Glu Val Met Val Ser Asp Phe Ala Glu Val 

515 520 525 

Glu Lys Val Ala Ser Lys Thr Leu Ala Asp Ser Ser Asp He Leu Asn 

530 535 540 

He Ala Thr His Val Ser Gly Thr Thr Met Asn Leu Asp Lys Gin Val 
545 550 555 560 

Asn Leu Phe Lys Thr 
565 

(2) INFORMATION FOR SEQ ID NO: 1197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 16 . . , 474 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197: 

CCTTGAGATT GCTCT ATG GAA GCA TTC ATC ATG CTC GCT ATA TCA GTG GCT 51 
Met Glu Ala Phe He Met Leu Ala He Ser Val Ala 
15 10 

TTA TTT TTC ACG GAA TTT AGG GTG GTT GAG GAA TCT TTC ATG CTC TTT 9 9 

Leu Phe Phe Thr Glu Phe Arg Val Val Glu Glu Ser Phe Met Leu Phe 
15 20 25 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196: 



Met Met Phe Ser 
1 

Val Val Leu Ala 
20 

Val Lys Val Met 
35 

Glu Thr Gly Gin 
50 

Lys lie lie Glu 
65 

Ala Thr Ala Arg 

Asp Lys Gly Met 
100 

Leu Phe Asp Pro 
115 

Ala Gin Ser Val 
130 

Ala Lys Lys Gly 
145 

Gly Gly Val Pro 

Ser Gin Met Val 
180 

Glu Asn Lys Ala 
195 

Thr Thr Lys Leu 
210 

Val Leu Thr Leu 
225 

Glu Leu Val Leu 

Arg Ala Lys lie 
260 

Gly Arg Gly lie 
275 

Glu lie Lys Gly 
290 

Val Gin lie Thr 
305 

Thr Leu Asn Ser 

Asn Ala Ser lie 
340 

Thr Gin Gly Leu 



Ser Met Phe Ala 
5 

Ala Leu Leu Gly 

Gin Lys Asp Val 
40 

Tyr Lys Lys Arg 
55 

Gin Gly lie His 
70 

Lys Met Ala Leu 
85 

lie Tyr Met Val 

Val Asn Pro Lys 
120 

Asp Gly Val Tyr 
135 

Gly Gly Tyr Thr 
150 

Glu Lys Lys Phe 
165 

lie Ala Thr Thr 

lie Lys Glu Gly 
200 

Phe Leu Trp lie 
215 

lie Tyr Ala Lys 
230 

Lys lie Asn Ala 
245 

Asp Val Gly Asp 

Asn Leu Phe Val 
280 

lie Ser Thr Leu 
295 

Gin Glu Thr Gin 
310 

Val Lys Asn Lys 
325 

Glu Gin Ser Gin 
Val Lys Glu Ser 



Ser Leu Gly Thr 
10 

Leu Gly Gly Leu 
25 

Leu Ala Gin Leu 

Glu Lys Thr Leu 
60 

Glu Tyr Tyr Lys 
75 

Asp Tyr Phe Lys 
90 

Val, Val Asp Lys 
105 

Thr Val Xaa Gin 

Tyr Val Arg Gly 
140 

Tyr Tyr Lys Met 
155 

Ala Tyr Ser His 
170 

Ser Tyr Tyr Thr 
185 

Val Asn Lys Val 

Leu Thr Ala Thr 
220 

Leu Arg lie Val 
235 

Phe Ser Arg Gly 
250 

Arg Asn Asp Glu 
265 

Glu Asn Ala Arg 

Asn Lys Thr Ser 
300 

Lys Ser Me t Lys 
315 

Ala Thr Asp lie 
330 

Gly Leu Arg Lys 
345 

Lys Asp Ala lie 



Arg lie Met Leu 
15 

Phe lie Gly Phe 
30 

Met Glu His Leu 
45 

Ala Tyr Met Thr 

Asn Phe Asp Asn 
80 

Arg lie Asn Asp 
95 

Asn Gly Val Val 
110 

Ser Gly Leu Asp 
125 

Tyr Leu Glu Ala 

Pro Lys Tyr Asp 
160 

Tyr Asp Glu Val 
175 

Asp lie Asn Thr 
190 

Phe Asp Glu Asn 
205 

lie Ala Leu Val 

Lys Arg lie Asp 
240 

Asp Lys Asp Leu 
255 

lie Ser Gin Val 
270 

Leu lie Met Glu 
285 

Met Asp Lys Leu 

Asp Ser Ser Thr 
320 

Ala Ser Met Met 
335 

Arg Leu lie Glu 
350 

Gly Asp Leu Phe 
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TTA TTT TCT CAA ATC ACA GAG AGC GCG CAC ACT GAA GAG GAA CTC TCT 115 3 

Leu Phe Ser Gin lie Thr Glu Ser Ala His Thr Glu Glu Glu Leu Ser 
370 375 380 

AGC AAA GTG GAG GAG CTA AGC CGT AAC GCT GAT GAT GTC AAA TCC ATT 12 01 

Ser Lys Val Glu Gin Leu Ser Arg Asn Ala Asp Asp Val Lys Ser lie 
385 390 395 

CTG GAT ATT ATC AAT GAT ATT GCC GAT CAA ACG AAT TTA TTA GCC CTA 124 9 

Leu Asp lie lie Asn Asp lie Ala Asp Gin Thr Asn Leu Leu Ala Leu 
400 405 410 

AAC GCT GCT ATT GAA GCC GCA AGG GCT GGC GAG CAT GGC AGA GGC TTT 12 9 7 

Asn Ala Ala lie Glu Ala Ala Arg Ala Gly Glu His Gly Arg Gly Phe 
415 420 425 430 

GCG GTG GTG GCT GAT GAA GTT AGG AAT TTA GCC GGG CGC ACT CAA AAG 134 5 

Ala Val Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys 
435 440 445 

TCT TTA GCC GAA ATC AAT TCC ACT ATC ATG GTG ATT GTC CAA GAA ATC 13 93 

Ser Leu Ala Glu lie Asn Ser Thr lie Met Val lie Val Gin Glu lie 
450 455 460 

AAT GCC GTG AGT TCG CAA ATG AAT CTC AAT TCG CAA AAA ATG GAG CGT 1441 
Asn Ala Val Ser Ser Gin Met Asn Leu Asn Ser Gin Lys Met Glu Arg 
465 470 475 

TTG AGC GAT ATG AGT AAA AGC GTG CAA GAA ACT TAC GAA AAA ATG AGT 14 8 9 

Leu Ser Asp Met Ser Lys Ser Val Gin Glu Thr Tyr Glu Lys Met Ser 
480 485 490 

TCT AAT TTA AGC TCA GTC GTG TCA GAC AGC AAT CAA AGC ATG GAC GAT 1537 
Ser Asn Leu Ser Ser Val Val Ser Asp Ser Asn Gin Ser Met Asp Asp 
495 500 505 510 

TAC GCC AAA TCC GGA CAC CAA ATT GAA GTT ATG GTA AGC GAT TTT GCA 15 8 5 

Tyr Ala Lys Ser Gly His Gin lie Glu Val Met Val Ser Asp Phe Ala 
515 520 525 

GAG GTG GAA AAA GTG GCT TCT AAG ACT TTA GCG GAT TCT TCA GAT ATT 163 3 

Glu Val Glu Lys Val Ala Ser Lys Thr Leu Ala Asp Ser Ser Asp lie 
530 535 540 

TTA AAC ATC GCT ACG CAT GTG AGT GGA ACG ACC ATG AAT TTA GAC AAA 16 81 

Leu Asn lie Ala Thr His Val Ser Gly Thr Thr Met Asn Leu Asp Lys 
545 550 . 555 

CAA GTG AAT TTG TTT AAA ACT TAATCAGGGG GAGTTTATTA AAAAAGGGTT GGAT 17 3 6 
Gin Val Asn Leu Phe Lys Thr 
560 565 

TGTTAAAAGT TTCTGTGATC AC 17 5 8 

(2) INFORMATION FOR SEQ ID NO: 1196: 
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GAG GCG GCC AAA AAA GGG GGA GGC TAG ACT TAT TAT AAA ATG CCT AAA 4 81 

Glu Ala Ala Lys Lys Gly Gly Gly Tyr Thr Tyr Tyr Lys Met Pro Lys 
145 150 155 

TAC GAT GGA GGC GTA CCG GAG AAA AAA TTC GCC TAC TCG CAT TAT GAT 52 9 

Tyr Asp Gly Gly Val Pro Glu Lys Lys Phe Ala Tyr Ser His Tyr Asp 
160 165 170 

GAA GTT TOT CAA ATG GTG ATC GCA ACG ACT TCC TAT TAC ACT GAC ATT 577 
Glu Val Ser Gin Met Val lie Ala Thr Thr Ser Tyr Tyr Thr Asp lie 
175 180 185 190 

AAC ACA GAA AAT AAA GCG ATC AAA GAA GGC GTG AAT AAG GTT TTT GAT 62 5 

Asn Thr Glu Asn Lys Ala lie Lys Glu Gly Val Asn Lys Val Phe Asp 
195 200 205 

GAA AAC ACC ACG AAA TTA TTC CTT TGG ATA CTG ACA GCG ACG ATA GCG 67 3 

Glu Asn Thr Thr Lys Leu Phe Leu Trp lie Leu Thr Ala Thr lie Ala 
210 215 220 

CTA GTG GTT TTG ACG CTC ATA TAC GCT AAA TTA AGG ATC GTG AAA CGC 721 
Leu Val Val Leu Thr Leu lie Tyr Ala Lys Leu Arg lie Val Lys Arg 
225 230 235 

ATT GAT GAA CTG GTC CTT AAA ATC AAC GCT TTT AGC CGT GGG GAT AAG 76 9 

lie Asp Glu Leu Val Leu Lys lie Asn Ala Phe Ser Arg Gly Asp Lys 
240 245 250 

GAT TTG AGA GCC AAA ATT GAT GTG GGT GAT CGC AAC GAT GAA ATC TCG 817 
Asp Leu Arg Ala Lys lie Asp Val Gly Asp Arg Asn Asp Glu lie Ser 
255 260 265 270 

CAA GTG GGC CGT GGG ATC AAT TTG TTT GTG GAA AAC GCC CGC TTG ATT 8 65 

Gin Val Gly Arg Gly lie Asn Leu Phe Val Glu Asn Ala Arg Leu lie 
275 280 285 

ATG GAA GAG ATT AAA GGG ATT TCC ACC CTC AAT AAA ACT TCA ATG GAT 913 
Met Glu Glu lie Lys Gly lie Ser Thr Leu Asn Lys Thr Ser Met Asp 
290 295 300 

AAA TTA GTC CAA ATC ACG CAA GAA ACC CAA AAG AGC ATG AAA GAT TCC 961 
Lys Leu Val Gin lie Thr Gin Glu Thr Gin Lys Ser Met Lys Asp Ser 
305 310 315 

TCA ACC ACC CTA AAT TCC GTG AAA AAT AAA GCC ACT GAT ATA GCG AGC 100 9 

Ser Thr Thr Leu Asn Ser Val Lys Asn Lys Ala Thr Asp lie Ala Ser 
320 325 330 

ATG ATG AAT GCT TCC ATA GAG CAA TCT CAA GGG TTA AGG AAG CGT TTG 10 57 

Met Met Asn Ala Ser lie Glu Gin Ser Gin Gly Leu Arg Lys Arg Leu 
335 340 345 350 

ATT GAA ACG . CAA GGG CTG GTC AAA GAG AGC AAG GAT GCG ATC GGG GAT 110 5 

lie Glu Thr Gin Gly Leu Val Lys Glu Ser Lys Asp Ala He Gly Asp 
355 360 365 
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(2) INFORMATION FOR SEQ ID NO:1195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 8 . . . 1702 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1195: 

GAGATAA ATG ATG TTT TCT TCA ATG TTT GCT TCG TTG GGG ACT CGT ATC 4 9 

Met Met Phe Ser Ser Met Phe Ala Ser Leu Gly Thr Arg lie 
15 10 

ATG CTG GTC GTG TTA GCC GCT CTT TTA GGT TTA GGG GGG CTT TTT ATT 97 
Met Leu Val Val Leu Ala Ala Leu Leu Gly Leu Gly Gly Leu Phe lie 
15 20 25 30 

GGT TTT GTA AAG GTT ATG CAA AAA GAT GTG TTA GCG CAA CTC ATG GAG 14 5 

Gly Phe Val Lys Val Met Gin Lys Asp Val Leu Ala Gin Leu Met Glu 
35 40 45 

Cat TTA GAA ACC GGG CAA TAC AAA AAG CGT GAA AAA ACG CTC GCT TAC 193 
His Leu Glu Thr Gly Gin Tyr Lys Lys Arg Glu Lys Thr Leu Ala Tyr 
50 55 60 

ATG ACA AAA ATT ATT GAA CAG GGC ATT CAT GAG TAT TAC AAA AAT TTT 241 
Met Thr Lys lie lie Glu Gin Gly lie His Glu Tyr Tyr Lys Asn Phe 
65 70 75 

GAC AAT GCT ACT GCA AGA AAA ATG GCG TTA GAT TAT TTC AAA CGC ATC 289 
Asp Asn Ala Thr Ala Arg Lys Met Ala Leu Asp Tyr Phe Lys Arg lie 
80 85 90 

AAC GAC GAT AAG GGC ATG ATT TAT ATG GTG GTG GTG GAT AAA AAC GGG 33 7 

Asn Asp Asp Lys Gly Met lie Tyr Met Val Val Val Asp Lys Asn Gly 
9S 100 105 110 

GTG GTA TTG TTT GAT CCG GTC AAT CCT AAA ACC GTA GNC CAA TCA GGG 3 85 

Val Val Leu Phe Asp Pro Val Asn Pro Lys Thr Val Xaa Gin Ser Gly 
115 120 125 

CTT GAC GCT CAG AGC GTT GAT GGG GTG TAT TAT GTT AGG GGG TAT TTG 43 3 

Leu Asp Ala Gin Ser Val Asp Gly Val Tyr Tyr Val Arg Gly Tyr Leu 
130 135 140 
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195 

Asp Pro Phe Ser 
210 

Met Leu Gly Lys 
225 

Leu lie lie Glu 

Phe Val Val lie 
260 

Ala lie Thr Leu 
275 

Val Ser Ser Thr 
290 

Asn Ser Ser Gly 
305 

Gly Arg Met Gin 

lie Met Gin Glu 
340 

Ser Asp Val Lys 
355 

Ala Ser Pro Asp 
370 

Asp Tyr Leu Gin 
385 

lie Phe Glu Ser 

Ala Ser Gly Arg 
420 

Gin Lys Met Leu 
435 

Asp Ser Ala Asn 
450 

Asn Ser Gin His 
465 

lie Thr Thr Ser 

Glu Gin Gly Gin 
500 

Ala Asp Gin Thr 
515 

Arg Ala Gly Glu 
530 

Arg Lys Leu Ala 
545 

Asn lie Asn lie 

Lys Asn Gin Val 
580 

Leu Arg Ser Val 
595 

Glu He Ser Gin 
610 

Val Asn Lys Lys 
625 



200 

His Lys Glu Asn 
215 

Thr Glu Ser Lys 
230 

Lys Asp Lys Val 
245 

He Ala Ser Ala 

Leu Met Arg Ala 
280 

Leu Ser His Phe 
295 

He Lys Leu He 
310 

Thr Ala He Asn 
325 

Asp Arg Gin Ala 

Ala Gly Asn Phe 
360 

Leu Lys Glu Leu 
375 

Glu Ser Val Gly 
390 

Tyr Ser Gly Leu 
405 

Val Glu Leu Val 

Glu Thr Ser Ser 
440 

Leu Lys Glu Cys 
455 

Lys Ser Leu Met 
470 

He Gin Gly Val 
485 

Asp He Lys Ser 

Asn Leu Leu Ala 
520 

His Gly Arg Gly 
535 

Glu Arg Thr Gin 
550 

Leu Val Gin Ser 
565 

Lys Glu Val Glu 

Thr Glu Gly Asn 
600 

Glu He Asp Lys 
615 

Gin Phe 
630 



Phe Leu Ala Val 
220 

Asp Asn Leu Asn 
235 

Tyr Glu Gin Val 
250 

He Met Val Leu 
265 

He Val Ser Ser 

Phe Lys Leu Leu 
300 

Glu Ala Lys Ser 
315 

Lys Asn He Leu 
330 

Val Gin Asp Thr 
345 

Ala Val Arg He 

Arg Asp Ala Leu 
380 

Thr His Met Pro 
395 

Asp Phe Arg Gly 
410 

Thr Asn Ala Leu 
425 

Asn Phe Ala Lys 

Val Gin Asn Leu 
460 

Glu Thr Ser Lys 
475 

Ser Ser Gin Ser 
490 

He Val Glu He 
505 

Leu Asn Ala Ala 

Phe Ala Val Val 
540 

Lys Ser Leu Ser 
555 

He Ser Asp Thr 
570 

Glu He Asn Ala 
585 

Leu Lys He Ala 

Val Ser Asn Asp 
620 



205 

Glu Thr Phe Lys 

Trp Met He Ala 
240 

Gly Ser Val Arg 
255 

Ala Leu He He 
270 

Arg Leu Glu Ala 
285 

Asn Asn Gin Ala 

Asn Asp Glu Leu 
320 

Gin Thr Gin Lys 
335 

He Lys Val Val 
350 

Thr Ala Glu Pro 
365 

Asn Gly He Met 

Ser He Phe Lys 
400 

Arg He Gin Asn 
415 

Gly Gin Glu He 
430 

Asp Leu Ala Asn 
445 

Glu Lys Ala Ser 

Thr He Glu Asn 
480 

Glu Ala Met He 
495 

He Arg Asp He 
510 

He Glu Ala Ala 
525 

Ala Asp Glu Val 

Glu He Glu Ala 
560 

Ser Glu Ser He 
575 

Ser He Glu Ala 
590 

Ser Asp Ser Leu 
605 

He Leu Glu Asp 
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AAC CAG GTT AAA GAA GTG GAA GAA ATC AAC GCT TCT ATT GAA GCC TTA 1834 
Asn Gin Val Lys Glu Val Glu Glu He Asn Ala Ser He Glu Ala Leu 
580 585 590 



AGA TCG GTT ACT GAG GGC AAT CTA AAA ATC GCT AGC GAT TCT TTA GAA 1882 
Arg Ser Val Thr Glu Gly Asn Leu Lys He Ala Ser Asp Ser Leu Glu 
595 600 605 



ATC AGT CAA GAA ATT GAC AAA GTT TCT AAC GAT ATT TTA GAA GAT GTG 193 0 

lie Ser Gin Glu He Asp Lys Val Ser Asn Asp He Leu Glu Asp Val 
610 615 620 625 

AAT AAA AAG CAG TTT TAATGCTCAT TCATATTTGC TGCTCAGTGG ATAACCTCTA T 19 8 6 
Asn Lys Lys Gin Phe 
630 



1986 

(2) INFORMATION FOR SEQ ID NO:1194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194: 



Met Gin Asp Ser Leu His Phe Lys Val Asn Glu Val Gin Gly Val Leu 

15 10 15 

Glu Asn Thr Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Lys 

20 25 30 

Asp Thr Lys Arg Glu He Lys He Gly Leu Leu Lys Asn Phe He Leu 

35 40 45 

Ala Asn Ser His Val Ala Gly Val Ser Met Phe Phe Lys Gly Arg Glu 

50 55 60 

Asp Leu Arg Leu Thr Leu Leu Arg Asp Asn Asn Thr He Lys Leu Val 
65 70 75 80 

Glu Asn Pro Ser Leu Glu Asn Ser Pro Leu Ala Gin Lys Ala Met Lys 

85 90 95 

Asn Lys Glu He Ser Lys Ser Leu Gly Tyr Tyr Arg Lys Met Pro Asn 

100 105 110 

Gly Ala Glu Val Tyr Gly Val Asp He Leu Leu Pro Leu Leu Asn Glu 

115 120 125 

Asn Ala Gin Glu Val Val Gly Ala Leu Met He Phe He Ser He Asp 

130 135 140 

Ser Phe Ser Asn Glu He Thr Lys Asn Arg Ser Asp Leu Phe Leu He 
145 150 155 160 

Gly Thr Lys Gly Lys Val Leu Leu Ser Ala Asn Lys Ser Leu Gin Asp 

165 170 175 

Lys Pro He Ala Glu He Tyr Lys Ser Val Pro Lys Ala Thr Asn Glu 

180 185 190 

Val Met Ala He Leu Glu Asn Gly Ser Lys Ala Thr Leu Glu Tyr Leu 
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GAT GTG AAA GCA GGG AAT TTT GCG GTG CGC ATC ACG GCT GAG CCC GCA 1162 
Asp Val Lys Ala Gly Asn Phe Ala Val Arg lie Thr Ala Glu Pro Ala 
355 360 365 

AGC CCT GAT TTG AAA GAA TTG AGG GAC GCG CTA AAT GGG ATC ATG GAT 1210 
Ser Pro Asp Leu Lys Glu Leu Arg Asp Ala Leu Asn Gly lie Met Asp 
370 375 380 385 

TAT TTG CAA GAA AGC GTA GGG ACT CAC ATG CCA AGC ATT TTC AAA ATC 12 5 8 

Tyr Leu Gin Glu Ser Val Gly Thr His Met Pro Ser lie Phe Lys lie 
390 395 400 

TTT GAA AGC TAT TCT GGT TTG GAT TTT AGA GGC CGG ATC CAA AAC GCT 13 0 6 

Phe Glu Ser Tyr Ser Gly Leu Asp Phe Arg Gly Arg lie Gin Asn Ala 
405 410 415 

TCG GGT AGG GTG GAA CTG GTT ACT AAC GCT TTA GGG CAA GAA ATC CAA 13 54 

Ser Gly Arg Val Glu Leu Val Thr Asn Ala Leu Gly Gin Glu lie Gin 
420 425 430 

AAA ATG CTA GAA ACT TCG TCT AAT TTT GCC AAA GAT TTA GCG AAC GAT 14 02 

Lys Met Leu Glu Thr Ser Ser Asn Phe Ala Lys Asp Leu Ala Asn Asp 
435 440 445 

AGC GCG AAT TTA AAA GAG TGC GTG CAA AAT TTA GAA AAA GCT TCA AAC 14 5 0 

Ser Ala Asn Leu Lys Glu Cys Val Gin Asn Leu Glu Lys Ala Ser Asn 
450 455 460 465 

TCC CAA CAC AAA AGC TTG ATG GAA ACT TCC AAA ACG ATA GAA AAT ATC 14 98 

Ser Gin His Lys Ser Leu Met Glu Thr Ser Lys Thr lie Glu Asn lie 
470 475 480 

ACC ACT TCC ATT CAA GGC GTG AGC TCT CAA AGT GAA GCC ATG ATT GAA 154 6 

Thr Thr Ser lie Gin Gly Val Ser Ser Gin Ser Glu Ala Met lie Glu 
485 490 495 

CAA GGG CAA GAC ATT AAA AGC ATT GTA GAA ATC ATT AGA GAT ATT GCT 15 94 

Gin Gly Gin Asp lie Lys Ser He Val Glu He He Arg Asp He Ala 
500 505 510 

GAT CAA ACC AAT CTT TTA GCC TTA AAC GCC GCT ATT GAA GCC GCA AGG 164 2 

Asp Gin Thr Asn Leu Leu Ala Leu Asn Ala Ala He Glu Ala Ala Arg 
515 520 525 

GCC GGC GAG CAT GGC AGA GGC TTT GCG GTG GTG GCT GAT GAG GTA AGA 1690 
Ala Gly Glu His Gly Arg Gly Phe Ala Val Val Ala Asp Glu Val Arg 
530 535 540 545 

AAG CTC GCT GAA AGG ACG CAA AAA TCG CTC AGC GAG ATT GAA GCC AAT 173 8 

Lys Leu Ala Glu Arg Thr Gin Lys Ser Leu Ser Glu He Glu Ala Asn 
550 555 560 

ATC AAT ATT TTA GTG CAA AGC ATT TCA GAC ACG AGC GAA AGC ATT AAA 17 86 

He Asn He Leu Val Gin Ser He Ser Asp Thr Ser Glu Ser He Lys 
565 570 575 
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GCT CAA GAG GTT GTA GGG GCT TTG ATG ATT TTT ATT TCC ATT GAC AGC 4 90 

Ala Gin Glu Val Val Gly Ala Leu Met He Phe He Ser He Asp Ser 
130 135 140 145 

TTC AGC AAT GAA ATC ACT AAA AAC AGG AGC GAT TTA TTT TTA ATT GGC 53 8 

Phe Ser Asn Glu He Thr Lys Asn Arg Ser Asp Leu Phe Leu He Gly 
150 155 160 

ACT AAA GGT AAA GTG CTT TTG AGC GCG AAT AAG AGT TTG CAA GAC AAA 586 
Thr Lys Gly Lys Val Leu Leu Ser Ala Asn Lys Ser Leu Gin Asp Lys 
165 170 175 

CCT ATC GCA GAA ATT TAT AAG AGC GTG CCT T^AA GCC ACC AAC GAA GTG 634 
Pro He Ala Glu He Tyr Lys Ser Val Pro Lys Ala Thr Asn Glu Val 
180 185 190 

ATG GCT ATT TTA GAA AAC GGC TCT AAA GCG ACT TTA GAA TAG TTA GAT 6 82 

Met Ala He Leu Glu Asn Gly Ser Lys Ala Thr Leu Glu Tyr Leu Asp 
195 200 205 

CCC TTT AGC CAT AAG GAA AAT TTT TTA GCC GTT GAA ACC TTT AAA ATG 73 0 

Pro Phe Ser His Lys Glu Asn Phe Leu Ala Val Glu Thr Phe Lys Met 
210 215 220 225 

CTA GGC AAA ACA GAA AGT AAA GAC AAT CTT AAT TGG ATG ATC GCT TTA 77 8 

Leu Gly Lys Thr Glu Ser Lys Asp Asn Leu Asn Trp Met He Ala Leu 
230 235 240 

ATC ATT GAA AAA GAC AAG GTC TAT GAG CAA GTA GGC TCG GTG CGT TTT 826 
He He Glu Lys Asp Lys Val Tyr Glu Gin Val Gly Ser Val Arg Phe 
245 250 255 

GTG GTG ATC ATA GCG AGC GCA ATC ATG GTG TTA GCC TTG ATT ATA GCG 8 74 

Val Val He He Ala Ser Ala He Met Val Leu Ala Leu He He Ala 
260 265 270 

ATC ACT CTC TTA ATG CGA GCG ATC GTG AGC AGT CGT TTG GAA GCC GTT 92 2 

He Thr Leu Leu Met Arg Ala He Val Ser Ser Arg Leu Glu Ala Val 
275 280 285 

TCT AGC ACC TTG TCT CAT TTC TTT AAA TTA TTG AAC AAT CAA GCC AAT 97 0 

Ser Ser Thr Leu Ser His Phe Phe Lys Leu Leu Asn Asn Gin Ala Asn 
290 295 300 305 

TCT AGC GGT ATT AAA TTG ATT GAA GCG AAA TCC AAT GAC GAG TTA GGC 1018 
Ser Ser Gly He Lys Leu He Glu Ala Lys Ser Asn Asp Glu Leu Gly 
310 315 320 

CGC ATG CAA ACA GCG ATC AAT AAA AAT ATC TTG CAA ACC CAA AAA ATC 1066 
Arg Met Gin Thr Ala He Asn Lys Asn He Leu Gin Thr Gin Lys He 
325 330 335 

ATG CAA GAA GAC AGG CAA GCC GTC CAA GAC ACC ATT AAA GTG GTT TCA 1114 
Met Gin Glu Asp Arg Gin Ala Val Gin Asp Thr He Lys Val Val Ser 
340 345 350 
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(2) INFORMATION FOR SEQ ID N0:1193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1986 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 56... 1945 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193: 
GGTGTCCTTA AACAGCAGGG TGAAAGAGAT TTTAAAAGAA AGCGCTCTGC ATTCT ATG 5 8 



Met 
1 



CAA GAT AGT TTG CAT TTT AAG GTT AAT GAA GTG CAA GGG GTT TTA GAA 106 

Gin Asp Ser Leu His Phe Lys Val Asn Glu Val Gin Gly Val Leu Glu 
5 10 15 

AAC ACT TAT ACG AGC ATG GGC ATT GTT AAA GAA ATG CTC CCT AAA GAC 154 

Asn Thr Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Lys Asp 
20 25 30 

ACC AAA AGA GAA ATC AAA ATC GGC TTG TTA AAA AAC TTC ATT TTA GCC 2 02 

Thr Lys Arg Glu He Lys He Gly Leu Leu Lys Asn Phe He Leu Ala 

35 40 45 

AAT TCG CAT GTC GCT GGG GTG AGC ATG TTT TTT AAA GGC AGA GAA GAT 25 0 

Asn Ser His Val Ala Gly Val Ser Met Phe Phe Lys Gly Arg Glu Asp 
SO 55 60 65 

TTA AGA TTA ACG CTT TTA AGG GAT AAC AAT ACG ATT AAG CTA GTG GAA 2 98 

Leu Arg Leu Thr Leu Leu Arg Asp Asn Asn Thr He Lys Leu Val Glu 
70 75 80 

AAT CCG TCA TTA GAG AAT AGC CCT TTA GCG CAA AAA GCG ATG AAA AAT 34 6 

Asn Pro Ser Leu Glu Asn Ser Pro Leu Ala Gin Lys Ala Met Lys Asn 
85 90 95 

AAA GAA ATT TCT AAA AGT TTG GGT TAT TAT AGG AAA ATG CCT AAT GGG 3 94 

Lys Glu He Ser Lys Ser Leu Gly Tyr Tyr Arg Lys Met Pro Asn Gly 
100 105 110 

GCG GAA GTT TAT GGG GTG GAT ATT CTT TTA CCT TTA TTG AAT GAG AAC 44 2 

Ala Glu Val Tyr Gly Val Asp He Leu Leu Pro Leu Leu Asn Glu Asn 

115 120 125 
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Lys Phe Gly Val Pro Cys Cys Val Leu Arg Ser lie Ser Asp Asn Ala 
190 195 200 

GAT GAG GAA GCT AAC ATG AGC TTT GAT GCG TTT TTA GAA AAA AGC GCT 67 5 

Asp Glu Glu Ala Asn Met Ser Phe Asp Ala Phe Leu Glu Lys Ser Ala 
205 210 215 

CAA ACT TCA GCG AAA TTC TTA AAA AGC ATG GTG GAT GAG CTT TAGGGTTTG 72 6 
Gin Thr Ser Ala Lys Phe Leu Lys Ser Met Val Asp Glu Leu 
220 225 230 

TTTTTATAGA GGGGTGGAA 745 

(2) INFORMATION FOR SEQ ID NO: 1192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1192: 

Met val Gin Lys lie Gly lie Leu Gly Ala Met Arg Glu Glu lie Thr 

15 10 15 

Pro He Leu Glu Leu Phe Gly Val Asp Phe Glu Glu He Pro Leu Gly 

20 25 30 

Gly Asn Val Phe His Lys Gly Val Tyr His Asn Lys Glu He He Val 

35 40 45 

Ala Tyr Ser Lys He Gly Lys Val His Ser Thr Leu Thr Thr Thr Ser 

50 55 60 

Met He Leu Ala Phe Gly Val Gin Lys Val Leu Phe Ser Gly Val Ala 
65 70 75 80 

Gly Ser Leu Val Lys Asp Leu Lys He Asn Asp Leu Leu Val Ala He 

85 90 95 

Gin Leu Val Gin His Asp Val Asp Leu Ser Ala Phe Asp His Pro Leu 

100 105 110 

Gly Phe He Pro Glu Ser Ala He Phe He Glu Thr Ser Glu Ser Leu 

115 120 125 

Asn Ala Leu Ala Lys Glu Val Ala Asn Glu Gin His He Val Leu Lys 

130 135 140 

Glu Gly Val He Ala Ser Gly Asp Gin Phe Val His Ser Lys Glu Arg 
145 150 155 160 

Lys Glu Phe Leu Val Ser Glu Phe Lys Ala Ser Ala Val Glu Met Glu 

165 170 175 

Gly Ala Ser Val Ala Phe Val Cys Gin Lys Phe Gly Val Pro Cys Cys 

180 185 190 

Val Leu Arg Ser He Ser Asp Asn Ala Asp Glu Glu Ala Asn Met Ser 

195 200 205 

Phe Asp Ala Phe Leu Glu Lys Ser Ala Gin Thr Ser Ala Lys Phe Leu 

210 215 220 

Lys Ser Met Val Asp Glu Leu 
225 230 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1191: 

AAGAATACGT GTGATTGGGA GAAA ATG GTG CAA AAA ATT GGC ATT TTA GGG 51 

Met Val Gin Lys lie Gly lie Leu Gly 
1 5 

GCG ATG AGA GAA GAA ATA ACC CCT ATA CTA GAA TTG TTT GGC GTG GAT 9 9 

Ala Met Arg Glu Glu lie Thr Pro lie Leu Glu Leu Phe Gly Val Asp 
10 15 20 25 

TTT GAA GAG ATC CCT TTA GGG GGG AAT GTC TTC CAT AAA GGC GTT TAT 14 7 

Phe Glu Glu lie Pro Leu Gly Gly Asn Val Phe His Lys Gly Val Tyr 
30 35 40 

CAC AAC AAG GAA ATC ATT GTC GCT TAT AGC AAG ATT GGC AAG GTG CAT 195 
His Asn Lys Glu lie lie Val Ala Tyr Ser Lys lie Gly Lys Val His 
45 50 55 

TCC ACT TTA ACC ACA ACG AGC ATG ATT TTA GCG TTT GGC GTT CAA AAG 24 3 

Ser Thr Leu Thr Thr Thr Ser Met lie Leu Ala Phe Gly Val Gin Lys 
60 65 70 

GTG CTT TTT AGC GGG GTG GCT GGA AGC TTA GTT AAA GAT TTA AAA ATC 2 91 

Val Leu Phe Ser Gly Val Ala Gly Ser Leu Val Lys Asp Leu Lys lie 
75 80 85 

AAT GAT TTA CTA GTG GCT ATT CAA TTA GTC CAG CAT GAT GTG GAT TTG 33 9 

Asn Asp Leu Leu Val Ala lie Gin Leu Val Gin His Asp Val Asp Leu 
90 95 100 105 

AGC GCG TTT GAT CAC CCT TTA GGG TTC ATC CCA GAA AGC GCG ATT TTT 3 87 

Ser Ala Phe Asp His Pro Leu Gly Phe lie Pro Glu Ser Ala lie Phe 
110 115 120 

ATT GAA ACG AGC GAA AGT TTG AAC GCT TTG GCT AAA GAA GTC GCT AAT 4 35 

lie Glu Thr Ser Glu Ser Leu Asn Ala Leu Ala Lys Glu Val Ala Asn 
125 130 135 

GAA CAG CAT ATC GTG CTC AAA GAA GGC GTC ATC GCA TCA GGC GAT CAG 4 83 

Glu Gin His lie Val Leu Lys Glu Gly Val lie Ala Ser Gly Asp Gin 
140 145 150 

TTT GTG CAT AGC AAA GAA AGG AAA GAG TTT TTA GTT AGC GAG TTT AAA 531 
Phe Val His Ser Lys Glu Arg Lys Glu Phe Leu Val Ser Glu Phe Lys 
155 160 165 

GCG AGC GCG GTG GAA ATG GAG GGG GCG AGC GTG GCG TTT GTG TGC CAA 57 9 

Ala Ser Ala Val Glu Met Glu Gly Ala Ser Val Ala Phe Val Cys Gin 
170 175 180 185 

ATU^ TTT GGC GTG CCA TGC TGT GTG TTA AGG AGC ATT AGC GAT AAC GCT 62 7 
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355 360 365 

lie Ala Asn His Lys Lys lie Leu Ala Asn lie Thr Asn Met Thr Lys 

370 375 380 

Glu Asp lie lie Ala Gin Val Pro Glu Ala Thr Met Val Ser Val Tyr 
385 390 395 400 

Met Asp Leu Lys Lys Leu Phe Leu Thr Lys Glu Ala Ser Glu Glu Gly 

405 410 415 

Phe Asp Leu Ala Pro Asn Lys Leu Lys Glu lie Leu Glu Gin lie Lys 

420 425 430 

Arg Gly Lys Leu lie Ser Asp Arg Ala Lys Asn Lys Met Ala Lys Ser 

435 440 445 

Asn Leu Arg Leu Val Val Ser lie Ala Lys Arg Phe Thr Ser Arg Gly 

450 455 460 

Leu Pro Phe Leu Asp Leu lie Gin Glu Gly Asn lie Gly Leu Met Lys 
465 470 475 480 

Ala Val Asp Lys Phe Glu His Glu Lys Gly Phe Lys Phe Ser Thr Tyr 

485 490 495 

Ala Thr Trp Trp lie Lys Gin Ala lie Ser Arg Ala lie Ala Asp Gin 

500 505 510 

Ala Arg Thr lie Arg lie Pro lie His Met lie Asp Thr lie Asn Arg 

515 520 525 

lie Asn Lys Val Met Arg Lys His lie Gin Glu Asn Gly Lys Glu Pro 

530 535 540 

Asp Leu Glu Val Val Ala Glu Glu Val Gly Leu Ser Leu Asp Lys Val 
545 550 555 560 

Lys Asn Val lie Lys Val Thr Lys Glu Pro lie Ser Leu Glu Thr Pro 

565 570 575 

Val Gly Asn Asp Asp Asp Gly Lys Phe Gly Asp Phe Val Glu Asp Lys 

580 585 590 

Asn lie Val Ser Ser lie Asp His lie Met Arg Glu Asp Leu Lys Ala 

595 600 605 

Gin lie Glu Ser Val Leu Asp Gin Leu Asn Glu Arg Glu Lys Ala Val 

610 615 620 

lie Arg Met Arg Phe Gly Leu Leu Asp Asp Glu Ser Asp Arg Thr Leu 
625 630 635 640 

Glu Glu He Gly Lys Glu Leu Asn Val Thr Arg Glu Arg Val Arg Gin 

645 650 655 

He Glu Ser Ser Ala He Lys Lys Leu Arg Ser Pro Gin Tyr Gly Arg 

660 665 670 

He Leu Arg Asn Tyr Leu Arg He 
675 680 

(2) INFORMATION FOR SEQ ID NO: 1191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 717 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190: 



Met 


Gly 


Ser 


Tyr 


Phe 


Met 


Glu 


Cys 


Pro 


Met 


Lys 


Lys 


Lys 


Ala 


Asn 


Glu 


1 








5 










10 










15 




Glu 


Lys 


Ala 


Gin 


Lys 


Arg 


Ala 


Lys 


Thr 


Glu 


Ala 


Lys 


Ala 


Glu 


Ala 


Thr 








20 










25 










30 






Gin 


Glu 


Asn 


Lys 


Thr 


Lys 


Glu 


Asn 


Asn 


Lys 


Ala 


Lys 


Glu 


Ser 


Lys 


He 






35 










40 










45 








Lys 


Glu 


Ser 


Lys 


He 


Lys 


Glu 


Ala 


Lys 


Ala 


Lys 


Glu 


Pro 


He 


Pro 


Val 




50 










55 










60 










Lys 


Lys 


Leu 


Ser 


Phe 


Asn 


Glu 


Ala 


Leu 


Glu 


Glu 


Leu 


Phe 


Ala 


Asn 


Ser 


65 










70 










75 










80 


Leu 


Ser 


Asp 


Cys 


Val 


Ser 


Tyr 


Glu 


Ser 


He 


He 


Gin 


He 


Ser 


Ala 


Lys 










85 










90 










95 




Val 


Pro 


Thr 


Leu 


Ala 


Gin 


He 


Lys 


Lys 


He 


Lys 


Glu 


Leu 


Cys 


Gin 


Lys 








100 










105 










110 






Tyr 


Gin 


Lys 


Lys 


Leu 


Val 


Ser 


Ser 


Ser 


Glu 


Tyr 


Ala 


Lys 


Lys 


Leu 


Asn 






115 










120 










125 








Ala 


lie 


Asp 


Lys 


He 


Lys 


Lys 


Thr 


Glu 


Glu 


Lys 


Gin 


Lys 


Val 


Leu 


Asp 




130 










135 










140 










Glu 


Glu 


Leu 


Glu 


Asp 


Gly 


Tyr 


Asp 


Phe 


Leu 


Lys 


Glu 


Lys 


Asp 


Phe 


Leu 


145 










150 










155 










160 


Glu 


Trp 


Ser 


Arg 


Ser 


Asp 


Ser 


Pro 


Val 


Arg 


Met 


Tyr 


Leu 


Arg 


Glu 


Met 










165 










170 










175 




Gly 


Asp 


He 


Lys 


Leu 


Leu 


Ser 


Lys 


Asp 


Glu 


Glu 


He 


Glu 


Leu 


Ser 


Lys 








180 










185 










190 






Gin 


lie 


Arg 


Leu 


Gly 


Glu 


Asp 


He 


He 


Leu 


Asp 


Ala 


He 


Cys 


Ser 


Val 






195 










200 










205 








Pro 


Tyr 


Leu 


He 


Asp 


Phe 


He 


Tyr 


Ala 


Tyr 


Lys 


Asp 


Ala 


Leu 


He 


Asn 




210 










215 










220 










Arg 


Glu 


Arg 


Arg 


Val 


Lys 


Glu 


Leu 


Phe 


Arg 


Ser 


Phe 


Asp 


Asp 


Asp 


Asp 


225 










230 










235 










240 


Glu 


Asn 


Ser 


Val 


Ser 


Asp 


Ser 


Lys 


Lys 


Asp 


Glu 


Asp 


Asn 


Glu 


Glu 


Asp 










245 










250 










255 




Glu 


Glu 


Asn 


Glu 


Glu 


Arg 


Lys 


Lys 


Val 


Val 


Ser 


Glu 


Lys 


Asp 


Lys 


Lys 








260 










265 










270 






Arg 


Val 


Glu 


Lys 


Val 


Gin 


Glu 


Ser 


Phe 


Lys 


Ala 


Leu 


Asp 


Lys 


Ala 


Lys 






275 










280 










285 








Lys 


Glu 


Trp 


Leu 


Lys 


Ala 


Leu 


Glu 


Ala 


Pro 


He 


Asp 


Glu 


Arg 


Glu 


Asp 




290 










295 










300 










Glu 


Leu 


Val 


Arg 


Ser 


Leu 


Thr 


Leu 


Ala 


Tyr 


Lys 


Arg 


Gin 


Thr 


Leu 


Lys 


305 










310 










315 










320 


Asp 


Arg 


Leu 


Tyr 


Asp 


Leu 


Glu 


Pro 


Thr 


Ser 


Lys 


Leu 


He 


Asn 


Glu 


Leu 










325 










330 










335 




Val 


Lys 


Thr 


Met 


Glu 


Thr 


Thr 


Leu 


Lys 


Ser 


Gly 


Asp 


Gly 


Phe 


Glu 


Lys 








340 










345 










350 






Glu 


Leu 


Lys 


Arg 


Leu 


Glu 


Tyr 


Lys 


Leu 


Pro 


Leu 


Phe 


Asn 


Asp 


Thr 


Leu 
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ATT GGC TTG ATG AAA GCG GTG GAT AAG TTT GAG CAT GAA AAG GGC TTC 14 91 

lie Gly Leu Met Lys Ala Val Asp Lys Phe Glu His Glu Lys Gly Phe 
480 485 490 

AAG TTT TCT ACC TAT GCG ACC TGG TGG ATC AAA CAA GOT ATC AGC AGA 153 9 

Lys Phe Ser Thr Tyr Ala Thr Trp Trp lie Lys Gin Ala lie Ser Arg 
495 500 505 

GCC ATA GCC GAT GAG GCC CGC ACT ATC CGC ATC CCC ATT CAC ATG ATT 15 87 

Ala lie Ala Asp Gin Ala Arg Thr lie Arg lie Pro lie His Met lie 
510 515 520 

GAT ACG ATT AAT CGC ATC AAT AAA GTC ATG CGC AAA CAC ATT CAA GAA 163 5 

Asp Thr lie Asn Arg He Asn Lys Val Met Arg Lys His He Gin Glu 
525 530 535 

AAC GGC AAA GAG CCT GAT TTA GAA GTG GTG GCT GAA GAA GTG GGG CTT 16 8 3 

Asn Gly Lys Glu Pro Asp Leu Glu Val Val Ala Glu Glu Val Gly Leu 
540 545 550 555 

TCG TTA GAT AAA GTG AAG AAT GTG ATT AAG GTG ACT AAA GAG CCT ATC 17 31 

Ser Leu Asp Lys Val Lys Asn Val He Lys Val Thr Lys Glu Pro He 
560 565 570 

AGT TTG GAA ACC CCA GTC GGC AAT GAT GAT GAT GGC AAG TTT GGG GAT 177 9 

Ser Leu Glu Thr Pro Val Gly Asn Asp Asp Asp Gly Lys Phe Gly Asp 
575 580 585 

TTC GTG GAA GAT AAG AAT ATC GTC AGC TCC ATT GAT CAC ATC ATG CGA 182 7 

Phe Val Glu Asp Lys Asn He Val Ser Ser He Asp His He Met Arg 
590 595 600 

GAA GAT TTG AAA GCA CAA ATT GAA AGC GTT TTG GAT CAG TTG AAT GAG 187 5 

Glu Asp Leu Lys Ala Gin He Glu Ser Val Leu Asp Gin Leu Asn Glu 
605 610 615 

CGA GAA AAA GCG GTG ATC CGC ATG CGT TTT GGG CTT TTA GAC GAT GAA 192 3 

Arg Glu Lys Ala Val He Arg Met Arg Phe Gly Leu Leu Asp Asp Glu 
620 625 630 635 

AGC GAT CGA ACT TTA GAA GAA ATT GGC AAG GAA TTG AAT GTT ACT AGA 1971 
Ser Asp Arg Thr Leu Glu Glu He Gly Lys Glu Leu Asn Val Thr Arg 
640 645 650 

GAA AGG GTG CGC CAG ATT GAA AGC TCT GCG ATT AAA AAA TTG AGA AGC 2 019 

Glu Arg Val Arg Gin He Glu Ser Ser Ala He Lys Lys Leu Arg Ser 
655 660 665 

CCG CAG TAC GGG CGC ATT TTA AGA AAC TAT TTG CGC ATT TGATGTTAAG GT 2 07 0 
Pro Gin Tyr Gly Arg He Leu Arg Asn Tyr Leu Arg He 
670 675 680 

TTCTCTAAAG CATGCGTTAT TTTCTTGTAG TTTTCTT 210 7 

(2) INFORMATION FOR SEQ ID NO: 1190: 
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GAC AAC GAA GAA GAT GAA GAA AAC GAA GAA AGG AAA AAA GTC GTT TCT 819 

Asp Asn Glu Glu Asp Glu Glu Asn Glu Glu Arg Lys Lys Val Val Ser 

255 260 265 

GAA AAA GAC AAG AAG CGT GTA GAA AAG GTT CAA GAA AGC TTT AAA GCC 8 67 

Glu Lys Asp Lys Lys Arg Val Glu Lys Val Gin Glu Ser Phe Lys Ala 

270 275 280 

CTA GAC AAG GCT AAA AAA GAA TGG CTT AAA GCC CTT GAA GCC CCC ATA 915 

Leu Asp Lys Ala Lys Lys Glu Trp Leu Lys Ala Leu Glu Ala Pro lie 

285 290 295 

GAT GAA AGA GAA GAC GAA TTG GTG CGT TCA TTG ACC CTA GCT TAC AAA 96 3 

Asp Glu Arg Glu Asp Glu Leu Val Arg Ser Leu Thr Leu Ala Tyr Lys 

300 305 310 315 

CGC CAA ACA CTC AAA GAC AGA CTC TAT GAT TTA GAA CCT ACC AGC AAA 1011 

Arg Gin Thr Leu Lys Asp Arg Leu Tyr Asp Leu Glu Pro Thr Ser Lys 

320 325 330 

CTG ATT AAT GAA TTA GTC AAA ACG ATG GAA ACC ACT TTA AAA AGC GGC 105 9 

Leu lie Asn Glu Leu Val Lys Thr Met Glu Thr Thr Leu Lys Ser Gly 

335 340 345 

GAT GGG TTT GAA AAA GAG TTG AAA CGC TTG GAA TAC AAA CTG CCC TTA 110 7 

Asp Gly Phe Glu Lys Glu Leu Lys Arg Leu Glu Tyr Lys Leu Pro Leu 

350 355 360 

TTC AAT GAC ACT CTC ATC GCA AAC CAT AAA AAA ATC CTT GCC AAT ATC 115 5 

Phe Asn Asp Thr Leu He Ala Asn His Lys Lys He Leu Ala Asn He 

365 370 375 

ACT AAC ATG ACT AAA GAA GAT ATT ATC GCT CAA GTG CCA GAA GCG ACT 12 03 

Thr Asn Met Thr Lys Glu Asp He He Ala Gin Val Pro Glu Ala Thr 

380 385 390 395 

ATG GTG AGC GTG TAT ATG GAT CTT AAA AAG CTT TTT TTG ACT AAA GAA 12 51 

Met Val Ser Val Tyr Met Asp Leu Lys Lys Leu Phe Leu Thr Lys Glu 

400 405 410 

GCG AGC GAA GAA GGC TTT GAT CTA GCC CCC AAC AAG CTA AAA GAA ATT 12 9 9 

Ala Ser Glu Glu Gly Phe Asp Leu Ala Pro Asn Lys Leu Lys Glu He 

415 420 425 

TTA GAG CAA ATC AAA AGA GGG AAG TTG ATT TCC GAT CGC GCT AAA AAC 13 4 7 

Leu Glu Gin He Lys Arg Gly Lys Leu He Ser Asp Arg Ala Lys Asn 

430 435 440 

AAA ATG GCT AAA TCC AAT TTA AGG TTG GTG GTG AGC ATC GCT AAA CGA 13 95 

Lys Met Ala Lys Ser Asn Leu Arg Leu Val Val Ser He Ala Lys Arg 

445 450 455 

TTC ACG AGC AGA GGC TTA CCA TTC TTG GAT TTG ATT CAA GAG GGC AAT 144 3 

Phe Thr Ser Arg Gly Leu Pro Phe Leu Asp Leu He Gin Glu Gly Asn 

460 465 470 475 
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AAA GCA GAA GCC ACA CAA GAA AAT AAA ACT AAA GAA AAC AAT AAA GCC 14 7 

Lys Ala Glu Ala Thr Gin Glu Asn Lys Thr Lys Glu Asn Asn Lys Ala 
30 35 40 

AAA GAA AGC AAA ATT AAA GAA AGC AAA ATC AAA GAA GCT AAA GCG AAA 195 
Lys Glu Ser Lys lie Lys Glu Ser Lys lie Lys Glu Ala Lys Ala Lys 
45 50 55 

GAA CCT ATT COT GTT AAA AAG CTT AGT TTT AAT GAA GCG TTA GAA GAA 24 3 

Glu Pro lie Pro Val Lys Lys Leu Ser Phe Asn Glu Ala Leu Glu Glu 
60 65 70 75 

TTG TTC GCT AAT TCC TTA AGC GAT TGC GTT TCT TAT GAG TCC ATC ATT 2 91 

Leu Phe Ala Asn Ser Leu Ser Asp Cys Val Ser Tyr Glu Ser lie lie 
80 85 90 

CAA ATC AGC GCG AAA GTC CCC ACT CTA GCC CAA ATC AAA AAA ATC AAA 33 9 

Gin lie Ser Ala Lys Val Pro Thr Leu Ala Gin lie Lys Lys lie Lys 
95 100 105 

GAA TTG TGC CAA AAA TAC CAA AAG AAA TTA GTC AGC TCT TCA GAA TAC 3 87 

Glu Leu Cys Gin Lys Tyr Gin Lys Lys Leu Val Ser Ser Ser Glu Tyr 
110 115 120 

GCT AAA AAA CTC AAT GCG ATT GAC AAG ATT AAA AAA ACC GAA GAA AAG 43 5 

Ala Lys Lys Leu Asn Ala lie Asp Lys lie Lys Lys Thr Glu Glu Lys 
125 130 135 

CAA AAA GTT TTA GAT GAA GAA TTA GAA GAT GGC TAT GAC TTT TTG AAA 4 83 

Gin Lys Val Leu Asp Glu Glu Leu Glu Asp Gly Tyr Asp Phe Leu Lys 
140 145 150 155 

GAA AAG GAT TTT TTA GAG TGG AGC AGA AGC GAT AGC CCA GTG CGC ATG 531 
Glu Lys Asp Phe Leu Glu Trp Ser Arg Ser Asp Ser Pro Val Arg Met 
160 165 170 

TAT TTG CGC GAA ATG GGG GAT ATA AAA CTT TTA AGC AAA GAT GAA GAG 57 9 

Tyr Leu Arg Glu Met Gly Asp lie Lys Leu Leu Ser Lys Asp Glu Glu 
175 180 185 

ATT GAA TTG AGC AAG CAA ATC CGC TTG GGT GAA GAC ATT ATT TTA GAC 62 7 

He Glu Leu Ser Lys Gin He Arg Leu Gly Glu Asp He He Leu Asp 
190 195 200 

GCG ATC TGC TCG GTG CCG TAT TTG ATT GAT TTT ATC TAT GCG TAT AAA 67 5 

Ala He Cys Ser Val Pro Tyr Leu He Asp Phe He Tyr Ala Tyr Lys 
205 210 215 

GAC GCT TTA ATC AAT CGT GAA AGA AGG GTT AAA GAG CTT TTC AGG AGC 72 3 

Asp Ala Leu He Asn Arg Glu Arg Arg Val Lys Glu Leu Phe Arg Ser 
220 225 230 235 

TTT GAT GAT GAC GAT GAA AAT AGC GTG AGC GAT TCT AAA AAA GAT GAA 771 
Phe Asp Asp Asp Asp Glu Asn Ser Val Ser Asp Ser Lys Lys Asp Glu 
^ 240 245 250 
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(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1188: 



Met 


Thr 


Lys 


Thr 


Ala 


Lys 


Val 


Asn 


Asp 


He 


Val 


Arg 


Asp 


Trp 


Val 


Val 


1 








5 










10 










15 




Leu 


Asp 


Ala 


Lys 


Asp 


Lys 


Val 


Phe 


Gly 


Arg 


Leu 


He 


Thr 


Glu 


He 


Ala 








20 










25 










30 






Val 


Leu 


Leu 


Arg 


Gly 


Lys 


His 


Arg 


Pro 


Phe 


Tyr 


Thr 


Pro 


Asn 


Val 


Asp 






35 










40 










45 








Cys 


Gly 


Asp 


Phe 


Val 


Val 


Val 


He 


Asn 


Ala 


Asn 


Lys 


Val 


Lys 


Phe 


Ser 




50 










55 










60 










Gly 


Met 


Lys 


Leu 


Glu 


Asp 


Lys 


Glu 


Tyr 


Phe 


Thr 


His 


Ser 


Gly 


Tyr 


Phe 


65 










70 










75 










80 


Gly 


Ser 


Thr 


Lys 


Ser 


Lys 


Thr 


Leu 


Gin 


Glu 


Met 


Leu 


Glu 


Lys 


Ala 


Pro 










85 










90 










95 




Glu 


Lys 


Leu 


Tyr 


His 


Leu 


Ala 


Val 


Arg 


Gly 


Met 


Leu 


Pro 


Lys 


Thr 


Lys 








100 










105 










110 






Leu 


Gly 


Lys 


Ala 


Met 


lie 


Lys 


Lys 


Leu 


Lys 


Val 


Tyr 


Arg 


Asp 


Asp 


Lys 






115 










120 










125 








His 


Pro 


His 


Thr 


Ala 


Gin 


Thr 


Ser 


Lys 


Lys 


Asp 


Ala 


Lys 










130 










135 










140 











(2) INFORMATION FOR SEQ ID NO: 1189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2107 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 2058 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1189: 

TAGCTTTTGA TTGAAAGC ATG GGT TCT TAC TTT ATG GAG TGT CCA ATG AAA 51 

Met Gly Ser Tyr Phe Met Glu Cys Pro Met Lys 
15 10 

AAG AAA GCT AAC GAA GAA AAA GCC CAA AAA AGA OCT AAA ACA GAA GCC 99 
Lys Lys Ala Asn Glu Glu Lys Ala Gin Lys Arg Ala Lys Thr Glu Ala 
15 20 25 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 441 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1187: 

TTTTATCAAA GGATTCTT ATG ACA AAG ACC GCT AAA GTC AAT GAC ATC GTT 51 

Met Thr Lys Thr Ala Lys Val Asn Asp lie Val 
15 10 

CGT GAT TGG GTC GTT TTA GAC GCC AAA GAC AAG GTT TTT GGC CGC TTG 99 
Arg Asp Trp Val Val Leu Asp Ala Lys Asp Lys Val Phe Gly Arg Leu 
15 20 25 

ATC ACT GAA ATC GCT GTG CTT TTA AGA GGG AAA CAC CGC CCT TTT TAC 14 7 

lie Thr Glu lie Ala Val Leu Leu Arg Gly Lys His Arg Pro Phe Tyr 
30 35 40 

ACC CCT AAT GTG GAT TGT GGG GAT TTT GTG GTG GTT ATC AAC GCT AAT 195 
Thr Pro Asn Val Asp Cys Gly Asp Phe Val Val Val lie Asn Ala Asn 
45 50 55 

AAG GTT AAA TTT TCA GGC ATG AAA TTA GAG GAT AAA GAG TAT TTT ACC 24 3 

Lys Val Lys Phe Ser Gly Met Lys Leu Glu Asp Lys Glu Tyr Phe Thr 
60 65 70 75 

CAT TCA GGC TAT TTT GGC AGC ACT AAG AGC AAG ACT CTC CAA GAA ATG 2 91 

His Ser Gly Tyr Phe Gly Ser Thr Lys Ser Lys Thr Leu Gin Glu Met 
80 85 90 

CTA GAA AAA GCC CCT GAA AAG CTC TAC CAC TTA GCC GTT AGG GGC ATG 33 9 

Leu Glu Lys Ala Pro Glu Lys Leu Tyr His Leu Ala Val Arg Gly Met 
95 100 105 

CTC CCT AAA ACG AAA TTA GGG AAA GCG ATG ATT T^AA AAA CTC AAA GTT 3 87 

Leu Pro Lys Thr Lys Leu Gly Lys Ala Met lie Lys Lys Leu Lys Val 
110 115 120 

TAT CGT GAT GAT AAG CAC CCT CAC ACC GCA CAA ACT AGC AAA AAG GAC 435 
Tyr Arg Asp Asp Lys His Pro His Thr Ala Gin Thr Ser Lys Lys Asp 
125 130 135 

GCT AAA TGAGAAAAAT CTATGCTACC GGTAAAAGAA AAACCGCTAT CG 4 83 

Ala Lys 

140 



(2) INFORMATION FOR SEQ ID NO: 118 8; 
(i) SEQUENCE CHARACTERISTICS: 
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lie Ala Phe Leu 

Ser Ala Ser His 
100 

Ser Tyr Gly Tyr 
115 

lie Phe His Asp 
130 

Ser Val Gly Ser 
145 

Ala His Leu Lys 

Arg lie Val Leu 
180 

Val Val Phe Ser 
195 

Pro Asn Gly Cys 
210 

Gin Leu Ser Gin 
225 

Phe Asp Gly Asp 

lie Val His Gly 
260 

Ser Lys Asn Ala 
275 

Asn Leu Ala Leu 
290 

His Cys Ala lie 
305 

Lys Ala Asn Phe 

Tyr Ala Lys Thr 
340 

Leu Val Leu Glu 
355 

Glu Leu Tyr Pro 
370 

Pro Leu Glu Ser 
385 

Lys Leu Glu lie 

Leu Arg lie Leu 
420 

Met Gin Glu Leu 
435 



Thr Glu Asp Met 
85 

Asn Pro Phe Glu 

Lys Leu Lys Glu 
120 

Glu Glu Leu Leu 
135 

Ala Lys Arg lie 
150 

His Ser Phe Pro 
165 

Asp Thr Ala Asn 

Glu Leu Gly Ala 
200 

Asn lie Asn Asp 
215 

Glu Val Lys Lys 
230 

Ala Asp Arg Leu 
245 

Asp Lys Leu Leu 

Leu Ser Ser Gin 
280 

Lys Glu Tyr Leu 
295 

Gly Asp Lys Phe 
310 

Gly Gly Glu Gin 
325 

Gly Asp Gly Leu 

Ser Lys Gin Val 
360 

Gin Ser Leu Val 
375 

Leu Lys Gly Tyr 
390 

Arg His Leu lie 
405 

Leu Glu Ala Lys 

Lys Glu Phe Phe 
440 



Arg Cys Asp Ala 
90 

Asp Asn Gly lie 
105 

Glu Glu Glu Lys 

His Ser Ser Tyr 
140 

Asp Asp Va 1 lie 
155 

Lys His Leu Asn 
170 

Gly Ala Ala Tyr 
185 

Asp Val Leu Val 

Gin Cys Gly Ala 
220 

Tyr Arg Ala Asp 
235 

Val Val Val Asp 
250 

Gly Val Leu Gly 
265 

Ala Val Val Ala 

Lys Ser Gin Asp 
300 

Val Ser Glu Cys 
315 

Ser Gly His lie 
330 

Val Cys Ala Leu 
345 

Ser Ser Val Ala 

Asn Leu Asn Val 
380 

Ser Ala Leu Leu 
395 

Arg Tyr Ser Gly 
410 

Asp Glu Lys Leu 
425 

Glu Gly His Leu 



Gly He Met He 
95 

Lys Phe Phe Asn 
110 

Ala He Glu Glu 
125 

Lys Val Gly Glu 

Gly Arg Tyr He 
160 

Leu Gin Ser Leu 
175 

Lys Val Ala Pro 
190 

He Asn Asp Glu 
205 

Leu His Pro Asn 

Leu Gly Phe Ala 
240 

Asn Leu Gly Asn 

255 

Val Tyr Gin Lys 
270 

Thr Asn Met Ser 
285 

Leu Glu Leu Lys 

Met Gin Leu Asn 
320 

He Phe Ser Asp 
335 

Gin Val Ser Ala 
350 

Leu Asn Pro Phe 
365 

Gin Lys Lys Pro 

Lys Glu Leu Asp 
400 

Thr Glu Asn Lys 
415 

Leu Glu Ser Lys 
430 

Cys 
445 



(2) INFORMATION FOR SEQ ID NO: 1187: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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CAT ATC ATT 
His lie lie 



GCT TTG CAA 
Ala Leu Gin 



GTT GCG TTA 
Val Ala Leu 
365 

AAT GTC CAA 
Asn Val Gin 
380 



TTT AGC GAT 
Phe Ser Asp 
335 

GTG AGC GCG 
Val Ser Ala 
350 

AAC CCC TTT 
Asn Pro Phe 



AAA AAG CCC 
Lys Lys Pro 



TAC GCT AAA 
Tyr Ala Lys 



TTA GTG TTA 
Leu Val Leu 
355 

GAA TTA TAC 
Glu Leu Tyr 
370 

CCT TTA GAA 
Pro Leu Glu 
385 



ACA GGC GAT 
Thr Gly Asp 
340 

GAA AGC AAG 
Glu Ser Lys 



CCC CAA AGC 
Pro Gin Ser 



AGC CTG AAA 
Ser Leu Lys 
390 



GGT TTG GTG 
Gly Leu Val 
345 

CAG GTA AGC 
Gin Val Ser 
360 

CTA GTG AAT 
Leu Val Asn 
375 

GGT TAT AGC 
Gly Tyr Ser 



TGC 1059 
Cys 



TCT 1107 
Ser 



TTG 1155 
Leu 



GCT 1203 
Ala 



CTT TTA AAA GAA TTA GAC AAG CTA GAA ATC CGC CAT TTG ATC CGT TAT 12 51 

Leu Leu Lys Glu Leu Asp Lys Leu Glu lie Arg His Leu lie Arg Tyr 
395 400 405 410 



AGC GGC ACT GAA AAC AAA TTG CGA ATC CTT TTA GAA GCT AAA GAT GAA 12 99 

Ser Gly Thr Glu Asn Lys Leu Arg lie Leu Leu Glu Ala Lys Asp Glu 
415 420 425 

AAG CTT TTA GAA TCC AAA ATG CAA GAA TTA AAA GAG TTT TTT GAA GGG 134 7 

Lys Leu Leu Glu Ser Lys Met Gin Glu Leu Lys Glu Phe Phe Glu Gly 
430 435 440 



CAT TTG TGC TAAAAACCAC TAAAAAAAGC CTGTTGGTTT TTATGG 13 92 

His Leu Cys 
445 



(2) INFORMATION FOR SEQ ID NO: 1186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1186: 



Met Lys lie Phe 
1 

Leu Thr Pro Met 
20 

Phe Lys Lys His 
35 

Arg Lys Ser Gly 
50 

Ser lie Gly Tyr 
65 



Gly Thr Asp Gly 

5 

Phe Val Met Arg 

Ser Gin Thr Asn 
40 

Tyr Met Val Glu 
55 

Asn Val lie Gin 
70 



Val Arg Gly Lys 
10 

Leu Gly lie Ala 
25 

Lys lie Leu lie 

Asn Ala Leu Val 
60 

lie Gly Pro Met 
75 



Ala Gly Val Lys 
15 

Ala Gly Leu Tyr 
30 

Gly Lys Asp Thr 
45 

Ser Ala Leu Thr 

Pro Thr Pro Ala 
80 
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GGC ATT AAG 
Gly lie Lys 



GAA AAA GCG 
Glu Lys Ala 
125 

AGC TAT AAA 
Ser Tyr Lys 
140 

GTC ATA GGG 
Val lie Gly 
155 

TTG AAT TTA 
Leu Asn Leu 



GCT TAT AAG 
Ala Tyr Lys 



TTA GTG ATT 
Leu Val lie 
205 

GGG GCT TTA 
Gly Ala Leu 
220 

GCA GAT TTA 
Ala Asp Leu 
235 

GTG GAT AAT 
Val Asp Asn 



TTA GGG GTT 
Leu Gly Val 



GTC GCC ACA 
Val Ala Thr 
285 

CAA GAT TTG 
Gin Asp Leu 
300 

GAA TGC ATG 
Glu Cys Met 
315 



TTT TTC AAT 
Phe Phe Asn 
110 

ATT GAA GAA 
lie Glu Glu 



GTG GGT GAG 
Val Gly Glu 



CGC TAT ATT 
Arg Tyr lie 
160 

CAG AGT TTA 
Gin Ser Leu 
175 

GTG GCT CCG 
Val Ala Pro 
190 

AAT GAT GAG 
Asn Asp Glu 



CAC CCC AAC 
His Pro Asn 



GGC TTT GCT 
Gly Phe Ala 
240 

TTA GGG AAT 
Leu Gly Asn 
255 

TAT CAA AAA 
Tyr Gin Lys 
270 

AAC ATG AGC 
Asn Met Ser 



GAA TTG AAG 
Glu Leu Lys 

CAA TTG AAT 
Gin Leu Asn 
320 



TCT TAT GGC 
Ser Tyr Gly 
115 

ATC TTT CAT 
lie Phe His 
130 

AGC GTC GGT 
Ser Val Gly 
145 

GCA CAT TTA 
Ala His Leu 



AGO ATC GTG 
Arg lie Val 



GTC GTT TTT 
Val Val Phe 
195 

CCT AAC GGG 
Pro Asn Gly 
210 

CAA TTA AGC 
Gin Leu Ser 
225 

TTT GAT GGC 
Phe Asp Gly 



ATC GTG CAT 
lie Val His 



TCT AAA AAC 
Ser Lys Asn 
275 

AAT TTA GCC 
Asn Leu Ala 
290 

CAT TGC GCG 
His Cys Ala 
305 

AAA GCC AAT 
Lys Ala Asn 



TAT AAG CTT 
Tyr Lys Leu 



GAT GAA GAA 
Asp Glu Glu 



AGC GCT AAA 
Ser Ala Lys 
150 

AAA CAC TCT 
Lys His Ser 
165 

CTA GAT ACG 
Leu Asp Thr 
180 

AGC GAG CTT 
Ser Glu Leu 



TGT AAC ATT 
Cys Asn He 



CAG GAA GTG 
Gin Glu Val 
230 

GAT GCT GAC 
Asp Ala Asp 
245 

GGG GAT AAG 
Gly Asp Lys 
260 

GCC CTT TCT 
Ala Leu Ser 



CTT AAA GAA 
Leu Lys Glu 



ATT GGG GAT 
He Gly Asp 
310 

TTT GGA GGC 
Phe Gly Gly 
325 



AAA GAA GAA 
Lys Glu Glu 
120 

TTA CTG CAT 
Leu Leu His 
135 

AGG ATA GAC 
Arg He Asp 



TTC CCC AAA 
Phe Pro Lys 



GCT AAT GGC 
Ala Asn Gly 
185 

GGG GCT GAT 
Gly Ala Asp 
200 

AAT GAT C7^ 
Asn Asp Gin 
215 

AAA AAA TAG 
Lys Lys Tyr 



AGG CTA GTG 
Arg Leu Val 

CTT TTA GGG 
Leu Leu Gly 
265 

TCT CAA GCG 
Ser Gin Ala 
280 

TAT TTA AAA 
Tyr Leu Lys 
295 

AAG TTT GTG 
Lys Phe Val 



GAG CAA AGC 
Glu Gin Ser 



GAA 3 87 

Glu 



TCT 4 35 

Ser 



GAT 4 8 3 

Asp 



CAT 531 

His 

170 

GCG 579 
Ala 



GTG 62 7 

Val 



TGC 675 
Cys 



CGC 72 3 

Arg 



GTG 771 

Val 

250 

GTG 819 
Val 



GTT' 867 
Val 



TCC 915 
Ser 



AGC 963 
Ser 



GGG 1011 

Gly 

330 
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100 105 110 

Ala lie Phe Asn Phe Ala Asp Val Met lie Asp Val Gly Val Gly Val 

115 120 125 

Leu Leu Leu Lys Gin Phe Phe Phe Lys Gin Lys Gin Asn Lys lie Lys 
130 135 140 

Ala 
145 

(2) INFORMATION FOR SEQ ID NO: 1185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 2 2... 13 56 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1185: 

TTTAAAAGGT ATTTTATAAC G ATG AAA ATT TTT GGG ACT GAT GGC GTG AGG 51 

Met Lys lie Phe Gly Thr Asp Gly Val Arg 
15 10 

GGT AAA GCA GGG GTG AAA CTC ACC CCC ATG TTT GTG ATG CGT TTA GGC 99 
Gly Lys Ala Gly Val Lys Leu Thr Pro Met Phe Val Met Arg Leu Gly 
15 20 25 

ATT GCT GCC GGA TTG TAT TTT AAA AAA CAT TCT CAA ACG AAT AAA ATT 14 7 

lie Ala Ala Gly Leu Tyr Phe Lys Lys His Ser Gin Thr Asn Lys lie 
30 35 40 

CTA ATC GGT AAA GAC ACC AGA AAA AGC GGC TAT ATG GTA GAA AAC GCT 195 
Leu lie Gly Lys Asp Thr Arg Lys Ser Gly Tyr Met Val Glu Asn Ala 
45 50 55 

TTA GTG AGC GCT CTA ACT TCC ATA GGC TAT AAT GTG ATT CAA ATA GGG 2 43 

Leu Val Ser Ala Leu Thr Ser lie Gly Tyr Asn Val lie Gin lie Gly 
60 65 70 

CCT ATG CCC ACC CCT GCG ATT GCG TTT TTA ACT GAA GAC ATG CGC TGT 2 91 

Pro Met Pro Thr Pro Ala lie Ala Phe Leu Thr Glu Asp Met Arg Cys 
75 80 85 90 

GAT GCG GGT ATT ATG ATA AGC GCG AGC CAC AAC CCT TTT GAA GAT AAT 33 9 

Asp Ala Gly lie Met lie Ser Ala Ser His Asn Pro Phe Glu Asp Asn 
95 100 105 
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AAT AAA GGC GTG GCG TTT TCC TTG CTC AGT TTT TTA GAG GGG GGT TTG 2 00 

Asn Lys Gly Val Ala Phe Ser Leu Leu Ser Phe Leu Glu Gly Gly Leu 
40 45 50 

AAA TAG TTG CAA ATC CTT TTG ATT TTA GGG CTT TTT ATC TTT TTA ATG 2 48 

Lys Tyr Leu Gin lie Leu Leu lie Leu Gly Leu Phe lie Phe Leu Met 
55 60 65 

CGC CAA AGG GAG CTT TTT AAA AAC CAT GCG ATA GAG TTT GGC ATG GTG 2 96 

Arg Gin Arg Glu Leu Phe Lys Asn His Ala lie Glu Phe Gly Met Val 
70 75 80 

TTT GGC GCC GGG GTT TCT AAT GTT TTA GAC CGG TTT GTG CAT GGG GGC 3 44 

Phe Gly Ala Gly Val Ser Asn Val Leu Asp Arg Phe Val His Gly Gly 
85 90 95 

GTG GTG GAT TAT GTG TAT TAT CAT TAT GGC TTT GAT TTT GCC ATT TTT 3 92 

Val Val Asp Tyr Val Tyr Tyr His Tyr Gly Phe Asp Phe Ala lie Phe 
100 105 110 115 

AAT TTC GCT GAT GTC ATG ATA GAT GTG GGC GTG GGC GTT TTA TTG TTG 44 0 

Asn Phe Ala Asp Val Met lie Asp Val Gly Val Gly Val Leu Leu Leu 
120 125 130 

AAA CAA TTC TTT TTT AAG CAA AAA CAA AAC AAA ATT AAG GCA TAATCACTC 4 91 
Lys Gin Phe Phe Phe Lys Gin Lys Gin Asn Lys lie Lys Ala 
135 140 145 

TTTTTAAAAT GAAAGGTCGC GTAGCTCAGT TGGTA 52 6 

(2) INFORMATION FOR SEQ ID NO: 1184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1184: 

Met Gly Val Phe Phe Leu lie Phe Gly Val Asp Gin Ala lie Lys Tyr 

15 10 15 

Ala lie Leu Glu Gly Phe Arg Tyr Glu Ser Leu Met lie Asp lie Val 

20 25 30 

Leu Val Phe Asn Lys Gly Val Ala Phe Ser Leu Leu Ser Phe Leu Glu 

35 40 45 

Gly Gly Leu Lys Tyr Leu Gin He Leu Leu He Leu Gly Leu Phe He 

50 55 60 

Phe Leu Met Arg Gin Arg Glu Leu Phe Lys Asn His Ala He Glu Phe 

65 70 75 80 

Gly Met Val Phe Gly Ala Gly Val Ser Asn Val Leu Asp Arg Phe Val 

85 90 95 

His Gly Gly Val Val Asp Tyr Val Tyr Tyr His Tyr Gly Phe Asp Phe 



-1724- 



BNSDOCID: <WO 984347aA1 J_> 



wo 98/43478 



PCT/US98/06371 



Gly 


He 


Cys 


Gly 
20 


Leu 


Thr 


Lys 


Val 


Asp 
25 


Pro 


Lys 


Ser 


Thr 


Ala 
30 


Val 


Met 


Asn 


Phe 


Phe 
35 


Val 


Gly 


Gly 


Leu 


Ser 
40 


He 


He 


Cys 


Asn 


He 
45 


Val 


Val 


He 


Thr 


Tyr 
50 


Ser 


Ala 


Leu 


His 


Pro 
55 


Thr 


Ala 


Pro 


Val 


Glu 
60 


Gly 


Ala 


Glu 


Asp 


lie 


Ala 


Gin 


Val 


Ser 


His 


His 


Leu 


Thr 


Ser 


Phe 


Tyr 


Gly 


Pro 


Ala 


Thr 


65 










70 










75 










80 


Gly 


Leu 


Leu 


Phe 


Gly 
85 


Phe 


Thr 


Tyr 


Leu 


Tyr 
90 


Ala 


Ala 


He 


Asn 


His 
95 


Thr 


Phe 


Gly 


Leu 


Asp 
100 


Trp 


Arg 


Pro 


Tyr 


Ser 
105 


Trp 


Tyr 


Ser 


Leu 


Phe 
110 


Val 


Ala 


He 


Asn 


Thr 
115 


He 


Pro 


Ala 


Ala 


He 
120 


Leu 


Ser 


His 


Tyr 


Ser 
125 


Asp 


Met 


Leu 


Asp 


Asp 
130 


His 


Lys 


Val 


Leu 


Gly 
135 


He 


Thr 


Glu 


Gly 


Asp 
140 


Trp 


Trp 


Ala 


He 


He 


Trp 


Leu 


Ala 


Trp 


Gly 


Val 


Leu 


Trp 


Leu 


Thr 


Ala 


Phe 


He 


Glu 


Asn 


145 










150 










155 










160 


He 


Leu 


Lys 


He 


Pro 
165 


Leu 


Gly 


Lys 


Phe 


Thr 
170 


Pro 


Trp 


Leu 


Ala 


He 
175 


He 


Glu 


Gly 


He 


Leu 
180 


Thr 


Ala 


Trp 


He 


Pro 
185 


Ala 


Trp 


Leu 


Leu 


Phe 
190 


He 


Gin 



His Trp Val 
195 

(2) INFORMATION FOR SEQ ID NO:1183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 48... 482 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1183: 

GAAGGGCATT TGTGCTAAAA ACCACTAAAA AAAGCCTGTT GGTTTTT ATG GGG GTT 5 6 

Met Gly Val 
1 

TTT TTC CTT ATT TTT GGC GTG GAT CAA GCG ATT AAA TAG GCT ATT TTA 104 
Phe Phe Leu He Phe Gly Val Asp Gin Ala He Lys Tyr Ala He Leu 
5 10 15 

GAA GGG TTT CGC TAT GAA AGT TTG ATG ATA GAT ATT GTT TTA GTG TTC 15 2 

Glu Gly Phe Arg Tyr Glu Ser Leu Met He Asp He Val Leu Val Phe 
20 25 30 35 
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Gly Gly Leu Ser lie lie Cys Asn lie Val Val lie Thr Tyr Ser Ala 
40 45 50 

CTC CAC CCT ACA GCC CCT GTA GAA GGT OCT GAA GAT ATT GCT CAA GTA 246 
XjCu His Pro Thr Ala Pro Val Glu Gly Ala Glu Asp lie Ala Gin Val 
55 60 65 

TCG CAC CAT TTG ACT AGT TTC TAT GGA CCA GCG ACT GGG TTA TTG TTT 2 94 

Ser His His Leu Thr Ser Phe Tyr Gly Pro Ala Thr Gly Leu Leu Phe 
70 75 80 

GGT TTC ACC TAC TTG TAT GCG GCT ATC AAC CAC ACT TTT GGT TTG GAT 34 2 

Gly Phe Thr Tyr Leu Tyr Ala Ala lie Asn His Thr Phe Gly Leu Asp 
85 90 95 100 

TGG AGG CCC TAC TCT TGG TAT AGC TTA TTC GTA GCG ATC AAC ACG ATT 3 90 

Trp Arg Pro Tyr Ser Trp Tyr Ser Leu Phe Val Ala lie Asn Thr lie 
105 110 115 

CCT GCT GCG ATT TTA TCC CAC TAT AGC GAT ATG CTT GAT GAC CAC AAA 43 8 

Pro Ala Ala lie Leu Ser His Tyr Ser Asp Met Leu Asp Asp His Lys 
120 125 130 

GTG TTA GGC ATC ACT GAA GGC GAT TGG TGG GCG ATC ATT TGG TTG GCT 486 
Val Leu Gly lie Thr Glu Gly Asp Trp Tirp Ala lie lie Trp Leu Ala 
135 140 145 

TGG GGT GTT TTG TGG CTT ACC GCT TTC ATT GAA AAC ATC TTG AAA ATC 534 
Trp Gly Val Leu Trp Leu Thr Ala Phe lie Glu Asn He Leu Lys He 
150 155 160 

CCT TTA GGG AAA TTC ACT CCA TGG CTT GCT ATC ATT GAG GGT ATT TTA 5 82 

Pro Leu Gly Lys Phe Thr Pro Trp Leu Ala He He Glu Gly He Leu 
165 170 175 180 

ACC GCT TGG ATC CCT GCT TGG TTG CTC TTT ATC CAA CAC TGG GTG TGAGA 63 2 
Thr Ala Trp He Pro Ala Trp Leu Leu Phe He Gin His Trp Val 
185 190 195 

TGATCATAGA GCGTTTAGT 651 

(2) INFORMATION FOR SEQ ID NO: 1182: 

(i) SEQUENCE CPIARACTERISTICS : 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182: 

Met Leu Gly Leu Val Leu Leu Tyr Val Gly He Val Leu He Ser Asn 
1.5 10 15 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180: 

Met Phe Ser His Glu Val Tyr Leu Glu Gly Cys Thr Leu Glu Leu Arg 

15 10 15 

Lys lie Cys Asp Asp Phe Glu Lys Asn Ala Met Gin Asp Asp Leu Gly 

20 25 30 

Gin Lys Leu Arg Ser Asp Val Leu Glu Asp Met Leu Lys lie Ala His 

35 40 45 

Asp Leu Glu Asn Leu Glu Asp Asp Thr Gin Tyr Gin Arg Arg lie lie 

50 55 60 

Asp Glu Gin lie Glu Glu Ala Lys Ser Leu Met Arg Gin lie Asp Met 
65 70 75 80 

Asn Phe His Pro Ser Ser Glu lie Asp Arg Leu Met Arg Glu Ala Lys 

85 90 95 

Glu His Glu Arg Glu Ala Ser Lys Arg Tyr Asp Glu Tyr Leu Lys Ser 

100 105 110 

Lys Asp Lys Asn Asp 
115 

(2) INFORMATION FOR SEQ ID NO: 1181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 43... 627 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1181: 

GGTGCGTTTG TTGTAAAAAT TTTTGTTTGG AAGGAAAAGG CA ATG CTA GGA CTT 5 4 

Met Leu Gly Leu 
1 

GTA TTG TTA TAT GTT GGG ATT GTT TTA ATC AGC AAT GGG ATT TGC GGG 102 
Val Leu Leu-Tyr Val Gly lie Val Leu lie Ser Asn Gly lie Cys Gly 
5 10 15 20 

TTA ACC AAA GTC GAT CCT AAA AGC ACT GCG GTG ATG AAC TTT TTT GTG 15 0 

Leu Thr Lys Val Asp Pro Lys Ser Thr Ala Val Met Asn Phe Phe Val 
25 30 35 

GGC GGA CTT TCC ATT ATT TGT AAT ATA GTT GTC ATC ACT TAT TCT GCA 198 
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(B) TYPE: nucleic acid 

(C) STRAWDEDNESS : single 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25 . . . 375 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 9: 

AAAAACTGGA ATCAAGGGGT TAAA ATG TTT TCT CAT GAA GTT TAT TTG GAG 51 

Met Phe Ser His Glu Val Tyr Leu Glu 
1 5 

GGT TGC ACC CTT GAA TTA AGA AAG ATT TGC GAT GAT TTT GAA AAA AAT 9 9 

Gly Cys Thr Leu Glu Leu Arg Lys lie Cys Asp Asp Phe Glu Lys Asn 
10 15 20 25 

GCC ATG CAA GAT GAT TTA GGG CAG AAA CTC AGG AGT GAT GTG CTA GAG 14 7 

Ala Met Gin Asp Asp Leu Gly Gin Lys Leu Arg Ser Asp Val Leu Glu 
30 35 40 

GAC ATG CTA AAA ATC GCG CAT GAT TTA GAA AAT TTA GAA GAT GAC ACC 195 
Asp Met Leu Lys lie Ala His Asp Leu Glu Asn Leu Glu Asp Asp Thr 
45 50 55 

CAA TAC CAA AGA AGA ATA ATT GAC GAG CAA ATT GAA GAA GCC AAA TCT 24 3 

Gin Tyr Gin Arg Arg lie lie Asp Glu Gin lie Glu Glu Ala Lys Ser 
60 65 70 

TTG ATG AGG CAA ATT GAT ATG AAT TTC CAT CCA TCA AGC GAG ATC GAT 2 91 

Leu Met Arg Gin lie Asp Met Asn Phe His Pro Ser Ser Glu lie Asp 
75 80 85 

AGG CTT ATG CGT GAA GCC AAA GAG CAT GAA AGA GAA GCT AGT AAA AGA 33 9 

Arg Leu Met Arg Glu Ala Lys Glu His Glu Arg Glu Ala Ser Lys Arg 
90 95 100 105 

TAT GAT GAG TAT CTT AAA TCT AAG GAT AAA AAT GAT TGATGTGAAT GGTTTA 3 91 
Tyr Asp Glu Tyr Leu Lys Ser Lys Asp Lys Asn Asp 
110 115 

TTAAAAGAAC TGGATGATGC CTTAGATAA 42 0 

(2) INFORMATION FOR SEQ ID NO: 118 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
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lie Pro Thr Thr Leu Lys Tyr Met lie Ser Leu Gly Lys lie Arg Glu 
30 35 40 45 



TTA GAT GTT TTA OCA AAT CTT GCT AAA CTT TGC CCT ACT TGT CAT AGG 255 
Leu Asp Val Leu Ala Asn Leu Ala Lys Leu Cys Pro Thr Cys His Arg 
50 55 60 



GCT TTA AAA AAA GGA TCT AGC GAA GAG GAG TTT CAA AAA CGC TTG ATT 3 03 

Ala Leu Lys Lys Gly Ser Ser Glu Glu Glu Phe Gin Lys Arg Leu lie 
65 70 75 



AGA AAC ATT CTC AAT CGC AAT AAA GAC AAT TTA GAG TTT GCG CAA TTG 351 
Arg Asn lie Leu Asn Arg Asn Lys Asp Asn Leu Glu Phe Ala Gin Leu 
80 85 90 



CGT TTT GAA ACC GAT GAT TTT TCA ACG CTT ATT GAT CGT ATT TGT GAA 3 99 

Arg Phe Glu Thr Asp Asp Phe Ser Thr Leu lie Asp Arg lie Cys Glu 
95 100 105 



AGC TTG AAA TGAATTATAA AATTTTAGAT TTATTTTGTG GGGCTGGGGG T 44 9 

Ser Leu Lys 

110 



(2) INFORMATION FOR SEQ ID NO: 117 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1178: 



Met Phe Ala Val His Ala Ala Met lie Thr Thr Leu Lys Lys Glu Val 

15 10 15 

Phe Phe Leu Tyr Leu Tyr lie Lys Ser Leu Lys lie Pro lie Pro Thr 

20 25 30 

Thr Leu Lys Tyr Met lie Ser Leu Gly Lys lie Arg Glu Leu Asp Val 

35 40 45 

Leu Ala Asn Leu Ala Lys Leu Cys Pro Thr Cys His Arg Ala Leu Lys 

50 55 60 

Lys Gly Ser Ser Glu Glu Glu Phe Gin Lys Arg Leu lie Arg Asn lie 
65 70 75 80 

Leu Asn Arg Asn Lys Asp Asn Leu Glu Phe Ala Gin Leu Arg Phe Glu 

85 90 95 

Thr Asp Asp Phe Ser Thr Leu lie Asp Arg lie Cys Glu Ser Leu Lys 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 1179: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 0 base pairs 
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130 135 140 

Glu Ser Asp Gly Ala Gly Ser Val Leu Thr Asn Thr Gin Cys Leu Leu 
145 150 155 160 

Glu Lys Asn Arg Asn Pro His Leu Asn Gin Asn Gly lie Glu Asn Met 

165 170 175 

Leu Lys Lys Glu Leu Gly Ala Lys Gin Val Leu Trp Tyr Ser Tyr Gly 

180 185 190 

Tyr Leu Lys Gly Asp Asp Thr Asp Ser His Thr Asp Thr Leu Ala Arg 

195 200 205 

Phe Leu Asp Lys Asp Thr lie Val Tyr Ser Thr Cys Glu Asp Glu Asn 

210 215 220 

Asp Glu His Tyr Thr Ala Leu Lys Lys Met Gin Glu Glu Leu Lys Thr 
225 230 235 240 

Phe Lys Lys Leu Asp Gly Thr Pro Tyr Lys Leu lie Pro Leu Glu lie 

245 250 255 

Pro Lys Ala lie Phe Asp Glu Asn Gin Gin Arg Leu Pro Ala Thr Tyr 

260 265 270 

Val Asn Phe Leu Leu Cys Asn Asn Ala Leu lie Val Pro Thr Tyr Asn 

275 280 285 

Asp Pro Lys Asp Ala Leu lie Leu Glu Thr Leu Lys Gin His Thr Pro 

290 295 300 

Leu Glu Val lie Gly Val Asp Cys Asn Thr Leu lie Lys Gin His Gly 
305 310 315 320 

Ser Leu His Cys Val Thr Met Gin Leu Tyr 
325 330 

(2) INFORMATION FOR SEQ ID NO: 117 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 73... 408 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177: 

GCTTTATTCA AAAGAGCGAG GAGTTTAGAC GAGATAGCAA AATCATCAAT CTTTATCGCC 6 0 

TTTCAACGCC TA ATG TTT GCA GTG CAT GCT GCG ATG ATT ACG ACA TTA AAG 111 
Met Phe Ala Val His Ala Ala Met lie Thr Thr Leu Lys 
15 10 

AAA GAA GTT TTC TTT CTT TAC CTT TAT ATC AAA TCA CTC AAA ATC CCG 15 9 

Lys Glu Val Phe Phe Leu Tyr Leu Tyr lie Lys Ser Leu Lys lie Pro 
15 20 25 

ATT CCT ACT ACA CTG AAA TAC ATG ATT TCT TTA GGC AAA ATC AGA GAA 2 07 
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GAA TTA AAA ACC TTT AAA AAA CTA GAC GGC ACG CCC TAT AAA CTC ATC 771 
Glu Leu Lys Thr Phe Lys Lys Leu Asp Gly Thr Pro Tyr Lys Leu lie 
240 245 250 

CCC CTA GAA ATC CCT AAA GCC ATT TTT GAT GAA AAC CAA CAA CGC TTG 819 
Pro Leu Glu lie Pro Lys Ala lie Phe Asp Glu Asn Gin Gin Arg Leu 
255 260 265 

CCG GCA ACT TAT GTG AAT TTT TTA TTG TGC AAT AAC GCT TTA ATC GTG 867 
Pro Ala Thr Tyr Val Asn Phe Leu Leu Cys Asn Asn Ala Leu He Val 
270 275 280 

CCC ACT TAC AAC GAC CCT AAA GAC GCG CTC ATT TTA GAA ACC TTG AAA 915 
Pro Thr Tyr Asn Asp Pro Lys Asp Ala Leu He Leu Glu Thr Leu Lys 
285 290 295 300 

CAA CAC ACG CCC TTA GAA GTG ATA GGG GTT GAT TGC AAC ACC TTA ATC 963 
Gin His Thr Pro Leu Glu Val He Gly Val Asp Cys Asn Thr Leu He 
305 310 315 

AAA CAG CAT GGA AGT TTG CAT TGT GTA ACG ATG CAA CTT TAT TGAACAAAA 1014 
Lys Gin His Gly Ser Leu His Cys Val Thr Met Gin Leu Tyr 
320 325 330 

TCACGCTTTT TGGCGTGGT 103 3 

(2) INFORMATION FOR SEQ ID NO: 117 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1176: 

Met Lys Arg Met Leu Ala Glu Phe Glu Lys He Gin Ala He Leu Met 

15 10 15 

Ala Phe Pro His Glu Phe Ser Asp Trp Ala Tyr Cys He Lys Glu Ala 

20 25 30 

Arg Glu Ser Phe Leu Asn He He Gin Thr He Ala Lys His Ala Lys 

35 40 45 

Val Leu Val Cys Val His Thr Asn Asp He He Gly Tyr Glu Thr Leu 

50 55 60 

Lys Asn Leu Pro Gly Val Glu He Ala Arg He Asp Thr Asn Asp Thr 
65 70 75 80 

Trp Ala Arg Asp Phe Gly Ala He Ser Val Glu Asn His Gly Val Leu 

85 90 95 

Glu Cys Leu Asp Phe Gly Phe Asn Gly Trp Gly Leu Lys Tyr Pro Ser 

100 105 110 

Asn Leu Asp Asn Gin Val Asn Phe Lys Leu Lys Ser Leu Gly Phe Leu 

115 120 125 

Lys His Pro Leu Lys Thr Met Pro Tyr He Leu Glu Gly Gly Ser He 
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GCG ATT CTA ATG OCT TTC CCC CAT GAG TTT AGC GAC TGG GCG TAT TGT 99 
Ala lie Leu Met Ala Phe Pro His Glu Phe Ser Asp Trp Ala Tyr Cys 
15 20 25 

ATC AAA GAG GCT AGG GAA AGT TTT TTA AAC ATC ATT CAA ACC ATA GCC 14 7 

He Lys Glu Ala Arg Glu Ser Phe Leu Asn He He Gin Thr He Ala 
30 35 40 

AAA CAC GCT AAA GTG CTA GTG TGC GTC CAC ACT AAC GAT ATT ATC GGT 195 
Lys His Ala Lys Val Leu Val Cys Val His Thr Asn Asp He He Gly 
45 50 55 60 

TAT GAA ACG CTT AAA AAC TTA CCC GGT GTA GAG ATC GCA AGG ATT GAC 24 3 

Tyr Glu Thr Leu Lys Asn Leu Pro Gly Val Glu He Ala Arg He Asp 
65 70 75 

ACT AAC GAC ACA TGG GCT AGG GAT TTT GGA GCG ATC AGC GTT GAA AAT 2 91 

Thr Asn Asp Thr Trp Ala Arg Asp Phe Gly Ala He Ser Val Glu Asn 
80 85 90 

CAT GGC GTT TTA GAG TGC TTG GAT TTT GGC TTT AAT GGC TGG GGG TTA 33 9 

His Gly Val Leu Glu Cys Leu Asp Phe Gly Phe Asn Gly Trp Gly Leu 
95 100 105 

AAA TAG CCG TCC AAT TTA GAC AAT CAA GTG AAT TTC AAA CTC AAA AGT 3 87 

Lys Tyr Pro Ser Asn Leu Asp Asn Gin Val Asn Phe Lys Leu Lys Ser 
110 115 120 

TTA GGG TTT TTA AAA CAC CCT TTA AAA ACG ATG CCC TAT ATT TTA GAG 43 5 

Leu Gly Phe Leu Lys His Pro Leu Lys Thr Met Pro Tyr He Leu Glu 
125 130 135 140 

GGC GGG AGT ATA GAA AGC GAT GGG GCT GGG AGC GTT TTA ACC AAC ACC 4 83 

Gly Gly Ser He Glu Ser Asp Gly Ala Gly Ser Val Leu Thr Asn Thr 
145 150 155 

CAA TGC CTG TTA GAA AAA AAT CGT AAC CCC CAT TTG AAT CAA AAT GGA 531 
Gin Cys Leu Leu Glu Lys Asn Arg Asn Pro His Leu Asn Gin Asn Gly 
160 165 170 

ATA GAA AAC ATG CTT AAA AAG GAA TTA GGG GCT AAA CAA GTG CTG TGG 57 9 

He Glu Asn Met Leu Lys Lys Glu Leu Gly Ala Lys Gin Val Leu Trp 
175 180 185 

TAT TCT TAT GGC TAT CTC AAA GGC GAT GAT ACC GAT AGC CAT ACC GAC 62 7 

Tyr Ser Tyr Gly Tyr Leu Lys Gly Asp Asp Thr Asp Ser His Thr Asp 
190 195 200 

ACG CTC GCT CGT TTT TTA GAT AAA GAC ACC ATT GTT TAT AGC ACA TGC 67 5 

Thr Leu Ala Arg Phe Leu Asp Lys Asp Thr He Val Tyr Ser Thr Cys 
205 210 215 220 

GAA GAT GAA AAC GAT GAG CAC TAC ACA GCC TTA AAA AAA ATG CAA GAA 72 3 

Glu Asp Glu Asn Asp Glu His Tyr Thr Ala Leu Lys Lys Met Gin Glu 
225 230 235 
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85 90 95 



Gly 


Leu 


Glu 


He 


Ser 


Leu 


Leu 


Lys 


Gin 


He 


Leu 


Gin 


Ser 


He 


Gin 


Lys 








100 










105 










110 






Glu 


Leu 


Phe 


Lys 


Ala 


Asn 


Leu 


Lys 


Leu 


Leu 


Ser 


Leu 


Asp 


Thr 


Lys 


Val 






115 










120 










125 








Val 


Pro 


Lys 


Gly 


Ser 


Val 


Asp 


Lys 


Leu 


Phe 


He 


Asn 


Thr 


Thr 


Cys 


He 




130 










135 










140 










Gly 


Lys 


He 


He 


Lys 


Pro 


Gly 


He 


Ser 


Ser 


Tyr 


His 


Leu 


Gin 


Gin 


Gly 


145 










150 










155 










160 


Gin 


Ala 


He 


He 


Leu 


Ser 


Asp 


Thr 


He 


Ala 


Asn 


His 


Gly Ala 


Ser 


Leu 










165 










170 










175 




Phe 


Ala 


Met 


Arg 


Asn 


Glu 


He 


Lys 


Leu 


Lys 


Thr 


Asn 


Leu 


Glu 


Ser 


Asp 








180 










185 










190 






Cys 


Gin 


Leu 


Leu 


Tyr 


Pro 


Leu 


Leu 


Lys 


Pro 


Leu 


Phe 


Leu 


Ser 


Asp 


Leu 






195 










200 










205 








Lys 


lie 


Asp 


Ala 


Leu 


Arg 


Asp 


Ala 


Thr 


Arg 


Gly 


Gly 


Leu 


Ala 


Ser 


Val 




210 










215 










220 










Leu 


Asn 


Glu 


Trp 


Ala 


Asn 


Ser 


Ser 


Arg 


Val 


Lys 


He 


Val 


He 


Glu 


Glu 


225 










230 










235 










240 


Glu 


Lys 


He 


Pro 


Leu 


Lys 


Glu 


Glu 


Thr 


Lys 


Gly 


He 


Cys 


Glu 


He 


Leu 










245 










250 










255 




Gly 


Leu 


Glu 


Pro 


Tyr 


Ala 


Leu 


Ala 


Asn 


Glu 


Gly 


Val 


Phe 


Val 


Leu 


Ala 








260 










265 










270 






Leu 


Asn 


Gin 


Lys 


Asp 


Ala 


Pro 


Lys 


Ala 


Leu 


Glu 


He 


Leu 


Lys 


Ser 


Asn 






275 










280 










285 








Glu 


Lys 


Ala 


Lys 


Asn 


Ala 


Cys 


Val 


He 


Gly 


Lys 


Val 


Phe 


Glu 


Asn 


Pro 




290 










295 










300 










Tyr 


Pro 


Ser 


Val 


Val 


Leu 


Lys 


Asn 


Ala 


Trp 


Gly 


Phe 


Glu 


Arg 


He 


Leu 


305 










310 










315 










320 


Glu 


Val 


Pro 


Glu 


Gly 


Glu 


Leu 


Leu 


Pro 


Arg 


He 


Cys 


















325 










330 















(2) INFORMATION FOR SEQ ID NO: 117 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 1005 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 5: 

AAAAGGATAT TTTGA ATG AAA AG A ATG TTA GCG GAG TTT GAA AAA ATC CAA 51 
Met Lys Arg Met Leu Ala Glu Phe Glu Lys He Gin 
15 10 
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210 215 220 

AGC GTG CTG AAC GAA TGG GCG AAC AGC TCT AGA GTG AAA ATC GTT ATA 783 
Ser Val Leu Asn Glu Trp Ala Asn Ser Ser Arg Val Lys lie Val lie 
225 230 235 

GAA GAA GAA AAA ATC CCC TTA AAA GAA GAA ACG AAA GGG ATT TGT GAG 831 
Glu Glu Glu Lys lie Pro Leu Lys Glu Glu Thr Lys Gly lie Cys Glu 
240 245 250 

ATT TTA GGG TTA GAA CCC TAG GCG CTA GCC AAT GAG GGG GTG TTT GTT 87 9 

lie Leu Gly Leu Glu Pro Tyr Ala Leu Ala Asn Glu Gly Val Phe Val 
255 260 265 270 

TTA GCG CTC AAT CAA AAA GAC GCC CCT AAA GCC TTA GAA ATT TTA AAA 92 7 

Leu Ala Leu Asn Gin Lys Asp Ala Pro Lys Ala Leu Glu lie Leu Lys 
275 280 285 

AGT AAC GAA AAA GCT AAA AAC GCT TGC GTG ATT GGC AAA GTG TTT GAA 97 5 

Ser Asn Glu Lys Ala Lys Asn Ala Cys Val lie Gly Lys Val Phe Glu 
290 295 300 

AAC CCT TAT CCT AGC GTG GTT TTA AAG AAC GCA TGG GGT TTT GAA AGG 102 3 

Asn Pro Tyr Pro Ser Val Val Leu Lys Asn Ala Trp Gly Phe Glu Arg 
305 310 315 

ATT TTA GAG GTG CCA GAG GGC GAA TTA TTG CCT AGG ATT TGT TAACACGCC 10 74 
lie Leu Glu Val Pro Glu Gly Glu Leu Leu Pro Arg lie Cys 
320 325 330 

GTCATTTTTT AATCGTTTTA AGCCTGCCCT AAAAATGGTT TA 1116 

(2) INFORMATION FOR SEQ ID NO: 1174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1174: 

Met Asp Ser Val Thr Leu Ala Cys Gly Asn Gly Gly Lys Glu Thr Asn 

15 10 15 

Ala Leu lie Glu Arg Val Phe Met Pro Tyr Leu Lys Glu Trp lie Val 

20 25 30 

Ala Phe Asp Glu Asp Ala Pro Lys Phe Glu Ala Ser Gly Glu Tyr Cys 

35 40 45 

Val Ser Thr Asp Ser Phe Val lie Thr Pro Leu lie Phe Asn Gly Gly 

50 55 60 

Asp lie Gly Lys Leu Cys Val Cys Gly Ser Ala Asn Asp Val Ser Val 
65 70 75 80 

Gin Gly Gly Glu Pro Leu Tyr Leu Asn Met Gly Phe lie Leu Glu Glu 
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CAGCAGTATC CCTATCGGTC AAGCCTTAAT GGCGTATTTC AACCCTACAA TCATCAAAAA 60 
AGGATAAAA ATG GAT AGC GTA ACT CTA GCA TGC GGG AAC GGA GGG AAA GAA 111 
Met Asp Ser Val Thr Leu Ala Cys Gly Asn Gly Gly Lys Glu 
15 10 

ACA AAC GCT TTG ATT GAG CGA GTC TTT ATG CCC TAT TTA AAA GAA TGG 15 9 

Thr Asn Ala Leu lie Glu Arg Val Phe Met Pro Tyr Leu Lys Glu Trp 
15 20 25 30 

ATT GTT GCA TTT GAT GAA GAC GCC CCT AAA TTT GAA GCT AGT GGG GAA 2 07 

lie Val Ala Phe Asp Glu Asp Ala Pro Lys Phe Glu Ala Ser Gly Glu 
35 40 45 

TAT TGC GTG AGC ACG GAT AGT TTT GTC ATC ACG CCC TTA ATT TTT AAT 2 55 

Tyr Cys Val Ser Thr Asp Ser Phe Val lie Thr Pro Leu lie Phe Asn 
50 55 60 

GGG GGC GAT ATA GGC AAG CTT TGC GTT TGC GGG AGT GCG AAT GAT GTG 3 03 

Gly Gly Asp lie Gly Lys Leu Cys Val Cys Gly Ser Ala Asn Asp Val 
65 70 75 

AGC GTG CAA GGG GGC GAA CCT TTG TAT TTG AAT ATG GGT TTT ATT TTA 351 
Ser Val Gin Gly Gly Glu Pro Leu Tyr Leu Asn Met Gly Phe lie Leu 
80 85 90 

GAA GAA GGC TTA GAA ATT TCT CTT TTA AAA CAA ATT TTA CAA TCC ATA 3 99 

Glu Glu Gly Leu Glu lie Ser Leu Leu Lys Gin lie Leu Gin Ser lie 
95 100 105 110 

CAA AAA GAA TTG TTT AAA GCC AAC CTG AAA CTC CTC TCC CTA GAC ACT 44 7 

Gin Lys Glu Leu Phe Lys Ala Asn Leu Lys Leu Leu Ser Leu Asp Thr 
115 120 125 

AAA GTC GTG CCA AAG GGG AGC GTG GAT AAG CTT TTT ATC AAC ACA ACC 4 95 

Lys Val Val Pro Lys Gly Ser Val Asp Lys Leu Phe lie Asn Thr Thr 
130 135 140 

TGC ATT GGT AAA ATC ATC AAG CCA GGG ATT TCT TCG TAC CAT TTA CAA 54 3 

Cys lie Gly Lys He He Lys Pro Gly He Ser Ser Tyr His Leu Gin 
145 150 155 

CAA GGG CAA GCC ATT ATC CTA AGC GAC ACT ATC GCC AAT CAT GGG GCA 5 91 

Gin Gly Gin Ala He He Leu Ser Asp Thr He Ala Asn His Gly Ala 
160 165 170 

AGC TTA TTT GCG ATG CGT AAT GAA ATC AAG CTT AAA ACG AAT CTA GAA 63 9 

Ser Leu Phe Ala Met Arg Asn Glu He Lys Leu Lys Thr Asn Leu Glu 
175 180 185 190 

AGC GAT TGC CAA CTG CTC TAT CCC TTA TTA AAA CCC CTA TTT TTA AGC 68 7 

Ser Asp Cys Gin Leu Leu Tyr Pro Leu Leu Lys Pro Leu Phe Leu Ser 
195 200 205 

GAT CTC AAA ATT GAT GCT TTA AGA GAT GCG ACT AGG GGC GGG TTA GCG 73 5 

Asp Leu Lys He Asp Ala Leu Arg Asp Ala Thr Arg Gly Gly Leu Ala 
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100 

Leu Arg lie Leu 
115 

Arg Phe Tyr Gin 
130 

Thr Pro Gin Asn 
145 

Val Ala Lys Met 

Tyr Asn Leu Phe 
180 

Val Arg Gly Glu 
195 

Ala lie Ala Tyr 
210 

Ala Lys Arg Asp 
225 

Leu Met Leu Gin 

Lys Thr Thr Ser 
260 

Gly lie Asn Leu 
275 

lie Lys Ser Val 
290 

Ser His Leu Lys 
305 

Phe Arg Pro Thr 

Glu Lys Glu Leu 
340 

Lys Asp Met Leu 
355 

Leu Gin Asp Gly 
370 



Glu Ala Met Arg 
120 

Ala Ser Thr Ser 
135 

Glu Asn Thr Pro 
150 

Tyr Ala Phe Tyr 
165 

Ala Val Asn Gly 

Thr Phe Val Thr 
200 

Asn Leu Thr Asp 
215 

Trp Gly His Ala 
230 

Ala Pro lie Pro 
245 

Val Arg Asp Phe 

Glu Phe Gin Asn 
280 

Asp Glu Lys Arg 
295 

Lys Gly Gin lie 
310 

Glu Val Asp Leu 
325 

Asp Trp Val Arg 

Glu Tyr Asp Leu 
360 

Gly Tyr lie Leu 
375 



105 

lie Leu Gly Leu 

Glu Leu Tyr Gly 
140 

Phe Asn Pro Arg 

155 

lie Thr Lys Asn 
170 

lie Leu Phe Asn 
185 

Arg Lys lie Thr 

Cys Leu Tyr Leu 
220 

Lys Asp Tyr Val 
235 

Gin Asp Tyr Val 
250 

Val Lys Met Ser 
265 

Thr Gly lie Lys 

Ala Asn Ala Leu 
300 

Val Val Arg lie 
315 

Leu Leu Gly Asp 
330 

Glu Tyr Asp Leu 
345 

Lys Glu Cys Gin 

Arg Asn Phe Tyr 
380 



110 

Glu Lys Lys Thr 
125 

Glu Val Leu Glu 

Ser Pro Tyr Ala 
160 

Tyr Arg Glu Ala 
175 

His Glu Ser Arg 
190 

Arg Ala Ala Ser 
205 

Gly Asn Leu Asp 

Lys Met Met His 
240 

lie Ala Thr Gly 
255 

Phe Glu Phe lie 
270 

Glu lie Gly Leu 
285 

Lys Leu Asn Leu 

Asp Glu Arg Tyr 
320 

Pro Thr Lys Ala 
335 

Lys Glu Leu Val 
350 

Lys Asn Leu Tyr 

365 

Glu 



(2) INFORMATION FOR SEQ ID NO:1173: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 1065 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173: 
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GGG ATT AAA GAA ATC GGT TTG ATT AAA AGC GTT GAT GAA AAA AGA GCG 917 
Gly lie Lys Glu lie Gly Leu lie Lys Ser Val Asp Glu Lys Arg Ala 
285 290 295 

AAC GCT TTA AAA TTG AAC TTA AGC CAT TTA AAA AAA GGC CAA ATC GTG 965 
Asn Ala Leu Lys Leu Asn Leu Ser His Leu Lys Lys Gly Gin lie Val 
300 305 310 

GTG CGC ATA GAC GAG CGT TAT TTC AGG CCT ACC GAA GTG GAT TTG CTT 1013 
Val Arg lie Asp Glu Arg Tyr Phe Arg Pro Thr Glu Val Asp Leu Leu 
315 320 325 

TTA GGC GAT CCC ACT AAG GCA GAG AAA GAG CTA GAC TGG GTT AGG GAA 10 61 

Leu Gly Asp Pro Thr Lys Ala Glu Lys Glu Leu Asp Trp Val Arg Glu 
330 335 340 345 

TAC GAT TTA AAA GAG TTG GTT AAG GAC ATG TTA GAA TAC GAT TTA AAA 110 9 

Tyr Asp Leu Lys Glu Leu Val Lys Asp Met Leu Glu Tyr Asp Leu Lys 
350 355 360 

GAA TGC CAA AAA AAC CTT TAC TTG CAA GAT GGG GGT TAT ATT TTA AGG 115 7 

Glu Cys Gin Lys Asn Leu Tyr Leu Gin Asp Gly Gly Tyr lie Leu Arg 
365 370 375 

AAT TTT TAT GAA TGAGATTATT TTAATCACTG GTGCCTATGG CATGGTGGGG CAGAA 1214 
Asn Phe Tyr Glu 
380 

CACGGCGTTG TATTT 12 2 9 

(2) INFORMATION FOR SEQ ID NO:1172: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 81 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1172: 

Met Lys Glu Lys lie Ala Leu lie Thr Gly Val Thr Gly Gin Asp Gly 

15 10 15 

Ser Tyr Leu Ala Glu Tyr Leu Leu Asn Leu Gly Tyr Glu Val His Gly 

20 25 30 

Leu Lys Arg Arg Ser Ser Ser lie Asn Thr Ser Arg lie Asp His Leu 

35 40 45 

Tyr Glu Asp Leu His Ser Asp His Lys Arg Arg Phe Phe Leu His Tyr 

50 55 60 

Gly Asp Met Thr Asp Ser Ser Asn Leu lie His Leu lie Ala Thr Thr 
65 70 75 80 

Lys Pro Thr Glu lie Tyr Asn Leu Ala Ala Gin Ser His Val Lys Val 

85 90 95 

Ser Phe Glu Thr Pro Glu Tyr Thr Ala Asn Ala Asp Gly lie Gly Thr 
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AGG CGT TTT TTC TTA CAC TAT GGG GAT ATG ACC GAT AGC TCT AAT CTT 24 5 

Arg Arg Phe Phe Leu His Tyr Gly Asp Met Thr Asp Ser Ser Asn Leu 
60 65 70 

ATC CAT TTA ATC GCT ACC ACT AAG CCT ACA GAG ATT TAT AAT TTA GCC 2 93 

lie His Leu lie Ala Thr Thr Lys Pro Thr Glu lie Tyr Asn Leu Ala 
75 80 85 

GCT CAA AGC CAT GTA AAA GTC TCT TTT GAA ACC CCC GAA TAC ACC GCT 341 
Ala Gin Ser His Val Lys Val Ser Phe Glu Thr Pro Glu Tyr Thr Ala 
90 95 100 105 

AAC GCT GAT GGT ATT GGC ACG CTA AGG ATT TTA GAA GCC ATG CGG ATT 3 89 

Asn Ala Asp Gly lie Gly Thr Leu Arg lie Leu Glu Ala Met Arg lie 
110 115 120 

TTA GGA TTA GAA AAG AAA ACG CGC TTT TAT CAA GCC AGC ACG AGC GAA 437 
Leu Gly Leu Glu Lys Lys Thr Arg Phe Tyr Gin Ala Ser Thr Ser Glu 
125 130 135 

TTG TAT GGC GAA GTC TTA GAA ACC CCG CAA AAT GAA AAC ACC CCC TTT 4 85 

Leu Tyr Gly Glu Val Leu Glu Thr Pro Gin Asn Glu Asn Thr Pro Phe 
140 145 150 

AAC CCA CGA AGC CCC TAT GCG GTC GCT AAA ATG TAT GCC TTT TAC ATC 5 33 

Asn Pro Arg Ser Pro Tyr Ala Val Ala Lys Met Tyr Ala Phe Tyr lie 
155 160 165 

ACC AAA AAT TAC AGA GAG GCC TAT AAC TTG TTT GCG GTT AAT GGC ATT 581 
Thr Lys Asn Tyr Arg Glu Ala Tyr Asn Leu Phe Ala Val Asn Gly lie 
170 175 180 185 

CTT TTT AAC CAT GAG AGC AGG GTA AGG GGC GAA ACT TTT GTA ACC CGT 62 9 

Leu Phe Asn His Glu Ser Arg Val Arg Gly Glu Thr Phe Val Thr Arg 
190 195 200 

AAA ATC ACA CGA GCC GCT AGC GCG ATA GCG TAT AAC TTA ACG GAT TGC 6 77 

Lys lie Thr Arg Ala Ala Ser Ala lie Ala Tyr Asn Leu Thr Asp Cys 
205 210 215 

TTG TAT TTA GGG AAT TTA GAC GCT AAA AGA GAC TGG GGG CAT GCC AAA 72 5 

Leu Tyr Leu Gly Asn Leu Asp Ala Lys Arg Asp Trp Gly His Ala Lys 
220 225 230 

GAT TAC GTG AAA ATG ATG CAT TTA ATG CTC CAA GCG CCC ATC CCA CAA 77 3 

Asp Tyr Val Lys Met Met His Leu Met Leu Gin Ala Pro lie Pro Gin 
235 240 245 

GAT TAT GTG ATC GCC ACA GGA AAG ACC ACA AGC GTG CGC GAT TTT GTG 821 
Asp Tyr Val lie Ala Thr Gly Lys Thr Thr Ser Val Arg Asp Phe Val 
250 255 260 265 

AAA ATG AGC TTT GAA TTT ATC GGT ATC AAT TTA GAA TTT CAA AAT ACA 86 9 

Lys Met Ser Phe Glu Phe lie Gly lie Asn Leu Glu Phe Gin Asn Thr 
270 275 280 
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Asn Asn Asn Lys Met Gly Glu Tyr Arg laeu Asp lie Val Trp Ser Arg 

85 90 95 

lie lie Thr Pro His Gly lie Asn lie Met Leu Thr Asn Ala Lys Gly 

100 105 110 

Ala Asp lie Lys Gly Tyr Asn Gly Leu Val Gly Glu Leu lie Glu Arg 

115 120 125 

Asn Phe Gin Arg Tyr Gly Val Pro Leu Leu Leu Ser Thr Leu Thr Asn 

130 135 140 

Gly Leu Leu lie Gly lie Thr Ser Ala Leu Asn Asn Arg Gly Asn Lys 
145 150 155 160 

Glu Gly Ala Thr Asn Phe Phe Gly Asp Tyr Leu Leu Met Gin Leu Met 

165 170 175 

Arg Gin Ser Gly Met Gly lie Asn Gin Val Val Asn Gin lie Leu Arg 

180 185 190 

Asp Lys Ser Lys He Ala Pro He Val Val He Arg Glu Gly Ser Arg 

195 200 205 

Val Phe He Ser Pro Asn Thr Asp He Phe Phe Pro He Pro Arg Glu 

210 215 220 

Asn Glu Val He Ala Glu Phe Leu Lys 
225 230 



(2) INFORMATION FOR SEQ ID NO: 1171: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 
(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 27.., 1169 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171: 

AAATAAAATT CAATAAAGGA AAAATA ATG AAA GAA AAA ATC GCT TTA ATC ACC 53 

Met Lys Glu Lys He Ala Leu He Thr 
1 5 

GGG GTT ACC GGG CAA GAC GGG AGC TAT CTG GCT GAA TAC TTG CTG AAT 101 
Gly Val Thr Gly Gin Asp Gly Ser Tyr Leu Ala Glu Tyr Leu Leu Asn 
10 15 20 25 

TTG GGT TAT GAA GTG CAT GGG TTA AAA AGG CGC TCT TCT AGC ATC AAC 14 9 

Leu Gly Tyr Glu Val His Gly Leu Lys Arg Arg Ser Ser Ser He Asn 
30 35 40 

ACT TCT AGG ATC GAT CAT CTG TAT GAA GAT TTG CAT AGC GAT CAT AAA 197 
Thr Ser Arg He Asp His Leu Tyr Glu Asp Leu His Ser Asp His Lys 
45 50 55 
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AAA GGC TAT AAC GGC TTG GTG GGG GAA TTG ATT GAA AGG AAT TTC CAG 44 2 

Lys Gly Tyr Asn Gly Leu Val Gly Glu Leu lie Glu Arg Asn Phe Gin 
120 125 130 

CGC TAT GGC GTG CCG TTA CTG CTT TCT ACT CTC ACT AAC GGC CTA TTG 4 90 

Arg Tyr Gly Val Pro Leu Leu Leu Ser Thr Leu Thr Asn Gly Leu Leu 
135 140 145 

ATT GGG ATC ACT TCG GCT TTA AAC AAC AGA GGC AAT AAA GAA GGA GCC 53 8 

lie Gly lie Thr Ser Ala Leu Asn Asn Arg Gly Asn Lys Glu Gly Ala 
150 155 160 

ACC AAT TTC TTT GGG GAT TAT CTT TTA ATG CAA TTG ATG AGG CAA AGC 5 86 

Thr Asn Phe Phe Gly Asp Tyr Leu Leu Met Gin Leu Met Arg Gin Ser 
165 170 175 

GGC ATG GGG ATC AAT CAA GTA GTC AAT CAA ATT TTA AGA GAT AAG AGC 63 4 

Gly Met Gly lie Asn Gin Val Val Asn Gin lie Leu Arg Asp Lys Ser 
180 185 190 195 

AAA ATC GCT CCT ATT GTG GTG ATT AGA GAA GGG AGT AGG GTC TTC ATT 6 82 

Lys lie Ala Pro lie Val Val lie Arg Glu Gly Ser Arg Val Phe lie 
200 205 210 

TCG CCC AAT ACT GAC ATC TTT TTC CCT ATA CCC AGA GAG AAT GAA GTC 73 0 

Ser Pro Asn Thr Asp lie Phe Phe Pro lie Pro Arg Glu Asn Glu Val 
215 220 225 

ATC GCT GAG TTT TTG AAG TGACTCAAAA ATCCCCAATT AAAAACGCT 7 77 

lie Ala Glu Phe Leu Lys 
230 



(2) INFORMATION FOR SEQ ID NO: 117 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170: 



Met Asp Asn Pro Lys 

1 5 
Asp lie Ala Thr Asn 
2 0 

Lys Met lie Pro Ala 
35 

Gly Lys Val He Ala 
50 

Lys Ala Val Leu He 
65 



Gly He Asp Gly Phe 
10 

Glu Asn Lys Leu Leu 
25 

Phe Leu He Thr Pro 
40 

Gin Val Glu Ser Asp 
55 

Pro Lys Gly Ser Lys 
70 



Thr Asn Leu Lys Glu Lys 
15 

Arg Thr He Thr Ala Asp 
30 

He Ser Ser Gin He Ala 
45 

He Phe Ala His Met Gly 
60 

Val He Gly Tyr Tyr Ser 
75 80 
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325 330 335 

Gin Lys Lys Val Lys Lys Asp Thr lie Asn Thr Lys Asn Glu lie 
340 345 350 

(2) INFORMATION FOR SEQ ID NO: 116 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50... 748 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 9: 

CCCTTTCAAA CAAGGCCCTA AAAATTACGA AGAAAACCTG ATTTTCCCC ATG GAT AAC 5 8 

Met Asp Asn 
1 

CCT AAA GGC ATT GAT GGT TTT ACT AAC CTT AAA GAA AAA GAC ATC GCC 106 
Pro Lys Gly lie Asp Gly Phe Thr Asn Leu Lys Glu Lys Asp lie Ala 
5 10 15 

ACT AAT GAA AAT AAG CTT TTA CGC ACC ATT ACA GCG GAT AAA ATG ATA 154 
Thr Asn Glu Asn Lys Leu Leu Arg Thr lie Thr Ala Asp Lys Met lie 
20 25 30 35 

CCC GCC TTT CTC ATC ACG CCT ATT TOT AGC CAG ATC GCT GGT AAA GTC 2 02 

Pro Ala Phe Leu lie Thr Pro lie Ser Ser Gin lie Ala Gly Lys Val 
40 45 50 

ATC GCG CAG GTG GAG AGC GAT ATT TTT GCT CAC ATG GGC AAG GCC GTC 2 50 

lie Ala Gin Val Glu Ser Asp lie Phe Ala His Met Gly Lys Ala Val 
55 60 65 

TTA ATC CCC AAA GGC TCT AAA GTC ATA GGT TAT TAC AGC AAC AAT AAC 2 98 

Leu lie Pro Lys Gly Ser Lys Val lie Gly Tyr Tyr Ser Asn Asn Asn 
70 75 80 

AAA ATG GGC GAA TAC CGC TTG GAT ATT GTA TGG AGC CGC ATC ATC ACT 346 
Lys Met Gly Glu Tyr Arg Leu Asp lie Val Trp Ser Arg lie lie Thr 
85 90 95 

CCC CAT GGC ATC AAT ATC ATG CTC ACT AAC GCT AAA GGG GCG GAC ATT 3 94 

Pro His Gly lie Asn lie Met Leu Thr Asn Ala Lys Gly Ala Asp lie 
100 105 110 115 
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AATTTAATGA GAATTTT 113 3 

(2) INFORMATION FOR SEQ ID NO: 1168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 8: 

Met Lys Asn Asp Ala Tyr Glu lie lie Leu Ser Trp Phe lie Thr Pro 

15 10 15 

Leu Thr Ala lie Leu Gly Arg Phe Ala Glu Phe Phe Leu Tyr Thr Leu 

20 25 30 

His Ala Gin Leu Val Phe Asn Ser Val Val Ala Leu Ala Phe Met Leu 

35 40 45 

Phe Ala Tyr Arg Ser Leu Lys Glu Gin Asn Phe Phe Ser Ala Ser Ala 

50 55 60 

Leu Thr Glu Ala Leu Leu Phe Val Gly Phe Phe Ala Leu Phe Asn Tyr 
65 70 75 80 

Ala Leu Lys Asn Pro Met His Phe Tyr Glu Phe Phe Gin Asn Ala lie 

85 90 95 

Phe lie Ala Pro Asn Met lie Ala Gin Ser Leu Ser Gin Ser Leu Ser 

100 105 110 

Asn Phe Ser Asp His Ala Leu Ser Leu Asp Phe lie Phe Asn His Gly 

115 120 125 

Phe Tyr Ala Leu Ser Phe lie Ser Asp Leu Ser His Asn Glu Met Ser 

130 135 140 

Val Trp Leu Phe Leu Ser Val Leu Gin Gly Leu Phe Leu Ser Val Leu 
145 150 155 160 

Phe Ala lie lie lie Leu Val Tyr Leu Glu Val His Val Trp Cys Ser 

165 170 175 

Leu Gly Val Leu Phe Leu Ala Phe Gly Phe Phe Lys Thr Trp Arg Ser 

180 185 190 

Val Val Val lie Cys Leu Lys Lys Cys Phe Ala Leu Gly Phe Tyr Lys 

195 200 205 

Pro Phe Leu Leu Leu Val Gly Phe Leu Asn Val Ser Val Thr Lys Ala 

210 215 220 

Leu lie Asp Ala His Met Gin Glu Lys Gin Asp Leu Ser Leu Leu Leu 
225 230 235 240 

Val Val Ala Leu Phe Leu Cys Cys Val Phe lie lie Gly Val Pro Phe 

245 250 255 

Phe lie Asn Ala Leu Phe Arg Val Gin Asn Ser Leu Lys Glu Thr Tyr 

260 265 270 

Lys Leu Ala Thr Asn Leu Ser Ala Asn Leu Ser Gin Asn Ala Leu Asn 

275 280 285 

Ser Leu Gin Tyr lie Thr Thr Pro Pro Ala Ser Ser Ser Val Ser Ser 

290 295 300 

Ser Met Ser Glu Ser Val Ser Lys Glu Lys Glu Thr His Ser Pro Thr 
305 310 315 320 

Phe Lys Val Glu Thr Thr Gin Leu Asp Val Lys lie Pro Asn Phe Lys 
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GCC CTT AGT TTC ATC AGC GAT TTG AGC CAT AAT GAA ATG TCT GTG TGG 4 89 

Ala Leu Ser Phe lie Ser Asp Leu Ser His Asn Glu Met Ser Val Trp 
135 140 145 

CTT TTT TTA AGC GTT TTG CAA GGG CTT TTT TTG AGC GTG CTG TTT GCA 53 7 

Leu Phe Leu Ser Val Leu Gin Gly Leu Phe Leu Ser Val Leu Phe Ala 
150 155 160 

ATC ATC ATT TTA GTG TAT TTA GAA GTG CAT GTG TGG TGC TCT TTA GGG 585 
lie lie lie Leu Val Tyr Leu Glu Val His Val Trp Cys Ser Leu Gly 
165 170 175 

GTG CTG TTT TTA GCG TTT GGG TTT TTT AAA ACC TGG AGG AGC GTT GTG 63 3 

Val Leu Phe Leu Ala Phe Gly Phe Phe Lys Thr Trp Arg Ser Val Val 
180 185 190 

GTT ATA TGC CTA AAA AAG TGC TTC GCT CTT GGG TTT TAG AAG CCT TTT 681 
Val lie Cys Leu Lys Lys Cys Phe Ala Leu Gly Phe Tyr Lys Pro Phe 
195 200 205 210 

TTG TTG TTG GTA GGG TTT TTG AAT GTG TCG GTT ACT AAG GCT TTA ATA 72 9 

Leu Leu Leu Val Gly Phe Leu Asn Val Ser Val Thr Lys Ala Leu lie 
215 220 225 

GAC GCT CAT ATG CAA GAA AAA CAA GAC TTA AGC CTT TTA TTG GTG GTA 77 7 

Asp Ala His Met Gin Glu Lys Gin Asp Leu Ser Leu Leu Leu Val Val 
230 235 240 

GCG TTA TTT TTG TGT TGC GTT TTT ATC ATC GGC GTG CCT TTT TTC ATC 82 5 

Ala Leu Phe Leu Cys Cys Val Phe lie lie Gly Val Pro Phe Phe lie 
245 250 255 

AAC GCT TTG TTT AGG GTG CAA AAC AGC CTT AAA GAA ACT TAG AAA CTC 8 73 

Asn Ala Leu Phe Arg Val Gin Asn Ser Leu Lys Glu Thr Tyr Lys Leu 
260 265 270 

GCC ACC AAT TTG AGT GCC AAC CTC AGC CAA AAC GCC CTT AAT TCC TTA 921 
Ala Thr Asn Leu Ser Ala Asn Leu Ser Gin Asn Ala Leu Asn Ser Leu 
275 280 285 290 

CAA TAC ATC ACG ACC CCA CCC GCT TCT TCT AGC GTT TCT TCT TCT ATG 96 9 

Gin Tyr lie Thr Thr Pro Pro Ala Ser Ser Ser Val Ser Ser Ser Met 
295 300 305 

AGT GAA AGC GTC TCT AAA GAA AAA GAA ACG CAT TCC CCC ACA TTT AAG 1017 
Ser Glu Ser Val Ser Lys Glu Lys Glu Thr His Ser Pro Thr Phe Lys 
310 315 320 

GTA GAA ACC ACT CAA TTA GAT GTA AAA ATC CCA AAT TTC AAG CAA AAA 1065 
Val Glu Thr Thr Gin Leu Asp Val Lys lie Pro Asn Phe Lys Gin Lys 
325 330 335 

AAG GTT AAA AAG GAT ACA ATA AAT ACA AAA AAT GAA ATT TAAATAAATA GG 1116 
Lys Val Lys Lys Asp Thr lie Asn Thr Lys Asn Glu lie 
340 345 350 
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(2) INFORMATION FOR SEQ ID NO: 1167: 

(i) SEQXIENCE CHARACTERISTICS: 

(A) LENGTH: 113 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52 . . . 1104 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167: 

TAGTCTTGCG TAAAGGCTTT GAATTTTACA TTAAAGTCCG TTGAAGTATT G ATG AAA 5 7 

Met Lys 
1 

AAT GAC GCT TAT GAA ATT ATT CTT TCT TGG TTT ATC ACG CCT CTC ACG 10 5 

Asn Asp Ala Tyr Glu lie lie Leu Ser Trp Phe lie Thr Pro Leu Thr 
5 10 15 

GCG ATT TTA GGG CGT TTC GCT GAA TTT TTT CTC TAC ACT TTG CAT GCG 15 3 

Ala lie Leu Gly Arg Phe Ala Glu Phe Phe Leu Tyr Thr Leu His Ala 
20 25 30 

CAA TTG GTG TTT AAT AGC GTG GTC GCT TTG GCG TTC ATG CTC TTT GCT 201 
Gin Leu Val Phe Asn Ser Val Val Ala Leu Ala Phe Met Leu Phe Ala 
35 40 45 50 

TAT AGG AGT TTG AAA GAA CAG AAT TTC TTC AGC GCT AGC GCG CTA ACA 24 9 

Tyr Arg Ser Leu Lys Glu Gin Asn Phe Phe Ser Ala Ser Ala Leu Thr 
55 60 65 

GAA GCG TTA TTG TTT GTG GGG TTT TTT GCA CTT TTC AAC TAC GCT TTA 2 97 

Glu Ala Leu Leu Phe Val Gly Phe Phe Ala Leu Phe Asn Tyr Ala Leu 
70 75 80 

AAA AAT CCC ATG CAT TTT TAT GAA TTT TTC CAA AAC GCT ATT TTT ATT 345 
Lys Asn Pro Met His Phe Tyr Glu Phe Phe Gin Asn Ala lie Phe lie 
85 90 95 

GCG CCT AAC ATG ATC GCG CAA AGC CTC TCT CAA AGC TTG AGT AAC TTT 3 93 

Ala Pro Asn Met lie Ala Gin Ser Leu Ser Gin Ser Leu Ser Asn Phe 
100 105 110 

TCT GAC CAT GCG CTT TCT TTA GAT TTT ATC TTT AAT CAT GGT TTT TAT 441 
Ser Asp His Ala Leu Ser Leu Asp Phe lie Phe Asn His Gly Phe Tyr 
115 120 125 130 
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(2) INFORMATION FOR SEQ ID NO: 1166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1166: 



Met Phe His Lys 
1 

Asn Gly Leu Gly 
20 

Lys Ala Ser Leu 
35 

Gly Val Tyr Leu 
50 

lie Lys Ser Asp 
65 

Ser Glu Tyr Val 

Arg Pro Met Ala 
100 

Leu Val Gly Gin 
115 

Thr Gin Lys Asn 
130 

Leu Gly Val Gly 
145 

Phe Leu Asn Gin 

Glu Glu His His 
180 

Pro Lys Asn Pro 
195 

Gin Lys lie His 
210 

Lys Tyr Gin Ser 
225 

Lys Glu Val Thr 

Lys Asp Thr Phe 
260 

lie lie Thr Lys 
275 

Lys Leu Gly Asp 
290 

Asn Pro Lys Ala 
305 

Leu Val Leu Arg 



Ala Leu lie Thr 
5 

Ala Tyr Asp Phe 

Gin Lys Gly Gly 
40 

Tyr Tyr Ser Lys 
55 

Pro Phe Val Gly 
70 

Tyr Thr Leu Arg 
85 

Ser lie Gly Asp 

Arg Gly Tyr Glu 
12 0 

Gly Val lie Ser 
135 

Gly Asn Gly Phe 
150 

Gin Glu Pro Tyr 
165 

Lys Arg Leu Val 

Pile Leu Lys Asn 
200 

Ser Leu Ala Glu 

215 

Leu Ala Lys Val 
230 

Leu Lys Val Asn 
245 

Leu Glu Arg Tyr 

lie Gly Ala His 
280 

Arg lie Leu Trp 
295 

Leu Arg Glu Ala 
310 

Lys Gly Phe Glu 
325 



Phe lie Val Leu 
10 

Lys His Cys Gin 
25 

Val Ala Leu Lys 

Thr Tyr Pro Lys 
60 

Leu Tyr Leu Leu 
75 

Asp Leu Asp Lys 
90 

Lys Glu Ala Leu 
105 

Arg Tyr Ala Gin 

Asn lie Cys Tyr 
140 

lie Glu Thr Lys 
155 

Tyr Gly Asp lie 
170 

Val Val Gin Phe 
185 

Asp Glu lie Leu 

Phe Glu Trp Val 
220 

Glu lie Lys Arg 
235 

Lys Arg Tyr Gly 
250 

Gly lie Ala Leu 
265 

Leu Pro Lys Gly 

Val Asn Tyr Lys 
300 

Leu Ser Ala Pro 
315 

Phe Tyr lie Lys 
330 



Trp Phe Phe Leu 
15 

Ala Phe Phe Lys 
30 

Glu Leu Pro Lys 
45 

His Ala Lys Val 

Gin Ser Ala Pro 
80 

Asp Ala Leu lie 
95 

Glu Thr Arg Leu 
110 

lie Ser Gin Lys 
125 

Gin Met Leu Gly 

Phe lie Lys Arg 
160 

Gly Val Arg Leu 
175 

Asp Pro Phe Phe 
190 

Ala lie Asn His 
205 

Val Ser Asn Leu 

Asn His Lys Val 
240 

Gly Phe Leu Leu 
255 

Asp Glu Arg Phe 
270 

Leu Asp Phe Leu 
285 

Ser Val Ala Ser 

Lys lie Glu Leu 
320 

Val Arg 
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TCG CAA AAG ACT CAA AAA AAT GGC GTT ATC AGC AAT ATT TGC TAT CAA 43 5 

Ser Gin Lys Thr Gin Lys Asn Gly Val lie Ser Asn He Cys Tyr Gin 
130 135 140 

ATG TTA GGG CTA GGG GTA GGG GGG AAT GGC TTT ATA GAA ACG AAA TTT 4 83 

Met Leu Gly Leu Gly Val Gly Gly Asn Gly Phe He Glu Thr Lys Phe 
145 150 155 

ATC AAG CGC TTT TTA AAC CAG CAA GAG CCT TAT TAT GGG GAT ATT GGG 531 
He Lys Arg Phe Leu Asn Gin Gin Glu Pro Tyr Tyr Gly Asp He Gly 
160 165 170 

GTG CGT TTA GAA GAA CAT CAT AAG CGT TTA GTG GTA GTG CAA TTT GAT 57 9 

Val Arg Leu Glu Glu His His Lys Arg Leu Val Val Val Gin Phe Asp 
175 180 185 

CCA TTT TTC CCT AAA AAC CCT TTT TTA AAA AAC GAT GAA ATC CTA GCG 627 
Pro Phe Phe Pro Lys Asn Pro Phe Leu Lys Asn Asp Glu He Leu Ala 
190 195 200 205 

ATC AAC CAT CAA AAG ATC CAC TCA TTA GCG GAG TTT GAA TGG GTG GTG 675 
He Asn His Gin Lys He His Ser Leu Ala Glu Phe Glu Trp Val Val 
210 215 220 

AGC AAT CTT AAA TAG CAA AGC CTT GCA AAA GTG GAA ATC AAA CGA AAC 72 3 

Ser Asn Leu Lys Tyr Gin Ser Leu Ala Lys Val Glu He Lys Arg Asn 
225 230 235 

CAT AAA GTC AAA GAA GTA ACG CTC AAA GTC AAT AAG CGT TAT GGG GGG 771 
His Lys Val Lys Glu Val Thr Leu Lys Val Asn Lys Arg Tyr Gly Gly 
240 245 250 

TTT TTA CTC AAA GAC ACT TTT TTA GAG CGC TAT GGC ATC GCT TTA GAT 819 
Phe Leu Leu Lys Asp Thr Phe Leu Glu Arg Tyr Gly He Ala Leu Asp 
255 260 265 

GAG CGT TTT ATT ATC ACT AAA ATA GGC GCT CAT TTG CCC AAA GGC TTG 8 67 

Glu Arg Phe He He Thr Lys He Gly Ala His Leu Pro Lys Gly Leu 
270 275 280 285 

GAT TTT TTA AAG CTT GGG GAT AGG ATT TTA TGG GTG AAT TAT AAA AGC 915 
Asp Phe Leu Lys Leu Gly Asp Arg He Leu Trp Val Asn Tyr Lys Ser 
290 295 300 

GTG GCG TCC AAC CCA AAG GCT TTA AGA GAA GCG TTA AGC GCG CCT AAA 963 
Val Ala Ser Asn Pro Lys Ala Leu Arg Glu Ala Leu Ser Ala Pro Lys 
305 310 315 

ATT GAA TTA TTA GTC TTG CGT AAA GGC TTT GAA TTT TAC ATT AAA GTC 1011 
He Glu Leu Leu Val Leu Arg Lys Gly Phe Glu Phe Tyr He Lys Val 
320 325 330 

CGT TGAAGTATTG ATGAAAAATG ACGCTTATGA AATTATTCTT TCTTGGTTT 1063 
Arg 
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Lys Asn Val Leu Thr Ser Gin lie Phe Asp Gly Ala Tyr His Phe Lys 
580 585 590 

Ser 



(2) INFORMATION FOR SEQ ID NO:1165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1063 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATtTRE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13 . . . 1014 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165: 

TGCTAAATTG TG ATG TTT CAC AAA GCC CTT ATT ACC TTT ATC GTT CTA TGG 51 
Met Phe His Lys Ala Leu lie Thr Phe lie Val Leu Trp 
15 10 

TTT TTT TTG AAT GGC TTA GGG GCT TAT GAT TTC AAG CAT TGT CAA GCG 99 
Phe Phe Leu Asn Gly Leu Gly Ala Tyr Asp Phe Lys His Cys Gin Ala 
15 20 25 

TTT TTT AAA AAA GCG AGC CTT CAA AAA GGA GGC GTG GCT TTA AAA GAA 14 7 

Phe Phe Lys Lys Ala Ser Leu Gin Lys Gly Gly Val Ala Leu Lys Glu 
30 35 40 45 

TTG CCT AAA GGC GTG TAT TTG TAT TAT TCC AAA ACC TAT CCC AAA CAC 195 
Leu Pro Lys Gly Val Tyr Leu Tyr Tyr Ser Lys Thr Tyr Pro Lys His 
50 55 60 

GCC AAA GTC ATC AAA TCC GAT CCC TTT GTA GGG TTG TAT TTG TTG CAA 24 3 

Ala Lys Val lie Lys Ser Asp Pro Phe Val Gly Leu Tyr Leu Leu Gin 
65 70 75 

AGC GCA CCA AGC GAG TAT GTT TAT ACC TTA AGG GAT TTA GAC AAA GAC 2 91 

Ser Ala Pro Ser Glu Tyr Val Tyr Thr Leu Arg Asp Leu Asp Lys Asp 
80 85 90 

GCC CTT ATA AGG CCA ATG GCT AGC ATA GGG GAT AAA GAA GCC CTA GAA 33 9 

Ala Leu lie Arg Pro Met Ala Ser lie Gly Asp Lys Glu Ala Leu Glu 
95 100 105 

ACG CGA TTA TTG GTG GGG CAA AGA GGC TAT GAG CGC TAC GCT CAA ATT 3 87 

Thr Arg Leu Leu Val Gly Gin Arg Gly Tyr Glu Arg Tyr Ala Gin lie 
110 115 120 125 
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130 

Ser Val Asp Leu 
145 

His Pro Gin Pro 

Lys Asp Arg lie 
180 

Arg Gly Tyr Val 
195 

Phe Met Ser Ser 
210 

Glu Asn Met Ala 
225 

Arg Asp Phe lie 

Pro Lys Asp Asn 
260 

Gin Asn Glu Pro 
275 

Trp Gly Lys Gin 
290 

lie Ala Asn Lys 
305 

Pro Asp Asn Phe 

Ser Val Pro Ser 
340 

Val Tyr Thr Phe 
355 

Leu Ala Leu Asn 
370 

Pro Tyr Asp Tyr 
385 

Arg Ser Asn lie 

Pro Lys Glu Phe 
420 

lie Lys Arg lie 
435 

Gin Gly Met Lys 
450 

Arg Gin Gly Lys 
465 

Asn Thr lie Asp 

Leu Ala Thr Gin 
500 

Val Thr Ala lie 
515 

lie Asp Lys lie 
530 

Phe Asp lie Asp 
545 

He Glu Val Pro 



135 

Asp Pro Tyr Asp 
150 

Glu Asn Gly Lys 
165 

Glu Asn He Tyr 

Asp Lys Phe Phe 
200 

Asp Lys Phe Val 
215 

Arg Thr Gin Ala 
230 

Gly Tyr lie Glu 
245 

He Leu Pro Asp 

Ser Ser Thr Phe 
280 

Phe Arg Gin Val 
295 

Asn Lys Thr Tyr 
310 

Gly Glu Val Trp 
325 

Asp Leu Pro Gin 

Leu Ser Arg Asn 
360 

Lys Val Val Lys 
375 

Ser Lys Leu Gly 
390 

Lys Ala Leu Gin 
405 

Phe Leu Asp Ala 

Glu Ala Arg Lys 
440 

Arg Asn He Lys 
455 

Lys Val Ser Arg 
470 

Thr Asp Ala Gly 
485 

Leu Gly Asn He 

Glu Gin Ala Glu 
520 

Thr Tyr Asn Ser 
535 

Asn Pro Leu Ser 
550 

Arg Glu Ala Leu 
565 



140 

Pro He Lys Val 
155 

Glu Thr Thr Lys 
170 

Ala Thr Leu Tyr 
185 

Lys Lys Glu Ala 

Glu Arg Tyr Arg 
220 

Leu Lys Asp He 
235 

Val Leu Gly Tyr 
250 

Lys Glu Val Ser 
265 

Asp Leu Lys Asn 

Ala He Cys Tyr 
300 

Arg He Asp Leu 
315 

Ala Lys Phe Thr 
330 

Ala Leu Thr Arg 
345 

Lys Glu Gly He 

Thr Glu Leu Lys 
380 

He Glu Thr He 
395 

Lys Met Ser Arg 
410 

Met He Lys Lys 
425 

Cys Ala Val Ser 

Val Asn Asn Leu 
460 

Thr Leu He Ala 
475 

Tyr Cys Phe He 
490 

Ser Pro Arg Leu 
505 

Gly Val Arg Leu 

Leu His Asn He 
540 

Asp Gin Val Phe 
555 

Lys Asn Val Lys 
570 



Val Asp Phe Phe 
160 

Phe Lys Asn Tyr 
175 

Asn Thr Leu Gly 
190 

Thr Met Arg Asp 
205 

Tyr Thr Arg Lys 

Met Asn He Asp 
240 

Trp Lys Asp Asn 
255 

Phe Phe Val Phe 
270 

His Leu Leu He 
285 

Gly Gly Gin Leu 

He Ser Cys Arg 
320 

Gly He Lys Phe 
335 

He Asn Asp Ser 
350 

Gly Leu Asn Lys 
365 

Ala Thr Cys Met 

Gly Glu Asp He 
400 

Gly Tyr Gly His 
415 

Gin Glu Asn Ala 
430 

Asp Asp Phe Lys 
445 

Val Lys Ala Met 

Lys Val Leu Ala 
480 

Ser Pro Thr Asp 
495 

Ser Lys Ser He 
510 

Asn Tyr Ala Leu 
525 

Leu Ser Phe He 

Glu Arg Leu Val 
560 

Leu Pro Gin He 
575 
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CAA CTT GGC AAC ATC AGC CCT AGA CTA TCT AAA AGC ATA GTT ACC GCC 15 93 

Gin Leu Gly Asn lie Ser Pro Arg Leu Ser Lys Ser lie Val Thr Ala 
500 505 510 515 



ATA GAG CAA GCA GAG GGC GTG AGA CTG AAT TAT GCG TTG ATT GAG AAA 1641 

He Glu Gin Ala Glu Gly Val Arg Leu Asn Tyr Ala Leu He Asp Lys 
520 525 530 

ATC ACC TAT AAC TCA CTC CAC AAT ATC TTA AGT TTC ATT TTT GAT ATT 16 8 9 

He Thr Tyr Asn Ser Leu His Asn He Leu Ser Phe He Phe Asp He 
535 540 545 



GAT AAC CCT TTA AGC GAC CAA GTG TTT GAG AGA TTA GTC ATT GAA GTC 173 7 

Asp Asn Pro Leu Ser Asp Gin Val Phe Glu Arg Leu Val He Glu Val 
550 555 560 



CCA AGA GAA GCA CTT AAA AAT GTG AAG TTG CCA CAA ATC AAA AAT GTA 17 85 

Pro Arg Glu Ala Leu Lys Asn Val Lys Leu Pro Gin He Lys Asn Val 
565 570 575 

TTG ACT TCT CAA ATC TTT GAT GGC GCT TAC CAC TTT AAA AGT TAAACCATG 183 6 

Leu Thr Ser Gin He Phe Asp Gly Ala Tyr His Phe Lys Ser 

580 585 590 



CTCTTTATCA GCGCAACTAA CACGAATGCC GGAAAAACCA CAT 187 9 

(2) INFORMATION FOR SEQ ID N0:1164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164: 



Met Leu Leu Asn Tyr Asp Phe Leu Glu Phe Val Asp Glu Pro Lys Arg 

15 10 15 

Asn Thr Ser Leu Thr Ala Ser He Asp Lys Ala Leu Ala Asp Arg Lys 

20 25 30 

Leu Ala Arg Gin Asn Lys Pro Ser Val Arg Val Leu Gly Lys Ala Met 

35 40 45 

Pro Leu Ser Lys Phe Leu Asp Ala Val Gly Asp Glu He Ser Arg Leu 

50 55 60 

Lys Tyr Asp Met Ser His Lys Thr He Lys Gly Ser Thr He Glu Ser 
65 70 75 80 

Ser Asn Leu He Ser He Tyr Lys Lys He Ala Ser Gly Leu Pro Phe 

85 90 95 

Gly Thr He Ser Ala Phe Arg Pro Phe Lys Asp Ala Phe Tyr Lys Asp 

100 105 110 

Phe Thr Glu Lys Glu Gin Asn Ala Leu He Tyr Ala Tyr Lys Ser Gly 

115 120 125 

Ala Asp Pro Lys Asn Ala Asp He He Ala Lys Tyr Trp Leu Ser Gin 
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Pro Ser Ser Thr Phe Asp Leu Lys Asn His Leu Leu lie Trp Gly Lys 

280 285 290 

CAA TTC AGA CAA GTA GCG ATT TGC TAT GGC GGA CAA TTG ATT GCT AAT 96 9 

Gin Phe Arg Gin Val Ala lie Cys Tyr Gly Gly Gin Leu lie Ala Asn 
295 300 305 

AAG AAT AAG ACT TAT AGG ATA GAT TTG ATA AGT TGC AGA COT GAT AAT 1017 

Lys Asn Lys Thr Tyr Arg lie Asp Leu lie Ser Cys Arg Pro Asp Asn 

310 315 320 

TTT GGT GAG GTT TGG GCT AAA TTC ACA GGG ATT AAA TTT TCA GTT CCT 1065 

Phe Gly Glu Val Trp Ala Lys Phe Thr Gly lie Lys Phe Ser Val Pro 
325 330 335 

AGC GAC TTA CCA CAA GCT CTC ACA CGC ATA AAT GAC AGC GTT TAT ACT 1113 

Ser Asp Leu Pro Gin Ala Leu Thr Arg lie Asn Asp Ser Val Tyr Thr 
340 345 350 355 

TTT CTC TCT AGG AAT AAA GAG GGT ATC GGT CTT AAT AAA CTC GCT CTC 1161 

Phe Leu Ser Arg Asn Lys Glu Gly lie Gly Leu Asn Lys Leu Ala Leu 

360 365 370 

AAT AAA GTC GTT AAG ACA GAA TTA AAA GCG ACT TGT ATG CCC TAT GAT 12 0 9 

Asn Lys Val Val Lys Thr Glu Leu Lys Ala Thr Cys Met Pro Tyr Asp 
375 380 385 

TAC TCT AAA TTG GGT ATA GAG ACT ATT GGC GAG GAC ATT AGA AGC AAT 12 57 

Tyr Ser Lys Leu Gly lie Glu Thr lie Gly Glu Asp lie Arg Ser Asn 

390 395 400 

ATT AAA GCA TTA CAG AAA ATG TCT CGT GGG TAT GGA CAC CCT AAA GAG 13 05 

lie Lys Ala Leu Gin Lys Met Ser Arg Gly Tyr Gly His Pro Lys Glu 
405 410 415 

TTC TTT TTG GAC GCA ATG ATA AAA AAA CAG GAA AAT GCG ATT AAA CGC 1353 

Phe Phe Leu Asp Ala Met lie Lys Lys Gin Glu Asn Ala lie Lys Arg 
420 425 430 435 

ATA GAA GCA CGA AAA TGT GCG GTA AGC GAT GAC TTC AAA CAA GGT ATG 14 01 

He Glu Ala Arg Lys Cys Ala Val Ser Asp Asp Phe Lys Gin Gly Met 

440 445 450 

AAA CGA AAC ATT AAA GTT AAT AAC CTT GTT AAA GCT ATG CGA CAA GGC 144 9 

Lys Arg Asn He Lys Val Asn Asn Leu Val Lys Ala Met Arg Gin Gly 
455 460 465 

AAA AAA GTG AGT AGG ACA TTG ATT GCT AAA GTG CTT GCT AAC ACC ATA 14 97 

Lys Lys Val Ser Arg Thr Leu He Ala Lys Val Leu Ala Asn Thr He 

470 475 480 

GAC ACC GAT GCG GGT TAT TGC TTC ATT TCG CCG ACA GAT TTA GCG ACA 154 5 

Asp Thr Asp Ala Gly Tyr Cys Phe He Ser Pro Thr Asp Leu Ala Thr 
485 490 495 
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55 60 65 

ATG AGC CAC AAG ACT ATT AAA GGC TCT ACA ATT GAG AGT TCT AAT CTT 2 97 

Met Ser His Lys Thr lie Lys Gly Ser Thr lie Glu Ser Ser Asn Leu 
70 75 80 

ATC AGC ATT TAT AAA AAG ATT GCG AGC GGA CTA CCT TTT GGG ACT ATC 34 5 

lie Ser lie Tyr Lys Lys He Ala Ser Gly Leu Pro Phe Gly Thr lie 
85 90 95 

TCG GCG TTT AGA CCT TTT AAA GAC GCT TTT TAT AAA GAC TTT ACC GAA 3 93 

Ser Ala Phe Arg Pro Phe Lys Asp Ala Phe Tyr Lys Asp Phe Thr Glu 
100 105 110 115 

AAA GAA CAA AAC GCT CTA ATC TAT GCT TAT AAG AGC GGA GCA GAC CCT 441 
Lys Glu Gin Asn Ala Leu He Tyr Ala Tyr Lys Ser Gly Ala Asp Pro 
120 125 130 

AAA AAT GCG GAC ATA ATA GCC AAA TAT TGG TTA AGT CAA TCT GTG GAT 48 9 

Lys Asn Ala Asp He He Ala Lys Tyr Trp Leu Ser Gin Ser Val Asp 
135 140 145 

TTA GAC CCA TAC GAC CCT ATT AAA GTT GTA GAT TTC TTT CAC CCA CAA 53 7 

Leu Asp Pro Tyr Asp Pro He Lys Val Val Asp Phe Phe His Pro Gin 
150 155 160 

CCT GAA AAT GGT AAA GAG ACT ACA AAA TTT AAG AAC TAC AAA GAT AGG 58 5 

Pro Glu Asn Gly Lys Glu Thr Thr Lys Phe Lys Asn Tyr Lys Asp Arg 
165 170 175 

ATT GAG AAC ATT TAT GCG ACA CTC TAT AAC ACA TTG GGT AGG GGT TAT 63 3 

He Glu Asn He Tyr Ala Thr Leu Tyr Asn Thr Leu Gly Arg Gly Tyr 
180 185 190 195 

GTG GAT AAA TTT TTT AAA AAA GAA GCC ACA ATG AGG GAC TTT ATG TCT 681 
Val Asp Lys Phe Phe Lys Lys Glu Ala Thr Met Arg Asp Phe Met Ser 
200 205 210 

AGC GAT AAA TTT GTT GAG AGA TAC CGC TAC ACT AGA AAA GAA AAT ATG 72 9 

Ser Asp Lys Phe Val Glu Arg Tyr Arg Tyr Thr Arg Lys Glu Asn Met 
215 220 225 

GCA AGG ACA CAA GCA TTA AAA GAC ATA ATG AAT ATT GAC AGA GAT TTC 7 77 

Ala Arg Thr Gin Ala Leu Lys Asp He Met Asn He Asp Arg Asp Phe 
230 235 240 

ATT GGT TAT ATT GAA GTG TTA GGG TAT TGG AAA GAC AAC CCT AAA GAC 82 5 

He Gly Tyr He Glu Val Leu Gly Tyr Trp Lys Asp Asn Pro Lys Asp 
245 250 255 

AAT ATC TTA CCA GAC AAA GAG GTT AGC TTT TTT GTA TTC CAA AAC GAA 873 
Asn He Leu Pro Asp Lys Glu Val Ser Phe Phe Val Phe Gin Asn Glu 
260 265 270 275 

CCT AGT AGC ACA TTT GAT TTG AAA AAC CAC TTA TTG ATA TGG GGT AAA 921 
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195 

Glu Leu Thr Leu 
210 

Ser Leu Glu Cys 
225 

Asn Leu Pro Thr 

Ser Ala Asn Asp 
260 

Asn Tyr Glu Leu 
275 

Tyr Asp Glu Ser 
290 

Ser Asn Glu Asn 
305 

Gin Thr Thr Thr 



200 

Phe Asn Ala Thr 
215 

lie Leu Asp Ser 
230 

Tyr Leu Thr Lys 
245 

Gly Glu Lys Asn 

lie Gin Lys Cys 
280 

Leu Leu Tyr Asn 
295 

Val Leu Leu lie 
310 

Thr Lys Cys Leu 
325 



Ser Cys Ala Thr 
220 

Ser Phe Lys Asn 
235 

Ala Gly lie Lys 
250 

Val Lys Val Thr 
265 

Pro Asn Cys Glu 

Leu Pro Asp Leu 
300 

Leu His Leu Gin 
315 



205 

Tyr Thr Thr Ala 

Asn Ala Tyr Glu 
240 

Val Phe Trp Tyr 
255 

Ser Tyr Leu Lys 
270 

Ala lie Ala Pro 
285 

Leu Lys Glu His 

Ala Arg Met Ala 
320 



(2) INFORMATION FOR SEQ ID NO:1163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49 . , . 1827 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163: 

AAGCTCAAAG ATAAAGCGCT ACAATCTCGC TTAGAAAAAG GACACAAA ATG CTA TTG 5 7 

Met Leu Leu 
1 

AAT TAC GAT TTT TTA GAA TTT GTT GAT GAG CCG AAA AGA AAC ACT TCT 105 
Asn Tyr Asp Phe Leu Glu Phe Val Asp Glu Pro Lys Arg Asn Thr Ser 
5 10 15 

TTG ACA GCA TCT ATT GAT AAA GCG TTA GCG GAC AGG AAG TTA GCT AGA 15 3 

Leu Thr Ala Ser lie Asp Lys Ala Leu Ala Asp Arg Lys Leu Ala Arg 
20 25 30 35 

CAA AAT AAA CCT AGC GTT AGG GTG CTT GGT AAG GCG ATG CCC TTA AGC 2 01 

Gin Asn Lys Pro Ser Val Arg Val Leu Gly Lys Ala Met Pro Leu Ser 
40 45 50 

AAG TTT TTA GAT GCT GTT GGC GAT GAA ATC TCA CGA CTT AAA TAT GAT 24 9 

Lys Phe Leu Asp Ala Val Gly Asp Glu lie Ser Arg Leu Lys Tyr Asp 
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TAT GAA TTG ATT CAA AAA TGC CCC AAT TOT GAA GCG ATC GCT CCT TAT 92 3 

Tyr Glu Leu He Gin Lys Cys Pro Asn Cys Glu Ala He Ala Pro Tyr 
275 280 285 

GAT GAA TCT TTA CTT TAT AAT TTG CCT GAG CTT TTA AAA GAA CAC TCT 971 
Asp Glu Ser Leu Leu Tyr Asn Leu Pro Asp Leu Leu Lys Glu His Ser 
290 295 300 305 

AAT GAA AAT GTC TTG CTC ATC TTA CAC TTG CAG GCT CGC ATG GCC CAA 1019 
Asn Glu Asn Val Leu Leu He Leu His Leu Gin Ala Arg Met Ala Gin 
310 315 320 

ACT ACG ACA ACA AAG TGC CTT TAAATTTTAG GGTGTTTAAG CCTTATTGCT CAAG 1074 
Thr Thr Thr Thr Lys Cys Leu 
325 

CGCTGATCTG TCTTCTT 10 91 

(2) INFORMATION FOR SEQ ID NO: 1162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162: 

Met Met Val Val Val Leu Phe Cys Val Asn Gly Ala Leu Phe Leu Ala 

15 10 15 

Leu Gly Leu He Ser Ala Ser Leu Met Arg Trp Ser Ala He Val Phe 

20 25 30 

Ser Leu Leu Asn Ser Val Ala Phe Tyr Phe He Ser Ala Tyr Lys Val 

35 40 45 

Phe Leu Asn Lys Ser Met Met Gly Asn Val Leu Asn Thr Asn Thr His 

50 55 60 

Glu Val Leu Gly Phe Leu Ser Val Lys Leu Phe Val Phe He Val Val 
65 70 75 80 

Phe Gly Val Leu Pro Gly Tyr Val He Tyr Lys He Pro Leu Lys Asn 

85 90 95 

Ser Ser Lys Lys Ala Pro Phe Leu Ala He Leu Ala Leu Val Phe He 

100 105 110 

Phe He Ala Ser Ala Leu Ala Asn Thr Lys Asn Trp Leu Trp Phe Asp 

115 120 125 

Lys His Ala Lys Phe He Gly Gly Leu He Leu Pro Phe Ala Tyr Ser 

130 135 140 

Val Asn Ala Phe Arg Val Ser Ala Leu Lys Phe Phe Ala Pro Thr He 
145 150 155 160 

Lys Pro Leu Pro Leu Phe Ser Pro Asn His Ser His Ser Phe Val Val 

165 170 175 

Leu Val He Gly Glu Ser Ala Arg Lys His Asn Tyr Ala Leu Tyr Gly 

180 185 190 

Tyr Gin Lys Pro Thr Thr Pro Arg Leu Ser Lys Arg Leu Ala Asp Asn 
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Leu Asn Lys Ser Met Met Gly Asn Val Leu Asn Thr Asn Thr His Glu 
50 55 60 65 

GTT TTA GGC TTT TTG AGC GTC AAA TTA TTC GTT TTT ATC GTT GTT TTT 2 99 

Val Leu Gly Phe Leu Ser Val Lys Leu Phe Val Phe lie Val Val Phe 
70 75 80 

GGG GTG TTG CCT GGC TAT GTC ATC TAT AAA ATC CCC CTT AAA AAT TCT 34 7 

Gly Val Leu Pro Gly Tyr Val lie Tyr Lys lie Pro Leu Lys Asn Ser 
85 90 95 

TCT AAA AAA GCG CCC TTT TTA GCG ATC TTG GCG TTA GTG TTT ATC TTT 3 95 

Ser Lys Lys Ala Pro Phe Leu Ala lie Leu Ala Leu Val Phe lie Phe 
100 105 110 

ATC GCT AGC GCT TTA GCT AAC ACT AAA AAT TGG CTG TGG TTT GAC AAG 44 3 

lie Ala Ser Ala Leu Ala Asn Thr Lys Asn Trp Leu Trp Phe Asp Lys 
115 120 125 . 

CAT GCG AAA TTC ATA GGG GGC TTA ATT TTG CCC TTC GCT TAT AGC GTG 4 91 

His Ala Lys Phe lie Gly Gly Leu lie Leu Pro Phe Ala Tyr Ser Val 
130 135 140 145 

AAC GCT TTT AGA GTG AGC GCT CTC AAA TTT TTC GCC CCC ACC ATC AAG 53 9 

Asn Ala Phe Arg Val Ser Ala Leu Lys Phe Phe Ala Pro Thr lie Lys 
150 155 160 

CCG CTC CCT CTT TTT TCA CCC AAT CAT TCC CAT TCG TTT GTG GTG CTA 5 87 

Pro Leu Pro Leu Phe Ser Pro Asn His Ser His Ser Phe Val Val Leu 
165 170 175 

GTC ATT GGC GAA AGC GCT AGG AAA CAT AAT TAG GCC CTT TAT GGC TAT 63 5 

Val He Gly Glu Ser Ala Arg Lys His Asn Tyr Ala Leu Tyr Gly Tyr 
180 185 190 

CAA AAA CCC ACC ACC CCA AGA CTA AGC AAG CGT TTA GCG GAT AAT GAA 6 83 

Gin Lys Pro Thr Thr Pro Arg Leu Ser Lys Arg Leu Ala Asp Asn Glu 
195 200 205 

CTC ACT CTT TTC AAC GCC ACT TCT TGC GCC ACT TAC ACG ACA GCG AGT 731 
Leu Thr Leu Phe Asn Ala Thr Ser Cys Ala Thr Tyr Thr Thr Ala Ser 
210 215 220 225 

TTG GAA TGC ATT TTA GAT TCT TCT TTT AAA AAC AAC GCT TAT GAA AAT 77 9 

Leu Glu Cys He Leu Asp Ser Ser Phe Lys Asn Asn Ala Tyr Glu Asn 
230 235 240 

TTG CCA ACT TAC TTG ACT AAP. GCC GGT ATC AAA GTC TTT TGG TAT AGC 82 7 

Leu Pro Thr Tyr Leu Thr Lys Ala Gly He Lys Val Phe Trp Tyr Ser 
245 250 255 

GCG AAC GAC GGC GAA AAG AAT GTT AAG GTT ACA AGC TAT CTT AAA AAC 8 75 

Ala Asn Asp Gly Glu Lys Asn Val Lys Val Thr Ser Tyr Leu Lys Asn 
260 265 270 
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Leu Ser Val Ala lie Pro Leu lie Gin Arg Asp Ala Val Gly Leu Phe 
65 70 75 80 

Gin Val Ala Asn Ala Ser lie Ala Thr Thr Leu Leu Thr His Thr Thr 

85 90 95 

Lys Arg Ala Leu Asn His Val Thr lie Asn Asp Gin Arg Leu Gly Glu 

100 105 110 

Arg Pro Tyr Gly Gly Asn Phe Asn Met Pro Ser Gly His Ser Ser Met 

115 120 125 

Val Gly Leu Ala Val Ala Phe Leu Met Arg Arg Tyr Ser Phe Lys Lys 

130 135 140 

Tyr Phe Trp Leu Leu Pro Leu Val Pro Leu Thr Met Leu Ala Arg lie 
145 150 155 160 

Tyr Leu Asp Met His Thr lie Gly Ala Val Leu Thr Gly Leu Gly Val 

165 170 175 

Gly Met Leu Cys Val Xaa Phe Leu Gin Ala Pro Lys Ser Leu Asn Gin 
180 185 190 

Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 1161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 57... 1040 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161: 

ATTTCCCTTT GTTCGTTTAT GTTTATAAAG AAAGCAACCA GGTCAGTTTT ATCGCC ATG 5 9 

Met 
1 

ATG GTT GTG GTG CTT TTT TGC GTT AAT GGC GCT CTT TTT TTG GCG TTA 107 
Met Val Val Val Leu Phe Cys Val Asn Gly Ala Leu Phe Leu Ala Leu 
5 10 15 

GGC TTG ATC TCT GCT TCT TTG ATG CGT TGG AGT GCG ATA GTT TTT AGC 155 
Gly Leu lie Ser Ala Ser Leu Met Arg Trp Ser Ala lie Val Phe Ser 
20 25 30 

CTG CTC AAT TCC GTT GCT TTC TAT TTC ATT AGC GCT TAT AAG GTG TTT 2 03 

Leu Leu Asn Ser Val Ala Phe Tyr Phe lie Ser Ala Tyr Lys Val Phe 
35 40 45 

TTA AAT AAG AGC ATG ATG GGT AAT GTC TTA AAC ACC AAC ACG CAT GAA 2 51 
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55 60 65 70 

TTA ATC CAA AGA GAT GCG GTA GGG CTT TTT CAA GTC GCT AAC GCT TCT 2 94 

Leu lie Gin Arg Asp Ala Val Gly Leu Phe Gin Val Ala Asn Ala Ser 
75 80 85 

ATC GCT ACA ACC CTT CTC ACG CAC ACC ACC AAA AGA GCC TTA AAC CAT 34 2 

He Ala Thr Thr Leu Leu Thr His Thr Thr Lys Arg Ala Leu Asn His 
90 95 100 

GTA ACA ATC AAC GAT CAG CGT TTG GGC GAG CGC CCT TAT GGA GGT AAT 3 90 

Val Thr He Asn Asp Gin Arg Leu Gly Glu Arg Pro Tyr Gly Gly Asn 
105 110 115 

TTC AAC ATG CCA AGC GGG CAT TCG TCT ATG GTG GGT TTG GCG GTG GCG 43 8 

Phe Asn Met Pro Ser Gly His Ser Ser Met Val Gly Leu Ala Val Ala 
120 125 130 

TTT TTA ATG CGC CGC TAT TCT TTT AAA AAA TAC TTT TGG CTC TTG CCC 4 86 

Phe Leu Met Arg Arg Tyr Ser Phe Lys Lys Tyr Phe Trp Leu Leu Pro 
135 140 145 150 

CTA GTC CCT TTG ACC ATG CTC GCT CGC ATT TAT TTA GAC ATG CAC ACC 5 34 

Leu Val Pro Leu Thr Met Leu Ala Arg He Tyr Leu Asp Met His Thr 
155 160 165 

ATT GGC GCG GTG CTG ACC GGG CTT GGC GTT GGA ATG TTG TGC GTA ASC 582 
He Gly Ala Val Leu Thr Gly Leu Gly Val Gly Met Leu Cys Val Xaa 
170 175 180 

TTT TTA CAA GCC CCA AAA AGC CTT AAT CAA AAG CTT TAGTTTCTGT TTTTA 63 3 

Phe Leu Gin Ala Pro Lys Ser Leu Asn Gin Lys Leu 
185 190 



(2) INFORMATION FOR SEQ ID NO: 1160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1160: 



Met Lys Lys Phe 
1 

Ser Phe Ser Lys 
20 

Gly He Phe Ala 
35 

Leu Met Phe His 
50 



Leu Phe Lys Gin 
5 

Thr Leu Leu Ala 

Pro Phe Pro Lys 
40 

Phe Thr Glu His 
55 



Lys Phe Cys Glu 
10 

Leu Ser Leu Gly 
25 

Val Pro Lys Gin 

Tyr Ala Arg Phe 
60 



Ser Leu Pro Lys 
15 

Leu He Leu Leu 
30 

Pro Ser Val Pro 
45 

He Pro Thr He 
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195 

Leu Lys Thr Met 
210 

Asp Phe lie Asn 
225 

Thr Asp Val Ser 

Glu Ala Ser Gly 
260 

Thr Ser Lys lie 
275 

Asn Glu Asp Met 
290 

Ser Asn Pro Lys 

305 

Ala 



200 

Gin Met lie Leu 
215 

Gly Lys Thr Leu 
230 

Asn lie Gly Leu 
245 

Phe Glu Val lie 

Pro He Val Val 
280 

Ala Arg Ser Leu 
295 

Asp He Gin Arg 
310 



Asp Lys Leu Gly 
220 

Leu Glu His Leu 
235 

He He Thr Asp 
250 

Lys Gin Val Lys 
265 

Asn Ser Ser Met 

Lys Ala Asp Asp 
300 

Val Val Lys Gin 
315 



205 

Val Lys His He 

Phe Asn Pro Thr 
240 

Leu Glu Met Pro 
255 

Asn Asn Pro Leu 
270 

Ser Gly Ser Ser 
285 

Phe He Ser Lys 

Phe Leu Glu Leu 
320 



(2) INFORMATION FOR SEQ ID NO: 115 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 618 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159: 

ATCTGCTTAA ACACAAAAAA GAGTAAAATA ACACGC ATG AAA AAA TTC TTA TTT 54 

Met Lys Lys Phe Leu Phe 
1 5 

AAA CAA AAA TTT TGT GAA AGO CTG CCC AAA AGC TTT TCT AAA ACT TTG 102 
Lys Gin Lys Phe Cys Glu Ser Leu Pro Lys Ser Phe Ser Lys Thr Leu 
10 15 20 

TTA GCG CTC AGT TTG GGC TTG ATT TTA TTA GGC ATT TTT GCG CCT TTC 15 0 

Leu Ala Leu Ser Leu Gly Leu He Leu Leu Gly He Phe Ala Pro Phe 
25 30 35 

CCT AAA GTC CCT AAA CAG CCT AGC GTG CCT TTA ATG TTT CAT TTC ACC 19 8 

Pro Lys Val Pro Lys Gin Pro Ser Val Pro Leu Met Phe His Phe Thr 
40 45 50 

GAG CAT TAT GCG CGC TTT ATC CCT ACG ATT TTA TCT GTG GCG ATT CCC 24 6 

Glu His Tyr Ala Arg Phe He Pro Thr He Leu Ser Val Ala He Pro 

-1691- 



BNSDOCID: <WO 9843476A1 I > 



wo 98/43478 



PCT/US98/06371 



AAC AAT CCT TTG ACT TCA AAA ATC CCT ATC GTG GTC AAT TCT TCT ATG 8 66 

Asn Asn Pro Leu Thr Ser Lys lie Pro lie Val Val Asn Ser Ser Met 
270 275 280 

AGC GGG AGT TCT AAT GAA GAC ATG GCC AGG AGT TTG AAG GCC GAT GAT 914 
Ser Gly Ser Ser Asn Glu Asp Met Ala Arg Ser Leu Lys Ala Asp Asp 
285 290 295 300 

TTC ATT TCC AAG TCT AAC CCC AAA GAC ATC CAG CGA GTG GTT AAG CAA 962 
Phe lie Ser Lys Ser Asn Pro Lys Asp lie Gin Arg Val Val Lys Gin 
305 310 315 

TTT TTG GAA TTA GCA TGAAAAAATA CAGCACTATC CCCACCCCTT GCTACGTGTT A 1018 
Phe Leu Glu Leu Ala 
320 

GAGAGCGAAC GCTTAGAAAA AAACGC 1044 
(2) INFORMATION FOR SEQ ID NO: 1158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 8: 

Met Ala Asp Ser Leu Ala Gly lie Asp Gin Val Thr Ser Leu His Lys 

15 10 15 

Asn Asn Glu Leu Gin Leu Leu Cys Phe Arg Leu Gly Lys Asn Lys Asp 

20 25 30 

Leu Tyr Ala Val Asn Val Phe Lys lie Arg Glu Val Val Lys Tyr His 

35 40 45 

Gly Asn Leu Thr lie lie Ser His Glu Asn Asn Ser Leu Val Glu Gly 

50 55 60 

Leu He He He Arg Glu Leu Thr He Pro Leu He Asp Met Lys Lys 
65 70 75 80 

Trp Phe Tyr Tyr Asp Ser Gin Asn Lys Asn Lys Asp Leu Arg Pro Tyr 

85 90 95 

Arg He Glu Lys Glu Lys Gly Glu Asp Asp He Val Met He Cys Glu 

100 105 110 

Phe Ser Arg Trp Thr He Gly Val Arg He Tyr Glu Ala Asp Arg He 

115 120 125 

Leu Ser Lys Lys Trp Thr Glu Met Glu Gin Ser Ala Gly Leu Gly Gly 

130 135 140 

Ser Ala Gly Asn Asn Lys Leu Val Ser Arg Thr Arg Tyr Phe Asp Gly 
145 150 155 160 

Arg Leu Val Gin Val Val Asp He Glu Lys Met Leu He Asp Val Phe 

165 170 175 

Pro Trp He Glu Asp Glu Lys His Asn Asp Leu Glu Thr Leu Ser Lys 

180 185 190 

He His Ser Asn Gin Cys Val Leu Leu Ala Asp Asp Ser Pro Ser Val 
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GTG AAA TAC CAT GGC AAT CTC ACC ATC ATT AGC CAC GAA AAC AAT TCG 194 
Val Lys Tyr His Gly Asn Leu Thr lie lie Ser His Glu Asn Asn Ser 
45 50 55 60 

CTC GTT GAG GGG CTA ATC ATT ATA AGA GAA CTC ACC ATT CCC TTG ATT 24 2 

Leu Val Glu Gly Leu lie lie lie Arg Glu Leu Thr lie Pro Leu lie 
65 70 75 

GAT ATG AAA AAA TGG TTT TAT TAT GAC AGC CAA AAC AAA AAC AAG GAT 2 90 

Asp Met Lys Lys Trp Phe Tyr Tyr Asp Ser Gin Asn Lys Asn Lys Asp 
80 85 90 

TTA CGC CCT TAT AGG ATA GAA AAA GAA AAA GGC GAA GAT GAT ATT GTT 33 8 

Leu Arg Pro Tyr Arg He Glu Lys Glu Lys Gly Glu Asp Asp He Val 
95 100 105 

ATG ATT TGT GAG TTT TCT CGC TGG ACT ATA GGG GTT AGG ATC TAT GAA 3 86 

Met He Cys Glu Phe Ser Arg Trp Thr He Gly Val Arg He Tyr Glu 
110 115 120 

GCG GAT AGG ATT TTG AGC AAG AAA TGG ACT GAA ATG GAG CAA AGC GCT 4 34 

Ala Asp Arg He Leu Ser Lys Lys Trp Thr Glu Met Glu Gin Ser Ala 
125 130 135 140 

GGG CTA GGG GGA TCT GCA GGC AAT AAC AAA CTC GTG AGC CGC ACG CGC 4 82 

Gly Leu Gly Gly Ser Ala Gly Asn Asn Lys Leu Val Ser Arg Thr Arg 
145 150 155 

TAT TTT GAT GGG CGC TTG GTG CAA GTG GTG GAT ATT GAA AAA ATG CTT 53 0 

Tyr Phe Asp Gly Arg Leu Val Gin Val Val Asp He Glu Lys Met Leu 
160 165 170 

ATA GAC GTG TTC CCT TGG ATT GAA GAT GAA AAA CAC AAC GAT TTA GAG 57 8 

He Asp Val Phe Pro Trp He Glu Asp Glu Lys His Asn Asp Leu Glu 
175 180 185 

ACG CTT TCT AAA ATC CAT TCT AAC CAA TGC GTT TTG CTT GCT GAT GAC 62 6 

Thr Leu Ser Lys He His Ser Asn Gin Cys Val Leu Leu Ala Asp Asp 
190 195 200 

TCC CCA AGC GTT TTG AAA ACC ATG CAA ATG ATT TTA GAC AAG CTG GGC 6 74 

Ser Pro Ser Val Leu Lys Thr Met Gin Met He Leu Asp Lys Leu Gly 
205 210 215 220 

GTC AAG CAT ATA GAT TTT ATC AAT GGT AAA ACC TTA CTA GAG CAT TTA 72 2 

Val Lys His He Asp Phe He Asn Gly Lys Thr Leu Leu Glu His Leu 
225 230 235 

TTC AAC CCC ACA ACC GAT GTG AGT AAT ATT GGC CTG ATT ATT ACC GAT 77 0 

Phe Asn Pro Thr Thr Asp Val Ser Asn He Gly Leu He He Thr Asp 
240 245 250 

TTG GAA ATG CCA GAG GCG AGC GGT TTT GAA GTG ATC AAG CAG GTT AAA 818 
Leu Glu Met Pro Glu Ala Ser Gly Phe Glu Val He Lys Gin Val Lys 
255 260 265 
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(2) INFORMATION FOR SEQ ID NO: 1156: 

( i ) SEQXJENCE' CHARACTERISTICS : 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156: 



Met lie lie Leu Ser Ala Ser Val Lys Asn Leu Arg Glu lie Ser Val 

15 10 15 

Lys Glu Lys Phe Leu Trp Leu Asn Ala Lys Ser Tyr Leu lie Ser Val 

20 25 30 

Phe Ala Pro Phe lie Leu Leu Pro Trp lie Asp Leu Leu Ser Ala Phe 

35 40 45 

Leu Leu Tyr Leu Gly Phe Leu Ala Leu Phe Ser Val Leu Glu Phe Phe 

50 55 60 

Asp Glu Asp lie Ala Asp lie lie Val Ala Lys Ser Lys lie Lys Thr 
65 70 75 80 

Lys Thr Lys Cys Tyr Arg Ala 
85 



(2) INFORMATION FOR SEQ ID N0:1157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 15... 977 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157: 

ATTAAGGGGA AGTC ATG GCT GAT AGT TTA GCG GGC ATT GAT CAA GTT ACG 5 0 

Met Ala Asp Ser Leu Ala Gly lie Asp Gin Val Thr 
15 10 

AGT TTG CAT AAA AAT AAC GAG TTA CAA TTG TTG TGT TTC AGG CTG GGT 9 8 

Ser Leu His Lys Asn Asn Glu Leu Gin Leu Leu Cys Phe Arg Leu Gly 
15 20 25 

AAA AAC AAG GAT TTG TAT GCG GTC AAT GTT TTT AAG ATC CGT GAA GTG 14 6 

Lys Asn Lys Asp Leu Tyr Ala Val Asn Val Phe Lys lie Arg Glu Val 
3 0 3.5 4 0 
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Asn His Leu Gin Tyr Ser Phe Glu Lys lie Gin Asn Leu Leu Glu Glu 

245 250 255 

Asp Ala Leu Gin Ala Lys Glu His Ala Asn Leu Ser Tyr Val Tyr Gin 
260 265 270 

lie Val Glu 
275 

(2) INFORMATION FOR SEQ ID NO: 1155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 40... 300 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1155 : 

AGCTCCGAGT TTAGCGAATT GGTTTATGGG AATTTTTTA ATG ATT ATC CTG TCA 54 

Met lie lie Leu Ser 
1 5 

GCG AGC GTG AAG AAT TTG CGT GAA ATT TCG GTT AAA GAA AAA TTT TTA 102 
Ala Ser Val Lys Asn Leu Arg Glu lie Ser Val Lys Glu Lys Phe Leu 
10 15 20 

TGG CTG AAC GCT AAG TCT TAT TTG ATT TCT GTT TTT GCG CCT TTT ATC 15 0 

Trp Leu Asn Ala Lys Ser Tyr Leu lie Ser Val Phe Ala Pro Phe lie 
25 30 35 

TTG CTC CCT TGG ATT GAT TTG TTG AGC GCT TTT TTA TTG TAT TTA GGG 19 8 

Leu Leu Pro Trp lie Asp Leu Leu Ser Ala Phe Leu Leu Tyr Leu Gly 
40 45 50 

TTT TTA GCG CTC TTT AGC GTG CTG GAA TTT TTT GAT GAA GAC ATT GCA 24 6 

Phe Leu Ala Leu Phe Ser Val Leu Glu Phe Phe Asp Glu Asp lie Ala 
55 60 65 

GAT ATT ATC GTG GCT AAA AGC AAA ATA AAG ACT AAA ACC AAA TGT TAT 2 94 

Asp lie lie Val Ala Lys Ser Lys lie Lys Thr Lys Thr Lys Cys Tyr 
70 75 80 85 

AGA GCG TAGAATGTTA GAAAAGCTTT TAAGCGCTAT CAAACAA 337 
Arg Ala 
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lie Val Ala Tyr Asn His Leu Gin Tyr Ser Phe Glu Lys lie Gin Asn 
240 245 250 

CTC TTA GAA GAG GAC GCC CTA CAA GCC AAA GAA CAC GCT AAT CTT TCT 817 
Leu Leu Glu Glu Asp Ala Leu Gin Ala Lys Glu His Ala Asn Leu Ser 
255 260 265 

TAT GTC TAT CAA ATC GTA GAA TAGTAAGGCT TTTACACTCT TTGGCTTTGC TTTT 8 72 
Tyr Val Tyr Gin lie Val Glu 
270 275 

TTACCCTTT 881 

(2) INFORMATION FOR SEQ ID NO: 1154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1154 : 

Met Gin Glu Phe Leu Gly Phe Gly Val Val Gly Asn Phe Ala Gly His 

15 10 15 

Leu Glu Gin Ala Gly Glu Ser His Ser Phe He Asn Met Lys Ser Glu 

20 25 30 

Glu Lys Asp Ala Pro Lys Gly Leu Phe Pro Phe Tyr He Pro Tyr Glu 

35 40 45 

Asn Cys Tyr Leu Gly Arg Cys Cys He Asp Asn His Lys He He Leu 

50 55 60 

Pro Ser Asp Leu Asp Leu Arg Val Gin Ala Glu Pro Glu He Ala Leu 
65 70 75 80 

Glu Cys Asp Val Lys Tyr Asp Glu Lys His Leu Val Ala Lys Leu Val 

85 90 95 

Pro Asn Phe Phe Met Ala Phe Asn Asp Ala Ser Val Arg Asn Leu Asp 

100 105 110 

Ala Ala Lys Leu Ser Gin Lys Lys Asn Phe Ser Pro Ala Ser Lys Gly 

115 120 125 

He Gly Gin Lys Leu Pro He Asp Arg Phe Val Tyr Gly Gly Val Cys 

130 135 140 

Asn Asn Phe Ser He Ala Ser Phe Leu Lys Tyr Asn Asn Val Trp His 
145 150 155 160 

He Tyr Gly Glu Asn Ser Lys Leu Leu Lys Tyr Glu Phe Phe Tyr Gin 

165 170 175 

Lys Leu Leu Asp Trp He Lys Asp Gin Leu Asn His Gin Gin Asp Gly 

180 185 190 

Asp Ser Leu Glu Ala Leu Arg Pro Phe Leu Glu Arg His Asn Phe Pro 

195 200 205 

Thr Lys Met He Phe Ala He Gly Ala Thr Pro Tyr Met Pro Phe Ala 

210 215 220 

Gin Glu His Phe Leu Gin Lys Gly Asp Glu Val Val He Val Ala Tyr 
225 230 235 240 
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ATG AAA AGC GAA GAA AAG GAC GCC CCT AAG GGG CTA TTC CCT TTT TAT 14 5 

Met Lys Ser Glu Glu Lys Asp Ala Pro Lys Gly Leu Phe Pro Phe Tyr 
30 35 40 

ATC CCC TAT GAA AAT TGT TAT TTG GGG CGT TGT TGC ATT GAT AAC CAT 193 
lie Pro Tyr Glu Asn Cys Tyr Leu Gly Arg Cys Cys lie Asp Asn His 
45 50 55 60 

AAG ATT ATT TTG CCT AGT GAT CTA GAT TTA AGG GTG CAA GCA GAG CCA 241 
Lys lie lie Leu Pro Ser Asp Leu Asp Leu Arg Val Gin Ala Glu Pro 
65 70 75 

GAA ATC GCT TTA GAA TGC GAT GTT AAA TAC GAT GAA AAA CAT TTG GTT 2 89 

Glu lie Ala Leu Glu Cys Asp Val Lys Tyr Asp Glu Lys His Leu Val 
80 85 90 

GCA AAG CTC GTG CCT AAT TTT TTC ATG GCG TTT AAT GAC GCT TCT GTG 3 37 

Ala Lys Leu Val Pro Asn Phe Phe Met Ala Phe Asn Asp Ala Ser Val 
95 100 105 

CGC AAT TTA GAC GCC GCA AAA CTC TCC CAA AAA AAG AAT TTT TCA CCG 3 85 

Arg Asn Leu Asp Ala Ala Lys Leu Ser Gin Lys Lys Asn Phe Ser Pro 
110 115 120 

GCT TCT AAA GGT ATA GGG CAG AAA TTG CCC ATT GAC AGG TTT GTT TAT 43 3 

Ala Ser Lys Gly lie Gly Gin Lys Leu Pro lie Asp Arg Phe Val Tyr 
125 130 135 140 

GGG GGG GTG TGT AAC AAT TTC TCT ATC GCG TCT TTT TTG AAA TAC AAT 4 81 

Gly Gly Val Cys Asn Asn Phe Ser lie Ala Ser Phe Leu Lys Tyr Asn 
145 150 155 

AAT GTT TGG CAC ATT TAT GGG GAA AAC AGC AAA TTG CTC AAA TAC GAG 52 9 

Asn Val Trp His lie Tyr Gly Glu Asn Ser Lys Leu Leu Lys Tyr Glu 
160 165 170 

TTT TTT TAT CAA AAG CTT TTA GAT TGG ATT AAA GAC CAA TTA AAC CAC 57 7 

Phe Phe Tyr Gin Lys Leu Leu Asp Trp lie Lys Asp Gin Leu Asn His 
175 180 185 

CAA CAA GAT GGC GAC TCT TTA GAG GCT CTA AGA CCT TTT TTA GAG CGC 62 5 

Gin Gin Asp Gly Asp Ser Leu Glu Ala Leu Arg Pro Phe Leu Glu Arg 
190 195 200 

CAT AAT TTC CCC ACT AAA ATG ATT TTT GCA ATA GGG GCT ACC CCT TAT 67 3 

His Asn Phe Pro Thr Lys Met lie Phe Ala lie Gly Ala Thr Pro Tyr 
205 210 215 220 

ATG CCT TTT GCG CAA GAG CAT TTT TTG CAA AAA GGC GAT GAG GTG GTG 721 
Met Pro Phe Ala Gin Glu His Phe Leu Gin Lys Gly Asp Glu Val Val 
225 230 235 

ATC GTT GCT TAC AAC CAT TTA CAA TAC AGT TTT GAA AAG ATT CAA AAC 76 9 
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His Ser His Asp 
340 

Ala Ser Thr Thr 
355 

Ala Lys Leu Ser 
370 

Glu Val Glu Met 
385 

Ala Thr Lys Ala 

Ala Leu lie Arg 
420 

Glu Lys Val Gly 
435 

Ala Gin lie Ala 
450 

Glu Val Glu Lys 
465 

Lys Tyr Val Asp 

Glu Arg lie Ala 
500 

Thr Thr Glu Ala 

515 

Ala Met Pro Asp 

530 
Met Met 
545 



325 

Val Lys Asp Arg 

Ser Asp Tyr Asp 
360 

Gly Gly Val Ala 
375 

Lys Glu Lys Lys 
390 

Ala Val Glu Glu 
405 

Ala Ala Gin Lys 

Tyr Glu He He 
440 

He Asn Ala Gly 
455 

His Glu Gly His 
470 

Met Phe Lys Glu 
485 

Leu Gin Asn Ala 

Thr Val His Glu 
520 

Met Gly Gly Met 
535 



330 

Val Ala Gin He 
345 

Lys Glu Lys Leu 

Val He Lys Val 
380 

Asp Arg Val Asp 
395 

Gly He Val He 
410 

Val His Leu Asn 
425 

Met Arg Ala He 

Tyr Asp Gly Gly 
460 

Phe Gly Phe Asn 
475 

Gly He He Asp 
490 

Val Ser Val Ser 
505 

He Lys Glu Glu 

Gly Gly Met Gly 
540 



335 

Lys Thr Gin He 
350 

Gin Glu Arg Leu 
365 

Gly Ala Ala Ser 

Asp Ala Leu Ser 
400 

Gly Gly Gly Ala 
415 

Leu His Asp Asp 
430 

Lys Ala Pro Leu 
445 

Val Val Val Asn 

Ala Ser Asn Gly 
480 

Pro Leu Lys Val 
495 

Ser Leu Leu Leu 
510 

Lys Ala Ala Pro 
525 

Gly Met Gly Gly 



(2) INFORMATION FOR SEQ ID NO: 1153: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 14 . . , 838 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153: 



CTAGGAGGTT TGC ATG CAA GAG TTT TTA GGT TTT GGT GTG GTG GGG AAT 4 9 

Met Gin Glu Phe Leu Gly Phe Gly Val Val Gly Asn 
15 10 



TTT GCA GGG CAT TTG GAG CAA GCA GGA GAG AGT CAT AGT TTT ATT AAC 97 
Phe Ala Gly His Leu Glu Gin Ala Gly Glu Ser His Ser Phe He Asn 
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AAAATCCCCT AAAATCCCCC CTTTCTAAAA TCTC 17 24 

(2) INFORMATION FOR SEQ ID NO:1152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECXILE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1152: 



Met Ala Lys Glu 
1 

Glu Gly Val Arg 
20 

Arg Gly Arg Asn 
35 

Thr Lys Asp Gly 
50 

Val Ala Asn Met 
65 

Ala Asp Ala Ala 

Ser lie Phe Lys 
100 

lie Glu Val Lys 
115 

Glu Leu Lys Lys 
130 

Gin Val Ala Thr 
145 

lie Ala Asp Ala 

Glu Glu Ala Lys 
180 

Gin Phe Asp Arg 
195 

Lys Met Thr Ala 
210 

Lys lie Ser Ser 
225 

Lys Glu Gly Lys 

Ala Leu Thr Thr 
260 

Ala Ala Val Lys 
275 

Lys Asp lie Ala 
290 

Gly Leu Ser Leu 
305 

Arg lie Val lie 



lie Lys Phe Ser 
5 

Gin Leu His Asp 

Val Leu lie Gin 
40 

Val Ser Val Ala 
55 

Gly Ala Gin Leu 
70 

Gly Asp Gly Thr 
85 

Glu Gly Leu Arg 

Arg Gly Met Asp 
120 

Ala Ser Lys Lys 
135 

lie Ser Ala Asn 
150 

Met Glu Lys Val 
165 

Gly lie Glu Asp 

Gly Tyr Leu Ser 
200 

Gin Leu Asp Asn 
215 

Met Lys Asp lie 
230 

Pro Leu Leu lie 
245 

Leu Val Val Asn 

Ala Pro Gly Phe 
280 

lie Leu Thr Gly 
295 

Glu Asn Ala Glu 
310 

Asp Lys Asp Asn 



Asp Ser Ala Arg 
10 

Ala Val Lys Val 
25 

Lys Ser Tyr Gly 

Lys Glu lie . Glu 
60 

Val Lys Glu Val 
75 

Thr Thr Ala Thr 
90 

Asn lie Thr Ala 
105 

Lys Ala Ala Glu 

Val Gly Gly Lys 
140 

Ser Asp His Asn 
155 

Gly Lys Asp Gly 
170 

Glu Leu Asp Val 
185 

Pro Tyr Phe Val 

Ala Tyr lie Leu 
220 

Leu Pro Leu Leu 
235 

lie Ala Glu Asp 
250 

Lys Leu Arg Gly 
265 

Gly Asp Arg Arg 

Gly Gin Val lie 
300 

Val Glu Phe Leu 
315 

Thr Thr lie Val 



Asn Leu Leu Phe 
15 

Thr Met Gly Pro 
30 

Ala Pro Ser lie 
45 

Leu Ser Cys Pro 

Ala Ser Lys Thr 
80 

Val Leu Ala Tyr 
95 

Gly Ala Asn Pro 
110 

Ala lie lie Asn 
125 

Glu Glu lie Thr 

lie Gly Lys Leu 
160 

Val lie Thr Val 
175 

Val Glu Gly Met 
190 

Thr Asn Ala Glu 
205 

Leu Thr Asp Lys 

Glu Lys Thr Met 
240 

lie Glu Gly Glu 
255 

Val Leu Asn lie 
270 

Lys Glu Met Leu 
285 

Ser Glu Glu Leu 

Gly Lys Ala Gly 
320 

Asp Gly Lys Gly 
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GTA GAT GGC AAA GGC CAT AGC CAT GAT GTC AAA GAC AGA GTC GCG CAA 105 9 

Val Asp Gly Lys Gly His Ser His Asp Val Lys Asp Arg Val Ala Gin 
335 340 345 

ATC AAA ACC CAA ATT GCA AGC ACG ACA AGC GAT TAT GAC AAA GAA AAA 110 7 

lie Lys Thr Gin lie Ala Ser Thr Thr Ser Asp Tyr Asp Lys Glu Lys 
350 355 360 

TTG CAA GAA AGG TTG GCT AAA CTC TCT GGC GGT GTG GCT GTG ATT AAA 115 5 

Leu Gin Glu Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val lie Lys 
365 370 375 

GTG GGC GCT GCG AGT GAA GTG GAA ATG AAA GAG AAA AAA GAC CGG GTT 12 0 3 

Val Gly Ala Ala Ser Glu Val Glu Met Lys Glu Lys Lys Asp Arg Val 
380 385 390 395 

GAT GAT GCG TTG AGC GCG ACT AAA GCG GCT GTT GAA GAA GGT ATT GTG 12 51 

Asp Asp Ala Leu Ser Ala Thr Lys Ala Ala Val Glu Glu Gly lie Val 
400 405 410 

ATT GGC GGC GGT GCG GCT CTC ATT CGC GCG GCT CAA AAA GTG CAT TTG 12 9 9 

lie Gly Gly Gly Ala Ala Leu lie Arg Ala Ala Gin Lys Val His Leu 
415 420 425 

AAT TTG CAC GAT GAT GAA AAA GTG GGC TAT GAA ATC ATC ATG CGC GCC 13 4 7 

Asn Leu His Asp Asp Glu Lys Val Gly Tyr Glu lie lie Met Arg Ala 
430 435 440 

ATT AAA GCC CCA TTA GCT CAA ATC GCT ATC AAT GCC GGT TAT GAT GGC 13 95 

lie Lys Ala Pro Leu Ala Gin lie Ala lie Asn Ala Gly Tyr Asp Gly 
445 450 455 

GGT GTG GTC GTG AAT GAA GTA GAA AAA CAC GAA GGG CAT TTT GGT TTT 144 3 

Gly Val Val Val Asn Glu Val Glu Lys His Glu Gly His Phe Gly Phe 
460 465 470 475 

AAC GCT AGC AAT GGC AAG TAT GTG GAT ATG TTT AAA GAA GGC ATT ATT 14 91 

Asn Ala Ser Asn Gly Lys Tyr Val Asp Met Phe Lys Glu Gly lie lie 
480 485 490 

GAC CCC TTA AAA GTA GAA AGG ATC GCT TTA CAA AAT GCG GTT TCG GTT 15 3 9 

Asp Pro Leu Lys Val Glu Arg lie Ala Leu Gin Asn Ala Val Ser Val 
495 500 505 

TCA AGC CTG CTT TTA ACC ACA GAA GCC ACC GTG CAT GAA ATC AAA GAA 15 8 7 

Ser Ser Leu Leu Leu Thr Thr Glu Ala Thr Val His Glu lie Lys Glu 
510 515 520 

GAA AAA GCG GCC CCA GCA ATG CCT GAT ATG GGT GGC ATG GGC GGT ATG 16 3 5 

Glu Lys Ala Ala Pro Ala Met Pro Asp Met Gly Gly Met Gly Gly Met 
525 530 535 

GGA GGC ATG GGT GGC ATG ATG TAAGCCCCCT TGCTTTTTAG TATCATCTGC TTTT 16 90 
Gly Gly Met Gly Gly Met Met 
540 545 
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Ala Gly Ala Asn Pro lie Glu Val Lys Arg Gly Met Asp Lys Ala Ala 
110 115 120 

GAA GCC ATT ATT AAT GAG CTT AAA AAA GCG AGC AAA AAA GTG GGC GGT 435 
Glu Ala lie lie Asn Glu Leu Lys Lys Ala Ser Lys Lys Val Gly Gly 
125 130 135 

AAA GAA GAA ATC ACC CAA GTG GCG ACC ATT TCT GCA AAC TCC GAT CAC 4 83 

Lys Glu Glu lie Thr Gin Val Ala Thr lie Ser Ala Asn Ser Asp His 
140 145 150 155 

AAT ATC GGG AAA CTC ATC GCT GAC GCT ATG GAA AAA GTG GGT AAA GAC 531 
Asn lie Gly Lys Leu lie Ala Asp Ala Met Glu Lys Val Gly Lys Asp 
160 165 170 

GGC GTG ATC ACC GTT GAA GAA GCT AAG GGC ATT GAA GAT GAA CTA GAT 57 9 

Gly Val lie Thr Val Glu Glu Ala Lys Gly lie Glu Asp Glu Leu Asp 
175 180 185 

GTT GTA GAA GGC ATG CAA TTT GAT AGA GGC TAC CTC TCC CCT TAT TTT 62 7 

Val Val Glu Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe 
190 195 200 

GTA ACA AAC GCT GAG AAA ATG ACC GCT CAA TTG GAT AAC GCT TAC ATC 67 5 

Val Thr Asn Ala Glu Lys Met Thr Ala Gin Leu Asp Asn Ala Tyr lie 
205 210 215 

CTT TTA ACG GAT AAA AAA ATC TCT AGC ATG AAA GAC ATT CTC CCG CTA 72 3 

Leu Leu Thr Asp Lys Lys lie Ser Ser Met Lys Asp lie Leu Pro Leu 
220 225 230 235 

CTA GAA AAA ACC ATG AAA GAG GGC AAA CCG CTT TTA ATC ATC GCT GAA 771 
Leu Glu Lys Thr Met Lys Glu Gly Lys Pro Leu Leu lie lie Ala Glu 
240 245 250 

GAC ATT GAG GGC GAA GCT TTA ACG ACT CTA GTG GTG AAT AAA TTA AGA 819 
Asp lie Glu Gly Glu Ala Leu Thr Thr Leu Val Val Asn Lys Leu Arg 
255 260 265 

GGC GTG TTG AAT ATC GCA GCG GTT AAA GCT CCA GGC TTT GGG GAC AGA 867 
Gly Val Leu Asn lie Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg 
270 275 280 

AGA AAA GAA ATG CTC AAA GAC ATC GCT ATT TTA ACC GGC GGT CAA GTT 915 
Arg Lys Glu Met Leu Lys Asp lie Ala lie Leu Thr Gly Gly Gin Val 
285 290 295 

ATT AGC GAA GAA TTG GGC TTG AGT CTA GAA AAC GCT GAA GTG GAG TTT 96 3 

lie Ser Glu Glu Leu Gly Leu Ser Leu Glu Asn Ala Glu Val Glu Phe 
300 305 310 315 

TTA GGC AAA GCC GGA AGG ATT GTG ATT GAC AAA GAC AAC ACC ACG ATC 1011 
Leu Gly Lys Ala Gly Arg lie Val lie Asp Lys Asp Asn Thr Thr lie 
320 325 330 
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Ser Phe Gly Val Leu Thr Thr Asp Asn lie Glu Gin Ala lie Glu Arg 

115 120 125 

Ala Gly Ser Lys Ala Gly Asn Lys Gly Phe Glu Ala Met Ser Thr Leu 

130 135 140 

lie Glu Leu Leu Ser Leu Cys Gin Thr Leu Lys Gly 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 1151: 

(i) SEQUENCE C^^ARACTERISTICS : 

(A) LENGTH: 1724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 1656 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151: 

TATTATTAAG GATACAAA ATG GCA AAA GAA ATC AAA TTT TCA GAT AGC GCG 51 

Met Ala Lys Glu lie Lys Phe Ser Asp Ser Ala 
1 5 10 

AGA AAC CTT TTA TTT GAA GGC GTG AGA CAA CTC CAT GAC GOT GTT AAA 9 9 

Arg Asn Leu Leu Phe Glu Gly Val Arg Gin Leu His Asp Ala Val Lys 
15 20 25 

GTA ACC ATG GGG CCA AGA GGC AGG AAC GTG TTG ATC CAA AAA AGC TAT 14 7 

Val Thr Met Gly Pro Arg Gly Arg Asn Val Leu lie Gin Lys Ser Tyr 
30 35 40 

GGC GCT CCA AGC ATC ACT AAA GAT GGC GTG AGC GTG GCT AAA GAG ATT 195 
Gly Ala Pro Ser lie Thr Lys Asp Gly Val Ser Val Ala Lys Glu lie 
45 50 55 

GAA TTA AGT TGC CCG GTA GCT AAC ATG GGC GCT CAA CTC GTT AAA GAA 24 3 

Glu Leu Ser Cys Pro Val Ala Asn Met Gly Ala Gin Leu Val Lys Glu 
60 65 70 75 

GTA GCG AGC AAA ACC GCT GAT GCT GCC GGC GAT GGC ACG ACC ACA GCG 2 91 

Val Ala Ser Lys Thr Ala Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala 
80 85 90 

ACC GTG CTG GCT TAT AGC ATT TTT AAA GAA GGT TTG AGG AAC ATC ACG 339 
Thr Val Leu Ala Tyr Ser lie Phe Lys Glu Gly Leu Arg Asn lie Thr 
95 100 105 

GCT GGG GCT AAC CCT ATT GAA GTG AAA CGA GGC ATG GAT AAA GCC GCT 3 87 
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TTG CCT TTT ATT TTA GAC AAA TTA TTA GAG AGC GAA AAA TAG GAT GGC 24 3 

beu Pro Phe lie Leu Asp Lys Leu Leu Glu Ser Glu Lys Tyr Asp Gly 
60 65 70 75 

GTG TGC GTT TTG GGA GCG ATC ATT AGA GGG GGG ACT CCG CAT TTT GAT 2 91 

Val Cys Val Leu Gly Ala lie lie Arg Gly Gly Thr Pro His Phe Asp 
80 85 90 

TAT GTG AGC GCG GAA GCG ACT AAG GGT ATT GCC CAT GCG ATG CTT AAA 33 9 

Tyr Val Ser Ala Glu Ala Thr Lys Gly lie Ala His Ala Met Leu Lys 
95 100 105 

TAC AGC AT<3 CCG GTA AGC TTT GGC GTG CTG ACC ACG GAC AAT ATT GAA 3 87 

Tyr Ser Met Pro Val Ser Phe Gly Val Leu Thr Thr Asp Asn He Glu 
110 115 120 

CAA GCG ATT GAA AGA GCG GGC AGT AAA GCC GGC AAT AAG GGC TTT GAA 435 
Gin Ala He Glu Arg Ala Gly Ser Lys Ala Gly Asn Lys Gly Phe Glu 
125 130 135 

GCG ATG AGC ACC CTC ATT GAA TTG TTG AGC TTG TGC CAA ACT CTC AAG 4 83 

Ala Met Ser Thr Leu lie Glu Leu Leu Ser Leu Cys Gin Thr Leu Lys 
140 145 150 155 

GGT TAAAATGGCG ACACGAACTC AAGCCAGGGG GGCTGTG 52 3 

Gly 



(2) INFORMATION FOR SEQ ID NO: 1150: 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150: 



Met Gin He He Glu Gly Lys Leu Gin Leu Gin Gly Asn Glu Arg Val 

15 10 15 

Ala He Leu Thr Ser Arg Phe Asn His He He Thr Asp Arg Leu Gin 

20 25 30 

Glu Gly Ala Met Asp Cys Phe Lys Arg His Gly Gly Asp Glu Asp Leu 

35 40 45 

Leu Asp He Val Leu Val Pro Gly Ala Tyr Glu Leu Pro Phe He Leu 

50 55 60 

Asp Lys Leu Leu Glu Ser Glu Lys Tyr Asp Gly Val Cys Val Leu Gly 
65 70 75 80 

Ala He He Arg Gly Gly Thr Pro His Phe Asp Tyr Val Ser Ala Glu 

85 90 95 

Ala Thr Lys Gly He Ala His Ala Met Leu Lys Tyr Ser Met Pro Val 
100 105 110 
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15 10 15 

Asp Tyr Ala Lys Tyr Pro Arg Arg lie Ala Glu Glu Leu Gin His Tyr 

20 25 30 

Gly Gly Asn Ser Phe Ala Asn Phe Phe Arg Asp Glu Gly Val Leu Tyr 

35 40 45 

Lys Glu lie Leu Cys Asp Ala Cys Asp His Leu Lys Val Asn Tyr Asn 

50 55 60 

Glu Glu Ser Ala Thr Ser Leu lie Glu Gin Asn Met Leu Ser Lys Leu 
65 70 75 80 

Leu Lys Asp Ser Leu Glu Lys Met Ser Arg Arg Glu lie Lys Glu Leu 

85 90 95 

Cys Asn Glu Leu Gly Met Thr Asn lie Asp Lys Val lie Gly Glu Asn 

100 105 110 

Lys Gin Val Leu lie Ala Ser Thr Leu Thr Leu Phe Lys Ala Gly Gly 

115 120 125 

Ser His Ser Tyr Ala Leu Ala Val Ser Val Ala Asp Ala Met Val Arg 

130 135 140 

Gin Thr Leu Gly His Val Met Trp Trp Val Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 1149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 486 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 9: 

TTATTTTAAA GGAATTTC ATG CAA ATC ATA GAA GGG AAA TTG CAA TTA CAA 51 

Met Gin lie lie Glu Gly Lys Leu Gin Leu Gin 
15 10 



GGG AAT GAA AGA GTC GCT ATT TTA ACA TCG CGC TTC AAT CAT ATC ATC 
Gly Asn Glu Arg Val Ala lie Leu Thr Ser Arg Phe Asn His lie lie 
15 20 25 



99 



ACA GAC AGA TTG CAA GAA GGG GCG ATG GAC TGC TTT AAA AGG CAT GGG 14 7 

Thr Asp Arg Leu Gin Glu Gly Ala Met Asp Cys Phe Lys Arg His Gly 
30 35 40 

GGC GAT GAG GAT CTT TTA GAC ATC GTG CTG GTG CCT GGG GCT TAT GAA 195 

Gly Asp Glu Asp Leu Leu Asp lie Val Leu Val Pro Gly Ala Tyr Glu 

45 50 55 
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TCA ACA GAA TAT AAA AGA TAT GGC CAT GAT TAG GCC AAA TAC CCA AGA 102 
Ser Thr Glu Tyr Lys Arg Tyr Gly His Asp Tyr Ala Lys Tyr Pro Arg 
10 15 20 

AGA ATC GOT GAA GAA TTG CAA CAT TAT GGG GGC AAT AGT TTT GCG AAT 15 0 

Arg lie Ala Glu Glu Leu Gin His Tyr Gly Gly Asn Ser Phe Ala Asn 
25 30 35 

TTT TTT AGA GAT GAA GGG GTC TTA TAC AAA GAG ATT TTG TGC GAT GCG 198 
Phe Phe Arg Asp Glu Gly Val Leu Tyr Lys Glu lie Leu Cys Asp Ala 
40 45 50 55 

TGC GAT CAT TTA AAG GTT AAT TAC AAT GAA GAA TCT GCA ACC TCT TTG 24 6 

Cys Asp His Leu Lys Val Asn Tyr Asn Glu Glu Ser Ala Thr Ser Leu 
60 65 70 

ATT GAG CAA AAC ATG CTT TCT AAA CTC TTG AAA GAT AGT TTA GAA AAA 2 94 

lie Glu Gin Asn Met Leu Ser Lys Leu Leu Lys Asp Ser Leu Glu Lys 
75 80 85 

ATG AGT AGG AGA GAG ATT AAA GAA CTT TGC AAT GAA TTG GGC ATG ACA 342 
Met Ser Arg Arg Glu lie Lys Glu Leu Cys Asn Glu Leu Gly Met Thr 
90 95 100 

AAT ATT GAT AAA GTG ATT GGT GAA AAC AAA CAA GTC CTA ATC GCA TCT 3 90 

Asn lie Asp Lys Val lie Gly Glu Asn Lys Gin Val Leu lie Ala Ser 
105 110 115 

ACT TTA ACG CTG TTT AAA GCG GGT GGC TCT CAT TCT TAT GCG TTG GCT 43 8 

Thr Leu Thr Leu Phe Lys Ala Gly Gly Ser His Ser Tyr Ala Leu Ala 
120 125 130 135 

GTA TCT GTT GCA GAT GCA ATG GTA AGA CAA ACT CTA GGG CAT GTT ATG 4 86 

Val Ser Val Ala Asp Ala Met Val Arg Gin Thr Leu Gly His Val Met 
140 145 150 

TGG TGG GTA AAG TAGCACTTAA AAAAACTTTG GGCGTTTTGG CTGGCCCTAT TGGTT 54 3 
Trp Trp Val Lys 
155 

GGGT 54 7 

(2) INFORMATION FOR SEQ ID NO: 114 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 8: 
Met Asn Glu Asp Leu Thr Asn Ser Thr Glu Tyr Lys Arg Tyr Gly His 
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100 105 110 

Arg Gly His Val Tyr Ser Leu Phe lie Asn Glu Lys Gin Thr Cys Met 

115 120 125 

Pro Leu Ser Val Leu Leu Val Ser Gly Gly His Ser Leu lie Leu Glu 

130 135 140 

Ala Arg Asp Tyr Glu Asn He Lys He Val Ala Thr Ser Leu Asp Asp 
145 150 155 160 

Ser Phe Gly Glu Ser Phe Asp Lys Val Ser Lys Met Leu Asp Leu Gly 

165 170 175 

Tyr Pro Gly Gly Pro He Val Glu Lys Leu Ala Leu Asp Tyr Arg His 

180 185 190 

Pro Asn Glu Pro Leu Met Phe Pro He Pro Leu Lys Asn Ser Pro Asn 

195 200 205 

Leu Ala Phe Ser Phe Ser Gly Leu Lys Asn Ala Val Arg Leu Glu Val 

210 215 220 

Glu Lys Asn Ala Pro Asn Leu Asn Glu Ala He Lys Gin Lys He Gly 
225 230 235 240 

Tyr His Phe Gin Ser Ala. Ala He Glu His Leu He Gin Gin Thr Lys 

245 250 255 

Arg Tyr Phe Lys He Lys Arg Pro Lys He Phe Gly He Val Gly Gly 

260 265 270 

Ala Ser Gin Asn Leu Ala Leu Arg Lys Ala Phe Glu Asn Leu Cys Asp 

275 280 285 

Ala Phe Asp Cys Lys Leu Val Leu Ala Pro Leu Glu Phe Cys Ser Asp 

290 295 300 

Asn Ala Ala Met He Gly Arg Ser Ser Leu Glu Ala Tyr Gin Lys Lys 
305 310 315 320 

Arg Phe Val Pro Leu Glu Lys Ala Asn He Ser Pro Arg Thr Leu Leu 
325 330 335 

Lys Ser Phe Glu 
340 

(2) INFORMATION FOR SEQ ID NO: 1147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 498 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147: 

TGCACTTGTT ATGATGAAGA TGGCGCACTA AGA ATG AAT GAA GAC TTG ACA AAT 54 

Met Asn Glu Asp Leu Thr Asn 
1 5 
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AAT GAA GCG ATC AAA CAA AAG ATT GGC TAT CAT TTT CAA AGT GCA GCG 7 74 

Asn Glu Ala lie Lys Gin Lys lie Gly Tyr His Phe Gin Ser Ala Ala 
235 240 245 

ATT GAG CAT TTA ATC GAG CAG ACT AAA CGC TAT TTT AAA ATC AAA CGC 822 
lie Glu His Leu lie Gin Gin Thr Lys Arg Tyr Phe Lys lie Lys Arg 
250 255 260 

CCT AAA ATT TTT GGC ATT GTG GGG GGA GCG AGC CAA AAT TTG GCT TTA 870 
Pro Lys lie Phe Gly lie Val Gly Gly Ala Ser Gin Asn Leu Ala Leu 
265 270 275 

AGA AAG GCG TTT GAA AAT TTG TGC GAT GCG TTT GAT TGC AAG CTT GTT 918 
Arg Lys Ala Phe Glu Asn Leu Cys Asp Ala Phe Asp Cys Lys Leu Val 
280 285 290 295 

TTA GCC CCT TTA GAA TTT TGC AGC GAC AAT GCC GCC ATG ATA GGG CGA 9 66 

Leu Ala Pro Leu Glu Phe Cys Ser Asp Asn Ala Ala Met lie Gly Arg 
300 305 310 

TCC AGC CTA GAA GCT TAT CAA AAA AAG CGC TTT GTC CCT TTA GAA AAG 1014 
Ser Ser Leu Glu Ala Tyr Gin Lys Lys Arg Phe Val Pro Leu Glu Lys 
315 320 325 

GCT AAC ATT TCG CCA AGA ACG CTG TTA AAA AGT TTT GAG TGAATGGATA CA 10 65 
Ala Asn lie Ser Pro Arg Thr Leu Leu Lys Ser Phe Glu 
330 335 340 

AAAAGAAAGC GCATGAT7UUV AC 1087 

(2) INFORMATION FOR SEQ ID NO: 1146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 6: 

Met lie Leu Ser lie Glu Ser Ser Cys Asp Asp Ser Ser Leu Ala Leu 

15 10 15 

Thr Arg lie Glu Asp Ala Gin Leu He Ala His Phe Lys He Ser Gin 

20 25 30 

Glu Lys His His Ser Ser Tyr Gly Gly Val Val Pro Glu Leu Ala Ser 

35 40 45 

Arg Leu His Ala Glu Asn Leu Pro Leu Leu Leu Glu Arg He Lys He 

50 55 60 

Ser Leu Asn Lys Asp Phe Ser Lys He Lys Ala He Ala He Thr Asn 

65 70 75 80 

Gin Pro Gly Leu Ser Val Thr Leu He Glu Gly Leu Met Met Ala Lys 

85 90 95 

Ala Leu Ser Leu Ser Leu Asn Leu Pro Leu He Leu Glu Asp His Leu 
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TCT TGC GAT GAC AGC TCT TTA GCC CTT ACA AGA ATA GAG GAC GCT CAA 102 
Ser Cys Asp Asp Ser Ser Leu Ala Leu Thr Arg lie Glu Asp Ala Gin 
10 15 20 

CTC ATC GCT CAT TTT AAA ATC TCT CAA GAA AAG CAC CAT AGT TCT TAT 15 0 

Leu lie Ala His Phe Lys lie Ser Gin Glu Lys His His Ser Ser Tyr 
25 30 35 

GGG GGC GTT GTG CCT GAG CTT GCA TCA CGT TTG CAT GCT GAG AAT TTG 198 
Gly Gly Val Val Pro Glu Leu Ala Ser Arg Leu His Ala Glu Asn Leu 
40 45 50 55 

CCG CTT TTA TTA GAA CGC ATT AAA ATA AGC TTG AAT AAG GAT TTT TCC 24 6 

Pro Leu Leu Leu Glu Arg lie Lys lie Ser Leu Asn Lys Asp Phe Ser 
60 65 70 

AAA ATT AAA GCC ATC GCT ATC ACT AAT CAG CCA GGT TTG AGC GTT ACT 2 94 

Lys lie Lys Ala lie Ala lie Thr Asn Gin Pro Gly Leu Ser Val Thr 
75 80 85 

TTA ATA GAA GGT TTG ATG ATG GCA AAA GCC TTG AGC TTG TCT TTG AAT 342 
Leu lie Glu Gly Leu Met Met Ala Lys Ala Leu Ser Leu Ser Leu Asn 
90 95 100 

TTG CCC TTG ATT TTA GAA GAT CAT TTG AGA GGG CAT GTG TAT TCG CTC 3 90 

Leu Pro Leu lie Leu Glu Asp His Leu Arg Gly His Val Tyr Ser Leu 
105 110 115 

TTT ATC AAT GAA AAA CAA ACC TGC ATG CCT TTA AGC GTG CTC TTA GTC 43 8 

Phe lie Asn Glu Lys Gin Thr Cys Met Pro Leu Ser Val Leu Leu Val 
120 125 130 135 

TCT GGG GGG CAT TCT TTG ATT TTA GAG GCT AGA GAT TAT GAG AAT ATT 4 86 

Ser Gly Gly His Ser Leu lie Leu Glu Ala Arg Asp Tyr Glu Asn lie 
140 145 150 

AAA ATC GTT GCC ACG AGT TTA GAC GAT AGC TTT GGG GAG AGT TTT GAT 5 34 

Lys lie Val Ala Thr Ser Leu Asp Asp Ser Phe Gly Glu Ser Phe Asp 
155 160 165 

AAG GTT TCC AAA ATG CTT GAT TTA GGC TAT CCA GGA GGC CCT ATA GTG 5 82 

Lys Val Ser Lys Met Leu Asp Leu Gly Tyr Pro Gly Gly Pro lie Val 
170 175 180 

GAA AAA TTA GCC CTT GAT TAT AGG CAC CCA AAC GAG CCT TTA ATG TTC 63 0 

Glu Lys Leu Ala Leu Asp Tyr Arg His Pro Asn Glu Pro Leu Met Phe 
185 190 195 

CCT ATC CCT TTA AAA AAC AGC CCG AAT CTG GCT TTT AGT TTT TCA GGT 67 8 

Pro lie Pro Leu Lys Asn Ser Pro Asn Leu Ala Phe Ser Phe Ser Gly 
200 205 210 215 

TTA AAA AAT GCG GTG CGT TTG GAG GTT GAA AAA AAC GCC CCC AAC TTG 72 6 

Leu Lys Asn Ala Val Arg Leu Glu Val Glu Lys Asn Ala Pro Asn Leu 
220 225 230 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ai44: 



Met Leu Phe Asn Gly Leu Cys Leu Phe Glu Gin Ala Ser Leu Cys Phe 

15 10 15 

Arg Lys Ala Ser Val Ser Met Lys Lys Leu Lys Gly Leu Phe Leu He 

20 25 30 

Leu Leu Leu Trp Val Tyr Pro Leu Arg Ser Glu Pro He Asn Glu Gly 

35 40 45 

Ala Tyr He Leu Glu Glu He Gly Asp Val Leu Arg Phe Leu Pro He 

50 55 60 

Phe Val Gly Thr Val Ser Leu Ala Met Arg Asp Tyr Arg Gly Leu Gly 
65 70 75 80 

Glu Leu Ala Val Gly Thr Leu Val Thr Gin Gly Val He Tyr Gly Leu 

85 90 95 

Lys Gly Ala Phe Ser Asn Ala His Lys Asp Gly Ala Arg Val Glu Phe 

100 105 110 

Ala Lys Arg Pro Cys Cys Asn Ser Trp Arg Gly Met Pro Ser Gly His 

115 120 125 

Ala Gly Gly Val Phe Ser Ala Ala Gly Phe Val Tyr Tyr Arg Tyr Gly 

130 135 140 

Trp Lys Pro Ala Leu Pro Val He Ala Leu Ala He Leu Thr Asp Ala 
145 150 155 160 

Ser Arg Val Val Ala Arg Gin His Thr He Leu Gin Val Thr He Gly 

165 170 175 

Ser Leu He Ala Trp Gly Phe Ala Tyr Leu Phe Thr Ser Arg Tyr Lys 

180 185 190 

Pro Lys Gin Trp Met Leu Tyr Pro Glu He Ser Ser Asp Phe Lys Gly 

195 200 205 

Ser Ser Arg Tyr Gly Val Ser Phe Ser Tyr Gin Trp 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 114 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 1053 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145: 



TTCAAGCAAA AACACCACCC AAATATAAAG ATA ATG ATT TTA AGC ATT GAA AGT 54 

Met He Leu Ser He Glu Ser 
1 5 
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Pro lie Asn Glu Gly Ala Tyr He Leu Glu Glu He Gly Asp Val Leu 
45 50 55 

AGG TTT TTG CCT ATT TTT GTA GGC ACG GTC AGT TTG GCG ATG CGC GAT 24 3 

Arg Phe Leu Pro He Phe Val Gly Thr Val Ser Leu Ala Met Arg Asp 
60 65 70 75 

TAT AGA GGG TTA GGG GAA TTA GCG GTC GGC ACA TTG GTT ACT CAA GGC 2 91 

Tyr Arg Gly Leu Gly Glu Leu Ala Val Gly Thr Leu Val Thr Gin Gly 
80 85 90 

GTG ATT TAT GGC CTT AAA GGA GCT TTT AGC AAC GCC CAT AAA GAT GGG 33 9 

Val He Tyr Gly Leu Lys Gly Ala Phe Ser Asn Ala His Lys Asp Gly 
95 100 105 

GCT AGA GTG GAA TTT GCT AAA CGC CCG TGC TGT AAT TCT TGG AGA GGC 3 87 

Ala Arg Val Glu Phe Ala Lys Arg Pro Cys Cys Asn Ser Trp Arg Gly 
110 115 120 

ATG CCA AGC GGG CAT GCT GGG GGG GTG TTT AGC GCG GCT GGG TTT GTG 4 35 

Met Pro Ser Gly His Ala Gly Gly Val Phe Ser Ala Ala Gly Phe Val 
125 130 135 

TAT TAC CGC TAT GGG TGG AAG CCG GCT CTT CCT GTG ATC GCT CTT GCA 4 83 

Tyr Tyr Arg Tyr Gly Trp Lys Pro Ala Leu Pro Val He Ala Leu Ala 
140 145 150 155 

ATC CTC ACT GAC GCT AGC AGA GTG GTG GCA AGA CAA CAC ACG ATC TTG 531 
He Leu Thr Asp Ala Ser Arg Val Val Ala Arg Gin His Thr He Leu 
160 165 170 

CAA GTT ACG ATC GGC AGC CTT ATC GCA TGG GGG TTT GCT TAT TTA TTC 57 9 

Gin Val Thr He Gly Ser Leu He Ala Trp Gly Phe Ala Tyr Leu Phe 
175 180 185 

ACT TCA CGC TAC AAA CCC AAG CAA TGG ATG CTC TAT CCT GAA ATT TCT 62 7 

Thr Ser Arg Tyr Lys Pro Lys Gin Trp Met Leu Tyr Pro Glu He Ser 
190 195 200 

AGC GAT TTT AAG GGC AGT AGC CGC TAT GGG GTG AGC TTT TCT TAT CAA 675 
Ser Asp Phe Lys Gly Ser Ser Arg Tyr Gly Val Ser Phe Ser Tyr Gin 
205 210 215 

TGG TAAAGGGATA AAGTGCTAAA AAAATTATTA TTCATTGCAC T 719 

Trp 

220 



(2) INFORMATION FOR SEQ ID NO:1144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(B) TOPOLOGY: linear 



-1672- 



BNSDOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



Phe Leu Gin Leu Leu Arg Ala Tyr Ala Val Phe Ser Asn Glu Gly Lys 

450 455 460 

Leu Thr Thr Pro Tyr Leu Val Gin Arg Glu Thr Ala Pro Asn Gly Asp 
465 470 475 480 

lie Tyr lie Pro Ser Pro Lys Pro Thr Phe Gin Val lie Ser Pro Lys 

485 490 495 

Ser Ala Arg Lys Met Lys Glu Thr Leu lie Lys Val Val Arg Tyr Gly 

500 505 510 

Thr Gly Lys Asn Ala Gin Phe Glu Gly Leu Tyr lie Gly Gly Lys Thr 

515 520 525 

Gly Thr Ala Arg Val Ala Lys Asn Gly Ser Tyr Ser Ala Gin Ser Tyr 

530 535 540 

Asn Ser Ser Phe Phe Gly Phe Ala Glu Asp Glu Arg Gin Val Phe Thr 
545 550 555 560 

lie Gly Val Val lie Leu Gly Ser His Gly Lys Glu Glu Tyr Tyr Ala 

565 570 575 

Ser Lys He Ala Ala Pro He Phe Lys Glu He Thr Glu He Leu Val 

580 585 590 

Arg Tyr Asn Tyr Leu Ser Pro Ser He Ala He Gin Asn Ala Leu Glu 

595 600 605 

Lys Asn Arg Phe Lys He Lys 
610 615 

(2) INFORMATION FOR SEQ ID N0:1143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRAlsTDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 19... 678 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1143: 

TAGCCTAGTC TTTATCGC ATG CTA TTT AAT GGG CTA TGC TTA TTT GAA CAG 51 

Met Leu Phe Asn Gly Leu Cys Leu Phe Glu Gin 
15 10 

GCA AGT TTG TGC TTT AGA AAA GCG AGC GTT TCA ATG AAA AAG CTC AAA 99 
Ala Ser Leu Cys Phe Arg Lys Ala Ser Val Ser Met Lys Lys Leu Lys 
15 20 25 

GGT CTT TTT TTG ATC CTG CTC TTA TGG GTC TAT CCT TTA AGG AGT GAG 14 7 

Gly Leu Phe Leu He Leu Leu Leu Trp Val Tyr Pro Leu Arg Ser Glu 
30 35 40 

CCA ATC AAT GAG GGA GCA TAC ATT TTA GAA GAG ATT GGC GAT GTG CTT 195 
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1 

Glu Thr Gin Lys 
20 

Ijeu Phe Phe lie 
35 

Gin Ala Asn Met 
50 

Arg Gly Thr lie 
65 

Thr Leu Phe Lys 

Glu Asp Phe Phe 
100 

Lys Ser Leu Lys 
115 

Tyr Asp Leu Thr 
130 

Lys Phe Leu Ala 
145 

Lys Val Trp Gin 

Arg His Tyr Pro 
180 

Gin Lys Gin Glu 
195 

Val Glu Lys Ser 
210 

Arg Thr Gly Lys 
225 

Tyr Thr Glu Val 

Pro Leu Lys Leu 
260 

Asp Lys Leu Lys 
275 

Ser Gly Glu lie 
290 

Ala lie Lys Thr 
305 

Lys Val Phe Glu 

Leu Leu Asp Lys 
340 

His Gly Tyr Tyr 
355 

Pro Ser Lys Lys 
370 

Val Gly Met lie 
385 

Asn Gly Leu Leu 

Ser Leu Glu Ala 
420 

Glu Val Leu Lys 
435 



5 

Tyr Lys Gly Thr 

Phe Leu Met Val 
40 

Pro Asn Leu Val 
55 

Tyr Ser Gin Asp 
70 

Leu Gly Phe Asp 
85 

lie Asp Phe Leu 

Asp Ala lie Asn 
120 

Pro Asn Met Ala 
135 

Phe Gly Val Phe 
150 

Lys Gin Gly Leu 
165 

Tyr Gin Asn Ser 

Glu Asp Lys Leu 
200 

Gin Asp His Leu 
215 

Arg Asp Val Ser 
230 

Glu Arg Leu Asp 
245 

Gin Arg Glu lie 

Ala Lys Glu lie 
280 

Leu Ser Leu Ala 
295 

Ser Asp Tyr Glu 
310 

Pro Gly Ser Thr 
325 

Asn Leu lie Asn 

Gin Leu Gly Lys 
360 

Ala Val Val Glu 
375 

Lys lie Ser Lys 
390 

Gly Tyr Gly Phe 
405 

Thr Gly Lys lie 

Gly Ser Val Ser 
440 



10 

Val Thr Ala Leu 
25 

Ala Phe Lys Lys 

Met Ser Lys Gin 
60 

Asn Tyr Ser Leu 
75 

Thr Arg Phe Leu 
90 

Ser lie Tyr Ser 
105 

Thr Lys Gly Tyr 

Ala Asn lie Arg 
140 

Gin Asn Phe Lys 
155 

Asn lie Glu Val 
170 

Leu Glu Pro lie 
185 

Thr Leu Thr Thr 

Leu Lys Ala Gin 
220 

Phe Asn Phe lie 
235 

Gly Tyr Glu Val 
250 

Glu Thr Leu Leu 
265 

Leu Val Gly He 

Ser Ser Lys Arg 
300 

Ser Leu Asn Leu 
315 

He Lys Pro He 
330 

Pro Lys Glu Arg 
345 

Tyr Thr He Lys 

Asp He Leu He 
380 

Asn Leu Asn Pro 
395 

Ser Gin Lys Thr 
410 

Pro Pro Leu Ser 
425 

Tyr Gly Tyr Gly 



15 

He Phe Leu Leu 
30 

Ala Phe Phe Ala 
45 

Asp Thr Ala Ala 

Ala Thr Ser Gin 
80 

Asn Pro Asp Lys 
95 

Asn He Pro Lys 
110 

He He Leu Ala 
125 

Asp Leu Asn Lys 

Asp Ala His Asp 
160 

Ser Gly Val Ser 
175 

He Gly Tyr Val 
190 

Gly Lys Lys Gly 
205 

Gin Asn Gly He 

Gin Asn His Ser 
240 

Tyr Leu Ser Val 
255 

Asp Lys Thr Lys 
270 

He Asn Pro Lys 
285 

Phe Asn Pro Asn 

Ser Val Ala Glu 
320 

Val Tyr Ser Leu 
335 

He Asp Leu Asn 
350 

Asp Asp Phe He 
365 

Gin Ser Ser Asn 

Lys Asp Phe Tyr 
400 

Gly He Asp Leu 
415 

Ala Phe Lys Arg 
430 

Leu Asn Ala Thr 
445 
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GAA GGC AAA TTG ACT ACC CCC TAT TTA GTG CAA CGA GAA ACC GCC CCT 14 4 3 

Glu Gly Lys Leu Thr Thr Pro Tyr Leu Val Gin Arg Glu Thr Ala Pro 
465 470 475 

AAT GGC GAT ATT TAC ATC CCT AGC CCC AAA CCC ACC TTT CAA GTC ATT 14 91 

Asn Gly Asp lie Tyr lie Pro Ser Pro Lys Pro Thr Phe Gin Val lie 
480 485 490 

AGC CCA AAA AGC GCT AGG AAA ATG AAA GAA ACC TTA ATC AAA GTA GTG 153 9 

Ser Pro Lys Ser Ala Arg Lys Met Lys Glu Thr Leu lie Lys Val Val 
495 500 505 

CGT TAT GGC ACA GGC AAA AAC GCT CAA TTT GAA GGG CTA TAC ATA GGG 15 8 7 

Arg Tyr Gly Thr Gly Lys Asn Ala Gin Phe Glu Gly Leu Tyr lie Gly 
510 515 520 525 

GGC AAA ACC GGC ACG GCT AGG GTC GCT AAA AAC GGG AGT TAT AGC GCG 163 5 

Gly Lys Thr Gly Thr Ala Arg Val Ala Lys Asn Gly Ser Tyr Ser Ala 
530 535 540 

CAG TCC TAC AAC AGC TCT TTT TTT GGG TTT GCT GAA GAT GAA AGG CAG 16 8 3 

Gin Ser Tyr Asn Ser Ser Phe Phe Gly Phe Ala Glu Asp Glu Arg Gin 
545 550 555 

GTT TTT ACT ATC GGC GTG GTT ATC TTA GGT TCG CAT GGC AAG GAA GAA 17 31 

Val Phe Thr lie Gly Val Val lie Leu Gly Ser His Gly Lys Glu Glu 
560 565 570 

TAT TAC GCC AGC AAG ATT GCA GCC CCC ATT TTT AAA GAA ATC ACC GAA 177 9 

Tyr Tyr Ala Ser Lys lie Ala Ala Pro lie Phe Lys Glu lie Thr Glu 
575 580 585 

ATT TTA GTG CGT TAC AAT TAT CTA TCG CCC TCT ATT GCG ATT CAA AAC 182 7 

lie Leu Val Arg Tyr Asn Tyr Leu Ser Pro Ser lie Ala lie Gin Asn 
590 595 600 605 

GCG CTC GAG AAA AAC CGC TTT AAG ATA AAA TAAAAGGCTC TTTTCAACCC AAA 188 0 
Ala Leu Glu Lys Asn Arg Phe Lys lie Lys 
610 615 

CTCCAAAAAA GGAGTCTTAA GTT 1903 

(2) INFORMATION FOR SEQ ID NO: 114 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1142: 
Met Asp Asn Arg Asn lie Asp Pro Tyr Phe Asn Pro Glu Gin Phe Leu 
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Asn His Ser 
240 

TTG AGC GTT 
Leu Ser Val 
255 

AAA ACT AAA 
Lys Thr Lys 
270 

AAC CCT AAA 
Asn Pro Lys 

AAT CCT AAT 
Asn Pro Asn 



GTT GCT G7^ 
Val Ala Glu 
320 

TAT TCC TTG 
Tyr Ser Leu 
335 

GAT TTA AAC 
Asp Leu Asn 
350 

GAC TTT ATC 
Asp Phe lie 



TCT AGC AAT 
Ser Ser Asn 



GAT TTC TAT 
Asp Phe Tyr 
400 

ATT GAT TTA 
lie Asp Leu 
415 

TTC AAG CGT 
Phe Lys Arg 
430 

AAC GCG ACT 
Asn Ala Thr 



Tyr Thr Glu 



CCT TTA AAA 
Pro Leu Lys 

GAC AAA CTC 
Asp Lys Leu 
275 

AGC GGG GAA 
Ser Gly Glu 
290 

GCG ATT AAA 
Al a lie Lys 
305 

AAG GTT TTT 
Lys Val Phe 

CTG TTA GAC 
Leu Leu Asp 

CAT GGC TAT 
His Gly Tyr 
355 

CCC AGT AAA 
Pro Ser Lys 
370 

GTG GGC ATG 
Val Gly Met 
385 

AAT GGG CTT 
Asn Gly Leu 



TCT CTA GAA 
Ser Leu Glu 



GAA GTG TTA 
Glu Val Leu 
435 

TTT TTG CAG 
Phe Leu Gin 
450 



Val Glu Arg 
245 

CTC CAA AGA 
Leu Gin Arg 
260 

AAG GCT AAA 
Lys Ala Lys 



ATT TTA TCG 
lie Leu Ser 



ACC AGC GAT 
Thr Ser Asp 
310 

GAG CCA GGC 
Glu Pro Gly 
325 

AAG AAT TTG 
Lys Asn Leu 
340 

TAC CAA TTA 
Tyr Gin Leu 



AAA GCC GTT 
Lys Ala Val 



ATA AAA ATC 
lie Lys lie 
390 

TTA GGC TAT 
Leu Gly Tyr 
405 

GCC ACA GGA 
Ala Thr Gly 
420 

AAG GGG AGC 
Lys Gly Ser 

CTT TTA AGG 
Leu Leu Arg 



Leu Asp Gly 



GAA ATT GAA 
Glu lie Glu 
265 

GAA ATC CTA 
Glu lie Leu 
280 

CTA GCT TCA 
Leu Ala Ser 
295 

TAT GAA AGC 
Tyr Glu Ser 



AGC ACG ATC 
Ser Thr lie 



ATC AAC CCC 
lie Asn Pro 
345 

GGA AAA TAC 
Gly Lys Tyr 
360 

GTG GAA GAC 
Val Glu Asp 
375 

AGT AAA AAC 
Ser Lys Asn 



GGA TTT TCT 
Gly Phe Ser 



AAG ATC CCT 
Lys lie Pro 
425 

GTT TCT TAT 
Val Ser Tyr 
440 

GCT TAT GCG 
Ala Tyr Ala 
455 



Tyr Glu Val 
250 

ACC CTA TTG 
Thr Leu Leu 



GTG GGT ATC 
Val Gly lie 



AGC AAG CGC 
Ser Lys Arg 
300 

TTG AAT TTG 
Leu Asn Leu 
315 

AAA CCC ATT 
Lys Pro lie 
330 

AAA GAA CGC 
Lys Glu Arg 



ACC ATT AAA 
Thr lie Lys 



ATT TTG ATC 
lie Leu lie 
380 

TTA AAC CCA 
Leu Asn Pro 
395 

CAA AAA ACC 
Gin Lys Thr 
410 

CCT TTG TCC 
Pro Leu Ser 



GGC TAT GGG 
Gly Tyr Gly 



GTG TTT TCT 
Val Phe Ser 
460 



Tyr 



GAT 819 
Asp 



ATT 867 

lie 

285 

TTC 915 
Phe 



AGC 96 3 

Ser 



GTT 1011 
Val 



ATT 1059 
He 



GAC 1107 

Asp 

365 

CAA 1155 
Gin 



AAG 1203 
Lys 

GGC 12 51 
Gly 



GCT 12 9 9 

Ala 



CTG 1347 

Leu 

445 

AAT 1395 
Asn 
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15 

TTT TTA TTG 
Phe Leu L»eu 
30 

TTT TTT GCC 
Phe Phe Ala 



ACT GCG GCT 
Thr Ala Ala 



ACT TCA CAA 
Thr Ser Gin 
80 

CCG GAT AAA 
Pro Asp Lys 
95 

ATC CCT AAA 
lie Pro Lys 
110 

ATT CTA GCC 
lie Leu Ala 



TTA AAT AAG 
Leu Asn Lys 

GCG CAC GAT 
Ala His Asp 
160 

GGC GTT TCT 
Gly Val Ser 
175 

GGC TAT GTG 
Gly Tyr Val 
190 

AAA AAA GGC 
Lys Lys Gly 



AAT GGC ATA 
Asn Gly lie 



AAC CAC TCT 



CTT TTT TTT 
Leu Phe Phe 
35 

CAA GCC AAC 
Gin Ala Asn 
50 

AGG GGG ACT 
Arg Gly Thr 
65 

ACC CTT TTC 
Thr Leu Phe 



GAA GAT TTT 
Glu Asp Phe 



AAG TCC TTA 
Lys Ser Leu 
115 

TAT GAT CTC 
Tyr Asp Leu 
130 

AAA TTT TTA 
Lys Phe Leu 
145 

AAG GTG TGG 
Lys Val Trp 



AGG CAT TAC 
Arg His Tyr 



CAA AAA CAA 
Gin Lys Gin 
195 

GTT GAA AAA 
Val Glu Lys 
210 

AGA ACA GGC 
Arg Thr Gly 
225 

TAT ACA GAG 



20 

ATT TTT TTA 
lie Phe Leu 



ATG CCT AAT 
Met Pro Asn 



ATC TAT AGT 
lie Tyr Ser 
70 

AAA CTG GGC 
Lys Leu Gly 
85 

TTC ATT GAT 
Phe lie Asp 
100 

AAA GAC GCC 
Lys Asp Ala 



ACG CCC AAT 
Thr Pro Asn 



GCC TTT GGG 
Ala Phe Gly 
150 

CAA AAG CAA 
Gin Lys Gin 
165 

CCT TAT CAA 
Pro Tyr Gin 
180 

GAA. GAA GAC 
Glu Glu Asp 



TCT CAA GAT 
Ser Gin Asp 



AAA AGA GAC 
Lys Arg Asp 
230 

GTT GAA CGC 



25 

ATG GTG GCT 
Met Val Ala 
40 

CTA GTG ATG 
Leu Val Met 
55 

CAA GAC AAC 
Gin Asp Asn 



TTT GAT ACA 
Phe Asp Thr 



TTC CTT TCT 
Phe Leu Ser 
105 

ATC AAT ACA 
lie Asn Thr 
120 

ATG GCT GCT 
Met Ala Ala 
135 

GTT TTT CAA 
Val Phe Gin 



GGG CTA AAC 
Gly Leu Asn 

AAT AGC CTA 
Asn Ser Leu 
185 

AAG CTC ACT 
Lys Leu Thr 
200 

CAC TTG CTT 
His Leu Leu 
215 

GTG AGT TTT 
Val Ser Phe 



CTT GAT GGC 



TTT AAA AAA 

Phe Lys Lys 



AGC AAA CAA 
Ser Lys Gin 
60 

TAC AGC CTA 
Tyr Ser Leu 
75 

AGG TTT TTA 
Arg Phe Leu 
90 

ATT TAT AGC 
lie Tyr Ser 



AAA GGC TAT 

Lys Gly Tyr 



AAT ATT AGA 
Asn lie Arg 
140 

AAT TTC AAA 
Asn Phe Lys 
155 

ATT GAA GTG 
lie Glu Val 
170 

GAG CCA ATC 
Glu Pro lie 



TTA ACT ACC 
Leu Thr Thr 



AAA GCC CAA 
Lys Ala Gin 
220 

AAC TTT ATC 
Asn Phe lie 
235 

TAT GAG GTG 



GCT 14 7 

Ala 

45 

GAC 195 
Asp 



GCC 243 
Ala 



AAC 2 91 

Asn 



AAT 33 9 

Asn 



ATC 3 87 

He 

125 

GAC 435 
Asp 



GAC 4 83 

Asp 



AGC 531 
Ser 



ATT 579 
He 



GGT 62 7 

Gly 

205 

CAA 675 
Gin 



CAA 723 
Gin 



TAT 771 
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225 230 235 240 

Ser Gly lie His Ala Thr He Ala Ala Val He Leu Ala Phe Met He 

245 250 255 

Pro Val Lys He Pro Lys Asp Ser Lys Asn Val Glu Leu Leu Glu Leu 

260 265 270 

Gly Lys Arg Tyr Ala Glu Thr Ser Ser Gly Ala Leu Leu Ser Lys Glu 

275 280 285 

Gin Gin Glu He Leu His Ser He Glu Glu Lys Ala Ser Ala Leu Gin 

290 295 300 

Ser Pro Leu Glu Arg Leu Glu His Phe Leu Ala Pro He Ser Gly Tyr 
305 310 315 320 

Phe He Met Pro Leu Phe Ala Phe Ala Asn Ala Gly Val Ser Val Asp 

325 330 335 

Ser Ser He Asn Leu Glu Val Asp Lys Val Leu Leu Gly Val He Leu 

340 345 350 

Gly Leu Cys Leu Gly Lys Pro Leu Gly He Phe Leu He Thr Phe He 

355 360 365 

Ser Glu Lys Leu Lys He Thr Ala Arg Pro Lys Gly He Ser Trp Trp 

370 375 380 

His He Leu Gly Ala Gly Leu Leu Ala Gly He Gly Phe Thr Met Ser 
385 390 395 400 

Met Phe He Ser Asn Leu Ala Phe Thr Ser Glu His Lys Asp Ala Met 

405 410 415 

Glu Val Ala Lys He Ala He Leu Leu Gly Ser Leu He Ser Gly He 

420 425 430 

He Gly Ala Leu Tyr Leu Phe Ala Leu Asp Lys Arg Ala Ala Leu Lys 
435 440 445 

Lys 



(2) INFORMATION FOR SEQ ID NO: 1141: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1903 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13 . . . 1857 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141: 



AAGTAAGTGC TT ATG GAT AAT AGG AAT ATT GAT CCT TAG TTC AAC CCA GAG 51 
Met Asp Asn Arg Asn He Asp Pro Tyr Phe Asn Pro Glu 
15 10 

CAA TTT TTA GAA ACC CAA AAA TAC AAA GGC ACG GTT ACA GCA TTA ATC 9 9 

Gin Phe Leu Glu Thr Gin Lys Tyr Lys Gly Thr Val Thr Ala Leu He 
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CAT AAA GAC GCT ATG GAA GTG GCA AAA ATT GCG ATT TTA CTC GGA TCT 12 9 9 

His Lys Asp Ala Met Glu Val Ala Lys lie Ala lie Leu Leu Gly Ser 
415 420 425 

TTG ATT TCT GGG ATC ATA GGG GCT TTG TAT TTA TTT GCG CTA GAC AAA 134 7 

Leu lie Ser Gly lie lie Gly Ala Leu Tyr Leu Phe Ala Leu Asp Lys 
430 435 440 

AGA GCG GCT TTA AAG AAA TAGCTAAAAA TGCTATAATT TGAGATTAAA ACATCTTT 14 0 3 
Arg Ala Ala Leu Lys Lys 
445 

TAAGGAAATT AAATGGGACA AATT 14 2 7 

(2) INFORMATION FOR SEQ ID NO: 114 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140: 

Met Gly Leu Lys Leu Lys lie Leu Arg Leu Ser Met Asn Leu Lys Lys 

15 10 15 

Thr Glu Asn Ala Leu Ser Leu Thr Leu Lys Asn Phe lie Lys Ser Glu 

20 25 30 

Ser Phe Gly Gly He Phe Leu Phe Leu Asn Ala Val Leu Ala Met Val 

35 40 45 

Val Ala Asn Ser Phe Leu Lys Glu Ser Tyr Phe Ala Leu Trp His Thr 

50 55 60 

Pro Phe Gly Phe Gin He Gly Asp Phe Phe He Gly Phe Ser Leu His 
65 70 75 80 

Asn Trp He Asp Asp Val Leu Met Ala Leu Phe Phe Leu Met He Gly 

85 90 95 

Leu Glu He Lys Arg Glu Leu Leu Phe Gly Glu Leu Ser Ser Phe Lys 

100 105 110 

Lys Ala Ser Phe Pro Val He Ala Ala He. Gly Gly Met He Ala Pro 

115 120 125 

Gly Leu He Tyr Phe Phe Leu Asn Ala Asn Thr Pro Ser Gin His Gly 

130 135 140 

Phe Gly He Pro Met Ala Thr Asp He Ala Phe Ala Leu Gly Val He 
145 150 155 160 

Met Leu Leu Gly Lys Arg Val Pro Thr Ala Leu Lys Val Phe Leu He 

165 170 175 

Thr Leu Ala Val Ala Asp Asp Leu Gly Ala He Val Val He Ala Leu 

180 185 190 

Phe Tyr Thr Thr Asn Leu Lys Phe Ala Trp Leu Leu Gly Ala Leu Gly 

195 200 205 

Val Val Leu Val Leu Ala Val Leu Asn Arg Leu Asn Met Arg Ser Leu 

210 215 220 

He Pro Tyr Leu Leu Leu Gly Val Leu Leu Trp Phe Cys Val His Gin 
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GTG GTG ATC GCG CTC TTT TAT ACC ACG AAT TTA AAA TTC GCA TGG CTT 62 7 

Val Val He Ala Leu Phe Tyr Thr Thr Asn Leu Lys Phe Ala Trp Leu 
190 195 200 

TTA GGG GCT TTA GGG GTG GTT CTT GTT TTA GCC GTA TTA AAC CGC CTG 67 5 ' 

Leu Gly Ala Leu Gly Val Val Leu Val Leu Ala Val Leu Asn Arg Leu 
205 210 215 

AAT ATG CGC TCG CTC ATC CCT TAC TTG CTT TTA GGG GTG TTG CTT TGG 72 3 

Asn Met Arg Ser Leu He Pro Tyr Leu Leu Leu Gly Val Leu Leu Trp 
220 225 230 235 

TTT TGC GTG CAT CAA AGC GGT ATC CAT GCG ACG ATT GCT GCA GTG ATT 771 
Phe Cys Val His Gin Ser Gly He His Ala Thr He Ala Ala Val He 
240 245 250 

CTA GCT TTT ATG ATA CCG GTG AAG ATC CCT AAA GAT TCT AAA AAT GTA 819 
Leu Ala Phe Met He Pro Val Lys He Pro Lys Asp Ser Lys Asn Val 
255 260 265 

GAG CTT TTG GAA CTA GGC AAA CGA TAC GCA GAA ACG AGT TCA GGA GCG 867 
Glu Leu Leu Glu Leu Gly Lys Arg Tyr Ala Glu Thr Ser Ser Gly Ala 
270 275 280 

CTT TTG AGT AAA GAG CAG CAA GAA ATC TTG CAT TCT ATT GAA GAA AAA 915 
Leu Leu Ser Lys Glu Gin Gin Glu He Leu His Ser He Glu Glu Lys 
285 290 295 

GCG AGC GCC TTA CAA AGC CCC TTA GAA AGA TTG GAG CAT TTT CTA GCC 963 
Ala Ser Ala Leu Gin Ser Pro Leu Glu Arg Leu Glu His Phe Leu Ala 
300 305 310 315 

CCC ATT AGC GGG TAT TTC ATC ATG CCC TTA TTC GCG TTT GCA AAC GCT 1011 
Pro He Ser Gly Tyr Phe He Met Pro Leu Phe Ala Phe Ala Asn Ala 
320 325 330 

GGG GTG AGC GTT GAT TCT AGC ATC AAT TTA GAA GTG GAT AAG GTG CTT 1059 
Gly Val Ser Val Asp Ser Ser He Asn Leu Glu Val Asp Lys Val Leu 
335 340 345 

TTA GGG GTT ATT TTA GGG CTT TGC TTG GGC AAA CCT TTA GGG ATT TTC 110 7 

Leu Gly Val He Leu Gly Leu Cys Leu Gly Lys Pro Leu Gly He Phe 
350 355 360 

TTA ATC ACT TTT ATA AGC GAA AAG CTT AAA ATC ACC GCA CGC CCT AAA 115 5 

Leu He Thr Phe He Ser Glu Lys Leu Lys He Thr Ala Arg Pro Lys 
365 370 375 

GGC ATC AGC TGG TGG CAT ATT TTA GGG GCT GGG CTT TTA GCA GGG ATT 12 03 

Gly He Ser Trp Trp His He Leu Gly Ala Gly Leu Leu Ala Gly He 
380 385 390 395 

GGC TTT ACC ATG TCT ATG TTT ATT TCT AAT CTG GCC TTC ACG AGC GAG 12 51 

Gly Phe Thr Met Ser Met Phe He Ser Asn Leu Ala Phe Thr Ser Glu 
400 405 410 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 1365 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1139: 

AAGACTGCTT GAAAAATT ATG GGT CTG AAA TTA AAA ATT TTA AGG TTG TCT 51 

Met Gly Leu Lys Leu Lys lie Leu Arg Leu Ser 
15 10 

ATG AAT CTC AAA AAA ACA GAA AAC GCG CTC AGT TTG ACG CTT AAG AAC 9 9 

Met Asn Leu Lys Lys Thr Glu Asn Ala Leu Ser Leu Thr Leu Lys Asn 
15 20 25 

TTC ATT AAA AGC GAG TCT TTT GGA GGG ATT TTC CTC TTT TTA AAC GCT 14 7 

Phe lie Lys Ser Glu Ser Phe Gly Gly lie Phe Leu Phe Leu Asn Ala 
30 35 40 

GTT TTA GCG ATG GTG GTG GCT AAT TCG TTT TTA AAA GAA AGT TAT TTT 195 
Val Leu Ala Met Val Val Ala Asn Ser Phe Leu Lys Glu Ser Tyr Phe 
45 50 55 

GCA CTA TGG CAC ACC CCT TTT GGG TTT CAA ATA GGG GAT TTT TTC ATC 24 3 

Ala Leu Trp His Thr Pro Phe Gly Phe Gin lie Gly Asp Phe Phe lie 
60 65 70 75 

GGC TTT AGT TTG CAC AAC TGG ATT GAT GAT GTC TTA ATG GCG TTA TTC 2 91 

Gly Phe Ser Leu His Asn Trp lie Asp Asp Val Leu Met Ala Leu Phe 
80 85 90 

TTT TTA ATG ATA GGC TTA GAA ATC AAA CGA GAA TTG TTG TTT GGG GAA 33 9 

Phe Leu Met lie Gly Leu Glu lie Lys Arg Glu Leu Leu Phe Gly Glu 
95 100 105 

TTA TCC AGT TTC AAA AAA GCT TCT TTT CCT GTG ATT GCG GCC ATA GGG 3 87 

Leu Ser Ser Phe Lys Lys Ala Ser Phe Pro Val lie Ala Ala lie Gly 
110 115 120 

GGC ATG ATA GCC CCA GGA TTG ATT TAT TTT TTT CTT AAC GCT AAC ACG 43 5 

Gly Met lie Ala Pro Gly Leu lie Tyr Phe Phe Leu Asn Ala Asn Thr 
125 130 135 

CCT TCC CAG CAT GGT TTT GGG ATC CCT ATG GCG ACG GAT ATT GCG TTC 4 83 

Pro Ser Gin His Gly Phe Gly lie Pro Met Ala Thr Asp lie Ala Phe 
140 145 150 155 

GCT TTA GGC GTG ATC ATG CTT TTA GGC AAG AGG GTG CCA ACC GCT TTA 531 
Ala Leu Gly Val lie Met Leu Leu Gly Lys Arg Val Pro Thr Ala Leu 
160 165 170 

AAG GTT TTT TTA ATC ACT CTA GCG GTG GCT GAT GAC TTG GGG GCT ATT 57 9 

Lys Val Phe Leu lie Thr Leu Ala Val Ala Asp Asp Leu Gly Ala He 
175 ISO 185 
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lie lie Lys Ala Arg He Thr Ala Ala He Val Ala Trp He Pro Leu 
180 185 190 

GATGCACGCA AAACCAAAGC AACACCCCTA AAAGC 67 0 

(2) INFORMATION FOR SEQ ID NO: 113 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1138: 

Met He Pro Glu He Lys Asp Pro Ser Lys He Ala He Phe Ala Thr 

15 10 15 

Ser He Ala Ser Leu Cys Ser Leu Val Lys Ala Arg Leu Glu He Asn 

20 25 30 

He Asp Met Val Lys Pro He Pro Ala Lys Ser Pro Ala Pro Lys He 

35 40 45 

Cys His Gin Leu Met Pro Leu Gly Arg Ala Val He Leu Ser Phe Ser 

50 55 60 

Leu He Lys Val He Lys Lys He Pro Lys Gly Leu Pro Lys Gin Ser 
65 70 75 80 

Pro Lys He Thr Pro Lys Ser Thr Leu Ser Thr Ser Lys Leu Met Leu 

85 90 95 

Glu Ser Thr Leu Thr Pro Ala Phe Ala Asn Ala Asn Lys Gly Met Met 

100 105 110 

Lys Tyr Pro Leu Met Gly Ala Arg Lys Cys Ser Asn Leu Ser Lys Gly 

115 120 125 

Leu Cys Lys Ala Leu Ala Phe Ser Ser He Glu Cys Lys He Ser Cys 

130 135 140 

Cys Ser Leu Leu Lys Ser Ala Pro Glu Leu Val Ser Ala Tyr Arg Leu 
145 150 155 160 

Pro Ser Ser Lys Ser Ser Thr Phe Leu Glu Ser Leu Gly He Phe Thr 

165 170 175 

Gly He He Lys Ala Arg He Thr Ala Ala He Val Ala Trp He Pro 
180 185 190 

Leu 



(2) INFORMATION FOR SEQ ID NO: 113 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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CD) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137: 

TAGCTATTTC TTTAAAGCCG CTCTTTTGTC TAGCGCAAAT AAATACAAAG CCCCT ATG 5 8 

Met 
1 

ATC CCA GAA ATC AAA GAT CCG AGT AAA ATC GCA ATT TTT GCC ACT TCC 106 
lie Pro Glu lie Lys Asp Pro Ser Lys lie Ala lie Phe Ala Thr Ser 
5 10 15 

ATA GCG TCT TTA TGC TCG CTC GTG AAG GCC AGA TTA GAA ATA AAC ATA 154 
lie Ala Ser Leu Cys Ser Leu Val Lys Ala Arg Leu Glu lie Asn lie 
20 25 30 

GAC ATG GTA AAG CCA ATC CCT GCT AAA AGC CCA GCC CCT AAA ATA TGC 2 02 

Asp Met Val Lys Pro lie Pro Ala Lys Ser Pro Ala Pro Lys lie Cys 
35 40 45 

CAC CAG CTG ATG CCT TTA GGG CGT GCG GTG ATT TTA AGC TTT TCG CTT 25 0 

His Gin Leu Met Pro Leu Gly Arg Ala Val lie Leu Ser Phe Ser Leu 
50 55 60 65 

ATA AAA GTG ATT AAG AAA ATC CCT AAA GGT TTG CCC AAG CAA AGC CCT 2 98 

He Lys Val He Lys Lys He Pro Lys Gly Leu Pro Lys Gin Ser Pro 
70 75 80 

AAA ATA ACC CCT AAA AGC ACC TTA TCC ACT TCT AAA TTG ATG CTA GAA 34 6 

Lys He Thr Pro Lys Ser Thr Leu Ser Thr Ser Lys Leu Met Leu Glu 
85 90 95 

TCA ACG CTC ACC CCA GCG TTT GCA AAC GCG AAT AAG GGC ATG ATG AAA 3 94 

Ser Thr Leu Thr Pro Ala Phe Ala Asn Ala Asn Lys Gly Met Met Lys 
100 105 110 

TAC CCG CTA ATG GGG GCT AGA AAA TGC TCC AAT CTT TCT AAG GGG CTT 44 2 

Tyr Pro Leu Met Gly Ala Arg Lys Cys Ser Asn Leu Ser Lys Gly Leu 
115 120 125 

TGT AAG GCG CTC GCT TTT TCT TCA ATA GAA TGC AAG ATT TCT TGC TGC 4 90 

Cys Lys Ala Leu Ala Phe Ser Ser He Glu Cys Lys He Ser Cys Cys 
130 135 140 145 

TCT TTA CTC AAA AGC GCT CCT GAA CTC GTT TCT GCG TAT CGT TTG CCT 53 8 

Ser Leu Leu Lys Ser Ala Pro Glu Leu Val Ser Ala Tyr Arg Leu Pro 
150 155 160 

AGT TCC AAA AGC TCT ACA TTT TTA GAA TCT TTA GGG ATC TTC ACC GGT 58 6 

Ser Ser Lys Ser Ser Thr Phe Leu Glu Ser Leu Gly He Phe Thr Gly 
165 170 175 

ATC ATA AAA GCT AGA ATC ACT GCA GCA ATC GTC GCA TGG ATA CCG CTT T 63 5 
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1 

Asn Tyr Gly Val 
20 

Leu Leu Phe Phe 
35 

Ser Leu Val Gin 
50 

Arg Asp Leu Phe 
65 

Glu Phe Gly Ser 

Thr Ala Glu Asn 
100 

Pro Ser Gly Asp 
115 

Val Gly Ser Glu 
130 

Leu lie Ala lie 
145 

Ala Leu Ala Ser 

Ser Ser Val He 
180 

Ala Leu Leu Thr 
195 

Phe Asp Arg He 
210 

Thr Gin Ala He 
225 

Leu Thr Ser Leu 

Gly Ser Lys He 
260 

He Val Gly Thr 
275 

Leu Leu Gly Phe 
290 

He Lys Lys Ala 
305 

Gly Gin Val 



5 

He Val Ser Ala 

Lys Gly Phe Ser 
40 

Val Arg Tyr Thr 
55 

Glu Lys Glu Ala 
70 

Lys Glu Glu He 
85 

Glu Asp Leu Asn 

Phe Glu He Arg 
120 

Leu Lys Glu Lys 
135 

Met Val Tyr Val 
150 

Val He Ala Leu 
165 

Val Phe Lys He 

Leu He Gly Tyr 
200 

Arg Glu Glu Met 
215 

Asp Glu Ala He 
230 

Thr Val Phe Phe 
245 

He He Gly Phe 

Tyr Ser Ser He 
280 

Asp Met Asp Lys 
295 

Gin Glu Lys Glu 
310 



10 

He Leu Ala Leu 
25 

Leu Gly He Asp 

Gin Asn Ala Pro 
60 

Arg Phe Lys Gly 
75 

Leu He Lys Phe 
90 

Ala He Val Ala 
105 

Lys Phe Asp Thr 

Gly He Leu Ser 
14 0 

Ser Phe Arg Tyr 
155 

Val His Asp Val 
170 

Asp Met Asn Leu 
185 

Ser He Asn Asp 

Xaa Ser Gin Lys 
220 

Ser Ser Thr Leu 
235 

Val Val Leu He 
250 

Ser Leu Pro Met 
265 

Phe He Ala Pro 

Tyr Tyr Glu Asn 
300 

Lys Met Arg Arg 
315 



15 

Leu Ala Leu Gly 
30 

Phe Ala Gly Gly 
45 

He Lys Glu Val 

Val Gin Val Ser 
80 

Pro Phe Val Glu 
95 

Asn He Leu Lys 
110 

Val Gly Pro Arg 
125 

Leu He Leu Ala 

Glu Trp Arg Phe 
160 

He Leu Val Ala 
175 

Glu Val He Ala 
190 

Thr He He He 
205 

Thr Lys Asn Ala 

Thr Arg Thr Leu 
240 

Leu Cys Val Phe 
255 

Leu He Gly Thr 
270 

Lys Val Ala Leu 
285 

Glu Thr Arg Lys 

Leu Tyr Glu Ser 
320 



(2) INFORMATION FOR SEQ ID NO: 1137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 56... 634 
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CAT GAT GTG ATT TTA GTG GCA AGC TCG GTG ATT GTT TTT AAG ATT GAT 57 9 

His Asp Val lie Leu Val Ala Ser Ser Val lie Val Phe Lys lie Asp 
170 175 180 185 

ATG AAT TTG GAA GTG ATT GCG GCC TTG CTC ACC TTG ATT GGG TAT TCC 62 7 

Met Asn Leu Glu Val lie Ala Ala Leu Leu Thr Leu lie Gly Tyr Ser 
190 195 200 

ATT AAT GAT ACG ATC ATT ATT TTT GAC AGG ATC AGA GAA GAG ATG CTY 67 5 

lie Asn Asp Thr lie lie lie Phe Asp Arg lie Arg Glu Glu Met Xaa 
205 210 215 

TCT CAA AAA ACC AAA AAC GCC ACT CAA GCC ATT GAT GAA GCC ATT TCT 72 3 

Ser Gin Lys Thr Lys Asn Ala Thr Gin Ala lie Asp Glu Ala lie Ser 
220 225 230 

AGC ACG CTC ACG CGC ACG CTT TTA ACT TCT TTA ACC GTG TTT TTT GTG 771 
Ser Thr Leu Thr Arg Thr Leu Leu Thr Ser Leu Thr Val Phe Phe Val 
235 240 245 

GTG TTG ATT TTG TGC GTG TTT GGG AGT AAG ATC ATC ATT GGC TTT TCA 819 
Val Leu lie Leu Cys Val Phe Gly Ser Lys lie lie lie Gly Phe Ser 
250 255 260 265 

TTG CCC ATG TTA ATA GGC ACG ATT GTA GGG ACT TAT AGC TCT ATT TTC 867 
Leu Pro Met Leu lie Gly Thr lie Val Gly Thr Tyr Ser Ser lie Phe 
270 275 280 

ATC GCC CCT AAA GTG GCG TTA TTG TTA GGC TTT GAT ATG GAT AAA TAT 915 
lie Ala Pro Lys Val Ala Leu Leu Leu Gly Phe Asp Met Asp Lys Tyr 
285 290 295 

TAT GAG AAT GAG ACT AGA AAA ATT AAA AAA GCT CAA GAG AAA GAA AAA 96 3 

Tyr Glu Asn Glu Thr Arg Lys lie Lys Lys Ala Gin Glu Lys Glu Lys 
300 305 310 

ATG CGC CGT TTG TAT GAG AGC GGT CAA GTT TAAGGAGTTT CTATGGATTG GGG 1016 
Met Arg Arg Leu Tyr Glu Ser Gly Gin Val 
315 320 

TCGGGTCGTT CATGTGCTG 103 5 

(2) INFORMATION FOR SEQ ID NO: 113 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136: 
Met Glu Leu Phe Lys Arg Thr Arg lie Leu Ser Phe Met Arg Tyr Ser 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 993 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 5: 

GAATAAAAGA GCTTAGGAGG TTTT ATG GAA TTA TTC AAA CGA ACT AGA ATC 51 

Met Glu Leu Phe Lys Arg Thr Arg lie 
1 5 

TTA AGC TTC ATG CGT TAT TCC AAT TAT GGG GTG ATC GTT TCA GCA ATT 9 9 

Leu Ser Phe Met Arg Tyr Ser Asn Tyr Gly Val lie Val Ser Ala lie 
10 15 20 25 

TTA GCG CTT CTA GCG TTG GGG CTT TTG TTT TTC AAA GGG TTT TCT TTA 14 7 

Leu Ala Leu Leu Ala Leu Gly Leu Leu Phe Phe Lys Gly Phe Ser Leu 
30 35 40 

GGG ATT GAT TTT GCG GGG GGG AGT TTG GTG CAA GTG CGC TAC ACT CAA 195 
Gly lie Asp Phe Ala Gly Gly Ser Leu Val Gin Val Arg Tyr Thr Gin 
45 50 55 

AAC GCC CCC ATT AAA GAA GTG CGC GAT CTG TTT GAA AAA GAA GCT CGC 24 3 

Asn Ala Pro lie Lys Glu Val Arg Asp Leu Phe Glu Lys Glu Ala Arg 
60 65 70 

TTC AAA GGC GTG CAA GTG AGC GAA TTT GGC TCT AAA GAA GAA ATT TTA 2 91 

Phe Lys Gly Val Gin Val Ser Glu Phe Gly Ser Lys Glu Glu lie Leu 
75 80 85 

ATC AAA TTC CCT TTT GTA GAA ACG GCT GAA AAT GAA GAT CTG AAC GCT 33 9 

lie Lys Phe Pro Phe Val Glu Thr Ala Glu Asn Glu Asp Leu Asn Ala 
90 95 100 105 

ATC GTG GCC AAC ATT CTA AAA CCC AGC GGC GAT TTT GAA ATC CGT AAA 3 87 

lie Val Ala Asn lie Leu Lys Pro Ser Gly Asp Phe Glu lie Arg Lys 
110 115 120 

TTT GAC ACC GTG GGC CCT AGA GTG GGG AGC GAA TTG AAA GAG AAA GGC 43 5 

Phe Asp Thr Val Gly Pro Arg Val Gly Ser Glu Leu Lys Glu Lys Gly 
125 130 135 

ATT TTG TCG CTG ATT TTA GCA TTA ATA GCG ATC ATG GTT TAT GTG AGT 4 83 

lie Leu Ser Leu lie Leu Ala Leu lie Ala lie Met Val Tyr Val Ser 
140 145 150 

TTC CGC TAT GAA TGG CGT TTT GCT TTA GCG AGC GTC ATT GCG CTT GTG 531 
Phe Arg Tyr Glu Trp Arg Phe Ala Leu Ala Ser Val lie Ala Leu Val 
155 160 165 
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ATT GGG GTG ATC AAG CGT TTC GCT TCA GAG CTA TTG GCC AGC TCT TTA 195 
lie Gly Val lie Lys Arg Phe Ala Ser Glu Leu Leu Ala Ser Ser Leu 
50 55 60 



GCC ACC GTA TTG CAT CTC ATA CCG GCA TTT GTG TTT TTA CAG ATT TTA 24 3 

Ala Thr Val Leu His Leu lie Pro Ala Phe Val Phe Leu Gin lie Leu 
65 70 75 



AAT AAT TTG GTT ACC GCT TAC ATG CTC ATG ATC GGG GCG TTG ATT AGC 2 91 

Asn Asn Leu Val Thr Ala Tyr Met Leu Met lie Gly Ala Leu lie Ser 
80 85 90 



AAC GCT TTC AGT CTC ATC TTT TTG TTG ATT GAA AGC GTT GTA ACG AGC 33 9 

Asn Ala Phe Ser Leu lie Phe Leu Leu lie Glu Ser Val Val Thr Ser 
95 100 105 



GAA ACG GAT TAAGGGGTAG TGATGGATTT TATCAATATA GAAAAAAAAT GGC 3 91 

Glu Thr Asp 

110 



(2) INFORMATION FOR SEQ ID NO : 1134 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134: 



Met Asp Trp Gly Arg 

1 5 
Thr Thr lie Ala Gly 
20 

Val Ala Leu Ala Leu 
35 

lie Lys Arg Phe Ala 
50 

Leu His Leu lie Pro 
65 

Val Thr Ala Tyr Met 
85 

Ser Leu lie Phe Leu 
100 



Val Val His Val Leu Phe 
10 

Phe Leu Tyr Glu Pro Asn 
25 

Asn Leu lie Ser Val Thr 
40 

Ser Glu Leu Leu Ala Ser 
55 

Ala Phe Val Phe Leu Gin 
70 75 
Leu Met lie Gly Ala Leu 
90 

Leu lie Glu Ser Val Val 
105 



Ser Leu lie Ser Leu 
15 

Thr Val Val Leu Phe 
30 

Leu Lys lie Gly Val 
45 

Ser Leu Ala Thr Val 
60 

lie Leu Asn Asn Leu 
80 

lie Ser Asn Ala Phe 
95 

Thr Ser Glu Thr Asp 
110 



(2) INFORMATION FOR SEQ ID NO : 1135 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1132: 



Met Ala lie Phe Asp Asn Asn Asn Lys Ser Ala Asn Ala Lys Thr Gly 

15 10 15 

Pro Ala Thr He He Ala Gin Gly Thr Lys He Lys Gly Glu Leu His 

20 25 30 

Leu Asp Tyr His Leu His Val Asp Gly Glu Leu Glu Gly Val Val His 

35 40 45 

Ser Lys Ser Thr Val Val He Gly Gin Thr Gly Ser Val Val Gly Glu 

50 55 60 

He Phe Thr Asn Lys Leu Val Val Ser Gly Lys Phe Thr Gly Thr Val 
65 70 75 80 

Glu Ala Glu Val Val Glu He Met Pro Leu Gly His Leu Asp Gly Lys 

85 90 95 

He Ser Ser Gin Glu Leu Val Val Glu Arg Lys Gly He Leu He Gly 

100 105 110 

Glu Thr Arg Pro Lys Asn He Gin Gly Gly Ala Leu Leu He Asn Glu 

115 120 125 

Gin Glu Lys Lys He Glu Asn Lys 
130 135 



(2) INFORMATION FOR SEQ ID NO:1133: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13... 348 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1133: 



TAAGGAGTTT CT ATG GAT TGG GGT CGG GTC GTT CAT GTG CTG TTC AGC CTT 51 
Met Asp Trp Gly Arg Val Val His Val Leu Phe Ser Leu 
1 5 10 

ATT TCT TTA ACC ACC ATT GCA GGG TTT TTG TAT GAG CCT AAT ACG GTG 9 9 

He Ser Leu Thr Thr He Ala Gly Phe Leu Tyr Glu Pro Asn Thr Val 
15 20 25 

GTG TTG TTT GTA GCG TTA GCT TTA AAC CTT ATT TCT GTT ACG CTT AAA 14 7 

Val Leu Phe Val Ala Leu Ala Leu Asn Leu He Ser Val Thr Leu Lys 
30 35 40 45 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 429 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131: 

CAATAAAGAA AGGAGCATCA G ATG GCA ATC TTT GAT AAC AAT AAT AAA TCG 51 

Met Ala lie Phe Asp Asn Asn Asn Lys Ser 
15 10 

GCT AAT GCA AAA ACA GGA CCA GCG ACT ATC ATC GCT CAA GGC ACA AAA 9 9 

Ala Asn Ala Lys Thr Gly Pro Ala Thr lie lie Ala Gin Gly Thr Lys 
15 20 25 

ATA AAG GGG GAG CTT CAT TTA GAT TAC CAT TTG CAC GTA GAT GGC GAA 14 7 

lie Lys Gly Glu Leu His Leu Asp Tyr His Leu His Val Asp Gly Glu 
30 35 40 

TTA GAA GGG GTG GTG CAT TCT AAA AGC ACG GTG GTG ATC GGG CAA ACC 195 
Leu Glu Gly Val Val His Ser Lys Ser Thr Val Val He Gly Gin Thr 
45 50 55 

GGC TCG GTA GTG GGT GAG ATT TTT ACT AAT AAA TTA GTG GTC AGT GGC 24 3 

Gly Ser Val Val Gly Glu He Phe Thr Asn Lys Leu Val Val Ser Gly 
60 65 70 

AAG TTC ACT GGC ACG GTG GAG GCG GAA GTG GTA GAA ATC ATG CCT TTA 2 91 

Lys Phe Thr Gly Thr Val Glu Ala Glu Val Val Glu He Met Pro Leu 
75 80 85 90 

GGG CAC CTT GAT GGC AAA ATC TCT AGC CAA GAG CTT GTG GTG GAA AGA 33 9 

Gly His Leu Asp Gly Lys He Ser Ser Gin Glu Leu Val Val Glu Arg 
95 100 105 

AAG GGG ATT TTG ATT GGG GAA ACT CGC CCT AAG AAT ATT CAA GGG GGG 3 87 

Lys Gly He Leu He Gly Glu Thr Arg Pro Lys Asn He Gin Gly Gly 
110 115 120 

GCG TTG TTA ATC AAT GAG CAA GAA AAG AAA ATT GAA AAT AAA TAGGGAATG 43 8 
Ala Leu Leu He Asn Glu Gin Glu Lys Lys He Glu Asn Lys 
125 130 135 

ATCCAATCCA GCCTTTATAG AGCC 4 62 

(2) INFORMATION FOR SEQ ID NO:1132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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Thr Leu Phe Gly 
50 

Tyr Lys Pro Asp 
65 

Met Gin Glu Val 

Ala Ala Ser Met 
100 

lie Tyr Tyr Gly 
115 

Gly Met lie Leu 
130 

Met Leu Pro Trp 
145 

Asn Leu Phe Gly 

lie Arg Gly Asn 
180 

Met Leu His Val 
195 

Val His Phe Tyr 
210 

Glu Glu He Asp 
225 

Lys Lys Glu Ser 

Asp He Phe Val 
260 

Cys Tyr His Tyr 
275 

Asn Ser Leu Lys 
290 

Trp Ser Tyr Glu 
305 

Leu Met Ala Phe 

Leu Asp Arg Ser 
340 

Met Val Trp Phe 
355 

Tyr Gly Lys Leu 
370 

Gly Ser He Thr 
385 

Thr He Ala Glu 



Val Leu Val Val 
55 

Ala Lys Met Ala 
70 

Ala Tyr Gly Trp 
85 

He Phe Val He 

Ser Tyr Lys Lys 
12 0 

Phe Val Val Phe 
135 

Gly Gin Met Ser 
150 

Gly He Pro Phe 
165 

Tyr Val Val Ala 

Phe Leu Leu Pro 
200 

Ser Leu Arg He 
215 

Phe Glu Leu Glu 
230 

Lys Val He Pro 
245 

Val Cys Ala Phe 

Asp Phe Ala Met 
280 

Thr Pro Pro His 
295 

Val Leu Arg Gly 
310 

Gly Val Ala Gin 
325 

Pro Val Val Ala 

Trp Leu Val He 
360 

Pro Pro Leu Gly 
375 

Phe Leu Ala Leu 
390 

Ser Lys Lys Gin 
405 



Ser Gly He Phe 
60 

Phe Asp Ser Val 
75 

Leu Trp Arg His 
90 

He Tyr He His 
105 

Gly Arg Glu Met 

Ser Ala Glu Ala 
140 

Tyr Trp Ala Ala 
155 

He Gly Ala Asp 
170 

Asp Ser Thr Leu 
185 

He Ala He He 

Pro His Val Asn 
220 

Glu Lys Lys Phe 
235 

Phe Trp Pro Val 
250 

Met Val Phe Phe 
265 

Asp Pro He Asn 

He Tyr Pro Glu 
300 

Phe Phe Phe Ser 
315 

Val He Phe Phe 
330 

Pro Ala His Lys 
345 

He Asp Met He 

He Gly Lys Tyr 
380 

Phe Phe Val Val 
395 

Gly Gly Val Arg 
410 



Leu Leu Met Tyr 

Asn Phe Thr He 
80 

Met His Ala Thr 
95 

Met Phe Val Gly 
110 

He Trp He Ser 
125 

Phe Ser Gly Tyr 

Ala Val He Thr 
160 

Val Val Glu Trp 
175 

Thr Arg Phe Phe 
190 

Leu Leu Val Gly 
205 

Asn Gin Glu Gly 

He Glu Gly Lys 
240 

Phe Leu Ser Lys 
255 

Phe Tyr Leu Val 
270 

Phe Glu Arg Ala 
285 

Trp Tyr Phe Leu 

Ala Asp Leu Gly 
320 

Leu Leu Pro Phe 

335 

Arg Pro Ala Phe 
350 

Val Leu Thr He 
365 

He Gly Leu Ala 

Leu Pro He He 
400 



(2) INFORMATION FOR SEQ ID NO: 1131: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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Pro lie Asn Phe Glu Arg Ala Asn Ser Leu Lys Thr Pro Pro His He 
285 290 295 

TAG CCT GAA TGG TAT TTC TTA TGG AGC TAT GAA GTC TTA AGA GGC TTT 963 
Tyr Pro Glu Trp Tyr Phe Leu Trp Ser Tyr Glu Val Leu Arg Gly Phe 
300 305 310 

TTC TTT AGC GOT GAT TTA GGG CTA ATG GCC TTT GGC GTG GCG CAA GTG 1011 
Phe Phe Ser Ala Asp Leu Gly Leu Met Ala Phe Gly Val Ala Gin Val 
315 320 325 

ATT TTC TTT TTG CTA CCC TTC TTG GAT CGA AGT CCA GTG GTC GCT CCT 10 5 9 

He Phe Phe Leu Leu Pro Phe Leu Asp Arg Ser Pro Val Val Ala Pro 
330 335 340 345 

GCG CAC AAA CGG CCG GCG TTT ATG GTG TGG TTT TGG CTT GTA ATC ATT 1107 
Ala His Lys Arg Pro Ala Phe Met Val Trp Phe Trp Leu Val He He 
350 355 360 

GAT ATG ATT GTT TTA ACG ATC TAT GGT AAA TTG CCT CCG CTT GGG ATT 115 5 

Asp Met He Val Leu Thr He Tyr Gly Lys Leu Pro Pro Leu Gly He 
365 370 375 

GGT AAA TAG ATT GGC TTA GCG GGT TCA ATC ACT TTT TTG GCC CTT TTC 12 0 3 

Gly Lys Tyr He Gly Leu Ala Gly Ser He Thr Phe Leu Ala Leu Phe 
380 385 390 

TTT GTG GTA TTG CCC ATC ATC ACT ATC GCT GAG AGC AAG AAA CAA GGG 12 51 

Phe Val Val Leu Pro He He Thr He Ala Glu Ser Lys Lys Gin Gly 
395 400 405 

GGT GTT AGA TGAAAGAGTT TAAGATTCTA ATCATCCTCA TTGTGGTGGT AGGCGTGAT 13 09 

Gly Val Arg 

410 

TTATTATGGG GTTGA 13 24 

(2) INFORMATION FOR SEQ ID NO: 113 0: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 0: 

Met Ala Glu He Lys Lys Ala Lys Asn Leu Gly Glu Trp Leu Asp Met 

15 10 15 

Arg Leu Gly Thr Asn Lys Leu Val Lys Val Leu Met Thr Glu Tyr Trp 

20 25 30 

He Pro Lys Asn He Asn Phe Leu Trp Ala Met Gly Val He Leu Leu 
35 40 45 
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60 65 70 

GAT AGC GTG AAT TTC ACC ATC ATG CAA GAA GTG GCT TAT GGC TGG CTT 2 91 

Asp Ser Val Asn Phe Thr lie Met Gin Glu Val Ala Tyr Gly Trp Leu 
75 80 85 

TGG CGC CAC ATG CAT GCC ACG GCA GCG AGC ATG ATT TTT GTC ATC ATT 33 9 

Trp Arg His Met His Ala Thr Ala Ala Ser Met He Phe Val He He 
90 95 100 105 

TAT ATC CAC ATG TTT GTT GGC ATC TAT TAT GGC TCT TAC AAA AAG GGT 3 87 

Tyr He His Met Phe Val Gly He Tyr Tyr Gly Ser Tyr Lys Lys Gly 
110 115 120 

CGT GAG ATG ATT TGG ATT AGC GGG ATG ATT TTG TTT GTG GTC TTT AGC 43 5 

Arg Glu Met He Trp He Ser Gly Met He Leu Phe Val Val Phe Ser 
125 130 135 

GCG GAA GCC TTT AGC GGG TAT ATG CTG CCT TGG GGG CAG ATG AGT TAT 4 83 

Ala Glu Ala Phe Ser Gly Tyr Met Leu Pro Trp Gly Gin Met Ser Tyr 
140 145 150 

TGG GCC GCA GCG GTT ATC ACG AAT TTA TTT GGA GGC ATT CCT TTC ATT 531 
Trp Ala Ala Ala Val He Thr Asn Leu Phe Gly Gly He Pro Phe He 
155 160 165 

GGG GCT GAT GTG GTG GAG TGG ATT AGA GGC AAT TAT GTT GTG GCG GAT 57 9 

Gly Ala Asp Val Val Glu Trp He Arg Gly Asn Tyr Val Val Ala Asp 
170 175 180 185 

TCC ACT TTA ACG CGC TTT TTC ATG CTC CAT GTG TTT TTA CTG CCC ATT 62 7 

Ser Thr Leu Thr Arg Phe Phe Met Leu His Val Phe Leu Leu Pro He 
190 195 200 

GCG ATC ATT CTA CTT GTT GGG GTG CAT TTT TAT TCT TTA CGC ATC CCG 675 
Ala He He Leu Leu Val Gly Val His Phe Tyr Ser Leu Arg He Pro 
205 210 215 

CAT GTC AAT AAC CAA GAA GGC GAA GAG ATT GAC TTT GAA TTA GAA GAG 72 3 

His Val Asn Asn Gin Glu Gly Glu Glu He Asp Phe Glu Leu Glu Glu 
220 225 230 

AAG AAA TTC ATT GAA GGC AAG AAA AAA GAA TCC AAA GTC ATT CCT TTT 771 
Lys Lys Phe He Glu Gly Lys Lys Lys Glu Ser Lys Val He Pro Phe 
235 240 245 

TGG CCG GTG TTC TTG TCT AAA GAT ATT TTT GTG GTT TGC GCG TTC ATG 819 
Trp Pro Val Phe Leu Ser Lys Asp He Phe Val Val Cys Ala Phe Met 
250 255 260 265 

GTC TTT TTC TTT TAC TTG GTG TGT TAC CAC TAT GAT TTT GCG ATG GAT 867 
Val Phe Phe Phe Tyr Leu Val Cys Tyr His Tyr Asp Phe Ala Met Asp 
270 275 280 

CCT ATC AAC TTT GAA AGG GCT AAC AGC CTT AAA ACG CCG CCT CAC ATT 915 
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Thr Val Leu Glu 
340 

Glu Asn lie Leu 
355 

Glu lie Lys Val 
370 

Leu Val Val Tyr 
385 

Leu Ala Gin Phe 

Tyr Ser Gly Val 
420 

Leu Ser Leu Arg 
435 

Lys Lys Thr Ala 
450 



325 

Asp Leu Gin Lys 

Leu Ala Val Ala 
360 

Ser Ser Arg Gin 
375 

Phe Ala Arg Leu 
390 

Leu Asp Leu Lys 
405 

Lys Lys Met Leu 

Glu Glu lie Lys 
440 

Phe Asn Ser Ser 
455 



330 

Asp His Ala Glu 
345 

Gin Ser Leu Asn 

Lys Asn Val Ala 
380 

Tyr Thr Pro Asn 
395 

Asp His Ser Ser 
410 

Glu Glu Glu Lys 
425 

Asn Arg Leu Asn 
Glu 



335 

Gly Ser Ser Leu 
350 

Leu Lys Ser Ser 
365 

Leu Ala Arg Lys 

Pro Thr Leu Ser 
400 

lie Ser Lys Met 
415 

Ser Pro Phe Val 
430 

Glu Leu Asn Asp 
445 



(2) INFORMATION FOR SEQ ID NO: 112 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
{ ix ) FEATURE : 



(A) NANE/KEY: Coding Sequence 

(B) LOCATION: 25.., 1260 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 9: 

TGGCTAAAGC GTAAGGAGAG TTTA ATG GCA GAG ATA AAA AAA GCG AAA AAT 51 

Met Ala Glu lie Lys Lys Ala Lys Asn 
1 5 

TTA GGC GAA TGG CTG GAC ATG CGT CTT GGC ACT AAC AAG CTT GTT AAA 9 9 

Leu Gly Glu Trp Leu Asp Met Arg Leu Gly Thr Asn Lys Leu Val Lys 
10 15 20 25 

GTG CTA ATG ACA GAA TAT TGG ATC CCT AAA AAC ATC AAT TTT TTA TGG 14 7 

Val Leu Met Thr Glu Tyr Trp lie Pro Lys Asn lie Asn Phe Leu Trp 
30 35 40 

GCG ATG GGG GTG ATT TTA TTA ACC CTT TTT GGC GTG CTT GTG GTC TCA 195 
Ala Met Gly Val lie Leu Leu Thr Leu Phe Gly Val Leu Val Val Ser 
45 50 55 

GGG ATT TTC TTG CTC ATG TAT TAC AAG CCT GAT GCG AAA ATG GCG TTT 24 3 

Gly lie Phe Leu Leu Met Tyr Tyr Lys Pro Asp Ala Lys Met Ala Phe 
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TTTCAAAT 14 63 

(2) INFORMATION FOR SEQ ID NO: 112 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 8: 

Met Asp Thr Asn Asn Asn lie Glu Lys Glu lie Leu Ala Leu Val Lys 

15 10 15 

Gin Asn Pro Lys Val Ser Leu lie Glu Tyr Glu Asn Tyr Phe Ser Gin 

20 25 30 

Leu Lys Tyr Asn Pro Asn Ala Ser Lys Ser Asp lie Ala Phe Phe Tyr 

35 40 45 

Ala Pro Asn Gin Val Leu Cys Thr Thr lie Thr Ala Lys Tyr Gly Ala 

50 55 60 

Leu Leu Lys Glu lie Leu Ser Gin Asn Lys Val Gly Met His Leu Ala 
65 70 75 80 

His Ser Val Asp Val Arg lie Glu Val Ala Pro Lys lie Gin lie Asn 

85 90 95 

Ala Gin Ser Asn lie Asn Tyr Lys Ala lie Lys Thr Ser Val Lys Asp 

100 105 110 

Ser Tyr Thr Phe Glu Asn Phe Val Val Gly Ser Cys Asn Asn Thr Val 

115 120 125 

Tyr Glu lie Ala Lys Lys Val Ala Gin Ser Asp Thr Pro Pro Tyr Asn 

130 135 140 

Pro Val Leu Phe Tyr Gly Gly Thr Gly Leu Gly Lys Thr His lie Leu 
145 150 155 160 

Asn Ala lie Gly Asn His Ala Leu Glu Lys His Lys Lys Val Val Leu 

165 170 175 

Val Thr Ser Glu Asp Phe Leu Thr Asp Phe Leu Lys His Leu Asp Asn 

180 185 190 

Lys Thr Met Asp Ser Phe Lys Ala Lys Tyr Arg His Cys Asp Phe Phe 

195 200 205 

Leu Leu Asp Asp Ala Gin Phe Leu Gin Gly Lys Pro Lys Leu Glu Glu 

210 215 220 

Glu Phe Phe His Thr Phe Asn Glu Leu His Ala Asn Ser Lys Gin lie 
225 230 235 240 

Val Leu lie Ser Asp Arg Ser Pro Lys Asn lie Ala Gly Leu Glu Asp 

245 250 255 

Arg Leu Lys Ser Arg Phe Glu Trp Gly lie Thr Ala Lys Val Met Pro 

260 265 270 

Pro Asp Leu Glu Thr Lys Leu Ser lie Val Lys Gin Lys Cys Gin Leu 

275 280 285 

Asn Gin lie Thr Leu Pro Glu Glu Val Met Glu Tyr lie Ala Gin His 

290 295 300 

He Ser Asp Asn He Arg Gin Met Glu Gly Ala He He Lys He Ser 
305 310 315 320 

Val Asn Ala Asn Leu Met Asn Ala Ser He Asp Leu Asn Leu Ala Lys 
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Ser Asp Arg Ser Pro Lys Asn lie Ala Gly Leu Glu Asp Arg Leu Lys 
245 250 255 

TCG CGC TTT GAA TGG GGG ATA ACC GCT AAA GTC ATG CCC CCT GAT TTA 871 
Ser Arg Phe Glu Trp Gly lie Thr Ala Lys Val Met Pro Pro Asp Leu 
260 265 270 275 

GAA ACC AAA CTT TCC ATT GTC AAA CAA AAA TGC CAG CTC AAT CAA ATC 919 
Glu Thr Lys Leu Ser lie Val Lys Gin Lys Cys Gin Leu Asn Gin lie 
280 285 290 

ACT TTG CCT GAA GAG GTG ATG GAA TAC ATC GCC CAA CAC ATC AGC GAC 96 7 

Thr Leu Pro Glu Glu Val Met Glu Tyr lie Ala Gin His lie Ser Asp 
295 300 305 

AAT ATC CGC CAA ATG GAA GGC GCG ATC ATT AAA ATC AGC GTG AAC GCG 1015 
Asn lie Arg Gin Met Glu Gly Ala lie lie Lys He Ser Val Asn Ala 
310 315 320 

AAC TTG ATG AAC GCT TCC ATT GAT TTG AAC CTC GCT AAA ACC GTT TTA 106 3 

Asn Leu Met Asn Ala Ser He Asp Leu Asn Leu Ala Lys Thr Val Leu 
325 330 335 

GAA GAT TTG CAA AAA GAT CAT GCT GAA GGT TCA AGC TTG GAA AAT ATC 1111 
Glu Asp Leu Gin Lys Asp His Ala Glu Gly Ser Ser Leu Glu Asn He 
340 345 350 355 

CTA CTC GCT GTC GCG CAA AGC CTG AAT CTC AAA TCC AGC GAA ATC AAA 115 9 

Leu Leu Ala Val Ala Gin Ser Leu Asn Leu Lys Ser Ser Glu He Lys 
360 365 370 

GTC TCT TCG CGC CAA AAA AAT GTC GCT TTG GCG AGG AAA TTA GTC GTG 12 0 7 

Val Ser Ser Arg Gin Lys Asn Val Ala Leu Ala Arg Lys Leu Val Val 
375 380 385 

TAT TTC GCC AGG CTT TAT ACC CCT AAC CCC ACG CTC TCG CTC GCT CAA 12 5 5 

Tyr Phe Ala Arg Leu Tyr Thr Pro Asn Pro Thr Leu Ser Leu Ala Gin 
390 395 400 

TTT TTG GAT TTA AAG GAT CAT TCA AGC ATT TCT AAA ATG TAT TCT GGC 13 0 3 

Phe Leu Asp Leu Lys Asp His Ser Ser He Ser Lys Met Tyr Ser Gly 
405 410 415 

GTT AAA AAA ATG CTT GAA GAA GAA AAA AGC CCT TTT GTC TTA AGC CTT 1351 
Val Lys Lys Met Leu Glu Glu Glu Lys Ser Pro Phe Val Leu Ser Leu 
420 425 430 435 

AGA GAA GAA ATC AAA AAC CGC TTG AAC GAA TTG AAC GAC AAA AAA ACC 13 9 9 

Arg Glu Glu He Lys Asn Arg Leu Asn Glu Leu Asn Asp Lys Lys Thr 
440 445 450 

GCT TTC AAT TCA AGT GAA TGAAAAAAGG CTTATGAAAA AGCGTTTCAT TCACTTCT 14 5 5 
Ala Phe Asn Ser Ser Glu 
455 
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20 25 30 35 

AAC CCT AAC GCA AGC AAG AGC GAT ATT GCC TTT TTT TAT GCC CCC AAC 199 
Asn Pro Asn Ala Ser Lys Ser Asp lie Ala Phe Phe Tyr Ala Pro Asn 
40 45 50 

CAA GTC TTA TGC ACC ACG ATT ACA GCT AAA TAG GGC GCG TTG CTT AAA 24 7 

Gin Val Leu Cys Thr Thr lie Thr Ala Lys Tyr Gly Ala Leu Leu Lys 
55 60 65 

GAA ATT TTA AGC CAG AAT AAA GTC GGC ATG CAT TTA GCC CAC AGC GTG 2 95 

Glu lie Leu Ser Gin Asn Lys Val Gly Met His Leu Ala His Ser Val 
70 75 80 

GAT GTG CGT ATT GAA GTA GCG CCT AAA ATC CAA ATT AAC GCC CAA TCT 34 3 

Asp Val Arg lie Glu Val Ala Pro Lys lie Gin lie Asn Ala Gin Ser 
85 90 95 

AAT ATC AAT TAG AAA GCC ATA AAA ACG AGC GTC AAA GAC TCT TAC ACT 3 91 

Asn lie Asn Tyr Lys Ala lie Lys Thr Ser Val Lys Asp Ser Tyr Thr 
100 105 110 - 115 

TTT GAA AAT TTT GTC GTA GGC TCA TGC AAT AAC ACC GTT TAT GAA ATC 43 9 

Phe Glu Asn Phe Val Val Gly Ser Cys Asn Asn Thr Val Tyr Glu lie 
120 125 130 

GCT AAA AAA GTC GCC CAA AGC GAT ACC CCC CCT TAT AAC CCG GTG CTT 4 87 

Ala Lys Lys Val Ala Gin Ser Asp Thr Pro Pro Tyr Asn Pro Val Leu 
135 140 145 

TTT TAT GGC GGC ACA GGG TTA GGC AAA ACG CAC ATT TTA AAC GCT ATC 535 
Phe Tyr Gly Gly Thr Gly Leu Gly Lys Thr His lie Leu Asn Ala lie 
150 155 160 

GGC AAC CAT GCC CTA GAA AAG CAT AAA AAA GTC GTG TTA GTC ACT TCA 583 
Gly Asn His Ala Leu Glu Lys His Lys Lys Val Val Leu Val Thr Ser 
165 170 175 

GAA GAC TTT TTG ACA GAC TTT TTA AAG CAT TTA GAC AAC AAA ACC ATG 631 
Glu Asp Phe Leu Thr Asp Phe Leu Lys His Leu Asp Asn Lys Thr Met 
180 185 190 195 

GAT TCT TTT AAA GCA AAA TAC CGC CAT TGC GAC TTT TTC TTG TTA GAT 67 9 

Asp Ser Phe Lys Ala Lys Tyr Arg His Cys Asp Phe Phe Leu Leu Asp 
200 205 210 

GAC GCT CAA TTT TTG CAA GGA AAA CCC AAG CTA GAA GAA GAA TTT TTC 72 7 

Asp Ala Gin Phe Leu Gin Gly Lys Pro Lys Leu Glu Glu Glu Phe Phe 
215 220 225 

CAC ACC TTT AAC GAA TTG CAC GCC AAC AGC AAA CAA ATC GTA TTG ATT 775 
His Thr Phe Asn Glu Leu His Ala Asn Ser Lys Gin lie Val Leu lie 
230 235 240 

TCA GAC CGA TCG CCT AAA AAC ATC GCC GGC TTA GAA GAT CGC TTA AAA 82 3 
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225 

lie Ala hys Asn 

Ser Glu Lys lie 
260 

Gin Lys Arg Phe 
275 

Lys Glu Glu Val 
290 

Ala Thr Thr Leu 
305 

Met Val lie Leu 

Leu Arg Gly Arg 
340 

Cys Thr lie Gin 
355 

Leu Asp Gly Phe 
370 

Gly Asp Leu Leu 
385 

lie Asp Leu Ala 

Phe Leu Lys Ala 
420 



230 

His Gin Val lie 
245 

Pro Tyr Leu Ser 

Lys Lys Val Tyr 
280 

lie Glu Glu Phe 
295 

lie Glu Val Gly 
310 

Ala Pro Glu Arg 
325 

Val Ser Arg Asn 

Glu Glu Asn Glu 
360 

Lys lie Ala Glu 
375 

Gin Gly Gly Glu 
390 

Lys Asp Glu Asn 
405 

Ala Ser Val Ser 



235 

Val Val Tyr Pro 
250 

Leu Ser Glu Gly 
265 

Thr Thr Ser Gly 

Arg Glu Ser Gly 
300 

lie Ser Leu Pro 
315 

Leu Gly Leu Ala 
330 

Gly Leu Lys Gly 
345 

Arg Leu Glu Lys 

Leu Asp Leu Glu 
380 

Gin Ser Gly Asn 
395 

lie He Ala Glu 
410 

Arg Gly Thr Phe 
425 



240 

Leu Val Asn Glu 
255 

Ala Ser Phe Trp 
270 

Gin Asp Lys Asn 
285 

Ser He Leu Leu 

Arg Leu Ser Val 
320 

Thr Leu His Gin 
335 

Tyr Cys Phe Leu 
350 

Phe Ala Asp Glu 
365 

Tyr Arg Lys Ser 

Ser Phe Glu Tyr 
400 

Val Lys Arg Asp 
415 

Glu Asn 

430 



(2) INFORMATION FOR SEQ ID NO: 1127: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47... 1417 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 7: 

TGTTATTCCT TTTTCATTCA CCACTTATTC ACGCTATAAT AACGCC ATG GAT ACC 5 5 

Met Asp Thr 
1 

AAC AAC AAT ATT GAA AAA GAA ATC TTG GCG CTA GTC AAA CAA AAT CCT 103 
Asn Asn Asn lie Glu Lys Glu lie Leu Ala Leu Val Lys Gin Asn Pro 
5 10 15 

AAA GTT AGT CTC ATA GAG TAT GAA AAT TAC TTT AGC CAA CTC AAA TAC 151 
Lys Val Ser Leu lie Glu Tyr Glu Asn Tyr Phe Ser Gin Leu Lys Tyr 
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GAT TTA CTC CAG GGA GGG GAG CAG AGC GGG AAT ACT TTT GAA TAG ATT 12 5 7 

Asp Leu Leu Gin Gly Gly Glu Gin Ser Gly Asn Ser Phe Glu Tyr lie 
390 395 400 

GAG TTA GCC AAA GAT GAA AAC ATT ATC GOT GAA GTG AAA CGG GAT TTT 13 0 5 

Asp Leu Ala Lys Asp Glu Asn lie lie Ala Glu Val Lys Arg Asp Phe 
405 410 415 

TTA AAG GCC GCT AGC GTT TCA CGG GGA ACA TTT GAA AAT TGAAAATTAA GG 135 6 
Leu Lys Ala Ala Ser Val Ser Arg Gly Thr Phe Glu Asn 
420 425 430 

CAGAATTGGG TAATTTAAAT CATTTAAAAA AAG 13 8 9 

(2) INFORMATION FOR SEQ ID NO: 112 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1126: 

Met Leu Phe Tyr Met Lys Asn Leu Glu Arg Lys Lys Leu Gin Phe Gly 

15 10 15 

Ala Lys lie Ala Cys Pro Asn Asn Asn Glu Arg Leu Lys Ala Phe lie 

20 25 30 

Ala Ser Leu Pro Phe Lys Leu Thr Arg Asp Gin Gin Asn Ala lie Lys 

35 40 45 

Glu lie Gin Asn Asp Leu Thr Ser Ser lie Ala Cys Lys Arg Leu lie 

50 55 60 

lie Gly Asp Val Gly Cys Gly Lys Thr Met Val lie Leu Ala Ser Met 
65 70 75 80 

Val Leu Thr Tyr Pro Asn Lys Thr Leu Leu Met Ala Pro Thr Ser lie 

85 90 95 

Leu Ala Lys Gin Leu Tyr Asn Glu Ala Leu Lys Phe Leu Pro Pro Tyr 

100 105 110 

Phe Glu Val Glu Leu Leu Leu Gly Gly Ser Tyr Lys Lys Arg Ser Asn 

115 120 125 

His Leu Phe Glu Thr lie Thr His Val Val lie Gly Thr Gin Ala Leu 

130 135 140 

Leu Phe Asp Lys Arg Asp Leu Asn Glu Phe Ala Leu Val lie Thr Asp 
145 150 155 160 

Glu Gin His Arg Phe Gly Thr Lys Gin Arg Tyr Gin Leu Glu Lys Met 

165 170 175 

Ala Ser Ser Lys Gly Asn Lys Pro His Ser Leu Gin Phe Ser Ala Thr 

180 185 190 

Pro lie Pro Arg Thr Leu Ala Leu Ala Lys Ser Ala Phe Val Lys Thr 

195 200 205 

Thr Met lie Arg Glu lie Pro Tyr Pro Lys Glu lie Glu Thr Leu Val 

210 215 220 

Leu His Lys Arg Asp Phe Lys lie Val Met Glu Lys lie Ser Glu Glu 
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Gin His Arg Phe Gly Thr Lys Gin Arg Tyr Gin Leu Glu Lys Met Ala 
165 170 175 

AGC AGT AAG GGT AAT AAA CCC CAT TCT TTG CAA TTT TCC GCT ACC CCC 633 
Ser Ser Lys Gly Asn Lys Pro His Ser Leu Gin Phe Ser Ala Thr Pro 
180 185 190 

ATT CCT CGC ACG CTC GCC CTA GCC AAA AGC GCG TTT GTG AAA ACG ACC 681 
lie Pro Arg Thr Leu Ala Leu Ala Lys Ser Ala Phe Val Lys Thr Thr 
195 2O0 205 

ATG ATT AGA GAA ATC CCT TAT CCT AAA GAG ATT GAA ACT CTA GTC TTG 72 9 

Met lie Arg Glu lie Pro Tyr Pro Lys Glu lie Glu Thr Leu Val Leu 
210 215 220 225 

CAT AAA AGA GAT TTT AAA ATA GTG ATG GAG AAA ATC AGC GAA GAA ATC 777 
His Lys Arg Asp Phe Lys lie Val Met Glu Lys lie Ser Glu Glu lie 
230 235 240 

GCT AAA AAC CAT CAA GTC ATT GTC GTC TAT CCG CTG GTG AAT GAG AGC 82 5 

Ala Lys Asn His Gin Val lie Val Val Tyr Pro Leu Val Asn Glu Ser 
245 250 255 

GAA AAA ATC CCG TAT TTA TCG CTC AGT GAG GGG GCG AGT TTT TGG CAA 8 73 

Glu Lys lie Pro Tyr Leu Ser Leu Ser Glu Gly Ala Ser Phe Trp Gin 
260 265 270 

AAA CGC TTT AAA AAG GTT TAT ACC ACT TCA GGG CAA GAT AAA AAT AAA 921 
Lys Arg Phe Lys Lys Val Tyr Thr Thr Ser Gly Gin Asp Lys Asn Lys 
275 280 285 

GAA GAA GTG ATT GAA GAA TTT AGA GAA TCC GGG AGC ATT CTT TTA GCG 96 9 

Glu Glu Val lie Glu Glu Phe Arg Glu Ser Gly Ser lie Leu Leu Ala 
290 295 300 305 

ACT ACG CTC ATT GAG GTG GGC ATT TCT TTA CCA CGA TTG AGC GTG ATG 1017 
Thr Thr Leu lie Glu Val Gly lie Ser Leu Pro Arg Leu Ser Val Met 
310 315 320 

GTG ATT TTA GCG CCC GAA AGG TTA GGC TTA GCG ACT TTA CAC CAG TTA 1065 
Val lie Leu Ala Pro Glu Arg Leu Gly Leu Ala Thr Leu His Gin Leu 
325 330 335 

AGG GGG CGC GTT TCT CGT AAC GGC TTG AAA GGC TAT TGT TTT TTA TGC 1113 
Arg Gly Arg Val Ser Arg Asn Gly Leu Lys Gly Tyr Cys Phe Leu Cys 
340 345 350 

ACG ATC CAA GAA GAA AAC GAA CGA TTA GAA AAG TTT GCT GAT GAA TTG 1161 
Thr lie Gin Glu Glu Asn Glu Arg Leu Glu Lys Phe Ala Asp Glu Leu 
355 360 365 

GAC GGC TTT AAA ATC GCT GAA TTG GAT TTA GAA TAC AGA AAA AGC GGG 12 0 9 

Asp Gly Phe Lys lie Ala Glu Leu Asp Leu Glu Tyr Arg Lys Ser Gly 
370 375 380 385 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 55... 1344 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1125: 

GAAACGAATA AAAATTTTCC TTCACAACAT TTAAATGCAT TAAAATACAT TGAA ATG 5 7 

Met 
1 

CTT TTT TAT ATG AAA AAT TTA GAG CGC AAA AAA TTG CAA TTT GGC GCT 105 
Leu Phe Tyr Met Lys Asn Leu Glu Arg Lys Lys Leu Gin Phe Gly Ala 
5 10 15 

AAA ATC GCA TGC CCC AAT AAT AAC GAG CGC TTG AAA GCG TTT ATC GCT 15 3 

Lys lie Ala Cys Pro Asn Asn Asn Glu Arg Leu Lys Ala Phe lie Ala 
20 25 30 

TCT TTA CCC TTT AAA CTC ACA CGC GAT CAA CAA AAC GCC ATT AAA GAA 201 
Ser Leu Pro Phe Lys Leu Thr Arg Asp Gin Gin Asn Ala lie Lys Glu 
35 40 45 

ATC CAA AAC GAT CTC ACT AGC TCC ATA GCG TGC AAG CGT TTG ATT ATA 24 9 

lie Gin Asn Asp Leu Thr Ser Ser lie Ala Cys Lys Arg Leu lie lie 
50 55 60 65 

GGC GAT GTG GGG TGC GGG AAA ACG ATG GTG ATT TTA GCG AGC ATG GTA 2 97 

Gly Asp Val Gly Cys Gly Lys Thr Met Val lie Leu Ala Ser Met Val 
70 75 80 

TTA ACT TAC CCA AAT AAA ACC CTT TTA ATG GCG CCC ACT TCC ATT CTC 34 5 

Leu Thr Tyr Pro Asn Lys Thr Leu Leu Met Ala Pro Thr Ser lie Leu 
85 90 95 

GCT AAA CAG CTT TAT AAC GAA GCC TTA AAA TTT TTA CCC CCT TAT TTT 3 93 

Ala Lys Gin Leu Tyr Asn Glu Ala Leu Lys Phe Leu Pro Pro Tyr Phe 
100 105 110 

GAA GTG GAA TTG CTG CTC GGC GGG AGT TAC AAG AAG CGA TCC AAT CAT 441 
Glu Val Glu Leu Leu Leu Gly Gly Ser Tyr Lys Lys Arg Ser Asn His 
115 120 125 

TTG TTT GAA ACA ATC ACG CAT GTG GTT ATC GGC ACG CAA GCG TTG TTG 4 89 

Leu Phe Glu Thr lie Thr His Val Val lie Gly Thr Gin Ala Leu Leu 
130 135 140 145 

TTT GAT AAG CGC GAT TTG AAT GAA TTC GCT CTA GTG ATC ACT GAT GAA 53 7 

Phe Asp Lys Arg Asp Leu Asn Glu Phe Ala Leu Val lie Thr Asp Glu 
150 155 160 

CAG CAC CGA TTT GGC ACC AAG CAG CGC TAC CAA TTA GAA AAA ATG GCA 5 85 
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100 



105 



110 



Lys Met lie Tyr lie Asp Pro Pro Tyr Asn Thr Lys Asn Glu Asn Phe 

115 120 125 

lie Tyr Gly Asp Asp Phe Ser Gin Ser Asn Glu Glu Val Leu Lys Thr 

130 135 140 

Leu Asp Tyr Ser Lys Glu Lys Leu Asp Tyr lie Lys Asn Leu Phe Gly 
145 150 155 160 

Ser Lys Cys His Ser Gly Trp Leu Ser Phe Met Tyr Pro Arg Leu Leu 

165 170 175 

Leu Ala Lys Asp Leu Leu Lys Gin Asp Gly Val lie Phe lie Ser lie 

180 185 190 

Asp Asp Asn Glu Cys Ala Gin Leu Lys Leu Leu Cys Asp Glu lie Phe 

195 200 205 

Gly Glu Gly Asn Phe Val Ala Cys Leu Lys Trp Lys Lys Lys Lys Gin 

210 215 220 

Pro Ser Phe Leu Ser Lys Val Ala Val lie Leu Glu Tyr lie Leu Val 
225 230 235 240 

Tyr Ala Lys Asp Phe Ser Leu lie Asp Lys Leu Gly Leu Asp Asn Val 

245 250 255 

Ser Asp Ser Asp Lys Pro lie lie Asn Thr Ser Asn Asn Leu Ser Lys 

260 265 270 

Arg Tyr Phe Lys Lys Gly lie Arg Val Lys Ser Asp Leu Asn Phe lie 

275 280 285 

Lys Ser Gly Lys Tyr Gin Asn Lys Thr Met Thr lie Glu Phe Met Asn 

290 295 300 

Asp He Phe He Glu Asn Gly Arg Thr Lys Asn Asp Phe Glu Cys He 
305 310 315 320 

Gly Lys Phe Arg Thr Gly Gin Glu Asn He Asn Glu Phe He Glu Lys 

325 330 335 

Asp Leu He Phe He Thr Lys Asn Leu Gly He Arg Arg Asp Leu Leu 

340 345 350 

Glu Glu Glu Gin Ser Asn Lys Lys Thr He Thr Asp Leu Leu Thr Glu 

355 360 365 

Trp Gly Gin Asn Gin Asp Ala Thr Asn Glu Leu Asn He Leu Phe Asn 

370 375 380 

Asn Ser Ser Asp Glu Ser He Phe Ser Asn Pro Lys Pro Thr Lys Leu 
385 390 395 400 

He Asn Arg Leu He Glu Leu Ser Thr Asn Glu Gly Asp He He Leu 

405 410 415 

Asp Phe Phe Ala Gly Ser Gly Thr Thr Ala His Ala Val Leu Glu Ser 

420 425 430 

Asn Lys Ser Asp Tyr Gin Lys Leu Ser Glu Gly Gly Gly Gly Tyr Leu 
435 440 445 

Met Val 



(2) INFORMATION FOR SEQ ID NO:1125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



450 
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GGG ATT AGA AGA GAT TTA TTA GAA GAA GAG CAA TCA AAT AAA AAA ACA 1110 
Gly He Arg Arg Asp Leu Leu Glu Glu Glu Gin Ser Asn Lys Lys Thr 
350 355 360 

ATT ACA GAT TTA TTA ACA GAA TGG GGA CAA AAT CAA GAT GCT ACT AAT 115 8 

He Thr Asp Leu Leu Thr Glu Trp Gly Gin Asn Gin Asp Ala Thr Asn 
365 370 .375 

GAA TTA AAT ATT TTA TTT AAT AAT TCT AGC GAT GAA AGT ATT TTT TCA 12 06 

Glu Leu Asn He Leu Phe Asn Asn Ser Ser Asp Glu Ser He Phe Ser 
380 385 390 

AAT CCT AAA CCT ACA AAA CTC ATC AAC CGA TTG ATT GAA TTA TCC ACC 12 54 

Asn Pro Lys Pro Thr Lys Leu He Asn Arg Leu He Glu Leu Ser Thr 
395 400 405 

AAC GAG GGC GAC ATC ATC TTA GAC TTT TTT GCC GGG AGC GGG ACA ACC 13 02 

Asn Glu Gly Asp He He Leu Asp Phe Phe Ala Gly Ser Gly Thr Thr 
410 415 420 425 

GCG CAT GCG GTG TTA GAG AGT AAT AAG AGC GAT TAT CAA AAA TTA AGT 13 5 0 

Ala His Ala Val Leu Glu Ser Asn Lys Ser Asp Tyr Gin Lys Leu Ser 
430 435 440 

GAG GGG GGG GGG GGT TAT TTA ATG GTT TGAACGCCGC ATTTAAAGAA AGGCGCT 14 04 
Glu Gly Gly Gly Gly Tyr Leu Met Val 
445 450 

TCATTCTCGT C 1415 

(2) INFORMATION FOR SEQ ID NO:1124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1124: 

Met Gin Asn Lys Glu He Gly Glu Glu Lys Ser Val Asn Glu Lys Asn 

15 10 15 

Val Glu Val Phe Asn Arg Tyr Phe Pro Gly Cys Leu Ser He Glu Asn 

20 25 30 

Asp Asn Lys Leu Thr Leu Asp Thr Gly Lys Leu Lys Ala Leu Leu Gly 

35 40 45 

Asp Phe Ser Glu He Lys Glu Glu Gly Tyr Gly Leu Asp Phe Val Gly 

50 55 60 

Lys Lys He Ala Leu Asn Gin Ala Phe Lys Lys Asn His Lys He Leu 
65 70 75 80 

Lys Pro Leu Asn Glu Ser Thr Ser Lys His Val Leu He Lys Gly Asp 

85 90 95 

Asn Leu Asp Ala Leu Lys He Leu Lys Gin Ser Tyr Ser Glu Lys He 
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Asn Thr Lys Asn Glu Asn Phe lie Tyr Gly Asp Asp Phe Ser Gin Ser 
125 130 135 

AAT GAA GAG GTT TTA AAA ACA TTG GAT TAT TCT AAA GAA AAA TTG GAT 4 86 

Asn Glu Glu Val Leu Lys Thr Leu Asp Tyr Ser Lys Glu Lys Leu Asp 
140 145 150 

TAG ATC AAG AAC CTT TTT GGG TCA AAA TGC CAT AGC GGG TGG CTT AGT 534 
Tyr lie Lys Asn Leu Phe Gly Ser Lys Cys His Ser Gly Trp Leu Ser 
155 160 165 

TTC ATG TAT CCC AGA TTG TTG CTC GOT AAA GAT TTG CTC AAA CAA GAC 582 
Phe Met Tyr Pro Arg Leu Leu Leu Ala Lys Asp Leu Leu Lys Gin Asp 
170 175 180 185 

GGC GTG ATT TTC ATT TCT ATT GAC GAT AAC GAA TGC GCT CAA CTC AAA 63 0 

Gly Val lie Phe lie Ser lie Asp Asp Asn Glu Cys Ala Gin Leu Lys 
190 195 200 

CTT TTA TGC GAT GAA ATT TTT GGG GAG GGG AAT TTT GTG GCG TGT TTA 67 8 

Leu Leu Cys Asp Glu lie Phe Gly Glu Gly Asn Phe Val Ala Cys Leu 
205 210 215 

AAA TGG AAA AAG AAA AAA CAA CCA AGT TTT TTA TCA AAA GTA GCC GTA 72 6 

Lys Trp Lys Lys Lys Lys Gin Pro Ser Phe Leu Ser Lys Val Ala Val 
220 225 230 

ATA TTA GAA TAT ATT TTA GTA TAT GCA AAA GAT TTT AGT CTA ATT GAT 7 74 

lie Leu Glu Tyr lie Leu Val Tyr Ala Lys Asp Phe Ser Leu lie Asp 
235 240 245 

AAG TTA GGT TTA GAT AAT GTA TCT GAT AGC GAT AAA CCT ATC ATT AAT 82 2 

Lys Leu Gly Leu Asp Asn Val Ser Asp Ser Asp Lys Pro lie lie Asn 
250 255 260 265 

ACC TCT AAT AAT TTA TCA AAA AGA TAT TTT AAA AAA GGT ATT AGG GTT 87 0 

Thr Ser Asn Asn Leu Ser Lys Arg Tyr Phe Lys Lys Gly lie Arg Val 
270 275 280 

AAA TCT GAT TTA AAT TTT ATA AAG AGT GGA AAG TAT CAA AAT AAG ACA 918 
Lys Ser Asp Leu Asn Phe He Lys Ser Gly Lys Tyr Gin Asn Lys Thr 
285 290 295 

ATG ACG ATT GAA TTT ATG AAT GAT ATT TTT ATT GAA AAT GGC AGA ACT 966 
Met Thr He Glu Phe Met Asn Asp He Phe He Glu Asn Gly Arg Thr 
300 305 310 

AAA AAT GAT TTT GAA TGT ATA GGT AAA TTT AGA ACA GGA CAA GAA AAT 1014 
Lys Asn Asp Phe Glu Cys He Gly Lys Phe Arg Thr Gly Gin Glu Asn 
315 320 325 

ATT AAT GAA TTT ATT GAA AAA GAT TTA ATT TTT ATA ACA AAA AAT TTA 10 62 

He Asn Glu Phe He Glu Lys Asp Leu He Phe He Thr Lys Asn Leu 
330 335 340 345 
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Tyr His Leu Thr Asp Glu Glu lie Lys lie lie Glu Glu Gly Gin 
265 1270 1275 1 

(2) INFORMATION FOR SEQ ID NO:1123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 1377 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1123: 

TTTAAATCAT TTAAAAAAAG GATAGAG ATG CAA AAT AAA GAA ATT GGT GAA GAA 54 

Met Gin Asn Lys Glu lie Gly Glu Glu 
1 5 

AAA AGC GTT AAT GAA AAA AAT GTA GAG GTT TTT T^T CGT TAT TTT CCC 10 2 

Lys Ser Val Asn Glu Lys Asn Val Glu Val Phe Asn Arg Tyr Phe Pro 
10 15 20 25 

GGT TGC TTG AGT ATA GAA AAT GAT AAC AAG CTC ACG CTG GAT ACA GGA 15 0 

Gly Cys Leu Ser lie Glu Asn Asp Asn Lys Leu Thr Leu Asp Thr Gly 
30 35 40 

AAA TTA AAA GCG TTA CTA GGG GAT TTT AGC GAG ATA AAA GAA GAG GGC 19 8 

Lys Leu Lys Ala Leu Leu Gly Asp Phe Ser Glu lie Lys Glu Glu Gly 
45 50 55 

TAT GGG TTG GAT TTT GTG GGT AAG AAA ATC GCC TTA AAC CAA GCT TTT 24 6 

Tyr Gly Leu Asp Phe Val Gly Lys Lys lie Ala Leu Asn Gin Ala Phe 
60 65 70 

AAG AAA AAT CAT AAG ATT TTA AAG CCC TTA AAC GAA TCC ACT AGC AAG 2 94 

Lys Lys Asn His Lys lie Leu Lys Pro Leu Asn Glu Ser Thr Ser Lys 
75 80 85 

CAC GTT CTC ATC AAG GGC GAT AAT TTA GAC GCT CTC AAA ATC TTA AAA 34 2 

His Val Leu lie Lys Gly Asp Asn Leu Asp Ala Leu Lys lie Leu Lys 
90 95 100 105 

CAA AGC TAT AGT GAA AAA ATC AAA ATG ATT TAG ATT GAC CCG CCT TAC 3 90 

Gin Ser Tyr Ser Glu Lys lie Lys Met lie Tyr He Asp Pro Pro Tyr 
110 115 120 

AAC ACG AAA AAC GAG AAT TTT ATC TAT GGC GAT GAT TTC TCG CAA TCC 43 8 



-1640- 



BNSDCXID: <WO 9843478 A 1J_> 



wo 98/43478 



PCT/US98/06371 



820 825 830 



Val 


Leu 


Asp 


Asp 


Glu 


Gly 


Asp 


Phe 


Ser 


Gly 


Phe 


Asp 


Cys 


He 


He 


Gly 






835 










840 










845 








Asn 


Pro 


Pro 


Tyr 


He 


Arg 


Gin 


Glu 


His 


He 


Lys 


Asp 


Leu 


Lys 


Pro 


Leu 




850 










855 










860 










Leu 


Glu 


Lys 


Gin 


Tyr 


Gin 


Asp 


Phe 


Tyr 


Asn 


Ser 


Thr 


Ala 


Asp 


He 


Tyr 


865 










870 










875 










880 


Thr 


Tyr 


Phe 


Phe 


Ala 


Leu 


Ala 


Phe 


His 


Leu 


Leu 


Lys 


Glu 


Lys 


Gly 


Phe 










885 










890 










895 




Ser 


Ala 


Phe 


He 


Thr 


Ser 


Asn 


Lys 


Tyr 


Thr 


Arg 


Ala 


Lys 


Tyr 


Gly 


Ala 








900 










905 










910 






Lys 


Leu 


Arg 


Glu 


Trp 


Leu 


Leu 


Lys 


Lys 


Thr 


Thr 


He 


Val 


Ser 


Tyr 


Met 






915 










920 










925 








Glu 


Leu 


Asn 


Ala 


Leu 


Lys 


Val 


Phe 


Glu 


Ser 


Ala 


Ala 


Val 


Asp 


Thr 


Ser 




930 










935 










940 










He 


He 


His 


Phe 


He 


Lys 


Gin 


Thr 


Pro 


Ser 


Lys 


Glu 


Ser 


Glu 


Phe 


Lys 


945 










950 










955 










960 


Tyr 


Tyr 


Glu 


Pro 


Thr 


Pro 


Asn 


Asp 


Lys 


Asp 


Asp 


Leu 


Lys 


Ser 


Thr 


Pro 










965 










970 










975 




His 


Leu 


Leu 


Met 


Lys 


Gin 


Asn 


Val 


Leu 


Ser 


Thr 


Glu 


Ser 


Phe 


He 


Phe 








980 










985 










990 






Ala 


Asn 


Ala 


Thr 


Leu 


Leu 


Asp 


Leu 


Arg 


Asp 


Lys 


He 


Glu 


Ser 


Val 


Gly 






995 








1000 








1005 








Thr 


Pro 


Leu 


Lys 


Asp 


Trp 


Asp 


He 


Gin 


He 


Asn 


Tyr 


Gly 


He 


Lys 


Thr 


1010 








1015 








1020 










Gly Ala 


Asn 


Glu 


Ala 


Phe 


He 


He 


Pro 


Thr 


Glu 


Lys 


Arg 


Glu 


Glu 


He 


025 








1030 








1035 








1040 


Leu 


Asn 


Ala 


Cys 


Lys 


Thr 


Gin 


Glu 


Glu 


Arg 


Glu 


Arg 


Thr 


Glu 


Arg 


Leu 








1045 








1050 








1055 




He 


Lys 


Pro 


He 


Leu 


Arg 


Gly 


Lys 


Asp 


He 


Lys 


Arg 


Tyr 


Ser 


Tyr 


Glu 






1060 








1065 








1070 






Trp 


Ala 


His 


Leu 


Trp 


Val 


He 


Asn 


Thr 


His 


Asn 


Gly 


Tyr 


Thr 


Ser 


Ser 




1075 








1080 








1085 








Leu 


Lys 


Ser 


Lys 


He 


Pro 


Pro 


He 


Asp 


He 


Glu 


Lys 


Tyr 


Pro 


Ala 


He 




1090 








1095 










1100 










Lys 


Ala 


His 


Leu 


Asp 


Ala 


His 


Tyr 


Asp 


Thr 


He 


Ala 


Thr 


Arg 


Cys 


Asp 



105 1110 1115 1120 

Gin Gly Asp Thr Pro Tyr His Leu Arg Asn Cys Ala Tyr Leu Glu Asp 

1125 1130 1135 

Phe Glu Lys Glu Lys He Val Trp Ala Ser Val Gly Phe Val Glu Tyr 

1140 1145 1150 

Cys Met He Pro Gly Leu Leu He Leu Asp Thr Asn Tyr Phe Phe Glu 

1155 1160 1165 

Val Ser Lys Phe Gly Asn Thr Lys Asn Tyr Leu Leu Gly Leu Leu Asn 

1170 1175 1180 

Ser Lys Leu Leu Thr Phe Trp Leu Lys Ala Lys Asn Thr Pro Leu Gly 
185 1190 1195 1200 

Asp Met Gly Ala Tyr Arg Asn Tyr Lys Tyr Asn He Met Glu Leu Pro 

1205 1210 1215 

Met Val Lys He Thr Ala Lys Asn Lys Lys He Ala Asp Lys He He 

1220 1225 1230 

Ala Leu Val Asp Lys He Leu Gin Ala Lys Glu Lys Asp Pro Lys Ala 

1235 1240 1245 

Asn Thr Gin Lys Leu Glu Lys Glu He Asp Ala Leu Val Tyr Gin Leu 
1250 1255 1260 
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Pro Leu Pro Leu Leu Lys Tyr Leu Phe Lys Phe Leu Arg Leu Tyr Lys 
385 390 395 400 

Phe Thr Thr Thr Pro Lys Asp lie Lys Asp Asn Thr Asp Thr Ser Glu 

405 410 415 

Ser Arg Leu lie Asn Pro Ser Val Leu Gly Leu Val Phe Glu Lys Leu 

420 425 430 

Asn Gly Tyr Lys Glu Gly Ser Phe Tyr Thr Pro Ser Phe lie Thr Ser 

435 440 445 

Tyr Met Cys Lys Glu Ser lie Thr Pro lie Val Leu Asp Lys Phe Asn 

450 455 460 

Ala lie Tyr Gin Trp Asp Cys Glu Asn Leu Lys Ala Leu Arg Gly Glu 
465 470 475 480 

lie Asp Arg Asn Phe Ser Asn Glu Lys Ala Lys Glu Tyr Leu Asn Thr 

485 490 495 

Leu Leu Thr Leu Arg lie Cys Asp Pro Ala Val Gly Ser Gly His Phe 

500 505 510 

Leu Val Ser Ala Leu Asn Glu Met Val Arg Val Ala Tyr Glu Leu Gly 

515 520 525 

Leu lie Ala Ser Leu Tyr Arg Tyr Asp Leu Lys Leu Glu Asn Asp Glu 

530 535 540 

He lie He His His Thr Pro Thr Gly Glu He Phe Asn Tyr He Lys 
545 550 555 560 

Pro Asp Ser Glu Asn Asp Pro His His His He Gin Lys Glu Leu Phe 

565 570 575 

Asn Leu Lys Lys Ser He He Glu Asn Cys Leu Phe Gly Val Asp He 

580 585 590 

Asn Pro Asn Ser Cys Glu He Thr Lys Leu Arg Leu Trp He Glu Leu 

595 600 605 

Leu Lys Tyr Ser Tyr Tyr He Phe Glu Lys Gly Lys Asn Thr Asn Ala 

610 615 620 

Leu Glu Thr Leu Pro Asn He Asp He Asn He Lys Cys Ala Asn Ser 
625 630 635 640 

Leu He Ser Arg Phe Ala Leu Lys Asp Lys Ala Leu Leu Lys Ser Glu 

645 650 655 

Lys Asn Lys Asn Leu Glu Tyr Ser He Ala Glu Tyr Lys Glu Leu Val 

660 665 670 

Lys He Tyr Lys Asp Pro Lys He Leu Glu Thr Leu Thr His Pro He 

675 680 685 

Lys Asp Ser Asn Ala Val Arg Lys Tyr Ala Lys Glu Arg Leu Tyr Gin 

690 695 700 

Glu Leu Lys Gin Asn Pro Asn Lys Asp Phe Lys Lys Ala Leu Asn Asp 
705 710 715 720 

Arg He Glu Lys He Lys Lys Ala Phe Lys Leu Thr Leu Asn Pro Pro 

725 730 735 

Pro Lys Glu Leu Lys Phe Lys Lys Phe Leu Lys Glu His Leu Glu Leu 

740 745 750 

Tyr Gly Lys Ser He Leu Glu Glu Ala Asn Tyr Asn Gly Leu Glu Leu 

755 760 765 

Glu Ala Leu Ala Leu Glu Lys Gin Met Ala Asn Leu Phe Phe Asp Tyr 

770 775 780 

Arg Pro Tyr Pro Lys Leu Asp Lys Ser Asp Lys Val Val Gly Leu Glu 
785 790 795 800 

His Phe Asn Arg Tyr Val Leu Thr Ser Tyr Lys Asp Leu Gin Asp Glu 

805 810 815 

Asn Glu Arg Tyr Ala Asn Ala Leu Glu Trp Arg Phe Glu Phe Pro Glu 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECUIiE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122: 

Met Asp Tyr Lys Lys Leu Asp Leu Pro Asn Thr Asn Tyr Pro Asn Gin 

15 10 15 

Glu Gin Leu Lys Ala Phe Glu Thr Ala Phe Asp Ala Phe Leu Glu Thr 

20 25 30 

Asn Gin Gin Glu Asn Glu Asn His Gin Asn Asp Ala Phe Asn Asp Leu 

35 40 45 

Leu Lys Gly Val Phe Lys Tyr Lys Val Lys Pro Thr Lys Lys lie Asp 

50 55 60 

Ser Thr lie Leu Asn Glu Asn Asn Glu Val Glu Val lie lie Glu Phe 
65 70 75 80 

Lys Ala Leu Lys Asn Pro Asn Glu Phe lie Lys Lys Gly Asp Leu Asn 

85 90 95 

Val Lys Ala Phe His Glu Ser Leu Leu Ser Tyr Leu Thr Glu Arg Lys 

100 105 110 

Glu Gly Asn Asn Asn Leu Lys His Leu lie Leu Ala Thr lie Lys Glu 

115 120 125 

Leu Tyr lie lie Asp Ala Asn Glu Phe Glu Val Phe Asn Lys Asp Lys 

130 135 140 

Glu lie Glu Asn Ala Phe Lys Asn Cys His Asp Arg Lys Gly Asn Asp 
145 150 155 160 

Thr Arg Thr Lys Ala Phe Tyr Asp Ala Cys Gin Lys Arg Leu Asn Glu 

165 170 175 

Phe Asp Arg Ser Leu Lys Tyr His Tyr lie Pro Leu Lys Lys Glu Asn 

180 185 190 

Leu Ala Leu lie Tyr Gin Ala Leu Ser Pro Asn Phe Leu Leu Lys lie 

195 200 205 

Pro Lys Tyr Ser Asp Ala Asn Thr Leu Asn Lys Asp Phe Tyr Glu Glu 

210 215 220 

Leu Leu Tyr lie Leu Gly Leu Glu Glu Gin Asn Asp Lys Gly Lys lie 
225 230 235 240 

Leu lie Lys Pro Ser Arg Thr Gin Asn Ser Leu Ser Asp Ala Leu Lys 

245 250 255 

Lys Glu Tyr Lys Asn Leu Asp Asp Glu Glu Val Met Ala Leu Leu lie 

260 265 270 

Ala Trp Asn Asn Arg lie Leu Phe Leu Arg Leu Leu Glu Ser Leu Leu 

275 280 285 

lie Ser Phe Lys His Phe Glu Asn Pro Phe Leu Thr Thr Glu Asn Phe 

290 295 300 

Glu Asn Phe Asn Asp Leu Asn Thr Leu Phe Phe Glu Val Leu Ala Lys 
305 310 315 320 

Lys Asn Ser Glu Arg Leu Pro Glu lie Lys Glu Asp Lys lie Leu Glu 

325 330 335 

Lys lie Pro Tyr Leu Asn Ser Ser Leu Phe Asp Lys Thr Pro Leu Glu 

340 345 350 

Leu Lys Gly His Glu lie Lys Leu Leu Asp Asn Lys Lys Leu Glu lie 

355 360 365 

Tyr Lys Asn Ser Val Leu Lys Lys His Lys Asp Tyr Gin Lys Glu Lys 
370 375 380 
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GAA AAA TAG CCC GCA ATT AAA GCG CAT TTA GAC GCT CAT TAC GAC ACT 3 3 62 

Glu Lys Tyr Pro Ala lie Lys Ala His Leu Asp Ala His Tyr Asp Thr 
1100 1105 1110 

ATT GCA ACA CGA TGC GAT CAA GGA GAC ACC CCC TAT CAC TTA AGG AAT 3 410 

lie Ala Thr Arg Cys Asp Gin Gly Asp Thr Pro Tyr His Leu Arg Asn 
1115 1120 1125 1130 

TGC GCG TAT TTA GAG GAT TTT GAA AAA GAG AAA ATT GTG TGG GCA AGT 34 5 8 

Cys Ala Tyr Leu Glu Asp Phe Glu Lys Glu Lys lie Val Trp Ala Ser 
1135 1140 1145 

GTG GGA TTT GTT GAA TAT TGT ATG ATC CCA G&A TTA TTG ATA CTT GAT 3 5 06 

Val Gly Phe Val Glu Tyr Cys Met lie Pro Gly Leu Leu lie Leu Asp 
1150 1155 1160 

ACA AAT TAT TTT TTT GAA GTC AGT AAA TTT GGC AAT ACA AAA AAC TAT 3 5 54 

Thr Asn Tyr Phe Phe Glu Val Ser Lys Phe Gly Asn Thr Lys Asn Tyr 
1165 1170 1175 

TTG CTT GGA CTT TTA AAT TCA AAA TTG CTA ACT TTT TGG TTA AAA GCT 3 6 02 

Leu Leu Gly Leu Leu Asn Ser Lys Leu Leu Thr Phe Trp Leu Lys Ala 
1180 1185 1190 

AAA AAT ACA CCA TTA GGC GAT ATG GGA GCT TAT AGA AAT TAT AAG TAT 3 6 50 

Lys Asn Thr Pro Leu Gly Asp Met Gly Ala Tyr Arg Asn Tyr Lys Tyr 
1195 1200 1205 1210 

AAT ATT ATG GAG TTA CCG ATG GTA AAA ATA ACG GCA AAA AAT AAA AAA 3 6 98 

Asn lie Met Glu Leu Pro Met Val Lys lie Thr Ala Lys Asn Lys Lys 
1215 1220 1225 

ATC GCC GAT AAA ATC ATC GCT TTA GTG GAT AAA ATC CTA CAA GCA AAA 3 74 6 

lie Ala Asp Lys lie lie Ala Leu Val Asp Lys lie Leu Gin Ala Lys 
1230 1235 1240 

GAA AAA GAC CCT AAA GCC AAC ACC CAA AAG TTA GAA AAA GAA ATT GAC 37 94 

Glu Lys Asp Pro Lys Ala Asn Thr Gin Lys Leu Glu Lys Glu lie Asp 
1245 1250 1255 

GCC TTA GTC TAT CAG CTC TAC CAC CTC ACC GAT GAA GAA ATT AAG ATC 3 84 2 

Ala Leu Val Tyr Gin Leu Tyr His Leu Thr Asp Glu Glu He Lys He 
1260 1265 1270 

ATT GAA GAG GGG CAG TGAATGGAAA AGTTATTTGA AAAGATATTG CATGAAATGA G 3 8 98 
He Glu Glu Gly Gin 
1275 1 

ATCAA 3 903 

(2) INFORMATION FOR SEQ ID NO: 1122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 12 7 9 amino acids 

(B) TYPE: amino acid 
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AGC ACC GCT GAC ATT TAG ACC TAG TTT TTT GCC GTG GCT TTG GAG CTT 2 690 

Ser Thr Ala Asp He Tyr Thr Tyr Phe Phe Ala Leu Ala Phe His Leu 
875 880 885 890 

TTA AAA GAA AAG GGG TTT AGC GCT TTC ATC ACT TCT AAC AAA TAT ACG 2 73 8 

Leu Lys Glu Lys Gly Phe Ser Ala Phe He Thr Ser Asn Lys Tyr Thr 
895 900 905 

CGA GCC AAA TAG GGC GCT AAA TTG AGG GAA TGG CTG CTC AAA AAA ACC 2786 
Arg Ala Lys Tyr Gly Ala Lys Leu Arg Glu Trp Leu Leu Lys Lys Thr 
910 915 920 

ACC ATC GTC AGC TAG ATG GAA CTA AAC GCC TTA AAA GTC TTT GAG AGC 2 8 34 

Thr He Val Ser Tyr Met Glu Leu Asn Ala Leu Lys Val Phe Glu Ser 
925 930 935 

GCT GCA GTG GAT ACC AGC ATC ATT CAT TTC ATC AAA GAA ACG CCC TCT 2 8 82 

Ala Ala Val Asp Thr Ser lie lie His Phe He Lys Gin Thr Pro Ser 
940 945 950 

AAA GAG AGC GAA TTT AAA TAT TAC GAA CCC ACC CCA AAC GAT AAA GAC 2 93 0 

Lys Glu Ser Glu Phe Lys Tyr Tyr Glu Pro Thr Pro Asn Asp Lys Asp 
955 960 965 970 

GAT TTG AAA AGC ACC CCA GAC CTT TTG ATG AAA GAA AAC GTG CTT TCA 2 97 8 

Asp Leu Lys Ser Thr Pro His Leu Leu Met Lys Gin Asn Val Leu Ser 
975 980 985 

ACA GAA AGC TTT ATT TTT GCC AAC GCC ACG CTT TTA GAT TTG AGG GAC 3 02 6 

Thr Glu Ser Phe He Phe Ala Asn Ala Thr Leu Leu Asp Leu Arg Asp 
990 995 1000 

AAA ATA GAG AGT GTT GGC ACC CCG CTT AAA GAC TGG GAC ATT GAA ATC 3 074 

Lys He Glu Ser Val Gly Thr Pro Leu Lys Asp Trp Asp He Gin He 
1005 1010 1015 

AAT TAT GGG ATA AAA ACC GGC GCG AAC GAA GCC TTT ATC ATT CCC ACT 312 2 

Asn Tyr Gly He Lys Thr Gly Ala Asn Glu Ala Phe He He Pro Thr 
1020 1025 1030 

GAA AAA AGA GAA GAG ATC TTA AAC GCT TGC AAG ACG CAA GAA GAA AGG 317 0 

Glu Lys Arg Glu Glu He Leu Asn Ala Cys Lys Thr Gin Glu Glu Arg 
1035 1040 1045 1050 

GAG GGC ACA GAG AGG CTT ATT AAG GCT ATT TTA AGA GGG AAA GAC ATT 3 218 
Glu Arg Thr Glu Arg Leu He Lys Pro He Leu Arg Gly Lys Asp He 
1055 1060 1065 

AAA AGG TAT TCT TAT GAG TGG GCG CAT TTG TGG GTT ATC AAC ACC CAT 32 66 

Lys Arg Tyr Ser Tyr Glu Trp Ala His Leu Trp Val He Asn Thr His 
1070 1075 1080 

AAC GGC TAC ACT TCT TCT CTC AAA TCC AAA ATC GCT CCC ATT GAT ATA 3314 
Asn Gly Tyr Thr Ser Ser Leu Lys Ser Lys He Pro Pro He Asp He 
1085 1090 1095 
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GCC TTG TTA AAA AGC GAA AAA AAT AAA AAC CTA GAA TAG TCT ATC GCT 2018 
Ala Leu Leu Lys Ser Glu Lys Asn Lys Asn Leu Glu Tyr Ser lie Ala 
655 660 665 

GAA TAG AAA GAA CTC GTT AAA ATC TAT AAA GAG CCT AAA ATC TTA GAA 2 066 

Glu Tyr Lys Glu Leu Val Lys lie Tyr Lys Asp Pro Lys lie Leu Glu 
670 675 680 

ACC CTA ACG CAC CCC ATA AAA GAC TCT AAC GCC GTT AGA AAA TAG GCT 2114 
Thr Leu Thr His Pro lie Lys Asp Ser Asn Ala Val Arg Lys Tyr Ala 
685 690 695 

AAA GAA CGC CTT TAT GAA GAA CTA AAA CAA AAT CCT AAC AAA GAT TTT 2162 
Lys Glu Arg Leu Tyr Gin Glu Leu Lys Gin Asn Pro Asn Lys Asp Phe 
700 705 ' 710 

AAA AAG GCT CTC AAT GAT AGG ATA GAG AAA ATT AAA AAA GCT TTT AAA 2210 
Lys Lys Ala Leu Asn Asp Arg lie Glu Lys lie Lys Lys Ala Phe Lys 
715 720 725 730 

CTC ACT TTA AAC CCC CCT CCA AAA GAA TTA AAA TTT AAA AAA TTT TTA 22 5 8 

Leu Thr Leu Asn Pro Pro Pro Lys Glu Leu Lys Phe Lys Lys Phe Leu 
735 740 745 

AAA GAG CAT TTA GAA CTC TAT GGC AAG AGT ATC TTA GAA GAG GCA AAC 2 3 06 

Lys Glu His Leu Glu Leu Tyr Gly Lys Ser lie Leu Glu Glu Ala Asn 
750 755 760 

TAC AAC GGC TTA GAA TTG GAA GCC CTA GCA TTA GAA AAG CAA ATG GCG 2 3 54 

Tyr Asn Gly Leu Glu Leu Glu Ala Leu Ala Leu Glu Lys Gin Met Ala 
765 770 775 

AAT CTT TTT TTT GAT TAT AGA CCC TAC CCC AAA CTA GAC AAA TCG GAT 24 02 

Asn Leu Phe Phe Asp Tyr Arg Pro Tyr Pro Lys Leu Asp Lys Ser Asp 
780 785 790 

AAA GTA GTA GGA CTA GAA CAT TTT AAC CGC TAT GTC CTA ACA TCT TAT 24 5 0 

Lys Val Val Gly Leu Glu His Phe Asn Arg Tyr Val Leu Thr Ser Tyr 
795 800 805 810 

AAA GAT TTA CAA GAT GAA AAC GAA CGC TAC GCT AAC GCT CTT GAA TGG 24 98 

Lys Asp Leu Gin Asp Glu Asn Glu Arg Tyr Ala Asn Ala Leu Glu Trp 
815 820 825 

CGC TTT GAA TTC CCT GAA GTT TTA GAT GAT GAG GGG GAT TTT TCA GGC 2 54 6 

Arg Phe Glu Phe Pro Glu Val Leu Asp Asp Glu Gly Asp Phe Ser Gly 
830 835 840 

TTT GAT TGC ATC ATT GGG AAT CCA CCT TAT ATC CGC CAA GAA CAC ATC 2 5 94 

Phe Asp Cys lie lie Gly Asn Pro Pro Tyr lie Arg Gin Glu His lie 
845 850 855 

AAA GAC TTA AAG GCT TTA TTA GAA AAG CAA TAC CAA GAT TTC TAT AAC 2 64 2 

Lys Asp Leu Lys Pro Leu Leu Glu Lys Gin Tyr Gin Asp Phe Tyr Asn 
860 865 870 



-1634- 



BNSDOCID: <:WO__9843478A1 J_> 



wo 98/43478 PCTAJS98/06371 

CTT GTT TTT GAA AAA CTC AAC GGC TAT AAA GAG GGG AGC TTT TAT ACC 13 4 6 

Leu Val Phe Glu Lys Leu Asn Gly Tyr Lys Glu Gly Ser Phe Tyr Thr 
430 435 440 

CCA AGC TTT ATC ACA AGC TAG ATG TGC AAA GAG AGC ATC ACG CCC ATC 13 94 

Pro Ser Phe lie Thr Ser Tyr Met Cys Lys Glu Ser lie Thr Pro lie 
445 450 455 

GTG TTG GAT AAA TTC AAC GCC ATT TAT CAG TGG GAC TGC GAA AAT CTA 144 2 

Val Leu Asp Lys Phe Asn Ala lie Tyr Gin Trp Asp Cys Glu Asn Leu 
460 465 470 

AAA GCG TTG CGA GGA GAA ATA GAC AGA AAT TTT TCA AAT GAA AAA GCT 14 9 0 

Lys Ala Leu Arg Gly Glu lie Asp Arg Asn Phe Ser Asn Glu Lys Ala 
475 480 485 490 

AAA GAA TAC CTA AAC ACG CTT TTA ACC TTG CGT ATT TGC GAT CCG GCG 153 8 

Lys Glu Tyr Leu Asn Thr Leu Leu Thr Leu Arg lie Cys Asp Pro Ala 
495 500 505 

GTG GGG AGC GGG CAT TTC TTG GTT TCA GCG CTC AAT GAA ATG GTG CGG 15 86 

Val Gly Ser Gly His Phe Leu Val Ser Ala Leu Asn Glu Met Val Arg 
510 515 520 

GTT GCT TAT GAG CTA GGA CTT ATT GCT TCC TTG TAT CGC TAC GAT CTT 16 34 

Val Ala Tyr Glu Leu Gly Leu lie Ala Ser Leu Tyr Arg Tyr Asp Leu 
525 530 535 

AAA TTA GAA AAC GAT GAA ATC ATC ATT CAC CAC ACG CCA ACG GGT GAA 16 82 

Lys Leu Glu Asn Asp Glu lie lie lie His His Thr Pro Thr Gly Glu 
540 545 550 

ATC TTT AAC TAC ATA AAA CCA GAT AGC GAA AAC GAC CCC CAC CAC CAC 17 3 0 

lie Phe Asn Tyr lie Lys Pro Asp Ser Glu Asn Asp Pro His His His 
555 560 565 570 

ATC CAA AAA GAA CTT TTT AAT CTT AAA AAA TCC ATT ATT GAA AAC TGC 17 7 8 

lie Gin Lys Glu Leu Phe Asn Leu Lys Lys Ser lie lie Glu Asn Cys 
575 580 585 

CTT TTT GGC GTG GAT ATT AAC CCC AAT TCT TGC GAA ATC ACC AAG CTC 182 6 

Leu Phe Gly Val Asp lie Asn Pro Asn Ser Cys Glu lie Thr Lys Leu 
590 595 600 

AGG CTA TGG ATA GAG CTT TTA AAA TAC AGC TAT TAT ATT TTT GAA AAG 18 74 

Arg Leu Trp lie Glu Leu Leu Lys Tyr Ser Tyr Tyr lie Phe Glu Lys 
605 610 615 

GGC AAG AAC ACT AAC GCG CTT GAA ACC CTC CCC AAC ATT GAT ATT AAC 192 2 

Gly Lys Asn Thr Asn Ala Leu Glu Thr Leu Pro Asn lie Asp lie Asn 
620 625 630 

ATT AAG TGC GCT AAT TCG CTC ATT TCT AGG TTT GCC CTC AAA GAT AAA 197 0 

lie Lys Cys Ala Asn Ser Leu lie Ser Arg Phe Ala Leu Lys Asp Lys 
635 640 645 650 
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Asn Phe Leu 
205 

AAA GAT TTT 
Lys Asp Phe 
220 

AAT GAC AAA 
Asn Asp Lys 
235 

CTA AGC GAT 
Leu Ser Asp 



GTC ATG GCG 
Val Met Ala 



CTT TTA GAA 
Leu Leu Glu 
285 

TTA ACC ACA 
Leu Thr Thr 
300 

TTT GAA GTC 
Phe Glu Val 
315 

GAA GAC AAG 
Glu Asp Lys 



GAT AAA ACG 
Asp Lys Thr 

AAT AAA AAG 
Asn Lys Lys 
365 

GAT TAT CAA 
Asp Tyr Gin 
380 

TTT TTG CGT 
Phe Leu Arg 
395 

AAT ACC GAT 
Asn Thr Asp 



Leu Lys lie 



TAT GAA GAA 
Tyr Glu Glu 



GGG AAA ATT 
Gly Lys lie 
240 

GCT TTA AAA 
Ala Leu Lys 
255 

TTG CTC ATC 
Leu Leu lie 
270 

AGC CTT TTA 
Ser Leu Leu 



GAA AAC TTT 
Glu Asn Phe 



CTA GCC AAG 
Leu Ala Lys 
320 

ATT TTA GAA 
lie Leu Glu 
335 

CCT TTA GAA 
Pro Leu Glu 
350 

CTA GAA ATC 
Leu Glu lie 



AAA GAA AAA 
Lys Glu Lys 



CTT TAT AAA 
Leu Tyr Lys 
400 

ACC AGC GAA 
Thr Ser Glu 
415 



Pro Lys Tyr 
210 

TTG CTT TAC 
Leu Leu Tyr 
225 

TTA ATC AAG 
Leu lie Lys 



AAG GAA TAC 
Lys Glu Tyr 



GCT TGG AAT 
Ala Trp Asn 
275 

ATT TCT TTT 
lie Ser Phe 
290 

GAA AAT TTC 
Glu Asn Phe 
305 

AAA AAC AGC 
Lys Asn Ser 



AAA ATC CCT 
Lys lie Pro 

TTA AAG GGG 
Leu Lys Gly 
355 

TAT AAA AAT 
Tyr Lys Asn 
370 

CCT TTG CCC 
Pro Leu Pro 
385 

TTC ACC ACC 
Phe Thr Thr 



AGC CGT TTG 
Ser Arg Leu 



Ser Asp Ala 



ATT TTA GGG 
lie Leu Gly 
230 

CCC AGC CGC 
Pro Ser Arg 
245 

AAA AAT TTA 
Lys Asn Leu 
260 

AAC CGC ATC 
Asn Arg lie 



AAG CAT TTT 
Lys Hi s Phe 

AAC GAT TTA 
Asn Asp Leu 
310 

GAG CGC TTA 
Glu Arg Leu 
325 

TAT TTG AAT 
Tyr Leu Asn 
340 

CAT GAA ATC 
His Glu lie 



TCC GTT CTC 
Ser Val Leu 



TTG CTA AAA 
Leu Leu Lys 
390 

ACC CCT AAA 
Thr Pro Lys 
405 

ATT AAC CCT 
lie Asn Pro 
420 



Asn Thr Leu 
215 

TTA GAA GAG 
Leu Glu Glu 



ACC CAA AAT 
Thr Gin Asn 



GAC GAT GAA 
Asp Asp Glu 
265 

TTG TTT TTA 
Leu Phe Leu 
280 

GAA AAT CCT 
Glu Asn Pro 
295 

AAC ACG CTC 
Asn Thr Leu 



CCA GAA ATT 
Pro Glu lie 



TCC AGT TTG 
Ser Ser Leu 
345 

AAG CTT TTA 
Lys Leu Leu 
360 

AAA AAA CAT 
Lys Lys His 
375 

TAC CTT TTT 
Tyr Leu Phe 



GAC ATT AAA 
Asp lie Lys 



AGC GTT TTA 
Ser Val Leu 
425 



Asn 



CAA 722 
Gin 



TCC 77 0 

Ser 

250 

GAA 818 
Glu 



CGG 866 
Arg 

TTC 914 
Phe 



TTT 9 62 

Phe 



AAA 1010 

Lys 

330 

TTT 105 8 

Phe 



GAC 1106 
Asp 



AAA 1154 
Lys 



AAA 1202 
Lys 



GAT 12 5 0 

Asp 

410 

GGG 12 9 8 

Gly 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121: 

AAGCGATGTA AGGAATTAAC ATG GAT TAT AAA AAA TTA GAT TTA CCC AAC 5 0 

Met Asp Tyr Lys Lys Leu Asp Leu Pro Asn 
15 10 

ACA AAC TAC CCA AAT CAA GAG CAA CTG AAA GCT TTT GAA ACC GCT TTT 9 8 

Thr Asn Tyr Pro Asn Gin Glu Gin Leu Lys Ala Phe Glu Thr Ala Phe 
IS 20 25 

GAC GCC TTT TTA GAA ACC AAC CAA CAA GAA AAT GAA AAT CAC CAA AAC 14 6 

Asp Ala Phe Leu Glu Thr Asn Gin Gin Glu Asn Glu Asn His Gin Asn 
30 35 40 

GAC GCT TTT AAT GAT TTA TTG AAA GGC GTT TTT AAA TAC AAG GTT AAG 194 
Asp Ala Phe Asn Asp Leu Leu Lys Gly Val Phe Lys Tyr Lys Val Lys 
45 50 55 

CCC ACC AAA AAA ATA GAC AGC ACT ATT CTT AAT GAA AAT AAC GAA GTG 24 2 

Pro Thr Lys Lys lie Asp Ser Thr lie Leu Asn Glu Asn Asn Glu Val 
60 65 70 

GAG GTG ATC ATT GAA TTT AAA GCC CTT AAA AAC CCC AAC GAA TTT ATT 2 90 

Glu Val lie lie Glu Phe Lys Ala Leu Lys Asn Pro Asn Glu Phe lie 
75 80 85 90 

AAA AAG GGC GAT TTG AAT GTT AAA GCC TTT CAT GAA AGC CTT TTG TCT 33 8 

Lys Lys Gly Asp Leu Asn Val Lys Ala Phe His Glu Ser Leu Leu Ser 
95 100 105 

TAT CTC ACA GAA AGA AAA GAG GGT AAT AAC AAC CTT AAG CAT CTT ATC 3 86 

Tyr Leu Thr Glu Arg Lys Glu Gly Asn Asn Asn Leu Lys His Leu lie 
110 115 120 

TTA GCC ACT ATT AAA GAG CTT TAT ATC ATT GAT GCA Ts^AC GAA TTT GAG 434 
Leu Ala Thr lie Lys Glu Leu Tyr lie lie Asp Ala Asn Glu Phe Glu 
125 130 135 

GTT TTT AAT AAA GAT AAA GAA ATT GAA AAC GCC TTT AAA AAT TGC CAC 4 82 

Val Phe Asn Lys Asp Lys Glu lie Glu Asn Ala Phe Lys Asn Cys His 
140 145 150 

GAT AGA AAG GGT AAC GAT ACA CGC ACA AAA GCG TTT TAT GAT GCT TGC 53 0 

Asp Arg Lys Gly Asn Asp Thr Arg Thr Lys Ala Phe Tyr Asp Ala Cys 
155 160 165 170 

CAA AAG CGC CTT AAT GAG TTT GAT CGT TCT TTG AAA TAC CAC TAT ATC 578 
Gin Lys Arg Leu Asn Glu Phe Asp Arg Ser Leu Lys Tyr His Tyr lie 
175 180 185 

CCC CTC AAA AAA GAA AAT TTA GCC CTA ATC TAT CAA GCC CTA AGC CCT 62 6 

Pro Leu Lys Lys Glu Asn Leu Ala Leu lie Tyr Gin Ala Leu Ser Pro 
190 195 200 

AAT TTT TTG CTC AAA ATT CCA AAA TAT TCT GAC GCT AAC ACG CTT AAC 674 
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100 105 110 

Phe Lys Asp Leu Gin Tyr Gin Leu Glu Lys Ala Leu Glu Asp Ser His 

115 120 125 

Phe Glu Ala Phe Gin Lys Arg Leu Asn Ser Val Leu Met Gly Gin Val 

130 135 140 

He Leu Val Asp His Asn Gin Asn Thr Phe He Glu He Glu Gin Gin 
145 150 155 160 

Phe Gin Gly Val Leu Ser Met Arg His Arg He Lys Gly Glu Ser Phe 

165 170 175 

Lys Val Gly Asp Ser He Lys Ala Val Leu Thr Gin Val Lys Arg Thr 

180 185 190 

Lys Lys Gly Leu Leu Leu Glu Leu Ser Arg Thr Thr Pro Lys Met Leu 

195 200 205 

Glu Ala Leu Leu Glu Leu Glu Val Pro Glu He Lys Asp Lys Glu He 

210 215 220 

Glu He He His Cys Ala Arg He Pro Gly Asn Arg Ala Lys Val Ser 
225 230 235 240 

Phe Phe Ser His Asn Ala Arg He Asp Pro He Gly Ala Ala Val Gly 

245 250 255 

Val Lys Gly Val Arg He Asn Ala He Ser Asn Glu Leu Asn Lys Glu 

260 265 270 

Asn He Asp Cys He Glu Tyr Ser Asn Val Pro Glu He Tyr He Thr 

275 280 285 

Leu Ala Leu Ala Pro Ala Lys He Leu Ser Val Glu He Lys Lys He 

290 295 300 

Pro He Glu Glu Leu Asn Ala Glu Glu Lys Glu Ser He Gin Glu Arg 
305 310 315 320 

Phe He Val Asn Asn His Leu Gin Lys Ala Lys Val Arg Leu Leu Asp 

325 330 335 

He Glu Lys Ser Lys Ala He Gly Lys Gly Gly Val Asn Val Cys Leu 

340 345 350 

Ala Ser Met Leu Thr Gly Tyr His He Glu Phe Glu Thr He Pro Ser 

355 360 365 

Val Lys Glu Asn Ala Glu Asn Glu Ser Glu Lys Glu Thr Pro Lys Val 

370 375 380 

Gly Val Glu Ala Leu Glu Ser Leu Phe Lys Asn 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 1121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3903 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 21... 3857 
(D) OTHER INFORMATION: 
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TAG ATC ACT CTC GCA CTC GCT CCA GCC AAA ATT TTA AGC GTT GAA ATC 915 
Tyr lie Thr Leu Ala Leu Ala Pro Ala Lys lie Leu Ser Val Glu lie 
290 295 300 

AAA AAA ATC CCT ATA GAA GAA TTG AAT GCT GAA GAA AAA GAA TCC ATT 96 3 

Lys Lys lie Pro lie Glu Glu Leu Asn Ala Glu Glu Lys Glu Ser lie 
305 310 315 

CAA GAG CGT TTT ATC GTC AAT AAC CAT TTG CAA AAG GCT AAA GTG CGT 1011 
Gin Glu Arg Phe lie Val Asn Asn His Leu Gin Lys Ala Lys Val Arg 
320 325 330 

TTA TTG GAC ATT GAA AAA TCT AAG GCT ATC GGT AAG GGC GGG GTG AAT 105 9 

Leu Leu Asp lie Glu Lys Ser Lys Ala lie Gly Lys Gly Gly Val Asn 
335 340 345 

GTG TGC TTA GCG TCC ATG CTT ACA GGC TAT CAC ATA GAG TTT GAA ACC 1107 
Val Cys Leu Ala Ser Met Leu Thr Gly Tyr His lie Glu Phe Glu Thr 
350 355 360 365 

ATT CCT AGC GTG AAA GAA AAC GCA GAA AAT GAA AGC GAA AAA GAA ACG 115 5 

lie Pro Ser Val Lys Glu Asn Ala Glu Asn Glu Ser Glu Lys Glu Thr 
370 375 380 

CCA AAA GTG GGG GTA GAA GCT TTA GAG TCT TTG TTT AAG AAT TAAGGGTAT 12 06 
Pro Lys Val Gly Val Glu Ala Leu Glu Ser Leu Phe Lys Asn 
385 390 395 

CTAAAATTCA ATCTCTAAAA AAGCTTTTAA CT 12 3 8 

(2) INFORMATION FOR SEQ ID NO: 112 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 0: 

Met Glu Lys lie Ser Asp Leu lie Glu Cys lie Ala Tyr Glu Lys Asn 

15 10 15 

Leu Pro Lys Glu Met lie Ser Lys Val lie Gin Gly Cys Leu Leu Lys 

20 25 30 

Met Ala Gin Asn Glu Leu Asp Pro Leu Ala Arg Tyr Leu Val Val Glu 

35 40 45 

Glu Asn Lys Gin Leu Gin Leu lie Gin Leu Val Glu Val Leu Glu Asp 

50 55 60 

Gly Asp Glu Arg Leu Val Asn Asp Pro Ser Lys Tyr lie Ser Leu Ser 

65 70 75 80 

Lys Ala Lys Glu Met Asp Pro Ser Val Lys lie Lys Asp Glu Leu Ser 

85 90 95 

Tyr Ser Leu Ser Leu Glu Ser Met Lys Gin Gly Ala lie Asn Arg Leu 
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TTA GAA GAT GOT GAT GAA AGA TTG GTT AAC GAC CCT TCT AAA TAG ATC 24 3 

Leu Glu Asp Gly Asp Glu Arg Leu Val Asn Asp Pro Ser Lys Tyr lie 
65 70 75 

AGC CTG TCT AAA GCC AAA GAA ATG GAT CCA AGC GTT AAG ATT AAA GAC 2 91 

Ser Leu Ser Lys Ala Lys Glu Met Asp Pro Ser Val Lys lie Lys Asp 
80 85 90 

GAA TTG TCC TAT AGC TTG AGT TTG GAG AGC ATG AAA CAA GGA GCG ATC 33 9 

Glu Leu Ser Tyr Ser Leu Ser Leu Glu Ser Met Lys Gin Gly Ala lie 
95 100 105 

AAC CGC CTT TTT AAA GAT TTG CAA TAC CAG TTA GAA AAA GCG TTA GAA 387 
Asn Arg Leu Phe Lys Asp Leu Gin Tyr Gin Leu Glu Lys Ala Leu Glu 
110 115 120 125 

GAC AGC CAC TTT GAA GCG TTT CAA AAG CGT CTT AAC AGC GTT TTA ATG 435 
Asp Ser His Phe Glu Ala Phe Gin Lys Arg Leu Asn Ser Val Leu Met 
130 135 140 

GGG CAA GTG ATT TTA GTG GAT CAC AAC CAA AAC ACC TTT ATT GAG ATT 4 83 

Gly Gin Val lie Leu Val Asp His Asn Gin Asn Thr Phe lie Glu lie 
145 150 155 

GAG CAG CAA TTT CAG GGC GTT CTT TCC ATG CGC CAT CGC ATC AAG GGC 531 
Glu Gin Gin Phe Gin Gly Val Leu Ser Met Arg His Arg lie Lys Gly 
160 165 170 

GAG AGT TTT AAA GTG GGC GAT AGC ATT AAA GCG GTT TTA ACG CAA GTC 57 9 

Glu Ser Phe Lys Val Gly Asp Ser lie Lys Ala Val Leu Thr Gin Val 
175 180 185 

AAA CGC ACG AAA AAA GGC TTA TTA TTA GAG CTG AGC CGC ACC ACC CCT 62 7 

Lys Arg Thr Lys Lys Gly Leu Leu Leu Glu Leu Ser Arg Thr Thr Pro 
190 195 200 205 

AAA ATG CTT GAA GCT TTG TTG GAA TTG GAA GTC CCT GAA ATT AAA GAC 675 
Lys Met Leu Glu Ala Leu Leu Glu Leu Glu Val Pro Glu lie Lys Asp 
210 215 220 

AAA GAA ATT GAA ATC ATC CAT TGT GCG CGA ATC CCA GGC AAC AGA GCG 72 3 

Lys Glu lie Glu lie lie His Cys Ala Arg lie Pro Gly Asn Arg Ala 
225 230 235 

AAA GTG AGC TTT TTT TCC CAT AAC GCT AGG ATT GAC CCC ATA GGC GCG 771 
Lys Val Ser Phe Phe Ser His Asn Ala Arg lie Asp Pro lie Gly Ala 
240 245 250 

GCT GTG GGG GTT AAG GGC GTG CGC ATT AAT GCG ATC AGT AAC GAA TTG 819 
Ala Val Gly Val Lys Gly Val Arg He Asn Ala He Ser Asn Glu Leu 
255 260 265 

AAT AAA GAA AAC ATT GAT TGC ATA GAA TAT TCT AAT GTG CCT GAA ATT 867 
Asn Lys Glu T^n He Asp Cys He Glu Tyr Ser Asn Val Pro Glu He 
270 275 280 285 
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Leu Lys Asn His Pro 
245 

Leu Thr Leu Leu Phe 
260 

Asp lie Thr lie His 
275 

Gin Lys Ala Leu Lys 
290 

Val Ser lie Ala Ser 
305 

Glu Leu Glu Ser Phe 
325 

Leu Asn Ser Glu Lys 
340 

Ala Arg lie Ser Cys 
355 

Glu Lys Asp Phe Glu 

370 
Lys Ala 
385 



Leu Tyr Arg Val Leu Arg 
250 

Cys Ser Leu Lys Ala Trp 
265 

Ala Leu Leu Asn Gin Thr 
280 

Leu Tyr Ala Leu Leu Lys 
295 

Ser Phe Ser Lys lie Gly 
310 315 
Cys Val Lys lie Gin Pro 
330 

Leu Tyr Leu Lys Leu Phe 
345 

Glu Phe Val Cys Phe Glu 
360 

Lys lie Ala Leu Val Leu 
375 
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Val Gly Lys lie Thr 
255 

lie Asn His Gin Glu 
270 

Lys Asp Ala Leu Leu 
285 

Pro Leu Glu Leu Asn 
300 

Asn Leu Phe Gly Arg 
320 

Lys Asn Thr Arg Ala 
335 

Gin Lys Gly Val He 
350 

Val Phe Ser Leu Asn 
365 

Glu Glu lie Leu Asn 
380 



(2) INFORMATION FOR SEQ ID NO: 1119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13 . . . 1197 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119: 

AGGAACTTAA GA ATG GAA AAA ATC AGC GAT CTT ATA GAA TGC ATT GCG TAT 51 
Met Glu Lys lie Ser Asp Leu lie Glu Cys lie Ala Tyr 
15 10 

GAA AAA AAT TTG CCT AAA GAG ATG ATT TCA AAA GTG ATT CAA GGC TGT 99 
Glu Lys Asn Leu Pro Lys Glu Met lie Ser Lys Val lie Gin Gly Cys 
15 20 25 

TTG TTA AAA ATG GCG CAA AAT GAG TTA GAC CCC CTA GCA CGC TAC TTG 14 7 

Leu Leu Lys Met Ala Gin Asn Glu Leu Asp Pro Leu Ala Arg Tyr Leu 
30 35 40 45 

GTG GTT GAA GAA AAC AAG CAG CTC CAG CTT ATC CAG TTG GTA GAA GTT 195 
Val Val Glu Glu Asn Lys Gin Leu Gin Leu lie Gin Leu Val Glu Val 
50 55 60 
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Ser Glu Lys Leu Tyr Leu Lys Leu Phe Gin Lys Gly Val lie Ala Arg 
340 345 350 

ATT TCA TGC GAA TTC GTG TGC TTT GAA GTC TTT AGO TTG AAT GAA AAA 1161 
lie Ser Cys Glu Phe Val Cys Phe Glu Val Phe Ser Leu Asn Glu Lys 
355 360 365 370 

GAT TTT GAA AAA ATC GCT CTG GTT TTA GAA GAA ATT CTT AAT AAA GCT T 1210 
Asp Phe Glu Lys lie Ala Leu Val Leu Glu Glu lie Leu Asn Lys Ala 
375 380 385 

AAAAATTCGC TATAATA 12 2 7 

(2) INFORMATION FOR SEQ ID NO: 1118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118: 

Met Ala Lys Glu Thr Leu Glu lie Thr Pro Asp Leu Leu Lys Asn Pro 

15 10 15 

Tyr Gin Lys lie lie Asn Ala Ser Ala Ser Val Phe Asp Glu Lys His 

20 25 30 

Gly Arg Ser Phe Phe Ser Thr Gin Phe Tyr Glu Lys lie Glu Pro Tyr 

35 40 45 

Leu Lys Glu Val Leu Thr His Pro lie Asp Leu Glu Cys Asp Leu Asn 

50 55 60 

Thr Ala Lys Lys Lys Asn Arg Leu Thr Pro Leu Lys Gin Leu Phe Lys 
65 70 75 80 

Ala Cys Phe Asn Thr Glu Glu lie Leu lie Val Asn Asn Asn Thr Ser 

85 90 95 

Ala lie Phe Leu lie Ala Asn Ala Leu Ala Gin Glu Lys Glu lie lie 

100 105 110 

Val Ser Tyr Gly Glu Leu Val Gly Gly Asp Phe Asn Leu Lys Asp lie 

115 120 125 

Leu Leu Asn Ser Gly Ala Arg Leu His Leu Val Gly Asn lie Asn Arg 

130 135 140 

Ala Tyr Leu Arg Asp Tyr Arg Leu Ala Leu Asn Glu Asn Ser Lys lie 
145 150 155 160 

Leu Phe Lys Thr His Asn Pro His Phe Lys Lys Asp Thr Pro Phe Lys 

165 170 175 

Asp Leu Gin Thr Leu Ala Lys Glu His Asp Leu lie Asp Tyr Tyr Asn 

180 185 190 

Leu Gly Asp Val Asp Leu Ser Asn Arg Val Ala Leu Glu Glu lie Leu 

195 200 205 

Ala Leu Lys Pro Ser Leu Leu Ser Phe Ser Ala Asp Lys Phe Phe Asn 

210 215 220 

Ser Ala Gin Ala Gly lie lie Met Gly Gin Lys Glu Arg Val Glu Ala 
225 230 235 240 
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115 120 125 130 

AAT AGT GGG GCT AGG CTG CAT TTA GTG GGG AAT ATT AAT CGC GCT TAT 4 89 

Asn Ser Gly Ala Arg Leu His Leu Val Gly Asn lie Asn Arg Ala Tyr 
135 140 145 

TTA AGG GAT TAG CGC TTA GCC TTG AAT GAA AAC AGC AAA ATA CTC TTT 537 
Leu Arg Asp Tyr Arg Leu Ala Leu Asn Glu Asn Ser Lys lie Leu Phe 
150 155 160 

AAA ACC CAC AAC CCC CAT TTT AAA AAA GAC ACG CCC TTT AAA GAT TTA 5 85 

Lys Thr His Asn Pro His Phe Lys Lys Asp Thr Pro Phe Lys Asp Leu 
165 170 175 

CAA ACT CTT GCT AAA GAG CAT GAT CTC ATT GAT TAT TAC AAT TTA GGG 63 3 

Gin Thr Leu Ala Lys Glu His Asp Leu lie Asp Tyr Tyr Asn Leu Gly 
180 185 190 

GAT GTG GAT TTG TCA AAC AGA GTG GCT TTG GAA GAA ATT TTA GCC CTA 681 
Asp Val Asp Leu Ser Asn Arg Val Ala Leu Glu Glu lie Leu Ala Leu 
195 200 205 210 

AAA CCA TCG CTT TTA AGC TTT AGC GCG GAT AAA TTC TTT AAC AGT GCG 72 9 

Lys Pro Ser Leu Leu Ser Phe Ser Ala Asp Lys Phe Phe Asn Ser Ala 
215 220 225 

CAA GCG GGC ATT ATT ATG GGG CAA AAA GAA CGG GTT GAA GCG TTA AAA 777 
Gin Ala Gly He He Met Gly Gin Lys Glu Arg Val Glu Ala Leu Lys 
230 235 240 

AAC CAC CCC CTT TAT AGA GTT TTA AGG GTG GGT AAA ATC ACG CTC ACC 82 5 

Asn His Pro Leu Tyr Arg Val Leu Arg Val Gly Lys He Thr Leu Thr 
245 250 255 

TTG CTT TTT TGC AGC CTA AAA GCA TGG ATA AAT CAT CAA GAA GAC ATT 8 73 

Leu Leu Phe Cys Ser Leu Lys Ala Trp He Asn His Gin Glu Asp He 
260 265 270 

ACA ATC CAT GCG TTA TTG AAC CAA ACT AAA GAC GCA TTA TTG CAA AAA 921 
Thr He His Ala Leu Leu Asn Gin Thr Lys Asp Ala Leu Leu Gin Lys 
275 280 285 290 

GCC CTC AAA CTC TAC GCT CTT TTA AAG CCT TTA GAA TTG AAT GTG AGC 96 9 

Ala Leu Lys Leu Tyr Ala Leu Leu Lys Pro Leu Glu Leu Asn Val Ser 
295 300 305 

ATA GCC TCT AGC TTT TCT AAA ATA GGG AAT TTG TTT GGT AGG GAA TTA 1017 
He Ala Ser Ser Phe Ser Lys He Gly Asn Leu Phe Gly Arg Glu Leu 
310 315 320 

GAA TCC TTT TGC GTG AAA ATC CAG CCC AAA AAC ACC CGT GCT TTA AAT 1065 
Glu Ser Phe Cys Val Lys He Gin Pro Lys Asn Thr Arg Ala Leu Asn 
325 330 335 

AGT GAG AAA CTT TAT TTA AAG CTT TTC CAA AAA GGC GTT ATC GCA AGG 1113 
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115 

(2) INFORMATION FOR SEQ ID NO: 1117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 12 0 9 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117: 

TAAAATAACG CTTATTTTAA ACTCTCAAAA AAGGAATCAA ACGCACTCAT C ATG GCT 57 

Met Ala 
1 

AAA GAA ACG CTT GAA ATA ACC CCG GAT CTT TTG AAA AAC CCT TAT CAA 10 5 

Lys Glu Thr Leu Glu lie Thr Pro Asp Leu Leu Lys Asn Pro Tyr Gin 
5 10 15 

AAA ATC ATC AAT GCG AGC GCG AGC GTT TTT GAT GAA AAG CAT GGG CGA 153 
Lys lie lie Asn Ala Ser Ala Ser Val Phe Asp Glu Lys His Gly Arg 
20 25 30 

TCG TTT TTT AGC ACG CAA TTT TAT GAA AAA ATT GAA CCT TAT TTA AAA 2 01 

Ser Phe Phe Ser Thr Gin Phe Tyr Glu Lys lie Glu Pro Tyr Leu Lys 
35 40 45 50 

GAA GTT TTA ACC CAT CCC ATT GAT TTA GAA TGC GAT CTA AAC ACC GCT 24 9 

Glu Val Leu Thr His Pro lie Asp Leu Glu Cys Asp Leu Asn Thr Ala 
55 60 65 

AAA AAA AAG AAC CGC TTA ACC CCT TTA AAA CAG CTT TTT AAA GCG TGT 2 97 

Lys Lys Lys Asn Arg Leu Thr Pro Leu Lys Gin Leu Phe Lys Ala Cys 
70 75 80 

TTT AAC ACC GAA GAA ATT TTG ATT GTG AAT AAT AAC ACC AGC GCG ATT 34 5 

Phe Asn Thr Glu Glu lie Leu lie Val Asn Asn Asn Thr Ser Ala lie 
85 90 95 

TTC CTC ATC GCT AAC GCT TTA GCG CAA GAA AAA GAA ATC ATT GTT TCT 3 93 

Phe Leu lie Ala Asn Ala Leu Ala Gin Glu Lys Glu lie lie Val Ser 
100 105 110 

TAT GGC GAA TTA GTG GGG GGG GAT TTT AAC CTT AAA GAT ATT TTA TTA 441 
Tyr Gly Glu Leu Val Gly Gly Asp Phe Asn Leu Lys Asp lie Leu Leu 
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10 15 20 

CGC TTA AAA CCC CAA AAA ATA AGC GTG AAT TTG GAT CTT TTC TAG ACG 15 0 

Arg Leu Lys Pro Gin Lys lie Ser Val Asn Leu Asp Leu Phe Tyr Thr 
25 30 35 40 

CAA TTA CCC AAT AAG GTT TAT TTA GAC TAG ATG GAA ATT CAA GAG CTT 198 
Gin Leu Pro Asn Lys Val Tyr Leu Asp Tyr Met Glu lie Gin Glu Leu 
45 50 55 

ATT CAA AAG ATG ATG CAA GAA AAC CAA TAC CTT CTC ATT GAA GAC GCC 24 6 

lie Gin Lys Met Met Gin Glu Asn Gin Tyr Leu Leu lie Glu Asp Ala 
60 65 70 

CTG AAA GAT TTG AGC CAT GCT TTA AAA ACG CGC TAC AAG GAG ATC ACT 2 94 

Leu Lys Asp Leu Ser His Ala Leu Lys Thr Arg Tyr Lys Glu lie Thr 
75 80 85 

GAA CTT TAT TTA AAA ATC AGC AAG TTA GAG ATT TCT CCC AAT TCT CAA 34 2 

Glu Leu Tyr Leu Lys lie Ser Lys Leu Glu lie Ser Pro Asn Ser Gin 
90 95 100 

GTG GGA GCG AGC GTG AAA ATC CGC TAT GAA AGC AAT CTT TAGCCTCTTT TT 3 93 
Val Gly Ala Ser Val Lys lie Arg Tyr Glu Ser Asn Leu 
105 110 115 

CCTTCTTATT G 4 04 

(2) INFORMATION FOR SEQ ID NO: 1116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 1116: 

Met Lys Thr Lys Gin Gly Val His lie His Asn Leu Val Phe Glu Ala 

15 10 15 

lie Leu Gly lie Leu Glu Phe Glu Arg Leu Lys Pro Gin Lys lie Ser 

20 25 30 

Val Asn Leu Asp Leu Phe Tyr Thr Gin Leu Pro Asn Lys Val Tyr Leu 

35 40 45 

Asp Tyr Met Glu lie Gin Glu Leu lie Gin Lys Met Met Gin Glu Asn 

50 55 60 

Gin Tyr Leu Leu lie Glu Asp Ala Leu Lys Asp Leu Ser His Ala Leu 
65 70 75 80 

Lys Thr Arg Tyr Lys Glu lie Thr Glu Leu Tyr Leu Lys lie Ser Lys 

85 90 95 

Leu Glu lie Ser Pro Asn Ser Gin Val Gly Ala Ser Val Lys lie Arg 

100 105 110 

Tyr Glu Ser Asn Leu 
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35 40 45 

Phe Lreu Thr Asn lie Asn Val He Phe Thr Leu Leu Gly Tyr Leu He 

50 55 60 

Gly Gly He Pro Phe Gly Tyr Ala Leu Met Lys He Phe Tyr Gly Met 
65 70 75 80 

Asp He Thr Lys He Gly Ser Gly Gly He Gly Ala Thr Asn Val Leu 

85 90 95 

Arg Ala Leu Gin Ser Lys Gly Val Ser Asn Ala Lys Gin Met Ala Leu 

100 105 110 

Leu Val Leu He Leu Asp Leu Phe Lys Gly Met Phe Ala Val Phe Leu 

115 120 125 

Ser Lys Leu Phe Gly Leu Asp Tyr Ser Leu Gin Trp Met Val Ala He 

130 135 140 

Ala Ser He Leu Gly His Cys Tyr Ser Pro Phe Leu Asn Phe Asn Gly 
145 150 155 160 

Gly Lys Gly Val Ser Thr He Met Gly Ser Val Val Leu Leu He Pro 

165 170 175 

He Glu Ser Leu He Gly Leu Thr Val Trp Phe Phe Val Gly Lys Val 

180 185 190 

Leu Lys He Ser Ser Leu Ala Ser He Leu Gly Val Gly Thr Ala Thr 

195 200 205 

Val Leu He Phe Phe Val Pro Tyr Met His He Pro Asp Ser Val Asn 

210 215 220 

He Leu Lys Glu Val Gly Thr Gin Thr Pro Met Val Leu He Phe He 
225 230 235 240 

Phe Thr Leu He Lys His Ala Gly Asn He Phe Asn Leu Leu Ala Gly 

245 250 255 

Lys Glu Lys Lys Val Leu 
260 

(2) INFORMATION FOR SEQ ID NO: 1115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 381 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115: 

TTTATTGGCC GGCAAGGAAA AGAAAGTCTT ATG AAA ACT AAA CAA GGC GTT CAT 54 

Met Lys Thr Lys Gin Gly Val His 
1 5 

ATC CAT AAC TTG GTG TTT GAG GCG ATT TTG GGG ATT TTA GAA TTT GAA 102 
He His ,Asn Leu Val Phe Glu Ala He Leu Gly He Leu Glu Phe Glu 
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GCA GTA TTT TTG AGC AAA TTG TTT GGG TTG GAT TAT AGT TTG CAA TGG 43 3 

Ala Val Phe Leu Ser Lys Leu Phe Gly Leu Asp Tyr Ser Leu Gin Trp 
125 130 135 140 

ATG GTC GCT ATC GCT AGC ATT TTA GGG CAT TGC TAT TCG CCT TTT TTG 4 81 

Met Val Ala lie Ala Ser lie Leu Gly His Cys Tyr Ser Pro Phe Leu 
145 150 155 

AAT TTC AAT GGA GGT AAG GGC GTT TCT ACG ATC ATG GGC TCT GTG GTG 52 9 

Asn Phe Asn Gly Gly Lys Gly Val Ser Thr lie Met Gly Ser Val Val 
160 165 170 

TTG CTC ATC CCT ATT GAA AGT CTC ATC GGC TTA ACG GTG TGG TTT TTT 5 77 

Leu Leu lie Pro lie Glu Ser Leu lie Gly Leu Thr Val Trp Phe Phe 
175 180 185 

GTG GGT AAG GTG CTT AAA ATC TCT TCA CTC GCT AGC ATT CTA GGG GTA 62 5 

Val Gly Lys Val Leu Lys lie Ser Ser Leu Ala Ser lie Leu Gly Val 
190 195 200 

GGC ACA GCG ACT GTT CTT ATC TTT TTT GTG CCT TAT ATG CAT ATC CCA 673 
Gly Thr Ala Thr Val Leu lie Phe Phe Val Pro Tyr Met His lie Pro 
205 210 215 220 

GAC AGC GTC AAT ATC CTT AAA GAA GTC GGC ACG CAA ACG CCG ATG GTG 721 
Asp Ser Val Asn lie Leu Lys Glu Val Gly Thr Gin Thr Pro Met Val 
225 230 235 

CTT ATT TTT ATT TTC ACC CTT ATC AAG CAT GCG GGT AAT ATT TTT AAT 76 9 

Leu lie Phe lie Phe Thr Leu lie Lys His Ala Gly Asn lie Phe Asn 
240 245 250 

TTA TTG GCC GGC AAG GAA AAG AAA GTC TTA TGAAAACTAA ACAAGGCGTT CAT 822 
Leu Leu Ala Gly Lys Glu Lys Lys Val Leu 
255 260 



(2) INFORMATION FOR SEQ ID N0:1114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114: 

Met Ala Arg Ser Phe Lys His Ser Gin Tyr Pro Lys lie Phe Lys Pro 

15 10 15 

Leu Tyr Pro Asn Asn Leu Thr Leu Ser Leu Lys Lys Gin His Val lie 

20 25 30 

Met lie Ala lie Leu Phe Glu Arg Val Phe Met Glu Ser Val Leu Asn 
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3 25 33 0 335 

Ala Leu Leu Trp lie Glu Asn Ser 
340 

(2) INFORMATION FOR SEQ ID NO: 1113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 14... 799 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113: 

ACGATTTTAA AAA ATG GCT AGA AGT TTC AAG CAT TCT CAA TAT CCT AAA 4 9 

Met Ala Arg Ser Phe Lys His Ser Gin Tyr Pro Lys 
15 10 

ATT TTT AAG CCA CTA TAC CCT AAC AAC TTA ACG CTT TCA CTT AAA AAG 97 
lie Phe Lys Pro Leu Tyr Pro Asn Asn Leu Thr Leu Ser Leu Lys Lys 
15 20 25 

CAA CAT GTT ATA ATG ATC GCT ATT TTA TTT GAA AGG GTA TTT ATG GAA 145 
Gin His Val lie Met lie Ala lie Leu Phe Glu Arg Val Phe Met Glu 
30 35 40 

AGC GTT TTA AAT TTC CTA ACC AAT ATC AAT GTG ATT TTC ACC CTT TTG 193 
Ser Val Leu Asn Phe Leu Thr Asn lie Asn Val lie Phe Thr Leu Leu 
45 50 55 60 

GGC TAT TTG ATT GGG GGG ATT CCT TTT GGC TAT GCG TTA ATG AAA ATC 241 
Gly Tyr Leu lie Gly Gly lie Pro Phe Gly Tyr Ala Leu Met Lys lie 
65 70 75 

TTT TAC GGC ATG GAT ATT ACT AAA ATC GGA TCG GGG GGC ATT GGC GCA 2 89 

Phe Tyr Gly Met Asp lie Thr Lys lie Gly Ser Gly Gly lie Gly Ala 
80 85 90 

ACG AAT GTC TTG CGT GCT TTA CAA AGT AAG GGC GTG AGT AAC GCT AAA 33 7 

Thr Asn Val Leu Arg Ala Leu Gin Ser Lys Gly Val Ser Asn Ala Lys 
95 100 105 

CAA ATG GCC CTA TTA GTT TTA ATC TTG GAT CTC TTC AAA GGC ATG TTT 3 85 

Gin Met Ala Leu Leu Val Leu lie Leu Asp Leu Phe Lys Gly Met Phe 
110 115 120 
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AAGATTATAC CAGCCTTTGA ACGCTTATTC TTACAACAGC GATTC 



1108 



(2) INFORMATION FOR SEQ ID NO:1112: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1112: 

Met Arg Leu Tyr Glu Ser Leu Leu Glu Met Cys Leu Asn Lys Ala Trp 

15 10 15 

Glu His Gin Thr Leu Ala Leu Glu Asn Pro Ser Val Ala Cys Met Val 

20 25 30 

Leu Asp Lys Asn His Glu lie Leu Ser Leu Glu Thr His Lys Lys Ala 

35 40 45 

Lys Thr Pro His Ala Glu Val Leu Ala Ala Gin Ser Ala Leu Lys lie 

50 55 60 

Leu Arg Pro Ser Leu Lys Asn Asp Leu Glu Lys Leu Glu Asp Pro Lys 
65 70 75 80 

Thr Leu Ser Asp Phe Leu Lys Thr His His Asp Asn Ala Phe Thr Asp 

85 90 95 

Cys Val Phe Leu lie Thr Leu Glu Pro Cys Asn Ser Tyr Gly Lys Thr 

100 105 110 

Pro Ala Cys Ser Glu Leu Leu Glu lie Leu Lys Pro Lys Arg Val Val 

115 120 125 

lie Ala Thr Glu Glu Asn Glu Ala Lys Lys Gly Gly Leu Ala Arg Leu 

130 135 140 

Gin Lys Ala Arg lie Glu Thr lie lie Cys His Asn Leu Glu Asn Lys 
145 150 155 160 

Ala Lys Asp Leu Leu Leu Pro Phe Arg Val Met Glu Gin Lys Gly Arg 

165 170 175 

Phe Asn Leu Phe Lys Leu Ala Leu Arg Met Asn Gly Asp Tyr His His 

180 185 190 

Gly Lys lie Thr Gly Gin Lys Ser Val lie Phe Thr His Asn Gin Arg 

195 200 205 

Ala lie Cys Asp Thr Leu lie Val Ser Gly Lys Thr lie Arg Thr Asp 

210 215 220 

Asn Pro Leu Leu Asp Ala Arg Phe Cys Asp Ser Phe Tyr Gin Asn Lys 
225 230 235 240 

Asn Pro Asn lie Ala lie Leu Ser Lys Arg Ser lie Asp Pro Asn Ser 

245 250 255 

Lys Val Phe Ser Ala Pro Asn Arg Leu Val Asn Thr Phe His Asp Pro 

260 265 270 

Lys Asp Leu Pro Leu Glu Lys Gly Phe Asn Phe lie Glu Gly Gly Trp 

275 280 285 

Glu Leu Phe Glu Ser Leu Arg Asp Lys lie Asp Ala Leu Leu Leu His 

290 295 300 

Ser His Ala Ser Met lie Gly Glu Ala Phe Lys Ala Leu Ala Leu Lys 
305 310 315 320 

Thr Pro Phe Lys Gly Arg Leu Leu His Ala Gin lie Leu Glu Asn Glu 
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lie Leu Lys Pro Lys Arg Val Val lie Ala Thr Glu Glu Asn Glu Ala 
125 130 135 

AAA AAA GGG GGT TTA GCA AGG CTA CAA AAG GCT CGT ATT GAA ACA ATA 4 86 

Lys Lys Gly Gly Leu Ala Arg Leu Gin Lys Ala Arg lie Glu Thr lie 
140 145 150 

ATT TGC CAC AAT TTA GAA AAC AAA GCT AAA GAC TTG CTC TTG CCT TTT 5 34 

lie Cys His Asn Leu Glu Asn Lys Ala Lys Asp Leu Leu Leu Pro Phe 
155 160 165 

AGG GTA ATG GAA CAA AAG GGG CGT TTT AAT TTG TTC AAA CTC GCT TTA 5 82 

Arg Val Met Glu Gin Lys Gly Arg Phe Asn Leu Phe Lys Leu Ala Leu 
170 175 180 

AGA ATG AAT GGG GAT TAC CAT CAT GGC AAG ATC ACC GGG CAA AAA AGC 63 0 

Arg Met Asn Gly Asp Tyr His His Gly Lys lie Thr Gly Gin Lys Ser 
185 190 195 200 

GTT ATT TTC ACG CAC AAC CAG CGA GCA ATA TGC GAC ACG CTT ATT GTT 67 8 

Val lie Phe Thr His Asn Gin Arg Ala lie Cys Asp Thr Leu lie Val 
205 210 215 

TCT GGG AAA ACC ATA AGA ACG GAC AAC CCC TTA TTG GAC GCT CGC TTT 726 
Ser Gly Lys Thr lie Arg Thr Asp Asn Pro Leu Leu Asp Ala Arg Phe 
220 225 230 

TGC GAC AGC TTT TAT CAA AAT AAA AAC CCC AAT ATC GCT ATT TTA TCC 7 74 

Cys Asp Ser Phe Tyr Gin Asn Lys Asn Pro Asn lie Ala lie Leu Ser 
235 240 245 

AAG CGC TCA ATT GAC CCT AAT TCA AAA GTT TTT TCT GCG CCT AAT CGT 822 
Lys Arg Ser lie Asp Pro Asn Ser Lys Val Phe Ser Ala Pro Asn Arg 
250 255 260 

TTA GTT AAC ACT TTC CAT GAC CCC AAA GAT TTA CCC CTA GAG AAG GGG 87 0 

Leu Val Asn Thr Phe His Asp Pro Lys Asp Leu Pro Leu Glu Lys Gly 
265 270 275 280 

TTT AAT TTC ATT GAA GGG GGG TGG GAA TTG TTT GAG AGC TTG AGG GAT 918 
Phe Asn Phe lie Glu Gly Gly Trp Glu Leu Phe Glu Ser Leu Arg Asp 
285 290 295 

AAA ATA GAC GCG TTG CTT TTG CAT TCG CAT GCG TCT ATG ATT GGC GAA 9 66 

Lys lie Asp Ala Leu Leu Leu His Ser His Ala Ser Met lie Gly Glu 
300 305 310 

GCG TTT AAG GCA CTC GCT CTA AAA ACC CCT TTT AAA GGA CGG TTG TTG 1014 
Ala Phe Lys Ala Leu Ala Leu Lys Thr Pro Phe Lys Gly Arg Leu Leu 
315 320 325 

CAT GCG CAA ATC TTA GAA AAT GAA GCC CTT TTA TGG ATA GAA AAC TCT T 10 63 
His Ala Gin lie Leu Glu Asn Glu Ala Leu Leu Trp lie Glu Asn Ser 
330 335 340 
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Glu Lys He Ser Ala He 
100 

(2) INFORMATION FOR SEQ ID NO: 1111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 1062 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111: 

AGGCTTTTTG CTCTTGCCTT TTTTCCCATC ATG AGA CTT TAT GAG AGT TTA TTA 5 4 

Met Arg Leu Tyr Glu Ser Leu Leu 
1 5 

GAA ATG TGC TTG AAT AAG GCA TGG GAG CAT CAA ACC CTA GCC TTA GAA 102 
Glu Met Cys Leu Asn Lys Ala Trp Glu His Gin Thr Leu Ala Leu Glu 
10 15 20 

AAC CCA AGC GTA GCT TGC ATG GTG TTG GAT AAA AAC CAT GAG ATC TTG 15 0 

Asn Pro Ser Val Ala Cys Met Val Leu Asp Lys Asn His Glu He Leu 
25 30 35 40 

AGT TTA GAA ACC CAC AAA AAA GCC AAA ACC CCG CAT GCA GAA GTC TTA 198 
Ser Leu Glu Thr His Lys Lys Ala Lys Thr Pro His Ala Glu Val Leu 
45 50 55 

GCC GCC CAA TCA GCG CTA AAG ATT TTA CGC CCC AGT TTG AAA AAC GAT 24 6 

Ala Ala Gin Ser Ala Leu Lys He Leu Arg Pro Ser Leu Lys Asn Asp 
60 65 70 

TTA GAA AAG TTA GAA GAC CCT AAA ACT TTA AGC GAT TTT TTA AAA ACG 2 94 

Leu Glu Lys Leu Glu Asp Pro Lys Thr Leu Ser Asp Phe Leu Lys Thr 
75 80 85 

CAC CAC GAT AAC GCT TTT ACA GAC TGC GTT TTT TTA ATC ACC TTA GAG 342 
His His Asp Asn Ala Phe Thr Asp Cys Val Phe Leu He Thr Leu Glu 
90 95 100 

CCA TGC AAT TCT TAT GGC AAA ACC CCG GCT TGT AGC GAA TTG TTA GAA 3 90 

Pro Cys Asn Ser Tyr Gly Lys Thr Pro Ala Cys Ser Glu Leu Leu Glu 
105 110 115 120 

ATT TTA AAG CCT AAA AGA GTG GTC ATT GCC ACA GAA GAA AAC GAA GCT 4 38 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109: 



TCAAATTCTT CATAA ATG ATT TTT TCT TTT AAA AGO ACT TCT TTT TGC GTG 51 
Met lie Phe Ser. Phe Lys Arg Thr Ser Phe Cys Val 
15 10 



AGC GTG CCG GTT TTG TCT ATA AAG ATT TTT TTC ACT TTA GCC AGA GTT 99 

Ser Val Pro Val Leu Ser lie Lys lie Phe Phe Thr Leu Ala Arg Val 
15 20 25 

TCT AAA AAC AAC GCT TCT TTA AAC ACG ATC AAA GGG TTT TTA AAC ACC 14 7 

Ser Lys Asn Asn Ala Ser Leu Asn Thr lie Lys Gly Phe Leu Asn Thr 
30 35 40 



CCT ATC ACT AAC GCA ATG GGC GTA GCC AGA GCG AAC GCG CAA GGG CAG 195 
Pro lie Thr Asn Ala Met Gly Val Ala Arg Ala Asn Ala Gin Gly Gin 
45 50 55 60 



CTG ATG ACT AGC ACG CTA ATA CAC ACC ATT AAG GCT TTT TCA AAA TTA 24 3 

Leu Met Thr Ser Thr Leu lie His Thr lie Lys Ala Phe Ser Lys Leu 
65 70 75 

CCC CCC AAA CCA AAT TGC CAT AAC AAA AAG CTT ACA AAG GCT AAA AAC 2 91 

Pro Pro Lys Pro Asn Cys His Asn Lys Lys Leu Thr Lys Ala Lys Asn 
80 85 90 



AAC ACC GCT TTA GAA AAA ATA TCC GCA ATT TGATTCGCGC TACTCTCAAT T 342 
Asn Thr Ala Leu Glu Lys lie Ser Ala He 
95 100 



(2) INFORMATION FOR SEQ ID NO: 1110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110: 



Met He Phe Ser Phe Lys Arg Thr Ser Phe Cys Val Ser Val Pro Val 

15 10 15 

Leu Ser He Lys He Phe Phe Thr Leu Ala Arg Val Ser Lys Asn Asn 

20 25 30 

Ala Ser Leu Asn Thr He Lys Gly Phe Leu Asn Thr Pro He Thr Asn 

35 40 45 

Ala Met Gly Val Ala Arg Ala Asn Ala Gin Gly Gin Leu Met Thr Ser 

50 55 60 

Thr Leu He His Thr He Lys Ala Phe Ser Lys Leu Pro Pro Lys Pro 
65 70 75 80 

Asn Cys His Asn Lys Lys Leu Thr Lys Ala Lys Asn Asn Thr Ala Leu 
85 90 95 
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70 75 80 

CGC TCC CTT AAA AAG CCC ACC CCA AAC ACC GCG CCC ACC ACA ATA TGC 34 3 

Arg Ser Leu Lys Lys Pro Thr Pro Asn Thr Ala Pro Thr Thr lie Cys 
85 90 95 

GTA GAG CTT ACG GGC AAG CCT AAT TGAGAGGCTA AAAGCACGGT GATGACT 3 94 

Val Glu Leu Thr Gly Lys Pro Asn 
100 105 



(2) INFORMATION FOR SEQ ID NO: 1108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108: 



Met Glu Arg Leu lie Thr Ser Ser Leu Tyr Thr Phe Leu Ser Asp Phe 

15 10 15 

Phe Ser Phe Phe Phe Asn Ser Lys Ala Met Ala Val Phe Leu Leu Phe 

20 25 30 

Phe Lys Leu Ser Ser Met Ser Asp Phe Ser Phe Lys Leu Ala Leu Ser 

35 40 45 

Lys Arg Ser Lys Lys Pro Ser lie Ser Ser Lys Ser Ser Pro Lys Cys 

50 55 60 

Ala Ala Thr Met Leu Ser Leu lie Leu Ala Lys Arg Leu Leu Asp Cys 
65 70 75 80 

Ser Leu Lys Arg Ser Leu Lys Lys Pro Thr Pro Asn Thr Ala Pro Thr 

85 90 95 

Thr lie Cys Val Glu Leu Thr Gly Lys Pro Asn 
100 105 



(2) INFORMATION FOR SEQ ID NO: 1109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 321 
(D) OTHER INFORMATION: 
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355 360 365 

Gly Val Phe Glu Leu Glu Gly Met Leu Asp Leu Glu Ser Val Glu Glu 

370 375 380 

Ala Leu His lie Glu Phe Asp Lys Glu Cys Glu Gin Val Thr Leu Gly 
385 390 395 400 

Gly Tyr Val Phe Ser Leu Leu Glu Arg Met Pro Met Glu Gly Asp Thr 

405 410 415 

lie Val Ser His Gly Tyr Ser Phe Glu Val Leu Ser Val Asp Gly Ala 

420 425 430 

Arg He Lys Arg Leu Lys Ala Val Lys Gin Asp Gin Gly Glu Asn Glu 
435 440 445 

Ala 



(2) INFORMATION FOR SEQ ID NO: 1107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47... 367 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107: 

AGAAACCGGC ACGGTTACCA ACCAAGCGGT AACAATCTTT TTTAAA ATG GAG CGT 5 5 

Met Glu Arg 
1 

TTG ATC ACT TCT TCT TTA TAC ACT TTT TTA AGC GAC TTT TTT TCT TTC 103 
Leu lie Thr Ser Ser Leu Tyr Thr Phe Leu Ser Asp Phe Phe Ser Phe 
5 10 15 

TTT TTC AAT TCC AAA GCG ATG GCG GTG TTC TTG CTT TTT TTT AAG CTC 151 
Phe Phe Asn Ser Lys Ala Met Ala Val Phe Leu Leu Phe Phe Lys Leu 
20 25 30 35 

TCT AGC ATG AGC GAT TTT TCT TTC AAA TTG GCT TTA TCA AAG CGC TCT 19 9 

Ser Ser Met Ser Asp Phe Ser Phe Lys Leu Ala Leu Ser Lys Arg Ser 
40 45 50 

AAA AAG CCT TCA ATT TCT TCT AAA TCT TCC CCA AAG TGC GCG GCT ACA 24 7 

Lys Lys Pro Ser lie Ser Ser Lys Ser Ser Pro Lys Cys Ala Ala Thr 
55 60 65 

ATG TTG TCT CTG ATT CTA GCA AAA CGC CTT CTT GAT TGC TCT CTT AAG 2 95 

Met Leu Ser Leu lie Leu Ala Lys Arg Leu Leu Asp Cys Ser Leu Lys 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1106: 

Met Gly Arg Asn Gin Gly Ala Tyr Leu Asp Pro Ser Glu Ser lie Leu 

15 10 15 

Met Leu Met Val Ala Phe Leu Leu Val Leu Leu Asn Ala Phe Phe Val 

20 25 30 

Leu Ser Glu Phe Ala Leu Val Lys Val Arg Lys Thr Arg Leu Glu Glu 

35 40 45 

Leu Val Lys lie Gly Asn Ser Asn Ala Lys Leu Ala Leu Lys Met Ser 

50 55 60 

Gin Arg Leu Asp Thr Tyr Leu Ser Ala Thr Gin Leu Gly lie Thr Leu 
65 70 75 80 

Ser Ser Leu Ala Leu Gly Trp Val Gly Glu Pro Ala lie Ala Lys Leu 

85 90 95 

Leu Ala Ala Leu Phe Glu Ser Met Asp Leu Arg Glu Asn Pro lie Phe 

100 105 110 

lie His Ser Met Ser Val Val lie Ala Phe Leu Ser lie Thr Phe Leu 

115 120 125 

His Val Val Leu Gly Glu lie Val Pro Lys Ser Leu Ala He Ala Lys 

130 135 140 

Ser Glu Lys Ala Thr Leu Phe Ala Ala Arg Pro Leu His Val Phe Trp 
145 150 155 160 

Val Val Phe Tyr Pro Val Val Arg Leu Phe Asp Val He Ala His Phe 

165 170 175 

Phe Leu Lys Lys Met Gly He Asn Pro Lys Glu His Asp Gly Thr His 

180 185 190 

Ser Glu Glu Glu Leu Lys He He Val Gly Glu Ser Leu Arg Glu Gly 

195 200 205 

He He Asp Ser Val Glu Gly Glu He He Lys Asn Ala Val Asp Phe 

210 215 220 

Ser Asp Thr Ser Ala Lys Glu He Met Thr Pro Arg Lys Asp Met Val 
225 230 235 240 

Cys Leu Asp Glu Glu Asn Ser Tyr Glu Glu Asn He Asp He Val Leu 

245 250 255 

Lys Gly His Phe Thr Arg Tyr Pro Tyr Cys Lys Gly Ser Lys Asp Asn 

260 265 270 

He He Gly Met Val His He Arg Asp Leu Leu Ser Arg Ser He Phe 

275 280 285 

Thr Pro Lys Met His Asp Phe Asn Gin He Val Arg Lys Met He He 

290 295 300 

Val Pro Glu Ser Ala Ser He Ser Gin He Leu He Lys Met Lys Lys 
305 310 315 320 

Glu Gin He His Thr Ala Leu Val He Asp Glu Tyr Gly Gly Thr Ala 

325 330 335 

Gly Leu Leu Thr Met Glu Asp He He Glu Glu He Met Gly Glu He 

340 345 350 

Ser Asp Glu Tyr Asp Leu Lys Gin Glu Gly He Asn Lys Leu Glu Glu 
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TTG GAT GAA GAA AAC AGC TAT GAA GAA AAT ATA GAC ATT GTT TTA AAA 827 
Leu Asp Glu Glu Asn Ser Tyr Glu Glu Asn lie Asp lie Val Leu Lys 
245 250 255 



GGC CAT TTC ACG CGC TAG OCT TAT TGC AAG GGT TCT AAG GAT AAC ATT 8 75 

Gly His Phe Thr Arg Tyr Pro Tyr Cys Lys Gly Ser Lys Asp Asn lie 
260 265 270 



ATC GGC ATG GTG CAT ATT AGG GAT TTG CTT TCG CGC TCT ATT TTT ACC 92 3 

lie Gly Met Val His lie Arg Asp Leu Leu Ser Arg Ser lie Phe Thr 
275 280 285 



CCC AAA ATG CAT GAT TTC AAT CAA ATC GTT AGG AAA ATG ATC ATC GTC 971 
Pro Lys Met His Asp Phe Asn Gin lie Val Arg Lys Met lie lie Val 
290 295 300 305 



CCC GAA AGC GCT TCC ATT TCT CAA ATC CTT ATT AAA ATG AAA AAA GAG 1019 
Pro Glu Ser Ala Ser lie Ser Gin lie Leu lie Lys Met Lys Lys Glu 
310 315 320 



CAA ATC CAT ACC GCT TTG GTG ATT GAT GAA TAC- GGC GGC ACA GCC GGG 10 67 

Gin lie His Thr Ala Leu Val lie Asp Glu Tyr Gly Gly Thr Ala Gly 
325 330 335 



TTG CTC ACT ATG GAA GAC ATC ATT GAA GAG ATC ATG GGC GAG ATT AGC 1115 

Leu Leu Thr Met Glu Asp lie lie Glu Glu lie Met Gly Glu lie Ser 

340 345 350 

GAC GAA TAC GAC TTA AAA CAA GAG GGC ATA AAC AAG CTT GAA GAG GGC 1163 

Asp Glu Tyr Asp Leu Lys Gin Glu Gly lie Asn Lys Leu Glu Glu Gly 
355 360 365 



GTG TTT GAA TTA GAG GGC ATG CTG GAT TTA GAG AGC GTA GAA GAA GCG 1211 
Val Phe Glu Leu Glu Gly Met Leu Asp Leu Glu Ser Val Glu Glu Ala 
370 375 380 385 



CTT CAC ATT GAA TTT GAT AAA GAA TGC GAG CAG GTA ACG CTT GGG GGC 12 5 9 

Leu His lie Glu Phe Asp Lys Glu Cys Glu Gin Val Thr Leu Gly Gly 
390 395 400 



TAT GTT TTT AGC TTG TTA GAG CGC ATG CCT ATG GAG GGA GAT ACA ATC 13 07 

Tyr Val Phe Ser Leu Leu Glu Arg Met Pro Met Glu Gly Asp Thr lie 
405 410 415 



GTT TCG CAT GGG TAT TCT TTT GAA GTC TTA AGC GTG GAT GGG GCT AGG 13 5 5 

Val Ser His Gly Tyr Ser Phe Glu Val Leu Ser Val Asp Gly Ala Arg 
420 425 430 



ATA AAA CGC TTA AAA GCG GTT AAA CAA GAT CAG GGA GAA AAT GAA GCA T 14 04 
lie Lys Arg Leu Lys Ala Val Lys Gin Asp Gin Gly Glu Asn Glu Ala 
435 440 445 



GAAAAAAACA ACCCTCTTTG TATTGGGCTT ATTATTTAA 14 4 3 

(2) INFORMATION FOR SEQ ID NO: 1106: 
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Leu Met Val Ala Phe Leu Leu Val Leu Leu Asn Ala Phe Phe Val Leu 
20 25 30 

TCA GAG TTT GCC CTT GTG AAA GTG CGT AAA ACC CGC TTA GAA GAG CTG 2 03 

Ser Glu Phe Ala Leu Val Lys Val Arg Lys Thr Arg Leu Glu Glu Leu 
35 40 45 

GTT AAA ATC GGT AAT TCC AAC GCT AAA CTC GCT TTA AAG ATG AGT CAA 251 
Val Lys lie Gly Asn Ser Asn Ala Lys Leu Ala Leu Lys Met Ser Gin 
50 55 60 65 

AGA CTA GAC ACT TAT TTG AGC GCG ACG CAG TTA GGC ATC ACC CTT TCT 2 99 

Arg Leu Asp Thr Tyr Leu Ser Ala Thr Gin Leu Gly lie Thr Leu Ser 
70 75 80 

TCA TTA GCT TTA GGC TGG GTG GGT GAG CCC GCT ATC GCA AAA TTG TTA 3 47 

Ser Leu Ala Leu Gly Trp Val Gly Glu Pro Ala lie Ala Lys Leu Leu 
85 90 95 

GCC GCG CTG TTT GAG TCT ATG GAT TTG AGA GAA AAT CCT ATT TTT ATC 3 95 

Ala Ala Leu Phe Glu Ser Met Asp Leu Arg Glu Asn Pro lie Phe lie 
100 105 110 

CAT TCA ATG AGC GTG GTC ATA GCG TTT TTA AGC ATC ACT TTT TTG CAT 44 3 

His Ser Met Ser Val Val lie Ala Phe Leu Ser lie Thr Phe Leu His 
115 120 125 

GTC GTG TTG GGC GAG ATT GTG CCT AAA TCT TTA GCG ATC GCT AAA TCT 4 91 

Val Val Leu Gly Glu lie Val Pro Lys Ser Leu Ala lie Ala Lys Ser 
130 135 140 145 

GAA AAA GCC ACC CTT TTT GCC GCA CGC CCT TTG CAT GTG TTT TGG GTG 53 9 

Glu Lys Ala Thr Leu Phe Ala Ala Arg Pro Leu His Val Phe Trp Val 
150 155 160 

GTG TTT TAT CCG GTG GTG CGT TTG TTT GAT GTG ATC GCT CAT TTT TTT 58 7 

Val Phe Tyr Pro Val Val Arg Leu Phe Asp Val lie Ala His Phe Phe 
165 170 175 

TTG AAA AAG ATG GGC ATC AAT CCT AAA GAG CAT GAC GGC ACG CAT TCT 63 5 

Leu Lys Lys Met Gly lie Asn Pro Lys Glu His Asp Gly Thr His Ser 
180 185 190 

GAA GAA GAG TTA AAA ATC ATT GTG GGC GAG AGT TTG AGA GAG GGC ATT 683 
Glu Glu Glu Leu Lys lie lie Val Gly Glu Ser Leu Arg Glu Gly lie 
195 200 205 

ATT GAT TCA GTG GAG GGC GAA ATC ATT AAA AAC GCA GTG GAT TTT TCT 731 
lie Asp Ser Val Glu Gly Glu lie lie Lys Asn Ala Val Asp Phe Ser 
210 215 220 225 

GAC ACG AGC GCT AAA GAA ATC ATG ACC CCA CGA AAA GAC ATG GTG TGT 77 9 

Asp Thr Ser Ala Lys Glu lie Met Thr Pro Arg Lys Asp Met Val Cys 
230 235 240 
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fhc Leu Glu Aari 
50 

Thr Gly Asp Met 
65 

lie Glu Phe Lys 

Glu Leu Ala lie 
100 

lie Gin Ala Gin 
115 

Asp lie Leu Ser 
130 

Glu Ala Leu Lys 
145 

Val lie Leu Glu 

lie Ser Gly Phe 
180 

lie Ser Lys Asn 
195 

Gly Phe Leu Ser 
210 

Phe Glu lie Leu 

225 

Met Leu Val Lys 



Lys L.ys Ala J_reu 
55 

Leu Val Ala Trp 
70 

Glu Cys Leu Gly 
85 

Lys Lys Cys Glu 

Ala Lys Asp Leu 
120 

lie Ala Tyr Gly 
135 

Glu Phe Phe Arg 
150 

Phe Leu Lys Lys 
165 

Tyr Thr Asn Lys 

Tyr Gly Ala Tyr 
200 

Leu Glu Gly Leu 
215 

Arg Thr Glu Asp 

230 
Lys Asn 
245 



Arg Leu Val Asp 
60 

Gin Lys Ser Ala 
75 

lie Asp Pro Ser 
90 

Glu Leu Glu Asn 
105 

Lys Gly Val Glu 

Leu Arg Asn Val 
140 

Val Leu Lys Pro 
155 

Asp Asn Pro Thr 
170 

Val Leu Pro Leu 
185 

Ser Tyr Leu Pro 

Lys His Glu Leu 
220 

Ser lie Ala Gin 
235 



Val Ala Cys Gly 

Leu Asn Cys Gly 
80 

Asn Asn Met Leu 
95 

Lys Ala Ser Phe 
110 

Asn Asn Ser Val 
125 

Val Glu Arg Gin 

Arg Gly Val Leu 
160 

Trp Leu Asp Lys 
175 

Val Gly Gly Ala 
190 

Gin Ser lie Glu 
205 

Arg Asn Ala Gly 

lie Ser Thr Thr 
240 



(2) INFORMATION FOR SEQ ID NO: 1105: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 5 7... 14 03 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1105 : 

AAAATTCTGA GATTTTATAT ATTTTATATT TATCGTTAGG TTTTAGGTTT AAAGTT ATG 5 9 

Met 
1 

GGG AGG AAT CAA GGA GCT TAT TTG GAT CCG TCT GAA TCG ATT TTG ATG 107 
Gly Arg Asn Gin Gly Ala Tyr Leu Asp Pro Ser Glu Ser lie Leu Met 
5 10 15 

TTG ATGjGTT GCT TTT TTA TTG GTG CTG TTG AAC GCT TTT TTT GTG CTT 155 
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Leu Glu Asn Lys Ala Ser Phe lie Gin Ala Gin Ala Lys Asp Leu Lys 
110 115 120 

GGC GTT GAA AAT AAC AGC GTG GAT ATC CTC TCT ATT GCG TAT GGC TTG 4 95 

Gly Val Glu Asn Asn Ser Val Asp He Leu Ser He Ala Tyr Gly Leu 
125 130 135 

CGT AAT GTC GTG GAA AGA CAA GAG GCC TTA AAA GAG TTT TTT AGG GTG 54 3 

Arg Asn Val Val Glu Arg Gin Glu Ala Leu Lys Glu Phe Phe Arg Val 
140 145 150 

TTA AAA CCC AGG GGC GTT TTA GTG ATT TTA GAA TTT TTA AAA AAA GAG 5 91 

Leu Lys Pro Arg Gly Val Leu Val He Leu Glu Phe Leu Lys Lys Asp 
155 160 165 

AAC CCC ACA TGG CTG GAT AAA ATC TCA GGG TTT TAG ACG AAT AAG GTT 63 9 

Asn Pro Thr Trp Leu Asp Lys He Ser Gly Phe Tyr Thr Asn Lys Val 
170 175 180 185 

TTG CCT TTA GTG GGA GGG GCT ATC AGT AAG AAT TAT GGT GCT TAT TCT 687 
Leu Pro Leu Val Gly Gly Ala He Ser Lys Asn Tyr Gly Ala Tyr Ser 
190 195 200 

TAT TTA CCG CAA TCC ATT GAG GGG TTT TTG AGT TTA GAG GGT TTG AAG 73 5 

Tyr Leu Pro Gin Ser He Glu Gly Phe Leu Ser Leu Glu Gly Leu Lys 
205 210 215 

CAT GAA TTA AGA AAC GCA GGG TTT GAG ATT TTA AGG ACT GAA GAT TCT 7 83 

His Glu Leu Arg Asn Ala Gly Phe Glu He Leu Arg Thr Glu Asp Ser 
220 225 230 

ATC GCT CAA ATT TCA ACG ACC ATG CTT GTT AAA AAA AAC TAAAGGAATG TT 834 
He Ala Gin He Ser Thr Thr Met Leu Val Lys Lys Asn 
235 240 245 

ATGCAAGATG AATTATTTGA AACC 85 8 

(2) INFORMATION FOR SEQ ID NO:1104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104: 

Met Lys Lys Glu Lys His Leu Lys Gin Glu Lys He He Asn Met Phe 

15 10 15 

Asp Asp He Ala Ser Ser Tyr Asp Gin Ala Asn Arg Leu Met Ser Phe 

20 25 30 

Gly Leu Asp Val Lys Trp Arg Glu Arg Ala Cys Glu His Ala Phe Leu 
35 40 45 
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65 70 75 80 

Leu Glu Leu Leu Asn Thr Phe Ala Leu Ser lie Leu Leu Leu Ser Lys 
85 90 95 

Phe Phe Ala 



(2) INFORMATION FOR SEQ ID NO : 1103 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 85... 822 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1103: 

GGCGCCAACG ATTTTAATGA TCCTCATTGT GATTTTAGTG GTTGTCAAGC CTTTTTAAAG 6 0 

ACAAGCCATG AAAAAAGAAA AGTC ATG AAA AAA GAA AAG CAT CTC AAG CAA 111 

Met Lys Lys Glu Lys His Leu Lys Gin 
1 5 

GAA AAA ATC ATC AAC ATG TTT GAT GAT ATA GCC AGC TCT TAG GAT CAA 15 9 

Glu Lys lie lie Asn Met Phe Asp Asp lie Ala Ser Ser Tyr Asp Gin 
10 15 20 25 

GCC AAC CGC TTG ATG AGT TTT GGC TTA GAC GTT AAA TGG CGA GAA AGG 2 07 

Ala Asn Arg Leu Met Ser Phe Gly Leu Asp Val Lys Trp Arg Glu Arg 
30 35 40 

GCT TGC GAG CAT GCG TTT TTA TTT TTA GAA AAC AAG AAA GCG TTA AGG 2 55 

Ala Cys Glu His Ala Phe Leu Phe Leu Glu Asn Lys Lys Ala Leu Arg 
45 50 55 

CTT GTG GAT GTG GCA TGC GGG ACG GGG GAT ATG CTT GTG GCT TGG CAA 3 03 

Leu Val Asp Val Ala Cys Gly Thr Gly Asp Met Leu Val Ala Trp Gin 
60 65 70 

AAA AGC GCT CTC AAT TGC GGT ATA GAG TTT AAG GAA TGT TTG GGG ATT 351 
Lys Ser Ala Leu Asn Cys Gly lie Glu Phe Lys Glu Cys Leu Gly lie 
75 80 85 

GAC CCC TCT AAT AAC ATG CTT GAA TTA GCC ATC AAA AAA TGT GAA GAG 3 99 

Asp Pro Ser Asn Asn Met Leu Glu Leu Ala lie Lys Lys Cys Glu Glu 
90 95 100 105 

CTT GAA AAC AAA GCT TCT TTC ATC CAA GCT CAA GCC AAA GAT TTA AAA 44 7 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101: 



CTCCCTGAAG CG ATG CTC GCA TGG ATG TCT TGC TCG TTG AAA AAA GTA CCA 51 
Met Leu Ala Trp Met Ser Cys Ser Leu Lys Lys Val Pro 
15 10 

CTA GGC CGT' AAA TCT TTC ATG GCG TTT TTA AGG TTA ATA TTA GAT TCT 9 9 

Leu Gly Arg Lys Ser Phe Met Ala Phe Leu Arg Leu lie Leu Asp Ser 
15 20 25 



TCT AAC GCC ACG ACA TGC GCG TTT TTA GAC GCC ACG ACA AAA ACT TCA 14 7 

Ser Asn Ala Thr Thr Cys Ala Phe Leu Asp Ala Thr Thr Lys Thr Ser 
30 35 40 45 



ATT TCT TTG GGT AAT TTT TCT AAA AAC CGC AAG GCT AGG GGT ATC CCG 195 
lie Ser Leu Gly Asn Phe Ser Lys Asn Arg Lys Ala Arg Gly lie Pro 
50 55 60 



CTC GCT CCA CTG ATG CCT AAA ACC AAT TTC ATG AAT GTC CTT TAT AAG 24 3 

Leu Ala Pro Leu Met Pro Lys Thr Asn Phe Met Asn Val Leu Tyr Lys 
65 70 75 



ATT TGC GCT TTA GAG CTG CTC AAC ACT TTT GCT TTG AGT ATT TTA TTG 2 91 

lie Cys Ala Leu Glu Leu Leu Asn Thr Phe Ala Leu Ser lie Leu Leu 
80 85 90 

CTT TCT AAA TTT TTC GCT TGAATGATTT GATTAAGCGC GCCATTTTCT AG 341 
Leu Ser Lys Phe Phe Ala 
95 



(2) INFORMATION FOR SEQ ID NO: 1102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1102: 



Met 


Leu 


Ala 


Trp 


Met 


Ser 


Cys 


Ser 


Leu 


Lys 


Lys 


Val 


Pro 


Leu 


Gly Arg 


1 








5 










10 










15 


Lys 


Ser 


Phe 


Met 


Ala 


Phe 


Leu 


Arg 


Leu 


lie 


Leu 


Asp 


Ser 


Ser 


Asn Ala 








20 










25 










30 




Thr 


Thr 


Cys 


Ala 


Phe 


Leu 


Asp 


Ala 


Thr 


Thr 


Lys 


Thr 


Ser 


He 


Ser Leu 






35 










40 










45 






Gly 


Asn 


Phe 


Ser 


Lys 


Asn 


Arg 


Lys 


Ala 


Arg 


Gly 


He 


Pro 


Leu 


Ala Pro 




50 










55 










60 








Leu 


Met, 


-Pro 


Lys 


Thr 


Asn 


Phe 


Met 


Asn 


Val 


Leu 


Tyr 


Lys 


He 


Cys Ala 
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Asn 



(2) INFORMATION FOR SEQ ID NO: 1100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100: 



Met Lys Leu Val Leu Gly lie Ser Gly Ala Ser Gly lie Pro Leu Ala 

15 10 15 

Leu Arg Phe Leu Glu Lys Leu Pro Lys Glu lie Glu Val Phe Val Val 

20 25 30 

Ala Ser Lys Asn Ala His Val Val Ala Leu Glu Glu Ser Asn lie Asn 

35 40 45 

Leu Lys Asn Ala Met Lys Asp Leu Arg Pro Ser Gly Thr Phe Phe Asn 

50 55 60 

Glu Gin Asp lie His Ala Ser lie Ala Ser Gly Ser Tyr Gly lie His 
65 70 75 80 

Lys Met Ala lie lie Pro Ala Ser Met Asp Met Val Ala Lys lie Ala 

85 90 95 

His Gly Phe Gly Gly Asp Leu lie Ser Arg Ser Ala Ser Val Met Leu 

100 105 110 

Lys Glu Lys Arg Pro Leu Leu lie Ala Pro Arg Glu Met Pro Leu Ser 

115 120 125 

Ala lie Met Leu Glu Asn Leu Leu Lys Leu Ser His Ser Asn Ala lie 

130 135 140 

lie Ala Pro Pro Met Met Thr Tyr Tyr Thr Gin Ser Lys Thr Leu Glu 
145 150 155 160 

Ala Met Gin Asp Phe Leu Val Gly Lys Trp Phe Asp Ser Leu Gly lie 

165 170 175 

Glu Asn Asp Leu Tyr Pro Arg Trp Gly Met Asn 
180 185 



(2) INFORMATION FOR SEQ ID NO: 1101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13... 309 
(D) OTHER INFORMATION: 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099: 

GCAAATCTTA TAAAGGACAT TC ATG AAA TTG GTT TTA GGC ATC AGT GGA GCG 5 2 

Met Lys Leu Val Leu Gly lie Ser Gly Ala 
15 10 

AGC GGG ATA CCC CTA GCC TTG CGG TTT TTA GAA AAA TTA CCC AAA GAA 100 
Ser Gly lie Pro Leu Ala Leu Arg Phe Leu Glu Lys Leu Pro Lys Glu 
15 20 25 

ATT GAA GTT TTT GTC GTG GCG TCT AAA AAC GCG CAT GTC GTG GCG TTA 14 8 

lie Glu Val Phe Val Val Ala Ser Lys Asn Ala His Val Val Ala Leu 
30 35 40 

GAA GAA TCT AAT ATT AAC CTT AAA AAC GCC ATG AAA GAT TTA CGG CCT 196 
Glu Glu Ser Asn lie Asn Leu Lys Asn Ala Met Lys Asp Leu Arg Pro 
45 50 55 

AGT GGT ACT TTT TTC AAC GAG CAA GAC ATC CAT GCG AGC ATC GCT TCA 2 44 

Ser Gly Thr Phe Phe Asn Glu Gin Asp lie His Ala Ser lie Ala Ser 
60 65 70 

GGG AGT TAT GGT ATC CAT AAA ATG GCG ATC ATT CCA GCG AGC ATG GAC 2 92 

Gly Ser Tyr Gly lie His Lys Met Ala lie lie Pro Ala Ser Met Asp 
75 80 85 90 

ATG GTG GCT AAA ATC GCG CAT GGC TTT GGG GGG GAT TTG ATT TCT AGG 34 0 

Met Val Ala Lys lie Ala His Gly Phe Gly Gly Asp Leu lie Ser Arg 
95 100 105 

AGT GCG TCT GTG ATG CTT AAA GAA AAG CGC CCC TTA CTC ATT GCC CCT 3 88 

Ser Ala Ser Val Met Leu Lys Glu Lys Arg Pro Leu Leu lie Ala Pro 
110 115 120 

AGA GAA ATG CCT TTA AGC GCT ATC ATG TTA GAA AAT TTG CTC AAA CTC 436 
Arg Glu Met Pro Leu Ser Ala lie Met Leu Glu Asn Leu Leu Lys Leu 
125 130 135 

TCC CAT TCT AAT GCA ATC ATT GCG CCG CCG ATG ATG ACT TAT TAG ACC 4 84 

Ser His Ser Asn Ala lie lie Ala Pro Pro Met Met Thr Tyr Tyr Thr 
140 145 150 

CAG AGC AAG ACT TTA GAA GCG ATG CAA GAT TTT TTA GTG GGG AAG TGG 532 
Gin Ser Lys Thr Leu Glu Ala Met Gin Asp Phe Leu Val Gly Lys Trp 
155 160 165 170 

TTT GAC AGC TTA GGG ATA GAA AAT GAC TTA TAC CCA CGA TGG GGA ATG 580 
Phe Asp Ser Leu Gly lie Glu Asn Asp Leu Tyr Pro Arg Trp Gly Met 
175 180 185 

AAC TGATGCAAAA AATCGGCATT TACCCGGGCA CTTTTGA 62 0 
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Phe He Lys Cys Ala Val 
190 

AATTGCTTG 644 
(2) INFORMATION FOR SEQ ID NO: 1098: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098: 

Met Tyr Val Val Leu Glu Gly Val Asp Gly Ala Gly Lys Ser Thr Gin 

15 10 15 

Val Glu Leu Leu Lys Asp Arg Phe Lys Asn Ala Leu Phe Thr Lys Glu 

20 25 30 

Pro Gly Gly Thr Arg Met Gly Glu Ser Leu Arg Arg lie Ala Leu Asn 

35 40 45 

Glu Asn lie Ser Glu Leu Ala Arg Ala Phe Leu Phe Leu Ser Asp Arg 

50 55 60 

Ala Glu His Thr Glu Ser Val lie Lys Pro Ala Leu Lys Glu Lys Lys 
65 70 75 80 

Leu lie lie Ser Asp Arg Ser Leu lie Ser Gly Met Ala Tyr Ser Gin 

85 90 95 

Phe Ser Ser Leu Glu Leu Asn Leu Leu Ala Thr Gin Ser Val Leu Pro 

100 105 110 

Ala Lys lie lie Leu Leu Leu lie Asp Lys Glu Gly Leu Lys Gin Arg 

115 120 125 

Leu Ser Leu Lys Ser Leu Asp Lys lie Glu Asn Gin Gly lie Glu Lys 

130 135 140 

Leu Leu His lie Gin Gin Lys Leu Lys Thr His Ala Tyr Ala Leu Gin 
145 150 155 160 

Glu Lys Phe Gly Cys Glu Val Leu Glu Leu Asp Ala Lys Glu Ser Val 

165 170 175 

Lys Asn Leu His Glu Lys He Ala Ala Phe He Lys Cys Ala Val 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 1099: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 23.,. 583 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1097: 

ATCCTTTGAT TTCAAAGGCT TAAA ATG TAT GTG GTG TTA GAA GGC GTT GAT 51 

Met Tyr Val Val Leu Glu Gly Val Asp 
1 5 

GGC GCG GGC AAA AGC ACT CAA GTA GAA TTA TTA AAA GAC CGG TTT AAA 9 9 

Gly Ala Gly Lys Ser Thr Gin Val Glu Leu Leu Lys Asp Arg Phe Lys 
10 15 20 25 

AAC GCC CTT TTT ACC AAA GAG CCA GGG GGG ACG AGA ATG GGC GAG AGT 14 7 

Asn Ala Leu Phe Thr Lys Glu Pro Gly Gly Thr Arg Met Gly Glu Ser 
30 35 40 

TTA AGG CGT ATC GCT TTG AAT GAA AAC ATT AGC GAA TTG GCT AGA GCG 195 
Leu Arg Arg lie Ala Leu Asn Glu Asn lie Ser Glu Leu Ala Arg Ala 
45 50 55 

TTT TTA TTC TTA AGC GAT AGG GCT GAG CAT ACA GAA AGC GTG ATA AAA 2 43 

Phe Leu Phe Leu Ser Asp Arg Ala Glu His Thr Glu Ser Val lie Lys 
60 65 70 

CCG GCA TTG AAA GAA AAA AAG CTC ATC ATT AGC GAC AGG AGC TTG ATC 2 91 

Pro Ala Leu Lys Glu Lys Lys Leu lie lie Ser Asp Arg Ser Leu lie 
75 80 85 

TCT GGC ATG GCT TAT AGC CAA TTT TCA AGC TTA GAA TTA AAC CTG CTT 33 9 

Ser Gly Met Ala Tyr Ser Gin Phe Ser Ser Leu Glu Leu Asn Leu Leu 
90 95 100 105 

GCC ACC CAA AGC GTC TTG CCT GCA AAA ATC ATT CTT TTA CTC ATA GAC 387 
Ala Thr Gin Ser Val Leu Pro Ala Lys lie lie Leu Leu Leu He Asp 
110 115 120 

AAA GAG GGC TTA AAA CAG CGC TTA AGC CTT AAA AGT TTA GAT AAA ATA 435 
Lys Glu Gly Leu Lys Gin Arg Leu Ser Leu Lys Ser Leu Asp Lys He 
125 130 135 

GAA AAC CAA GGC ATA GAA AAA TTA CTT CAT ATC CAG CAA AAG CTC AAA 4 83 

Glu Asn Gin Gly He Glu Lys Leu Leu His He Gin Gin Lys Leu Lys 
140 145 150 

ACC CAC GCT TAT GCG TTA CAA GAA AAA TTT GGG TGC GAA GTT TTG GAA 531 
Thr His Ala Tyr Ala Leu Gin Glu Lys Phe Gly Cys Glu Val Leu Glu 
155 160 165 

TTA GAC GCT AAA GAA AGC GTT AAA AAC TTG CAC GAA AAA ATC GCC GCT 57 9 

Leu Asp Ala Lys Glu Ser Val Lys Asn Leu His Glu Lys He Ala Ala 
170 175 180 185 

TTT ATA AAA TGC GCT GTT TAACCTGTTT GAAGCTTTCT TTTAAGCCTC TTTGCCCA 63 5 
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355 

Pro Tyr Val Ala 
370 

Ser Phe Val Trp 
385 

Ser Met Asn Leu 

Glu Glu Leu Glu 
420 

Gly lie Glu lie 
435 

lie Leu Met Gly 
450 

Ser Gly Phe Asp 
465 

Phe Val Leu Asn 

Val Glu Gin Ala 
500 

lie Gin Ser Ser 
515 

lie Leu Glu Lys 
530 

Leu Phe lie Gly 
545 

Val Asn Glu Lys 

Ser Asn Asp Gly 
580 

His Asn Leu Lys 
595 

Val Asp Leu Val 
610 

Asp Asp Tyr Tyr 
625 

Asn Gin Asn Lys 

Arg Thr lie Ala 
660 

Asn Gin Met Pro 
675 



360 

Thr Leu Leu Ala 
375 

Asp Phe Ala Thr 
390 

Met lie Glu Asp 
405 

Gin Lys lie Ala 

Leu Ser Asp lie 
440 

Asp Gly Ser Ser 
455 

Gly Lys Val Asn 
470 

Pro Pro Tyr Ser 
485 

Leu Glu Lys Met 

Ala Gly Ser Gly 
520 

His Thr Leu Leu 
535 

Lys Ser Ser Val 
550 

His Asp Ala Lys 
565 

Tyr Ala Arg Ala 

Asp Thr His Asn 
600 

His lie Gly Gin 

615 

Glu Asn Thr lie 
630 

Pro Thr Asp Thr 
645 

Asp Tyr Leu Ser 
Pro Lys Arg 



Arg Leu Ser Lys 
380 

Gly Ser Ala Gly 
395 

Ala Lys Lys Arg 
410 

His lie Lys Ala 
425 

His Thr Leu Ala 

Gin lie Leu Asn 
460 

Asn Glu Ala Phe 
475 

Ala Ser Gly Asn 
490 

Gin Ser Gly Tyr 
505 

Lys Ala Lys Glu 

Ala Ser lie Lys 
540 

Gin Thr His lie 
555 

Gin Arg Val Lys 
570 

Asn Arg Lys Lys 
585 

Ala Lys Glu Arg 

Ser Cys Leu Lys 
620 

Asp Pro Lys Asn 
635 

Lys Pro Glu Leu 
650 

Tyr Glu Val Ser 
665 



365 

Val Asn Lys Asp 

Leu Leu Val Ala 
400 

lie Thr Ser Pro 
415 

Lys Gin Leu Leu 
430 

Val Leu Asn Met 
445 

Gin Asp Gly Leu 

Lys Ala Asn Ala 
480 

Gly Met Val Phe 
495 

Ala Ser Val lie 
510 

Tyr Asn Val Arg 
525 

Met Pro Leu Asp 

Tyr Val Phe Arg 
560 

Phe lie Asn Phe 
575 

Ala Lys Ala Ser 
590 

Tyr Asn Glu Val 
605 

Phe Leu Ser Glu 

Gly Ser Asp Trp 
640 

Glu Asp Phe Lys 
655 

Leu lie Leu Lys 
670 



(2) INFORMATION FOR SEQ ID NO: 10 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25.., 597 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 96: 

Met Asn Lys Val Gin Ser lie Asp Pro Leu lie Ala Asp Lys Phe Asn 

15 10 15 

Asn Glu Leu Arg Ser Tyr Asn Leu Glu Tyr Lys Leu Glu Gin Glu Ser 

20 25 30 

Leu Asn Lys Glu lie Asp Glu Ala Leu Lys Asn Tyr Ala Ser Lys Asn 

35 40 45 

Gly Gly Leu Gly Gly Asn Arg Pro Asp Val Lys Leu Leu Leu Asn Thr 

50 55 60 

Gin Asp Pro Asn Arg Arg Val Pro lie Leu lie Glu Tyr Lys Gly Leu 
65 70 75 80 

Lys Asp Lys Leu lie Lys Leu Asp Lys Asn Lys Leu Val Glu Asn Phe 

85 90 95 

Lys Asn His Glu Pro His Tyr Lys Asn lie Arg Glu Tyr Ala Leu Asn 

100 105 110 

Gly Ala Leu His Tyr Ala Asn Ala lie Leu His His Thr Ser Tyr Thr 

115 120 125 

Glu Cys lie Ala lie Gly lie Thr Gly Tyr Lys Asp Asn Lys Gly Gly 

130 135 140 

lie Cys Ser Gin lie Ala Val Tyr Tyr Val Asn Lys Ser Asn Leu Gly 
145 150 155 160 

Met Gly lie Asp Val Ser Lys Gly Glu Gin Gly Tyr Ser Asp Leu Ser 

165 170 175 

Phe Leu Ser Arg Lys His Phe Asn Asp Phe lie Lys Arg Val Asp Thr 

180 185 190 

Leu Ser Leu Ser Asp Glu Asp Leu Glu Arg lie Arg Glu Lys Lys Asn 

195 200 205 

Gin Glu lie Glu Asp Cys Leu Met Arg Leu Asn Asn Asn lie Tyr Asn 

210 215 220 

Lys Glu Lys Asn Phe Leu Ser Glu His Asn Arg Val Tyr Leu Val lie 
225 230 235 240 

Ala Ser lie lie Ala Asn Leu Gly lie Pro Asn Leu Val Thr Pro Leu 

245 250 255 

Asn Lys Glu Asp Leu Lys Ser Ser Asp Glu Val His Gin Arg Asp Gly 

260 265 270 

Asp lie Met Leu Arg Lys lie Gin Ser Phe Leu Glu Asn Lys Asp Leu 

275 280 285 

Ser Pro Glu Lys Arg Gin Ser lie lie Ser Ser Leu Glu Thr Leu Leu 

290 295 300 

Arg Asn Glu Asn Asn Asn Lys Ala Thr Asn Gly Glu Ser Cys Leu Lys 
305 310 315 320 

Arg Cys Phe Ser Glu lie Val Asp Ser Leu Gly lie Tyr Tyr Lys lie 

325 330 335 

Gly Leu Ser Thr Asp Phe Thr Gly Lys Leu Phe Asn Glu Met Tyr Arg 

340 345 350 

Trp Leu Gly Phe Thr Lys Asp Gin Leu Asn Asp Val Val Leu Thr Pro 
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AAC GAA GCG TTT AAG GCT AAT GCC TTT GTT TTA AAC CCG CCT TAT TCC 14 94 

Asn Glu Ala Phe Lys Ala Asn Ala Phe Val Leu Asn Pro Pro Tyr Ser 
475 480 485 

GCT AGC GGT AAT GGC ATG GTG TTT GTG GAG GAG GCT TTA GAA AAA ATG 1542 
Ala Ser Gly Asn Gly Met Val Phe Val Glu Gin Ala Leu Glu Lys Met 
490 495 500 

CAA AGC GGT TAT GCG AGC GTG ATC ATC CAA TCA AGC GCC GGC AGT GGT 15 9 0 

Gin Ser Gly Tyr Ala Ser Val lie lie Gin Ser Ser Ala Gly Ser Gly 
505 510 515 520 

AAA GCC AAA GAA TAC AAT GTA AGG ATT TTG GAA AAA CAC ACG CTT TTA 163 8 

Lys Ala Lys Glu Tyr Asn Val Arg lie Leu Glu Lys His Thr Leu Leu 
525 530 535 

GCG AGC ATT AAA ATG CCT TTA GAT TTA TTC ATC GGT AAA AGC AGC GTT 168 6 

Ala Ser lie Lys Met Pro Leu Asp Leu Phe lie Gly Lys Ser Ser Val 
540 545 550 

CAA ACC CAT ATC TAT GTT TTT AGG GTC AAT GAA AAG CAT GAC GCT AAG 17 34 

Gin Thr His lie Tyr Val Phe Arg Val Asn Glu Lys His Asp Ala Lys 
555 560 565 

CAA AGG GTG AAA TTT ATT AAT TTC AGT AAC GAC GGC TAC GCT AGA GCG 17 82 

Gin Arg Val Lys Phe lie Asn Phe Ser Asn Asp Gly Tyr Ala Arg Ala 
570 575 580 

AAT CGC AAA AAA GCC AAA GCC AGC CAC AAT TTA AAA GAC ACG CAT AAC 183 0 

Asn Arg Lys Lys Ala Lys Ala Ser His Asn Leu Lys Asp Thr His Asn 
585 590 595 600 

GCC AAA GAG CGC TAC AAC GAA GTC GTG GAT TTA GTC CAT ATT GGC CAA 187 8 

Ala Lys Glu Arg Tyr Asn Glu Val Val Asp Leu Val His lie Gly Gin 
605 610 615 

TCA TGT TTG AAA TTT CTA AGC GAA GAT GAC TAT TAT GAA AAC ACC ATA 192 6 

Ser Cys Leu Lys Phe Leu Ser Glu Asp Asp Tyr Tyr Glu Asn Thr lie 
620 625 630 

GAT CCC AAA AAC GGC AGC GAT TGG AAC CAA AAC AAA CCC ACT GAC ACC 1974 
Asp Pro Lys Asn Gly Ser Asp Trp Asn Gin Asn Lys Pro Thr Asp Thr 
635 640 645 

AAA CCC GAA TTA GAG GAT TTT AAA AGA ACG ATA GCC GAT TAC CTT TCT 2 02 2 

Lys Pro Glu Leu Glu Asp Phe Lys Arg Thr lie Ala Asp Tyr Leu Ser 
650 655 660 

TAT GAA GTA AGC TTG ATT TTA AAA AAC CAA ATG CCC CCA AAG CGA TAGGC 2 072 
Tyr Glu Val Ser Leu lie Leu Lys Asn Gin Met Pro Pro Lys Arg 
665 670 675 

CCCCTTAATA GCCAACTCAA CGCTATTAAG TGGGGCGAG 2111 
(2) INFORMATION FOR SEQ ID NO: 10 96: 
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ATC CCT AAT TTG GTA ACQ CCC CTA AAC AAA GAA GAT CTA AAA TCC AGO 82 2 

lie Pro Asn Leu Val Thr Pro Leu Asn Lys Glu Asp Leu Lys Ser Ser 
250 255 260 

GAT GAG GTC CAT CAA AGA GAT GGC GAC ATC ATG CTC AGA AAA ATC CAA 870 
Asp Glu Val His Gin Arg Asp Gly Asp lie Met Leu Arg Lys lie Gin 
265 270 275 280 

TCC TTT TTA GAG AAT AAG GAT TTG TOT CCA GAG AAA AGG CAA AGC ATT 918 
Ser Phe Leu Glu Asn Lys Asp Leu Ser Pro Glu Lys Arg Gin Ser lie 
285 290 295 

ATT TCT TCA TTA GAG ACT TTA TTA AGA AAC GAA AAC AAC AAC AAA GCC 96 6 

lie Ser Ser Leu Glu Thr Leu Leu Arg Asn Glu Asn Asn Asn Lys Ala 
300 305 310 

ACT AAT GGC GAA AGC TGT TTG AAG CGT TGT TTT AGT GAG ATT GTG GAT 1014 
Thr Asn Gly Glu Ser Cys Leu Lys Arg Cys Phe Ser Glu lie Val Asp 
315 320 325 

AGT TTG GGC ATT TAT TAT AAA ATC GGT CTT AGC ACG GAT TTT ACC GGT 10 62 

Ser Leu Gly lie Tyr Tyr Lys lie Gly Leu Ser Thr Asp Phe Thr Gly 
330 335 340 

AAA TTG TTC AAT GAA ATG TAT CGC TGG CTG GGT TTC ACG AAA GAC CAA 1110 
Lys Leu Phe Asn Glu Met Tyr Arg Trp Leu Gly Phe Thr Lys Asp Gin 
345 350 355 360 

TTA AAC GAT GTG GTG CTC ACA CCC CCT TAT GTC GCC ACG CTT TTA GCT 115 8 

Leu Asn Asp Val Val Leu Thr Pro Pro Tyr Val Ala Thr Leu Leu Ala 
365 370 375 

AGA CTT TCT AAA GTC AAT AAG GAT AGT TTC GTG TGG GAT TTT GCC ACC 12 0 6 

Arg Leu Ser Lys Val Asn Lys Asp Ser Phe Val Trp Asp Phe Ala Thr 
380 385 390 

GGA AGC GCT GGG CTA TTA GTC GCA AGC ATG AAT TTG ATG ATA GAA GAC 12 54 

Gly Ser Ala Gly Leu Leu Val Ala Ser Met Asn Leu Met lie Glu Asp 
395 400 405 

GCT AAA AAG CGT ATC ACT AGT CCA GAG GAA TTA GAG CAA AAA ATC GCC 13 02 

Ala Lys Lys Arg lie Thr Ser Pro Glu Glu Leu Glu Gin Lys lie Ala 
410 415 420 

CAC ATT AAA GCC AAG CAA CTT TTA GGG ATA GAA ATC TTA TCG GAT ATC 13 5 0 

His He Lys Ala Lys Gin Leu Leu Gly He Glu He Leu Ser Asp He 
425 430 435 440 

CAT ACT TTA GCG GTG TTA AAC ATG ATT TTA ATG GGC GAT GGG AGC AGT 13 98 

His Thr Leu Ala Val Leu Asn Met He Leu Met Gly Asp Gly Ser Ser 
445 450 455 

CAA ATC TTA AAC CAA GAC GGC TTG AGC GGT TTT GAT GGC AAA GTC AAT 144 6 

Gin He Leu Asn Gin Asp Gly Leu Ser Gly Phe Asp Gly Lys Val Asn 
460 465 470 
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Glu Tyr Lys Leu Glu Gin Glu Ser Leu Asn Lys Glu lie Asp Glu Ala 
25 30 35 40 

TTA AAA AAT TAG GOT TOT AAA AAT GGG GGT TTA GGG GGT AAC CGC OCT 198 
Leu Lys Asn Tyr Ala Ser Lys Asn Gly Gly Leu Gly Gly Asn Arg Pro 
45 50 55 

GAT GTG AAA CTT TTA TTA AAC ACA CAA GAC CCC AAC AGA AGA GTC OCT 24 6 

Asp Val Lys Leu Leu Leu Asn Thr Gin Asp Pro Asn Arg Arg Val Pro 
60 65 70 

ATT TTA ATA GAA TAC AAA GGG CTA AAA GAT AAG CTC ATT AAA TTA GAC 2 94 

lie Leu lie Glu Tyr Lys Gly Leu Lys Asp Lys Leu lie Lys Leu Asp 
75 80 85 

AAA AAC AAA CTG GTA GAA AAC . TTT AAA AAC CAT GAG OCT CAT TAT AAA 3 42 

Lys Asn Lys Leu Val Glu Asn Phe Lys Asn His Glu Pro His Tyr Lys 
90 95 100 

AAC ATT AGA GAA TAC GCC CTA AAT GGG GCT TTG CAT TAC GOT AAT GCG 3 90 

Asn lie Arg Glu Tyr Ala Leu Asn Gly Ala Leu His Tyr Ala Asn Ala 
105 110 115 120 

ATT TTA CAC CAC ACG AGC TAC ACT GAA TGC ATC GCC ATA GGC ATT ACA 438 
lie Leu His His Thr Ser Tyr Thr Glu Cys lie Ala lie Gly lie Thr 
125 130 135 

GGC TAT AAA GAC AAT AAG GGC GGC ATA TGC TCT CAA ATC GCT GTC TAT 4 86 

Gly Tyr Lys Asp Asn Lys Gly Gly lie Cys Ser Gin lie Ala Val Tyr 
140 145 150 

TAT GTG AAT AAA AGC AAT CTA GGC ATG GGG ATA GAT GTT TCA AAA GGC 5 34 

Tyr Val Asn Lys Ser Asn Leu Gly Met Gly lie Asp Val Ser Lys Gly 
155 160 165 

GAG CAA GGT TAT AGC GAT CTC TCC TTT TTA AGC CGT AAG CAT TTT AAC 5 82 

Glu Gin Gly Tyr Ser Asp Leu Ser Phe Leu Ser Arg Lys His Phe Asn 
170 175 180 

GAC TTT ATT AAA CGA GTA GAC ACC CTT TCT TTA AGC GAT GAA GAT TTA 63 0 

Asp Phe lie Lys Arg Val Asp Thr Leu Ser Leu Ser Asp Glu Asp Leu 
185 190 195 200 

GAG CGC ATT AGA GAA AAG AAA AAC CAA GAA ATA GAA GAC TGC TTA ATG 67 8 

Glu Arg lie Arg Glu Lys Lys Asn Gin Glu lie Glu Asp Cys Leu Met 
205 210 215 

CGG CTC AAC AAC AAT ATT TAC AAC AAA GAA AAG AAT TTT TTA AGC GAA 72 6 

Arg Leu Asn Asn Asn lie Tyr Asn Lys Glu Lys Asn Phe Leu Ser Glu 
220 225 230 

CAC AAT CGG GTA TAT TTA GTG ATT GCG AGC ATT ATC GCT AAT TTA GGC 77 4 

His Asn Arg Val Tyr Leu Val lie Ala Ser lie lie Ala Asn Leu Gly 
235 240 245 
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Ala Tyr Phe Lys 
160 

Thr lie Phe Asp 
175 

Gly Gly Asn Tyr 
190 

Gin Gly Tyr Asp 
205 

lie Glu Glu Val 

Ala Phe lie Thr 
240 

Lys Ser Leu Met 
255 

Glu Arg Glu lie 
270 

Ala Cys Gly Thr 
285 

Asn Asn Lys Ser 

Leu Tyr Asp Leu 
320 

Lys Asp Trp lie 

335 

Phe Glu Val Gly 
340 

(2) INFORMATION FOR SEQ ID NO: 1095: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ICEY: Coding Sequence 

(B) LOCATION: 31... 2067 
(D) OTHER INFORMATION: 



Glu Tyr Leu Phe 
145 

Gly Gly lie Glu 

Arg Ala Ala Pro 
180 

Ala Ala Ser Leu 
195 

Asp Cys lie Tyr 
210 

Gly Ala Ala Asn 
225 

Pro His Ser Pro 

Val Leu Ala Lys 
260 

Leu Met Asp Glu 
275 

Ala Ala lie lie 
290 

Tyr Phe Phe Glu 
305 

Leu Leu Ser lie 



lie Val Phe Cys 
150 

Lys Gly Gly Ala 
165 

Lys Gly Thr Gly 

Leu Ala His Lys 
200 

Leu Asp Pro Thr 
215 

Phe Phe Gly lie 
230 

Ser lie Leu Pro 
245 

Glu Tyr Leu Asn 

Leu Asp Ala Phe 
280 

Thr Pro lie Lys 
295 

Ala Pro Gly His 
310 

Gin Gin Gly Glu 



Ala Pro Val Gly 
155 

Arg Phe lie Thr 
170 

Gly Val Lys Val 
185 

Met Ala Thr Glu 

Thr His Thr Lys 
220 

Thr His Asp Asp 
235 

Ser lie Thr Lys 
250 

Leu Lys Val Glu 
265 

Lys Glu Ala Gly 

Glu lie Val His 
300 

lie Thr Lys Arg 
315 

Gin Glu Ala Pro 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095: 

GTTATAATTT TATTTTTTAA AAGGATACCC ATG AAT AAA GTT CAA TCT ATT GAT 54 

Met Asn Lys Val Gin Ser lie Asp 
1 5 

CCT TTA ATC GCT GAT AAG TTC AAC AAC GAG TTA AGA ACT TAT AAC CTA 10 2 

Pro Leu lie Ala Asp Lys Phe Asn Asn Glu Leu Arg Ser Tyr Asn Leu 
10 15 20 

GAA TAC AAA CTA GAG CAA GAA AGC CTG AAT AAA GAA ATT GAT GAA GCT 15 0 
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lie Thr Lys Lys Ser Leu Met Val Leu Ala Lys Glu Tyr Leu Asn Leu 
250 255 260 265 

AAA GTA GAA GAG AGG GAA ATC CTA ATG GAT GAG TTG GAT GCG TTT AAA 870 
Lys Val Glu Glu Arg Glu lie Leu Met Asp Glu Leu Asp Ala Phe Lys 
270 275 280 

GAA GOT GGA GCG TGC GGG ACA GCT GCG ATC ATT ACG CCC ATT AAA GAA 918 
Glu Ala Gly Ala Cys Gly Thr Ala Ala lie lie Thr Pro He Lys Glu 
285 290 295 

ATC GTG CAC AAC AAC AAG TCT TAT TTT TTT GAA GCG CCG GGC CAT ATT 96 6 

He Val His Asn Asn Lys Ser Tyr Phe Phe Glu Ala Pro Gly His He 
300 305 310 

ACT AAA CGA CTC TAT GAT TTG CTT TTA TCC ATC CAA CAA GGC GAA CAA 1014 
Thr Lys Arg Leu Tyr Asp Leu Leu Leu Ser He Gin Gin Gly Glu Gin 
315 320 325 

GAA GCC CCC AAA GAT TGG ATT TTT GAA GTT GGC TAAAAGGTTA AAATTTATAG 10 67 
Glu Ala Pro Lys Asp Trp He Phe Glu Val Gly 
330 335 340 

CTGTATGCCG CATAAAATAA GGGCG 10 92 

(2) INFORMATION FOR SEQ ID N0:1094: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094: 

Met Ala Asn Leu Glu Asn Leu Asp Trp Lys Asn Leu Gly Phe Ser Tyr 

15 10 15 

He Lys Thr Asp Phe Arg Phe He Ala Thr Tyr Lys Asn Gly Ser Trp 

20 25 30 

Ser Gin Gly Gly Leu Val Ser Glu Asn Met Leu Gin Leu Ser Glu Gly 

35 40 45 

Ser Pro Val Leu His Tyr Gly Gin Ala Cys Phe Glu Gly Leu Lys Ala 

50 55 60 

Tyr Arg Ser Gin Lys Gly Lys Ala Leu Leu Phe Arg Pro Leu Glu Asn 
65 70 75 80 

Ala Lys Arg Leu Gin Thr Ser Cys Glu Arg Leu Leu Met Pro Lys Val 

85 90 95 

Ser Glu Glu Leu Phe Leu Arg Ala Cys Ala Glu Val Val Lys Ala Asn 

100 105 110 

Gin Lys Trp Leu Ala Pro Tyr Lys Ser Gly Ala Ser Leu Tyr Leu Arg 

115 120 125 

Pro Phe Val He Gly Val Gly Asp Asn Leu Gly Val Lys Pro Ala Asn 
130 135 140 
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30 35 40 

ATG TTA CAA CTC AGC GAA GGC TCG CCG GTC TTG CAC TAG GGG GAG GGT 198 
Met Leu Gin Leu Ser Glu Gly Ser Pro Val Leu His Tyr Gly Gin Ala 
45 50 55 

TGT TTT GAA GGC TTG AAG GGT TAG GGG TCT CAA AAG GGG AAA GGT TTA 24 6 

Cys Phe Glu Gly Leu Lys Ala Tyr Arg Ser Gin Lys Gly Lys Ala Leu 
60 65 70 

CTC TTT GGG GCT TTA GAA AAC GGC AAA CGC TTG CAA ACT TCA TGC GAA 2 94 

Leu Phe Arg Pro Leu Glu Asn Ala Lys Arg Leu Gin Thr Ser Cys Glu 
75 80 85 

AGA CTG CTC ATG CCC AAA GTG AGC GAA GAG CTG TTT TTA AGG GCA TGC 34 2 

Arg Leu Leu Met Pro Lys Val Ser Glu Glu Leu Phe Leu Arg Ala Cys 
90 95 100 105 

GCT GAA GTG GTG AAA GCG AAT CAA AAA TGG CTC GCT CCT TAT AAA AGC 3 90 

Ala Glu Val Val Lys Ala Asn Gin Lys Trp Leu Ala Pro Tyr Lys Ser 
110 115 120 

GGG GCG AGT TTG TAT TTG CGC CCT TTT GTC ATA GGC GTA GGG GAT AAT 438 
Gly Ala Ser Leu Tyr Leu Arg Pro Phe Val lie Gly Val Gly Asp Asn 
125 130 135 

TTG GGG GTG AAG CCG GCT AAT GAA TAG CTT TTT ATG GTG TTT TGT GGG 4 86 

Leu Gly Val Lys Pro Ala Asn Glu Tyr Leu Phe lie Val Phe Cys Ala 
140 145 150 

GGT GTG GGG GGG TAT TTT AAG GGG GGT ATA GAA AAA GGG GGG GCT AGG 53 4 

Pro Val Gly Ala Tyr Phe Lys Gly Gly lie Glu Lys Gly Gly Ala Arg 
155 160 165 

TTT ATG ACT AGG ATT TTT GAT AGG GCG GCG CCT AAA GGC ACC GGT GGG 5 82 

Phe lie Thr Thr lie Phe Asp Arg Ala Ala Pro Lys Gly Thr Gly Gly 
170 175 180 185 

GTG AAA GTG GGA GGG AAT TAG GCT GCA AGC CTG TTA GGC CAT AAA ATG 63 0 

Val Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu Leu Ala His Lys Met 
190 195 200 

GCC ACA GAG CAA GGC TAT GAT GAT TGC ATT TAT TTA GAG CCT ACT ACG 67 8 

Ala Thr Glu Gin Gly Tyr Asp Asp Cys lie Tyr Leu Asp Pro Thr Thr 
205 210 215 

CAC ACT AAA ATT GAA GAA GTG GGG GCG GCG AAT TTT TTT GGC ATG ACG 72 6 

His Thr Lys lie Glu Glu Val Gly Ala Ala Asn Phe Phe Gly lie Thr 
220 225 230 

GAT GAT GAT GGC TTT ATG AGG CCG GAT TGG CCA AGG ATT CTG GGA AGC 774 
His Asp Asp Ala Phe lie Thr Pro His Ser Pro Ser lie Leu Pro Ser 
235 240 245 

ATT ACC -AAA AAA AGC TTG ATG GTT TTG GCT AAA GAA TAT TTG AAC CTC 822 
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35 40 45 

Lys Asn Phe Ser His Ser Val lie Lys Ser Leu Asp Thr Leu Ala lie 

50 55 60 

Glu Val Glu Lys Asn lie Pro Thr Gin Ala Gly Leu Gly Gly Gly Ser 
65 70 75 80 

Thr Asp Ala Gly Gly Leu Leu Tyr His Leu Asn Gin lie Phe Asp Trp 

85 90 95 

Arg Leu Ser Leu Glu Glu Leu Tyr Ser Met Gly Ser Leu Val Gly Ala 

100 105 110 

Asp Thr Asn Phe Phe lie Ser Gin Tyr Lys Ser Thr Asn Ala Thr Ser 

115 120 125 

Tyr Gly Glu Val lie Glu Asn Phe Glu Glu Glu Pro Leu Glu Asn Arg 

130 135 140 

Leu Glu lie Tyr Ala Pro Asn His Val Phe Cys Ser Thr Lys Ala Val 
145 150 155 160 

Tyr Gin Ala Tyr Lys Pro Glu Thr Cys Phe Ser Gin Ala Lys Glu Trp 

165 170 175 

Leu Lys Lys Pro Ser Leu Glu Cys Leu Lys Thr Tyr Asp Arg Asn Gly 

180 185 190 

Leu Asn Asp Leu Leu Lys Pro Ala Leu Leu Thr Asn Gin Ala Leu Lys 

195 200 205 

Asp lie Glu Ser Glu Leu Gly Lys Glu Trp Phe Phe Ser Gly Ser Gly 

210 215 220 

Ser Ala Phe Phe Arg Leu Lys Pro Met Gin Lys Gly Gly Glu 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 1093: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 1047 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093: 

TGAAACTATC CATTTAAAGG TGTGAAA ATG GCA AAT TTA GAA AAT TTA GAC TGG 54 

Met Ala Asn Leu Glu Asn Leu Asp Trp 
1 5 * 

AAA AAT TTA GGC TTT AGC TAC ATT AAA ACG GAT TTT CGC TTC ATC GCC 102 
Lys Asn Leu Gly Phe Ser Tyr lie Lys Thr Asp Phe Arg Phe lie Ala 
10 15 20 25 

ACT TAT AAA AAC GGC TCT TGG TCG CAA GGC GGA TTG GTG AGC GAA AAC 15 0 

Thr Tyr Lys Asn Gly Ser Trp Ser Gin Gly Gly Leu Val Ser Glu Asn 
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TAT AGC ATG GGA TCT TTA GTG GGC GCG GAC ACC AAT TTT TTC ATC TCG 4 50 

Tyr Ser Met Gly Ser Leu Val Gly Ala Asp Thr Asn Phe Phe lie Ser 
105 110 115 

CAA TAG AAA AGC ACT AAC GCC ACT TCT TAT GGC GAA GTC ATT GAA AAT 4 98 

Gin Tyr Lys Ser Thr Asn Ala Thr Ser Tyr Gly Glu Val lie Glu Asn 
120 125 130 135 

TTT GAA GAA GAG CCT TTA GAA AAT CGC CTA GAA ATC TAT GCA CCA AAT 54 6 

Phe Glu Glu Glu Pro Leu Glu Asn Arg Leu Glu lie Tyr Ala Pro Asn 
140 145 150 

CAT GTT TTT TGC AGC ACC AAA GCC GTT TAT CAA GCT TAT AAG CCT GAA 594 
His Val Phe Cys Ser Thr Lys Ala Val Tyr Gin Ala Tyr Lys Pro Glu 
155 160 165 

ACT TGT TTT TCT CAA GCT AAA GAA TGG CTT AAA AAG CCG AGT TTG GAA 64 2 

Thr Cys Phe Ser Gin Ala Lys Glu Trp Leu Lys Lys Pro Ser Leu Glu 
170 175 180 

TGC CTA AAA ACT TAT GAT AGA AAC GGA TTA AAC GAC CTT TTA AAG CCG 6 90 

Cys Leu Lys Thr Tyr Asp Arg Asn Gly Leu Asn Asp Leu Leu Lys Pro 
185 190 195 

GCT TTA CTC ACT AAC CAA GCC TTA AAA GAT ATA GAA AGC GAA CTA GGC 73 8 

Ala Leu Leu Thr Asn Gin Ala Leu Lys Asp lie Glu Ser Glu Leu Gly 
200 205 210 215 

AAG GAG TGG TTT TTT AGC GGG AGC GGG AGC GCG TTT TTT AGG CTA AAG 7 86 

Lys Glu Trp Phe Phe Ser Gly Ser Gly Ser Ala Phe Phe Arg Leu Lys 
220 225 230 

CCT ATG CAA AAA GGG GGC GAA TGAAACTCAT TGCCAGCAAC AAAAAAGCCT ATTT 841 
Pro Met Gin Lys Gly Gly Glu 
235 

TGACTA 84 7 

(2) INFORMATION FOR SEQ ID NO: 1092: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092: 

Met Cys Leu Val Lys Asp Lys Leu Lys Asp lie lie Ser Val Lys Ser 

15 10 15 

Ala Leu Ser Phe Ser Leu Lys Gly Asp Phe Asp Cys Pro Leu Glu Glu 

20 25 30 

Asn Ser Leu Phe Lys Ala Leu Gin lie Leu Lys Asn Phe Leu Lys Ser 
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290 

Leu Lys Phe Gly Glu 
305 

lie Glu Val Gly Val 
325 

Asn Arg 



295 

Ser Phe Thr Ala His 
310 

Lys Thr Asp Lys Gly 
330 



300 

Tyr Lys Cys Gin Asn Leu 
315 320 
Ser Trp Thr Phe Asn Phe 
335 



(2) INFORMATION FOR SEQ ID NO: 1091: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 94... 8 07 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091: 

GCCTTGACGC ATGTTTTTGA AGTTTATCCT AAAGTCAATA TTTTTTTAAA AATCCTTCAC 6 0 

AAAGAAGGGG CTTACCACAA GCTTATTTCT CGC ATG TGT TTG GTC AAA GAC AAG 114 

Met Cys Leu Val Lys Asp Lys 
1 5 

CTC AAA GAC ATT ATC AGC GTC AAA AGC GCG CTT TCT TTT TCG TTA AAA 162 
Leu Lys Asp lie lie Ser Val Lys Ser Ala Leu Ser Phe Ser Leu Lys 
10 15 20 

GGG GAT TTT GAC TGC OCT TTA GAA GAA AAC TCG CTC TTT AAA GCC CTC 210 
Gly Asp Phe Asp Cys Pro Leu Glu Glu Asn Ser Leu Phe Lys Ala Leu 
25 30 35 

CAA ATT TTA AAG AAT TTT TTA AAA TCA AAA AAT TTC TCT CAT TCT GTC 25 8 

Gin lie Leu Lys Asn Phe Leu Lys Ser Lys Asn Phe Ser His Ser Val 
40 45 50 55 

ATC AAA TCC CTA GAC ACC CTA GCG ATT GAA GTG GAA AAA AAC ATC CCC 3 06 

lie Lys Ser Leu Asp Thr Leu Ala lie Glu Val Glu Lys Asn lie Pro 
60 65 70 

ACT CAA GCC GGA TTA GGC GGT GGG AGC ACT GAT GCT GGG GGG CTA TTG 3 54 

Thr Gin Ala Gly Leu Gly Gly Gly Ser Thr Asp Ala Gly Gly Leu Leu 
75 80 85 

TAT CAT TTA AAT CAG ATT TTT GAC TGG CGT TTG AGT TTA GAA GAG CTT 4 02 

Tyr His Leu Asn Gin lie Phe Asp Trp Arg Leu Ser Leu Glu Glu Leu 
90 95 100 
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ACC GAT AAA GGC TCC TGG ACT TTC AAC TTT AAC AGA TAAATCAGGC AAATAT 106 6 
Thr Asp Lys Gly Ser Trp Thr Phe Asn Phe Asn Arg 
330 335 



GGACAATAGC ACAGACAGAG CAAAAATCCT TATAGAAGAG CTTAAAAT 1114 
(2) INFORMATION FOR SEQ ID NO: 1090: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090: 



Met Phe Phe Lys Thr Tyr Gin Lys Leu Leu Gly Ala Ser Cys Leu Thr 

15 10 15 

Leu Tyr Leu Ala Gly Cys Gly Ser Asp Ser Ser Glu Pro Leu Val Gly 

20 25 30 

lie Glu Lys Asn Ser Phe Asn Ser Thr Val Lys lie lie Ser Lys Thr 

35 40 45 

Asp Asn lie Glu lie Gin Asp Leu Lys Leu Asn Arg Gly Asn Cys Glu 

50 55 60 

His Asp Gin Asn Phe Leu Val Lys Leu lie Gin Glu Thr Ala Asn Thr 
65 70 75 80 

Tyr Leu Phe Ala Ser Glu Lys Glu Lys Ala lie Lys Asn His Gin Ala 

85 90 95 

Lys lie Ala Arg Leu Gin Lys Asp Leu Glu Glu Leu Thr Gin His Val 

100 105 110 

Gin Gin Ser Asn Asn Leu Asp Lys Leu Leu Glu Asn Gly Gly Leu Phe 

115 120 125 

Val Ser Gly His Asp Tyr Lys Tyr Thr Lys Asp Asp Asn Pro lie Tyr 

130 135 140 

Val Val Lys Arg Met Leu Asp Asn Leu Asp Ser Tyr Lys Tyr Glu Ser 
145 150 155 160 

Asp Asp Val Leu Asp Val Pro Tyr Glu Lys Leu Leu Glu lie Ser lie 

165 170 175 

Ala lie Glu Asp Thr Lys Asn Pro Lys Asp Tyr Pro Tyr lie Asn Leu 

180 185 190 

Lys Glu Leu Lys Lys Leu lie Asp Ser lie lie Asp Asp His Gly Tyr 

195 200 205 

Met Ala Asp Gly Phe Leu Asn Glu Tyr Ser Asn Arg Val Ser Lys Lys 

210 215 220 

Gly Leu Gin lie Leu Ala Lys Leu Lys Ser Met Trp Pro Ser Val Gly 
225 230 235 240 

Lys Phe Tyr Phe Ala Ser Leu Lys Glu Ala lie Pro Arg His Ala Lys 

245 250 255 

Glu Val Thr Asp Lys Met lie Ser Ser Glu Glu Lys Ser lie Lys Ala 

260 265 270 

Asn Gin Val Lys Leu Thr Glu Ala Lys Gin Asp lie Asp Lys Met Glu 

275 280 285 

Lys lie He Lys Asp Leu Glu Ser Lys Lys Asn Thr Leu Ser Val Tyr 
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AAA GAT TTA 
Lys Asp Leu 
105 

GAT AAA TTG 
Asp Lys Leu 
120 

AAA TAT ACA 
Lys Tyr Thr 
135 

GAT AAC CTT 
Asp Asn Leu 

CCA TAT GAG 
Pro Tyr Glu 



AAC CCC AAA 
Asn Pro Lys 
185 

ATA GAT AGT 
lie Asp Ser 
200 

AAT GAA TAT 
Asn Glu Tyr 
215 

AAA CTA AAA 
Lys Leu Lys 

TTG AAA GAG 
Leu Lys Glu 



ATT AGC TCT 
lie Ser Ser 
265 

GAA GCG AAG 
Glu Ala Lys 
280 

GAA AGC AAG 
Glu Ser Lys 
295 

TTC ACA GCG 
Phe Thr Ala 



GAA GAA CTC 
Glu Glu Leu 



TTA GAA AAT 
Leu Glu Asn 



AAA GAT GAT 
Lys Asp Asp 
14 0 

GAT AGC TAT 
Asp Ser Tyr 
155 

AAG CTA TTG 
Lys Leu Leu 
170 

GAC TAC CCT 
Asp Tyr Pro 



ATT ATT GAT 
lie lie Asp 



TCT AAT AGG 
Ser Asn Arg 
220 

TCC ATG TGG 
Ser Met Trp 
235 

GCT ATC CCA 
Ala lie Pro 
250 

GAA GAA AAA 
Glu Glu Lys 



CAA GAT ATT 
Gin Asp lie 



AAA AAC ACC 
Lys Asn Thr 
300 

CAT TAT AAG 
His Tyr Lys 
315 



ACA CAG CAT 
Thr Gin His 
110 

GGA GGA CTA 
Gly Gly Leu 
125 

AAC CCA ATA 
Asn Pro lie 



AAA TAT GAA 
Lys Tyr Glu 



GAA ATA AGC 
Glu lie Ser 
175 

TAT ATC AAC 
Tyr lie Asn 
190 

GAT CAT GGT 
Asp His Gly 
205 

GTA TCA AAA 
Val Ser Lys 



CCT AGC GTA 
Pro Ser Val 



AGG CAT GCC 
Arg His Ala 
255 

TCT ATC AAA 
Ser lie Lys 
270 

GAC AAA ATG 
Asp Lys Met 
285 

TTA TCA GTG 
Leu Ser Val 



TGT CAA AAT 
Cys Gin Asn 



GTG CAA CAA 
Val Gin Gin 



TTC GTT AGT 
Phe Val Ser 
130 

TAT GTT GTT 
Tyr Val Val 
145 

TCA GAC GAC 
Ser Asp Asp 
160 

ATT GCT ATT 
lie Ala He 



CTT AAA GAA 
Leu Lys Glu 

TAT ATG GCC 
Tyr Met Ala 
210 

AAA GGT CTC 
Lys Gly Leu 
225 

GGG AAA TTT 

Gly Lys Phe 
240 

AAA GAA GTT 

Lys Glu Val 



GCC AAT CAA 
Ala Asn Gin 



GAA AAA ATC 
Glu Lys He 
290 

TAT TTA AAA 
Tyr Leu Lys 
305 

CTC ATA GAA 
Leu He Glu 
320 



TCC AAT AAT 
Ser Asn Asn 
115 

GGC CAT GAT 
Gly His Asp 

AAG AGG ATG 
Lys Arg Met 



GTG CTA GAC 
Val Leu Asp 
165 

GAA GAC ACT 
Glu Asp Thr 
180 

CTC AAA AAA 
Leu Lys Lys 
195 

GAT GGC TTT 
Asp Gly Phe 



CAA ATC CTT 
Gin He Leu 



TAT TTC GCC 
Tyr Phe Ala 
245 

ACT GAC AAG 
Thr Asp Lys 
260 

GTC AAA CTC 
Val Lys Leu 
275 

ATT AAA GAT 
He Lys Asp 



TTT GGA GAA 
Phe Gly Glu 



GTT GGA GTC 
Val Gly Val 
325 



CTT 3 90 

Leu 



TAT 4 3 8 

Tyr 

CTT 486 

Leu 

150 

GTG 534 
Val 



AAA 582 
Lys 



TTA 63 0 

Leu 



TTG 678 
Leu 



GCT 72 6 

Ala 

230 

TCT 7 74 

Ser 



ATG 822 
Met 



ACT 870 
Thr 



TTA 918 
Leu 



AGT 966 

Ser 

310 

AAA 1014 
Lys 
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Thr lie Lys Gin Ala Gin Asp Arg Leu Met Glu Leu Ser Lys Val Lys 

245 250 255 

Lys Ala Tyr Trp Leu Glu Lys Ser Pro Met Glu Arg Phe lie Glu Lys 

260 265 270 

Ala Thr Gin Ser Ala Thr Asn lie lie Thr Gin Ala Phe Gly Tyr Gin 

275 280 285 

Leu Leu Met Arg 
290 

(2) INFORMATION FOR SEQ ID NO: 108 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 1050 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1089: 

CGAGCTATCA CAACAAATCA ATTTGTAGGA ACAAGC ATG TTT TTT AAA ACT TAT 54 

Met Phe Phe Lys Thr Tyr 
1 5 

CAA AAA TTA TTG GGT GCG AGC TGT TTG ACG TTG TAT TTA GCG GGC TGT 102 
Gin Lys Leu Leu Gly Ala Ser Cys Leu Thr Leu Tyr Leu Ala Gly Cys 
10 15 20 

GGG AGT GAT ACT AGC GAG CCA TTG GTG GGA ATT GAA AAA AAT AGC TTC 15 0 

Gly Ser Asp Ser Ser Glu Pro Leu Val Gly lie Glu Lys Asn Ser Phe 
25 30 35 

AAT TCT ACC GTG AAA ATC ATT TCT AAA ACC GAC AAC ATA GAA ATC CAA 198 
Asn Ser Thr Val Lys lie lie Ser Lys Thr Asp Asn lie Glu lie Gin 
40 45 50 

GAC TTG AAG CTC AAT CGT GGC AAT TGT GAG CAT GAT CAA AAT TTC TTG 24 6 

Asp Leu Lys Leu Asn Arg Gly Asn Cys Glu His Asp Gin Asn Phe Leu 
55 60 65 70 

GTA AAG TTA ATC CAA GAA ACA GCC AAT ACA TAC CTG TTT GCA TCA GAA 2 94 

Val Lys Leu lie Gin Glu Thr Ala Asn Thr Tyr Leu Phe Ala Ser Glu 
75 80 85 

AAA GAA AAA GCG ATC AAA AAC CAC CAA GCA AAA ATC GCA AGA CTT CAA 3 42 

Lys Glu Lys Ala lie Lys Asn His Gin Ala Lys lie Ala Arg Leu Gin 
90 95 100 
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Lys Lys Ala Tyr Trp Leu Glu Lys Ser Pro Met Glu Arg Phe lie Glu 
260 265 270 



AAA GCC ACG CAA TCA GCC ACA AAC ATC ATC ACA CAA GCC TTT GGC TAT 92 4 

Lys Ala Thr Gin Ser Ala Thr Asn lie lie Thr Gin Ala Phe Gly Tyr 
275 280 285 



CAA TTA TTA ATG AGA TAAAGATGTT AGAATTTATT TTAAAAATTC AAGCTAGAGA C 98 0 
Gin Leu Leu Met Arg 
290 

TCTAAAGGCT T 991 



(2) INFORMATION FOR SEQ ID NO: 1088: 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 2 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088: 



Met Trp Ser Phe 
1 

Leu Asp Phe lie 
20 

Val Leu Val Phe 
35 

Asn Leu Ala Lys 

50 

Phe Asp Lys Glu 
65 

Val Leu Leu Leu 

Glu Leu Ser Glu 
100 

Ala Tyr Ala Arg 
115 

Gin Ala Ser Glu 
130 

Gly Val lie Phe 
145 

Gly Val Ala Thr 

Thr Phe Thr Arg 
180 

Leu Val Asn Glu 
195 

Arg Lys Leu Asp 
210 

Phe Ser Ala Gin 
225 



lie Gin Lys lie 
5 

Thr Lys Tyr Phe 

Phe Ser Ala Lys 
40 

Leu Tyr Leu Asn 
55 

Val Asp Lys lie 
70 

lie Asp Ser Pro 
85 

Lys lie Ala Asp 

Gly Val Met Ala 
120 

Val Tyr Ala Ser 
135 

Ser Gly Ala Asn 
150 

Gin Gly Val His 
165 

Ala Trp Lys Pro 

Gin Tyr Gin Met 

200 

Ala Lys Asp Tyr 
215 

Lys Ala Leu Lys 
230 



Phe Lys Ala Leu 
10 

Lys Ser Phe Val 
25 

Glu Ser Ala Pro 

Gly Ala lie Phe 
60 

Leu Lys Thr Pro 
75 

Gly Gly Ala Val 
90 

Leu Lys Gin Lys 
105 

Ser Gly Ser Tyr 

Lys Ala Ser Leu 
140 

Val Glu Asn Leu 
155 

Ala Gly Glu Tyr 
170 

Asn Glu Lys Asp 
185 

Phe Val Asn Asp 

Lys Asp Phe Ala 
220 

Leu Lys Leu lie 
235 



lie lie Ala Pro 
15 

Leu Leu Leu lie 
30 

Ser Ala Pro Pro 
45 

Ser Thr Glu Asp 

Ser lie Lys Gly 
80 

Ser Ala Ser Val 
95 

Met Pro Val Leu 
110 

Tyr Ala Gly Met 
125 

lie Gly Ser lie 

Leu Asn Lys Val 
160 

Lys Glu lie Gly 
175 

Phe Leu Gin Asn 
190 

Val Ala Lys Ala 
205 

Glu Gly Lys Val 

Asp Lys lie Ser 
240 
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35 40 45 

CCT AAT CTC GCT AAA CTC TAT TTA AAT GGG GCG ATT TTT AGC ACC GAG 2 52 

Pro Asn Leu Ala Lys Leu Tyr Leu Asn Gly Ala lie Phe Ser Thr Glu 
50 55 60 

GAT TTT GAC AAA GAA GTG GAT AAA ATC CTA AAA ACC CCT AGC ATT AAG 3 00 

Asp Phe Asp Lys Glu Val Asp Lys lie Leu Lys Thr Pro Ser lie Lys 
65 70 75 

GGC GTT TTG CTT TTG ATT GAC TCT CCT GGT GGG GCG GTG TCA GCG AGC 34 8 

Gly Val Leu Leu Leu lie Asp Ser Pro Gly Gly Ala Val Ser Ala Ser 
80 85 90 95 

GTG GAA TTG AGC GAA AAA ATC GCT GAT TTG AAG CAA AAA ATG CCC GTT 3 96 

Val Glu Leu Ser Glu Lys lie Ala Asp Leu Lys Gin Lys Met Pro Val 
100 105 110 

TTA GCG TAT GCT AGG GGG GTT ATG GCG AGC GGG AGC TAT TAT GCG GGC 444 
Leu Ala Tyr Ala Arg Gly Val Met Ala Ser Gly Ser Tyr Tyr Ala Gly 
115 120 125 

ATG CAA GCG AGC GAA GTT TAT GCC TCT AAA GCG AGT TTG ATA GGA TCC 4 92 

Met Gin Ala Ser Glu Val Tyr Ala Ser Lys Ala Ser Leu lie Gly Ser 
130 135 140 

ATT GGG GTG ATT TTT TCA GGT GCG AAT GTG GAA AAT TTG CTC AAT AAA 54 0 

lie Gly Val lie Phe Ser Gly Ala Asn Val Glu Asn Leu Leu Asn Lys 
145 150 155 

GTC GGC GTA GCC ACT CAA GGC GTG CAT GCG GGC GAA TAC AAA GAA ATA 5 88 

Val Gly Val Ala Thr Gin Gly Val His Ala Gly Glu Tyr Lys Glu lie 
160 165 170 175 

GGC ACT TTC ACC AGA GCG TGG AAA CCC AAC GAA AAA GAT TTT TTG CAA 63 6 

Gly Thr Phe Thr Arg Ala Trp Lys Pro Asn Glu Lys Asp Phe Leu Gin 
180 185 190 

AAT TTA GTC AAT GAG CAA TAC CAA ATG TTT GTG AAT GAT GTC GCA AAA 6 84 

Asn Leu Val Asn Glu Gin Tyr Gin Met Phe Val Asn Asp Val Ala Lys 
195 200 205 

GCT AGG AAA TTA GAC GCT AAG GAT TAT AAG GAT TTT GCT GAA GGG AAG 73 2 

Ala Arg Lys Leu Asp Ala Lys Asp Tyr Lys Asp Phe Ala Glu Gly Lys 
210 215 220 

GTC TTT AGC GCT CAA AAG GCT CTG AAA TTA AAA CTC ATT GAT AAA ATC 780 
Val Phe Ser Ala Gin Lys Ala Leu Lys Leu Lys Leu lie Asp Lys lie 
225 230 235 

AGC ACG ATT AAG CAA GCG CAA GAT CGC TTA ATG GAA TTG AGT AAG GTT 82 8 

Ser Thr lie Lys Gin Ala Gin Asp Arg Leu Met Glu Leu Ser Lys Val 
240 245 250 255 

AAA AAA GCT TAT TGG CTA GAA AAA AGC CCT ATG GAG CGC TTC ATT GAA 876 
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Leu Ser Leu Lys 
145 

Thr Thr Ser Thr 

Cys Leu Met Arg 
180 

His Pro Gly Gly 
195 

Leu Leu Gin Thr 
210 

Lys Asp Ala Leu 
225 

Leu Val Asn Glu 

Val Arg Arg Ala 
260 

His Phe Ala Thr 
275 

Leu Leu Glu Ala 
290 

Val Cys Val Asp 
305 

Gin Leu Leu Glu 



lie Gin Lys Glu 
150 

Thr Leu Thr Leu 
165 

Ala Lys Asn Phe 

Leu Leu Gly Lys 
200 

Thr Asn Leu Pro 
215 

lie Glu Met Ser 
230 

Ala Asn Glu Leu 
245 

Leu Leu Lys Gly 

Leu Lys Pro Lys 
280 

Leu Glu Phe Leu 
295 

Asp Hi's Asn Lys 
310 

Leu Gly Leu Lys 
325 



Ala Cys Pro lie 
155 

Ala Leu Gly Asp 
170 

Ser Gin Glu Asp 
185 

Lys Leu Phe Val 

Leu lie Ala Pro 
220 

Glu Lys Arg Leu 
235 

Val Gly Val Leu 
250 

Val Ser Leu Lys 
265 

Ser Phe Lys Asn 

Glu Arg His Lys 
300 

Val Leu Gly Val 
315 

Ala 



Asn Thr Ala Pro 
160 

Val Leu Met Ala 
175 

Phe Ala Ser Phe 
190 

Lys Val Lys Asp 
205 

Ser Thr Ser Phe 

Gly Ser Ala lie 
240 

Ser Asp Gly Asp 
255 

Ser Glu Val Arg 
270 

Leu Asp Ala Leu 
285 

lie Gin Leu Leu 

Leu His Leu His 
320 



(2) INFORMATION FOR SEQ ID NO: 10 87: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 64 ... 939 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1087: 

TCCTTAAAAC ATTTGAAGTA TAATACACTC TTATGGTCAA ATATTAAGTT TAGGAAAAGC 6 0 

TGC ATG TGG AGT TTC ATT CAA AAA ATC TTT AAG GCT TTA ATC ATC GCA 10 8 

Met Trp Ser Phe lie Gin Lys lie Phe Lys Ala Leu lie lie Ala 
15 10 15 

CCT TTA GAT TTT ATC ACG AAG TAT TTC AAG TCG TTT GTG CTG TTA CTC 15 6 

Pro Leu Asp Phe lie Thr Lys Tyr Phe Lys Ser Phe Val Leu Leu Leu 
20 25 30 

ATT GTA TTA GTC TTT TTT AGC GCT AAA GAA AGC GCG CCA AGC GCC CCG 2 04 

lie Val -Leu Val Phe Phe Ser Ala Lys Glu Ser Ala Pro Ser Ala Pro 
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TTA GTC AAT 
Leu Val Asn 



GTC CGT AGG 
Val Arg Arg 



CAT TTT GCC 
His Phe Ala 
275 

CTT TTA GAA 
Leu Leu Glu 
290 

GTG TGC GTA 
Val Cys Val 
305 

CAA CTT TTA 
Gin Leu Leu 



GAA GOT AAC 
Glu Ala Asn 
245 

GCG TTA TTA 
Ala Leu Leu 
260 

ACT TTA AAA 
Thr Leu Lys 



GCG TTG GAA 
Ala Leu Glu 



GAT GAT CAT 
Asp Asp His 

310 

GAA TTA GGG 
Glu Leu Gly 
325 



GAG CTT GTG 
Glu Leu Val 



AAA GGG GTG 
Lys Gly Val 
265 

CCT AAA AGC 
Pro Lys Ser 
280 

TTT TTA GAG 

Phe Leu Glu 
295 

AAT AAG GTT 

Asn Lys Val 



CTT AAA GCA 
Leu Lys Ala 



GGG GTG TTA 
Gly Val Leu 
250 

AGT TTA AAG 
Ser Leu Lys 



TTT AAG AAT 
Phe Lys Asn 



CGC CAT AAG 
Arg His Lys 
300 

TTA GGG GTC 
Leu Gly Val 
315 

TGA 



AGC GAT GGC 
Ser Asp Gly 
255 

AGC GAA GTG 
Ser Glu Val 
270 

TTA GAC GCT 
Leu Asp Ala 
285 

ATC CAG CTT 
lie Gin Leu 



TTG CAC TTG 
Leu His Leu 



GAT 768 
Asp 



AGG 816 
Arg 

CTT 8 64 
Leu 



TTA 912 
Leu 



CAC 96 0 

His 

320 

990 



(2) INFORMATION FOR SEQ ID NO: 10 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086: 

Met Pro lie Leu Phe Asp Cys Asn Ala lie Ala Ser Gin Val Leu Lys 

15 10 15 

Asp Glu Ala Ser Ala Leu Leu Glu Ser Val Gly Gin Phe Gin Lys Pro 

20 25 30 

Asn Asp Leu Glu Ala lie Val Lys Leu lie Leu Lys Ser Gin Glu Asn 

35 40 45 

Gly Gly Lys Leu Val lie Val Gly Val Gly Lys Ser Ala Leu Val Ala 

50 55 60 

Gin Lys lie Val Ala Ser Met Leu Ser Thr Gly Asn Arg Ser Ala Phe 
65 70 75 80 

Leu His Pro Thr Glu Ala Met His Gly Asp Leu Gly Met Val Glu Lys 

85 90 95 

Asn Asp Val Val Leu Met lie Ser Tyr Gly Gly Glu Ser Leu Glu Leu 

100 105 110 

Leu Asn Leu Val Ser His Leu Lys Arg Leu Ser His Lys lie lie Thr 

115 120 125 

Phe Thr Lys Ser Pro Asn Ser Ser Leu Ser Lys Leu Gly Asp Tyr Tyr 
130 135 140 
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GAT GAA GCG AGC GCG CTT TTA GAA AGC GTT GGA CAA TTC CAA AAA CCC 96 
Asp Glu Ala Ser Ala Leu Leu Glu Ser Val Gly Gin Phe Gin Lys Pro 
20 25 30 

AAC GAT TTA GAA GCG ATT GTC AAA CTC ATT TTA AAA AGC CAA GAA AAT 144 
Asn Asp Leu Glu Ala lie Val Lys Leu lie Leu Lys Ser Gin Glu Asn 
35 40 45 

GGG GGT AAG CTT GTG ATA GTG GGT GTG GGT AAG AGC GCT TTA GTG GCG 192 
Gly Gly Lys Leu Val He Val Gly Val Gly Lys Ser Ala Leu Val Ala 
50 55 60 

CAA AAA ATC GTT GCT TCC ATG CTA AGC ACC GGT AAC AGG AGC GCG TTT 24 0 

Gin Lys He Val Ala Ser Met Leu Ser Thr Gly Asn Arg Ser Ala Phe 
65 70 75 80 

TTA CAC CCC ACA GAA GCC ATG CAT GGG GAT TTG GGC ATG GTG GAA AAA 2 88 

Leu His Pro Thr Glu Ala Met His Gly Asp Leu Gly Met Val Glu Lys 
85 90 95 

AAC GAT GTG GTT TTA ATG ATT AGC TAT GGG GGC GAG TCT TTA GAA TTA 3 36 

Asn Asp Val Val Leu Met He Ser Tyr Gly Gly Glu Ser Leu Glu Leu 
100 105 110 

TTG AAT CTG GTG AGC CAT TTA AAA CGC TTG AGC CAT AAA ATC ATC ACT 3 84 

Leu Asn Leu Val Ser His Leu Lys Arg Leu Ser His Lys He He Thr 
115 120 125 

TTC ACT AAA AGC CCT AAT AGC TCG CTC TCT AAA CTC GGC GAT TAT TAT 4 32 

Phe Thr Lys Ser Pro Asn Ser Ser Leu Ser Lys Leu Gly Asp Tyr Tyr 
130 135 140 

TTG AGC TTG AAA ATT CAA AAA GAA GCT TGC CCG ATT AAC ACC GCT CCA 4 80 

Leu Ser Leu Lys He Gin Lys Glu Ala Cys Pro He Asn Thr Ala Pro 
145 150 155 160 

ACG ACT TCT ACC ACC CTA ACT CTA GCG TTA GGC GAT GTT TTA ATG GCA 52 8 

Thr Thr Ser Thr Thr Leu Thr Leu Ala Leu Gly Asp Val Leu Met Ala 
165 170 175 

TGC TTG ATG CGA GCG AAA AAC TTT AGC CAA GAA GAT TTT GCC TCC TTT 576 
Cys Leu Met Arg Ala Lys Asn Phe Ser Gin Glu Asp Phe Ala Ser Phe 
180 185 190 

CAT CCG GGC GGG CTT TTA GGC AAA AAA CTT TTT GTC AAG GTT AAA GAT 62 4 

His Pro Gly Gly Leu Leu Gly Lys Lys Leu Phe Val Lys Val Lys Asp 
195 200 205 

TTA CTG CAA ACC ACG AAC CTC CCC CTA ATC GCT CCT AGC ACA AGT TTT 67 2 

Leu Leu Gin Thr Thr Asn Leu Pro Leu He Ala Pro Ser Thr Ser Phe 
210 215 220 

AAA GAC GCG CTC ATA GAA ATG AGT GAA AAA CGC TTA GGC AGC GCG ATT 72 0 

Lys Asp Ala Leu He Glu Met Ser Glu Lys Arg Leu Gly Ser Ala He 
225 230 235 240 
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Tyr Thr Val Cys Val 
115 

Phe Cys Phe Thr Gin 
130 

Glu lie lie Gin Gin 
145 

Leu Glu Lys Ala Leu 
165 

Asn Asn Leu Asp Glu 
180 

Met Gin lie Ser Pro 
195 

Asp Lys lie Pro lie 
210 

lie Ser Leu His Ala 
225 

Leu Asn Lys Lys Tyr 
245 

Trp Pro Leu Glu Gin 
260 

Lys Asp Leu Asn Asp 
275 

Leu Asn Gly lie Lys 
290 

Glu Gly Ser Lys Phe 
305 

Ala Asp Phe Leu Asn 
325 

Lys Ala Leu Asp lie 
340 

Leu Ser Gin Gin lie 
355 



Ser Cys Gin lie Gly Cys 
120 

Lys Gly Gly Phe Val Arg 
135 

Ala Leu Leu lie Lys Glu 
150 155 
Asn lie Val Phe Met Gly 
170 

Val Cys Lys Ala lie Glu 
185 

Lys Arg lie Thr lie Ser 
200 

Leu Ala Gly Lys Asn Leu 
215 

Val Asp Asp Lys Thr Arg 
230 235 
Asn lie Glu Cys Val Leu 
250 

Arg Lys Arg Val Met Phe 
265 

Ser Leu Asp Cys Ala Lys 
280 

Ser Lys Val Asn Leu lie 
295 

Glu Arg Pro Ser Leu Glu 
310 315 
Ser Lys Gly Leu Leu Cys 
330 

Glu Ala Ala Cys Gly Gin 
345 



Gin Val Gly Cys Ser 
125 

Asn Leu Lys Ala Ser 
140 

Asp Asn Asn Leu Pro 
160 

Met Gly Glu Pro Leu 
175 

lie Phe Asn Thr Gly 
190 

Thr Ser Gly Val Ala 
205 

Gly Val Gin Leu Ala 
220 

Ser Ser Leu Met Pro 
240 

Asn Glu Val Arg Lys 
255 

Glu Tyr Leu Leu lie 
270 

Lys Leu Leu Lys Leu 
285 

Leu Phe Asn Pro His 
300 

Asn Ala Arg Met Phe 
320 

Thr lie Arg Glu Ser 
335 

Leu Arg Glu Lys Lys 
350 



(2) INFORMATION FOR SEQ ID NO: 108 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...987 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1085: 

ATG CCC ATT CTT TTT GAT TGT AAC GCT ATT GCT TCA CAA GTT TTA AAA 4 8 

Met Pro lie Leu Phe Asp Cys Asn Ala lie Ala Ser Gin Val Leu Lys 

1,-5 10 15 
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Leu Asn Glu Val Arg Lys Trp Pro Leu Glu Gin Arg Lys Arg Val Met 
255 260 265 

TTT GAA TAC CTT TTG ATC AAA GAT TTG AAC GAT AGC CTA GAC TGC GCT 869 
Phe Glu Tyr Leu Leu lie Lys Asp Leu Asn Asp Ser Leu Asp Cys Ala 
270 275 280 

AAA AAA CTT TTA AAA CTT TTA AAC GGC ATT AAA TCC AAA GTG AAT TTG 917 
Lys Lys Leu Leu Lys Leu Leu Asn Gly lie Lys Ser Lys Val Asn Leu 
285 290 295 

ATC TTA TTC AAC CCG CAT GAA GGC TCT AAG TTT GAA CGC CCT AGC TTA 965 
lie Leu Phe Asn Pro His Glu Gly Ser Lys Phe Glu Arg Pro Ser Leu 
300 305 310 

GAG AAC GCT AGA ATG TTT GCG GAT TTT TTA AAC TCT AAA GGC TTA TTA 1013 
Glu Asn Ala Arg Met Phe Ala Asp Phe Leu Asn Ser Lys Gly Leu Leu 
315 320 325 330 

TGC ACC ATT AGA GAG TCT AAA GCC TTG GAT ATT GAA GCG GCT TGC GGG 1061 
Cys Thr lie Arg Glu Ser Lys Ala Leu Asp lie Glu Ala Ala Cys Gly 
335 340 345 

CAG TTG AGG GAG AAA AAA CTC TCT CAG CAA ATT TGAAAACTTT TTTGTGGTGT 1114 
Gin Leu Arg Glu Lys Lys Leu Ser Gin Gin lie 
350 355 

TTGTCTTTTT TCTAATGGGG GGTGTTGG 114 2 

(2) INFORMATION FOR SEQ ID NO: 1084: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 3 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1084: 

Met Lys Ala Ser lie Tyr Asp Phe Thr Leu Lys Glu Leu Ser Gin Leu 

15 10 15 

Leu Lys Pro Ser Phe Arg Ala Lys Gin Leu Tyr Leu Trp Leu Tyr Ala 

20 25 30 

Lys Tyr Lys Thr Ser Phe Lys Asp Met Gin Asn Asn Phe Ser Lys Asp 

35 40 45 

Phe lie Ala Tyr Leu Glu Arg Glu Phe Ala Leu Arg Thr lie Glu lie 

50 55 60 

Thr His Val Arg Glu Ser Val Asp Gly Ser Lys Lys Tyr Leu Phe Lys 
65 70 75 80 

Ser Leu Arg Asp Asn His Thr Phe Glu Ala Val Leu Leu Lys Met Lys 

85 90 95 

Asp Lys Lys lie Asp Ala Glu Thr Asn Ala lie Leu Glu Arg Glu Lys 
100 105 110 
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AAT AAT TTT 
Asn Asn Phe 
45 

TTG CGC ACG 
Leu Arg Thr 
60 

AAA AAA TAC 
Lys Lys Tyr 
75 

GTG TTG TTG 
Val Leu Leu 



ATT TTA GAG 
lie Leu Glu 



TGT CAA GTG 
Cys Gin Val 
125 

AGG AAC TTA 
Arg Asn Leu 
140 

GAA GAC AAC 
Glu Asp Asn 
155 

GGA ATG GGC 
Gly Met Gly 



GAG ATT TTC 
Glu lie Phe 



TCC ACT AGC 
Ser Thr Ser 
205 

TTA GGC GTG 
Leu Gly Val 
220 

CGC TCG TCT 
Arg Ser Ser 
235 

TTG AAT GAA 



30 

TCA A7^ GAT 
Ser Lys Asp 



ATA GAA ATC 
lie Glu lie 



CTT TTT AAA 
Leu Phe Lys 
80 

AAA ATG AAG 
Lys Met Lys 
95 

AGG GAA AAA 
Arg Glu Lys 
110 

GGT TGC TCG 
Gly Cys Ser 



AAA GCG AGC 
Lys Ala Ser 

AAC CTC CCC 
Asn Leu Pro 
160 

GAG CCT TTA 
Glu Pro Leu 
175 

AAT ACC GGC 
Asn Thr Gly 
190 

GGC GTA GCC 
Gly Val Ala 



CAA TTA GCC 
Gin Leu Ala 



TTA ATG CCC 
Leu Met Pro 
240 

GTG AGG AAA 



35 

TTT ATC GCT 
Phe lie Ala 
50 

ACG CAT GTG 
Thr His Val 
65 

TCT TTA AGA 
Ser Leu Arg 

GAT AAA AAG 
Asp Lys Lys 

TAC ACC GTA 
Tyr Thr Val 
115 

TTT TGT TTC 
Phe Cys Phe 
130 

GAG ATC ATC 
Glu lie lie 
145 

CTT GAA AAA 
Leu Glu Lys 

AAC AAT TTA 
Asn Asn Leu 



ATG CAA ATT 
Met Gin lie 
195 

GAT AAA ATC 
Asp Lys lie 
210 

ATA TCC TTA 
lie Ser Leu 
225 

TTG AAT AAA 
Leu Asn Lys 



TGG CCT TTA 



TAT TTG GAG 
Tyr Leu Glu 



AGG GAG AGC 
Arg Glu Ser 
70 

GAC AAC CAC 
Asp Asn His 
85 

ATT GAT GCA 
lie Asp Ala 
100 

TGC GTG TCT 
Cys Val Ser 



ACT CAA AAA 
Thr Gin Lys 



CAA CAA GCC 
Gin Gin Ala 
150 

GCG CTC AAC 
Ala Leu Asn 
165 

GAT GAG GTG 
Asp Glu Val 
180 

TCG CCT AAA 
Ser Pro Lys 



CCT ATT TTA 
Pro lie Leu 



CAC GCC GTA 
His Ala Val 
230 

AAA TAC AAT 
Lys Tyr Asn 
245 

GAG CAG CGC 



40 

CGA GAA TTT 
Arg Glu Phe 
55 

GTT GAT GGC 
Val Asp Gly 



ACT TTT GAA 
Thr Phe Glu 



GAA ACG AAC 
Glu Thr Asn 
105 

TGT CAA ATC 
Cys Gin lie 
120 

GGC GGT TTT 
Gly Gly Phe 
135 

CTA CTC ATT 
Leu Leu lie 



ATT GTT TTT 
lie Val Phe 



TGT AAA GCG 
Cys Lys Ala 
185 

AGA ATC ACC 
Arg lie Thr 
200 

GCG GGC AAA 
Ala Gly Lys 
215 

GAT GAC AAA 
Asp Asp Lys 

ATT GAA TGC 
lie Glu Cys 



AAA AGA GTG 



GCT 197 
Ala 



TCT 24 5 

Ser 



GCG 2 93 

Ala 

90 

GCT 341 
Ala 



GGC 3 8 9 

Gly 



GTA 437 
Val 



AAA 485 
Lys 



ATG 53 3 

Met 

170 

ATT 581 
lie 



ATT 62 9 

lie 



AAC 677 
Asn 



ACG 725 
Thr 



GTT 773 

Val 

250 

ATG 821 
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TTG CTT TAAACTTGCT ATGGACGATT AGAAATCG 232 

Leu Leu 

55 



(2) INFORMATION FOR SEQ ID NO: 10 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082: 



Met Ser Leu Gly 
1 

Gly Val lie Leu 
20 

Asn Asn Lys Asn 
35 

Val Pro Gin Arg 
50 



Ala Val lie Lys 
5 

Asp Leu Lys Arg 

Ala Leu Phe lie 
40 

Phe lie Leu Leu 
55 



Leu lie Phe Cys 
10 

lie Asn Phe Lys 
25 

Asn Asn Lys Lys 



Tyr Lys Leu Glu 
15 

Ser Tyr Tyr Pro 
30 

lie His Tyr Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 10 83: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 24... 1094 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1083: 



CTTTTAGAAT TAGGGCTTAA AGC ATG AAA GCT AGT ATT TAT GAT TTC ACT CTA 5 3 

Met Lys Ala Ser lie Tyr Asp Phe Thr Leu 
15 10 



AAG GAA TTG AGC CAG CTT TTA AAA 

Lys Glu Leu Ser Gin Leu Leu Lys 
15 

TAT TTG TGG CTC TAT GCG AAG TAT 

Tyr Leu Trp Leu Tyr Ala Lys Tyr 



CCA AGC TTT AGG GCT AAA CAG CTT 101 
Pro Ser Phe Arg Ala Lys Gin Leu 
20 25 

AAA ACA AGC TTT AAG GAC ATG CAA 14 9 

Lys Thr Ser Phe Lys Asp Met Gin 
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580 

Phe Pro Phe Val 
595 

Trp Arg Lys Thr 
610 

Tyr Ser Gly lie 
625 

Tyr Tyr Ser Gly 

Gin Cys Glu Ala 
660 

Tyr Trp Val Trp 
675 

Arg His Arg Val 
690 

Lys Tyr Tyr Phe 
705 

Ala Pro Gly Arg 



Ser Pro Phe Gin 
600 

Thr lie Gly lie 
615 

Ser Asn Ser Ala 
630 

Gly Asn Asn Tyr 
645 

Trp Cys Met Thr 

Asn lie Gin Val 
680 

Thr Gly Ser Leu 
695 

Thr Gly lie Gly 
710 

Ser Val Thr Ala 
725 



585 

Phe lie Leu Asp 

Ser Ser Tyr Phe 
620 

Ala Gly Gly Tyr 
635 

Glu Ser Val Leu 
650 

Gin His Glu Gly 
665 

Ser Gin lie Phe 

Gin lie Asn Asn 
700 

Ser Ser Pro Ala 
715 

Tyr Leu Asn Tyr 
730 



590 

Ala Arg Tyr Asn 
605 

Tyr Ser Arg Ala 

Tyr Gly Met Gin 
640 

Asn Ser Gly Tyr 
655 

Leu Leu Pro Trp 
670 

Trp Glu Asn Gly 
685 

lie Phe Asn Met 

Gly Leu Gin Pro 
720 

Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 1081: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
Cix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 204 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081: 



CATTGGAGAT TGTGCCATTC TTTGATTTTA TCTAAA ATG TCT TTA GGG GCA GTG 54 

Met Ser Leu Gly Ala Val 
1 5 



ATT AAG CTT ATT TTT TGT TAT AAA TTA GAG GGG GTA ATA TTA GAT TTA 102 
lie Lys Leu lie Phe Cys Tyr Lys Leu Glu Gly Val lie Leu Asp Leu 
10 15 20 



AAG CGC ATC AAT TTC AAA TCC TAT TAT CCC AAT AAT AAA AAT GCA TTA 15 0 

Lys Arg lie Asn Phe Lys Ser Tyr Tyr Pro Asn Asn Lys Asn Ala Leu 
25 30 35 

TTT ATC AAC AAT AAA AAA ATC CAT TAT CTA GTG CCT CAA AGG TTC ATA 198 
Phe lie Asn Asn Lys Lys lie His Tyr Leu Val Pro Gin Arg Phe lie 
40 45 50 
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Asn Trp Val Arg 
145 

Ser Asn Tyr Trp 

lie Lys Ala Tyr 
180 

Ser Leu Ser Glu 
195 

Pro Leu Asn Gin 
210 

Glu Asn Arg Phe 
225 

Thr Tyr Tyr Gly 

Tyr Asn Ser Ala 
260 

Ala Ala Gly Leu 
275 

Thr Asn Tyr Asn 
290 

Asn Asn Ala Phe 
305 

Val Lys Gin Thr 

Leu Gin Arg Gin 
340 

Gly Glu Gly Ala 
355 

Trp Lys Pro His 
370 

Arg Asn Asn Tyr 
385 

Asp Gly Arg Phe 

Tyr Asn Asn Glu 
420 

Leu Arg Lys lie 
435 

Gly Phe lie Pro 
450 

Tyr Gin Arg Ser 
465 

Gly Ala Glu Tyr 

Arg Tyr Thr Tyr 
500 

lie Trp Ala Arg 
515 

Gly Pro Met Lys 
530 

Val Glu Leu Glu 
545 

Ala Ala Phe Asn 

Thr Asp Leu Asn 



Gly Gin Gly Phe 
150 

Leu Asp Gly Val 
165 

Tyr Gin Tyr Tyr 

Gin Asp Tyr Lys 
200 

Lys Gly Gly Arg 
215 

Gly Asp Leu Asp 
230 

Gin Leu Met Thr 
245 

Asn Met Val Thr 

Pro Ala Gly Tyr 
280 

Gly Trp Ala Glu 
295 

Glu Pro Lys Val 
310 

Phe lie Met Gly 
325 

Tyr Leu Asn Thr 

Gly Phe Leu Cys 
360 

lie Lys His Gly 
375 

Thr Ala Val Tyr 
390 

Phe lie Val Pro 
405 

Asn Ala Ser Asn 

Lys His Met Asn 
440 

Val Gin Gly Asp 
455 

Phe Val Pro Pro 
470 

Phe Thr Gin His 
485 

Lys Asp Lys Phe 

Asp Phe Ala Thr 
520 

Gly Asn Val Arg 
535 

Leu Tyr Tyr Arg 
550 

Tyr lie Asp Thr 
565 

Gly Asp Val Leu 



Arg Asp Asn Ser 
155 

Tyr Asp lie Asn 
170 

Asp Phe Ala lie 
185 

lie Asn Arg Phe 

Ser Gin Arg Phe 
220 

Lys Val Gly Gly 
235 

Arg Asp Phe Gin 
250 

Cys Phe Ser Glu 
265 

Asn Leu Ala Val 

Val Glu Asn Pro 
300 

Asn Leu lie Val 

315 

Leu Arg Phe Met 
330 

Asn Glu Cys Ala 
345 

Glu Gly Ala Asn 

Val Tyr Arg Asn 
380 

Leu Ser Asp Arg 
395 

Gly Leu Arg Tyr 
410 

Trp Met Gin lie 
425 

Asn Trp Met Pro 

His Asn Val Leu 
460 

Gin Leu Asp Val 
475 

Phe Asp Thr Val 
490 

Ser Phe Asn Ala 
505 

Gly Gin Tyr Ser 

Pro lie Asn Gly 
540 

Pro lie Arg Gly 
555 

Arg Val Thr Ser 
570 

Lys Gly Thr Ser 



Pro Ser Asn lie 
160 

Glu Asn Asn Gly 
175 

Ala Gin Pro Gly 
190 

Ala Asn Leu Arg 
205 

Gly Ala Val Tyr 

Thr Phe Ser Phe 
240 

Val Ser Ser Ser 
255 

Ala Ala Cys Arg 
270 

Pro Tyr Tyr Ala 
285 

Val Arg Ser lie 

Asn Thr Gly Lys 
320 

Thr Thr Thr Phe 
335 

Thr Lys Thr Ser 
350 

Val Met Ser Gly 
365 

Trp Asn Asn Trp 

lie Glu Ala Trp 
400 

Ala Phe Val Gin 
415 

Pro Glu Lys Asp 
430 

Ser Thr Asn lie 
445 

Thr Tyr Phe Asn 

Leu Ser Tyr Gly 
480 

Glu Ala Gly Ala 
495 

Asp Tyr Phe Arg 
510 

Val Tyr Thr Ser 
525 

Tyr Ser Gin Gly 

Leu Gin Phe His 
560 

His Gly Pro Leu 
575 

Tyr Asn Lys His 
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GTT CTT AAT AGC GGT TAT CAA TGC GAA OCT TGG TOT ATG ACC CAA CAT 2 07 9 

Val Leu Asn Ser Gly Tyr Gin Cys Glu Ala Trp Cys Met Thr Gin His 
655 660 665 

GAA GGG CTC TTG CCT TGG TAT TGG GTG TGG AAT ATC CAA GTG AGC CAA 212 7 

Glu Gly Leu Leu Pro Trp Tyr Trp Val Trp Asn lie Gin Val Ser Gin 
670 675 680 

ATT TTC TGG GAA AAC GGG AGA CAC AGA GTT ACA GGA AGC TTG CAA ATC 217 5 

lie Phe Trp Glu Asn Gly Arg His Arg Val Thr Gly Ser Leu Gin lie 
685 690 695 

AAT AAT ATC TTC AAC ATG AAG TAT TAT TTT ACA GGG ATT GGC TCT AGC 222 3 

Asn Asn lie Phe Asn Met Lys Tyr Tyr Phe Thr Gly lie Gly Ser Ser 
700 705 710 

CCT GCA GGC TTG CAA CCT GCG CCT GGA AGA TCG GTT ACA GCG TAT TTG 2 271 

Pro Ala Gly Leu Gin Pro Ala Pro Gly Arg Ser Val Thr Ala Tyr Leu 
715 720 725 730 

AAC TAC ACT TTC TAAAGGCTTT AAAAAGGAGG GGGTTATTGC GCGATGATGA GCCG 2 32 7 

Asn Tyr Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 1080: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 0: 



Met Pro Thr lie Gin 

1 5 
Ser Asp Ala Thr Leu 
20 

Pro Tyr Ala His lie 
35 

lie Asp Arg lie Asp 
50 

Pro Asn Thr Tyr Gly 
65 

Asn Gin Trp Glu Asn 
85 

Ala Arg Asn Ala Gly 
100 

Ser Phe lie Lys Ser 
115 

Arg Ser Gly Gly Met 
130 



lie Arg Gly Phe Gly Ala 
10 

Met Leu Val Asn Gly lie 
25 

Glu Leu Asp lie Phe Pro 
40 

Val lie Lys Gly Gly Gly 
55 

Gly lie Val Asn lie lie 
70 75 
Gin Ala Ala Glu Arg lie 
90 

Phe Ala Ala Pro Pro Asp 
105 

Leu Gly Asn Asn Leu Leu 
120 

lie Asn Lys His Val Gly 
135 



Gly Gly Ser Gly His 
15 

Pro Val Tyr Met Ala 
30 

Val Thr Phe Gin Ala 
45 

Ser Val Gin Tyr Gly 
60 

Thr Lys Pro lie Pro 
80 

Thr Tyr Trp Ala Lys 
95 

Lys Thr Gly Asp Pro 
110 

Tyr Asn Thr Tyr Val 
125 

He Gin Ala Gin Ala 
140 
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CAA ATC CCT GAG AAG GAT TTA AGA AAA ATC AAG CAC ATG AAC AAT TGG 14 07 

Gin lie Pro Glu Lys Asp Leu Arg Lys lie Lys His Met Asn Asn Trp 
430 435 440 

ATG CCC TCA ACC AAC ATT GGC TTT ATC CCC GTG CAA GGC GAT CAC AAT 145 5 

Met Pro Ser Thr Asn lie Gly Phe lie Pro Val Gin Gly Asp His Asn 
445 450 455 

GTG CTT ACC TAC TTT AAC TAC CAA CGC TCT TTC GTC CCG CCT CAA TTA 15 03 

Val Leu Thr Tyr Phe Asn Tyr Gin Arg Ser Phe Val Pro Pro Gin Leu 
460 465 470 

GAC GTT TTG AGC TAT GGA GGA GCG GAG TAT TTT ACC CAG CAC TTT GAC 15 51 

Asp Val Leu Ser Tyr Gly Gly Ala Glu Tyr Phe Thr Gin His Phe Asp 
475 480 485 490 

ACG GTG GAA GCA GGA GCG CGC TAC ACC TAT AAG GAT AAA TTC AGC TTC 15 99 

Thr Val Glu Ala Gly Ala Arg Tyr Thr Tyr Lys Asp Lys Phe Ser Phe 
495 500 505 

AAT GCG GAC TAC TTC AGG ATT TGG GCG CGC GAT TTT GCC ACC GGG CAG 164 7 

Asn Ala Asp Tyr Phe Arg lie Trp Ala Arg Asp Phe Ala Thr Gly Gin 
510 515 520 

TAT TCA GTC TAT ACA AGC GGT CCC ATG AAG GGT AAT GTG CGC CCC ATT 16 95 

Tyr Ser Val Tyr Thr Ser Gly Pro Met Lys Gly Asn Val Arg Pro lie 
525 530 535 

AAT GGC TAT TCT CAA GGC GTG GAG CTG GAA TTG TAT TAC AGG CCT ATT 174 3 

Asn Gly Tyr Ser Gin Gly Val Glu Leu Glu Leu Tyr Tyr Arg Pro lie 
540 545 550 

AGA GGG TTG CAA TTC CAT GCC GCT TTC AAC TAC ATT GAC ACT CGT GTA 17 91 

Arg Gly Leu Gin Phe His Ala Ala Phe Asn Tyr lie Asp Thr Arg Val 
555 560 565 570 

ACC AGC CAT GGC CCT TTA ACC GAC TTG AAC GGG GAT GTG CTA AAA GGG 183 9 

Thr Ser His Gly Pro Leu Thr Asp Leu Asn Gly Asp Val Leu Lys Gly 
575 580 585 

ACT AGC TAT AAC AAG CAT TTC CCT TTT GTA AGC CCT TTC CAA TTC ATT 18 8 7 

Thr Ser Tyr Asn Lys His Phe Pro Phe Val Ser Pro Phe Gin Phe lie 
590 595 600 

CTT GAC GCT CGT TAC AAT TGG CGT AAA ACC ACC ATC GGT ATT TCT AGC 193 5 

Leu Asp Ala Arg Tyr Asn Trp Arg Lys Thr Thr lie Gly lie Ser Ser 
605 610 615 

TAT TTT TAC AGC CGT GCT TAT AGC GGG ATT AGC AAC AGT GCA GCA GGA 19 83 

Tyr Phe Tyr Ser Arg Ala Tyr Ser Gly lie Ser Asn Ser Ala Ala Gly 
620 625 630 

GGC TAT TAT GGG ATG CAA TAT TAT AGT GGG GGG AAC AAC TAT GAA AGC 2 031 

Gly Tyr Tyr Gly Met Gin Tyr Tyr Ser Gly Gly Asn Asn Tyr Glu Ser 
635 640 645 650 
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CGC TTC GCT AAT TTG CGC CCC TTA AAC CAA AAA GGC GGG CGT TCA CAA 73 5 

Arg Phe Ala Asn beu Arg Pro Leu Asn Gin Lys Gly Gly Arg Ser Gin 
205 210 215 

CGC TTT GGG GCT GTG TAT GAA AAC CGC TTC GGG GAT TTA GAC AAA GTG 7 83 

Arg Phe Gly Ala Val Tyr Glu Asn Arg Phe Gly Asp Leu Asp Lys Val 
220 225 230 

GGC GGG ACT TTT AGC TTC ACT TAC TAT GGG CAG TTG ATG ACT AGG GAT 831 
Gly Gly Thr Phe Ser Phe Thr Tyr Tyr Gly Gin Leu Met Thr Arg Asp 
235 240 245 250 

TTT CAA GTG AGC TCT AGC TAC AAT AGC GCT AAC ATG GTT ACT TGT TTT 87 9 

Phe Gin Val Ser Ser Ser Tyr Asn Ser Ala Asn Met Val Thr Cys Phe 
255 260 265 

AGC GAA GCG GCA TGC AGG GCG GCA GGA CTT CCG GCA GGG TAT AAC TTG 92 7 

Ser Glu Ala Ala Cys Arg Ala Ala Gly Leu Pro Ala Gly Tyr Asn Leu 
270 275 280 

GCT GTG CCT TAT TAT GCC ACT AAC TAC AAT GGC TGG GCA GAA GTA GAA 97 5 

Ala Val Pro Tyr Tyr Ala Thr Asn Tyr Asn Gly Trp Ala Glu Val Glu 
285 290 295 

AAC CCT GTG CGC TCC ATT AAC AAC GCT TTT GAG CCT AAA GTG AAT TTG 102 3 

Asn Pro Val Arg Ser lie Asn Asn Ala Phe Glu Pro Lys Val Asn Leu 
300 305 310 

ATC GTC AAT ACC GGG AAA GTC AAG CAA ACC TTT ATC ATG GGC TTG CGT 1071 
He Val Asn Thr Gly Lys Val Lys Gin Thr Phe He Met Gly Leu Arg 
315 320 325 330 

TTC ATG ACC ACC ACT TTT TTA CAG CGC CAA TAC TTA AAC ACC AAT GAA 1119 
Phe Met Thr Thr Thr Phe Leu Gin Arg Gin Tyr Leu Asn Thr Asn Glu 
335 340 345 

TGC GCC ACC AAA ACG AGC GGT GAG GGG GCA GGA TTC TTG TGT GAG GGC 1167 
Cys Ala Thr Lys Thr Ser Gly Glu Gly Ala Gly Phe Leu Cys Glu Gly 
350 355 360 

GCT AAT GTG ATG AGC GGT TGG AAA CCT CAC ATC AAG CAT GGC GTT TAT 1215 
Ala Asn Val Met Ser Gly Trp Lys Pro His He Lys His Gly Val Tyr 
365 370 375 

AGA AAC TGG AAT AAC TGG CGT AAC AAT TAC ACA GCG GTT TAT TTG AGC 12 6 3 

Arg Asn Trp Asn Asn Trp Arg Asn Asn Tyr Thr Ala Val Tyr Leu Ser 
380 385 390 

GAT CGC ATT GAA GCT TGG GAT GGG CGC TTT TTC ATC GTG CCT GGT TTG 1311 
Asp Arg lie Glu Ala Trp Asp Gly Arg Phe Phe He Val Pro Gly Leu 
395 400 405 410 

CGC TAC GCT TTT GTG CAA TAC AAC AAC GAA AAT GCG TCT AAC TGG ATG 13 5 9 

Arg Tyr Ala Phe Val Gin Tyr Asn Asn Glu Asn Ala Ser Asn Trp Met 
415 420 425 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079: 

CAACAAGCTA ACCAAAGCAT TGAAGAAGCT TTACAGAATG TCCCGGGTCT GCAAATTAGG 6 0 

AATGCGACAG GTGTAGGGGC T ATG CCT ACT ATC CAA ATC CGT GGC TTT GGA 111 

Met Pro Thr lie Gin lie Arg Gly Phe Gly 
15 10 

GCT GGG GGT TCA GGG CAT AGC GAT GCG ACG CTG ATG TTA GTC AAT GGT 15 9 

Ala Gly Gly Ser Gly His Ser Asp Ala Thr Leu Met Leu Val Asn Gly 
15 20 25 

ATT CCT GTT TAT ATG GCC CCC TAC GCT CAC ATT GAG CTA GAC ATT TTC 2 07 

lie Pro Val Tyr Met Ala Pro Tyr Ala His He Glu Leu Asp He Phe 
30 35 40 

CCC GTT ACC TTT CAA GCC ATT GAT CGC ATT GAT GTG ATC AAG GGT GGA 255 
Pro Val Thr Phe Gin Ala He Asp Arg He Asp Val He Lys Gly Gly 
45 50 55 

GGC AGC GTG CAA TAC GGG CCT AAC ACT TAT GGG GGT ATT GTC AAT ATC 3 03 

Gly Ser Val Gin Tyr Gly Pro Asn Thr Tyr Gly Gly He Val Asn He 
60 65 70 

ATC ACT AAG CCT ATC CCT AAT CAA TGG GAA AAC CAA GCG GCT GAA AGG 3 51 

He Thr Lys Pro He Pro Asn Gin Trp Glu Asn Gin Ala Ala Glu Arg 
75 80 85 90 

ATC ACT TAT TGG GCT AAG GCT AGA AAC GCT GGG TTT GCC GCT CCT CCT 3 99 

He Thr Tyr Trp Ala Lys Ala Arg Asn Ala Gly Phe Ala Ala Pro Pro 
95 100 105 

GAT AAA ACC GGC GAT CCT TCT TTC ATC AAG TCT TTA GGC AAC AAC CTC 44 7 

Asp Lys Thr Gly Asp Pro Ser Phe He Lys Ser Leu Gly Asn Asn Leu 
110 115 120 

CTC TAT AAC ACT TAT GTG AGG AGT GGA GGG ATG ATC AAT AAG CAT GTG 4 95 

Leu Tyr Asn Thr Tyr Val Arg Ser Gly Gly Met He Asn Lys His Val 
125 130 135 

GGT ATC CAA GCG CAA GCT AAC TGG GTT AGA GGA CAA GGC TTT AGG GAC 54 3 

Gly He Gin Ala Gin Ala Asn Trp Val Arg Gly Gin Gly Phe Arg Asp 
140 145 150 

AAT AGC CCC TCT AAC ATT TCA AAC TAT TGG CTA GAT GGA GTC TAT GAC 5 91 

Asn Ser Pro Ser Asn He Ser Asn Tyr Trp Leu Asp Gly Val Tyr Asp 
155 160 165 170 

ATC AAT GAA AAC AAT GGG ATT AAA GCC TAT TAC CAA TAC TAC GAT TTT 63 9 

He Asn Glu Asn Asn Gly He Lys Ala Tyr Tyr Gin Tyr Tyr Asp Phe 
175 180 185 

GCT ATC GCT CAA CCA GGA TCA CTC AGC GAG CAA GAT TAC AAA ATA AAC 6 87 

Ala He Ala Gin Pro Gly Ser Leu Ser Glu Gin Asp Tyr Lys He Asn 
190 195 200 
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155 160 165 

AAAAATTCAA ACTCTAAGGA TCTATCTTTT CGTTAG 572 
(2) INFORMATION FOR SEQ ID NO: 107 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078: 

Met Ser Glu Pro Leu Glu Thr Leu Asp Lys Asp Lys Gin Ala Met Ser 

15 10 15 

Glu Ala lie Lys Lys Asp lie Glu Lys Asp Lys Glu Asn Leu Ala Arg 

20 25 30 

Val Lys Ala Asp Lys Lys Val Lys Ala Asp Glu Ser Glu Lys Gly Tyr 

35 40 45 

Glu Lys Asp Asp Asp Lys Lys Ala Glu Asn Leu Asp Lys Glu lie Ala 

50 55 60 

Lys Asp Lys Ala Ser Pro Asn Asp Asn Glu Leu Tyr Glu Glu Asp Asp 
65 70 75 80 

Arg Val Lys Arg Asp Lys Glu Arg Asp Asp Ala Leu Arg Asp Lys Glu 

85 90 95 

Lys Ala Lys. Asp Asp Ala Cys Met Val Arg Ala Asp Asp Asp Thr lie 

100 105 110 

Glu Asp Asp Glu Glu Tyr Gly Asp Asp Asp Lys Leu Airg Asp Glu lie 

115 120 125 

Leu Gly Val Met Glu Glu Leu Cys Asp Thr Leu Asn Asp Asn Leu Asn 

130 135 140 

Phe Lys Lys Val Val Cys Met Gly Gly Lys Val Ser lie Ala Phe Lys 
145 150 155 160 

Phe Leu lie Phe Cys Ser 
165 

(2) INFORMATION FOR SEQ ID NO: 1079: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2327 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 82... 22 83 
(D) OTHER INFORMATION: 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 27... 524 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077: 

AAAAGGCTTT TTAAAAGGAC ACACCA ATG AGC GAA CCA TTA GAA ACA TTA GAC 5 3 

Met Ser Glu Pro Leu Glu Thr Leu Asp 
1 5 

AAG GAT AAA CAA GCT ATG AGT GAA GCA ATT AAA AAA GAT ATT GAA AAA 101 
Lys Asp Lys Gin Ala Met Ser Glu Ala lie Lys Lys Asp lie Glu Lys 
10 15 20 25 

GAC AAA GAA AAC CTC GCA CGA GTC AAA GCA GAC AAA AAA GTC AAA GCC 14 9 

Asp Lys Glu Asn Leu Ala Arg Val Lys Ala Asp Lys Lys Val Lys Ala 
30 35 40 

GAT GAA AGT GAA AAA GGC TAG GAA AAA GAC GAT GAC AAA AAA GCC GAG 197 
Asp Glu Ser Glu Lys Gly Tyr Glu Lys Asp Asp Asp Lys Lys Ala Glu 
45 50 55 

AAT CTT GAC AAA GAA ATC GCT AAA GAC AAA GCT AGC CCT AAC GAT AAT 24 5 

Asn Leu Asp Lys Glu lie Ala Lys Asp Lys Ala Ser Pro Asn Asp Asn 
60 65 70 

GAG CTT TAT GAA GAG GAC GAT AGA GTT AAA CGA GAC AAA GAA AGA GAC 2 93 

Glu Leu Tyr Glu Glu Asp Asp Arg Val Lys Arg Asp Lys Glu Arg Asp 
75 80 85 

GAT GCC TTG CGT GAT AAA GAA AAA GCC AAA GAT GAC GCA TGC ATG GTA 341 
Asp Ala Leu Arg Asp Lys Glu Lys Ala Lys Asp Asp Ala Cys Met Val 
90 95 100 105 

AGA GCG GAC GAT GAC ACC ATA GAG GAC GAT GAG GAA TAT GGT GAT GAT 3 89 

Arg Ala Asp Asp Asp Thr lie Glu Asp Asp Glu Glu Tyr Gly Asp Asp 
110 115 120 

GAT AAG TTA AGA GAC GAA ATA CTC GGT GTT ATG GAG GAG TTA TGC GAT 437 
Asp Lys Leu Arg Asp Glu lie Leu Gly Val Met Glu Glu Leu Cys Asp 
125 130 135 

ACC CTT AAT GAT AAC CTT AAC TTC AAA AAA GTC GTC TGT ATG GGC GGT 4 85 

Thr Leu Asn Asp Asn Leu Asn Phe Lys Lys Val Val Cys Met Gly Gly 
140 145 150 

AAG GTT TCA ATT GCG TTC AAA TTT CTA ATT TTT TGC TCT TAATCTTTTA GA 53 6 
Lys Val ,Ser lie Ala Phe Lys Phe Leu lie Phe Cys Ser 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075: 

AGCTCACATT TTAGAAAAAT ATTTAAAGGG AGA ATG ATG CAA AAT AGC GTT AAA 54 

Met Met Gin Asn Ser Val Lys 
1 5 

AAA TTA GAA TAT GAA GAG CGT TTC AAT GAC GCT CTT TTG AAA TTA CAA 102 
Lys Leu Glu Tyr Glu Glu Arg Phe Asn Asp Ala Leu Leu Lys Leu Gin 
10 15 20 

GCA TGC CAA GAA GAA AAG CAG GTA ACG AGT TGT TTG AAA TGC GAG CAG 15 0 

Ala Cys Gin Glu Glu Lys Gin Val Thr Ser Cys Leu Lys Cys Glu Gin 
25 30 35 

GTT TTG AAT TGC AAG ATC CGC AAC AGC TAT GTG GAT GCG GCT TAT GAG 198 
Val Leu Asn Cys Lys lie Arg Asn Ser Tyr Val Asp Ala Ala Tyr Glu 
40 45 50 55 

AGC ATG AGT TTA GGC GAA CGG GGC GGG TTT GAT TTC AAT TAAATGGGAT TA 24 9 
Ser Met Ser Leu Gly Glu Arg Gly Gly Phe Asp Phe Asn 
60 65 

AAATGGCTAG TAATACTACC TT 271 

(2) INFORMATION FOR SEQ ID NO:1076: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 6: 

Met Met Gin Asn Ser Val Lys Lys Leu Glu Tyr Glu Glu Arg Phe Asn 

15 10 15 

Asp Ala Leu Leu Lys Leu Gin Ala Cys Gin Glu Glu Lys Gin Val Thr 

20 25 30 

Ser Cys Leu Lys Cys Glu Gin Val Leu Asn Cys Lys lie Arg Asn Ser 

35 40 45 

Tyr Val Asp Ala Ala Tyr Glu Ser Met Ser Leu Gly Glu Arg Gly Gly 

50 55 60 

Phe Asp Phe Asn 
65 

(2) INFORMATION FOR SEQ ID NO:1077: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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CCC AGC AAC CAG CAA ACA ACC ATA AGA GAT TAT CTT AAA CCC GGA TCT 15 9 

Pro Ser Asn Gin Gin Thr Thr lie Arg Asp Tyr Leu Lys Pro Gly Ser 
15 20 25 

ATT GTT GTG TTT GCC ATA ATT GTA ATA ATA ATT TCA TCT CAT TTC TCC 2 07 

lie Val Val Phe Ala He He Val lie He He Ser Ser His Phe Ser 
30 35 40 45 

AAC GCC TAT AAA ACC CTT ATC GCT TCT AAT AAA AAA CCA GTT TTA AGC 255 
Asn Ala Tyr Lys Thr Leu He Ala Ser Asn Lys Lys Pro Val Leu Ser 
50 55 60 

CAT TTA GAA ATT TGATTTCTTA AACCTTTTTA TCAAAAATAC CGGTGTT 3 04 

His Leu Glu He 
65 



(2) INFORMATION FOR SEQ ID NO:1074: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074: 

Met Asp Tyr Arg Leu Phe His Met Asp Ser Met Asp Leu Pro Ser Asn 

15 10 15 

Gin Gin Thr Thr He Arg Asp Tyr Leu Lys Pro Gly Ser He Val Val 

20 25 30 

Phe Ala He He Val He He He Ser Ser His Phe Ser Asn Ala Tyr 

35 40 45 

Lys Thr Leu He Ala Ser Asn Lys Lys Pro Val Leu Ser His Leu Glu 
50 55 60 

He 
65 

(2) INFORMATION FOR SEQ ID NO: 1075: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 237 
(D) OTHER INFORMATION: 
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35 

Glu Cys Val Gly 
50 

Ala lie Pro Val 
65 

lie Lys Lys Ser 

lie lie Arg Asn 
100 

His Pro Met Cys 
115 

Gly Leu Tyr Glu 
130 

Gly Thr Glu Gin 
145 

Ala Arg Leu lie 

Tyr lie Ser His 
180 

Val Leu Lys Gin 
195 

Gly Phe Arg Asp 
210 

Lys Asp lie Phe 
225 

Lys Cys Glu Lys 

Asp Tyr Glu Ser 
260 

Glu Phe Met 
275 



40 

Phe Glu Lys lie 
55 

Glu Gly lie lie 
70 

Ala Thr lie lie 
85 

lie Pro Lys Ser 

Gly Thr Glu Phe 
120 

Asn Ala Leu Val 
135 

Val Glu lie Ala 
150 

Lys Met Lys Ser 
165 

Leu Pro His Val 

Asn Asp Pro Glu 
200 

Met Ser Arg Leu 

215 

Lys Gin Asn Arg 
230 

Glu lie Val Gin 
245 

Leu Ala Glu Trp 



Leu Glu Cys Asp 
60 

Gly Cys Leu Lys 
75 

Asp Leu Gly Gly 
90 

lie Arg Lys Asn 
105 

Tyr Gly Pro Lys 

lie Leu Cys Asp 
140 

Lys Glu lie Phe 
155 

Asn Glu His Asp 
170 

Leu Ser Tyr Ala 
185 

Met lie Leu Ser 

Ser Lys Ser Ser 
220 

Asp Asn Val Leu 
235 

Ala Lys Ala Trp 
250 

Met Ala Gin Ala 
265 



45 

Val He Phe Leu 

Lys Met Thr Ser 
80 

Ala Lys Ala Gin 
95 

Phe He Ala Ala 
110 

Ala Ser Val Lys 
125 

Leu Glu Asp Ser 

Leu Gly Val Lys 
160 

Thr His Val Ala 
175 

Leu Ala Asn Ser 
190 

Leu Ala Gly Gly 
205 

Pro Leu Met Trp 

Glu Ala He Lys 
240 

He Glu Asn Asn 
255 

Asn Lys Leu Gin 
270 



(2) INFORMATION FOR SEQ ID NO: 1073: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 73... 267 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1073: 



AATAATTTAA AATTTTTTAT ATTGTTGTTT TTAGGGGTGC GAGGAGCGAA ATGGGGTATT 60 
TGGATTGTTT TT ATG GAT TAT AGG CTG TTT CAT ATG GAT AGC ATG GAT TTA 111 
Met Asp Tyr Arg Leu Phe His Met Asp Ser Met Asp Leu 
15 10 
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GTG ATA TTG TGC GAT TTA GAA GAT TCA GGG ACT GAG CAA GTA GAG ATC 4 86 

Val He Leu Cys Asp Leu Glu Asp Ser Gly Thr Glu Gin Val Glu He 
140 145 150 

GCT AAA GAA ATC TTT TTA GGC GTT AAA GCG CGC TTG ATT AAA ATG AAA 534 
Ala Lys Glu He Phe Leu Gly Val Lys Ala Arg Leu He Lys Met Lys 
155 160 165 

TCC AAT GAG CAT GAC ACC CAT GTG GCT TAT ATC AGC CAT TTA CCC CAT 582 
Ser Asn Glu His Asp Thr His Val Ala Tyr He Ser His Leu Pro His 
170 175 180 

GTT TTG AGC TAT GCG TTA GCC AAT AGC GTT TTA AAG CAA AAC GAC CCA 63 0 

Val Leu Ser Tyr Ala Leu Ala Asn Ser Val Leu Lys Gin Asn Asp Pro 
185 190 195 

GAG ATG ATT TTA TCT TTA GCG GGT GGG GGT TTT AGG GAT ATG AGC CGT 678 
Glu Met He Leu Ser Leu Ala Gly Gly Gly Phe Arg Asp Met Ser Arg 
200 205 210 215 

CTG TCC AAA AGC TCG CCT TTA ATG TGG AAA GAT ATT TTC AAA CAA AAC 72 6 

Leu Ser Lys Ser Ser Pro Leu Met Trp Lys Asp He Phe Lys Gin Asn 
220 225 230 

CGA GAC AAT GTC TTA GAA GCG ATT AAA AAA TGC GAA AAA GAA ATC GTG 7 74 

Arg Asp Asn Val Leu Glu Ala He Lys Lys Cys Glu Lys Glu He Val 
235 240 245 

CAA GCT AAG GCG TGG ATA GAA AAT AAC GAT TAT GAA AGC CTT GCA GAA 8 22 

Gin Ala Lys Ala Trp He Glu Asn Asn Asp Tyr Glu Ser Leu Ala Glu 
250 255 260 

TGG ATG GCG CAA GCG AAC AAA CTC CAG GAG TTC ATG TAAAGTAAAA TGATGT 8 74 
Trp Met Ala Gin Ala Asn Lys Leu Gin Glu Phe Met 
265 270 275 

AAAATAATTT AAAATTTTTT ATATTGTTGT TTTTAGG 911 

(2) INFORMATION FOR SEQ ID NO: 1072: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 72: 

Met Lys Ala Gly He He Gly Leu Gly Leu Met Gly Gly Ser Leu Gly 

15 10 15 

Leu Ala Leu Gin Glu Trp Gly Arg Phe Lys Ser Val He Gly Tyr Asp 

20 25 30 

His Asn Ala Leu His Ala Lys Leu Ala Leu Thr Leu Gly Leu Val Asp 
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(2) INFORMATION FOR SEQ ID NO: 1071: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 858 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1071: 

AGTGTTAGAA AAAACTTTGC TTTGAAATTT GGC ATG AAA GCA GGC ATT ATT GGT 54 

Met Lys Ala Gly lie lie Gly 
1 5 

TTA GGG CTT ATG GGG GGG AGT TTA GGG CTA GCC TTG CAA GAA TGG GGG 102 
Leu Gly Leu Met Gly Gly Ser Leu Gly Leu Ala Leu Gin Glu Trp Gly 
10 15 20 

CGT TTT AAA AGC GTT ATA GGC TAT GAT CAT AAC GCT TTG CAT GCT AAA 15 0 

Arg Phe Lys Ser Val lie Gly Tyr Asp His Asn Ala Leu His Ala Lys 
25 30 35 

TTG GCT TTG ACT TTG GGG CTT GTA GAT GAA TGC GTG GGA TTT GAA AAG 198 
Leu Ala Leu Thr Leu Gly Leu Val Asp Glu Cys Val Gly Phe Glu Lys 
40 45 50 55 

ATT TTA GAA TGC GAT GTG ATT TTT TTG GCC ATT CCG GTT GAG GGC ATC 24 6 

lie Leu Glu Cys Asp Val lie Phe Leu Ala lie Pro Val Glu Gly lie 
60 65 70 

ATT GGA TGT CTG AAA AAA ATG ACC TCT ATC AAA AAA AGC GCG ACC ATT 2 94 

lie Gly Cys Leu Lys Lys Met Thr Ser lie Lys Lys Ser Ala Thr lie 
75 80 85 

ATT GAT TTA GGG GGC GCT AAA GCG CAA ATC ATT CGC AAT ATC CCT AAA 342 
lie Asp Leu Gly Gly Ala Lys Ala Gin lie lie Arg Asn lie Pro Lys 
90 95 100 

AGC ATT CGT AAG AAT TTC ATC GCT GCG CAC CCC ATG TGC GGG ACA GAG 3 90 

Ser lie Arg Lys Asn Phe lie Ala Ala His Pro Met Cys Gly Thr Glu 
105 110 115 

TTT TAT GGC CCT AAA GCG AGC GTT AAG GGG CTG TAT GAA AAC GCT CTA 43 8 

Phe Tyr Gly Pro Lys Ala Ser Val Lys Gly Leu Tyr Glu Asn Ala Leu 
120 125 130 135 
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15 10 15 

Asp Pro Lys Lys Arg Arg lie lie Phe Ala Ser Asn Leu Asn Lys Asp 

20 25 30 

Val Cys Val Cys Glu Tyr Cys lie Asp Val Met His Gly Glu Leu His 

35 40 45 

Lys Tyr Asp Asn Ser Leu Leu Ala Leu Lys Arg Asp Arg Leu Arg Arg 

50 55 60 

Met Glu Ser Ser Ala Tyr Glu Glu Glu Phe Leu Leu Ser Tyr lie Pro 
65 70 75 80 

Ala Pro Lys Glu Leu Lys Ala Val Leu Asp Asn Tyr Val lie Gly Gin 

85 90 95 

Glu Gin Ala Lys Lys Val Phe Ser Val Ala Val Tyr Asn His Tyr Lys 

100 105 110 

Arg Leu Ser Phe Lys Glu Lys Leu Lys Lys Gin Asp Asn Gin Asp Ser 

115 120 125 

Asn Val Glu Leu Glu His Leu Glu Glu Val Glu Leu Ser Lys Ser Asn 

130 135 140 

lie Leu Leu lie Gly Pro Thr Gly Ser Gly Lys Thr Leu Met Ala Gin 
145 150 155 160 

Thr Leu Ala Lys His Leu Asp lie Pro lie Ala lie Ser Asp Ala Thr 

165 170 175 

Ser Leu Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn lie Leu 

180 185 190 

Thr Arg Leu Leu Gin Ala Ser Asp Trp Asn Val Gin Lys Ala Gin Lys 

195 200 205 

Gly lie Val Phe lie Asp Glu He Asp Lys lie Ser Arg Leu Ser Glu 

210 215 220 

Asn Arg Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala 
225 230 235 240 

Leu Leu Lys He Val Glu Gly Ser Leu Val Asn He Pro Pro Lys Gly 

245 250 255 

Gly Arg Lys His Pro Glu Gly Asn Phe He Gin He Asp Thr Ser Asp 

260 265 270 

He Leu Phe He Cys Ala Gly Ala Phe Asp Gly Leu Ala Glu He He 

275 280 285 

Lys Lys Arg Thr Thr Gin Asn Val Leu Gly Phe Thr Gin Glu Lys Met 

290 295 300 

Ser Lys Lys Glu Gin Glu Ala He Leu His Leu Val Gin Thr His Asp 
305 310 315 320 

Leu Val Thr Tyr Gly Leu He Pro Glu Leu He Gly Arg Leu Pro Val 

325 330 335 

Leu Ser Thr Leu Asp Ser He Ser Leu Glu Ala Met Val Asp He Leu 

340 345 350 

Gin Lys Pro Lys Asn Ala Leu He Lys Gin Tyr Gin Gin Leu Phe Lys 

355 360 365 

Met Asp Glu Val Asp Leu He Phe Glu Glu Glu Ala He Lys Glu He 

370 375 380 

Ala Gin Leu Ala Leu Glu Arg Lys Thr Gly Ala Arg Gly Leu Arg Ala 
385 390 395 400 

He He Glu Asp Phe Cys Leu Asp He Met Phe Asp Leu Pro Lys Leu 

405 410 415 

Lys Gly Ser Glu Val Arg He Thr Lys Asp Cys Val Leu Lys Gin Ala 

420 425 430 

Glu Pro Leu He He Ala Lys Thr His Ser Lys He Leu Pro 
435 440 445 
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ATC ATT AAA AAA CGC ACC ACG CAG AAT GTG TTG GGT TTC ACT CAA GAA 97 2 

lie lie Lys Lys Arg Thr Thr Gin Asn Val Leu Gly Phe Thr Gin Glu 
290 295 300 



AAG ATG AGO AAA AAA GAG CAA GAA GCG ATC TTG CAT TTA GTC CAA ACC 102 0 

Lys Met Ser Lys Lys Glu Gin Glu Ala lie Leu His Leu Val Gin Thr 
305 310 315 



CAT GAC CTG GTT ACT TAT GGG CTT ATC CCT GAG CTT ATT GGC CGT TTG 10 68 

His Asp Leu Val Thr Tyr Gly Leu lie Pro Glu Leu lie Gly Arg Leu 
320 325 330 

CCG GTT TTA AGC ACG CTA GAT AGC ATC AGT TTA GAA GCG ATG GTG GAT 1116 
Pro Val Leu Ser Thr Leu Asp Ser lie Ser Leu Glu Ala Met Val Asp 
335 340 345 350 

ATT TTA CAA AAA CCT AAA AAC GCT CTT ATC AAG CAA TAC CAG CAG CTT 1164 
lie Leu Gin Lys Pro Lys Asn Ala Leu lie Lys Gin Tyr Gin Gin Leu 
355 360 365 



TTC AAA ATG GAT GAG GTG GAT TTG ATC TTT GAA GAA GAA GCC ATT AAA 1212 
Phe Lys Met Asp Glu Val Asp Leu lie Phe Glu Glu Glu Ala lie Lys 
370 375 380 



GAA ATC GCT CAA CTC GCA TTA GAA AGA AAA ACC GGG GCT AGG GGC TTA 12 6 0 

Glu lie Ala Gin Leu Ala Leu Glu Arg Lys Thr Gly Ala Arg Gly Leu 
385 390 395 

AGG GCG ATC ATT GAA GAT TTT TGT TTG GAT ATT ATG TTT GAT TTA CCC 13 0 8 

Arg Ala lie lie Glu Asp Phe Cys Leu Asp lie Met Phe Asp Leu Pro 
400 405 410 



AAG CTT AAA GGA TCG GAA GTG CGT ATC ACT AAA GAT TGT GTT TTA AAA 13 5 6 

Lys Leu Lys Gly Ser Glu Val Arg lie Thr Lys Asp Cys Val Leu Lys 
415 420 425 430 



CAG GCT GAA CCT TTG ATC ATT GCT AAA ACG CAT TCT AAA ATT CTT CCT T 14 05 
Gin Ala Glu Pro Leu lie lie Ala Lys Thr His Ser Lys lie Leu Pro 
435 440 445 



AAGGAACACG CTTATAAATT TAACGATAAA GGATTAGAAA GGG 144 8 

(2) INFORMATION FOR SEQ ID N0:1070: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070: 
Met Asn Glu Thr Leu Tyr Cys Ser Phe Cys Lys Lys Pro Glu Ser Arg 
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Arg Arg Met Glu Ser Ser Ala Tyr Glu Glu Glu Phe Leu Leu Ser Tyr 
65 70 75 

ATT CCA GCC CCT AAA GAG CTT AAG GCG GTT TTA GAC AAT TAT GTG ATA 34 8 

He Pro Ala Pro Lys Glu Leu Lys Ala Val Leu Asp Asn Tyr Val He 
80 85 90 

GGG CAA GAG CAG GCT AAA AAG GTT TTT TCC GTA GCC GTG TAT AAC CAT 3 96 

Gly Gin Glu Gin Ala Lys Lys Val Phe Ser Val Ala Val Tyr Asn His 
95 100 105 110 

TAC AAA CGC TTA TCT TTT AAA GAA AAA CTC AAA AAA CAA GAC AAC CAA 444 
Tyr Lys Arg Leu Ser Phe Lys Glu Lys Leu Lys Lys Gin Asp Asn Gin 
115 120 125 

GAC AGC AAT GTG GAG TTA GAG CAT TTA GAA GAA GTG GAG TTG AGC AAG 4 92 

Asp Ser Asn Val Glu Leu Glu His Leu Glu Glu Val Glu Leu Ser Lys 
130 135 140 

TCT AAT ATT TTA CTA ATC GGC CCT ACA GGA TCA GGC AAA ACT TTA ATG 54 0 

Ser Asn He Leu Leu He Gly Pro Thr Gly Ser Gly Lys Thr Leu Met 
145 150 155 

GCG CAA ACT CTG GCC AAG CAT TTG GAT ATT CCT ATC GCC ATT AGC GAT 58 8 

Ala Gin Thr Leu Ala Lys His Leu Asp He Pro He Ala He Ser Asp 
160 165 170 

GCG ACT AGC TTG ACT GAA GCG GGC TAT GTG GGC GAA GAC GTG GAA AAT 63 6 

Ala Thr Ser Leu Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn 
175 180 185 190 

ATT CTC ACA AGA TTG TTG CAA GCG AGC GAC TGG AAT GTC CAA AAA GCC 6 84 

He Leu Thr Arg Leu Leu Gin Ala Ser Asp Trp Asn Val Gin Lys Ala 
195 200 205 

CAA AAA GGC ATT GTG TTT ATT GAT GAG ATT GAT AAA ATC AGC CGT TTG 73 2 

Gin Lys Gly He Val Phe He Asp Glu He Asp Lys He Ser Arg Leu 
210 215 220 

TCA GAA AAC CGC TCT ATC ACT AGA GAT GTT TCT GGC GAG GGC GTT CAG 780 
Ser Glu Asn Arg Ser He Thr Arg Asp Val Ser Gly Glu Gly Val Gin 
225 230 235 

CAA GCG TTG TTG AAA ATC GTT GAA GGT TCT TTA GTG AAT ATC CCC CCC 82 8 

Gin Ala Leu Leu Lys He Val Glu Gly Ser Leu Val Asn He Pro Pro 
240 245 250 

AAA GGC GGC AGA AAG CAC CCT GAG GGC AAT TTC ATT CAA ATT GAC ACG 876 
Lys Gly Gly Arg Lys His Pro Glu Gly Asn Phe He Gin He Asp Thr 
255 260 265 270 

AGC GAT ATT TTA TTC ATT TGT GCT GGA GCG TTT GAT GGG TTA GCT GAA 924 
Ser Asp He Leu Phe He Cys Ala Gly Ala Phe Asp Gly Leu Ala Glu 
275 280 285 
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65 

Tyr Glu Ser Ala 

Lys Asn Asn Lys 
100 

Asp Val Trp Glu 
115 

Lys lie Leu Lys 
130 

Glu Arg Met He 
145 

Leu Phe Ser Gly 

Arg Asn Phe He 
180 

Leu Glu Met Phe 
195 



70 

Glu Arg He Lys 
85 

Arg Trp Phe Pro 

He Thr Ser Gin 
120 

Pro Lys His Pro 
135 

Lys Ala Ser Ser 
150 

Ser Gly Met Thr 
165 

Gly Cys Glu Ser 

Arg Tyr Asn Glu 
200 



75 

His Ala Gin Ser 
90 

Asn Pro Lys Gly 
105 

Arg His Val Glu 

Ser He Lys Pro 
140 

His Lys Asn Asp 
155 

Ser Leu Val Ala 
170 

His Ala Glu Tyr 
185 

Cys Glu 



80 

Lys Gly He Leu 
95 

Lys Leu Cys Leu 
110 

Lys Glu Lys Gly 
125 

Lys Ala Leu He 

Leu He Leu Asp 
160 

Lys Ser Leu Glu 
175 

Val His Gly Ser 
190 



(2) INFORMATION FOR SEQ ID NO:1069: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 67 , . . 1404 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069: 



AGAGTTCTAG GGGCGTGGCG TATAAGTCAA GCGAATATTC TAGCGAAGAA AAACAAGAGG 6 0 

AATAAC ATG AAC GAA ACG CTT TAT TGC AGT TTT TGC AAA AAA CCA GAA 10 8 

Met Asn Glu Thr Leu Tyr Cys Ser Phe Cys Lys Lys Pro Glu 
15 10 



TCA AGA GAT CCC AAA AAA CGC CGC ATT ATT TTT GCG AGC AAT CTC AAT 156 
Ser Arg Asp Pro Lys Lys Arg Arg He He Phe Ala Ser Asn Leu Asn 
15 20 25 30 



AAA GAT GTG TGC GTG TGC GAA TAT TGT ATA GAT GTG ATG CAT GGG GAA 2 04 

Lys Asp Val Cys Val Cys Glu Tyr Cys He Asp Val Met His Gly Glu 
35 40 45 



TTG CAC AAA TAG GAC AAT TCT TTA TTG GCG CTC AAA AGA GAC CGA TTG 2 52 

Leu His Lys Tyr Asp Asn Ser Leu Leu Ala Leu Lys Arg Asp Arg Leu 
50 55 60 

AGA AGA ATG GAA TCT AGC GCT TAT GAA GAA GAG TTT TTA CTC TCT TAG 3 00 
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TAT GAA TCC GCT GAA CGC ATC AAA CAT GCT CAA AGT AAG GGG ATT TTA 3 47 

Tyr Glu Ser Ala Glu Arg He Lys His Ala Gin Ser Lys Gly He Leu 
85 90 95 

AAA AAT AAC AAA CGC TGG TTC CCT AAC CCT AAG GGC AAA TTA TGC CTT 3 95 

Lys Asn Asn Lys Arg Trp Phe Pro Asn Pro Lys Gly Lys Leu Cys Leu 
100 105 110 

GAT GTG TGG GAA ATC ACT TCA CAA AGG CAT GTT GAA AAA GAG AAG GGT 44 3 

Asp Val Trp Glu He Thr Ser Gin Arg His Val Glu Lys Glu Lys Gly 
115 120 125 

AAA ATC CTT AAG CCC AAA CAC CCC AGC ATC AAA CCT AAA GCG CTC ATT 4 91 

Lys He Leu Lys Pro Lys His Pro Ser He Lys Pro Lys Ala Leu He 
130 135 140 

GAA CGC ATG ATA AAA GCT AGC TCT CAC AAA AAC GAT TTG ATT TTA GAT 53 9 

Glu Arg Met He Lys Ala Ser Ser His Lys Asn Asp Leu He Leu Asp 
145 150 155 160 

TTG TTT AGC GGC AGT GGC ATG ACT AGC TTA GTG GCT AAA AGT TTG GAG 587 
Leu Phe Ser Gly Ser Gly Met Thr Ser Leu Val Ala Lys Ser Leu Glu 
165 170 175 

CGT AAT TTT ATA GGG TGT GAG AGC CAT GCT GAA TAC GTG CAT GGG AGT 635 
Arg Asn Phe He Gly Cys Glu Ser His Ala Glu Tyr Val His Gly Ser 
180 185 190 

TTG GAA ATG TTT AGG TAT AAT GAA TGC GAA TAAAAAAGGA TATTTGACAT GCCA 68 9 
Leu Glu Met Phe Arg Tyr Asn Glu Cys Glu 
195 200 

AAATTAGAAA AAAT 7 03 

(2) INFORMATION FOR SEQ ID NO: 106 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1068: 

Met Leu Pro Lys Leu Lys Asp Thr Gly Ser Phe Tyr He Phe Asn Thr 

15 10 15 

Pro Phe Asn Cys Ala Leu Phe Leu Ala Tyr Leu His His Lys Lys Val 

20 25 30 

His Phe Leu Asn Phe He Thr Trp Val Lys Lys Asp Gly Phe Ala Asn 

35 40 45 

Ala Lys Lys Arg Tyr Asn His Ala Gin Glu Ser He Leu Phe Tyr Ser 

50 55 60 

Met His Lys Lys Asn Tyr Thr Phe Asn Ala Asp Glu He Arg He Ala 
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290 295 300 

Tyr Glu lie Pro Lys Asp Lys lie Thr Glu Leu Ser Val Arg Tyr Val 
305 310 315 320 

Arg Phe Asn Ser Gly Val Leu Gly Tyr Gly lie Gly Leu Gly Leu Val 

325 330 335 

Lys Lys Val Cys Glu Lys Tyr Lys Met Arg Leu Glu lie His Ser Glu 

340 345 350 

Pro Ser Leu Lys Gly Ser Phe Tyr Glu Asn Ser Phe Cys Val Gin Phe 
355 360 365 

Gin Gly 
370 

(2) INFORMATION FOR SEQ ID NO: 1067: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 665 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1067: 
GGGATAGTTT TAAAAATGAT GAAGAGTTTT TAACATTTTC TTACGCTTGG ATTGATAAA 5 9 



ATG CTG CCC AAA CTT AAA GAC ACA GGG AGT TTT TAT ATC TTT AAT ACC 107 
Met Leu Pro Lys Leu Lys Asp Thr Gly Ser Phe Tyr lie Phe Asn Thr 
15 10 15 

CCT TTT AAT TGC GCT TTA TTT TTA GCG TAT TTG CAC CAT AAA AAA GTG 15 5 

Pro Phe Asn Cys Ala Leu Phe Leu Ala Tyr Leu His His Lys Lys Val 
20 25 30 

CAT TTT TTA AAT TTT ATC ACT TGG GTT AAA AAA GAT GGG TTT GCC AAC 2 03 

His Phe Leu Asn Phe lie Thr Trp Val Lys Lys Asp Gly Phe Ala Asn 
35 40 45 

GCC AAA AAG CGT TAT AAC CAC GCG CAA GAA AGC ATT TTA TTT TAT AGC 2 51 

Ala Lys Lys Arg Tyr Asn His Ala Gin Glu Ser lie Leu Phe Tyr Ser 
50 55 60 

ATG CAC AAG AAA AAC TAC ACC TTT AAT GCC GAT GAG ATT CGC ATC GCT 2 99 

Met His Lys Lys Asn Tyr Thr Phe Asn Ala Asp Glu lie Arg lie Ala 
65 70 75 80 
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TTT TAT GAA AAT TCG TTT TGC GTT CAA TTT CAA GGA TAAAGATGCT TTCAGT 122 3 
Phe Tyr Glu Asn Ser Phe Cys Val Gin Phe Gin Gly 
360 365 370 

GTATGAAAAA GTGAATGCTC TAG 12 4 6 

(2) INFORMATION FOR SEQ ID NO: 1066: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

(C) STRANPEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066: 

Met Leu Val lie Ser Val Leu Ala Phe Asn Tyr Glu Lys Asn Glu Lys 

15 10 15 

lie Lys Met lie Arg Met Asp Met Asp Lys Met Ala Ser Lys lie Ala 

20 25 30 

Ser Glu Val lie Ala Leu His Met Gin Thr His Gly Asp Tyr Gin Asn 

35 40 45 

Ala Leu Asn Ala Leu lie Ser Arg Tyr Lys Asp Ala Ser lie Ala Leu 

50 55 60 

Phe Asp Ser Lys Lys Arg Val Leu Tyr Ser Asn lie Pro Glu Ser Ala 
65 70 75 80 

Asn Leu lie Lys Asn His Lys Glu Ala Gly Phe Phe Ser Phe Arg Gly 

85 90 95 

Glu Tyr Tyr Leu Leu Ser Asp Glu Thr Phe Ala His Leu Gly Val Ala 

100 105 110 

Lys Met Leu Phe Lys Asn Ser Lys Pro Leu His Phe Ser Ser Leu Tyr 

115 120 125 

Arg Asn lie Val Leu Val Phe Val Val Ala Phe Leu Cys Val lie Gly 

130 135 140 

Val Ser Val Phe Leu Gly Arg Leu Phe Leu Lys Pro lie Arg Asn Glu 
145 150 155 160 

lie Thr Arg lie Asp His Phe Leu Lys Asn Thr Thr His Glu Leu Asn 

165 170 175 

Thr Pro Met Ser Ala Leu Val Leu Ser Leu Lys Thr Leu Glu Asp Asn 

180 185 190 

Gin Gin His Arg Arg lie Lys lie Ala lie Gin Arg Met Ser Phe Leu 

195 200 205 

Tyr Arg Ser Leu Ser Tyr Leu Val Met Gin Asp lie Glu Arg Glu Ser 

210 215 220 

Phe Val Leu Leu Asp Leu Lys Ala Leu lie lie Lys Glu Asn Thr Leu 
225 230 235 240 

Phe Ser Glu Met lie Asp Tyr His Lys Leu Glu Phe Lys Ser Asp Leu 

245 250 255 

Val Glu Val Glu Leu Lys Ala Lys Glu Gin Asp Phe lie Ser Leu Tyr 

260 265 270 

Ser Asn Leu Leu Met Asn Ala lie Lys Tyr Ser Val Met Asn Gly Tyr 

275 280 285 

lie His lie Glu Leu Thr His Ala Phe Leu Lys Val Lys Asn Leu Gly 
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TTT GTT GTA GCG TTT TTA TGC GTG ATA GGG GTT TCT GTG TTT TTG GGG 54 7 

Phe Val Val Ala Phe Leu Cys Val lie Gly Val Ser Val Phe Leu Gly 
135 140 145 150 

CGT TTG TTT TTA AAG CCC ATT AGG AAT GAA ATC ACC CGC ATT GAT CAT 5 95 

Arg Leu Phe Leu Lys Pro lie Arg Asn Glu lie Thr Arg lie Asp His 
155 160 165 

TTT TTA AAA AAC ACC ACG CAT GAA TTA AAC ACC CCC ATG AGC GCT TTA 64 3 

Phe Leu Lys Asn Thr Thr His Glu Leu Asn Thr Pro Met Ser Ala Leu 
170 175 180 

GTC TTG TCT TTA AAA ACC TTA GAA GAC AAC CAA CAA CAC CGC CGC ATT 6 91 

Val Leu Ser Leu Lys Thr Leu Glu Asp Asn Gin Gin His Arg Arg lie 
185 190 195 

AAA ATC GCT ATC CAG CGC ATG AGT TTT TTA TAC CGC TCG CTC TCG TAT 73 9 

Lys lie Ala lie Gin Arg Met Ser Phe Leu Tyr Arg Ser Leu Ser Tyr 
200 205 210 

TTA GTG ATG CAA GAT ATT GAG CGC GAA TCT TTT GTG CTT TTA GAT TTA 7 87 

Leu Val Met Gin Asp lie Glu Arg Glu Ser Phe Val Leu Leu Asp Leu 
215 220 225 230 

AAA GCC CTG ATT ATT AAA GAA AAC ACG CTT TTT AGC GAG ATG ATA GAC 83 5 

Lys Ala Leu lie lie Lys Glu Asn Thr Leu Phe Ser Glu Met lie Asp 
235 240 245 

TAC CAC AAG CTG GAA TTT AAA AGC GAT TTA GTG GAA GTG GAA CTT AAA 8 83 

Tyr His Lys Leu Glu Phe Lys Ser Asp Leu Val Glu Val Glu Leu Lys 
250 255 260 

GCT AAA GAG CAG GAT TTC ATT TCG CTT TAT AGC AAT TTG CTC ATG AAC 931 
Ala Lys Glu Gin Asp Phe lie Ser Leu Tyr Ser Asn Leu Leu Met Asn 
265 270 275 

GCG ATC AAA TAC AGC GTC ATG AAT GGG TAT ATC CAC ATA GAG CTA ACG 97 9 

Ala lie Lys Tyr Ser Val Met Asn Gly Tyr lie His lie Glu Leu Thr 
280 285 290 

CAT GCG TTT TTG AAA GTG AAA AAT TTA GGG TAT GAA ATC CCT AAA GAC 102 7 

His Ala Phe Leu Lys Val Lys Asn Leu Gly Tyr Glu lie Pro Lys Asp 
295 300 305 310 

AAG ATC ACA GAA TTA AGC GTT CGT TAT GTG CGT TTC AAT TCT GGC GTG 10 7 5 

Lys lie Thr Glu Leu Ser Val Arg Tyr Val Arg Phe Asn Ser Gly Val 
315 320 325 

TTG GGT TAT GGT ATA GGG TTA GGT TTG GTG AAA AAA GTG TGC GAA AAG 112 3 

Leu Gly Tyr Gly lie Gly Leu Gly Leu Val Lys Lys Val Cys Glu Lys 
330 335 340 

TAT AAA ATG CGT TTA GAA ATT CAT AGC GAA CCC TCT TTA AAG GGA TCG 1171 
Tyr Lys Met Arg Leu Glu lie His Ser Glu Pro Ser Leu Lys Gly Ser 
345 350 355 
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(2) INFORMATION FOR SEQ ID NO: 1065: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 124 6 base pairs 
(iB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 98... 1207 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065: 

GAAACGCATA AGGGGGTTGG CTATCGCTTT AACCCACTAT GAAAAAAAAT CCCTTAAACT 60 
CTTTTTAGGG ATTTATTTAG GCTCTTCGTT TGTGTTG ATG CTA GTG ATT AGC GTT 115 

Met Leu Val lie Ser Val 
1 5 

TTA GCG TTT AAC TAT GAA AAA AAC GAA AAA ATC AAA ATG ATA CGC ATG 163 
Leu Ala Phe Asn Tyr Glu Lys Asn Glu Lys lie Lys Met lie Arg Met 
10 15 20 

GAC ATG GAC AAA ATG GCT TCT AAG ATC GCT AGC GAA GTG ATT GCC TTG 211 
Asp Met Asp Lys Met Ala Ser Lys lie Ala Ser Glu Val lie Ala Leu 
25 ' 30 35 

CAC ATG CAA ACG CAT GGG GAT TAT CAA AAC GCT TTA AAC GCT CTC ATT 2 59 

His Met Gin Thr His Gly Asp Tyr Gin Asn Ala Leu Asn Ala Leu lie 
40 45 50 

TCA CGC TAT AAA GAC GCT TCC ATA GCC CTT TTT. GAT AGT AAA AAG CGT 3 07 

Ser Arg Tyr Lys Asp Ala Ser lie Ala Leu Phe Asp Ser Lys Lys Arg 
55 60 65 70 

GTT TTG TAT TCT AAT ATC CCT GAA AGC GCC AAT TTG ATT AAA AAC CAT 3 55 

Val Leu Tyr Ser Asn lie Pro Glu Ser Ala Asn Leu lie Lys Asn His 
75 80 85 

AAA GAA GCG GGC TTT TTT AGT TTT AGG GGA GAG TAT TAC CTA TTG AGC 4 03 

Lys Glu Ala Gly Phe Phe Ser Phe Arg Gly Glu Tyr Tyr Leu Leu Ser 
90 95 100 

GAT GAA ACT TTC GCT CAC TTA GGC GTG GCT AAA ATG CTT TTT AAA AAT 4 51 

Asp Glu Thr Phe Ala His Leu Gly Val Ala Lys Met Leu Phe Lys Asn 
105 110 115 

TCT AAA CCC CTT CAT TTT TCT TCT TTG TAT CGT AAC ATT GTT TTA GTG 4 99 

Ser Lys Pro Leu His Phe Ser Ser Leu Tyr Arg Asn lie Val Leu Val 
120 125 130 
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50 

Asn Phe lie Arg 
65 

Asp Leu Val Ala 

Tyr Val Glu Glu 
100 

Leu Ala Asn Thr 
115 

Ser Asp lie Leu 
130 

Gly Ser Thr lie 
145 

Ala Met Asp Leu 

Thr Gly lie Pro 
180 

Phe Asn Lys Gly 
195 

Lys Thr Ser Leu 
210 

Arg Gly Val Ala 

225 

Leu Arg Ala Leu 

Ser Gly Arg Leu 
260 

Asp His Leu Ser 
275 

Arg lie Glu Gin 
290 

Lys Glu Leu Gly 
305 

Ser Lys Ala Thr 

Glu Leu Lys Thr 
340 

Val Gin Leu Asn 
355 

Leu Ser Asp lie 
370 

Val Leu Phe Leu 
385 

Asn Lys lie Asp 

Glu Leu Tyr Leu 
420 

Gly Ser lie Glu 
435 

Ala Thr Gly Thr 
450 

Glu Asp Met Pro 
465 

Val Asp Tyr Asp 



55 

Gin Lys Met Pro 
70 

He Phe Ala Ala 
85 

He Lys Asn Ala 

He Arg Glu Gin 
120 

Gly Ala Val Glu 
135 

Glu Gly Phe Arg 
150 

He Thr Glu Asn 
165 

Thr Gly Phe Val 

Ser Leu Val He 
200 

Met Met Asn Met 
215 

Val Phe Ser Leu 

230 

Ser Asp Leu Thr 
245 

Asp Asp Asp Gin 

Gin Lys Lys Leu 
280 

He Arg Leu Gin 
295 

He Ala Phe He 
310 

Lys Glu Arg His 
325 

Leu Ala Arg Glu 

Arg Ser Leu Glu 
360 

Lys Asp Ser Gly 
375 

Tyr Arg Gly Tyr 
390 

Lys Leu Lys Lys 
405 

Lys Val Asn Glu 

Glu Ala Glu He 
440 

Val Tyr Thr Arg 
455 

He Asp Ser His 
470 

He Val Thr Thr 
485 



60 

Lys Asp Lys Gin 
75 

Ser Pro He Asp 
90 

Ser He Lys Arg 
105 

Ala Leu Glu Ser 

Arg Glu Val Tyr 
140 

Asn He Lys Glu 
155 

Gin Arg Lys Gly 
170 

Gin Leu Asp Asn 
185 

He Gly Ala Arg 

Val Leu Ser Ala 
220 

Glu Met Ser Ala 
235 

Ser He Asn Met 
250 

Tirp Glu Asn Leu 
265 

Phe Phe Tyr Asp 

Leu Arg Lys Leu 
300 

Asp Tyr Leu Gin 
315 

Glu Gin He Ala 
330 

Leu Glu He Pro 
345 

Asn Arg Asp Asp 

Gly He Glu Gin 
380 

He Tyr Gin Met 
395 

Glu Gly Lys He 
410 

Glu Arg Arg He 
425 

He Val Ala Lys 

Phe Asn Ala Pro 
460 

Leu Glu Glu Gly 
475 



He Lys Glu Glu 
80 

Asn He Glu Ala 
95 

Lys Leu Phe Gly 
110 

Ala Gin Lys Ser 
125 

Ala Leu Leu Asn 

Val Leu Glu Ser 
160 

Ser Leu Glu Val 
175 

Tyr Thr Ser Gly 
190 

Pro Ser Met Gly 
205 

Leu Asn Asp Asp 

Glu Gin Leu Ala 
240 

His Asp Leu Glu 
255 

Ala Lys Cys Phe 
270 

Lys Ser Tyr Val 
285 

Lys Ser Gin His 

Leu Met Ser Gly 
320 

Glu He Ser Arg 
335 

He He Ala Leu 
350 

Lys Arg Pro He 
365 

Asp Ala Asp He 

Arg Ala Glu Asp 
400 

Glu Glu Ala Gin 
415 

His Lys Gin Asn 
430 

Asn Arg Asn Gly 
445 

Phe Thr Arg Tyr 

Gin Glu Thr Lys 
480 



-1557- 



BNSDOCID: <WO 9843476A1 I > 



wo 98/43478 



PCT/US98/06371 



Glu Asn Arg Asp Asp Lys Arg Pro lie Leu Ser Asp lie Lys Asp Ser 
360 365 370 375 

GGG GGG ATT GAA CAA GAC GCT GAT ATT GTT TTA TTT TTA TAT AGA GGC 12 0 6 

Gly Gly lie Glu Gin Asp Ala Asp lie Val Leu Phe Leu Tyr Arg Gly 
380 385 390 

TAT ATC TAT CAA ATG AGG GCT GAA GAC AAC AAA ATA GAC AAA CTC AAA 1254 
Tyr lie Tyr Gin Met Arg Ala Glu Asp Asn Lys lie Asp Lys Leu Lys 
395 400 405 

AAA GAA GGT AAA ATT GAA GAG GCG CAA GAG TTG TAC TTA AAA GTT AAT 13 02 

Lys Glu Gly Lys lie Glu Glu Ala Gin Glu Leu Tyr Leu Lys Val Asn 
410 415 420 

GAA GAA AGG CGT ATC CAC AAG CAA AAT GGC AGC ATT GAA GAG GCT GAA 13 5 0 

Glu Glu Arg Arg lie His Lys Gin Asn Gly Ser lie Glu Glu Ala Glu 
425 430 435 

ATC ATT GTG GCT AAA AAC AGG AAT GGG GCT ACA GGA ACQ GTT TAT ACG 13 9 8 

lie lie Val Ala Lys Asn Arg Asn Gly Ala Thr Gly Thr Val Tyr Thr 
440 445 450 455 

CGC TTT AAC GCT CCT TTC ACG CGC TAT GAA GAC ATG CCC ATA GAT TCC 144 6 

Arg Phe Asn Ala Pro Phe Thr Arg Tyr Glu Asp Met Pro lie Asp Ser 
460 465 470 

CAT TTA GAA GAA GGG CAA GAA ACT AAA GTG GAT TAT GAT ATA GTT ACA 14 94 

His Leu Glu Glu Gly Gin Glu Thr Lys Val Asp Tyr Asp lie Val Thr 
475 480 485 

ACT TGAAAGACAA AACTTTTCAG GGGGCGTTTG AACTTCTTA 15 3 6 

Thr 



(2) INFORMATION FOR SEQ ID NO: 1064; 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064: 

Met Asp His Leu Lys His Leu Gin Gin Leu Gin Asn lie Glu Arg lie 

15 10 15 

Val Leu Ser Gly He Val Leu Ala Asn His Lys He Glu Glu Val His 

20 25 30 

Ser Val Leu Glu Pro Ser Asp Phe Tyr Tyr Pro Pro Asn Gly Leu Phe 

35 40 45 

Phe Glu He Ala Leu Lys Leu His Glu Glu Asp Cys Pro He Asp Glu 
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140 145 150 

AGG AAT ATT AAA GAA GTG CTT G7^ AGC GCA ATG GAT CTC ATT ACA GAA 53 4 

Arg Asn lie Lys Glu Val Leu Glu Ser Ala Met Asp Leu lie Thr Glu 
155 160 165 

AAC CAA AGA AAG GGG AGT TTG GAA GTT ACT GGC ATA CCG ACT GGC TTT 5 82 

Asn Gin Arg Lys Gly Ser Leu Glu Val Thr Gly lie Pro Thr Gly Phe 
170 175 180 

GTC CAA TTG GAT AAC TAT ACG AGC GGT TTC AAT AAG GGG AGT TTA GTC 63 0 

Val Gin Leu Asp Asn Tyr Thr Ser Gly Phe Asn Lys Gly Ser Leu Val 
185 190 195 

ATT ATA GGG GCA AGG CCG TCT ATG GGT AAA ACT AGT TTG ATG ATG AAC 67 8 

lie lie Gly Ala Arg Pro Ser Met Gly Lys Thr Ser Leu Met Met Asn 
200 205 210 215 

ATG GTC TTA AGC GCG CTC AAT GAC GAT AGG GGG GTA GCG GTT TTT AGT 72 6 

Met Val Leu Ser Ala Leu Asn Asp Asp Arg Gly Val Ala Val Phe Ser 
220 225 230 

TTA GAA ATG TCC GCA GAG CAA CTC GCT TTA AGG GCG TTA TCG GAT CTC 774 
Leu Glu Met Ser Ala Glu Gin Leu Ala Leu Arg Ala Leu Ser Asp Leu 
235 240 245 

ACC TCT ATT AAC ATG CAT GAT TTA GAA AGC GGG AGG CTT GAT GAT GAT 822 
Thr Ser lie Asn Met His Asp Leu Glu Ser Gly Arg Leu Asp Asp Asp 
250 255 260 

CAA TGG GAA AAT TTA GCC AAA TGC TTT GAT CAC CTT TCG CAA AAA AAA 87 0 

Gin Trp Glu Asn Leu Ala Lys Cys Phe Asp His Leu Ser Gin Lys Lys 
265 270 275 

CTC TTT TTC TAC GAT AAA AGT TAT GTG AGG ATA GAG CAA ATC CGC TTG 918 
Leu Phe Phe Tyr Asp Lys Ser Tyr Val Arg lie Glu Gin lie Arg Leu 
280 285 290 295 

CAA CTA CGA AAG CTT AAA TCC CAA CAC AAG GAA TTG GGT ATC GCT TTT 966 
Gin Leu Arg Lys Leu Lys Ser Gin His Lys Glu Leu Gly lie Ala Phe 
300 305 310 

ATT GAC TAT TTG CAG CTC ATG TCA GGG AGT AAA GCC ACT AAA GAG CGC 1014 
lie Asp Tyr Leu Gin Leu Met Ser Gly Ser Lys Ala Thr Lys Glu Arg 
315 320 325 

CAT GAG CAA ATC GCT GAA ATT TCA AGG GAG CTT AAA ACT TTA GCC AGA 10 62 

His Glu Gin lie Ala Glu lie Ser Arg Glu Leu Lys Thr Leu Ala Arg 
330 335 340 

GAA TTA GAA ATC CCT ATC ATA GCG TTA GTG CAA CTC AAC CGC AGC CTA 1110 
Glu Leu Glu lie Pro lie lie Ala Leu Val Gin Leu Asn Arg Ser Leu 
345 350 355 

GAA AAC CGA GAC GAT AAA CGG CCC ATT CTT TCG GAT ATC AAA GAC AGC 115 8 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 1497 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063: 

TAGAAAAGAT CAAACAATTA TAAAAGGATA AAA ATG GAT CAT TTA AAG CAT TTG 54 

Met Asp His Leu Lys His Leu 
1 5 

CAG CAA TTG CAA AAC ATT GAA AGG ATC GTG CTT TCA GGC ATT GTG TTG 102 
Gin Gin Leu Gin Asn He Glu Arg He Val Leu Ser Gly He Val Leu 
10 15 20 

GCC AAT CAT AAG ATT GAA GAG GTC CAT AGC GTT TTA GAG CCT AGC GAT 150 
Ala Asn His Lys He Glu Glu Val His Ser Val Leu Glu Pro Ser Asp 
25 30 35 

TTT TAC TAC CCG CCT AAC GGC TTA TTT TTT GAA ATC GCT TTA AAA CTG 19 8 

Phe Tyr Tyr Pro Pro Asn Gly Leu Phe Phe Glu He Ala Leu Lys Leu 
40 45 50 55 

CAT GAA GAA GAT TGC CCC ATT GAT GAG AAT TTT ATC CGC CAA AAA ATG 24 6 

His Glu Glu Asp Cys Pro He Asp Glu Asn Phe He Arg Gin Lys Met 
60 65 70 

CCT AAA GAC AAG CAG ATC AAA GAA GAA GAT CTA GTC GCT ATT TTT GCG 2 94 

Pro Lys Asp Lys Gin He Lys Glu Glu Asp Leu Val Ala He Phe Ala 
75 80 85 

GCA AGC CCC ATA GAT AAT ATT GAA GCC TAT GTG GAA GAG ATT AAA AAC 34 2 

Ala Ser Pro He Asp Asn He Glu Ala Tyr Val Glu Glu He Lys Asn 
90 95 100 

GCT TCC ATT AAA CGA AAA CTT TTT GGC TTG GCT AAC ACC ATT AGA GAG 3 90 

Ala Ser He Lys Arg Lys Leu Phe Gly Leu Ala Asn Thr He Arg Glu 
105 110 115 

CAA GCC CTA GAA AGC GCG CAA AAA TCC AGC GAT ATT TTA GGC GCT GTG 43 8 

Gin Ala Leu Glu Ser Ala Gin Lys Ser Ser Asp He Leu Gly Ala Val 
120 125 130 135 

GAG CGA GAA GTC TAT GCG TTA TTG AAT GGC AGC ACC ATA GAA GGC TTT 4 86 

Glu Arg Glu Val Tyr Ala Leu Leu Asn Gly Ser Thr He Glu Gly Phe 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1062: 



Met 


Asp 


Phe 


Leu 


Lys 


Glu 


Asn 


Leu 


Asn 


Thr 


He 


He 


Glu 


Gly Asp 


Cys 


1 








5 










10 










15 




Leu 


Glu 


Lys 


Leu 


Lys 


Asp 


Phe 


Pro 


Asn 


Arg 


Ser 


Val 


Asp 


Phe 


He 


Phe 








20 










25 










30 






Ala 


Asp 


Pro 


Pro 


Tyr 


Phe 


Met 


Gin 


Thr 


Glu 


Gly 


Glu 


Leu 


Lys 


Arg 


Phe 






35 










40 










45 








Glu 


Gly 


Thr 


Lys 


Phe 


Gin 


Gly 


Val 


Glu 


Asp 


Tyr 


Trp 


Asp 


Lys 


Phe 


Gly 




50 










55 










60 










Ser 


Phe 


Lys 


Glu 


Tyr 


Asp 


Ala 


Phe 


Cys 


Leu 


Gly 


Trp 


Leu 


Lys 


Glu 


Cys 


65 










70 










75 










80 


Gin 


Arg 


He 


Leu 


Lys 


Asp 


Asn 


Gly 


Ser 


He 


Cys 


Val 


He 


Gly 


Ser 


Phe 










85 










90 










95 




Gin 


Asn 


He 


Phe 


Arg 


He 


Gly 


Phe 


His 


Leu 


Gin 


Asn 


Leu 


Gly 


Phe 


Trp 








100 










105 










110 






He 


Leu 


Asn 


Asp 


He 


He 


Trp 


His 


Lys 


Ser 


Asn 


Pro 


Val 


Pro 


Asn 


Phe 






115 










120 










125 








Ala 


Gly 


Lys 


Arg 


Leu 


Cys 


Asn 


Ala 


His 


Glu 


Thr 


Leu 


He 


Trp 


Cys 


Ala 




130 










135 










140 










Lys 


His 


Lys 


Asn 


Ser 


Lys 


Val 


Ala 


Phe 


Asn 


Tyr 


Lys 


Thr 


Met 


Lys 


Tyr 


145 










150 










155 










160 


Leu 


Asn 


Asn 


Asp 


Lys 


Gin 


Glu 


Lys 


Ser 


Val 


Trp 


Gin 


He 


Pro 


He 


Cys 










165 










170 










175 




Met 


Gly 


Asn 


Glu 


Arg 


Leu 


Lys 


Asp 


Ala 


Gin 


Gly 


Lys 


Lys 


Val 


His 


Ser 








180 










185 










190 






Thr 


Gin 


Lys 


Pro 


Glu 


Ala 


Leu 


Leu 


Lys 


Lys 


He 


He 


Leu 


Ser 


Ala 


Thr 






195 










200 










205 








Lys 


Pro 


Lys 


Asp 


He 


He 


Leu 


Asp 


Pro 


Phe 


Phe 


Gly 


Thr 


Gly 


Thr 


Thr 




210 










215 










220 










Gly 


Ala 


Val 


Ala 


Lys 


Ser 


Met 


Asn 


Arg 


Tyr 


Phe 


He 


Gly 


He 


Glu 


Lys 


225 










230 










235 










240 


Asp 


Ser 


Phe 


Tyr 


He 


Lys 


Glu 


Ala 


Ala 


Lys 


Arg 


Leu 


Asn 


Asn 


Thr 


Arg 










245 










250 










255 




Asp 


Lys 


Ser 


Asp 


Phe 


He 


Thr 


Asn 


Leu 


Asp 


Leu 


Glu 


Thr 


Lys 


Pro 


Pro 








z o u 










'J £Z C 
^ O 3 










270 






Lys 


He 


Pro 


Met 


Ser 


Leu 


Leu 


He 


Ser 


Lys 


Gin 


Leu 


Leu 


Lys 


He 


Gly 






275 










280 










285 








Asp 


Phe 


Leu 


Tyr 


Ser 


Pro 


Asn 


Lys 


Glu 


Lys 


He 


Cys 


Gin 


Val 


Leu 


Glu 




290 










295 










300 










Asn 


Gly 


Gin 


Val 


Arg 


Asp 


Asn 


Glu 


Asn 


Tyr 


Glu 


Thr 


Ser 


He 


His 


Lys 


305 










310 










315 










320 


Met 


Ser 


Ala 


Lys 


Tyr 


Leu 


Asn 


Lys 


Thr 


Asn 


His 


Asn 


Gly 


Trp 


Lys 


Phe 










325 










330 










335 




Phe 


Tyr 


Ala 


Tyr 


Tyr 


Gin 


Asn 


Gin 


Phe 


Leu 


Leu 


Leu 


Asp 


Glu 


Leu 


Arg 








340 










345 










350 






Tyr 


He 


Cys 


Gin 


Lys 


Asp 


Ser 





















355 



(2) INFORMATION FOR SEQ ID NO : 1063 : 
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Trp Gin lie Pro lie Cys Met Gly Asn Glu Arg Leu Lys Asp Ala Gin 
175 180 185 

GGT AAA AAA GTG CAT TCC ACG CAA AAA CCA GAA GCG CTT TTA' AAA AAA 62 7 

Gly Lys Lys Val His Ser Thr Gin Lys Pro Glu Ala Leu Leu Lys Lys 
190 195 200 

ATC ATT TTA AGC GCG ACT AAA CCT AAA GAT ATT ATT TTA GAT CCC TTT 67 5 

lie lie Leu Ser Ala Thr Lys Pro Lys Asp lie lie Leu Asp Pro Phe 
205 210 215 

TTT GGC ACA GGC ACA ACA GGG GCT GTG GCT AAA TCC ATG AAC AGG TAT 723 
Phe Gly Thr Gly Thr Thr Gly Ala Val Ala Lys Ser Met Asn Arg Tyr 
220 225 230 

TTT ATT GGT ATT GAA AAA GAT TCT TTT TAT ATT AAA GAA GCG GCA AAA 771 
Phe He Gly He Glu Lys Asp Ser Phe Tyr He Lys Glu Ala Ala Lys 
235 240 245 250 

CGC CTG AAT AAC ACT AGG GAT AAA AGC GAT TTT ATC ACT AAT TTA GAT 819 
Arg Leu Asn Asn Thr Arg Asp Lys Ser Asp Phe He Thr Asn Leu Asp 
255 260 265 

TTA GAA ACT AAA CCC CCA AAA ATA CCT ATG AGT CTT TTA ATT TCT AAA 8 67 

Leu Glu Thr Lys Pro Pro Lys He Pro Met Ser Leu Leu He Ser Lys 
270 275 280 

CAA TTA TTA AAA ATC GGG GAT TTT TTA TAG TCA CCT AAC AAA GAA AAA 915 
Gin Leu Leu Lys He Gly Asp Phe Leu Tyr Ser Pro Asn Lys Glu Lys 
285 290 295 

ATT TGT CAA GTT TTA GAA AAC GGA CAA GTG AGG GAT AAT GAA AAC TAT 963 
He Cys Gin Val Leu Glu Asn Gly Gin Val Arg Asp Asn Glu Asn Tyr 
300 305 310 

GAA ACT TCT ATT CAT AAG ATG AGC GCT AAA TAT TTG AAT AAA ACC AAC 1011 
Glu Thr Ser He His Lys Met Ser Ala Lys Tyr Leu Asn Lys Thr Asn 

320 325 330 

CAT AAT GGC TGG AAA TTT TTT TAT GCG TAT TAC CAA AAT CAA TTT TTA 105 9 

His Asn Gly Trp Lys Phe Phe Tyr Ala Tyr Tyr Gin Asn Gin Phe Leu 
335 340 345 

TTG CTA GAT GAA TTG CGT TAT ATC TGC CAA AAG GAC TCT TAATGGACTA TC 1110 
Leu Leu Asp Glu Leu Arg Tyr He Cys Gin Lys Asp Ser 
350 355 

AAACCTTTAA CGAGATTTTT AATCGTTTTG TCTTTGGAAC 115 0 

(2) INFORMATION FOR SEQ ID NO:1062: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 1098 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061: 

GATATAAAAG GTTAGTTAAT C ATG GAT TTT TTA AAA GAA AAC TTA AAC ACT 51 

Met Asp Phe Leu Lys Glu Asn Leu Asn Thr 
15 10 

ATC ATA GAG GGG GAT TGT TTA GAA AAA TTG AAA GAC TTT CCT AAC AGA 9 9 

lie lie Glu Gly Asp Cys Leu Glu Lys Leu Lys Asp Phe Pro Asn Arg 
15 20 25 

AGC GTT GAT TTT ATC TTT GCT GAC CCC CCA TAT TTT ATG CAA ACA GAG 14 7 

Ser Val Asp Phe lie Phe Ala Asp Pro Pro Tyr Phe Met Gin Thr Glu 
30 35 40 

GGG GAA TTG AAG CGT TTT GAA GGC ACA AAA TTT CAA GGC GTT GAG GAT 195 
Gly Glu Leu Lys Arg Phe Glu Gly Thr Lys Phe Gin Gly Val Glu Asp 
45 50 55 

TAT TGG GAT AAA TTT GGC TCT TTT AAG GAA TAC GAT GCC TTT TGT TTG 24 3 

Tyr Trp Asp Lys Phe Gly Ser Phe Lys Glu Tyr Asp Ala Phe Cys Leu 
60 65 70 

GGT TGG TTG AAA GAA TGC CAA AGO ATT TTA AAA GAT AAT GGC AGT ATT 2 91 

Gly Trp Leu Lys Glu Cys Gin Arg lie Leu Lys Asp Asn Gly Ser lie 
75 80 85 90 

TGT GTG ATA GGG AGT TTT CAA AAT ATT TTT AGA ATT GGT TTT CAT TTG 339 
Cys Val lie Gly Ser Phe Gin Asn lie Phe Arg lie Gly Phe His Leu 
95 100 105 

CAA AAT TTA GGG TTT TGG ATA CTC AAT GAT ATT ATT TGG CAC AAG AGT 3 87 

Gin Asn Leu Gly Phe Trp lie Leu Asn Asp lie lie Trp His Lys Ser 
110 115 120 

AAT CCG GTG CCT AAT TTT GCT GGC AAG AGA TTA TGC AAC GCC CAT GAG 43 5 

Asn Pro Val Pro Asn Phe Ala Gly Lys Arg Leu Cys Asn Ala His Glu 
125 130 135 

ACG CTT ATT TGG TGT GCT AAA CAC AAA AAC AGC AAA GTT GCC TTT AAT 4 83 

Thr Leu lie Trp Cys Ala Lys His Lys Asn Ser Lys Val Ala Phe Asn 
140 145 150 

TAT AAA ACA ATG AAG TAC CTC AAT AAC GAC AAA CAA GAA AAA TCG GTT 531 
Tyr Lys Thr Met Lys Tyr Leu Asn Asn Asp Lys Gin Glu Lys Ser Val 
155 160 165 170 

TGG CAA ATC CCT ATT TGC ATG GGT AAC GAA AGA CTA AAA GAT GCG CAA 57 9 
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100 

Gly Met Arg Asp 
115 

Pro Ala Tyr Lys 
130 

Asn Asn Glu Ser 
145 

Met Arg Gly Lys 

Asn Glu Pro Lys 
180 

Leu Pro Ser Val 
195 

Val Arg Val Ser 
210 

Gly Leu Lys Ala 
225 

Gly Asn Pro Gly 

Phe lie Lys Glu 
260 

Glu Asn Leu Glu 
275 

Pro lie Ala Val 
290 

Val Ala Gly Ala 
305 

Lys Thr Phe Gly 

Asp Glu Ala lie 
340 

Arg Thr lie Gin 
355 

Gly Lys Val Pro 
370 

Leu Lys His His 
385 

Pro Asn Ser Lys 

Glu lie Thr Pro 
420 

lie Asp Ser Leu 
435 

Lys Ala Pro Lys 
450 



Gly Val Leu Thr 
120 

Ala Gly Val Lys 
135 

Thr Leu Ser Met 
150 

Pro Lys Thr Pro 
165 

Pro Leu Val Phe 

Tyr Val Lys Lys 
200 

Gly Phe Asp Lys 
215 

Asn Pro Lys Ala 
230 

Gly Leu Leu Asn 
245 

Gly Val Leu Val 

Tyr Lys Ala Asn 
280 

Leu Val Asn Gly 
295 

Leu Gin Asp His 
310 

Lys Gly Ser Val 
325 

Lys lie Thr Thr 

Ala Lys Gly lie 
360 

Glu Asn Glu Asn 

375 

Leu Glu Gin Glu 
390 

Glu Ala Asp Lys 
405 

Lys Met lie Asn 

Lys Thr Trp Ser 
440 

Lys Lys 



105 - 

Val lie Ala Pro 

Ser Gly Asp Asn 
140 

Ser lie Asp Asp 
155 

lie Gin He Thr 
170 

Asn He He Arg 
185 

He Lys Glu Thr 

Asn Val Thr Lys 
220 

Lys Gly He Val 
235 

Gin Ala Val Gly 
250 

Ser Gin Lys Gly 
265 

Gly Arg Ala Pro 

Gly Ser Ala Ser 
300 

Lys Arg Ala Val 
315 

Gin Met Leu Leu 
330 

Ala Arg Tyr Tyr 
345 

Thr Pro Asp He 

Lys Phe Ser Leu 
380 

Leu Lys Lys He 
395 

Asp Lys Lys Asn 
410 

Asp Asp He Gin 
425 

He Val Asp Glu 



110 

Leu Glu Gly Thr 
125 

He Leu Lys He 

Ala He Asn Leu 
160 

Val Val Arg Lys 
175 

Asp He He Lys 
190 

Pro Tyr Leu Tyr 
205 

Ser Val Leu Glu 

Leu Asp Leu Arg 
240 

Leu Ser Asn Leu 
255 

Lys Asn Lys Glu 
270 

Tyr Thr Asn Leu 
285 

Ala Ser Glu He 

He He Gly Glu 
320 

Pro Val Asn Lys 
335 

Leu Pro Ser Gly 
350 

Val He Tyr Pro 
365 

Lys Glu Ala Asp 

Asp Asp Lys Thr 
400 

Glu Glu Glu Lys 
415 

Leu Lys Thr Ala 
430 

Lys Met Asp Glu 
445 



(2) INFORMATION FOR SEQ ID NO: 1061: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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ACC GCA CGC TAG TAT TTG CCG AGC GGG CGT ACC ATT CAA GCT AAG GGG 1110 
Thr Ala Arg Tyr Tyr Leu Pro Ser Gly Arg Thr lie Gin Ala Lys Gly 
345 350 355 

ATC ACG CCT GAT ATT GTG ATT TAT CCG GGT AAA GTG CCA GAA AAT GAA 115 8 

lie Thr Pro Asp lie Val lie Tyr Pro Gly Lys Val Pro Glu Asn Glu 
360 365 370 375 

AAC AAA TTC AGC TTG AAA GAA GCG GAT CTA AAA CAC CAT TTA GAG CAA 12 06 

Asn Lys Phe Ser Leu Lys Glu Ala Asp Leu Lys His His Leu Glu Gin 
380 385 390 

GAG CTT AAA AAG ATT GAT GAT AAA ACC CCC AAT TCC AAA GAG GCG GAT 12 54 

Glu Leu Lys Lys lie Asp Asp Lys Thr Pro Asn Ser Lys Glu Ala Asp 
395 400 405 

AAA GAC AAG AAA AAC GAA GAG GAA AAA GAG ATT ACT CCT AAA ATG ATC 13 02 

Lys Asp Lys Lys Asn Glu Glu Glu Lys Glu lie Thr Pro Lys Met lie 
410 415 420 

AAC GAT GAT ATT CAG CTA AAA ACC GCT ATT GAC AGC TTG AAA ACC TGG 13 5 0 

Asn Asp Asp lie Gin Leu Lys Thr Ala lie Asp Ser Leu Lys Thr Trp 
425 430 435 

TCT ATC GTT GAT GAG AAA ATG GAT GAA AAA GCG CCT AAG AAG AAA TAAAA 14 0 0 
Ser lie Val Asp Glu Lys Met Asp Glu Lys Ala Pro Lys Lys Lys 
440 445 450 

ACTCATGGGG TTTTTATTTG AAAAATCGTT AATGAGTTTT TTCGCTCATC CAATC 14 55 

(2) INFORMATION FOR SEQ ID NO: 1060: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 amino acids 

(B) TYPE: amino acid 

• (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060: 

Met Thr Lys Arg Leu Phe Lys Gly Leu Leu Ala Val Ser Leu Ala Val 

15 10 15 

Ser Leu His Gly Gly Glu Val Lys Glu Lys Lys Pro Val Lys Pro Val 

20 25 30 

Lys Glu Asp Pro Gin Glu Leu Ala Ala Lys Arg Val Glu Ala Phe Ser 

35 40 45 

Arg Phe Ser Asn Val Val Ser Glu lie Glu Lys Lys Tyr Val Asp Lys 

50 55 60 

lie Ser lie Ser Glu lie Met Thr Lys Ala lie Glu Gly Leu Leu Ser 
65 70 75 80 

Asn Leu Asp Ala His Ser Ala Tyr Leu Asn Glu Lys Lys Phe Lys Glu 

85 90 95 

Phe Gin Ala Gin Thr Glu Gly Glu Phe Gly Gly Leu Gly lie Thr Val 
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ACC GTT ATT GCC CCT TTA GAA GGC ACT CCA GCT TAG AAG GCT GGG GTT 43 8 

Thr Val lie Ala Pro Leu Glu Gly Thr Pro Ala Tyr Lys Ala Gly Val 
120 125 130 135 

AAG TCA GGC GAT AAC ATT TTA AAA ATC AAT AAC GAA AGC ACG CTG AGC 4 86 

Lys Ser Gly Asp Asn lie Leu Lys lie Asn Asn Glu Ser Thr Leu Ser 
140 145 150 

ATG AGC ATT GAT GAT GCG ATC AAC CTC ATG CGC GGC AAG CCA AAA ACC 534 
Met Ser lie Asp Asp Ala lie Asn Leu Met Arg Gly Lys Pro Lys Thr 
155 160 165 

CCT ATT CAG ATC ACC GTT GTA AGA AAA AAC GAG CCA AAA CCT TTA GTG 5 82 

Pro lie Gin lie Thr Val Val Arg Lys Asn Glu Pro Lys Pro Leu Val 
170 175 180 

TTT AAC ATC ATT AGA GAC ATC ATT AAA CTC CCC TCT GTC TAT GTG AAA 63 0 

Phe Asn lie lie Arg Asp lie lie Lys Leu Pro Ser Val Tyr Val Lys 
185 190 195 

AAG ATT AAA GAA ACC CCT TAT CTG TAT GTG AGA GTG AGT GGT TTT GAC 67 8 

Lys lie Lys Glu Thr Pro Tyr Leu Tyr Val Arg Val Ser Gly Phe Asp 
200 205 210 215 

AAG AAT GTT ACC AAA TCG GTT TTA GAA GGC TTA AAA GCT AAC CCT AAG 72 6 

Lys Asn Val Thr Lys Ser Val Leu Glu Gly Leu Lys Ala Asn Pro Lys 
220 225 230 

GCT AAG GGG ATC GTG TTG GAT TTA AGG GGC AAT CCT GGA GGG CTA TTA 7 74 

Ala Lys Gly lie Val Leu Asp Leu Arg Gly Asn Pro Gly Gly Leu Leu 
235 240 245 

AAC CAA GCG GTG GGC TTG TCT AAC CTC TTC ATT AAA GAG GGG GTT TTA 82 2 

Asn Gin Ala Val Gly Leu Ser Asn Leu Phe lie Lys Glu Gly Val Leu 
250 255 260 

GTC TCT CAA AAA GGC AAA AAT AAA GAA GAA AAT TTA GAA TAG AAG GCT 87 0 

Val Ser Gin Lys Gly Lys Asn Lys Glu Glu Asn Leu Glu Tyr Lys Ala 
265 270 275 

AAC GGC AGA GCC CCT TAT ACC AAT TTG CCT ATT GCG GTG TTA GTC AAT 918 
Asn Gly Arg Ala Pro Tyr Thr Asn Leu Pro lie Ala Val Leu Val Asn 
280 285 290 295 

GGC GGT TCA GCG AGC GCG AGC GAG ATC GTC GCA GGG GCA CTG CAA GAT 96 6 

Gly Gly Ser Ala Ser Ala Ser Glu lie Val Ala Gly Ala Leu Gin Asp 
300 305 310 

CAC AAA CGG GCC GTG ATT ATC GGT GAA AAA ACC TTT GGT AAG GGA AGC 1014 
His Lys Arg Ala Val lie lie Gly Glu Lys Thr Phe Gly Lys Gly Ser 
315 320 325 

GTG CAG ATG CTG CTC CCT GTC AAT AAA GAC GAA GCC ATT AAA ATC ACA 10 62 

Val Gin Met Leu Leu Pro Val Asn Lys Asp Glu Ala lie Lys lie Thr 
330 335 340 

-1548- 



BNSDOCID: <WO 984347eA1J_> 



wo 98/43478 



PCT/US98/06371 



Tyr Lys lie Leu Thr Pro His Glu Gin lie Gly Trp Val Lys Lys Asp 

370 375 380 

Glu Val Lys 
385 

(2) INFORMATION FOR SEQ ID NO; 1059: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 13 95 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059: 

TTCTAATCTC AAAAATGGGT GGTGTTATTA ACA ATG ACA AAA CGA CTT TTT AAA 54 

Met Thr Lys Arg Leu Phe Lys 
1 5 

GGG TTG TTA GCG GTT TCT CTT GCT GTG AGT TTG CAT GGT GGT GAA GTT 102 
Gly Leu Leu Ala Val Ser Leu Ala Val Ser Leu His Gly Gly Glu Val 
10 15 20 

AAG GAA AAA AAG CCG GTT AAG CCG GTT AAA GAA GAT CCG CAA GAA TTA 15 0 

Lys Glu Lys Lys Pro Val Lys Pro Val Lys Glu Asp Pro Gin Glu Leu 
25 30 35 

GCG GCT AAA AGG GTG GAA GCG TTC AGT CGT TTC TCT AAT GTG GTT TCA 19 8 

Ala Ala Lys Arg Val Glu Ala Phe Ser Arg Phe Ser Asn Val Val Ser 
40 45 50 55 

GAA ATT GAA AAA AAA TAT GTG GAT AAA ATC AGC ATT TCT GAG ATC ATG 24 6 

Glu lie Glu Lys Lys Tyr Val Asp Lys lie Ser lie Ser Glu lie Met 
60 65 70 

ACT AAA GCG ATT GAA GGC TTG CTC TCT AAT TTG GAC GCG CAT TCA GCG 2 94 

Thr Lys Ala lie Glu Gly Leu Leu Ser Asn Leu Asp Ala His Ser Ala 
75 80 85 

TAT TTG AAT GAA AAG AAG TTT AAG GAA TTT CAA GCC CAA ACC GAG GGC 34 2 

Tyr Leu Asn Glu Lys Lys Phe Lys Glu Phe Gin Ala Gin Thr Glu Gly 
90 95 100 

GAA TTT GGG GGG CTT GGG ATC ACG GTG GGC ATG CGC GAT GGC GTT TTA 3 90 

Glu Phe Gly Gly Leu Gly He Thr Val Gly Met Arg Asp Gly Val Leu 
105 110 115 
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(A) LENGTH: 387 amino acids 

(B) TYPE: amino acid 

(C) STRANPEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058: 

Met Gly Phe Leu Phe Glu Lys Ser Leu Met Ser Phe Phe Ala His Pro 

15 10 15 

lie Lys lie Leu Lys lie lie Ser Leu lie Leu Ser Phe Leu Val Ser 

20 25 30 

Phe Leu Val Ala Glu Asn Ala His Glu Pro Glu Glu lie Lys Ala Lys 

35 40 45 

Val Ala Tyr Val Lys lie Pro Gin Leu Glu Asp Leu Glu Asn Asn Pro 

50 55 60 

Val Tyr lie Gly Gin lie lie Gly Val Thr Tyr Asp Leu Leu Leu Phe 
65 70 75 80 

Asp Ala Glu Phe Leu Glu Ala Lys lie Lys Asp Gly Leu Asp Lys Thr 

85 90 95 

Gin lie Glu Leu Leu Asn Lys Met Pro Lys Trp Lys Lys Val Glu Lys 

100 105 110 

Glu Leu Phe Arg Ala Thr Tyr Tyr Tyr Lys lie Lys Gly lie Lys Ala 

115 120 125 

lie lie Pro Ser Leu Glu Val Ser Ala Phe Ser Asn Lys Asp Lys Tyr 

130 135 140 

lie Asp His Ser lie Ala Pro Lys Val Thr Leu Gin Val Thr Asp Leu 
145 150 155 160 

Ser Lys Asn Pro Arg Tyr Ala Asn Val Met Ala Lys Asp Leu Gin Val 

165 170 175 

Leu Gin Tyr Lys Thr Lys Asp Tyr Asp Asp Lys Asn Asn lie Leu Val 

180 185 190 

Met Glu lie Ala Phe Lys Glu Ala Thr Trp Glu Asp Phe His lie Lys 

195 200 205 

Glu Ala lie Lys Gin Gly Phe Asp Asn Ala Ser Leu Asn Gin lie Lys 

210 215 220 

Ala Lys Glu Gly Ser Val Phe Tyr Tyr Cys Val Leu Pro Lys Thr lie 
225 230 235 240 

Gin Asn Leu Ser Phe Asp Tyr Phe Ser Leu Ser Asn Lys Gin Phe Lys 

245 250 255 

Thr Leu Ser Phe Ser Thr lie Pro Thr Gin Asp Thr Thr Gly lie Gin 

260 265 270 

Ser Asp Leu lie Pro Lys Asn Asn Phe Leu Val Phe Ser Asn Val Ala 

275 280 285 

Leu Leu Ala Leu Cys Val Phe Phe Leu Val Leu Phe Phe lie Phe Gly 

290 295 300 

Arg Lys Leu lie Phe Leu Gly Leu Gly lie Leu Cys Leu Gly Phe Val 
305 310 315 320 

Leu Tyr His Leu Leu Phe Thr Gin Lys Ser Ala Leu Leu Leu Ala His 

325 330 335 

Lys Lys lie Arg lie Leu Pro Thr Gin Asn Ser Thr lie Leu Gly Leu 

340 345 350 

Ser Lys Asn Glu Met Pro lie Lys lie Leu Gly Ser His Asp Asp Tyr 
355 360 365 
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190 195 200 

GAT TTT CAC ATC AAA GAA GCG ATC AAG CAA GGG TTT GAT AAC GCC TCT 675 
Asp Phe His lie Lys Glu Ala lie Lys Gin Gly Phe Asp Asn Ala Ser 
205 210 215 

TTA AAC CAG ATC AAG GCT AAA GAA GGG AGC GTT TTT TAT TAT TGC GTG 72 3 

Leu Asn Gin lie Lys Ala Lys Glu Gly Ser Val Phe Tyr Tyr Cys Val 
220 225 230 235 

TTG CCT AAG ACT ATT CAA AAC CTT TCT TTT GAT TAT TTC TCG CTT TCA 771 
Leu Pro Lys Thr lie Gin Asn Leu Ser Phe Asp Tyr Phe Ser Leu Ser 
240 245 250 

AAT AAG CAA TTT AAA ACC TTA TCT TTT TCA ACC ATT CCC ACT CAA GAC 819 
Asn Lys Gin Phe Lys Thr Leu Ser Phe Ser Thr lie Pro Thr Gin Asp 
255 260 265 

ACT ACC GGT ATT CAA AGC GAT CTC ATC CCT AAA AAC AAT TTT TTA GTC 8 67 

Thr Thr Gly lie Gin Ser Asp Leu lie Pro Lys Asn Asn Phe Leu Val 
270 275 280 

TTT TCT AAT GTG GCG TTG CTC GCT TTG TGC GTG TTT TTC TTG GTG CTG 915 
Phe Ser Asn Val Ala Leu Leu Ala Leu Cys Val Phe Phe Leu Val Leu 
285 290 295 

TTT TTC ATT TTT GGG CGC AAA CTC ATT TTT TTA GGG CTT GGG ATT TTG 963 
Phe Phe lie Phe Gly Arg Lys Leu lie Phe Leu Gly Leu Gly lie Leu 
300 305 310 315 

TGC TTA GGG TTT GTT TTG TAT CAC CTT TTA TTC ACG CAA AAA TCA GCC 1011 
Cys Leu Gly Phe Val Leu Tyr His Leu Leu Phe Thr Gin Lys Ser Ala 
320 325 330 

CTA TTG CTC GCT CAT AAA AAA ATC CGC ATT CTG CCC ACG CAA AAT TCC 10 5 9 

Leu Leu Leu Ala His Lys Lys lie Arg lie Leu Pro Thr Gin Asn Ser 
335 340 345 

ACC ATT TTA GGG CTT TCT AAA AAT GAA ATG CCG ATT AAA ATC TTA GGC 1107 
Thr lie Leu Gly Leu Ser Lys Asn Glu Met Pro lie Lys lie Leu Gly 
350 355 360 

TCG CAT GAT GAT TAT TAT AAA ATC CTA ACG CCG CAT GAA CAA ATA GGA 115 5 

Ser His Asp Asp Tyr Tyr Lys lie Leu Thr Pro His Glu Gin lie Gly 
365 370 375 

TGG GTC AAA AAA GAT GAA GTC AAA TAAAAAGTCC AATCGTTTAA GAGCGATTTA 12 0 9 
Trp Val Lys Lys Asp Glu Val Lys 
380 385 

TAGAGCTTTA GTGATCGCTA TAGGACTAGC TGT 124 2 

(2) INFORMATION FOR SEQ ID NO: 105 8: 
(i). -SEQUENCE CHARACTERISTICS: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1057: 

AAGAAGAAAT AAAAACTC ATG GGG TTT TTA TTT GAA AAA TCG TTA ATG AGT 51 

Met Gly Phe Leu Phe Glu Lys Ser Leu Met Ser 
15 10 

TTT TTC GCT CAT CCA ATC AAA ATC CTT AAA ATC ATC AGT TTG ATT TTA 99 
Phe Phe Ala His Pro lie Lys lie Leu Lys lie lie Ser Leu lie Leu 
15 20 25 

AGT TTT TTG GTA AGC TTT TTG GTT GCT GAA AAC GCT CAT GAG CCA GAA 14 7 

Ser Phe Leu Val Ser Phe" Leu Val Ala Glu Asn Ala His Glu Pro Glu 
30 35 40 

GAA ATC AAG GCT AAA GTG GCT TAT GTG AAA ATC CCC CAA TTA GAA GAT 195 
Glu lie Lys Ala Lys Val Ala Tyr Val Lys lie Pro Gin Leu Glu Asp 
45 50 55 

TTG GAA AAC AAC CCG GTT TAT ATC GGT CAA ATT ATA GGC GTA ACT TAT 24 3 

Leu Glu Asn Asn Pro Val Tyr lie Gly Gin lie lie Gly Val Thr Tyr 
60 65 70 75 

GAT TTA TTG CTG TTT GAC GCT GAG TTT TTG GAA GCC AAA ATC AAA GAC 2 91 

Asp Leu Leu Leu Phe Asp Ala Glu Phe Leu Glu Ala Lys lie Lys Asp 
80 85 90 

GGG TTG GAT AAA ACC CAA ATT GAG CTT TTA AAC AAG ATG CCT AAA TGG 33 9 

Gly Leu Asp Lys Thr Gin lie Glu Leu Leu Asn Lys Met Pro Lys Trp 
95 100 105 

AAA AAG GTG GAA AAA GAG CTT TTC AGA GCG ACT TAT TAT TAC AAG ATT 3 87 

Lys Lys Val Glu Lys Glu Leu Phe Arg Ala Thr Tyr Tyr Tyr Lys lie 
110 115 120 

AAG GGC ATA AAA GCG ATT ATT CCG TCC TTA GAA GTG AGC GCG TTT TCC 4 35 

Lys Gly lie Lys Ala He He Pro Ser Leu Glu Val Ser Ala Phe Ser 
125 130 135 

AAT AAA GAC AAA TAC ATA GAT CAT TCC ATA GCC CCA AAA GTT ACT TTG 4 83 

Asn Lys Asp Lys Tyr He Asp His Ser He Ala Pro Lys Val Thr Leu 
140 145 150 155 

CAG GTA ACG GAT TTG TCC AAA AAC CCT CGT TAT GCG AAT GTC ATG GCT 531 
Gin Val Thr Asp Leu Ser Lys Asn Pro Arg Tyr Ala Asn Val Met Ala 
160 165 170 

AAA GAT TTA CAA GTC TTG CAA TAC AAA ACC AAA GAT TAT GAC GAT AAA 57 9 

Lys Asp Leu Gin Val Leu Gin Tyr Lys Thr Lys Asp Tyr Asp Asp Lys 
175 180 185 

AAC AAT ATT TTG GTG ATG GAA ATA GCG TTC AAA GAA GCC ACT TGG GAA 62 7 

Asn Asn He Leu Val Met Glu He Ala Phe Lys Glu Ala Thr Trp Glu 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1056: 



Met Lys Leu Phe Asp Tyr Ala Pro Leu Ser Leu Ala Trp Arg Glu Phe 

15 10 15 

Leu Gin Ser Glu Phe Lys Lys Pro Tyr Phe Leu Glu lie Glu Lys Arg 

20 25 30 

Tyr Leu Glu Ala Leu Lys lie Pro Lys Thr lie Phe Pro Lys Ser Ser 

35 40 45 

Asn Leu Phe Tyr Ala Leu Asn Leu Thr Pro Pro Cys Ala Val Lys lie 

50 55 60 

lie Leu Leu Gly Gin Asp Pro Tyr His Ser Thr Tyr Leu Glu Asn Asp 
65 70 75 80 

Gin Glu Leu Pro Val Ala Met Gly Leu Ser Phe Ser Val Glu Lys Asn 

85 90 95 

Ala Pro lie Pro Pro Ser Leu Lys Asn lie Phe Lys Glu Leu His Ala 

100 105 110 

Asn Leu Gly Val Pro Val Pro Cys Cys Gly Asp Leu Ser Ala Trp Ala 

115 120 125 

Lys Arg Gly Met Leu Leu Leu Asn Ala lie Leu Ser Val Glu Lys Asn 

130 135 140 

Gin Ala Ala Ser His Gin Tyr lie Gly Trp Glu Ala Phe Ser Asp Gin 
145 150 155 160 

lie Leu Met Arg Leu Phe Glu Thr Thr Ala Pro Leu lie Val Val Leu 

165 170 175 

Leu Gly Lys Val Ala Gin Lys Lys lie Ala Leu lie Pro Lys Asn Lys 

180 185 190 

His lie lie lie Thr Ala Pro His Pro Ser Pro Leu Ser Arg Gly Phe 

195 200 205 

Leu Gly Ser Gly Val Phe Thr Ser Val Gin Lys Ala Tyr Arg Glu Val 

210 215 220 

Tyr Arg Lys Asp Phe Asp Phe Ser Leu 
225 230 



(2) INFORMATION FOR SEQ ID NO: 1057: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 1179 
(D) OTHER INFORMATION: 
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Phe Leu Glu lie Glu Lys Arg Tyr Leu Glu Ala Leu Lys lie Pro Lys 
30 35 40 

ACC ATT TTC CCT AAA AGC TCT AAT CTG TTT TAT GCG CTC AAT CTA ACG ' 195 
Thr lie Phe Pro Lys Ser Ser Asn Leu Phe Tyr Ala Leu Asn Leu Thr 
45 50 55 

CCC CCT TGT GCG GTT AAA ATC ATC CTT TTA GGG CAA GAC CCC TAC CAT 24 3 

Pro Pro Cys Ala Val Lys lie lie Leu Leu Gly Gin Asp Pro Tyr His 
60 65 70 

TCC ACC TAC CTA GAA AAT GAT CAA GAA TTG CCG GTG GCG ATG GGG TTG 2 91 

Ser Thr Tyr Leu Glu Asn Asp Gin Glu Leu Pro Val Ala Met Gly Leu 
75 80 85 

AGC TTT AGC GTG GAA AAA AAC GCC CCT ATT CCT CCA AGT TTA AAA AAT 33 9 

Ser Phe Ser Val Glu Lys Asn Ala Pro lie Pro Pro Ser Leu Lys Asn 
90 95 100 105 

ATT TTT AAA GAA TTG CAT GCG AAT TTA GGC GTG CCT GTG CCT TGT TGT 3 87 

lie Phe Lys Glu Leu His Ala Asn Leu Gly Val Pro Val Pro Cys Cys 
110 115 120 

GGG GAT TTG AGC GCA TGG GCT AAA AGG GGC ATG CTC TTA TTG AAC GCC 435 
Gly Asp Leu Ser Ala Trp Ala Lys Arg Gly Met Leu Leu Leu Asn Ala 
125 130 135 

ATT TTA AGC GTG GAA AAA AAC CAA GCC GCT TCG CAC CAA TAT ATT GGC .4 83 

lie Leu Ser Val Glu Lys Asn Gin Ala Ala Ser His Gin Tyr lie Gly 
140 145 150 

TGG GAA GCT TTT AGC GAT CAA ATA CTG ATG CGC CTT TTT GAA ACG ACC 531 
Trp Glu Ala Phe Ser Asp Gin lie Leu Met Arg Leu Phe Glu Thr Thr 
155 160 165 

GCC CCT TTA ATC GTG GTG TTA CTA GGG AAA GTC GCC CAA AAA AAG ATC 57 9 

Ala Pro Leu lie Val Val Leu Leu Gly Lys Val Ala Gin Lys Lys lie 
170 175 180 185 

GCG TTA ATC CCC AAA AAC AAA CAC ATC ATC ATC ACA GCC CCT CAC CCT 62 7 

Ala Leu lie Pro Lys Asn Lys His lie lie lie Thr Ala Pro His Pro 
190 195 200 

AGC CCA CTA TCT AGG GGG TTT TTA GGG AGT GGG GTT TTT ACA AGC GTT 675 
Ser Pro Leu Ser Arg Gly Phe Leu Gly Ser Gly Val Phe Thr Ser Val 
205 210 215 

CAA AAA GCT TAT AGA GAG GTT TAT CGC AAG GAT TTT GAT TTT AGT TTA T 724 
Gin Lys Ala Tyr Arg Glu Val Tyr Arg Lys Asp Phe Asp Phe Ser Leu 
220 225 230 

GATTGATGCT TAATGAGACA GAACCCCTTA AGAATGCCTT TATTTAAGAG CAT 77 7 

(2) INFORMATION FOR SEQ ID NO: 105 6: 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1054: 



Met Ser Val Ser His Val Ala Leu lie Leu Arg Lys Leu Phe Tyr His 

15 10 15 

Arg Gin Gly Val Phe Met Gly Gly Phe Ser Val Gly Met Leu Lys Asp 

20 25 30 

Tyr Val Asp lie Phe Val Phe Ala Val Leu Gly Val Ala Ser Phe Leu 

35 40 45 

Ala Leu Trp Phe Ala lie Glu Arg Val lie Phe Tyr Ser Lys Val Asp 

50 55 60 

Leu Lys Ala Tyr Asp Asp lie Asp Ala Leu Asn Leu Asp Leu Thr Lys 
65 70 75 80 

Asn Leu Thr lie Leu Tyr Val lie Phe Ser Asn Ala Pro Tyr Val Gly 

85 90 95 

Leu Leu Gly Thr Val Leu Gly He Met Val He Phe Tyr Asp Met Gly 

100 105 110 

Val Ser Gly Gly Met Asp Ala Lys Thr He Met Val Gly Leu Ser Leu 

115 120 125 

Ala Leu Lys Ala Thr Ala Leu Gly Leu Ala Val Ala He Pro Thr Leu 

130 135 140 

He Ala Tyr Asn Ser Leu Leu Arg Lys Ser Asp Val Leu Ser Glu Lys 
145 150 155 160 

Phe Arg He Met Lys Lys 
165 



(2) INFORMATION FOR SEQ ID NO: 1055: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 723 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055: 

TCATGAATTA AACCCTAGCG AACA ATG AAG CTT TTT GAC TAC GCT CCT TTG 51 

Met Lys Leu Phe Asp Tyr Ala Pro Leu 
1 5 

AGT TTG GCT TGG CGG GAG TTT TTG CAA AGC GAA TTT AAA AAG CCT TAT 99 
Ser Leu Ala Trp Arg Glu Phe Leu Gin Ser Glu Phe Lys Lys Pro Tyr 
10 15 20 25 

TTT TTA GAA ATA GAA AAA CGC TAC CTA GAA GCC CTA AAA ATC CCT AAA 14 7 



-1541- 



BNSDOCID: <WO 9843478A1 I > 



wo 98/43478 PCT/US98/0637 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1053: 

GCATGCTCTT AGAGT ATG TCT GTA TCG CAT GTT GCT TTA ATC TTA AGG AAA 51 
Met Ser Val Ser His Val Ala Leu lie Leu Arg Lys 
15 10 

TTG TTT TAT CAT AGA CAA GGA GTT TTT ATG GGC GGT TTT TCA GTG GGA 9 9 

Leu Phe Tyr His Arg Gin Gly Val Phe Met Gly Gly Phe Ser Val Gly 
15 20 25 

ATG TTG AAA GAT TAT GTG GAC ATA TTT GTT TTT GCG GTG CTT GGC GTG 14 7 

Met Leu Lys Asp Tyr Val Asp lie Phe Val Phe Ala Val Leu Gly Val 
30 35 40 

GCC AGT TTT TTA GCT TTG TGG TTT GCG ATT GAA AGG GTT ATT TTT TAT 195 
Ala Ser Phe Leu Ala Leu Trp Phe Ala lie Glu Arg Val lie Phe Tyr 
45 50 55 60 

TCT AAA GTC GAT TTG AAA GCT TAT GAC GAT ATA GAT GCC CTG AAT TTG 24 3 

Ser Lys Val Asp Leu Lys Ala Tyr Asp Asp lie Asp Ala Leu Asn Leu 
65 70 75 

GAT TTA ACC AAG AAT CTA ACC ATT CTC TAT GTG ATT TTT TCT AAC GCG 2 91 

Asp Leu Thr Lys Asn Leu Thr lie Leu Tyr Val lie Phe Ser Asn Ala 
80 85 90 

CCT TAT GTG GGC TTA TTA GGG ACG GTT TTA GGG ATT ATG GTG ATT TTC 33 9 

Pro Tyr Val Gly Leu Leu Gly Thr Val Leu Gly lie Met Val He Phe 
95 100 105 

TAT GAC ATG GGC GTG AGC GGC GGG ATG GAC GCT AAA ACG ATC ATG GTA 3 87 

Tyr Asp Met Gly Val Ser Gly Gly Met Asp Ala Lys Thr He Met Val 
110 115 120 

GGT TTG TCT TTG GCT TTA AAA GCG ACC GCT CTA GGG CTT GCT GTG GCG 43 5 

Gly Leu Ser Leu Ala Leu Lys Ala Thr Ala Leu Gly Leu Ala Val Ala 
125 130 135 140 

ATT CCC ACT TTG ATC GCT TAT AAT AGC TTG TTG AGA AAA TCC GAT GTT 4 83 

He Pro Thr Leu He Ala Tyr Asn Ser Leu Leu Arg Lys Ser Asp Val 
145 150 155 

TTG AGC GAA AAA TTC AGG ATC ATG AAA AAA TGAAAAGCAT CAGAAGAGGC GAT 53 6 
Leu Ser Glu Lys Phe Arg He Met Lys Lys 
160 165 

GGGC 54 0 

(2) INFORMATION FOR SEQ ID NO: 10 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO:1052: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1052: 



Met lie Phe Tyr Arg Lys Glu Ala Thr Met Asn Ala Leu Lys Lys Leu 

15 10 15 

Ser Phe Cys Ala Leu Leu Ser Leu Gly Leu Phe Ala Gin Thr Ala His 

20 25 30 

Ala Lys His Leu Lys Gly Thr lie Asn Tyr Pro Asp Trp Leu Glu lie 

35 40 45 

Asn Phe Phe Asp Glu Lys Asn Pro Pro Asn Gin Tyr Val Gly Ser Ala 

50 55 60 

Ser lie Ser Gly Lys Arg Asn Asp Phe Tyr Ala Asn Tyr lie Pro Tyr 
65 70 75 80 

Asp Asp Gin Leu Pro Pro Glu Gin Asn Ala Glu Lys lie Ala Leu Leu 

85 90 95 

Arg Ala Arg lie Asn Ala Tyr Ser Thr Leu Glu Ser lie Leu Leu Thr 

100 105 110 

Lys Met His Asn Arg lie Val Lys Val Leu Gin Val Lys Asn Asn Val 

115 120 125 

lie Ser His Leu Phe Gly Leu Val Asp Phe Leu Thr Ser Lys Ser lie 

130 135 140 

Leu Ala Lys Arg Phe Val Asp Thr Thr Asn His Arg Val Tyr Val Met 
145 150 155 160 

Val Gin Phe Pro Phe lie Gin Pro Glu Asp Leu lie Ala Tyr Phe Lys 

165 170 175 

Ala Lys Arg lie Asp Leu Ser Ser Ala Ser Ala Thr His Leu Ser Ala 

180 185 190 

Leu Leu Asn Lys Ala Leu Phe His Leu 
195 200 



(2) INFORMATION FOR SEQ ID NO: 1053: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 513 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051: 

AAGCGTAGGG TGTTTTTA ATG ATT TTT TAT AGA AAG GAA GCT ACA ATG AAC 51 

Met lie Phe Tyr Arg Lys Glu Ala Thr Met Asn 
15 10 

GCA TTG AAA AAA TTA AGT TTC TGC GCC TTG TTA TCC CTA GGC CTC TTC 9 9 

Ala Leu Lys Lys Leu Ser Phe Cys Ala Leu Leu Ser Leu Gly Leu Phe 
15 20 25 

GCT CAA ACA GCG CAT GCT AAG CAT TTA AAG GGC ACG ATT AAC TAT CCT 14 7 

Ala Gin Thr Ala His Ala Lys His Leu Lys Gly Thr lie Asn Tyr Pro 
30 35 40 

GAT TGG CTT GAA ATC AAT TTT TTT GAC GAA AAA AAC CCG CCC AAT CAA 195 
Asp Trp Leu Glu lie Asn Phe Phe Asp Glu Lys Asn Pro Pro Asn Gin 
45 50 55 

TAT GTC GGA TCG GCT TCA ATT TCT GGT AAA AGG AAC GAT TTT TAC GCC 24 3 

Tyr Val Gly Ser Ala Ser lie Ser Gly Lys Arg Asn Asp Phe Tyr Ala 
60 65 70 75 

AAT TAC ATC CCC TAT GAT GAC CAA TTG CCC CCT GAA CAA AAC GCT GAA 2 91 

Asn Tyr lie Pro Tyr Asp Asp Gin Leu Pro Pro Glu Gin Asn Ala Glu 
80 85 90 

AAA ATC GCT CTT TTA AGG GCC AGA ATA AAC GCT TAC AGC ACT TTA GAG 33 9 

Lys lie Ala Leu Leu Arg Ala Arg lie Asn Ala Tyr Ser Thr Leu Glu 
95 100 105 

AGC ATT TTA CTC ACT AAA ATG CAC AAT CGT ATT GTT AAG GTG CTT CAA 3 87 

Ser lie Leu Leu Thr Lys Met His Asn Arg lie Val Lys Val Leu Gin 
110 115 120 

GTT AAA AAT AAT GTT ATC AGC CAT TTA TTC GGG CTT GTT GAT TTT TTA 435 
Val Lys Asn Asn Val lie Ser His Leu Phe Gly Leu Val Asp Phe Leu 
125 130 135 

ACC TCT AAA TCC ATT TTG GCT AAA AGG TTC GTG GAT ACC ACA AAT CAT 4 83 

Thr Ser Lys Ser lie Leu Ala Lys Arg Phe Val Asp Thr Thr Asn His 
140 145 150 155 

CGT GTG TAT GTC ATG GTG CAA TTC CCT TTC ATT CAG CCT GAA GAC TTG 531 
Arg Val Tyr Val Met Val Gin Phe Pro Phe lie Gin Pro Glu Asp Leu 
160 165 170 

ATC GCT TAC TTT AAA GCC AAA CGC ATC GAC CTT TCT TCA GCG AGC GCT 57 9 

lie Ala Tyr Phe Lys Ala Lys Arg lie Asp Leu Ser Ser Ala Ser Ala 
175 180 185 

ACC CAT CTC AGC GCC CTT TTA AAT AAG GCG TTG TTC CAC CTC TAAGAGTTT 63 0 
Thr His Leu Ser Ala Leu Leu Asn Lys Ala Leu Phe His Leu 
190 195 200 

GGGATTTAAG ATGCGGTTT 64 9 
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CGT AAG GGT CTT CCT GTG AGA GOT CAA ACC ACT AAA AAT AAC GCT AGG 33 9 

Arg Lys Gly Leu Pro Val Arg Gly Gin Thr Thr Lys Asn Asn Ala Arg 
95 100 105 

ACT CGT AAG GGT AAG AAA AAA ACC GTG GGT AGC AAG TAGCGAATAA GGAGAT 3 91 
Thr Arg Lys Gly Lys Lys Lys Thr Val Gly Ser Lys 
110 115 120 

GATGATTTAA TGGCTAAGAG AAATGTAACG GC 42 3 

(2) INFORMATION FOR SEQ ID NO: 1050: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050: 

Met Ala Arg lie Ala Gly Val Asp Leu Pro Lys Lys Lys Arg Val Glu 

15 10 15 

Tyr Ala Leu Thr Tyr lie Tyr Gly lie Gly Leu Lys Ser Ser Arg Glu 

20 25 30 

lie Leu Glu Ala Val Gly lie Ser Phe Asp Lys Arg Val His Glu Leu 

35 40 45 

Ser Glu Asp Glu Val Ser Ser lie Ala Lys Lys lie Gin Gin Ser Tyr 

50 55 60 

Leu Val Glu Gly Asp Leu Arg Lys Lys Val Gin Met Asp lie Lys Ser 
65 70 75 80 

Leu Met Asp Leu Gly Asn Tyr Arg Gly lie Arg His Arg Lys Gly Leu 

85 90 95 

Pro Val Arg Gly Gin Thr Thr Lys Asn Asn Ala Arg Thr Arg Lys Gly 

100 105 110 

Lys Lys Lys Thr Val Gly Ser Lys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 1051: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 621 
(D) OTHER INFORMATION: 
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260 










265 










270 
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lie 


Gly 


He 
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Val 


Gly 


Glu 
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Val 


Leu 


Met 
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Pro 


Val 


Gly 
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310 










315 
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Glu 
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Pro 
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Arg 


Glu 


Ser 
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Lys 
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Lys 
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330 










335 
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Glu 
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Lys 
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Gly 


Asn 


Asp 



















340 



(2) INFORMATION FOR SEQ ID NO: 104 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 375 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049: 

AGACAAGGAT AAAGC ATG GCA AGG ATT GCT GGT GTA GAT TTA CCA AAA AAG 51 
Met Ala Arg lie Ala Gly Val Asp Leu Pro Lys Lys 
15 10 

AAG AGA GTG GAG TAT GCC CTT ACC TAT ATT TAT GGG ATT GGG CTT AAG 9 9 

Lys Arg Val Glu Tyr Ala Leu Thr Tyr lie Tyr Gly lie Gly Leu Lys 
15 20 25 

AGT TCC AGA GAG ATT TTA GAA GCG GTA GGC ATT TCT TTT GAC AAG CGC 14 7 

Ser Ser Arg Glu lie Leu Glu Ala Val Gly lie Ser Phe Asp Lys Arg 
30 35 40 

GTG CAT GAA TTG AGC GAA GAT GAA GTG TCT AGC ATC GCT AAA AAA ATC 195 
Val His Glu Leu Ser Glu Asp Glu Val Ser Ser lie Ala Lys Lys lie 
45 SO 55 60 

CAA CAA AGC TAC CTA GTA GAG GGC GAT TTG CGT AAA AAA GTT CAA ATG 24 3 

Gin Gin Ser Tyr Leu Val Glu Gly Asp Leu Arg Lys Lys Val Gin Met 
65 70 75 

GAT ATT AAA TCT TTA ATG GAC TTG GGG AAT TAT CGT GGG ATC AGG CAT 2 91 

Asp He Lys Ser Leu Met Asp Leu Gly Asn Tyr Arg Gly He Arg His 
80 85 90 
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CCG GTA GGC ACA GAA TTA AGC CCT GAA CAA AGA GAG AGT TTA AAG AAA 1012 
Pro Val Gly Thr Glu Leu Ser Pro Glu Gin Arg Glu Ser Leu Lys Lys 
320 325 330 

AGA TTA GAA AAA TTA GAA GAT AAA GGA GGT AAC GAC TGATGAGACA CAAACA 1064 
Arg Leu Glu Lys Leu Glu Asp Lys Gly Gly Asn Asp 
335 340 

CGGATACCGC AAGCTTGGGA GAACCAGCTC GCACAGAAAG GC 1106 

(2) INFORMATION FOR SEQ ID NO: 1048: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRAWDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 8: 

Met Lys Val lie Lys Thr Ala Pro Leu lie Pro Ser Glu lie Lys Val 

15 10 15 

Leu Glu Lys Glu Gly Asn Arg Val Lys lie Ser Leu Ala Pro Phe Glu 

20 25 30 

Phe Gly Tyr Ala Val Thr Leu Ala His Pro lie Arg Arg Leu Leu Leu 

35 40 45 

Leu Ser Ser Val Gly Tyr Ala Pro Val Gly Leu Lys lie Glu Gly Val 

50 55 60 

His His Glu Phe Asp Ser Leu Arg Gly Val Thr Glu Asp Val Ser Leu 
65 70 75 80 

Phe lie Met Asn Leu Lys Asn lie Arg Phe lie Ala Lys Ala Leu Val 

85 90 95 

Gly Gin Asp Ser Ser Leu Glu Asn Gin Ser Val Val Val Asp Tyr Ser 

100 105 110 

Phe Lys Gly Pro Met Glu Leu Arg Ala Arg Asp Leu Asn Ser Glu Gin 

115 120 125 

lie Glu lie Val Asn Pro Glu Met Pro Leu Ala Thr lie Asn Glu Asp 

130 135 140 

Ala Gin Leu Asn Phe Ser Leu lie lie Tyr Lys Gly Met Gly Tyr Val 
145 150 155 160 

Pro Ser Glu Asn Thr Arg Glu Leu Met Pro Glu Gly Tyr Met Pro Leu 

165 170 175 

Asp Gly Ser Phe Thr Pro lie Lys Lys Val Val Tyr Glu lie Glu Asn 

180 185 190 

Val Leu Val Glu Gly Asp Pro Asn Tyr Glu Lys lie lie Phe Asp lie 

195 200 205 

Glu Thr Asp Gly Gin lie Asp Pro Tyr Lys Ala Phe Leu Ser Ala Val 

210 215 220 

Lys Val Met Ser Lys Gin Leu Gly Val Phe Gly Glu Arg Pro lie Ala 
225 230 235 240 

Asn Thr Glu Tyr Ser Gly Asp Tyr Ala Gin Arg Asp Asp Ala Lys Asp 

245 250 255 

Leu Ser. Ala Lys lie Glu Ser Met Asn Leu Ser Ala Arg Cys Phe Asn 
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Liys Ala Lieu Val Gly Gin Asp Ser Ser Leu Glu Asn Gin Ser Val Val 
95 100 105 

GTG GAT TAT TCT TTT AAA GGG GOT ATG GAG CTT AGG GCT AGG GAT TTG 3 88 

Val Asp Tyr Ser Phe Lys Gly Pro Met Glu Leu Arg Ala Arg Asp Leu 
110 115 120 

AAT TCT GAG GAG ATA GAA ATC GTC AAT CCG GAA ATG CCC CTA GCG ACA 43 6 

Asn Ser Glu Gin lie Glu lie Val Asn Pro Glu Met Pro Leu Ala Thr 
125 130 135 140 

ATC AAT GAA GAC GCT CAA TTG AAT TTT TCG CTC ATT ATT TAT AAA GGA 4 84 

lie Asn Glu Asp Ala Gin Leu Asn Phe Ser Leu lie lie Tyr Lys Gly 
145 150 155 

ATG GGG TAT GTC CCA AGC GAA AAC ACA AGG GAA TTG ATG CCT GAG GGC 5 32 

Met Gly Tyr Val Pro Ser Glu Asn Thr Arg Glu Leu Met Pro Glu Gly 
160 165 170 

TAG ATG CCG CTA GAC GGC TCT TTC ACG CCG ATT AAA AAG GTC GTT TAT 5 80 

Tyr Met Pro Leu Asp Gly Ser Phe Thr Pro lie Lys Lys Val Val Tyr 
175 180 185 

GAG ATT GAA AAC GTT GTG GTT GAG GGC GAT CCC AAC TAT GAA AAA ATC 62 8 

Glu lie Glu Asn Val Leu Val Glu Gly Asp Pro Asn Tyr Glu Lys lie 
190 195 200 

ATT TTT GAT ATT GAA ACA GAC GGG CAG ATT GAC CCT TAT AAA GCG TTT 67 6 

lie Phe Asp lie Glu Thr Asp Gly Gin lie Asp Pro Tyr Lys Ala Phe 
205 210 215 220 

TTA TCA GCG GTG AAA GTG ATG AGC AAG CAA TTG GGT GTT TTT GGC GAA 724 
Leu Ser Ala Val Lys Val Met Ser Lys Gin Leu Gly Val Phe Gly Glu 
225 230 235 

AGA CCC ATT GCT AAC ACG GAG TAT TCA GGC GAT TAG GCT CAA AGA GAT 7 72 

Arg Pro lie Ala Asn Thr Glu Tyr Ser Gly Asp Tyr Ala Gin Arg Asp 
240 245 250 

GAC GCT AAA GAC TTG AGC GCT AAG ATT GAA AGC ATG AAT TTG AGC GCT 82 0 

Asp Ala Lys Asp Leu Ser Ala Lys lie Glu Ser Met Asn Leu Ser Ala 
255 260 265 

AGG TGT TTT AAT TGC TTG GAT AAA ATC GGC ATC AAG TAT GTG GGC GAA 86 8 

Arg Cys Phe Asn Cys Leu Asp Lys lie Gly lie Lys Tyr Val Gly Glu 
270 275 280 

CTC GTG TTG ATG AGC GAA GAA GAG CTT AAG GGC GTG AAA AAC ATG GGT 916 
Leu Val Leu Met Ser Glu Glu Glu Leu Lys Gly Val Lys Asn Met Gly 
285 290 295 300 

AAA AAA TCC TAT GAT GAA ATC GCT GAA AAA TTG AAT GAT TTG GGC TAT 964 
Lys Lys Ser Tyr Asp Glu lie Ala Glu Lys Leu Asn Asp Leu Gly Tyr 
305 310 315 
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130 135 140 

Gin lie Ser Leu Phe Ser Leu Asp Leu Lys Ala Gin Phe Thr Ser Lys 
145 150 155 160 

Asn Leu Lys Tyr Pro Leu Lys Asn Leu Arg Leu Lys Thr Leu Phe Ser 

165 170 175 

Gly Ser Leu Asn Glu Ala Thr Asp Ser Tyr Phe Ser Leu Ser Ser Thr 

180 185 190 

Pro Lys Ser Val Val Leu Val Tyr Gin Lys Phe Leu 
195 200 



(2) INFORMATION FOR SEQ ID NO: 1047; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 17... 1048 
CD) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047: 

AGTAAGGGGT TAGAGC ATG AAA GTT ATC AAA ACA GCA CCT TTG ATC CCA TCA 52 

Met Lys Val lie Lys Thr Ala Pro Leu lie Pro Ser 
15 10 

GAA ATT AAG GTG CTA GAG AAA GAG GGC AAT CGG GTT AAG ATT TCT CTG 100 
Glu lie Lys Val Leu Glu Lys Glu Gly Asn Arg Val Lys lie Ser Leu 
15 20 25 

GCT CCA TTT GAG TTT GGT TAC GCT GTT ACG CTC GCT CAT CCT ATT AGA 14 8 

Ala Pro Phe Glu Phe Gly Tyr Ala Val Thr Leu Ala His Pro lie Arg 
30 35 40 

AGG CTC TTG CTT TTA AGC TCT GTG GGG TAT GCT CCT GTA GGT TTA AAG 196 
Arg Leu Leu Leu Leu Ser Ser Val Gly Tyr Ala Pro Val Gly Leu Lys 
45 50 55 60 

ATT GAA GGC GTC CAT CAT GAG TTT GAC TCT TTA AGG GGG GTT ACT GAA 244 
lie Glu Gly Val His His Glu Phe Asp Ser Leu Arg Gly Val Thr Glu 
65 70 75 

GAC GTG TCG CTT TTT ATC ATG AAT TTA AAA AAT ATC CGC TTT ATA GCC 2 92 

Asp Val Ser Leu Phe lie Met Asn Leu Lys Asn lie Arg Phe lie Ala 
80 85 90 

AAG GOG TTA GTG GGG CAG GAT AGC TCT TTA GAA AAC CAA TCG GTT GTG 34 0 
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TTA TTG TTG QPJk Tat TTT AAA TTT TGC CAA AAA ATC CAA GCC ATA AGC 38 7 

L-eu Leu Leu Glu Tyr Phe Lys Phe Cys Gin Lys lie Gin Ala He Ser 
110 115 120 

GAC TAT GGT CTT TTT AGG GTG TTA GAA ACC CCT TTC ACT TTG CCC AGT 43 5 

Asp Tyr Gly Leu Phe Arg Val Leu Glu Thr Pro Phe Thr Leu Pro Ser 
125 130 135 140 

TTT AAA GGG GAA CAA ATC TCG CTT TTT AGC CTG GAT CTT AAA GCC CAA 4 83 

Phe Lys Gly Glu Gin He Ser Leu Phe Ser Leu Asp Leu Lys Ala Gin 
145 150 155 

TTC ACT TCT AAA AAC CTC AAA TAC CCC TTA AAA AAC TTG CGT TTA AAA 531 
Phe Thr Ser Lys Asn Leu Lys Tyr Pro Leu Lys Asn Leu Arg Leu Lys 
160 165 170 

ACG CTC TTT TCT GGC TCG CTC AAT GAA GCT ACA GAT AGT TAT TTT AGC 57 9 

Thr Leu Phe Ser Gly Ser Leu Asn Glu Ala Thr Asp Ser Tyr Phe Ser 
175 180 185 

CTT AGC TCT ACA CCT AAA TCG GTG GTG TTG GTG TAT CAA AAA TTC TTA T 62 8 
Leu Ser Ser Thr Pro Lys Ser Val Val Leu Val Tyr Gin Lys Phe Leu 
190 195 200 

AAGCGGGTTT TGTTAGGCAA GTTTTTGTCT GTATA 66 3 

(2) INFORMATION FOR SEQ ID NO: 1046: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1046: 

Met Gin Ala Val lie Leu Ala Asn Gly Glu Phe Pro Lys Ser Gin Lys 

15 10 15 

Cys Leu Asp Leu Leu Lys Asn Ala Pro Phe Leu lie Ala Cys Asp Gly 

20 25 30 

Ala Val Thr Ser Leu His Ala Leu Gin Phe Lys Pro Ser Val Val lie 

35 40 45 

Gly Asp Leu Asp Ser lie Asp Ser His Leu Lys Ala Leu Tyr Asn Pro 

50 55 60 

lie Arg Met Ser Glu Gin Asn Ser Asn Asp Leu Ser Lys Ala Phe Phe 
65 70 75 80 

Tyr Ala Leu Asn Lys Gly Cys Asp Asp Phe lie Phe Leu Gly Leu Asn 

85 90 95 

Gly Lys Arg Glu Asp His Ala Leu Ala Asn Thr Phe Leu Leu Leu Glu 

100 105 110 

Tyr Phe Lys Phe Cys Gin Lys lie Gin Ala lie Ser Asp Tyr Gly Leu 

115 120 125 

Phe Arg Val Leu Glu Thr Pro Phe Thr Leu Pro Ser Phe Lys Gly Glu 
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lie Ser Lys Tyr Thr His Asn Leu Asp Gly Gin Met Phe Ala Leu Phe 

50 55 60 

lie lie Ser lie Ala Ala Ser Glu Val Ala He Gly Leu Gly Leu Val 
65 70 75 80 

He Leu Trp Phe Lys Lys Phe Lys Ser Leu Asp He Asp Ser Leu Asn 
85 90 95 

Ala Met Lys Gly 
100 

(2) INFORMATION FOR SEQ ID NO: 1045: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16 - . . 627 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045: 

CCGGTGGAAT AAGTC ATG CAA GCA GTG ATT TTA GCG AAT GGG GAG TTT CCT 51 
Met Gin Ala Val He Leu Ala Asn Gly Glu Phe Pro 
15 10 

AAA TCT CAA AAA TGC TTA GAC CTT TTA AAA AAC GCT CCC TTT TTA ATC 99 
Lys Ser Gin Lys Cys Leu Asp Leu Leu Lys Asn Ala Pro Phe Leu He 
15 20 25 

GCA TGC GAT GGG GCT GTT ACC TCA TTA CAT GCG CTT CAA TTC AAA CCC 14 7 

Ala Cys Asp Gly Ala Val Thr Ser Leu His Ala Leu Gin Phe Lys Pro 
30 35 40 

AGC GTT GTT ATA GGC GAT CTA GAT AGC ATT GAT TCG CAT TTG AAA GCT 195 
Ser Val Val He Gly Asp Leu Asp Ser He Asp Ser His Leu Lys Ala 
45 50 55 60 

TTG TAT AAC CCT ATA CGC ATG AGT GAA CAA AAC AGC AAC GAT TTG TCC 243 
Leu Tyr Asn Pro He Arg Met Ser Glu Gin Asn Ser Asn Asp Leu Ser 
65 70 75 

AAA GCC TTT TTT TAT GCT TTA AAT AAA GGC TGT GAT GAC TTT ATT TTT 2 91 

Lys Ala Phe Phe Tyr Ala Leu Asn Lys Gly Cys Asp Asp Phe He Phe 
80 85 90 

TTA GGG TTG AAT GGC AAG CGA GAA GAT CAC GCT TTA GCG AAC ACT TTT 33 9 

Leu Gly Leu Asn Gly Lys Arg Glu Asp His Ala Leu Ala Asn Thr Phe 
95 100 105 
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(B) LOCATION: 44... 343 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1043: 

AAATCCATGG GAAAAATCAC ACGCAATTTA TAAAGGAATC TCT ATG ATA GGG TTA 55 

Met lie Gly Leu 
1 

AAC CAC TAT TTG ATT GTT TCA GGG TTG CTC TTT TGC ATT GGT TTA GCG 103 
Asn His Tyr Leu lie Val Ser Gly Leu Leu Phe Cys lie Gly Leu Ala 
5 10 15 20 

GGC ATG CTG AAA CGC AAA AAC ATT CTG TTA CTC TTT TTT TCT ACA GAA 151 
Gly Met Leu Lys Arg Lys Asn lie Leu Leu Leu Phe Phe Ser Thr Glu 
25 30 35 

ATC ATG CTC AAT GCG ATC AAT ATC GGT TTT GTA GCG ATC TCT AAA TAG 199 
lie Met Leu Asn Ala lie Asn lie Gly Phe Val Ala lie Ser Lys Tyr 
40 45 50 

ACG CAT AAT TTA GAC GGG CAG ATG TTT GCG CTC TTT ATT ATC TCT ATT 24 7 

Thr His Asn Leu Asp Gly Gin Met Phe Ala Leu Phe lie lie Ser lie 
55 60 65 

GCC GCT AGT GAG GTG GCT ATT GGT TTG GGC TTG GTG ATT TTG TGG TTT 2 95 

Ala Ala Ser Glu Val Ala lie Gly Leu Gly Leu Val lie Leu Trp Phe 
70 75 80 

AAG AAA TTC AAA AGC TTA GAT ATT GAT TCT TTA AAC GCT ATG AAA GGT T 34 4 
Lys Lys Phe Lys Ser Leu Asp lie Asp Ser Leu Asn Ala Met Lys Gly 
85 90 95 100 

GAGCATGCAA TATTCTTCTT TGCTGTCAGT GGTG 37 8 

(2) INFORMATION FOR SEQ ID NO:1044: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1044: 

Met lie Gly Leu Asn His Tyr Leu lie Val Ser Gly Leu Leu Phe Cys 

15 10 15 

lie Gly Leu Ala Gly Met Leu Lys Arg Lys Asn He Leu Leu Leu Phe 

20 25 30 

Phe Ser Thr Glu He Met Leu Asn Ala He Asn He Gly Phe Val Ala 

35 40 45 
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Glu Val Pro Thr Lys Thr Thr Tyr Leu Leu Leu Glu Glu Ala Gly He 

515 520 525 

Asn Leu Glu Thr Tyr Glu Lys He Leu Ala Leu Leu Gin Lys Ser Asn 

530 535 540 

Asn Thr Leu Leu Val Val Gly Glu Glu He Tyr Ser His Lys Gin Ala 
545 550 555 560 

His Asn He Ala Lys Met Leu Arg Leu Leu Ala Gin Lys Ser Ala He 

565 570 575 

Lys Leu He Leu He Pro Pro Ser Ala Asn Ala Leu Gly He Ala Ser 

580 585 590 

He Cys Gin Leu Ser Glu Glu He Phe Glu His Glu Lys He Val Gly 

595 600 605 

He Arg Ala Gin Gly Asp Phe Thr He Asn Ser Asp Asp Arg Val Phe 

610 615 620 

Gly Lys Asp Ala Ala Ser Lys Val Asp Phe He Leu Pro Ser Leu Asn 
625 630 635 64 0 

Gin Leu Glu Gly Thr He Thr Asn He Glu Gly Arg Val Leu Pro Leu 

645 650 655 

Lys Pro Ala Leu Arg Phe Glu Gly Tyr Asp Leu Ser Asp He Met Gin 

660 665 670 

Gly Phe Gly Phe Val Glu Glu Asn Leu He Glu Cys Thr His Lys Leu 

675 680 685 

Pro Thr Glu Ala Gly Phe Lys Ala He Glu Phe Asp Tyr Leu Thr Asn 

690 695 700 

Tyr Phe Ala Asn Asp Arg Val Asn His Arg Gly Tyr Leu Leu Gly Thr 
705 710 715 720 

Ser His Phe Glu Lys Ser Ala Lys Glu Cys Glu Thr He Glu Cys Glu 

725 730 735 

Pro He Lys Pro Leu Lys Glu Lys He Ala Phe Asn Ala Tyr Leu Lys 

740 745 750 

Tyr Pro Glu Thr Gin Phe Asn Asn Ala Thr Asn Lys Ser Glu Asn Leu 

755 760 765 

Gin Leu Lys Ala Gly Val Tyr Val Ser Lys Ala Phe Leu Lys Lys Leu 

770 775 780 

Asn Lys Glu Val Gly Gin Asn He Thr Leu Ser Lys Glu Glu Glu Glu 
785 790 795 800 

Leu Thr Gly Val Leu Tyr Leu Asp Glu Ser Leu Asp Gin Glu Val Phe 

805 810 815 

Val He Ser Pro Ser Leu Leu Lys Asn His Ser Gly Phe Phe Arg Glu 

820 825 830 

Gly Val Phe Asp Ser Val Asp Leu Lys Glu Gin Ala 



(2) INFORMATION FOR SEQ ID NO: 1043: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 



835 



840 
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65 

Arg Lys Ser lie 

Gly Val Cys Asp 
100 

Leu Thr Gly Val 
115 

Ala Leu Asp Phe 
130 

Met Cys Glu Arg 
145 

Asn Leu Lys Ala 

Asp Ser Met Ser 
180 

Gly lie lie Ser 
195 

Cys lie Ala Val 
210 

Tyr Thr Ala Asn 
225 

His Cys Ser Ala 

Thr Leu Gly Glu 
260 

His Asn Pro lie 
275 

Ser Pro Lys Gly 
290 

Glu Cys Glu Ala 
305 

Phe Leu lie Glu 

Gin Glu Ala Tyr 
340 

Lys Arg Pro Ser 
355 

lie Gly Ser Ser 
370 

Asn Asn Ala Leu 
385 

lie Lys Asp Asn 

Thr His Glu Val 
420 

Met Leu Asn lie 
435 

Gin Asn He Val 
450 

Lys Lys Ala Leu 
465 

Lys Ala Glu Asn 

Glu Glu Asn Gin 
500 



70 

Met Gin Thr Tyr 
85 

Lys Ser Gly Glu 

Glu His Gin Pro 
120 

Trp Ala Lys Ala 
135 

Cys Val Thr Thr 
150 

Thr Lys Ala Asp 
165 

Lys Asp Ala Phe 

Phe Val Gly Ser 
200 

Cys Pro Val Gly 
215 

Ala Trp Glu Leu 
230 

Gly Cys Leu He 
245 

Glu Ser Lys He 

Cys Gly Ala Gly 
280 

Ser Ala Asn Leu 
295 

Val Arg He Gly 
310 

Arg Leu Arg Lys 
325 

Arg Phe Gin Gin 

Val Glu Glu He 
360 

He Lys Thr Glu 
375 

Lys Leu Asn Lys 
390 

Ala Leu Ala Asn 
405 

Gly Ala Glu Glu 

Glu Ser Ala Ala 
440 

Asp Glu Ala Ala 
455 

Glu Gin Ala Glu 
470 

Gin Glu Glu Asn 
485 

Glu Glu Asn Lys 



75 

Asp Val Asn His 
90 

Cys Glu Leu Gin 
105 

Tyr Ala Val Ala 

Leu Tyr Asp Pro 
140 

Cys Lys Asp Asn 
155 

Leu His Ala Pro 
170 

Ser Val Trp Ser 
185 

Val Pro Cys Tyr 

Ala Leu Ser Tyr 
220 

Lys Lys He His 
235 

Ser Tyr Asp Val 
250 

Phe Arg Val Leu 
265 

Arg Phe Ala Phe 

Lys Glu Ala Gin 
300 

Gly Asp He Thr 
315 

Glu Leu Asp Phe 
330 

Phe Leu Lys Val 
345 

Lys Thr Ser His 

Asn Pro Leu Val 
380 

Ala Ser Leu He 
395 

Leu Cys Arg Ser 
410 

He Leu Leu Gly 
425 

Leu Lys Ser Leu 

Leu Lys Ala Leu 
460 

Gin Gly Cys Ser 
475 

Lys Thr Glu Ala 
490 

Thr Glu Val Lys 
505 



80 

Pro Leu Glu Cys 
95 

Asp Met Thr His 
110 

Asp Asp Phe Lys 
125 

Asn Leu Cys He 

Val Gly Glu Asn 
160 

Asp Lys Phe Lys 
175 

Arg Lys Gin Lys 
190 

Asp Cys Gly Glu 
205 

Lys Asp Phe Ala 

Ser Thr Cys Ser 
240 

Arg His Phe Asp 
255 

Asn Asp Phe Tyr 
270 

Asp Val Ser Ser 
285 

Asn Ala Leu Lys 

Asn Glu Glu Ala 
320 

Lys He Tyr Asn 
335 

Leu Gly Glu He 
350 

Leu Val Val Thr 
365 

Arg Tyr Ala He 

Ala Met His Pro 
400 

Ser Phe Cys He 
415 

Met Leu Leu Lys 
430 

Glu Asp Ser Lys 
445 

Glu Glu Glu Arg 

He Gly Glu Asn 
480 

Thr Thr Pro Lys 
495 

Glu Glu Lys He 
510 
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GAA AAG AGC GCT AAA GAA TGC GAA ACC ATA GAA TGC GAG CCT ATC AAG 22 6 5 

Glu Lys Ser Ala.Lys Glu Cys Glu Thr lie Glu Cys Glu Pro lie Lys 
725 730 735 

CCT TTA AAA GAA AAA ATC GCT TTC AAC GCG TAT TTA AAA TAC CCA GAA 2 313 

Pro Lieu Lys Glu Lys lie Ala Phe Asn Ala Tyr Leu Lys Tyr Pro Glu 
740 745 750 755 

ACG CAA TTC AAT AAC GCT ACT AAT AAA AGC GAG AAT TTG CAA TTA AAA 23 61 

Thr Gin Phe Asn Asn Ala Thr Asn Lys Ser Glu Asn Leu Gin Leu Lys 
760 765 770 

GCC GGT GTC TAT GTG TCT AAA GCT TTC TTA AAG AAA TTG AAT AAA GAA 24 0 9 

Ala Gly Val Tyr Val Ser Lys Ala Phe Leu Lys Lys Leu Asn Lys Glu 
775 780 785 

GTG GGG CAA AAC ATC ACT TTA TCT AAA GAA GAA GAG GAA TTA ACA GGC 24 57 

Val Gly Gin Asn lie Thr Leu Ser Lys Glu Glu Glu Glu Leu Thr Gly 
790 795 BOO 

GTT TTG TAT CTT GAT GAG AGC TTG GAT CAG GAA GTG TTT GTT ATC TCG 2 50 5 

Val Leu Tyr Leu Asp Glu Ser Leu Asp Gin Glu Val Phe Val lie Ser 
805 810 815 

CCT TCT CTT TTG AAA AAC CAT TCT GGC TTT TTT AGA GAG GGC GTG TTT 2 55 3 

Pro Ser Leu Leu Lys Asn His Ser Gly Phe Phe Arg Glu Gly Val Phe 
820 825 830 835 

GAT AGC GTG GAT TTA AAG GAG CAA GCA TGAGCGCTTA TATCATTGAA ACCCTGA 2 60 7 
Asp Ser Val Asp Leu Lys Glu Gin Ala 
840 

TTAAAATTTT GATTTT 2 62 3 

(2) INFORMATION FOR SEQ ID NO: 1042: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042: 

Met lie Thr Met Asn lie Asn Gly Lys Thr lie Glu Cys Gin Glu Gly 

15 10 15 

Gin Ser Val Leu Glu Ala Ala Arg Ser Ala Gly lie Tyr lie Pro Thr 

20 25 30 

lie Cys Tyr Leu Ser Gly Cys Ser Pro Thr Val Ala Cys Lys Met Cys 

35 40 45 

Met Val Glu Met Asp Gly Lys Arg Val Tyr Ser Cys Asn Thr Lys Ala 

50 55 60 

Lys Asn Asn Ala Thr lie Leu Thr Asn Thr Pro Thr Leu Met Asp Glu 
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CAA GAA GAA AAC AAG ACA GAG GTT AAA GAA GAA AAA ATT GAA GTC CCT 15 93 

Gin Glu Glu Asn Lys Thr Glu Val Lys Glu Glu Lys lie Glu Val Pro 
500 505 510 515 

ACC AAA ACC ACT TAT TTG CTG CTT GAA GAA GCG GGC ATC AAT TTA GAA* 1641 
Thr Lys Thr Thr Tyr Leu Leu Leu Glu Glu Ala Gly lie Asn Leu Glu 
520 525 530 

ACT TAT GAA AAA ATT CTG GCT CTT TTG CAA AAA TCA AAT AAC ACC CTG 16 8 9 

Thr Tyr Glu Lys lie Leu Ala Leu Leu Gin Lys Ser Asn Asn Thr Leu 
535 540 545 

CTA GTG GTT GGC GAA GAA ATC TAT AGC CAT AAG CAA GCC CAC AAT ATC 17 3 7 

Leu Val Val Gly Glu Glu lie Tyr Ser His Lys Gin Ala His Asn lie 
550 555 560 

GCT AAA ATG TTG CGT TTG CTA GCC CAA AAA AGC GCT ATT AAA CTC ATT 17 8 5 

Ala Lys Met Leu Arg Leu Leu Ala Gin Lys Ser Ala lie Lys Leu lie 
565 570 575 

CTT ATC CCC CCA AGC GCC AAC GCT TTA GGC ATC GCT TCT ATT TGT CAA 183 3 

Leu lie Pro Pro Ser Ala Asn Ala Leu Gly lie Ala Ser lie Cys Gin 
580 585 590 595 

TTG AGC GAA GAA ATT TTT GAA CAT GAA AAA ATT GTA GGC ATT CGC GCT 18 81 

Leu Ser Glu Glu lie Phe Glu His Glu Lys lie Val Gly lie Arg Ala 
600 605 610 

CAA GGG GAT TTC ACT ATC AAT AGC GAT GAT AGG GTT TTT GGA AAA GAC 192 9 

Gin Gly Asp Phe Thr lie Asn Ser Asp Asp Arg Val Phe Gly Lys Asp 
615 620 625 

GCT GCC AGC AAA GTG GAT TTT ATT TTA CCC AGT CTC AAC CAG CTA GAA 197 7 

Ala Ala Ser Lys Val Asp Phe lie Leu Pro Ser Leu Asn Gin Leu Glu 
630 635 640 

GGC ACG ATC ACC AAT ATT GAA GGG CGT GTG TTG CCC TTA AAA CCG GCT 2 02 5 

Gly Thr lie Thr Asn lie Glu Gly Arg Val Leu Pro Leu Lys Pro Ala 
645 650 655 

TTG AGG TTT GAG GGC TAT GAT TTG AGC GAT ATT ATG C?\A GGC TTT GGC 2 07 3 

Leu Arg Phe Glu Gly Tyr Asp Leu Ser Asp lie Met Gin Gly Phe Gly 
660 665 670 675 

TTT GTG GAA GAA AAC CTC ATA GAA TGC ACC CAC AAA CTC CCT ACA GAA 2121 
Phe Val Glu Glu Asn Leu lie Glu Cys Thr His Lys Leu Pro Thr Glu 
680 685 690 

GCG GGC TTT AAA GCC ATA GAA TTT GAT TAT TTA ACC AAC TAT TTC GCT 216 9 

Ala Gly Phe Lys Ala lie Glu Phe Asp Tyr Leu Thr Asn Tyr Phe Ala 
695 700 705 

AAC GAC AGA GTC AAC CAC AGA GGC TAT CTG CTA GGA ACA AGC CAT TTT 2217 
Asn Asp Arg Val Asn His Arg Gly Tyr Leu Leu Gly Thr Ser His Phe 
710 715 720 
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ATT TGT GGG GCA GGC CGT TTC GCT TTT GAT GTG AGC TCT AGC CCT AAA 921 
lie Cys Gly Ala Gly Arg Phe Ala Phe Asp Val Ser Ser Ser Pro Lys 
280 285 290 

GGC AGT GCT AAT CTT AAA GAA GCG CAA AAC GCC CTC AAA GAA TGC GAA 96 9 

Gly Ser Ala Asn Leu Lys Glu Ala Gin Asn Ala Leu Lys Glu Cys Glu 
295 300 305 

GCG GTG CGA ATA GGT GGG GAT ATT ACG AAT GAA GAG GCG TTT TTA ATA 1017 
Ala Val Arg lie Gly Gly Asp lie Thr Asn Glu Glu Ala Phe Leu lie 
310 315 320 

GAG CGT TTA AGA AAA GAG CTT GAT TTT AAA ATC TAC AAT CAA GAA GCG 1065 
Glu Arg Leu Arg Lys Glu Leu Asp Phe Lys lie Tyr Asn Gin Glu Ala 
325 330 335 

TAT CGT TTC CAG CAA TTC TTA AAA GTA TTG GGC GAA ATT AAA CGC CCC 1113 
Tyr Arg Phe Gin Gin Phe Leu Lys Val Leu Gly Glu lie Lys Arg Pro 
340 345 350 355 

AGC GTT GAA GAG ATT AAA ACT TCT CAT TTA GTC GTT ACG ATA GGA TCT 1161 
Ser Val Glu Glu lie Lys Thr Ser His Leu Val Val Thr lie Gly Ser 
360 365 370 

TCT ATC AAA ACA GAA AAC CCT TTG GTG CGC TAT GCC ATC AAT AAC GCT 12 09 

Ser lie Lys Thr Glu Asn Pro Leu Val Arg Tyr Ala lie Asn Asn Ala 
375 380 385 

CTC AAA CTC AAT AAA GCT TCT TTA ATC GCT ATG CAC CCT ATT AAG GAT 12 5 7 

Leu Lys Leu Asn Lys Ala Ser Leu lie Ala Met His Pro lie Lys Asp 
390 395 400 

AAC GCG CTA GCG AAT TTG TGC CGA AGC TCT TTT TGC ATC ACC CAT GAA 13 0 5 

Asn Ala Leu Ala Asn Leu Cys Arg Ser Ser Phe Cys lie Thr His Glu 
405 410 415 

GTG GGG GCT GAA GAA ATC CTT TTA GGC ATG CTT TTA AAA ATG CTT AAC 13 53 

Val Gly Ala Glu Glu lie Leu Leu Gly Met Leu Leu Lys Met Leu Asn 
420 425 430 435 

ATT GAA AGC GCG GCC CTA AAA AGC TTA GAA GAT TCC AAG CAA AAT ATT 14 01 

lie Glu Ser Ala Ala Leu Lys Ser Leu Glu Asp Ser Lys Gin Asn lie 
440 445 450 

GTA GAT GAA GCG GCT CTT AAA GCC TTA GAA GAA GAG CGA AAA AAA GCT 144 9 

Val Asp Glu Ala Ala Leu Lys Ala Leu Glu Glu Glu Arg Lys Lys Ala 
455 460 465 

TTA GAA CAA GCC GAG CAA GGG TGC AGT ATT GGA GAA AAT AAG GCA GAA 14 97 

Leu Glu Gin Ala Glu Gin Gly Cys Ser lie Gly Glu Asn Lys Ala Glu 
470 475 480 

AAT CAA GAA GAG AAT AAA ACA GAA GCG ACT ACC CCA AAA GAA GAA AAT 154 5 

Asn Gin Glu Glu Asn Lys Thr Glu Ala Thr Thr Pro Lys Glu Glu Asn 
485 490 495 
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ATG GAT GGC AAA CGG GTT TAT AGO TGC AAC ACG AAA GCC AAA AAC AAC 249 
Met Asp Gly Lys Arg Val Tyr Ser Cys Asn Thr Lys Ala Lys Asn Asn 
55 60 65 

GCC ACC ATT CTC ACT AAC ACC CCA ACG CTC ATG GAT GAA AGA AAA AGC 2 97 

Ala Thr lie Leu Thr Asn Thr Pro Thr Leu Met Asp Glu Arg Lys Ser 
70 75 80 

ATC ATG CAA ACT TAT GAT GTC AAC CAC CCC CTA GAG TGT GGC GTG TGC 345 
lie Met Gin Thr Tyr Asp Val Asn His Pro Leu Glu Cys Gly Val Cys 
85 90 95 

GAT AAG AGT GGG GAG TGC GAA TTG CAA GAC ATG ACG CAT TTA ACC GGC 3 93 

Asp Lys Ser Gly Glu Cys Glu Leu Gin Asp Met Thr His Leu Thr Gly 
100 105 110 115 

GTA GAG CAC CAA CCC TAT GCG GTG GCT GAT GAT TTT AAA GCA CTG GAT 441 
Val Glu His Gin Pro Tyr Ala Val Ala Asp Asp Phe Lys Ala Leu Asp 
120 125 130 

TTT TGG GCA AAA GCC TTG TAT GAT CCT AAT TTG TGC ATC ATG TGT GAA 4 89 

Phe Trp Ala Lys Ala Leu Tyr Asp Pro Asn Leu Cys lie Met Cys Glu 
135 140 145 

AGG TGC GTA ACC ACT TGT AAG GAC AAT GTG GGC GAA AAC AAC CTT AAA 537 
Arg Cys Val Thr Thr Cys Lys Asp Asn Val Gly Glu Asn Asn Leu Lys 
150 155 160 

GCC ACT AAA GCC GAC TTG CAT GCT CCG GAT AAA TTT AAA GAC AGC ATG 5 85 

Ala Thr Lys Ala Asp Leu His Ala Pro Asp Lys Phe Lys Asp Ser Met 
165 170 175 

TCC AAA GAC GCT TTT AGC GTG TGG AGT CGT AAG CAA AAA GGC ATT ATT 63 3 

Ser Lys Asp Ala Phe Ser Val Trp Ser Arg Lys Gin Lys Gly lie lie 
180 185 190 195 

TCT TTT GTG GGC AGC GTG CCT TGC TAT GAT TGC GGG GAA TGC ATT GCA 681 
Ser Phe Val Gly Ser Val Pro Cys Tyr Asp Cys Gly Glu Cys lie Ala 
200' 205 210 

GTA TGC CCT GTG GGC GCT TTG AGC TAT AAA GAT TTC GCT TAC ACG GCT 72 9 

Val Cys Pro Val Gly Ala Leu Ser Tyr Lys Asp Phe Ala Tyr Thr Ala 
215 220 225 

AAC GCA TGG GAG TTA AAA AAG ATC CAT TCT ACT TGT TCG CAT TGC TCG 77 7 

Asn Ala Trp Glu Leu Lys Lys lie His Ser Thr Cys Ser His Cys Ser 
230 235 240 

GCC GGG TGT TTG ATT TCT TAT GAT GTG CGC CAT TTT GAT ACT CTA GGC 82 5 

Ala Gly Cys Leu lie Ser Tyr Asp Val Arg His Phe Asp Thr Leu Gly 
245 250 255 

GAA GAA TCT AAA ATT TTT AGA GTG CTT AAT GAT TTT TAC CAT AAC CCT 87 3 

Glu Glu Ser Lys lie Phe Arg Val Leu Asn Asp Phe Tyr His Asn Pro 
260 ^ 265 270 275 
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Lys Asn Arg Arg Arg Xaa Arg Val Lys Cys Val Leu Leu Pro Asn Glu 

115 120 125 

Ser Val Asp Ser Leu Ser Phe Leu Tyr Arg Ser Ala Asn Trp Ser Glu 

130 135 140 

Arg Glu Ala Tyr Asp Met Leu Gly lie Val Phe Asp Lys His Pro Tyr 
145 150 155 160 

Leu Lys Arg Leu lie Met Pro His Asp Trp Val Gly His Pro Leu Leu 

165 170 175 

Arg Ser Tyr Pro Leu Lys Gly Asp Glu Phe Ala Gin Trp Tyr Glu Val 

180 185 190 

Asp Lys lie Phe Gly Lys Glu Tyr Arg Glu Val Val Gly Lys Glu Gin 

195 200 205 

Arg Asp Ser Ala Arg Val Asp Glu Lys Asp Thr Phe Asn Phe Ala Lys 

210 215 220 

lie Gly Tyr Glu Gin Gly Lys Gly Glu Glu Leu Lys Glu Val Glu Glu 
225 230 235 240 

Lys His Ala Phe Lys Lys lie Pro Phe Val Lys Asp Leu His Lys lie 

245 250 255 

Ala Pro Thr lie Leu Lys Lys Arg Leu 
260 265 

(2) INFORMATION FOR SEQ ID NO: 1041: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 2580 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041: 

TGACGCTTAC CCTTATATCC CTATTTTATC CCACTCTCAA GGAATTTC ATG ATC ACA 57 

Met lie Thr 
1 

ATG AAT ATC AAT GGC AAA ACG ATT GAA TGC CAA GAG GGA CAA AGC GTT 10 5 

Met Asn lie Asn Gly Lys Thr lie Glu Cys Gin Glu Gly Gin Ser Val 
5 10 15 

TTA GAG OCT GCT AGG AGC GCT GGG ATC TAG ATC CCT ACC ATT TGC TAT 15 3 

Lieu Glu Ala Ala Arg Ser Ala Gly lie Tyr lie Pro Thr lie Cys Tyr 
20 25 30 35 

TTA AGC GGT TGC TCG CCC ACA GTC GCA TGC AAA ATG TGC ATG GTT GAA 2 01 

Leu Ser Gly Cys Ser Pro Thr Val Ala Cys Lys Met Cys Met Val Glu 
40 45 50 
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CCG CAT GAT TOG GTA GGC CAC CCA TTA TTG CGC TCT TAG CCG CTC AAA 582 
Pro His Asp Trp Val Gly His Pro Leu Leu Arg Ser Tyr Pro Leu Lys 
170 175 180 

GGC GAT GAA TTC GCC CAA TGG TAT GAA GTG GAT AAA ATT TTT GGT AAA 63 0 

Gly Asp Glu Phe Ala Gin Trp Tyr Glu Val Asp Lys lie Phe Gly Lys 
185 190 195 

GAA TAC CGA GAA GTG GTG GGT AAA GAG CAG AGA GAC AGC GCA AGA GTG 67 8 

Glu Tyr Arg Glu Val Val Gly Lys Glu Gin Arg Asp Ser Ala Arg Val 
200 205 210 

GAT GAA AAA GAC ACT TTC AAT TTT GCA AAA ATT GGC TAT GAA CAG GGC 72 6 

Asp Glu Lys Asp Thr Phe Asn Phe Ala Lys lie Gly Tyr Glu Gin Gly 
215 220 225 230 

AAG GGC GAA GAA TTA AAA GAA GTA GAA GAA AAG CAT GCG TTT AAG AAA 77 4 

Lys Gly Glu Glu Leu Lys Glu Val Glu Glu Lys His Ala Phe Lys Lys 
235 240 245 

ATC CCT TTT GTC AAA GAT TTG CAC AAA ATC GCC CCC ACT ATC TTA AAA 82 2 

lie Pro Phe Val Lys Asp Leu His Lys lie Ala Pro Thr He Leu Lys 
250 255 260 

AAG AGG CTA TAAAATGGCT CAAAATTTCA CGAAACTCAA CCC 8 64 

Lys Arg Leu 
265 



(2) INFORMATION FOR SEQ ID NO: 1040: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040: 



Met Val Arg Lys Gin Ser Pro Tyr Glu Asp Val Gin Lys Gin Ser Arg 

15 10 15 

Gin His Asp Pro Tyr Lys He He Glu Pro Thr Pro Lys Lys Tyr Leu 

20 25 30 

Glu Gly Ser Ala Tyr Glu Val He Tyr Asn His Leu Ser Tyr Lys His 

35 40 45 

Glu He Leu Asp Lys Tyr He Glu Thr Asn Thr Ala Val Phe Trp He 

50 55 60 

Lys Lys Asp Asp He Phe Ser Val Ala Thr He Leu Arg His Leu Gly 
65 70 75 80 

Tyr Glu Cys Leu Ser Glu Met Ser Ala He Asp Leu Cys Ala Lys Lys 

85 90 95 

Gly His Phe Glu Leu Phe Tyr Gin Phe Val Gly Phe Ser Asp Ser Cys 
100 105 110 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 831 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039: 

AGCGATCAAA CAAGACGCTC CCAAAAGGTT AGTGTG ATG GTA AGA AAA CAA TCC 54 

Met Val Arg Lys Gin Ser 
1 5 

CCC TAT GAA GAT GTG CAA AAA CAA TCG CGC CAG CAT GAC CCC TAT AAA 102 
Pro Tyr Glu Asp Val Gin Lys Gin Ser Arg Gin His Asp Pro Tyr Lys 
10 15 20 

ATC ATA GAA CCC ACC CCT AAA AAA TAT TTA GAG GGC AGC GCT TAT GAG 15 0 

lie lie Glu Pro Thr Pro Lys Lys Tyr Leu Glu Gly Ser Ala Tyr Glu 
25 30 35 

GTC ATT TAC AAC CAC CTT TCT TAC AAA CAT GAG ATT TTA GAC AAA TAG 198 
Val lie Tyr Asn His Leu Ser Tyr Lys His Glu lie Leu Asp Lys Tyr 
40 45 50 

ATA GAG ACT AAC ACG GCT GTG TTT TGG ATC AAA AAA GAC GAT ATT TTT 246 
lie Glu Thr Asn Thr Ala Val Phe Trp lie Lys Lys Asp Asp lie Phe 
55 60 65 70 

TCT GTC GCT ACG ATT TTA AGG CAT TTG GGT TAT GAG TGT TTG AGC GAA 2 94 

Ser Val Ala Thr lie Leu Arg His Leu Gly Tyr Glu Cys Leu Ser Glu 
75 80 85 

ATG AGC GCG ATA GAT TTG TGC GCT AAA AAA GGG CAT TTT GAA TTG TTT 34 2 

Met Ser Ala lie Asp Leu Cys Ala Lys Lys Gly His Phe Glu Leu Phe 
90 95 100 

TAT CAG TTC GTG GGC TTT AGC GAT AGC TGC AAG AAC CGC CGT AGG NTG 3 90 

Tyr Gin Phe Val Gly Phe Ser Asp Ser Cys Lys Asn Arg Arg Arg Xaa 
105 110 115 

CGC GTG AAG TGC GTT TTG TTG CCT AAT GAG AGC GTG GAT TCT TTG AGT 43 8 

Arg Val Lys Cys Val Leu Leu Pro Asn Glu Ser Val Asp Ser Leu Ser 
120 125 130 

TTT TTA TAC CGA TCG GCT AAT TGG AGC GAA AGG GAA GCG TAT GAC ATG 4 86 

Phe Leu Tyr Arg Ser Ala Asn Trp Ser Glu Arg Glu Ala Tyr Asp Met 
135 140 145 150 

CTT GGT ATT GTG TTT GAC AAA CAC CCC TAT TTG AAA CGC CTT ATT ATG 53 4 

Leu Gly lie Val Phe Asp Lys His Pro Tyr Leu Lys Arg Leu lie Met 
155 160 165 
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75 80 85 90 

GCG ATT GGT TTT AAA AAA TTA GGC TTG TTT GGA CTC GTT GAA ATG CTA 34 0 

Ala lie Gly Phe Lys Lys Leu Gly Leu Phe Gly Leu Val Glu Met Leu 
95 100 105 

GGC TTT GTC TTC TTT TTA ACC ATT GGT TTT ATT TAG GCT TTA AAG CGA 3 88 

Gly Phe Val Phe Phe Leu Thr lie Gly Phe lie Tyr Ala Leu Lys Arg 
110 115 120 

AAC GCT TTG AGC TGG CAA AAA TTA GAG GTG AAA TAATGCAACA AGCACCGGTT 441 
Asn Ala Leu Ser Trp Gin Lys Leu Glu Val Lys 
125 130 

GTTCTAAGCA CTTTGGATAA ATTATTG 46 8 

(2) INFORMATION FOR SEQ ID NO: 1038: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 8: 

Met Gin Gin Ala Thr Glu Ala Leu Asn His Pro Tyr Phe Gly Val Phe 

15 10 15 

Val Leu Leu Val Phe Thr Phe Trp Val Phe Asn Leu Thr Leu Arg lie 

20 25 30 

Gin Arg Phe Leu Ser Arg Lys Met Ala Gin Lys Lys Gly Glu Lys Leu 

35 40 45 

Lys Leu Ala Pro Tyr Glu Cys Gly Pro Val Ala Leu Lys Gin Pro Asn 

50 55 60 

Arg Val Ser His His Phe Tyr lie Met Ala Met Leu Phe lie Leu Phe 
65 70 75 80 

Asp Val Glu lie Val Phe Met Phe Pro Trp Ala lie Gly Phe Lys Lys 

85 90 95 

Leu Gly Leu Phe Gly Leu Val Glu Met Leu Gly Phe Val Phe Phe Leu 

100 105 110 

Thr lie Gly Phe lie Tyr Ala Leu Lys Arg Asn Ala Leu Ser Trp Gin 

115 120 125 

Lys Leu Glu Val Lys 
130 

(2) INFORMATION FOR SEQ ID NO: 1039: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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100 105 110 

Gin Leu Arg Pro Asp lie Val Trp Phe Gly Glu Ala Val Pro Leu Leu 

115 120 125 

Lys Glu Ala lie Ser Leu Val Lys Gin Ala His Leu Leu lie lie lie 

130 135 140 

Gly Thr Ser Leu Gin Val Tyr Pro Ala Ala Ser Leu Tyr Thr His Ala 
145 150 155 160 

His Lys Asp Ala Leu lie Tyr Tyr lie Asp Pro Lys Ala Lys Asn Ala 

165 170 175 

His Leu Pro Gin Asn Val Gin Cys lie Asn Glu Ser Ala Val His Ala 

180 185 190 

Met Gin Asp Leu Met Pro Lys Leu lie Glu Met Ala Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 1037: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 23... 421 
iV) OTHER INFORMATION: 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037: 

ATTAAAGGAG TTTGAGAGTC TG ATG CAA CAA GCC ACA GAA GCA TTG AAT CAC 52 

Met Gin Gin Ala Thr Glu Ala Leu Asn His 
15 10 

CCC TAT TTT GGC GTT TTT GTT TTA TTG GTA TTC ACC TTT TGG GTG TTT 100 
Pro Tyr Phe Gly Val Phe Val Leu Leu Val Phe Thr Phe Trp Val Phe 
15 20 25 

AAC TTA ACC TTA AGG ATC CAA AGG TTT TTA AGC CGT AAA ATG GCT CAA 14 8 

Asn Leu Thr Leu Arg lie Gin Arg Phe Leu Ser Arg Lys Met Ala Gin 
30 35 40 

AAA AAG GGC GAA AAG CTC AAG CTC GCT CCC TAT GAA TGC GGG CCT GTG 196 
Lys Lys Gly Glu Lys Leu Lys Leu Ala Pro Tyr Glu Cys Gly Pro Val 
45 50 55 

GCT CTC AAA CAG CCT AAT AGG GTG TCG CAC CAT TTC TAT ATC ATG GCC 244 
Ala Leu Lys Gin Pro Asn Arg Val Ser His His Phe Tyr lie Met Ala 
60 65 70 

ATG CTT TTT ATT TTA TTT GAT GTA GAA ATC GTT TTC ATG TTC CCT TGG 2 92 

Met Leu Phe lie Leu Phe Asp Val Glu lie Val Phe Met Phe Pro Trp 
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GAA AAG GAC TTG AAT TTA GGC GAC TTG GCC AAA GAC AAA TCG CAA TTA 3 94 

Glu Lys Asp Leu Asn Leu Gly Asp Leu Ala Lys Asp Lys Ser Gin Leu 
100 105 110 

CGC CCT GAT ATT GTG TGG TTT GGC GAA GCG GTG CCT TTG CTT AAA GAA 44 2 

Arg Pro Asp lie Val Trp Phe Gly Glu Ala Val Pro Leu Leu Lys Glu 
115 120 125 130 

GCG ATT TCT TTA GTG AAA CAA GCG CAT CTT TTA ATC ATC ATT GGC ACT 4 90 

Ala lie Ser Leu Val Lys Gin Ala His Leu Leu lie lie lie Gly Thr 
135 140 145 

TCT TTG CAA GTC TAT CCC GCC GCT AGC CTC TAG ACG CAT GCG CAT AAA 53 8 

Ser Leu Gin Val Tyr Pro Ala Ala Ser Leu Tyr Thr His Ala His Lys 
150 155 160 

GAC GCT CTC ATT TAT TAC ATT GAC CCT AAG GCT AAA AAC GCC CAT TTA 58 6 

Asp Ala Leu lie Tyr Tyr lie Asp Pro Lys Ala Lys Asn Ala His Leu 
165 170 175 

CCC CAG AAT GTC CAA TGC ATT AAT GAA AGC GCG GTG CAT GCC ATG CAA 63 4 

Pro Gin Asn Val Gin Cys He Asn Glu Ser Ala Val His Ala Met Gin 
180 185 190 

GAT TTA ATG CCC AAA CTC ATA GAA ATG GCT TCT TAAGAAATGT TAAAATAATT 68 7 

Asp Leu Met Pro Lys Leu He Glu Met Ala Ser 
195 200 205 

TTTATTTTTT CAGCTAACGA TTAGCAAAAA 717 
(2) INFORMATION FOR SEQ ID NO: 1036: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 05 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 6: 

Met Ala Cys Gly Lys Gly His Asp He Met Glu Val Ala Ser Pro Tyr 

15 10 15 

Gly Trp Lys Lys Asn Pro Gin Lys Val Leu Asp Phe Tyr Asn Gin Arg 

20 25 30 

Arg Arg Gin Leu Phe Glu Val Tyr Pro Asn Lys Ala His Lys Ala Leu 

35 40 45 

Ala Glu Leu Glu Lys His Tyr Gin Val Asn He He Thr Gin Asn Val 

50 55 60 

Asp Asp Leu His Glu Arg Ala Gly Ser Ser Arg He Leu His Leu His 

65 70 75 80 

Gly Glu Leu Leu Ser Val Arg Ser Glu Lys Asp Pro Asn Leu Val Tyr 

85 90 95 

Arg Trp Glu Lys Asp Leu Asn Leu Gly Asp Leu Ala Lys Asp Lys Ser 
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65 70 75 80 

Leu Ser Met Gly Leu Leu lie Gly Phe Val Thr Pro Phe lie Phe Lys 

85 90 95 

Phe Phe Leu His Asp Lys Leu Ser Gly Thr His lie Glu Thr lie Lys 
100 105 110 

Glu Ala Thr 
115 



(2) INFORMATION FOR SEQ ID NO: 1035: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53 . . . 667 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1035: 



AGCGGGGCTG GTATTTCAGC AGAAAGCGGG ATTAAAACCT TTAGAGACGC TG ATG GCT 5 8 

Met Ala 
1 



TGT GGG AAA GGG CAT GAC ATC ATG GAA GTT GCC TCG CCT TAT GGC TGG 106 
Cys Gly Lys Gly His Asp lie Met Glu Val Ala Ser Pro Tyr Gly Trp 
5 10 15 



AAA AAG AAC CCG CAA AAG GTG TTG GAT TTT TAC AAC CAA AGG CGC CGA 154 
Lys Lys Asn Pro Gin Lys Val Leu Asp Phe Tyr Asn Gin Arg Arg Arg 
20 25 30 



CAG CTT TTT GAA GTT TAT CCT AAC AAA GCC CAT AAG GCT TTA GCG GAA 2 02 

Gin Leu Phe Glu Val Tyr Pro Asn Lys Ala His Lys Ala Leu Ala Glu 
35 40 45 50 



TTG GAA AAA CAC TAT CAA GTC AAT ATC ATC ACC CAA AAT GTA GAT GAT 25 0 

Leu Glu Lys His Tyr Gin Val Asn lie lie Thr Gin Asn Val Asp Asp 
55 60 65 



TTG CAT GAA AGA GCG GGT TCT TCT CGC ATT TTG CAC TTG CAT GGG GAA 2 98 

Leu His Glu Arg Ala Gly Ser Ser Arg lie Leu His Leu His Gly Glu 
70 75 80 



TTA TTG AGC GTT CGC AGC GAG AAA GAT CCT AAT TTA GTT TAT AGG TGG 34 6 

Leu Leu Ser Val Arg Ser Glu Lys Asp Pro Asn Leu Val Tyr Arg Trp 
85 90 95 
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CTCTCTTTGC GCCTAAAGGC CTTTATCACC GATATTTTT ATG ATT TAT ACC CCC 5 4 

Met lie Tyr Thr Pro 
1 5 

ATG CTT TAT ATA ATG ACT TAT GCG ATT TTA GGG AGC GCG AAG GAT TTT 102 
Met Leu Tyr lie Met Thr Tyr Ala lie beu Gly Ser Ala Lys Asp Phe 
10 15 20 

AGG GAA AAC CAG AGC GCG ATT TTT TTA TGC CTG CTT TTT TAC GCC CTA 15 0 

Arg Glu Asn Gin Ser Ala lie Phe Leu Cys Leu Leu Phe Tyr Ala Leu 
25 30 35 

ACA CAC AGC TTT TTT ATC GCT TTT AAA TCC CAA AGC CCT GGC ATG CGT 198 
Thr His Ser Phe Phe lie Ala Phe Lys Ser Gin Ser Pro Gly Met Arg 
40 45 50 

TAC GCT CGG TTT AAA TTA ATC AAA AAT AAT GGC GAA AAA GTG GGC TTT 24 6 

Tyr Ala Arg Phe Lys Leu lie Lys Asn Asn Gly Glu Lys Val Gly Phe 
55 60 65 

TTT TTA GCT TTG TGG CGC TTT GTT TTG TGG GTG TTG AGC ATG GGG TTA 2 94 

Phe Leu Ala Leu Trp Arg Phe Val Leu Trp Val Leu Ser Met Gly Leu 
70 75 80 85 

CTC ATA GGG TTT GTT ACG CCT TTT ATT TTT AAG TTT TTT TTG CAT GAC 34 2 

Leu lie Gly Phe Val Thr Pro Phe lie Phe Lys Phe Phe Leu His Asp 
90 95 100 

AAA CTC AGC GGC ACT CAT ATT GAA ACC ATC AAG GAG GCA ACA TGAAAAATT 3 93 
Lys Leu Ser Gly Thr His lie Glu Thr lie Lys Glu Ala Thr 
105 110 115 

TAGTAATC 4 01 

(2) INFORMATION FOR SEQ ID NO: 1034: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1034 : 

Met He Tyr Thr Pro Met Leu Tyr He Met Thr Tyr Ala He Leu Gly 

15 10 15 

Ser Ala Lys Asp Phe Arg Glu Asn Gin Ser Ala He Phe Leu Cys Leu 

20 25 30 

Leu Phe Tyr Ala Leu Thr His Ser Phe Phe He Ala Phe Lys Ser Gin 

35 40 45 

Ser Pro Gly Met Arg Tyr Ala Arg Phe Lys Leu He Lys Asn Asn Gly 

50 55 60 

Glu Lys Val Gly Phe Phe Leu Ala Leu Trp Arg Phe Val Leu Trp Val 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032: 

Met Asp lie Lys Ala Cys Tyr Gin Asn Ala Lys Ala Leu Leu Glu Gly 

15 10 15 

His Phe Leu Leu Ser Ser Gly Phe His Ser Asn Tyr Tyr Leu Gin Ser 

20 25 30 

Ala Lys Val Leu Glu Asp Pro Lys Leu Ala Glu Gin Leu Ala Leu Glu 

35 40 45 

Leu Ala Lys Gin lie Gin Glu Ala His Leu Asn lie Glu Cys Val Cys 

50 55 60 

Ser Pro Ala lie Gly Gly lie Leu Ala Gly Tyr Glu Leu Ala Arg Ala 
65 70 75 80 

Leu Gly Val Arg Phe lie Phe Thr Glu Arg Val Asp Asn Thr Met Ala 

85 90 95 

Leu Arg Arg Gly Phe Glu Val Lys Lys Asn Glu Lys lie Leu Val Cys 

100 105 110 

Glu Asp lie lie Thr Thr Gly Lys Ser Ala Met Glu Cys Ala Lys Val 

115 120 125 

Leu Glu Glu Lys Gly Ala Gin lie Val Ala Phe Gly Ala Leu Ala Asn 

130 135 140 

Arg Gly lie Cys Lys Arg Ala His Ser His Leu Lys Ala Gin Glu Gly 
145 150 155 160 

Ala Cys Leu Pro Ser His Leu Pro Leu Phe Ala Leu Glu Asp Phe Val 

165 170 175 

Phe Asp Met His Lys Pro Ser Ser Cys Pro Leu Cys Ala Thr Ser Val 

180 185 190 

Ala lie Lys Pro Gly Ser Arg Gly Asn 
195 200 

(2) INFORMATION FOR SEQ ID NO: 1033: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 40... 384 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033: 

-1515- 



BNSDOCtD: <WO 9S43478A1J_> 



wo 98/43478 



PCT/US98/06371 



Met Asp lie Lys Ala Cys Tyr Gin Asn Ala Lys Ala 
15 10 

TTA TTA GAG GGG CAT TTC TTG CTC AGC AGT GGG TTT CAT TCC AAT TAT 9 9 

Leu Leu Glu Gly His Phe Leu Leu Ser Ser Gly Phe His Ser Asn Tyr 
15 20 25 

TAT TTG CAA TCC GCT AAA GTT TTA GAA GAT CCC AAA CTA GCC GAA CAA 14 7 

Tyr Leu Gin Ser Ala Lys Val Leu Glu Asp Pro Lys Leu Ala Glu Gin 
30 35 40 

TTA GCG CTA GAA TTA GCC AAA CAA ATC CAA GAA GCT CAT TTG AAT ATT 195 
Leu Ala Leu Glu Leu Ala Lys Gin lie Gin Glu Ala His Leu Asn lie 
45 50 55 60 

GAA TGC GTG TGC TCA CCG GCT ATT GGG GGG ATT TTG GCT GGG TAT GAG 243 
Glu Cys Val Cys Ser Pro Ala lie Gly Gly lie Leu Ala Gly Tyr Glu 
65 70 75 

CTT GCA AGG GCT TTG GGC GTG CGT TTT ATC TTC ACC GAA AGG GTG GAT 2 91 

Leu Ala Arg Ala Leu Gly Val Arg Phe lie Phe Thr Glu Arg Val Asp 
80 85 90 

AAT ACC ATG GCG TTA AGG CGT GGC TTT GAA GTC AAA AAA AAC GAA AAA 33 9 

Asn Thr Met Ala Leu Arg Arg Gly Phe Glu Val Lys Lys Asn Glu Lys 
95 100 105 

ATT TTA GTG TGT GAG GAC ATT ATC ACT ACG GGA AAA TCC GCT ATG GAA 3 87 

lie Leu Val Cys Glu Asp lie lie Thr Thr Gly Lys Ser Ala Met Glu 
110 115 120 

TGC GCT AAA GTT TTA GAA GAA AAG GGT GCT CAA ATC GTG GCT TTT GGT 4 35 

Cys Ala Lys Val Leu Glu Glu Lys Gly Ala Gin lie Val Ala Phe Gly 
125 130 135 140 

GCT TTA GCT AAT CGG GGC ATT TGC AAG CGT GCT CAT TCT CAT TTA AAA 4 83 

Ala Leu Ala Asn Arg Gly lie Cys Lys Arg Ala His Ser His Leu Lys 
145 150 155 

GCC CAA GAG GGA GCG TGT TTG CCT AGC CAT TTG CCC CTT TTT GCT TTA 531 
Ala Gin Glu Gly Ala Cys Leu Pro Ser His Leu Pro Leu Phe Ala Leu 
160 165 170 

GAA GAT TTT GTT TTT GAC ATG CAC AAG CCT AGT TCT TGC CCT TTA TGC 57 9 

Glu Asp Phe Val Phe Asp Met His Lys Pro Ser Ser Cys Pro Leu Cys 
175 180 185 

GCT ACT AGC GTT GCT ATA AAG CCA GGA AGT CGT GGC AAC TAAAAAAACA AA 63 0 
Ala Thr Ser Val Ala lie Lys Pro Gly Ser Arg Gly Asn 
190 195 200 

AAAAAATAAA ACCCCAAAAA AAAAGCAAGC GT 6 62 

(2) INFORMATION FOR SEQ ID NO: 10 32: 
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Lys Pro lie Gly Glu Arg Tyr Leu Leu Met Leu Lys Lys Tyr Leu Gin 

85 90 95 

Phe Asp Phe Gly Glu Ser Phe Tyr Arg Gin lie Lys Val lie Asp Leu 

100 105 110 

lie Lys Glu Lys Leu Pro Val Ser lie Ser Leu Gly Leu Phe Ser Thr 

115 120 125 

Leu Leu lie Tyr Leu lie Ser lie Pro Leu Gly lie Phe Lys Ala Lys 

130 135 140 

Arg Asn Asn Glu Pro Leu Asp Val Leu Ser Ser Val Val lie lie Val 
145 150 155 160 

Ala Asn Ala lie Pro Ala Phe Leu Phe Ala Val Val Leu lie Val Phe 

165 170 175 

Phe Ala Gly Gly Asn Tyr Trp His Trp Phe Pro Leu Lys Gly Leu Val 

180 185 190 

Ser Asp Asn Phe Glu Ser Leu Ser Ala Leu Gly Lys lie Lys Asp Tyr 

195 200 205 

Leu Trp His lie Thr Leu Pro Val Leu Cys lie Ser Leu Gly Gly Phe 

210 215 220 

Ala Ser Leu Thr Leu Leu Val Lys Asn Ser Phe Leu Asp Glu Met Gly 
225 230 235 240 

Lys Leu Tyr Val Leu Ser Ala Lys Ala Lys Gly Cys Ser Val Gly Arg 

245 250 255 

lie Phe Tyr Ala His Val Phe Arg Asn Ala lie Leu Leu Val Val Ala 

260 265 270 

Gly Phe Pro Gin Ala Phe Leu Gly Met Phe Phe Ser Ser Ser Leu Leu 

275 280 285 

lie Glu lie Val Phe Ser Leu Asp Gly Leu Gly Leu Leu Gly Tyr Glu 

290 295 300 

Ser lie Val Ser Arg Asp Tyr Pro Val Val Phe Gly Ser Leu Tyr lie 
305 310 315 320 

Phe Thr Leu Leu Gly Leu Val Ala Ser Leu lie Ser Asp Leu Leu Cys 

325 330 335 

Val Val lie Asp Pro Arg lie Asp Phe Glu Lys Arg 
340 345 

(2) INFORMATION FOR SEQ ID NO: 1031: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16 - - . 618 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031: 
TTAAAGGTCT AAACC ATG GAT ATT AAG GCA TGT TAT CAA AAC GCT AAA GCG 51 
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Leu Thr Leu Leu Val Lys Asn Ser Phe Leu Asp Glu Met Gly Lys Leu 
230 .235 240 

TAT GTA CTG AGC GCT AAG GCT AAG GOT TGT TCA GTG GGG CGT ATT TTT 82 5 

Tyr Val Leu Ser Ala Lys Ala Lys Gly Cys Ser Val Gly Arg lie Phe 
245 250 255 

TAT GCG CAT GTG TTC CGT AAT GCG ATT TTA TTA GTG GTG GCG GGT TTC 8 73 

Tyr Ala His Val Phe Arg Asn Ala lie Leu Leu Val Val Ala Gly Phe 
260 265 270 

CCG CAA GCT TTT TTG GGC ATG TTC TTT AGC TCA AGT TTG TTG ATA GAG 921 
Pro Gin Ala Phe Leu Gly Met Phe Phe Ser Ser Ser Leu Leu lie Glu 
275 280 285 290 

ATT GTT TTT AGC CTA GAC GGG TTA GGG CTT TTA GGG TAT GAA AGC ATT 96 9 

lie Val Phe Ser Leu Asp Gly Leu Gly Leu Leu Gly Tyr Glu Ser lie 
295 300 305 

GTG AGT AGG GAT TAT CCC GTT GTG TTT GGT TCG CTT TAT ATT TTC ACG 1017 
Val Ser Arg Asp Tyr Pro Val Val Phe Gly Ser Leu Tyr lie Phe Thr 
310 315 320 

CTT TTA GGT TTG GTA GCG AGT TTG ATA AGC GAT TTG CTC TGT GTG GTG 106 5 

Leu Leu Gly Leu Val Ala Ser Leu lie Ser Asp Leu Leu Cys Val Val 
325 330 335 

ATT GAC CCT AGG ATT GAT TTT GAA AAG CGT TGAGGGTAGG AATGAAAACT GAG 1118 
lie Asp Pro Arg lie Asp Phe Glu Lys Arg 
340 345 

ATGAAATCTT CTTTAAAACT TTTTATGCGG CCTT 115 2 

(2) INFORMATION FOR SEQ ID NO: 1030: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1030: 

Met lie Ala Tyr lie Leu Lys Arg Leu Leu Leu lie lie Pro Thr Leu 

15 10 15 

Leu Ala lie Met Thr lie Asn Phe Phe Leu lie Gin Ser Ala Pro Gly 

20 25 30 

Gly Pro lie Glu Gin Met Met Ala Lys lie Asn Asn Thr Gin Ser Lys 

35 40 45 

Glu lie Gin Gly Val Val Lys Glu Arg Ser Tyr Arg Ala Ser Gin Gly 

50 55 60 

Leu Glu Ser Asp Leu Leu Glu Asn Leu Lys Lys Leu Tyr Gly Phe Asp 
65 _ 70 75 80 
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5 10 15 

ATC ATG ACC ATT AAT TTC TTT TTG ATC CAA TCG GCT CCT GGA GGC CCT 153 
He Met Thr He Asn Phe Phe Leu He Gin Ser Ala Pro Gly Gly Pro 
20 25 30 

ATA GAG GAG ATG ATG GCT AAA ATC AAT AAC ACG CAG TCC AAA GAG ATT 2 01 

He Glu Gin Met Met Ala Lys He Asn Asn Thr Gin Ser Lys Glu He 
35 40 45 50 

CAA GGC GTT GTT AAA GAG CGT TCG TAT AGG GCG TCT CAA GGG TTG GAG 24 9 

Gin Gly Val Val Lys Glu Arg Ser Tyr Arg Ala Ser Gin Gly Leu Glu 
55 60 65 

AGC GAT TTG TTA GAA AAT TTA AAA AAA CTC TAT GGT TTT GAC AAG CCC 2 97 

Ser Asp Leu Leu Glu Asn Leu Lys Lys Leu Tyr Gly Phe Asp Lys Pro 
70 75 80 

ATA GGG GAG CGC TAC CTT CTC ATG CTC AAA AAA TAT CTG CAA TTT GAT 34 5 

He Gly Glu Arg Tyr Leu Leu Met Leu Lys Lys Tyr Leu Gin Phe Asp 
85 90 95 

TTT GGG GAG AGC TTT TAT CGC CAG ATT AAA GTG ATA GAT TTG ATT AAG 3 93 

Phe Gly Glu Ser Phe Tyr Arg Gin He Lys Val He Asp Leu He Lys 
100 105 110 

GAA AAA TTG CCC GTA TCC ATT TCG TTA GGG CTT TTT AGC ACG CTT TTG 441 
Glu Lys Leu Pro Val Ser He Ser Leu Gly Leu Phe Ser Thr Leu Leu 
115 120 125 130 

ATT TAT CTT ATT TCT ATC CCT TTA GGG ATT TTC AAG GCC AAA CGC AAT 4 89 

He Tyr Leu He Ser He Pro Leu Gly He Phe Lys Ala Lys Arg Asn 
135 140 145 

AAC GAG CCT TTA GAC GTG TTA AGC AGC GTG GTG ATC ATT GTC GCT AAC 53 7 

Asn Glu Pro Leu Asp Val Leu Ser Ser Val Val He He Val Ala Asn 
150 155 160 

GCT ATC CCG GCC TTT TTG TTT GCG GTG GTG TTG ATC GTG TTT TTT GCT 5 85 

Ala He Pro Ala Phe Leu Phe Ala Val Val Leu He Val Phe Phe Ala 
165 170 175 

GGA GGG AAT TAT TGG CAT TGG TTC CCT TTA AAG GGG CTA GTG AGC GAT 63 3 

Gly Gly Asn Tyr Trp His Trp Phe Pro Leu Lys Gly Leu Val Ser Asp 
180 185 190 

AAT TTT GAA AGT TTG AGC GCG TTA GGT AAA ATC AAG GAT TAT TTA TGG 6 81 

Asn Phe Glu Ser Leu Ser Ala Leu Gly Lys He Lys Asp Tyr Leu Trp 
195 200 205 210 

CAT ATC ACT TTG CCC GTT CTT TGC ATT TCT TTA GGG GGT TTT GCA AGC 72 9 

His He Thr Leu Pro Val Leu Cys He Ser Leu Gly Gly Phe Ala Ser 
215 220 225 

CTT ACG CTT TTA GTG AAA AAC TCT TTT TTA GAT GAA ATG GGC AAG CTC 77 7 
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100 

Asn Pro Tyr Asn 
115 

Leu Lys Tyr Arg 
130 

Pro Gly Asp Leu 
145 

Leu Ala Val Cys 

Glu Ala Ala Tyr 
180 

Ser Thr Gin Val 
195 

Trp His Asn Leu 
210 

Asn Val Phe Tyr 
225 

Thr Thr Leu Val 

Glu lie Tyr Pro 
260 

Glu Asn Asn lie 
275 

Gly Gly Glu His 
290 

Gly Lys Tyr Lys 
305 

Ser lie Tyr Gly 



Thr Ala lie lie 
120 

Lys Leu Phe Pro 
135 

Gly Met Pro Val 
150 

lie Cys His Asp 
165 

Lys Gly Cys Asn 

Asn Asp Gin Trp 
200 

Met Tyr Thr Val 
215 

Tyr Phe Gly Glu 
230 

Gin Gly His Arg 
245 

Lys Met Ala Asp 

Tyr Asn Leu Gly 
280 

Asp Ala Gly Leu 
295 

Leu Pro Trp Glu 
310 

Tyr Pro Thr Thr 
325 



105 

lie Asp Pro Gin 

Trp Asn Pro lie 
140 

Cys Glu Gly Pro 
155 

Gly Met lie Pro 
170 

Val Tyr lie Arg 
185 

lie Leu Thr Asn 

Ser Val Asn Leu 
220 

Gly Gin lie Cys 
235 

Asn Pro Trp Glu 
250 

Asn Ala Arg Leu 
265 

His Arg Gly Tyr 

Thr Tyr lie Lys 
300 

Asp His Met Lys 
315 

Gly Gly Arg Phe 
330 



110 

Gly Glu lie lie 
125 

Glu Pro Trp Tyr 

Gly Gly Ser Lys 
160 

Glu Leu Ala Arg 
175 

lie Ser Gly Tyr 
190 

Arg Ser Asn Ala 
205 

Ala Gly Tyr Asp 

Asn Phe Asp Gly 
240 

lie Val Thr Gly 
255 

Ser Trp Gly Leu 
270 

Val Ala Lys Pro 
285 

Asp Leu Ala Ala 

lie Lys Asp Gly 
320 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 102 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/ICEY: Coding Sequence 

(B) LOCATION: 52.., 1095 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 9: 

TAGCTATGGA TTTTCGCCGT ATTTGTGGTG GATAAAAGAA AAGGATCTTC A ATG ATT 5 7 

Met lie 
1 

GCT TAC ATT CTC AAA CGC TTG CTT TTG ATT ATC CCT ACT TTA TTA GCT 105 
Ala Tyr lie Leu Lys Arg Leu Leu Leu lie lie Pro Thr Leu Leu Ala 
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GTC TTT TAC TAC TTT GGT GAG GGG CAA ATC TGT AAC TTT GAT GGC ACG 78 0 

Val Phe Tyr Tyr Phe Gly Glu Gly Gin lie Cys Asn Phe Asp Gly Thr 
230 235 240 

ACT CTT GTT CAA GGG CAC CGC AAC CCT TGG GAG ATT GTA ACC GGG GAA 82 8 

Thr Leu Val Gin Gly His Arg Asn Pro Trp Glu lie Val Thr Gly Glu 
245 250 255 

ATC TAT CCC AAA ATG GCA GAC AAC GCT CGC TTA AGC TGG GGA TTA GAA 876 
lie Tyr Pro Lys Met Ala Asp Asn Ala Arg Leu Ser Trp Gly Leu Glu 
260 265 270 

AAC AAC ATT TAC AAC CTA GGC CAT AGA GGG TAT GTG GCT AAA CCG GGC 924 
Asn Asn lie Tyr Asn Leu Gly His Arg Gly Tyr Val Ala Lys Pro Gly 
275 280 285 

GGA GAA CAT GAC GCA GGC TTA ACC TAT ATC AAA GAC TTA GCG GCC GGT 972 
Gly Glu His Asp Ala Gly Leu Thr Tyr lie Lys Asp Leu Ala Ala Gly 
290 295 300 305 

AAA TAC AAA TTG CCT TGG GAA GAT CAC ATG AAA ATC AAA GAC GGC TCT 102 0 

Lys Tyr Lys Leu Pro Trp Glu Asp His Met Lys lie Lys Asp Gly Ser 
310 315 320 

ATT TAT GGC TAC CCT ACC ACC GGT GGG CGT TTT GGG AAA TAATCCCTAA CC 1071 
lie Tyr Gly Tyr Pro Thr Thr Gly Gly Arg Phe Gly Lys 
325 330 

TTGCATTTTT GCTAGAACCC GTTTTTAAGG G 1102 

(2) INFORMATION FOR SEQ ID NO: 1028: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028: 

Met Gly Ser lie Gly Ser Met Gly Lys Pro lie Glu Gly Phe Leu Val 

15 10 15 

Ala Ala lie Gin Phe Pro Val Pro lie Val Asn Ser Arg Lys Asp lie 

20 25 30 

Asp His Asn lie Glu Ser lie lie Arg Thr Leu His Ala Thr Lys Ala 

35 40 45 

Gly Tyr Pro Gly Val Glu Leu lie lie Phe Pro Glu Tyr Ser Thr Gin 

50 55 60 

Gly Leu Asn Thr Ala Lys Trp Leu Ser Glu Glu Phe Leu Leu Asp Val 

65 70 75 80 

Pro Gly Lys Glu Thr Glu Leu Tyr Ala Lys Ala Cys Lys Glu Ala Lys 

85 90 95 

Val Tyr,-Gly Val Phe Ser lie Met Glu Arg Asn Pro Asp Ser Asn Lys 
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GGA AGT ATC GGT AGT ATG GGC AAA CCT ATT GAA GGG TTT TTA GTG GCA 10 8 

Gly Ser lie Gly Ser Met Gly Lys Pro lie Glu Gly Phe Leu Val Ala 
5 10 15 

GCC ATT CAG TTT CCT GTG CCA ATT GTC AAT AGC CGT AAG GAT ATT GAT 15 6 

Ala lie Gin Phe Pro Val Pro lie Val Asn Ser Arg Lys Asp lie Asp 
20 25 30 

CAC AAT ATT GAA AGC ATT ATT AGA ACC TTG CAT GCG ACT AAA GCG GGG 2 04 

His Asn lie Glu Ser lie lie Arg Thr Leu His Ala Thr Lys Ala Gly 
35 40 45 

TAT CCG GGA GTG GAG CTT ATC ATT TTC CCT GAG TAT AGC ACG CAA GGT 2 52 

Tyr Pro Gly Val Glu Leu He He Phe Pro Glu Tyr Ser Thr Gin Gly 
50 55 60 65 

TTG AAT ACC GCT AAG TGG CTT AGC GAA GAG TTT TTA TTA GAT GTC CCG 3 00 

Leu Asn Thr Ala Lys Trp Leu Ser Glu Glu Phe Leu Leu Asp Val Pro 
70 75 80 

GGT AAA GAG ACA GAG CTA TAC GCT AAG GCG TGT AAA GAG GCG AAA GTT 34 8 

Gly Lys Glu Thr Glu Leu Tyr Ala Lys Ala Cys Lys Glu Ala Lys Val 
85 90 95 

TAT GGT GTT TTT TCA ATC ATG GAA CGC AAT CCT GAT TCT AAC AAA AAC 3 96 

Tyr Gly Val Phe Ser He Met Glu Arg Asn Pro Asp Ser Asn Lys Asn 
100 105 110 

CCC TAC AAC ACC GCC ATT ATC ATT GAT CCG CAA GGT GAA ATC ATT TTA 4 44 

Pro Tyr Asn Thr Ala He He He Asp Pro Gin Gly Glu He He Leu 
115 120 125 

AAA TAC CGC AAG CTA TTC CCA TGG AAT CCC ATT GAG CCA TGG TAT CCT 4 92 

Lys Tyr Arg Lys Leu Phe Pro Trp Asn Pro He Glu Pro Trp Tyr Pro 
130 135 140 145 

GGG GAT TTA GGA ATG CCT GTG TGC GAG GGT CCG GGC GGA TCA AAA TTA 54 0 

Gly Asp Leu Gly Met Pro Val Cys Glu Gly Pro Gly Gly Ser Lys Leu 
150 155 160 

GCC GTG TGC ATT TGC CAT GAC GGC ATG ATT CCA GAG CTC GCC AGA GAA 58 8 

Ala Val Cys He Cys His Asp Gly Met He Pro Glu Leu Ala Arg Glu 
165 170 175 

GCG GCC TAT AAA GGG TGC AAT GTG TAT ATC CGC ATT TCA GGC TAT AGC 63 6 

Ala Ala Tyr Lys Gly Cys Asn Val Tyr He Arg He Ser Gly Tyr Ser 
180 185 190 

ACT CAA GTC AAT GAT CAA TGG ATT TTG ACC AAC CGC TCC AAC GCA TGG 6 84 

Thr Gin Val Asn Asp Gin Trp He Leu Thr Asn Arg Ser Asn Ala Trp 
195 200 205 

CAC AAT TTG ATG TAT ACC GTG AGC GTG AAT TTA GCC GGC TAT GAT AAT 7 32 

His Asn Leu Met Tyr Thr Val Ser Val Asn Leu Ala Gly Tyr Asp Asn 
210 215 220 225 

-1508- 



BNSDOCID: <WO 984347BA1J_> 



wo 98/43478 PCT/US98/06371 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026: 

Gly Tyr Ser Val Lys Asn Ser Asn Arg Leu lie Tyr Thr Asp Asn Leu 

15 10 15 

Glu Glu Ser Leu Glu Glu Thr Ala Ser Leu Phe Glu His His lie Lys 

20 25 30 

Phe Tyr Thr Glu lie lie Glu Lys Asp Lys Lys Val lie Lys Thr Phe 

35 40 45 

Asn Lys Asp Phe Lys lie Glu His Ala Lys Glu Val lie Ser Lys Ala 

50 55 60 

His Leu Lys His Ser Glu Leu Asn Ala Phe Leu lie Ala Ala Pro Ser 
65 70 75 80 

Tyr Gly lie Glu Ala Gin Asn Ala Leu Leu Lys lie Leu Glu Glu Pro 

85 90 95 

Pro Asn Asn Val Cys Phe lie Met Phe Ala Lys Ser Gin Asn His Val 

100 105 110 

Leu Ala Thr lie Lys Ser Arg Leu lie Lys Glu Asp Lys Arg Gin Lys 

115 120 125 

lie Pro Leu Lys Pro Leu Asp Leu Asp Leu Ser Lys Leu Asp Leu Lys 

130 135 140 

Asp lie Tyr Ala Phe Leu Lys Asn Leu Asp Lys Glu Asn Phe Asp Ser 
145 150 155 160 

Arg Glu Asn Gin Arg Glu Arg lie Glu Ser Leu Leu Glu Ser Val Asn 

165 170 175 

Arg His Lys lie Pro Leu Asn Glu Gin Glu Leu Gin Ala Phe Asp Leu 

180 185 190 

Ala lie Lys Ala Asn Ser Ser Tyr Tyr Lys Leu Ser Tyr Asn Leu Leu 

195 200 205 

Pro Leu Leu Leu Ser Leu Leu Ser Lys Lys Lys Thr Pro 
210 215 220 



(2) INFORMATION FOR SEQ ID NO:1027: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58 . . . 1059 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027: 

TATTCTCTCG CAATAATTGT TATTGTTATT GCGACAAAAC TTTTAGAAGG AGTTATT ATG 6 0 

Met 
1 
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35 40 45 

AAC AAG GAT TTT AAA ATA GAG CAT GCC AAA GAA GTC ATT TCC AAA OCT 192 
Asn Lys Asp Phe Lys lie Glu His Ala Lys Glu Val lie Ser Lys Ala 
50 55 60 

CAC CTA AAA CAC AGC GAA TTA AAC GCT TTT TTA ATC GCC GCT CCT AGT 24 0 

His Leu Lys His Ser Glu Leu Asn Ala Phe Leu lie Ala Ala Pro Ser 
65 70 75 80 

TAT GGT ATA GAA GCC CAA AAC GCG CTT TTA AAA ATC TTA GAA GAA CCC 2 88 

Tyr Gly lie Glu Ala Gin Asn Ala Leu Leu Lys lie Leu Glu Glu Pro 
85 90 95 

CCG AAT AAC GTT TGT TTT ATC ATG TTC GCT AAA AGC CAA AAC CAT GTG 33 6 

Pro Asn Asn Val Cys Phe lie Met Phe Ala Lys Ser Gin Asn His Val 
100 105 110 

TTA GCC ACC ATT AAA TCC CGC CTA ATT AAA GAA GAC AAA CGC CAA AAA 3 84 

Leu Ala Thr lie Lys Ser Arg Leu lie Lys Glu Asp Lys Arg Gin Lys 
115 120 125 

ATC CCC CTA AAA CCT TTA GAT TTG GAT TTA TCC AAG CTG GAT TTG AAA 43 2 

lie Pro Leu Lys Pro Leu Asp Leu Asp Leu Ser Lys Leu Asp Leu Lys 
130 135 140 

GAC ATT TAT GCG TTT TTA AAA AAT TTA GAC AAA GAA AAT TTT GAT TCC 4 80 

Asp lie Tyr Ala Phe Leu Lys Asn Leu Asp Lys Glu Asn Phe Asp Ser 
145 150 155 160 

AGA GAA AAT CAG AGO GAA AGG ATT GAA AGC CTG TTA GAG AGC GTT AAC 52 8 

Arg Glu Asn Gin Arg Glu Arg lie Glu Ser Leu Leu Glu Ser Val Asn 
165 170 175 

AGG CAT AAG ATC CCC TTA AAC GAG CAA GAA TTG CAA GCC TTT GAT TTA 57 6 

Arg His Lys lie Pro Leu Asn Glu Gin Glu Leu Gin Ala Phe Asp Leu 
180 185 190 

GCG ATC AAG GCT AAC AGC TCT TAT TAG AAG CTC AGC TAT AAT CTT TTA 62 4 

Ala lie Lys Ala Asn Ser Ser Tyr Tyr Lys Leu Ser Tyr Asn Leu Leu 
195 200 205 

CCC CTG CTT TTA AGC CTT TTA TCC AAA AAG AAA ACG CCA TGATTGTAAA AC 67 5 
Pro Leu Leu Leu Ser Leu Leu Ser Lys Lys Lys Thr Pro 
210 215 220 

GCCTTAACCC TGATGCGCTC AAAAACGCTC T 70 6 

(2) INFORMATION FOR SEQ ID NO:1026: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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195 



200 



205 



Lys Leu Ser Val Lys Leu Val Thr Arg Asn Ser Phe Asn His Ser Glu 

210 215 220 

Gly Thr Leu lie Val Ala Glu Lys Asp Phe Lys Gly Glu Arg Met Glu 
225 230 235 240 

Thr Pro lie Val Ser Gly lie Ala Leu Asp Lys Asn Gin Ala Arg Val 

245 250 255 

Ser Met Glu Gly Val Glu Asp Arg Pro Gly lie Ala Ala Glu lie Phe 

260 265 270 

Gly Ala Leu Ala Glu Tyr Arg lie Asn Val Asp Met lie Val Gin Thr 

275 280 285 

lie Gly Arg Asp Gly Lys Thr Asp Leu Asp Phe Thr lie Val Lys Thr 

290 295 300 

Gin He Glu Glu Thr Lys Gin Ala Leu Lys Pro Phe Leu Ala Gin Met 
305 310 315 320 

Asp Ser He Asp Tyr Asp Glu Asn He Ala Lys Val Ser He Val Gly 

325 330 335 

Val Gly Met Lys Ser His Ser Gly Val Ala Ser He Ala Phe Lys Ala 

340 345 350 

Leu Ala Lys Asp Asn He Asn He Met Met He Ser Thr Ser Glu He 

355 360 365 

Lys He Ser Val Leu He Asp He Lys Tyr Ala Glu Leu Ala Val Arg 

370 375 380 

Thr Leu His Ala Val Tyr Gin Leu Asp Gin 
385 390 

(2) INFORMATION FOR SEQ ID NO: 1025: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 663 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025: 

GGT TAC TCT GTG AAA AAC TCC AAC CGC CTT ATT TAT ACG GAC AAT CTT 4 8 

Gly Tyr Ser Val Lys Asn Ser Asn Arg Leu He Tyr Thr Asp Asn Leu 
15 10 15 

GAA GAG AGC CTA GAA GAG ACT GCA AGC CTT TTT GAA CAC CAC ATT AAA 96 

Glu Glu Ser Leu Glu Glu Thr Ala Ser Leu Phe Glu His His He Lys 



20 



25 



30 



TTC TAC ACG GAG ATT ATT GAA AAA GAC AAA AAG 
Phe Tyr Thr Glu He He Glu Lys Asp Lys Lys 



GTG ATC AAA ACT TTC 
Val He Lys Thr Phe 



144 
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TCC ATA GTG GGC GTG GGC ATG AAG TCG CAT TCT GGG GTG GCG AGT ATC 1119 
Ser lie Val Gly Val Gly Met Lys Ser His Ser Gly Val Ala Ser lie 
335 340 345 

GCT TTT AAA GCC CTA GCC AAA GAC AAT ATC AAT ATC ATG ATG ATT TCT 1167 
Ala Phe Lys Ala Leu Ala Lys Asp Asn lie Asn lie Met Met lie Ser 
350 355 360 

ACA AGC GAG ATT AAA ATT TCG GTT TTG ATT GAC ATT AAA TAG GCT GAA 1215 
Thr Ser Glu lie Lys lie Ser Val Leu lie Asp lie Lys Tyr Ala Glu 
365 370 375 380 

TTA GCT GTT AGA ACT TTG CAT GCG GTG TAT CAA TTA GAT CAA TGAAAAATT 12 6 6 
Leu Ala Val Arg Thr Leu His Ala Val Tyr Gin Leu Asp Gin 
385 390 

TCTACGATTG GATCAAGGAA TTTGT 12 91 

(2) INFORMATION FOR SEQ ID NO: 1024: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024: 

Met Gly Ser lie Glu Arg lie His Asn Val Ala Gin Arg Val Leu Glu 

1.5 10 15 

Ser Val Thr Leu Gly His Gin Val Val Val Val Val Ser Ala Met Ser 

20 25 30 

Gly Glu Thr Asp Arg Leu Leu Glu Phe Gly Lys Asn Phe Ser His Asn 

35 40 45 

Pro Asn Lys Arg Glu Met Asp Arg lie Val Ser Val Gly Glu Leu Val 

50 55 60 

Ser Ser Ala Ala Leu Ser Met Ala Leu Glu Arg Tyr Gly His Arg Ala 
65 70 75 80 

lie Ser Leu Ser Gly Lys Glu Ala Gly lie Leu Thr Ser Ser His Phe 

85 90 95 

Gin Asn Ala Val lie Gin Ser lie Asp Thr Lys Arg lie Thr Glu Leu 

100 105 110 

Leu Glu Lys Asn Tyr lie Val Val lie Ala Gly Phe Gin Gly Ala Asp 

115 120 125 

lie Gin Gly Glu Thr Thr Thr Leu Gly Arg Gly Gly Ser Asp Leu Ser 

130 135 140 

Ala Val Ala Leu Ala Gly Ala Leu Lys Ala His Leu Cys Glu lie Tyr 
145 150 155 160 

Thr Asp Val Asp Gly Val Tyr Thr Thr Asp Pro Arg He Glu Glu Lys 

165 170 175 

Ala Gin Lys He Ala Gin He Ser Tyr Asp Glu Met Leu Glu Leu Ala 

180 185 190 

Ser Met Gly Ala Lys Val Leu Leu Asn Arg Ser Val Glu Leu Ala Lys 
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ATC ACA GAG CTT TTA GAA AAA AAC TAC ATT GTG GTG ATC GCT GGG TTT 44 7 

lie Thr Glu Leu Leu Glu Lys Asn Tyr lie Val Val lie Ala Gly Phe 
110 115 120 

CAA GGC GCT GAT ATT CAA GGT GAA ACA ACG ACT TTA GGG CGT GGG GGG 4 95 

Gin Gly Ala Asp lie Gin Gly Glu Thr Thr Thr Leu Gly Arg Gly Gly 
125 130 135 140 

AGC GAT TTG AGC GCG GTT GCT TTG GCC GGG GCT TTA AAA GCG CAT TTG 543 
Ser Asp Leu Ser Ala Val Ala Leu Ala Gly Ala Leu Lys Ala His Leu 
145 150 155 

TGC GAA ATC TAT ACG GAT GTG GAT GGC GTT TAT ACC ACC GAT CCG CGC 5 91 

Cys Glu lie Tyr Thr Asp Val Asp Gly Val Tyr Thr Thr Asp Pro Arg 
160 165 170 

ATT GAA GAA AAG GCT CAA AAA ATC GCG CAA ATC AGC TAT GAT GAA ATG 63 9 

lie Glu Glu Lys Ala Gin Lys He Ala Gin He Ser Tyr Asp Glu Met 
175 180 185 

CTT GAA CTG GCT TCT ATG GGG GCT AAA GTT TTA TTA AAC CGC TCG GTG 6 87 

Leu Glu Leu Ala Ser Met Gly Ala Lys Val Leu Leu Asn Arg Ser Val 
190 195 200 

GAA TTA GCC AAA AAG CTC AGC GTG AAG TTA GTG ACT CGC AAT TCG TTT 73 5 

Glu Leu Ala Lys Lys Leu Ser Val Lys Leu Val Thr Arg Asn Ser Phe 
205 210 215 220 

AAC CAT AGC GAA GGC ACG CTC ATT GTG GCT GAA AAA GAC TTT AAA GGA 783 
Asn His Ser Glu Gly Thr Leu He Val Ala Glu Lys Asp Phe Lys Gly 
225 230 235 

GAA CGC ATG GAA ACC CCT ATA GTG AGT GGG ATC GCA TTG GAT AAA AAT 831 
Glu Arg Met Glu Thr Pro He Val Ser Gly He Ala Leu Asp Lys Asn 
240 245 250 

CAG GCT CGT GTG AGC ATG GAG GGC GTG GAA GAT CGG CCA GGC ATT GCC 879 
Gin Ala Arg Val Ser Met Glu Gly Val Glu Asp Arg Pro Gly He Ala 
255 260 265 

GCT GAA ATC TTT GGC GCT TTA GCG GAG TAT CGC ATT AAC GTG GAT ATG 92 7 

Ala Glu He Phe Gly Ala Leu Ala Glu Tyr Arg He Asn Val Asp Met 
270 275 280 

ATC GTC CAA ACG ATC GGC AGA GAC GGC AAA ACC GAT TTG GAT TTT ACG 97 5 

He Val Gin Thr He Gly Arg Asp Gly Lys Thr Asp Leu Asp Phe Thr 
285 290 295 300 

ATC GTT AAA ACC CAA ATA GAA GAA ACC AAG CAA GCC TTA AAG CCT TTT 102 3 

He Val Lys Thr Gin He Glu Glu Thr Lys Gin Ala Leu Lys Pro Phe 
305 310 315 

TTA GCG CAA ATG GAT TCC ATT GAT TAT GAT GAA AAT ATC GCT AAA GTC 10 71 

Leu Ala Gin Met Asp Ser He Asp Tyr Asp Glu Asn He Ala Lys Val 
320 325 330 
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Cys Ala His Phe Asp Asn Val Val Arg 

210 215 
Phe Lys Gin Tyr Glu Cys Leu lie Thr 
225 230 
Thr Thr Trp Leu Thr Gly Gly Leu Lys 
245 
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Glu lie Leu Glu Gly Phe Ser 
220 

Leu Glu lie lie Lys Ser Asn 
235 240 
He He 
250 



(2) INFORMATION FOR SEQ ID NO: 102 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 76... 1257 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023: 

GCCAAGTCAT TGCTTATTTC AAAAGAGAGG GGTATTTATA GGGTGTTAAT CGTTCAAAAA 6 0 

TACGGCGGCA CGAGC ATG GGC AGC ATA GAA AGG ATC CAC AAT GTC GCT CAA 111 
Met Gly Ser He Glu Arg He His Asn Val Ala Gin 
15 10 

AGG GTT TTA GAA AGC GTT ACA TTA GGG CAT CAA GTC GTG GTG GTG GTT 15 9 

Arg Val Leu Glu Ser Val Thr Leu Gly His Gin Val Val Val Val Val 
15 20 25 

TCA GCG ATG AGC GGC GAA ACC GAC AGG CTT TTA GAA TTT GGC AAG AAT 2 07 

Ser Ala Met Ser Gly Glu Thr Asp Arg Leu Leu Glu Phe Gly Lys Asn 
30 35 40 

TTT AGC CAT AAC CCT AAC AAG CGA GAG ATG GAC AGG ATT GTA AGC GTG 2 55 

Phe Ser His Asn Pro Asn Lys Arg Glu Met Asp Arg He Val Ser Val 
45 50 55 60 

GGG GAA TTG GTT TCA AGT GCG GCT TTG AGC ATG GCG TTA GAA AGG TAT 303 
Gly Glu Leu Val Ser Ser Ala Ala Leu Ser Met Ala Leu Glu Arg Tyr 
65 70 75 

GGG CAT AGA GCC ATT TCC TTG AGC GGG AAA GAA GCG GGC ATT TTA ACC 3 51 

Gly His Arg Ala He Ser Leu Ser Gly Lys Glu Ala Gly He Leu Thr 
80 85 90 

AGC TCG CAT TTT CAA AAC GCC GTG ATC CAA TCC ATT GAC ACC AAA CGC 3 99 

Ser Ser His Phe Gin Asn Ala Val He Gin Ser He Asp Thr Lys Arg 
95 100 105 
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Ala Tyr Ala Leu Ser Val Glu Lys Asn Thr Asn Leu Glu Lys Asn Ala 
190 195 200 

AAA AAA CCC TCA TGC OCT CAT TTT GAC AAT GTG GTG AGA GAG ATT TTA 67 6 

Lys Lys Pro Ser Cys Ala His Phe Asp Asn Val Val Arg Glu lie Leu 
205 210 215 220 

GAG GGC TTT TCT TTC AAG CAA TAG GAG TGT CTA ATT ACG CTA GAA ATT 72 4 

Glu Gly Phe Ser Phe Lys Gin Tyr Glu Cys Leu lie Thr Leu Glu lie 
225 230 235 

ATC AAG TCA AAC ACA ACT TGG CTT ACT GGG GGG CTA AAG ATT ATT TAGGG 774 
lie Lys Ser Asn Thr Thr Trp Leu Thr Gly Gly Leu Lys lie lie 
240 245 250 

TGCGGGGCTG GGGCTGTGGG CTGCGTGGCG AATGAGCGCT TTTTTGCAAA AAA 82 7 

(2) INFORMATION FOR SEQ ID NO:1022: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022: 

Met Arg Glu lie Asn Met He Leu Tyr He His He Pro Phe Cys Glu 

15 10 15 

Asn Lys Cys Gly Tyr ' Cys Ala Phe Asn Ser Tyr Glu Asn Lys His Gly 

20 25 30 

Leu Lys Glu Glu Tyr Thr Gin Ala Leu Cys Leu Asp Leu Lys His Ala 

35 40 45 

Leu Ser Gin Thr Asp Glu Pro He Glu Ser Val Phe He Gly Gly Gly 

50 55 60 

Thr Pro Asn Thr Leu Ser Val Lys Ala Phe Glu Arg He Phe Glu Ser 
65 70 75 80 

He Tyr Gin His Ala Ser Leu Ser Leu Asp Cys Glu He Thr Thr Glu 

85 90 95 

Ala Asn Pro Glu Leu He Thr Lys Ala Trp Cys Gin Gly Leu Lys Gly 

100 105 110 

Leu Gly He Asn Arg Leu Ser Leu Gly Val Gin Ser Phe Arg Glu Asp 

115 120 125 

Lys Leu Leu Phe Leu Glu Arg Gin His Ser Lys Asn He Ala Pro Ala 

130 135 140 

He Glu Thr He Leu Lys Ser Gly He Glu Asn He Ser He Asp Leu 
145 150 155 160 

He Tyr Asn Thr Pro Leu Asp Asn Glu Asn Ser Leu Lys Glu Glu Leu 

165 170 175 

Lys Leu Ala Lys Glu Leu Pro He Asn His Leu Ser Ala Tyr Ala Leu 

180 185 190 

Ser Val Glu Lys Asn Thr Asn Leu Glu Lys Asn Ala Lys Lys Pro Ser 
195 200 205 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021: 

TTAAAATGAA GTGAAA ATG AGA GAA ATA AAT ATG ATT TTA TAG ATT CAT ATC 52 

Met Arg Glu lie Asn Met lie Leu Tyr lie His lie 
15 10 

CCC TTT TGT GAA AAT AAA TGC GGC TAT TGC GCT TTC AAT TCC TAT GAA 100 
Pro Phe Cys Glu Asn Lys Cys Gly Tyr Cys Ala Phe Asn Ser Tyr Glu 
15 20 25 

AAC AAG CAT GGG TTA AAA GAA GAA TAG ACT CAA GCG TTA TGC CTG GAT 14 8 

Asn Lys His Gly Leu Lys Glu Glu Tyr Thr Gin Ala Leu Cys Leu Asp 
30 35 40 

TTA AAG CAT GCG TTA AGT CAA ACT GAC GAA CCA ATT GAA AGC GTT TTT 196 
Leu Lys His Ala Leu Ser Gin Thr Asp Glu Pro lie Glu Ser Val Phe 
45 50 55 60 

ATT GGT GGC GGC ACG CCT AAC ACT TTA AGC GTG AAG GCT TTT GAA AGG 2 44 

lie Gly Gly Gly Thr Pro Asn Thr Leu Ser Val Lys Ala Phe Glu Arg 
65 70 75 

ATT TTT GAA AGC ATT TAT CAA CAT GCG AGC TTG AGC TTG GAT TGT GAG 2 92 

lie Phe Glu Ser lie Tyr Gin His Ala Ser Leu Ser Leu Asp Cys Glu 
80 85 90 

ATC ACC ACT GAA GCT AAC CCC GAA TTG ATT ACT AAA GCT TGG TGT CAA 34 0 

lie Thr Thr Glu Ala Asn Pro Glu Leu lie Thr Lys Ala Trp Cys Gin 
95 100 105 

GGC TTA AAA GGT TTA GGG ATC AAC CGC TTG AGT TTA GGG GTG CAA AGT 3 88 

Gly Leu Lys Gly Leu Gly lie Asn Arg Leu Ser Leu Gly Val Gin Ser 
110 115 120 

TTT AGG GAA GAT AAA TTA TTG TTT TTA GAG CGC CAA CAT TCC AAA AAT 4 36 

Phe Arg Glu Asp Lys Leu Leu Phe Leu Glu Arg Gin His Ser Lys Asn 
125 130 135 140 

ATC GCT CCT GCG ATA GAA ACT ATT TTA AAA AGC GGG ATT GAA AAT ATC 4 84 

lie Ala Pro Ala lie Glu Thr lie Leu Lys Ser Gly lie Glu Asn lie 
145 150 155 

AGC ATT GAT TTG ATT TAT AAC ACC CCA TTA GAC AAT GAA AAC TCT CTA 532 
Ser lie Asp Leu lie Tyr Asn Thr Pro Leu Asp Asn Glu Asn Ser Leu 
160 165 170 

AAA GAA GAA TTA AAA CTC GCT AAA GAA CTC CCT ATC AAC CAC TTG AGC 58 0 

Lys Glu Glu Leu Lys Leu Ala Lys Glu Leu Pro lie Asn His Leu Ser 
175 180 185 

GCT TAC GCT TTG AGC GTT GAA AAA AAC ACG AAT TTA GAA AAA AAC GCC 62 8 
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100 



105 



110 



Phe Val Lys Glu Cys Gly lie Lys Ser Ala Ser Tyr Phe Glu Thr Asn 

115 120 125 

Asp Leu Lys Glu Ala Leu Ser Tyr lie Gin Asn Ala Ser Phe Pro Leu 

130 135 140 

Val lie Lys Ala Leu Asn Lys Asn Thr Ser lie Val Tyr Gin Glu Glu 
145 150 155 160 

Glu Ala lie Lys lie Leu Glu Asp Ala Phe Lys Gin Ser Asn Glu Pro 

165 170 175 

Val lie lie Glu Pro Phe Leu Glu Gly Phe Glu Leu Ser Val Thr Ala 

180 185 190 

Leu lie Ala Asn Asp Asp Phe lie Leu Leu Pro Phe Cys Gin Asn Tyr 

195 200 205 

Lys Arg Leu Leu Glu Gly Asp Asn Gly Val Asn Thr Gly Gly Met Gly 

210 215 220 

Ala lie Ala Pro Ala Asn Phe Phe Ser Asn Glu Leu Glu Glu Lys lie 
225 230 235 240 

Lys Asn His lie Phe Lys Pro Thr Leu Glu Lys Leu Gin Ala Asp Asn 

245 250 255 

Thr Pro Phe Lys Gly Val Leu Leu Ala Glu lie Val lie lie Glu Glu 

260 265 270 

Lys Gly Val Leu Glu Pro Tyr Leu Leu Asp Phe Ser Val Arg Phe Lys 

275 280 285 

Asp lie Glu Cys Gin Thr lie Leu Pro Leu Leu Glu Ser Ser Leu Leu 

290 295 300 

Asp Leu Cys Leu Ala Thr Ala Lys Gly Glu Leu His Ser Leu Glu Leu 
305 310 315 320 

Val Phe Ser Lys Glu Phe Val Met Ser Val Ala Leu Val Ser Arg Asn 

325 330 335 

Tyr Pro Thr Ser Ser Ser Pro Lys Gin Thr Leu Tyr lie Asp Pro Val 

340 345 350 

Asp Glu Lys Lys Gly His Leu lie Leu Gly Glu Val Glu Gin Asp Asn 

355 360 365 

Gly Val Phe Glu Ser Ser Gly Gly Arg Val lie Phe Ala lie Gly Arg 

370 375 380 

Gly Lys Ser Leu Leu Glu Ala Arg Asn His Ala Tyr Glu lie Ala Gin 
385 390 395 400 

Lys Val His Phe Glu Gly Met Phe Tyr Arg Lys Asp lie Gly Phe Lys 

405 410 415 

Val Leu Asp Leu Lys Glu Tyr Ser 



(2) INFORMATION FOR SEQ ID NO: 1021: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 17... 769 



420 
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GAA TTA CAT TCT CTT GAA TTG GTG TTT TCT AAA GAA TTT GTG ATG AGT 1011 
Glu Leu His Ser Leu Glu Leu Val Phe Ser Lys Glu Phe Val Met Ser 
315 320 325 

GTG GCG CTT GTT TCT AGG AAT TAG CCC ACT AGC TCT TCG CCC AAA CAA 105 9 

Val Ala Leu Val Ser Arg Asn Tyr Pro Thr Ser Ser Ser Pro Lys Gin 
330 335 340 345 

ACC CTT TAT ATT GAT CCG GTT GAT GAA AAA AAG GGT CAT TTG ATT TTA 1107 
Thr Leu Tyr lie Asp Pro Val Asp Glu Lys Lys Gly His Leu lie Leu 
350 355 360 

GGG GAG GTG GAG CAG GAT AAT GGC GTG TTT GAA AGC AGT GGG GGG AGG 115 5 

Gly Glu Val Glu Gin Asp Asn Gly Val Phe Glu Ser Ser Gly Gly Arg 
365 370 375 

GTG ATC TTT GCC ATT GGT AGG GGA AAA TCC TTA TTA GAA GCC AGA AAC 12 03 

Val lie Phe Ala lie Gly Arg Gly Lys Ser Leu Leu Glu Ala Arg Asn 
380 385 390 

CAT GCT TAT GAA ATC GCT CAA AAG GTG CAT TTT GAA GGC ATG TTT TAT 12 51 

His Ala Tyr Glu lie Ala Gin Lys Val His Phe Glu Gly Met Phe Tyr 
395 400 405 

CGC AAG GAT ATT GGT TTT AAG GTG TTA GAT TTG AAA GAA TAT TCT TAAAG 13 01 
Arg Lys Asp lie Gly Phe Lys Val Leu Asp Leu Lys Glu Tyr Ser 
410 415 420 

GTTAAAGTTT AAGACAAACC AAAGAGTTTG TCTTGTTTG 13 4 0 

(2) INFORMATION FOR SEQ ID NO: 1020: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1020: 

Met Lys Asp Asn Asn Asn Tyr Asn Val Leu lie Val Gly Asn Lys Gly 

15 10 15 

Arg Glu Tyr Ala Leu Ala Gin Arg Leu Gin Gin Asp Glu Arg Val Asn 

20 25 30 

Ala Leu Tyr Phe Cys Leu Gly Asn Gly Gly Thr Gin Asp Leu Gly Glu 

35 40 45 

Asn Leu Glu Cys Glu His Tyr Glu His lie Val Glu Leu Ala Leu Lys 

50 55 60 

Lys Gin lie His Leu Ala lie lie Ser Glu Glu Glu Phe Leu Val Leu 
65 70 75 80 

Gly Leu Thr Glu Met Leu Glu Lys Ala Gly lie Leu Val Phe Gly Ala 

85 90 95 

Ser Lys -Glu Ala Ala Lys Leu Glu Ala Ser Lys Ser Tyr Met Lys Ala 
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Gly lie Leu Val Phe Gly Ala Ser Lys Glu Ala Ala Lys Leu Glu Ala 
90 95 100 105 

TCT AAA AGC TAT ATG AAA OCT TTT GTT AAA GAG TOT GGC ATC AAA AGT 3 87 

Ser Lys Ser Tyr Met Lys Ala Phe Val Lys Glu Cys Gly He Lys Ser 
110 115 120 

GCG TCT TAG TTT GAA ACA AAC GAC TTA AAA GAA GCT TTG AGT TAG ATT 4 35 

Ala Ser Tyr Phe Glu Thr Asn Asp Leu Lys Glu Ala Leu Ser Tyr He 
125 130 135 

CAA AAC GCT TCT TTC CCC TTA GTC ATT AAA GCG TTG AAT AAA AAC ACA 4 83 

Gin Asn Ala Ser Phe Pro Leu Val He Lys Ala Leu Asn Lys Asn Thr 
140 145 150 

AGC ATT GTC TAT CAA GAA GAA GAA GCG ATA AAA ATC CTT GAA GAC GCT 531 
Ser He Val Tyr Gin Glu Glu Glu Ala He Lys He Leu Glu Asp Ala 
155 160 165 

TTC AAA CAA AGC AAT GAG CCT GTG ATT ATA GAG CCT TTT TTA GAG GGA 57 9 

Phe Lys Gin Ser Asn Glu Pro Val He He Glu Pro Phe Leu Glu Gly 
170 175 180 185 

TTT GAG CTT TCA GTT ACA GCG CTC ATA GCC AAT GAT GAT TTT ATC TTG 62 7 

Phe Glu Leu Ser Val Thr Ala Leu He Ala Asn Asp Asp Phe He Leu 
190 195 200 

TTG CCC TTT TGC CAA AAC TAG AAA CGC TTA TTA GAG GGG GAT AAT GGG 675 
Leu Pro Phe Cys Gin Asn Tyr Lys Arg Leu Leu Glu Gly Asp Asn Gly 
205 210 215 

GTC AAT ACQ GGG GGT ATG GGG GCC ATC GCT CCT GCA AAC TTT TTC TCT 72 3 

Val Asn Thr Gly Gly Met Gly Ala He Ala Pro Ala Asn Phe Phe Ser 
220 225 230 

AAT GAA TTA GAA GAG AAA ATA AAA AAT CAT ATC TTT AAA CCC ACT TTA 771 
Asn Glu Leu Glu Glu Lys He Lys Asn His He Phe Lys Pro Thr Leu 
235 240 245 

GAG AAA CTT CAG GCT GAC AAC ACG CCT TTT AAA GGG GTT TTA CTC GCT 819 
Glu Lys Leu Gin Ala Asp Asn Thr Pro Phe Lys Gly Val Leu Leu Ala 
250 255 260 265 

GAA ATT GTA ATC ATA GAA GAA AAA GGC GTT TTA GAG CCG TAT TTA TTG 8 67 

Glu He Val He He Glu Glu Lys Gly Val Leu Glu Pro Tyr Leu Leu 
270 275 280 

GAT TTT AGC GTG CGT TTT AAA GAC ATT GAA TGC CAG ACG ATT TTA CCC 915 
Asp Phe Ser Val Arg Phe Lys Asp He Glu Cys Gin Thr He Leu Pro 
285 290 295 

CTT TTA GAA AGC TCG CTT TTA GAT TTG TGT TTG GCC ACA GCC AAA GGG 963 
Leu Leu Glu Ser Ser Leu Leu Asp Leu Cys Leu Ala Thr Ala Lys Gly 
300 305 310 
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lie Leu Asn Tyr Leu His Ser Leu Asp Gin Glu Leu Glu Gin Tyr Ala 

610 615 620 

He Asp Glu Val Leu Glu Ala Gin Val Lys Arg He Val Asp Phe Gly 
625 630 635 640 

Ala Phe Leu Ser Leu Pro Lys Gly Gly Glu Gly Leu Leu Arg Lys Gin 

645 650 655 

Asn Met Asp Lys Cys Gin Val Val Leu Lys Glu Gly Asp Ser He Arg 

660 665 670 

Cys Arg Val He Ser Phe Asn Lys Gly Lys He Ala Leu Asp Leu Ala 
675 680 685 

(2) INFORMATION FOR SEQ ID NO: 1019: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 1296 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019: 

AAACAACCAC ATTGCAGGAA AGAC ATG AAA GAT AAC AAT AAC TAT AAT GTT 51 

Met Lys Asp Asn Asn Asn Tyr Asn Val 
1 5 

TTA ATT GTG GGG AAT AAG GGG CGA GAG TAT GCT TTG GCT CAA AGG CTT 99 
Leu He Val Gly Asn Lys Gly Arg Glu Tyr Ala Leu Ala Gin Arg Leu 
10 15 20 25 

CAG CAA GAT GAG CGA GTG AAT GCT TTG TAT TTT TGT TTG GGT AAT GGT 14 7 

Gin Gin Asp Glu Arg Val Asn Ala Leu Tyr Phe Cys Leu Gly Asn Gly 
30 35 40 

GGC ACT CAA GAT TTA GGC GAG AAT CTG GAA TGC GAA CAT TAC GAG CAT 195 
Gly Thr Gin Asp Leu Gly Glu Asn Leu Glu Cys Glu His Tyr Glu His 
45 50 55 

ATC GTG GAA TTA GCC CTG AAA AAA CAG ATC CAT TTA GCC ATC ATT TCA 243 
He Val Glu Leu Ala Leu Lys Lys Gin He His Leu Ala He He Ser 
60 65 70 

GAA GAA GAG TTT TTG GTT TTA GGG CTT ACA GAA ATG CTA GAA AAA GCG 2 91 

Glu Glu Glu Phe Leu Val Leu Gly Leu Thr Glu Met Leu Glu Lys Ala 
75 80 85 

GGG ATT TTA GTG TTT GGG GCT TCT AAA GAA GCG GCT AAG TTA GAG GCT 33 9 
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Gly Thr Lys Glu 
180 

Lys Leu Asn Ala 
195 

Ala Gin Lys Ser 
210 

Thr Pro His Gin 
225 

Phe Asn Glu Arg 

lie lie Lys Gly 
260 

Phe Asn Glu lie 
275 

Leu Glu Glu lie 
290 

Arg Arg Met lie 
305 

Glu Glu Val Arg 

His Ser Ser lie 
340 

Gly Val Leu Gly 
355 

His Lys Ala Pro 
370 

Pro Phe Cys Val 
385 

Glu Leu Gly His 

Lys Asn Lys Glu 
420 

Asn Gly Ser Ser 
435 

Tyr Ala Ser Gly 
450 

Gly Met Val Ser 
465 

Gly Leu Glu Asp 

Leu Glu Gly lie 
500 

Lys Leu Glu lie 
515 

Lys His lie Leu 
530 

Asn Phe Ser His 
545 

Lys lie Val Glu 

Val Glu Lys Phe 
580 

Val Lys lie Met 
595 



165 

Ser Leu Asn Met 

Leu Glu Glu Pro 
200 

Leu Glu Glu Thr 
215 

Asn Glu Leu Phe 
230 

Leu Leu Asp Leu 
245 

lie Glu Ser Ser 

Ala Arg Lys lie 
280 

Glu Leu Ser Leu 
295 

lie Lys Asp Lys 
310 

Pro lie Leu lie 
325 

Leu Phe Thr Arg 

Thr Asp Asn Asp 
360 

lie Lys Glu Arg 
375 

Gly Glu Ala Ser 
390 

Gly Asn Leu Ala 
405 

Gin Val lie Arg 

Ser Met Ala Ser 
440 

Val Glu lie Tyr 
455 

Glu Gly Gin Asp 
470 

Ala Glu Gly Asp 
485 

Thr Ala Met Gin 

Leu Tyr Gin Ala 
520 

Lys lie Met His 
535 

Leu Pro Thr Thr 
550 

lie He Gly Gin 
565 

Glu Val Lys lie 

Gly Asn Lys Glu 
600 



170 

He Glu Met Arg 
185 

Leu Met Leu Glu 

Cys Thr Leu Tyr 
220 

Phe Lys Glu Ser 
235 

Leu Lys Asn Gin 
250 

Ala Leu Ser Glu 
265 

Ser Glu Ala His 

Glu Lys Val Lys 
300 

He Arg Pro Asp 
315 

Glu Ser Asp Leu 
330 

Gly Gin Thr Gin 
345 

Ala Gin Thr His 

Phe Met Phe His 
380 

Ser He Gly Ala 
395 

Lys Arg Ala Leu 
410 

Leu Val Ser Glu 
425 

Val Cys Ala Gly 

Asp Leu Val Ala 
460 

His Ala He Leu 
475 

Met Asp Phe Lys 
490 

Met Asp Thr Lys 
505 

Leu Leu Gin Ala 

Glu Ala Lys Glu 
540 

Glu He Phe Asn 
555 

Gly Gly Arg Val 
570 

Asp Leu Asn Lys 
585 

Arg Val Leu Lys 



175 

Ser Leu Gly Gin 
190 

Ala Leu Glu Leu 
205 

Glu Glu He Phe 

Gin Gly He Val 
240 

Tyr Phe Asp Glu 
255 

Arg Glu Asn Val 
270 

Ser Glu Phe Ser 
285 

Lys Thr Glu He 

Lys Arg Ala Leu 
320 

Leu Pro Met Ala 
335 

Ser Leu Val Val 
350 

Glu Ser Leu Glu 
365 

Tyr Asn Phe Pro 

Ala Ser Arg Arg 
400 

Glu Thr Ser He 
415 

He Leu Glu Ser 
430 

Ser Leu Ala Leu 
445 

Gly Val Ala Met 

Ser Asp He Ser 
480 

He Ala Gly Asn 
495 

Met Ser Gly He 
510 

Lys Glu Ala Arg 
525 

Lys He Val He 

Val Ala Pro Asp 
560 

He Lys Glu He 
575 

Pro Ser Gly Glu 
590 

Thr Lys Glu Phe 
605 
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CAA GAA TTG GAG CAA TAG GCT ATT GAT GAG GTA TTA GAA GCT CAA GTG 192 3 

Gin Glu Leu Glu Gin Tyr Ala lie Asp Glu Val Leu Glu Ala Gin Val 
620 625 630 

AAA CGA ATC GTG GAT TTT GGG GCG TTT TTA AGC TTG OCT AAG GGG GGC 1971 
Lys Arg lie Val Asp Phe Gly Ala Phe Leu Ser Leu Pro Lys Gly Gly 
635 640 645 

GAA GGC TTG TTA AGA AAG CAA AAC ATG GAC AAG TGT CAA GTG GTT TTA 2 019 
Glu Gly Leu Leu Arg Lys Gin Asn Met Asp Lys Cys Gin Val Val Leu 
650 655 660 665 

AAA GAA GGC GAT AGC ATC AGG TGT AGG GTG ATT AGC TTC AAT AAG GGT 2 067 

Lys Glu Gly Asp Ser lie Arg Cys Arg Val lie Ser Phe Asn Lys Gly 
670 675 680 

AAA ATC GCT TTA GAT TTG GCT TAAAATTTTA AAAAGCGTTT TTTAAAAGCG TTTT 2122 
Lys lie Ala Leu Asp Leu Ala , 
685 

TAAGCTAGTT T 213 3 

(2) INFORMATION FOR SEQ ID NO: 1018: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) -MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1018: 

Met Asp Phe lie Thr lie Asn Ser Ser Asn Lys Thr Glu Glu Phe Ala 

15 10 15 

Leu Lys Gin Val Ala Lys Gin Ala Thr Ser Ser Leu Leu Tyr Arg Leu 

20 25 30 

Gly Lys Thr He He Leu Ala Ser Val Cys Val Glu Arg Glu Pro Val 

35 40 45 

Ser Glu Asp Phe Leu Pro Leu Val Val Gin Phe Leu Glu Lys Ser Tyr 

50 55 60 

Ala Ala Gly Lys He Pro Gly Gly Phe Val Lys Arg Glu Gly Arg Ala 
65 70 75 80 

Gin Asp Phe Glu He Leu Thr Ser Arg Leu He Asp Arg Thr Leu Arg 

85 90 95 

Pro Leu Phe Pro Lys Asp Tyr Arg Tyr Pro Thr Gin He Thr Leu Met 

100 105 110 

Val Leu Ser His Asp He Glu Asn Asp Leu Gin Val Ser Ala Leu Asn 

115 120 125 

Ala Ala Ser Ala Ala Leu Phe Leu Ala His He Ala Pro He Lys Ser 

130 135 140 

Val Ser Ala Cys Arg He Ala Arg Met Asp Asn Glu Phe He He Asn 
145 150 155 160 

Pro Ser Ala Ser Leu Leu Asn Gin Ser Ser Leu Asp Leu Phe Val Ser 
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ATT GGC GCG GCT TCA AGG CGT GAA TTA GGG CAT GGG AAT TTG GCT AAA 1251 
lie Gly Ala Ala Ser Arg Arg Glu Leu Gly His Gly Asn Leu Ala Lys 
395 400 405 

AGA GCC TTA GAA ACG AGC ATT AAA AAT AAA GAG GAG GTG ATA CGA TTG 12 99 

Arg Ala Leu Glu Thr Ser lie Lys Asn Lys Glu Gin Val lie Arg Leu 
410 415 420 425 

GTT TCT GAG ATT TTA GAA AGC AAT GGT TCA AGC TCA ATG GCG AGC GTG 134 7 

Val Ser Glu lie Leu Glu Ser Asn Gly Ser Ser Ser Met Ala Ser Val 
430 435 440 

TGC GCA GGC TCT TTA GCC CTT TAT GCA AGC GGT GTG GAA ATT TAC GAT 13 95 

Cys Ala Gly Ser Leu Ala Leu Tyr Ala Ser Gly Val Glu lie Tyr Asp 
445 450 455 

TTA GTC GCT GGG GTG GCT ATG GGC ATG GTG AGC GAA GGG CAA GAT CAC 144 3 

Leu Val Ala Gly Val Ala Met Gly Met Val Ser Glu Gly Gin Asp His 
460 465 470 

GCT ATT TTA AGC GAT ATT AGC GGC TTA GAA GAC GCA GAA GGC GAT ATG 14 91 

Ala lie Leu Ser Asp lie Ser Gly Leu Glu Asp Ala Glu Gly Asp Met 
475 480 485 

GAT TTT AAG ATT GCT GGG AAT TTA GAA GGC ATT ACG GCC ATG CAA ATG 153 9 

Asp Phe Lys lie Ala Gly Asn Leu Glu Gly lie Thr Ala Met Gin Met 
490 495 500 505 

GAT ACC AAA ATG AGC GGT ATC AAG CTA GAA ATT TTA TAC CAA GCC TTA 15 87 

Asp Thr Lys Met Ser Gly lie Lys Leu Glu lie Leu Tyr Gin Ala Leu 
510 515 520 

CTC CAA GCC AAA GAA GCA CGG AAA CAT ATT TTA AAA ATC ATG CAT GAA 163 5 

Leu Gin Ala Lys Glu Ala Arg Lys His lie Leu Lys lie Met His Glu 
525 530 535 

GCG AAA GAA AAG ATT GTG ATC AAT TTT TCC CAT TTG CCC ACA ACG GAG 168 3 

Ala Lys Glu Lys He Val He Asn Phe Ser His Leu Pro Thr Thr Glu 
540 545 550 

ATT TTT AAT GTC GCA CCC GAT AAA ATT GTA GAA ATT ATC GGT CAA GGG 17 31 

He Phe Asn Val Ala Pro Asp Lys He Val Glu He He Gly Gin Gly 
555 560 565 

GGG CGT GTG ATT AAA GAG ATA GTA GAA AAG TTT GAA GTT AAA ATT GAT 177 9 

Gly Arg Val He Lys Glu He Val Glu Lys Phe Glu Val Lys He Asp 
570 575 580 585 

TTG AAC AAA CCG AGC GGT GAA GTG AAA ATC ATG GGG AAT AAA GAG CGC 182 7 

Leu Asn Lys Pro Ser Gly Glu Val Lys He Met Gly Asn Lys Glu Arg 
590 595 600 

GTT TTA AAG ACT AAG GAA TTT ATT TTA AAC TAC TTG CAT TCT TTA GAT 1875 
Val Leu Lys Thr Lys Glu Phe He Leu Asn Tyr Leu His Ser Leu Asp 
605 610 615 
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AGT TTG GAT TTG TTC GTG TCT GGA ACG AAA GAG AGT TTG AAC ATG ATA 57 9 

Ser Leu Asp Leu Phe Val Ser Gly Thr Lys Glu Ser Leu Asn Met lie 
170 175 180 185 

GAA ATG CGC TCT TTG GGG CAA AAA TTG AAC GCT TTA GAA GAG CCT TTA 62 7 

Glu Met Arg Ser Leu Gly Gin Lys Leu Asn Ala Leu Glu Glu Pro Leu 
190 195 200 

ATG TTA GAA GCT TTA GAA TTG GCT CAA AAA AGT TTG GAA GAA ACT TGC 67 5 

Met Leu Glu Ala Leu Glu Leu Ala Gin Lys Ser Leu Glu Glu Thr Cys 
205 210 215 

ACG CTT TAT GAA GAG ATT TTC ACG CCC CAC CAA AAC GAG CTG TTT TTC 72 3 

Thr Leu Tyr Glu Glu lie Phe Thr Pro His Gin Asn Glu Leu Phe Phe 
220 225 230 

AAA GAG AGC CAA GGA ATA GTC TTT AAT GAA AGG CTG TTA GAT TTA TTG 771 
Lys Glu Ser Gin Gly lie Val Phe Asn Glu Arg Leu Leu Asp Leu Leu 
235 240 245 

AAA AAT CAG TAT TTT GAT GAA ATC ATC AAA GGC ATT GAA AGT TCT GCT 819 
Lys Asn Gin Tyr Phe Asp Glu lie lie Lys Gly lie Glu Ser Ser Ala 
250 255 260 265 

TTG AGC GAG CGA GAA AAT GTT TTC AAT GAA ATT GCC AGA AAA ATC AGT 867 
Leu Ser Glu Arg Glu Asn Val Phe Asn Glu lie Ala Arg Lys lie Ser 
270 275 280 

GAA GCC CAC TCA GAA TTC AGT TTA GAA GAA ATT GAA TTG TCT TTA GAA 915 
Glu Ala His Ser Glu Phe Ser Leu Glu Glu lie Glu Leu Ser Leu Glu 
285 290 295 

AAA GTG AAA AAG ACT GAG ATA AGA CGC ATG ATC ATT AAG GAT AAA ATC 963 
Lys Val Lys Lys Thr Glu lie Arg Arg Met lie lie Lys Asp Lys lie 
300 305 310 

CGC CCG GAT AAG CGC GCG TTA GAA GAA GTG CGG CCC ATT TTG ATA GAG 1011 
Arg Pro Asp Lys Arg Ala Leu Glu Glu Val Arg Pro lie Leu lie Glu 
315 320 325 

AGC GAT TTG CTC CCT ATG GCG CAT AGC TCC ATT TTA TTC ACT AGG GGG 1059 
Ser Asp Leu Leu Pro Met Ala His Ser Ser lie Leu Phe Thr Arg Gly 
330 335 340 345 

CAA ACT CAA AGC TTA GTG GTA GGG GTT TTA GGC ACG GAT AAT GAC GCT 1107 
Gin Thr Gin Ser Leu Val Val Gly Val Leu Gly Thr Asp Asn Asp Ala 
350 355 360 

CAA ACC CAT GAG AGT TTG GAG CAT AAA GCT CCC ATT AAA GAG CGC TTC 1155 
Gin Thr His Glu Ser Leu Glu His Lys Ala Pro lie Lys Glu Arg Phe 
365 370 375 

ATG TTT CAT TAT AAT TTC CCT CCT TTC TGC GTG GGC GAA GCG AGT TCT 12 03 

Met Phe His Tyr Asn Phe Pro Pro Phe Cys Val Gly Glu Ala Ser Ser 
380 385 390 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 2088 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1017: 

TAATTTAAAA AAGGAACATT AAAT ATG GAT TTT ATC ACC ATC AAT TCT AGT 51 

Met Asp Phe lie Thr lie Asn Ser Ser 
1 5 

AAC AAA ACC GAA GAG TTC GCT CTC AAA CAA GTG GCC AAA CAA GCC ACC 99 
Asn Lys Thr Glu Glu Phe Ala Leu Lys Gin Val Ala Lys Gin Ala Thr 
10 15 20 • 25 

AGC TCT CTT TTA TAC CGA TTA GGA AAA ACC ATT ATT TTA GCG AGC GTG 14 7 

Ser Ser Leu Leu Tyr Arg Leu Gly Lys Thr lie lie Leu Ala Ser Val 
30 35 40 

TGC GTG GAA AGA GAG CCT GTG AGT GAA GAT TTT CTG CCT TTA GTG GTG 195 
Cys Val Glu Arg Glu Pro Val Ser Glu Asp Phe Leu Pro Leu Val Val 
45 50 55 

CAG TTT TTA GAA AAA TCT TAT GCA GCC GGA AAG ATC CCG GGC GGT TTT 24 3 

Gin Phe Leu Glu Lys Ser Tyr Ala Ala Gly Lys lie Pro Gly Gly Phe 
60 65 70 

GTT AAA AGA GAA GGC AGG GCG CAA GAT TTT GAA ATC TTA ACC TCT AGG 2 91 

Val Lys Arg Glu Gly Arg Ala Gin Asp Phe Glu lie Leu Thr Ser Arg 
75 80 85 

CTC ATA GAC AGG ACT TTA CGC CCT TTA TTC CCT AAA GAC TAC CGC TAC 33 9 

Leu lie Asp Arg Thr Leu Arg Pro Leu Phe Pro Lys Asp Tyr Arg Tyr 
90 95 100 105 

CCT ACA CAG ATC ACT TTA ATG GTT TTA AGC CAT GAT ATT GAA AAT GAC 3 87 

Pro Thr Gin lie Thr Leu Met Val Leu Ser His Asp lie Glu Asn Asp 
110 115 120 

TTG CAG GTT TCT GCT TTA AAC GCC GCT TCA GCC GCT CTC TTT TTG GCC 435 
Leu Gin Val Ser Ala Leu Asn Ala Ala Ser Ala Ala Leu Phe Leu Ala 
125 130 135 

CAT ATC GCT CCT ATT AAA AGC GTG AGC GCT TGC AGG ATC GCT AGG ATG 4 83 

His lie Ala Pro lie Lys Ser Val Ser Ala Cys Arg lie Ala Arg Met 
140 145 150 

GAT AAC GAA TTT ATC ATT AAC CCT AGC GCA AGC CTT TTG AAT CAA TCC 531 
Asp Asn Glu Phe lie lie Asn Pro Ser Ala Ser Leu Leu Asn Gin Ser 
155 160 165 
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TCT TGG AGT CAA AAA GAA TTG GAG ATT TTC CAA AAA GAA TTG ACT GCA 54 3 

Ser Trp Ser Gin Lys Glu Leu Glu lie Phe Gin Lys Glu Leu Thr Ala 
145 150 155 



ATC AAA ATA GAA GGG GCC CGC TAG CCT GAA AGA ATC AAT GAA ATG GTG 5 91 

lie Lys lie Glu Gly Ala Arg Tyr Pro Glu Arg lie Asn Glu Met Val 
160 165 170 



AAT CAA TAAAAGTATT GGGTATTTAT AATTGCATTG GCTCTTTTAA AA 63 9 

Asn Gin 

175 



(2) INFORMATION FOR SEQ ID NO: 1016: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016: 



Met Ser Glu Ala Gly Leu Ser Ser lie Gin Lys Ala His Gin lie Cys 

15 10 15 

Pro Leu Ser Ala Leu Gin Ser Glu Tyr Ser Leu Trp Trp Arg Glu Pro 

20 25 30 

Glu Lys Glu lie Leu Gly Phe Leu Glu Lys Glu Lys lie Gly Phe Val 

35 40 45 

Ala Phe Ser Pro Leu Gly Lys Gly Phe Leu Gly Ala Lys Phe Glu Lys 

50 55 60 

Asn Ala Thr Phe Ala Ser Glu Asp Phe Arg Ser Val Ser Pro Arg Phe 
65 70 75 80 

Asn Gin Glu Asn Leu Ala Lys Asn Tyr Ala Leu Val Glu Leu lie Gin 

85 90 95 

Asp His Ala His Ala Lys Gly Val Thr Pro Ala Gin Leu Ala Leu Ser 

100 105 110 

Trp lie Leu His Thr Gin Lys lie lie Val Pro Leu Phe Gly Thr Thr 

115 120 125 

Lys Glu Ser Arg Leu lie Glu Asn lie Gly Ala Leu Gin Val Ser Trp 

130 135 140 

Ser Gin Lys Glu Leu Glu lie Phe Gin Lys Glu Leu Thr Ala lie Lys 
145 150 155 160 

lie Glu Gly Ala Arg Tyr Pro Glu Arg lie Asn Glu Met Val Asn Gin 
165 170 175 

(2) INFORMATION FOR SEQ ID N0:1017: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO: 1015: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 9 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 597 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015: 

CACGCCCATA GGAGAAGTGG CAGAAGTTAT GCAGCTCTTA TTAAAGAAGG AAAAATTAAA 6 0 

GCTTGGGGG ATG AGT GAG GCA GGG TTA TCT AGC ATC CAA AAA GCC CAT CAA 111 
Met Ser Glu Ala Gly Leu Ser Ser lie Gin Lys Ala His Gin 
15 10 

ATT TGC CCT TTA AGC GCG TTG CAG AGC GAA TAT TCC TTG TGG TGG CGC 159 
lie Cys Pro Leu Ser Ala Leu Gin Ser Glu Tyr Ser Leu Trp Trp Arg 
15 20 25 30 

GAA CCT GAA AAA GAG ATT TTA GGT TTT TTA GAA AAA GAA AAA ATT GGA 2 07 

Glu Pro Glu Lys Glu lie Leu Gly Phe Leu Glu Lys Glu Lys lie Gly 
35 40 45 

TTT GTC GCT TTT TCG CCT TTG GGT AAG GGG TTT TTA GGC GCG AAA TTT 2 55 

Phe Val Ala Phe Ser Pro Leu Gly Lys Gly Phe Leu Gly Ala Lys Phe 
50 55 60 

GAA AAA AAT GCC ACT TTC GCT AGT GAG GAT TTT AGA AGC GTT TCT CCT 3 03 

Glu Lys Asn Ala Thr Phe Ala Ser Glu Asp Phe Arg Ser Val Ser Pro 
65 70 75 

AGG TTT AAT CAA GAA AAT CTA GCC AAA AAT TAC GCC TTG GTG GAA TTA 351 
Arg Phe Asn Gin Glu Asn Leu Ala Lys Asn Tyr Ala Leu Val Glu Leu 
80 85 90 

ATC CAA GAT CAT GCA CAC GCT AAA GGC GTT ACA CCA GCC CAA CTG GCT 3 99 

lie Gin Asp His Ala His Ala Lys Gly Val Thr Pro Ala Gin Leu Ala 
95 100 105 110 

CTC TCA TGG ATT TTG CAC ACG CAA AAA ATC ATT GTC CCT CTC TTT GGC 44 7 

Leu Ser Trp lie Leu His Thr Gin Lys lie lie Val Pro Leu Phe Gly 
115 120 125 

ACC ACC AAA GAA TCT AGG CTC ATA GAA AAT ATA GGG GCT TTG CAG GTT 4 95 

Thr Thr Lys Glu Ser Arg Leu lie Glu Asn lie Gly Ala Leu Gin Val 
130 135 140 
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Asp Ala lie Gin 
20 

Gin Ser Phe Gly 
35 

Gin Thr Leu Leu 
50 

Arg Phe Lys Asp 
65 

Thr Val Pro Leu 

Met Pro Phe Lys 
100 

Ala Gin Lys Gly 
115 

lie Gly Ser Glu 
130 

Val Ala Ser Leu 
145 

Asn His Arg Lys 

Gin Val Asn Glu 
180 

Leu Asn Gly Val 
195 

Asn Thr lie Lys 
210 

Leu Ser His Ala 
225 

Asn Glu Asn Leu 

Leu Leu Gly Asp 
260 

Ser lie Ala Arg 
275 

Thr Leu Asn Glu 
290 

Tyr Asp His Leu 
305 

Gly Ala Ser lie 

Gin Leu Leu Asp 
340 

Met His Glu Glu 
355 

Arg Gin Ser Gly 
370 

Lys Lys Pro Phe 
385 

Val lie Gly Glu 

Met His Ser Gly 
420 

Glu lie lie Gly 
435 



Lys Ala Gin Leu 

Phe Val Pro lie 
40 

Pro Asp Ala Ser 
55 

His Gly Asp Arg 
70 

Ala Arg Phe Val 
85 

Arg Tyr Ala lie 

Arg Tyr Arg Glu 
120 

Ser Leu Val Cys 
135 

Lys Ala Leu Asp 
150 

lie Leu Asn Gly 
165 

Ala Leu Arg lie 

Glu Glu Glu Leu 
200 

Glu Leu Leu Glu 
215 

Glu Phe Phe Glu 
230 

Lys Lys Gly lie 
245 

Leu Gin lie Ser 

Gly Leu Gly Tyr 
280 

Met Lys Ser Leu 
295 

Thr Lys Asn Phe 
310 

Gly lie Asp Arg 
325 

Glu Arg Ser Thr 

Tyr Phe Ser Tyr 
360 

lie Phe Ser Glu 
375 

Ser Tyr Ala Asn 
390 

Glu Glu Phe Lys 
405 

Met Gin Leu Asn 

Glu Asn Asp Glu 
440 



Leu Ala Lys Val 
25 

Glu Thr Pro His 

Ser Asp lie Gin 
60 

Asp Val Ala Leu 
75 

Ser Leu His His 
90 

Gly Asn Val Phe 
105 

Phe Thr Gin Cys 

Asp Ala Glu lie 
140 

Leu Glu Asp Phe 
155 

lie Cys Glu Tyr 
170 

Val Asp Lys Leu 
185 

Lys Lys Glu Cys 

Leu lie Gin lie 
220 

Lys lie Ala Tyr 
235 

Gin Asp Leu Glu 
250 

Gin Asn Leu Tyr 

265 

Tyr Thr Gly lie 

Gly Ser Val Cys 
300 

Ser Lys Glu Asn 
315 

Leu lie Val Ala 
330 

Gin Ala Lys Val 
345 

Ala Asn Arg Leu 

Val Tyr Pro Glu 
380 

His Lys Gly His 
395 

Ser Glu Thr Leu 
410 

Cys Leu Ser Phe 
425 

Asp Leu 



Ser Val Val Phe 
30 

Leu Glu Tyr Ala 
45 

Lys Glu lie Tyr 

Arg Phe Asp Leu 
80 

Gin Thr Leu Gly 
95 

Arg Gly Glu Arg 
110 

Asp Phe Asp Phe 
125 

lie Gin Val lie 

Cys Val Ser lie 
160 

Phe Gly lie Ser 
175 

Glu Lys lie Gly 
190 

Gly Leu Asn Ser 
205 

Lys Gin Asn Asp 

Leu Lys Asp Tyr 
240 

Arg Leu Tyr Gin 
255 

Lys lie Asp Phe 
270 

Val Tyr Glu Thr 
285 

Ser Gly Gly Arg 

Leu Gin Gly Val 
320 

Leu Ser Glu Met 
335 

Leu lie Ala Cys 
350 

Ala Glu Ser Leu 
365 

Ala Gin Lys lie 

Glu Phe Val Ala 
400 

Ser Leu Lys Asn 
415 

Leu Lys Ala Leu 
430 
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TTA GGG AGC GTG TGT TCA GGG GGG CGT TAT GAT CAT TTG ACT AAA AAT 966 
Leu Gly Ser Val Cys Ser Gly Gly Arg Tyr Asp His Leu Thr Lys Asn 
300 305 310 

TTT TCT AAA GAG AAT TTA CAA GGG GTA GGG GCT TCT ATT GGG ATT GAT 1014 
Phe Ser Lys Glu Asn Leu Gin Gly Val Gly Ala Ser lie Gly lie Asp 
315 320 325 

CGA TTG ATT GTG GCT TTG AGT GAA ATG CAA TTA TTA GAC GAG CGC TCC 10 62 

Arg Leu lie Val Ala Leu Ser Glu Met Gin Leu Leu Asp Glu Arg Ser 
330 335 340 

ACC CAA GCC AAA GTT TTA ATC GCT TGC ATG CAT GAA GAG TAT TTT TCT 1110 
Thr Gin Ala Lys Val Leu lie Ala Cys Met His Glu Glu Tyr Phe Ser 
345 350 355 

TAG GCC AAC CGC TTA GCG GAG TCT TTA AGG CAA AGC GGG ATT TTT AGT 1158 
Tyr Ala Asn Arg Leu Ala Glu Ser Leu Arg Gin Ser Gly lie Phe Ser 
360 365 370 375 

GAA GTC TAT CCA GAA GCT CAA AAA ATC AAA AAA CCC TTT TCT TAT GCC 12 0 6 

Glu Val Tyr Pro Glu Ala Gin Lys lie Lys Lys Pro Phe Ser Tyr Ala 
380 385 390 

AAC CAT AAA GGG CAT GAG TTT GTG GCT GTC ATT GGC GAA GAA GAA TTT 12 54 

Asn His Lys Gly His Glu Phe Val Ala Val lie Gly Glu Glu Glu Phe 
395 400 405 

AAA AGC GAA ACT TTA AGC TTG AAA AAC ATG CAT TCA GGC ATG CAG TTG 13 02 

Lys Ser Glu Thr Leu Ser Leu Lys Asn Met His Ser Gly Met Gin Leu 
410 415 420 

AAT TGC TTG AGT TTT TTA AAA GCC CTT GAA ATC ATT GGA GAA AAC GAT 13 5 0 

Asn Cys Leu Ser Phe Leu Lys Ala Leu Glu lie He Gly Glu Asn Asp 
425 430 435 

GAA GAC TTA TAATGTCGCT ATTGTTGGGG CCAGTGGGGC GGTAGG 13 95 

Glu Asp Leu 

440 



(2) INFORMATION FOR SEQ ID NO: 1014: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014: 

Met He Thr Pro Lys Val Leu Ser Gly Phe Lys Asp Arg Leu Pro Lys 
15 10 15 



-1487- 



BNSCXXID: <WO 9e43478A1_l_> 



wo 98/43478 



PCT/US98/06371 



Arg Asp Val 



GTC TCT TTG 
Val Ser Leu 
90 

ATA GGC AAT 
lie Gly Asn 
105 

GAA TTT ACG 
Glu Phe Thr 
120 

TGC GAT GCT 
Cys Asp Ala 

GAT TTA GAA 
Asp Leu Glu 

GGG ATA TGC 
Gly lie Cys 
170 

ATT GTG GAT 
lie Val Asp 
185 

TTA AAA AAA 
Leu Lys Lys 
200 

GAA TTA ATT 
Glu Leu lie 



GAA AAA ATT 
Glu Lys lie 



ATA CAG GAT 
lie Gin Asp 
250 

TCT CAA AAC 
Ser Gin Asn 
265 

TAT TAT ACA 
Tyr Tyr Thr 
280 



Ala Leu Arg 
75 

CAC CAC CAA 
His His Gin 



GTC TTT AGG 
Val Phe Arg 



CAA TGC GAT 
Gin Cys Asp 
125 

GAG ATC ATT 
Glu lie lie 
140 

GAT TTT TGC 
Asp Phe Cys 
155 

GAA TAT TTT 
Glu Tyr Phe 



AAA TTG GAA 
Lys Leu Glu 



GAG TGC GGT 
Glu Cys Gly 
205 

CAA ATC AAA 
Gin lie Lys 
220 

GCT TAT TTG 
Ala Tyr Leu 
235 

TTA GAA AGG 
Leu Glu Arg 

CTG TAT AAA 
Leu Tyr Lys 

GGG ATT GTG 
Gly lie Val 
285 



Phe Asp Leu 
80 

ACG CTA GGC 
Thr Leu Gly 
95 

GGC GAA AGG 
Gly Glu Arg 
110 

TTT GAT TTT 
Phe Asp Phe 

CAA GTG ATT 
Gin Val lie 

GTC TCT ATC 
Val Ser He 
160 

GGG ATC TCT 
Gly He Ser 
175 

AAA ATT GGC 
Lys He Gly 
190 

TTA AAT TCA 
Leu Asn Ser 



CAA AAC GAT 
Gin Asn Asp 



AAA GAC TAT 
Lys Asp Tyr 
240 

CTA TAC CAG 
Leu Tyr Gin 
255 

ATT GAT TTT 
lie Asp Phe 
270 

TAT GAA ACC 
Tyr Glu Thr 



Thr Val Pro 



ATG CCC TTT 
Met Pro Phe 



GCG CAA AAA 
Ala Gin Lys 
115 

ATA GGG AGC 
He Gly Ser 
130 

GTC GCT TCT 
Val Ala Ser 
145 

AAC CAC AGA 
Asn His Arg 



CAA GTG AAT 
Gin Val Asn 



TTG AAT GGG 
Leu Asn Gly 
195 

AAC ACC ATT 
Asn Thr He 
210 

TTA AGC CAT 
Leu Ser His 
225 

AAT GAA AAT 
Asn Glu Asn 



TTG CTA GGG 
Leu Leu Gly 



TCT ATC GCT 
Ser He Ala 
275 

ACG CTT AAT 
Thr Leu Asn 
290 



Leu Ala Arg 
85 

AAA CGC TAC 
Lys Arg Tyr 
100 

GGG CGT TAT 
Gly Arg Tyr 



GAG AGT TTG 
Glu Ser Leu 



TTA AAA GCC 
Leu Lys Ala 
150 

AAA ATT TTG 
Lys He Leu 
165 

GAA GCG TTG 
Glu Ala Leu 
180 

GTT GAA GAA 
Val Glu Glu 



AAA GAG CTT 
Lys Glu Leu 



GCG GAA TTT 
Ala Glu Phe 
230 

CTA AAA AAA 
Leu Lys Lys 
245 

GAT TTG CAA 
Asp Leu Gin 
260 

AGG GGA TTA 
Arg Gly Leu 



GAA ATG AAG 
Glu Met Lys 



Phe 



GCT 342 
Ala 



AGG 3 90 

Arg 



GTG 438 

Val 

135 

CTA 4 86 

Leu 



AAC 534 
Asn 



CGC 582 
Arg 



GAA 63 0 

Glu 



TTA 678 

Leu 

215 

TTT 726 
Phe 



GGC 774 
Gly 



ATT 822 
He 



GGG 870 
Gly 



TCT 918 

Ser 

295 
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Ser Ala Thr Cys 

Leu Ser lie Ala 
260 

Val Leu Lys Asn 
275 

Asn Leu Tyr Pro 
290 

lie Gly Arg Leu 
305 

Phe Cys Val Ala 

Lys lie Ala Leu 
340 



Val Arg Val Pro 
245 

Phe Glu Lys Glu 

Ala Pro Ser Val 
280 

Thr Pro Leu Lys 
295 

Arg Lys Asp Leu 
310 

Asp Gin Leu Arg 
325 

His Tyr lie Lys 



Val Leu Arg Ser 
250 

Phe Asp Leu Lys 
265 

Ala Val Cys Asp 

Ala Ser His Thr 
300 

Phe Asp Lys Lys 
315 

Val Gly Ala Ala 

330 
Asn Ala 
345 



His Ser Glu Ser 
255 

Glu Val Tyr Glu 
270 

Asp Pro Ser His 
285 

Asp Ser Val Phe 

Thr Leu His Gly 
320 

Thr Asn Ala Leu 
335 



(2) INFORMATION FOR SEQ ID NO: 1013: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34 . . . 1359 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013: 

TAAAATTTTA GCATACAAAT ACAAGGAAAT GGA ATG ATT ACC CCT AAA GTG TTG 54 

Met lie Thr Pro Lys Val Leu 
1 5 

AGC GGG TTT AAA GAC CGC TTG CCT AAA GAT GCG ATA CAA AAA GCC CAG 102 
Ser Gly Phe Lys Asp Arg Leu Pro Lys Asp Ala lie Gin Lys Ala Gin 
10 15 20 

TTG CTT GCG AAA GTT TCA GTC GTG TTT CAA AGT TTT GGT TTT GTG CCG 150 
Leu Leu Ala Lys Val Ser Val Val Phe Gin Ser Phe Gly Phe Val Pro 
25 30 35 

ATT GAA ACC CCT CAT TTG GAA TAC GCT CAA ACG TTA TTG CCT GAT GCG 198 
lie Glu Thr Pro His Leu Glu Tyr Ala Gin Thr Leu Leu Pro Asp Ala 
40 45 50 55 

AGC AGT GAT ATT CAA AAA GAA ATT TAT CGT TTT AAA GAC CAT GGG GAT 24 6 

Ser Ser Asp lie Gin Lys Glu lie Tyr Arg Phe Lys Asp His Gly Asp 
60 65 70 

AGA GAT GTG GCT TTA AGG TTT GAT TTG ACT GTG CCA TTA GCC CGC TTT 2 94 
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GAT AAG AAA ACT TTG CAT GGC TTT TGT GTG GCG GAT CAA TTG AGA GTG 1014 
Asp Lys Lys Thr Leu His Gly Phe Cys Val Ala Asp Gin Leu Arg Val 
315 320 325 

GGG GCA GCC ACC AAC GCA CTC AAA ATC GCT CTG CAT TAC ATT AAG AAC 10 62 

Gly Ala Ala Thr Asn Ala Leu Lys lie Ala Leu His Tyr lie Lys Asn 
330 335 340 345 

GCT TGAGTTTATT CAAAGATAAC AAAGATGAAT GT 10 97 

Ala 



(2) INFORMATION FOR SEQ ID N0:1012: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 6 amino acids 
(E) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1012: 



Met 


Lys 


Thr 


Tyr 


Asn 


Val 


Ala 


He 


Val 


Gly Ala 


Ser 


Gly 


Ala 


Val 


Gly 


1 








5 










10 










15 




Gin 


Glu 


Leu 


He 


Lys 


Gly 


Leu 


Glu 


Asn 


Ser 


Phe 


Phe 


Pro 


He 


Lys 


Lys 








20 










25 










30 






Phe 


Val 


Pro 


Leu 


Ala 


Ser 


Thr 


Arg 


Ser 


Ala 


Gly 


Lys 


Lys 


He 


Lys 


Ala 






35 










40 










45 








Phe 


Asn 


Lys 


Asp 


Tyr 


Glu 


He 


Leu 


Glu 


Thr 


Thr 


His 


Glu 


Val 


Phe 


Glu 




50 










55 










60 










Arg 


Glu 


Lys 


He 


Asp 


He 


Ala 


Phe 


Phe 


Ser 


Ala 


Gly 


Gly 


Ser 


Val 


Ser 


65 










70 










75 










80 


Xaa 


Glu 


Phe 


Ala 


Thr 


Ser 


Ala 


Ser 


Lys 


Thr 


Ala 


Leu 


Val 


Val 


Asp 


Asn 










85 










90 










95 




Thr 


Ser 


Phe 


Phe 


Arg 


Leu 


Asn 


Lys 


Asp 


Val 


Pro 


Leu 


Val 


Val 


Pro 


Glu 








100 










105 










110 






lie 


Asn 


Ala 


Lys 


Glu 


He 


Phe 


Asn 


Ala 


Pro 


Leu 


Asn 


He 


He 


Ala 


Asn 






115 










120 










125 








Pro 


Asn 


Cys 


Ser 


Thr 


He 


Gin 


Met 


Thr 


Gin 


He 


Leu 


Asn 


Pro 


Leu 


His 




130 










135 










140 










Leu 


His 


Phe 


Lys 


He 


Lys 


Ser 


Val 


He 


Val 


Ser 


Thr 


Tyr 


Gin 


Ala 


Val 


145 










150 










155 










160 


Ser 


Gly 


Ala 


Gly 


Asn 


Lys 


Gly 


He 


Glu 


Ser 


Leu 


Lys 


Asn 


Glu 


Leu 


Lys 










165 










170 










175 




Thr 


Ala 


Leu 


Glu 


Cys 


Leu 


Glu 


Lys 


Asp 


Pro 


Thr 


He 


Asp 


Leu 


Asn 


Gin 








180 










185 










190 






Val 


Leu 


Gin 


Ala 


Gly 


Ala 


Phe 


Ala 


Tyr 


Pro 


He 


Ala 


Phe 


Asn 


Ala 


He 






195 










200 










205 








Ala 


His 


He 


Asp 


Thr 


Phe 


Lys 


Glu 


Asn 


Gly 


Tyr 


Thr 


Lys 


Glu 


Glu 


Leu 




210 










215 










220 










Lys 


Met 


Leu 


His 


Glu 


Thr 


His 


Lys 


He 


Met 


Gly 


Val 


Asp 


Phe 


Pro 


He 


225 










230 










235 










240 
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Thr Ala Leu Val Val Asp Asn Thr Ser Phe Phe Arg Leu Asn Lys Asp 
90 95 100 105 

GTG CCT TTA GTC GTT CCT GAA ATC AAC OCT AAA GAA ATT TTT AAC GCT 3 90 

Val Pro Leu Val Val Pro Glu lie Asn Ala Lys Glu He Phe Asn Ala 
110 115 120 

CCC TTG AAT ATC ATC GCT AAC CCT AAT TGC TCC ACC ATT CAA ATG ACG 438 
Pro Leu Asn He He Ala Asn Pro Asn Cys Ser Thr He Gin Met Thr 
125 130 135 

CAA ATC TTA AAC CCC TTA CAT CTC CAT TTT AAG ATA AAA AGC GTG ATT 4 86 

Gin He Leu Asn Pro Leu His Leu His Phe Lys He Lys Ser Val He 
140 145 150 

GTT AGC ACC TAT CAA GCC GTG AGT GGG GCA GGG AAC AAG GGC ATA GAG 5 34 

Val Ser Thr Tyr Gin Ala Val Ser Gly Ala Gly Asn Lys Gly He Glu 
155 160 165 

AGT TTA AAA AAT GAG TTA AAA ACC GCT TTA GAG TGT TTG GAA AAA GAC 5 82 

Ser Leu Lys Asn Glu Leu Lys Thr Ala Leu Glu Cys Leu Glu Lys Asp 
170 175 180 185 

CCC ACT ATT GAT TTA AAC CAA GTC TTG CAA GCT GGG GCT TTC GCT TAT 63 0 

Pro Thr He Asp Leu Asn Gin Val Leu Gin Ala Gly Ala Phe Ala Tyr 
190 195 200 

CCG ATC GCT TTC AAT GCG ATC GCT CAT ATT GAT ACT TTT AAG GAG AAT 678 
Pro He Ala Phe Asn Ala He Ala His He Asp Thr Phe Lys Glu Asn 
205 210 215 

GGT TAC ACG AAA GAA GAG CTA AAA ATG CTG CAT GAA ACC CAT AAA ATC 72 6 

Gly Tyr Thr Lys Glu Glu Leu Lys Met Leu His Glu Thr His Lys He 
220 225 230 

ATG GGC GTG GAT TTC CCT ATC AGC GCG ACT TGC GTG CGC GTG CCG GTA 774 
Met Gly Val Asp Phe Pro He Ser Ala Thr Cys Val Arg Val Pro Val 
235 240 245 

TTG AGG AGC CAT AGC GAG AGT TTG AGT ATC GCT TTT GAA AAA GAA TTC 822 
Leu Arg Ser His Ser Glu Ser Leu Ser He Ala Phe Glu Lys Glu Phe 
250 255 260 265 

GAT CTC AAA GAA GTC TAT GAA GTT TTA AAA AAC GCC CCT AGC GTG GCT 870 
Asp Leu Lys Glu Val Tyr Glu Val Leu Lys Asn Ala Pro Ser Val Ala 
270 275 280 

GTT TGC GAT GAT CCC AGT CAT AAT CTT TAC CCC ACG CCC CTA AAA GCG 918 
Val Cys Asp Asp Pro Ser His Asn Leu Tyr Pro Thr Pro Leu Lys Ala 
285 290 295 

AGC CAC ACG GAT AGC GTC TTT ATA GGG CGC TTG AGG AAG GAT TTG TTT 96 6 

Ser His Thr Asp Ser Val Phe He Gly Arg Leu Arg Lys Asp Leu Phe 
300 305 310 
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He 


Ser 


Leu 


Gin 


Met 


Arg 


Val 


Leu 


Gin 


Leu 


Ala 


Pro 


Asp 


Ala 


Thr 


Asp 




210 










215 










220 










Val 


Ala 


Ser 


Ala 


He 


Tyr 


Ser 


Gly 


Ser 


Tyr 


Asn 


Val 


Gly 


He 


Gly 


Ser 


225 










230 










235 










240 


Gly 


Ala 


Leu 


Phe 


Gly 


Ser 


He 


Val 


He 


His 


Gin 


Leu 


Gly 


Leu 


Gly 


Tyr 










245 










250 










255 




He 


Gly 


Phe 


Val 


Gly 


Gly 


Ala 


Leu 


Gly 


Leu 


Leu 


Ala 


Leu 


Phe 


Trp 


Leu 








260 










265 










270 






Arg 


Phe 


He 


Thr 


He 


Lys 


Phe 


Lys 


Lys 


Thr 















275 280 



(2) INFORMATION FOR SEQ ID NO: 1011: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 1065 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1011: 

AAGCCCTTGA AATCATTGGA GAAAACG ATG AAG ACT TAT AAT GTC GCT ATT GTT 54 

Met Lys Thr Tyr Asn Val Ala He Val 
1 5 

GGG GCC AGT GGG GCG GTA GGC CAA GAG CTG ATT AAG GGT TTA GAA AAT 102 
Gly Ala Ser Gly Ala Val Gly Gin Glu Leu He Lys Gly Leu Glu Asn 
10 15 20 25 

TCT TTT TTC CCA ATT AAA AAA TTT GTC CCG CTC GCT AGC ACT AGG AGT 15 0 

Ser Phe Phe Pro He Lys Lys Phe Val Pro Leu Ala Ser Thr Arg Ser 
30 35 40 

GCT GGT AAA AAG ATC AAA GCT TTC AAT AAA GAC TAT GAA ATT TTA GAA 198 
Ala Gly Lys Lys He Lys Ala Phe Asn Lys Asp Tyr Glu He Leu Glu 
45 50 55 

ACC ACG CAT GAA GTT TTT GAA AGA GAA AAA ATA GAC ATC GCC TTT TTT 24 6 

Thr Thr His Glu Val Phe Glu Arg Glu Lys He Asp He Ala Phe Phe 
60 65 70 

AGC GCT GGG GGG AGC GTG AGC GMA GAA TTT GCT ACA AGC GCT TCA AAA 2 94 

Ser Ala Gly Gly Ser Val Ser Xaa Glu Phe Ala Thr Ser Ala Ser Lys 
75 80 85 

ACG GCC TTA GTG GTT GAT AAC ACG AGC TTT TTT AGA TTG AAT AAA GAT 3 42 
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AGC TAT AAT GTG GGG ATT GGA TCA GGA GCG CTG TTT GGC AGT ATT GTG 773 
Ser Tyr Asn Val Gly lie Gly Ser Gly Ala Leu Phe Gly Ser lie Val 
235 240 245 



ATC CAC CAA CTA GGG CTA GGA TAT ATT GGC TTT GTG GGT GGG GCT TTA 821 
lie His Gin Leu Gly Leu Gly Tyr lie Gly Phe Val Gly Gly Ala Leu 
250 255 260 



GGT TTG TTG GCG CTC TTT TGG CTT AGA TTC ATT ACG ATA AAG TTT AAA 86 9 

Gly Leu Leu Ala Leu Phe Trp Leu Arg Phe lie Thr lie Lys Phe Lys 
265 270 275 280 

AAA ACA TAAAGAGCGT TAAAAGGATT AGCCCAATAA AGGAGAATCC CTTTCGCACT 92 5 

Lys Thr 



(2) INFORMATION FOR SEQ ID NO: 1010: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUE 

Met Gly lie Ala 
1 

Leu Val lie Arg 
20 

Leu Leu Ala Leu 
35 

Gly Arg lie lie 
50 

lie Gly Gly Val 
65 

Pro His Leu Pro 

Leu Met Lys Arg 
100 

Val lie Ser Gly 
115 

lie Gin lie Ser 
130 

Val Phe Gly Leu 
145 

Tyr Ala Lys Asn 

lie Cys Pro Gin 
180 

Val Phe Leu Gin 
195 



;NCE DESCRIPTION: 

Phe Ala His Ser 
5 

Val Ala Pro Arg 

Gly Ser Ser Leu 
40 

Gly Gin lie Leu 
55 

Ala Thr Leu He 
70 

Ser Arg Asn Ala 
85 

Pro Leu Leu Met 

His Phe Thr Thr 
120 

Gin Phe Ser Pro 
135 

Ala Gly Val Val 
150 

Ser Arg Lys Phe 
165 

Leu Leu Leu Phe 

He Phe Leu Trp 
200 



SEQ ID NO: 1010: 

He Phe Trp Ser 
10 

Asn Lys Lys Gin 
25 

Ala Met He Leu 

Asp Trp Arg Ser 
60 

Ala Leu Leu Met 
75 

Gly Thr Leu Ala 
90 

Gly He Tyr Leu 
105 

Tyr Ser Tyr He 

Asp He Thr Thr 
140 

Gly Ser Phe Leu 
155 

He Ala Phe Ala 
170 

Val Phe Lys Asn 
185 

Gly He Gly He 



He Thr Ala Ser 
15 

Gin Ala Leu Gly 
30 

Gly Leu Pro Leu 
45 

Thr Phe Gly Val 

Trp Lys Leu Leu 
80 

Ser Val Pro Val 
95 

Leu Val He Met 
110 

Glu Pro Phe He 
125 

Leu Met Leu Phe 

Phe Gly Arg Leu 
160 

Met Val Leu Val 
175 

Leu Glu Trp Val 
190 

Thr Ser Leu Gly 
205 
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ATT TTT TGG TCC ATC ACG OCT TCT TTA GTC ATT CGT GTC GCG CCA AGA 101 

lie Phe Trp Ser He Thr Ala Ser Leu Val He Arg Val Ala Pro Arg 
10 15 20 

AAT AAA AAA CAA CAG GCC TTA GGG CTG TTA GCG TTA GGG AGT TCG TTA 14 9 

Asn Lys Lys Gin Gin Ala Leu Gly Leu Leu Ala Leu Gly Ser Ser Leu 
25 30 35 40 

GCG ATG ATT TTA GGG TTG CCG CTT GGG AGG ATC ATT GGG CAA ATT CTA 197 

Ala Met He Leu Gly Leu Pro Leu Gly Arg He He Gly Gin He Leu 

45 50 55 

GAT TGG CGT TCC ACT TTT GGC GTG ATC GGG GGC GTT GCG ACC CTT ATA 24 5 

Asp Trp Arg Ser Thr Phe Gly Val He Gly Gly Val Ala Thr Leu He 

60 65 70 

GCG TTG CTT ATG TGG AAA TTG CTC CCG CAT CTA CCC AGT AGA AAC GCA 2 93 

Ala Leu Leu Met Trp Lys Leu Leu Pro His Leu Pro Ser Arg Asn Ala 
75 80 85 

GGC ACG CTC GCA AGT GTC CCT GTA TTA ATG AAA CGG CCG CTT TTA ATG 341 

Gly Thr Leu Ala Ser Val Pro Val Leu Met Lys Arg Pro Leu Leu Met 
90 95 100 

GGG ATT TAT TTG CTT GTG ATC ATG GTC ATC TCT GGG CAT TTC ACC ACT 3 89 

Gly He Tyr Leu Leu Val He Met Val He Ser Gly His Phe Thr Thr 
105 110 115 120 

TAT AGT TAT ATT GAG CCT TTT ATC ATT CAA ATC AGC CAA TTT TCT CCT 4 37 

Tyr Ser Tyr He Glu Pro Phe He He Gin He Ser Gin Phe Ser Pro 

125 130 135 

GAC ATT ACA ACG CTA ATG TTG TTT GTG TTT GGG TTA GCG GGC GTG GTG 4 85 

Asp He Thr Thr Leu Met Leu Phe Val Phe Gly Leu Ala Gly Val Val 

140 145 150 

GGG AGT TTT TTG TTC GGC CGT TTG TAT GCA AAA AAT TCA AGA AAA TTT 53 3 

Gly Ser Phe Leu Phe Gly Arg Leu Tyr Ala Lys Asn Ser Arg Lys Phe 
155 160 165 

ATC GCT TTT GCG ATG GTT TTA GTC ATT TGC CCG CAA CTC TTG CTT TTT 5 81 

lie Ala Phe Ala Met Val Leu Val He Cys Pro Gin Leu Leu Leu Phe 
170 175 180 

GTG TTT AAA AAC TTA GAG TGG GTG GTT TTC TTG CAA ATT TTC TTA TGG 62 9 

Val Phe Lys Asn Leu Glu Trp Val Val Phe Leu Gin He Phe Leu Trp 
185 190 195 200 

GGG ATT GGG ATC ACT TCG CTT GGG ATT TCC TTG CAA ATG AGG GTG TTG 6 77 

Gly He Gly He Thr Ser Leu Gly He Ser Leu Gin Met Arg Val Leu 

205 210 215 

CAG CTT GCG CCG GAT GCC ACG GAT GTT GCG AGT GCG ATT TAC TCA GGG 72 5 

Gin Leu Ala Pro Asp Ala Thr Asp Val Ala Ser Ala He Tyr Ser Gly 

220 225 230 



-1480- 



BNSDOCID: <WO 9843478 A 1 _!_> 



wo 98/43478 



PCT/US98/06371 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIiECULE TYPE: protein 



(xi) . SEQUENCE DESCRIPTION: SEQ ID NO:1008: 



Met 


Ser 


Tvr 
J. J J- 


Phe 


Tvr 


Lys 


His 


Cys 


Leu 


Lys 


Phe 


Ser 


Leu 


Val 


Gly 


Leu 


1 








5 










10 










15 




Leu 


Glv 


Leu 


Leu 


Ser 


Val 


Gin 


Leu 


Asp 


Ala 


Arq 


Ser 


Phe 


Val 


Asp 


Gly 








20 










25 










30 






Asp 


Leu 


Asp 


He 


Gin 


Lvs 


Phe 


Ser 


Tvr 


Glu 


Asp 


Ser 


Leu 


Leu 


Lvs 


Lvs 






35 










40 










45 








Glv 


Asp 


Pro 


Asn 


Glv 


Val 


His 


Lvs 


Val 


Gin 


Val 


Ara 


Asp 


Tvr 


Lvs 


Glv 




50 










55 










60 










Ly s 


Met 


Gin 


Glu 


Ala 


Glu 


He 


His 


Ser 


Glu 


He 


Arg 


He 


Ala 


Leu 


Lvs 


65 










70 










75 










80 


Pro 


Gly 


Val 


Lys 


Lys 


Glu 


Val 


Lys 


Lys 


Gly 


Lys 


He 


Tyr 


Ser 


Ala 


Gin 










85 










90 










95 




He 


Asn 


Asp 


Gly 


Met 


Cys 


Tyr 


Ala 


Phe 


Arg 


Met 


Leu 


Gin 


Thr 


Gly 


Asp 








100 










105 










110 






Asn 


Thr 


Thr 


Gly 


Leu 


Asp 


Ser 


Lys 


Glu 


Phe 


Pro 


Lys 


Gin 


Ser 


Arg 


Glu 






115 










120 










125 








Lys 


Lys 


Gly 


Arg 


Val 


He 


Thr 


Leu 


He 


Gly 


Lys 


Gly 


Glu 


Val 


Pro 


Tyr 




130 










135 










140 










Leu 


He 


Leu 


Glu 


Thr 


Asp 


Cys 


Gin 


Val 


Gly 


Asp 


He 


Ala 


Lys 


He 


Ser 


145 










150 










155 










160 


Leu 


Val 


Gly 


Asn 


Phe 


Asp 


Gly 


Thr 


Gly 


Phe 


Leu 


Thr 


Glu 


Tyr 


Lys 


Phe 










165 










170 










175 




Lys 


Asp 


Ala 


Lys 


Pro 


He 


Tyr 





















180 

(2) INFORMATION FOR SEQ ID NO: 1009: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 30... 875 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 
GGAATTTTTG GGTGCTACTC CTTTCTCGT 



SEQ ID NO: 1009 : 

ATG GGT ATC GCT TTT GCC CAC TCT 53 
Met Gly He Ala Phe Ala His Ser 
1 5 
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AAGGATTTGT TO ATG AGT TAT TTT TAT AAG CAC TGT TTG AAA TTT TCG TTG 51 
Met Ser Tyr Phe Tyr Lys His Cys Leu Lys Phe Ser Leu 
15 10 

GTT GGG TTG CTA GGG CTT TTG AGC GTT CAG CTT GAC GCT AGG AGT TTT 99 
Val Gly Leu Leu Gly Leu Leu Ser Val Gin Leu Asp Ala Arg Ser Phe 
15 20 25 

GTT GAT GGG GAT TTA GAC ATT CAG AAA TTC AGC TAT GAA GAT TCT CTA 14 7 

Val Asp Gly Asp Leu Asp lie Gin Lys Phe Ser Tyr Glu Asp Ser Leu 
30 35 40 45 

CTT AAA AAG GGA GAC CCT AAT GGC GTG CAT AAA GTG CAG GTG CGA GAT 195 
Leu Lys Lys Gly Asp Pro Asn Gly Val His Lys Val Gin Val Arg Asp 
50 55 60 

TAT AAA GGC AAA ATG CAA GAA GCT GAG ATC CAC TCA GAA ATA CGC ATT 243 
Tyr Lys Gly Lys Met Gin Glu Ala Glu lie His Ser Glu lie Arg lie 
65 70 75 

GCG CTT AAA CCG GGG GTT AAA AAA GAA GTT AAA AAA GGC AAG ATT TAT 2 91 

Ala Leu Lys Pro Gly Val Lys Lys Glu Val Lys Lys Gly Lys lie Tyr 
80 85 90 

AGC GCT CAA ATC AAT GAT GGC ATG TGC TAT GCT TTT AGA ATG CTC CAA 33 9 

Ser Ala Gin lie Asn Asp Gly Met Cys Tyr Ala Phe Arg Met Leu Gin 
95 100 105 

ACC GGC GAT AAT ACC ACA GGC CTT GAT TCT AAA GAG TTC CCC AAG CAA 3 87 

Thr Gly Asp Asn Thr Thr Gly Leu Asp Ser Lys Glu Phe Pro Lys Gin 
110 115 120 125 

AGT CGT GAG AAA AAG GGC CGA GTG ATC ACT TTA ATC GGT AAA GGT GAA 435 
Ser Arg Glu Lys Lys Gly Arg Val lie Thr Leu lie Gly Lys Gly Glu 
130 135 140 

GTG CCT TAT CTT ATT TTA GAA ACC GAT TGC CAA GTG GGT GAT ATT GCA 4 83 

Val Pro Tyr Leu lie Leu Glu Thr Asp Cys Gin Val Gly Asp lie Ala 
145 150 155 

AAG ATC TCT TTG GTG GGT AAT TTT GAT GGC ACT GGG TTT CTT ACG GAA 531 
Lys lie Ser Leu Val Gly Asn Phe Asp Gly Thr Gly Phe Leu Thr Glu 
160 165 170 

TAT AAA TTC AAA GAC GCT AAA CCC ATT TAC TAGTCTTTAT TCTTCGCTTC ATT 5 84 
Tyr Lys Phe Lys Asp Ala Lys Pro lie Tyr 
175 180 

CTTAA 5 8 9 

(2) INFORMATION FOR SEQ ID NO: 100 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006: 

Met Ser Val lie Leu Glu Thr Lys Gly Leu Lys Lys Thr Tyr Gin Asn 

15 10 15 

His Leu Val Leu Asp Gly lie Asn Phe Thr Leu Asn Lys Gly Glu Val 

20 25 30 

Ala Val lie Leu Gly Pro Ser Gly Cys Gly Lys Ser Thr Phe Leu Lys 

35 40 45 

Cys Leu Asn Gly Leu Glu Lys lie Asn Glu Gly Glu lie Leu Phe Glu 

50 55 60 

Asn Thr Asn Leu Asn Asn Lys Ala Thr Asn Trp Asn Gin Met Arg Gin 
65 70 75 80 

Lys lie Gly Met Val Phe Gin Asn Tyr Glu Leu Phe Pro His Leu Asn 

85 90 95 

Val Leu Asp Asn lie Leu Leu Ala Pro Met Lys Val Gin Lys Arg Ser 

100 105 110 

Lys Asp Glu Val lie Ser Gin Ala lie Glu Leu Leu Lys Arg Val Gly 

115 120 125 

Leu Glu His Lys Gin Gin Ala Tyr Pro Lys Glu Leu Ser Gly Gly Gin 

130 135 140 

Lys Gin Arg Val Ala lie Val Arg Ser Leu Cys Met Arg Pro Lys lie 
145 150 155 160 

Met Leu Phe Asp Glu Val Thr Ala Ser Leu Asp Pro Glu Met Val Lys 

165 170 175 

Glu Val Leu Glu Val lie Leu Glu Leu Ala Thr Thr Gly Met Ser Met 

180 185 190 

Val lie Val Thr His Glu Met Lys Phe Ala Gin Lys lie Ala His Lys 

195 200 205 

lie Val Phe Phe Asp Ser Gly Lys lie Ala Glu Glu Asn Asn Ala Lys 

210 215 220 

Glu Phe Phe Asn His Pro Lys Ser Gin Arg Ala Gin Lys Phe Leu Glu 
225 230 235 240 

Thr Phe His Phe Leu Gly Ser Cys 
245 

(2) INFORMATION FOR SEQ ID NO: 1007: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIsTDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13 ...561 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007: 
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60 65 70 

ACT AAC TGG AAT CAA ATG CGC CAA AAA ATA GGC ATG GTG TTT CAA AAT 2 94 

Thr Asn Trp Asn Gin Met Arg Gin Lys lie Gly Met Val Phe Gin Asn 
75 80 85 

TAT GAA TTG TTC CCG CAT TTA AAT GTG TTA GAT AAT ATC TTA CTC GCT 342 
Tyr Glu Leu Phe Pro His Leu Asn Val Leu Asp Asn lie Leu Leu Ala 
90 95 100 

CCT ATG AAA GTG CAA AAA CGA TCC AAA GAT GAG GTT ATT TCT CAA GCC 3 90 

Pro Met Lys Val Gin Lys Arg Ser Lys Asp Glu Val lie Ser Gin Ala 
105 110 115 120 

ATA GAG CTT TTA AAG CGA GTG GGT TTG GAG CAT AAA CAA CAA GCT TAG 43 8 

lie Glu Leu Leu Lys Arg Val Gly Leu Glu His Lys Gin Gin Ala Tyr 
125 130 135 

CCT AAA GAA TTG AGC GGC GGA CAA AAA CAA CGA GTA GCG ATC GTG CGC 4 86 

Pro Lys Glu Leu Ser Gly Gly Gin Lys Gin Arg Val Ala lie Val Arg 
140 145 150 

TCT TTA TGC ATG CGA CCA AAA ATC ATG CTT TTT GAT GAA GTA ACC GCC 53 4 

Ser Leu Cys Met Arg Pro Lys lie Met Leu Phe Asp Glu Val Thr Ala 
155 160 165 

TCT TTA GAC CCT GAA ATG GTT AAA GAA GTT TTA GAA GTG ATT TTA GAA 5 82 

Ser Leu Asp Pro Glu Met Val Lys Glu Val Leu Glu Val lie Leu Glu 
170 175 180 

TTA GCC ACA ACA GGC ATG AGC ATG GTG ATT GTA ACG CAT GAA ATG AAA 63 0 

Leu Ala Thr Thr Gly Met Ser Met Val lie Val Thr His Glu Met Lys 
185 190 195 200 

TTC GCG CAA AAA ATC GCT CAT AAA ATC GTG TTT TTT GAT AGC GGT AAA 67 8 

Phe Ala Gin Lys lie Ala His Lys lie Val Phe Phe Asp Ser Gly Lys 
205 210 215 

ATC GCT GAA GAA AAC AAC GCT AAA GAA TTT TTT AAC CAC CCG AAA TCT 72 6 

He Ala Glu Glu Asn Asn Ala Lys Glu Phe Phe Asn His Pro Lys Ser 
220 225 230 

CAA AGA GCG CAA AAA TTT TTA GAA ACT TTC CAT TTT TTA GGG AGC TGT T 775 
Gin Arg Ala Gin Lys Phe Leu Glu Thr Phe His Phe Leu Gly Ser Cys 
235 240 245 

AAATAAAGTT TGCTAAAAAG ATGATTCTAA TT 8 07 

(2) INFORMATION FOR SEQ ID NO: 10 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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Trp Gly Gly Ala 
100 

Val Ser Lys His 
115 

Lys Lys Val lie 
130 

Leu Pro Ser Ser 
145 

Leu Val Ser Leu 

lie lie Glu Leu 
180 

Tyr Gly Val lie 
195 

Leu Tyr Ser Ser 
210 



85 

Glu Met Met Asp 

Gin lie Glu Ser 
120 

Phe Asn lie lie 
135 

Leu Asn Leu Phe 
150 

lie Gly Ala lie 
165 

Asn Leu Leu Arg 

Leu Met Phe Tyr 
200 

Tyr Leu Glu Lys 
215 



90 

Leu Thr Arg Gly 
105 

Ala Leu Ala Leu 

Phe Pro Gin Ser 
14 0 

Thr Arg Met lie 
155 

Asp Leu Leu Lys 
170 

Met Pro Asn Ala 
185 

Phe Ser Leu Cys 

Lys Phe Gin His 
220 



95 

Val Leu Thr Ser 
110 

Gly Leu Asp Ser 
125 

Phe Leu Ser Leu 

Lys Thr Thr Ala 
160 

Val Gly Gin Gin 
175 

Ser Phe Val Val 
190 

Tyr Ser Leu Ser 
205 

lie Arg Gly 



(2) INFORMATION FOR SEQ ID N0:1005: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 774 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005: 

AAAAAAAATT CCAACACATT AGAGGGTAAA ATG AGC GTG ATT TTA GAA ACC AAA 54 

Met Ser Val lie Leu Glu Thr Lys 
1 5 

GGG TTA AAA AAA ACC TAT CAA AAC CAT TTG GTT TTA GAC GGC ATC AAT 102 
Gly Leu Lys Lys Thr Tyr Gin Asn His Leu Val Leu Asp Gly lie Asn 
10 15 20 

TTC ACT TTA AAT AAG GGT GAA GTG GCA GTG ATT TTA GGG CCT AGC GGG 15 0 

Phe Thr Leu Asn Lys Gly Glu Val Ala Val lie Leu Gly Pro Ser Gly 
25 30 35 40 

TGC GGG AAA AGC ACT TTT TTA AAA TGC CTA AAC GGG CTT GAA AAG ATT 198 
Cys Gly Lys Ser Thr Phe Leu Lys Cys Leu Asn Gly Leu Glu Lys lie 
45 50 55 

AAT GAA GGT GAA ATC CTT TTT GAA AAC ACT AAC CTT AAC AAT AAG GCC 24 6 

Asn Glu Gly Glu lie Leu Phe Glu Asn Thr Asn Leu Asn Asn Lys Ala 
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105 110 115 120 

GCT CTG GCT TTA GGC TTA GAT TCA AAA AAG GTG ATT TTT AAT ATT ATT 43 8 

Ala Leu Ala Leu Gly Leu Asp Ser Lys Lys Val lie Phe Asn lie lie 
125 130 135 

TTC CCT CAA AGC TTT TTG TCT TTA TTG CCC TCA AGC CTT AAT TTG TTC 48 6 

Phe Pro Gin Ser Phe Leu Ser Leu Leu Pro Ser Ser Leu Asn Leu Phe 
140 145 150 

ACG CGC ATG ATC AAA ACC ACG GCT TTA GTT TCT CTC ATT GGA GCG ATT 5 34 

Thr Arg Met lie Lys Thr Thr Ala Leu Val Ser Leu lie Gly Ala lie 
155 160 165 

GAT TTG CTA AAA GTG GGC CAG CAA ATC ATA GAG CTT AAC CTC TTA CGC 5 82 

Asp Leu Leu Lys Val Gly Gin Gin lie lie Glu Leu Asn Leu Leu Arg 
170 175 180 

ATG CCT AAT GCG AGC TTT GTG GTT TAT GGC GTT ATC TTA ATG TTT TAT 63 0 

Met Pro Asn Ala Ser Phe Val Val Tyr Gly Val lie Leu Met Phe Tyr 
185 190 195 200 

TTT ACT TTA TGC TAT ACT TTG AGC CTG TAT ACT TCC TAT TTA GAA AAA 67 8 

Phe Ser Leu Cys Tyr Ser Leu Ser Leu Tyr Ser Ser Tyr Leu Glu Lys 
205 210 215 

AAA TTC CAA CAC ATT AGA GGG TAAAATGAGC GTGATTTTAG AAACCAAAGG GTTA 73 3 
Lys Phe Gin His He Arg Gly 
220 

AAAA 737 
(2) INFORMATION FOR SEQ ID NO: 10 04; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004: 

Met Gly Val Leu Leu Glu Leu Asp Asn Leu Lys Arg Leu Leu Glu Gly 

15 10 15 

Phe Glu Thr Thr Leu Leu He Ala Leu Ser Ser Ala Met He Ser He 

20 25 30 

He Val Gly Met Leu Leu Gly Ser Leu Met Ala Phe Gly Ser Gin He 

35 40 45 

Val Val Leu Ala Cys Arg Val Tyr Leu Glu Ser He Arg He He Pro 

50 55 60 

Leu Leu Ala Trp Leu Phe He Val Tyr Phe Gly Leu Ala Ser Trp Phe 
65 70 75 80 

Asp Leu 'His He Ser Ala Val Leu Ala Ser Val He Val Phe Ser Leu 



-1474- 



BNSDOCID: <WO 984347eA1_L> 



wo 98/43478 



PCT/US98/06371 



180 185 190 

Thr Tyr Leu lie Ala Leu Leu Pro Leu Ser Val Leu Phe Val lie Leu 

195 200 205 

Glu Arg Phe Phe Lys Lys Lys Val Ala 
210 215 



(2) INFORMATION FOR SEQ ID NO: 1003: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 699 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003: 
AGCGTTTCTT TAAAAAGAAA GTGGCTTAAA ATG GGA GTT TTA CTA GAA TTA GAC 54 



Met Gly Val Leu Leu Glu Leu Asp 
1 5 



AAC CTT AAG CGT TTG TTA GAA GGG TTT GAA ACC ACT CTT TTG ATC GOT 
Asn Leu Lys Arg Leu Leu Glu Gly Phe Glu Thr Thr Leu Leu lie Ala 
10 15 20 



102 



CTT AGC TCT GCA ATG ATT TCA ATC ATT GTT GGA ATG CTT TTG GGG AGO 
Leu Ser Ser Ala Met He Ser He He Val Gly Met Leu Leu Gly Ser 
25 30 35 40 



150 



TTG ATG GCG TTT GGT TCT CAA ATA GTG GTT TTG GCG TGT CGT GTG TAT 
Leu Met Ala Phe Gly Ser Gin He Val Val Leu Ala Cys Arg Val Tyr 
45 50 55 



198 



TTA GAA AGC ATT CGC ATC ATC CCG CTT TTA GCA TGG CTT TTT ATT GTG 
Leu Glu Ser He Arg He He Pro Leu Leu Ala Trp Leu Phe He Val 
60 65 70 



246 



TAT TTC GGG TTA GCG AGC TGG TTT GAT TTG CAT ATT AGC GCG GTT TTG 
Tyr Phe Gly Leu Ala Ser Trp Phe Asp Leu His He Ser Ala Val Leu 
75 80 85 



294 



GCA AGC GTT ATT GTT TTT AGC TTG TGG GGT GGC GCT GAA ATG ATG GAT 
Ala Ser Val He Val Phe Ser Leu Trp Gly Gly Ala Glu Met Met Asp 
90 95 100 



342 



TTA ACT AGG GGG GTT TTA ACT TCC GTG AGC AAA CAC CAA ATA GAA AGC 
Leu Thr Arg Gly Val Leu Thr Ser Val Ser Lys His Gin He Glu Ser 



390 
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140 145 150 155 

AAA GAA ACT TCG GTG GTG GGC GCG ATA GCC CTA ACC GAT ATT ATG TTT 53 2 

Lys Glu Thr Ser Val Val Gly Ala lie Ala Leu Thr Asp lie Met Phe 
160 165 170 

GTG GCG AAA GAT TTT ATT GGC ATT TAT TAT AAA ACG ACT GAA AGO CTT 580 
Val Ala Lys Asp Phe lie Gly lie Tyr Tyr Lys Thr Thr Glu Ser Leu 
175 180 185 

TTG ATG TTA AGC CTC ACT TAT TTG ATC GCT TTA CTC CCT TTA AGC GTT 62 8 

Leu Met Leu Ser Leu Thr Tyr Leu lie Ala Leu Leu Pro Leu Ser Val 
190 195 200 

TTG TTT GTG ATC TTA GAG CGT TTC TTT AAA AAG AAA GTG GCT TAAAATGGG 67 9 
Leu Phe Val lie Leu Glu Arg Phe Phe Lys Lys Lys Val Ala 
205 210 215 

AGTTTTACTA GAATTAGACA ACCTT 7 04 

(2) INFORMATION FOR SEQ ID NO: 10 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002: 

Met Ala Leu Asp Trp Asp Phe Met Phe His Ser lie Pro Ala Phe Phe 

15 10 15 

Lys Gly Leu Glu Leu Thr Leu Tyr lie Ser Phe Phe Gly lie Leu Leu 

20 25 30 

Ser Leu Leu Val Gly Phe Leu Cys Ala lie Val Leu Tyr Phe Lys Thr 

35 40 45 

Arg Phe Leu Ser Pro Val Val Tyr lie Tyr Gly Glu lie Ala Arg Asn 

50 55 60 

Thr Pro Leu Leu lie Gin Leu Phe Phe Leu Tyr Tyr Gly Leu Asn Glu 
65 70 75 80 

lie Gly Leu Ser Ala Leu Glu Cys Ala lie Leu Ala Leu Gly Phe Leu 

85 90 95 

Gly Gly Gly Tyr Met Ser Gin Ser Phe Leu Leu Gly Phe Lys Ser Leu 

100 105 110 

Ala Ser lie Gin Arg Glu Ser Ala Leu Ser Leu Gly Phe Ser Pro Leu 

115 120 125 

Lys Met Met Tyr Tyr lie lie Leu Pro Gin Ser Leu Ser Val Ser Met 

130 135 140 

Pro Ser lie Gly Ala Asn Val lie Phe Leu Leu Lys Glu Thr Ser Val 
145 150 155 160 

Val Gly Ala lie Ala Leu Thr Asp lie Met Phe Val Ala Lys Asp Phe 

165 170 175 

lie Gly lie Tyr Tyr Lys Thr Thr Glu Ser Leu Leu Met Leu Ser Leu 

-1472- 



BNSDOCID: <WO 984347SA1_I_> 



wo 98/43478 



PCTAJS98/06371 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: .20. . .670 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001: 

ATTTTTAAGG TATAGCGTT ATG GCA TTA GAT TGG GAT TTT ATG TTT CAC TCC 52 

Met Ala Leu Asp Trp Asp Piie Met Phe His Ser 
15 10 

ATC CCT GCG TTT TTT AAG GGG TTA GAA CTC ACG CTT TAT ATT TCT TTC 100 
lie Pro Ala Phe Phe Lys Gly Leu Glu Leu Thr Leu Tyr lie Ser Phe 
15 20 25 

TTT GGG ATT TTG CTC TCT CTT TTG GTG GGG TTT TTG TGC GCG ATC GTT 14 8 

Phe Gly lie Leu Leu Ser Leu Leu Val Gly Phe Leu Cys Ala lie Val 
30 35 40 

TTG TAT TTT AAA ACG CGC TTT CTC TCT CCT GTT GTC TAT ATC TAT GGC 196 
Leu Tyr Phe Lys Thr Arg Phe Leu Ser Pro Val Val Tyr lie Tyr Gly 
45 50 55 

GAA ATC GCT AGG AAC ACG CCC CTG CTC ATC CAG CTT TTC TTT TTG TAT 244 
Glu lie Ala Arg Asn Thr Pro Leu Leu lie Gin Leu Phe Phe Leu Tyr 
60 65 70 75 

TAG GGG TTG AAT GAA ATC GGT TTG AGC GCT TTA GAG TGC GCG ATT TTA 2 92 

Tyr Gly Leu Asn Glu lie Gly Leu Ser Ala Leu Glu Cys Ala lie Leu 
80 85 90 

GCG TTA GGG TTT TTG GGT GGG GGG TAT ATG AGT CAA AGT TTT TTG CTT 34 0 

Ala Leu Gly Phe Leu Gly Gly Gly Tyr Met Ser Gin Ser Phe Leu Leu 
95 100 105 

GGG TTT AAG AGC CTA GCT TCC ATT CAA AGA GAA AGC GCT TTG AGT TTG 3 88 

Gly Phe Lys Ser Leu Ala Ser lie Gin Arg Glu Ser Ala Leu Ser Leu 
110 115 120 

GGG TTT AGC CCT TTG AAA ATG ATG TAT TAT ATT ATT CTG CCT CAA AGT 4 36 

Gly Phe Ser Pro Leu Lys Met Met Tyr Tyr lie lie Leu Pro Gin Ser 
125 130 135 

TTA AGC GTT TCT ATG CCT TCC ATA GGG GCG AAT GTG ATT TTT TTA CTC 4 84 

Leu Ser Val Ser Met Pro Ser lie Gly Ala Asn Val lie Phe Leu Leu 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000: 

Met Asn Gin Glu lie Leu Asp Val Leu lie Val Gly Ala Gly Pro Gly 

15 10 15 

Gly lie Ala Thr Ala Val Glu Cys Glu lie Ala Gly Val Lys Lys Val 

20 25 30 

Leu Leu Cys Glu Lys Thr Glu Ser His Ser Gly Met Leu Glu Lys Phe 

35 40 45 

Tyr Lys Ala Gly Lys Arg lie Asp Lys Asp Tyr Lys Lys Gin Val Val 

50 55 60 

Glu Leu Lys Gly His lie Pro Phe Lys Asp Ser Phe Lys Glu Glu Thr 
65 70 75 80 

Leu Glu Asn Phe Thr Asn Leu Leu Lys Glu His His lie Thr Pro Ser 

85 90 95 

Tyr Lys Thr Asp lie Glu Ser Val Lys Lys Glu Gly Glu Tyr Phe Lys 

100 105 110 

lie Thr Thr Thr Ser Asn Thr Thr Tyr His Ala Lys Phe Val Val Val 

115 120 125 

Ala lie Gly Lys Met Gly Gin Pro Asn Arg Pro Thr Ala Tyr Lys lie 

130 135 140 

Pro Val Ala Leu Ser Lys Gin Val Val Phe Ser lie Asn Asp Cys Lys 
145 150 155 160 

Glu Asn Glu Lys Thr Leu Val lie Gly Gly Gly Asn Ser Ala Val Glu 

165 170 175 

Tyr Ala lie Ala Leu Cys Lys Thr Thr Pro Thr Thr Leu Asn Tyr Arg 

180 185 190 

Lys Lys Glu Phe Ser Arg lie Asn Glu Asp Asn Ala Lys Asn Leu Gin 

195 200 205 

Glu Val Leu Asn Asn Asn Thr Leu Lys Ser Lys Leu Gly Val Asp lie 

210 215 220 

Glu Ser Leu Glu Glu Asp Asn Thr Gin lie Lys Val Asn Phe Thr Asp 
225 230 235 240 

Asn Thr Ser Glu Ser Phe Asp Arg Leu Leu Tyr Ala lie Gly Gly Ser 

245 250 255 

Thr Pro Leu Glu Phe Phe Lys Arg Cys Ser Leu Glu Leu Asp Pro Ser 

260 265 270 

Thr Asn lie Pro Val Val Lys Glu Asn Leu Glu Ser Asn Asn lie Pro 

275 280 285 

Asn Leu Phe lie Val Gly Asp lie Leu Phe Lys Ser Gly Ala Ser lie 

290 295 300 

Ala Thr Ala Leu Asn His Gly Tyr Asp Val Ala lie Glu lie Ala Lys 
305 310 315 320 

Arg Leu His Ser 



(2) INFORMATION FOR SEQ ID NO: 1001: 
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CAT GCT AAA TTC GTG GTG GTT GCG ATC GGG AAA ATG GGC GAG CCA AAC 43 5 

His Ala Lys Phe Val Val Val Ala He Gly Lys Met Gly Gin Pro Asn 
125 130 135 

CGC COT ACT GCT TAT AAA ATC CCT GTT GCG CTC TCT AAA CAA GTG GTT 4 83 

Arg Pro Thr Ala Tyr Lys He Pro Val Ala Leu Ser Lys Gin Val Val 
140 145 150 

TTT AGC ATC AAT GAT TGT AAG GAA AAT GAA AAA ACC CTT GTG ATC GGC 531 
Phe Ser He Asn Asp Cys Lys Glu Asn Glu Lys Thr Leu Val He Gly 
155 160 165 

GGA GGC AAC TCA GCG GTG GAA TAC GCC ATT GCT TTG TGC AAA ACC ACC 57 9 

Gly Gly Asn Ser Ala Val Glu Tyr Ala He Ala Leu Cys Lys Thr Thr 
170 175 180 185 

CCT ACC ACC CTC AAT TAC CGC AAA AAA GAA TTC AGC CGC ATC AAT GAA 62 7 

Pro Thr Thr Leu Asn Tyr Arg Lys Lys Glu Phe Ser Arg He Asn Glu 
190 195 200 

GAC AAC GCT AAA AAC TTG CAA GAA GTC CTA AAC AAT AAC ACG CTT AAA 67 5 

Asp Asn Ala Lys Asn Leu Gin Glu Val Leu Asn Asn Asn Thr Leu Lys 
205 210 215 

AGC AAG CTT GGA GTG GAT ATT GAA AGC CTA GAA GAA GAT AAC ACT CAG 72 3 

Ser Lys Leu Gly Val Asp He Glu Ser Leu Glu Glu Asp Asn Thr Gin 
220 225 230 

ATT AAG GTT AAC TTC ACC GAT AAC ACG AGC GAA AGT TTT GAT CGT TTG 771 
He Lys Val Asn Phe Thr Asp Asn Thr Ser Glu Ser Phe Asp Arg Leu 
235 240 245 

CTG TAT GCG ATC GGC GGC TCT ACC CCT TTA GAG TTT TTT AAA CGC TGT 819 
Leu Tyr Ala He Gly Gly Ser Thr Pro Leu Glu Phe Phe Lys Arg Cys 
250 255 260 265 

TCT TTA GAG CTG GAT CCT AGC ACC AAT ATC CCT GTG GTG AAA GAA AAT 86 7 

Ser Leu Glu Leu Asp Pro Ser Thr Asn He Pro Val Val Lys Glu Asn 
270 275 280 

TTA GAG AGC AAC AAT ATC CCT AAT TTG TTC ATC GTG GGC GAT ATT TTA 915 
Leu Glu Ser Asn Asn He Pro Asn Leu Phe He Val Gly Asp He Leu 
285 290 295 

TTC AAA TCA GGG GCG AGC ATC GCT ACC GCT TTA AAC CAT GGC TAT GAT 96 3 

Phe Lys Ser Gly Ala Ser He Ala Thr Ala Leu Asn His Gly Tyr Asp 
300 305 310 

GTT GCT ATA GAA ATC GCT AAA AGG TTG CAC TCT TAAAGCCGCT CACTCATCAA 1016 
Val Ala He Glu He Ala Lys Arg Leu His Ser 
315 320 

ACGGCTTAGC CTTATACAAA AA 103 8 

(2) INFORMATION FOR SEQ ID NO: 100 0: 
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Asp Lys Gly Glu Gin Arg Gin Lys Glu Ser Glu Leu lie Lys Ala Phe 

115 120 125 

Asn Leu Pro Leu Ser Leu lie Glu Arg Thr Gin Asp 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 999: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 996 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999: 

AAAATTAAAA TAAAAAGGAT AGAA ATG AAT CAA GAA ATT TTA GAC GTG TTG 51 

Met Asn Gin Glu lie Leu Asp Val Leu 
1 5 

ATA GTG GGT GCA GGG CCT GGG GGC ATT GCC ACG GCC GTA GAA TGC GAA 99 
lie Val Gly Ala Gly Pro Gly Gly lie Ala Thr Ala Val Glu Cys Glu 
10 15 20 25 

ATA GCC GGC GTT AAA. AAA GTG CTT TTA TGC GAA AAA ACC GAA AGC CAT 14 7 

lie Ala Gly Val Lys Lys Val Leu Leu Cys Glu Lys Thr Glu Ser His 
30 35 40 

TCA GGC ATG TTA GAG AAG TTT TAT AAA GCC GGT AAA AGG ATT GAT AAA 195 
Ser Gly Met Leu Glu Lys Phe Tyr Lys Ala Gly Lys Arg lie Asp Lys 
45 50 55 

GAT TAT AAA AAG CAA GTC GTA GAG CTT AAA GGG CAT ATC CCT TTT AAA 24 3 

Asp Tyr Lys Lys Gin Val Val Glu Leu Lys Gly His lie Pro Phe Lys 
60 65 70 

GAC AGC TTT AAA GAA GAA ACT TTA GAG AAT TTC ACT AAC CTT TTA AAA 2 91 

Asp Ser Phe Lys Glu Glu Thr Leu Glu Asn Phe Thr Asn Leu Leu Lys 
75 80 85 

GAG CAT CAC ATC ACG CCA AGC TAT AAA ACC GAT ATT GAG AGC GTG AAA 33 9 

Glu His His lie Thr Pro Ser Tyr Lys Thr Asp lie Glu Ser Val Lys 
90 95 100 105 

AAA GAG GGC GAA TAC TTT AAA ATC ACC ACC ACT TCT AAC ACA ACC TAT 3 87 

Lys Glu Gly Glu Tyr Phe Lys lie Thr Thr Thr Ser Asn Thr Thr Tyr 
110 115 120 
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lie Arg Glu lie Asn Lys Asp Leu Arg Gly Cys Asp Tyr Ala Thr Asp 
40 45 50 55 

GTT TTG AGC TTC CCT TTA GAA GCC ATT CCT CAC ACC CCT TTA GGG AGC 244 
Val Leu Ser Phe Pro Leu Glu Ala He Pro His Thr Pro Leu Gly Ser 
60 65 70 

GTG GTG ATT AAT GCG CCA TTA GCT CAA ACT AAC GCT CTG AAA TTA GGA 2 92 

Val Val He Asn Ala Pro Leu Ala Gin Thr Asn Ala Leu Lys Leu Gly 
75 80 85 

CAT AGC TTA GAA AAT GAG ATC GCT CTT TTA TTC ATT CAT GGG GTG TTG 34 0 

His Ser Leu Glu Asn Glu He Ala Leu Leu Phe He His Gly Val Leu 
90 95 100 

CAT TTG TTG GGC TAT GAC CAT GAA AAA GAT AAG GGC GAA CAA CGC CAA 3 88 

His Leu Leu Gly Tyr Asp His Glu Lys Asp Lys Gly Glu Gin Arg Gin 
105 110 115 

AAA GAG AGC GAA CTC ATT AAA GCG TTT AAC TTG CCT TTG AGT TTG ATT 43 6 

Lys Glu Ser Glu Leu He Lys Ala Phe Asn Leu Pro Leu Ser Leu He 
120 125 130 135 

GAA CGC ACA CAG GAT TAGGTTTAGA TACTCTACTA ATGCTGACAA ATAAAGCTTT T 4 92 
Glu Arg Thr Gin Asp 
140 

AATTTTTAAG AATGGAA 5 09 

(2) INFORMATION FOR SEQ ID NO : 998 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998: 

Met Leu Glu He Asp Asn Gin Thr Pro Leu Glu Ser Asp Phe Leu Leu 

15 10 15 

Leu Glu Lys He Ala Asn Val Leu Ala Pro Thr Gin He He Glu Leu 

20 25 30 

Val Leu Val Ser Asp Glu Thr He Arg Glu He Asn Lys Asp Leu Arg 

35 40 45 

Gly Cys Asp Tyr Ala Thr Asp Val Leu Ser Phe Pro Leu Glu Ala He 

50 55 60 

Pro His Thr Pro Leu Gly Ser Val Val He Asn Ala Pro Leu Ala Gin 
65 70 75 80 

Thr Asn Ala Leu Lys Leu Gly His Ser Leu Glu Asn Glu He Ala Leu 

85 90 95 

Leu Phe He His Gly Val Leu His Leu Leu Gly Tyr Asp His Glu Lys 
100 105 110 
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Thr Glu Ser lie Met 
705 

Glu Lys Lieu Glu His 
725 

Leu Arg Arg Leu Lys 
740 

Glu Ala Ser Val Gly 
755 

Gin Thr Tyr Ala Gin 
770 

Ser Ser Lys Pro Leu 
785 

Thr Tyr Ala Asn Leu 
805 

Leu Lys Lys Gin Leu 
820 

Leu His Asn Glu Asn 
835 

Lys Ala Lys Glu Ser 
850 

Lys Gin Glu Leu Asp 
865 



Val Met Pro Tyr Pro Lys 
710 715 
Glu Phe Glu Val lie Lys 
730 

lie Met Leu Glu Thr Pro 
745 

Leu Arg Glu Ala He Glu 
760 

Lys Leu Ala Arg Leu Glu 
775 

Lys Ser Val Ser Asp Val 
790 795 
Glu Asn Leu Asp Leu Ser 
810 

Glu Lys Leu Glu Lys Glu 
825 

Phe Val Lys Asn Ala Pro 
840 

Leu Lys Thr Leu Leu Glu 
855 

Leu Leu Glu Gin Pro 
870 



Asp Leu Ala Gin Asp 
720 

Asp Cys He Val Ser 
735 

Pro He Val Leu Lys 
750 

Asn Thr Glu Arg Leu 
765 

Lys Val Ser Val He 
780 

Gly Glu Phe Cys Gin 
800 

Pro Leu Val Ala Arg 
815 

Lys Leu Lys Leu Asn 
830 

Lys Ser Val Leu Glu 
845 

Lys Glu Ser Lys He 
860 



(2) INFORMATION FOR SEQ ID N0:997: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 32... 451 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997: 



GGGCTTTTTT AAACTTCTTC TTATATCTTT A ATG CTA GAA ATA GAC AAC CAA 52 

Met Leu Glu He Asp Asn Gin 
1 5 



ACC CCG CTA GAA TCA GAC TTT TTA TTA TTA GAA AAA ATC GCA AAT GTT 100 
Thr Pro Leu Glu Ser Asp Phe Leu Leu Leu Glu Lys He Ala Asn Val 
10 15 20 



TTA GCC CCC ACT CAA ATC ATT GAG CTT GTT TTG GTG AGC GAT GAA ACC 14 8 

Leu Ala Pro Thr Gin He He Glu Leu Val Leu Val Ser Asp Glu Thr 
25 30 35 

ATT CGA GAA ATC AAC AAG GAT TTA AGG GGT TGC GAT TAC GCT ACC GAT 196 
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260 

Gly Thr Gly Cys 
275 

Glu Val Gly Lys 
290 

Liys Gly lie Leu 
305 

Leu Glu Ala Arg 

Leu Glu Lys lie 
340 

Cys His Asn Val 
355 

Pro Glu lie Ala 
370 

Arg Phe Tyr Pro 
385 

Glu Leu Arg Pro 

lie Pro Val Phe 
420 

Thr Pro Leu Ser 
435 

Lys Asp Val Leu 
450 

Thr Leu Gly Trp 
465 

Leu Lys Asp Phe 

Leu Phe Phe Trp 
500 

Gly Glu Leu Pro 
515 

Glu Lys Gly Glu 
530 

Leu Glu Met lie 
545 

Ala Asn Leu Cys 

Leu Glu Asn Asn 
580 

Tyr Leu Lys Leu 
595 

Glu Tyr Gin Thr 
610 

Ala Thr Lys Glu 
625 

Ala Thr Thr Leu 

Phe lie Glu Phe 
660 

Ser Val Leu Lys 
675 

lie Ser Glu Ser 
690 



Val Lys Val Thr 
280 

Arg His His Leu 
295 

Asn Ala His Cys 
310 

Asp Lys Val Val 
325 

Glu Glu His Thr 

Val Glu Pro Tyr 
360 

Gin Ser Ser lie 
375 

Ser Asn Trp lie 
390 

Trp Cys lie Ser 
405 

Thr Cys Glu Asn 

Cys Pro Thr Cys 
440 

Asp Thr Trp Phe 
455 

Gly Gin Glu Lys 
470 

Tyr Pro Asn Thr 
485 

Val Ala Arg Met 

Phe Lys Asp lie 
520 

Lys Met Ser Lys 
535 

Glu Lys Tyr Gly 
550 

Ala Thr Gly Arg 
565 

Lys Asn Phe Ala 

Lys Gin Glu Ser 
600 

Pro Leu Gly Arg 
615 

Ala Arg Asn Ala 
630 

Leu Tyr Arg Phe 
645 

Ser Lys Val Glu 

Glu Ala Leu Lys 
680 

Leu Tyr His Lys 
695 



265 

Pro Gly His Asp 

Glu Thr lie Lys 
300 

Gly Glu Phe Glu 
315 

Glu Arg Leu Lys 
330 

His Gin Val Gly 
345 

Val Ser Lys Gin 

Glu Lys lie Gin 
380 

Asn Asn Tyr Asn 
395 

Arg Gin Leu Phe 
410 

Asn His Gin Phe 
425 

Lys Ser Glu Thr 

Ser Ser Gly Leu 
460 

Ser Gly Leu Phe 
475 

Thr Leu lie Thr 
490 

Leu Phe Cys Ser 
505 

Tyr Leu His Ala 

Ser Lys Gly Asn 
540 

Ala Asp Ser Leu 
555 

Asp lie Lys Leu 
570 

Asn Lys Leu Phe 
585 

Phe Lys Asp Lys 

Tyr Ala Lys Ser 
620 

Leu Asp Asn Tyr 
635 

Leu Trp Gly Glu 
650 

Asn Glu Ala lie 
665 

Leu Leu His Pro 

Leu Ser Asn Thr 
700 



270 

Phe Asn Asp Tyr 
285 

lie Phe Asp Glu 

Asn Leu Glu Arg 
320 

Glu Asn Ala Leu 
335 

His Cys Tyr Arg 
350 

Trp Phe Val Lys 
365 

Gin Gly Leu Ala 

Ala Trp Met Arg 
400 

Trp Gly His Gin 
415 

Val Ser Leu Asp 
430 

Leu Glu Gin Asp 

445 

Trp Ala Phe Ser 

Asn Glu Ser Asp 
480 

Gly Phe Asp lie 
495 

Glu Ser Leu Leu 
510 

Leu Val Arg Asp 
525 

Val lie Asp Pro 

Arg Phe Thr Leu 
560 

Ser Thr Thr His 
575 

Asn Ala Ala Ser 
590 

Glu Arg Leu Asn 
605 

Arg Leu Asn Ser 

Arg Phe Asn Asp 
640 

Phe Cys Asp Trp 
655 

Asp Glu Leu Gly 
670 

Phe Met Pro Phe 
685 

Glu Leu Glu Asn 
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TTA GAA AAA GAA AGT AAA ATT AAG CAA GAA TTG GAC TTG TTA GAA CAA 2 64 6 

Leu Glu Lys Glu Ser Lys lie Lys Gin Glu Leu Asp Leu Leu Glu Gin 
860 865 870 

CCA TAATAAAAGG ATAGAAAATG TTTCAAGCGT TAAGCGATGG GTTTAAAAAC GCGCTC 2 7 05 
Pro 



AATAAAATCC G 2716 
(2) INFORMATION FOR SEQ ID NO: 996: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996: 

Met lie Met Lys Gin Glu Pro Thr Thr Tyr Gin Pro Glu Glu lie Glu 

15 10 15 

Lys Lys lie Tyr Glu lie Cys Ser His Arg Gly Tyr Phe Glu lie Asp 

20 25 30 

Gly Asn Glu Ala lie Gin Glu Lys Asn Lys Arg Phe Cys Leu Met Met 

35 40 45 

Pro Pro Pro Asn Val Thr Gly Val Leu His lie Gly His Ala Leu Thr 

50 55 60 

Leu Ser Leu Gin Asp lie Leu Ala Arg Tyr Lys Arg Met Asp Gly Tyr 
65 70 75 80 

Lys Thr Leu Tyr Gin Pro Gly Leu Asp His Ala Gly lie Ala Thr Gin 

85 90 95 

Asn Val Val Glu Lys Gin Leu Leu Ser Gin Gly lie Lys Lys Glu Asp 

100 105 110 

Leu Gly Arg Glu Glu Phe lie Lys Lys Val Trp Glu Trp Lys Glu Lys 

115 120 125 

Ser Gly Gly Ala lie Leu Glu Gin Met Lys Arg Leu Gly Val Ser Ala 

130 135 140 

Ala Phe Ser Arg Thr Arg Phe Thr Met Asp Lys Gly Leu Gin Arg Ala 
145 150 155 160 

Val Lys Leu Ala Phe Leu Lys Trp Tyr Glu Lys Gly Leu lie lie Gin 

165 170 175 

Asp Asn Tyr Met Val Asn Trp Cys Thr Lys Asp Gly Ala Leu Ser Asp 

180 185 190 

lie Glu Val Glu Tyr Glu Glu Arg Lys Gly Ala Leu Tyr Tyr lie Arg 

195 200 205 

Tyr Tyr Leu Glu Asn Gin Lys Asp Tyr Leu Val Val Ala Thr Thr Arg 

210 215 220 

Pro Glu Thr Leu Phe Gly Asp Ser Ala Leu Met Val Asn Pro Asn Asp 
225 230 235 240 

Glu Arg Tyr Lys His Leu Val Gly Gin Lys Ala lie Leu Pro Leu lie 

245 250 255 

His Arg Thr lie Pro lie lie Ala Asp Glu His Val Glu Met Glu Phe 
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GAT AAT TAT CGT TTT AAT GAC GCC ACG ACT TTG TTA TAG CGC TTT TTG 1974 
Asp Asn Tyr Arg Phe Asn Asp Ala Thr Thr Leu Leu Tyr Arg Phe Leu 
635 640 645 

TGG GGG GAA TTT TGC GAC TGG TTC ATT GAA TTT TCT AAA GTG GAA AAT 2 022 

Trp Gly Glu Phe Cys Asp Trp Phe lie Glu Phe Ser Lys Val Glu Asn 
650 655 660 665 

GAA GCG ATA GAC GAA TTA GGG AGO GTG TTA AAA GAG GCT TTA AAA CTC 2 07 0 

Glu Ala lie Asp Glu Leu Gly Ser Val Leu Lys Glu Ala Leu Lys Leu 
670 675 680 

TTG CAC CCT TTC ATG CCC TTT ATC AGC GAG TCT TTA TAC CAC AAG CTC 2118 
Leu His Pro Phe Met Pro Phe lie Ser Glu Ser Leu Tyr His Lys Leu 
685 690 695 

AGC AAT ACG GAA CTA GAA AAC ACT GAA TCT ATC ATG GTC ATG CCT TAC 216 6 

Ser Asn Thr Glu Leu Glu Asn Thr Glu Ser lie Met Val Met Pro Tyr 
700 705 710 

CCT AAA GAT TTG GCG CAA GAT GAA AAA TTA GAG CAT GAA TTT GAA GTG 2 214 

Pro Lys Asp Leu Ala Gin Asp Glu Lys Leu Glu His Glu Phe Glu Val 
715 720 725 

ATT AAA GAT TGC ATT GTG TCT TTA AGG CGT TTA AAA ATC ATG CTA GA?^ 2 2 62 

lie Lys Asp Cys lie Val Ser Leu Arg Arg Leu Lys lie Met Leu Glu 
730 735 740 745 

ACC CCA CCG ATT GTT CTA AAA GAA GCG AGC GTG GGA TTA AGA GAA GCC 2 310 

Thr Pro Pro lie Val Leu Lys Glu Ala Ser Val Gly Leu Arg Glu Ala 
750 755 760 

ATA GAA AAC ACA GAG CGT TTG CAA ACT TAC GCC CAA AAA TTA GCG AGG 2358 
lie Glu Asn Thr Glu Arg Leu Gin Thr Tyr Ala Gin Lys Leu Ala Arg 
765 770 775 

TTG GAA AAA GTC AGC GTG ATT AGT TCT AAG CCT TTA AAA AGC GTG AGC 2 4 06 

Leu Glu Lys Val Ser Val lie Ser Ser Lys Pro Leu Lys Ser Val Ser 
780 785 790 

GAT GTG GGG GAA TTT TGC CAG ACT TAT GCG AAT TTA GAA AAT CTT GAT 24 54 

Asp Val Gly Glu Phe Cys Gin Thr Tyr Ala Asn Leu Glu Asn Leu Asp 
795 800 805 

TTA AGC CCG CTT GTT GCG CGT TTG T^AA AAG CAG TTG GAA AAA TTG GAA 2 5 02 

Leu Ser Pro Leu Val Ala Arg Leu Lys Lys Gin Leu Glu Lys Leu Glu 
810 815 820 825 

AAA GAA AAA TTA AAA CTC AAT TTG CAC AAT GAA AAT TTT GTC AAA AAC 2550 
Lys Glu Lys Leu Lys Leu Asn Leu His Asn Glu Asn Phe Val Lys Asn 
830 835 840 

GCG CCT AAA AGC GTG CTA GAA AAA GCT AAA GAG AGT TTA AAA ACG CTT 2 5 98 

Ala Pro Lys Ser Val Leu Glu Lys Ala Lys Glu Ser Leu Lys Thr Leu 
845 850 855 
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CAA TTG TTT TGG GGG CAT CAA ATA CCG GTA TTC ACT TGC GAG AAT AAC 13 02 

Gin Leu Phe Trp Gly His Gin lie Pro Val Phe Thr Cys Glu As'n Asn 
410 415 420 425 

CAC CAG TTC GTA AGC TTA GAC ACC CCC TTA AGT TGC CCT ACT TGT AAG 13 5 0 

His Gin Phe Val Ser Leu Asp Thr Pro Leu Ser Cys Pro Thr Cys Lys 
430 435 440 

AGC GAA ACA CTA GAG CAA GAT AAG GAT GTG CTA GAC ACA TGG TTT AGT 13 9 8 

Ser Glu Thr Leu Glu Gin Asp Lys Asp Val Leu Asp Thr Trp Phe Ser 
445 450 455 

TCA GGG CTA TGG GCG TTT TCC ACT CTA GGG TGG GGG CAA GAA AAA AGC 144 6 

Ser Gly Leu Trp Ala Phe Ser Thr Leu Gly Trp Gly Gin Glu Lys Ser 
460 465 470 

GGT TTG TTT AAT GAA AGC GAT TTG AAA GAT TTC TAC CCT AAC ACA ACG 14 94 

Gly Leu Phe Asn Glu Ser Asp Leu Lys Asp Phe Tyr Pro Asn Thr Thr 
475 480 485 

CTC ATT ACT GGG TTT GAC ATC CTC TTT TTT TGG GTG GCT AGG ATG CTT 154 2 

Leu lie Thr Gly Phe Asp lie Leu Phe Phe Trp Val Ala Arg Met Leu 
490 495 500 505 

TTT TGC AGC GAA TCG CTT TTA GGC GAA TTG CCC TTT AAA GAT ATT TAC 15 90 

Phe Cys Ser Glu Ser Leu Leu Gly Glu Leu Pro Phe Lys Asp lie Tyr 
510 515 520 

TTG CAC GCC TTA GTG AGA GAT GAA AAG GGT GAA AAA ATG AGC AAA TCT 163 8 

Leu His Ala Leu Val Arg Asp Glu Lys Gly Glu Lys Met Ser Lys Ser 
525 530 535 

AAG GGT AAT GTG ATC GAT CCT TTA GAG ATG ATA GAA AAA TAC GGC GCG 16 8 6 

Lys Gly Asn Val lie Asp Pro Leu Glu Met lie Glu Lys Tyr Gly Ala 
540 545 550 

GAT AGC TTG CGT TTC ACT TTA GCC AAT TTG TGC GCT ACG GGT AGG GAC 17 34 

Asp Ser Leu Arg Phe Thr Leu Ala Asn Leu Cys Ala Thr Gly Arg Asp 
555 560 565 

ATT AAG CTT TCC ACT ACG CAT TTA GAA AAT AAC AAG AAT TTC GCC AAC 17 82 

lie Lys Leu Ser Thr Thr His Leu Glu Asn Asn Lys Asn Phe Ala Asn 
570 575 580 585 

AAG CTT TTT AAT GCG GCG AGT TAC TTG AAG CTC AAA CAA GAA TCT TTC 183 0 

Lys Leu Phe Asn Ala Ala Ser Tyr Leu Lys Leu Lys Gin Glu Ser Phe 
590 595 600 

AAA GAT AAA GAG CGT TTG AAT GAA TAC CAA ACG CCT TTG GGG CGT TAT 1878 
Lys Asp Lys Glu Arg Leu Asn Glu Tyr Gin Thr Pro Leu Gly Arg Tyr 
605 610 615 

GCG AAA TCG CGC TTG AAT TCA GCG ACT AAA GAG GCG CGT AAC GCT TTA 192 6 

Ala Lys Ser Arg Leu Asn Ser Ala Thr Lys Glu Ala Arg Asn Ala Leu 
-620 625 630 
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AAA GAT GGG GCG TTG AGC GAT ATT GAA GTG GAG TAT GAA GAG CGT AAG 63 0 

Lys Asp Gly Ala Leu Ser Asp lie Glu Val Glu Tyr Glu Glu Arg Lys 
190 195 200 

GGG GCG TTG TAT TAT ATT AGA TAT TAT TTA GAA AAT CAA AAA GAT TAT 67 8 

Gly Ala Leu Tyr Tyr lie Arg Tyr Tyr Leu Glu Asn Gin Lys Asp Tyr 
205 210 215 

TTA GTG GTG GCT ACC ACA CGC OCT GAA ACC TTG TTT GGC GAT AGC GCG 72 6 

Leu Val Val Ala Thr Thr Arg Pro Glu Thr Leu Phe Gly Asp Ser Ala 
220 225 230 

CTT ATG GTC AAT CCT AAC GAT GAG AGA TAG AAG CAT TTG GTG GGG CAA 7 74 

Leu Met Val Asn Pro Asn Asp Glu Arg Tyr Lys His Leu Val Gly Gin 
235 240 245 

AAA GCG ATC TTG CCT TTA ATC CAT CGC ACA ATC CCT ATT ATC GCT GAT 82 2 

Lys Ala lie Leu Pro Leu lie His Arg Thr lie Pro lie lie Ala Asp 
250 255 260 265 

GAA CAT GTT GAA ATG GAG TTT GGC ACA GGG TGT GTG AAA GTA ACC CCT 87 0 

Glu His Val Glu Met Glu Phe Gly Thr Gly Cys Val Lys Val Thr Pro 
270 275 280 

GGG CAT GAT TTT AAC GAT TAT GAA GTG GGC AAA CGC CAC CAT TTG GAA 918 
Gly His Asp Phe Asn Asp Tyr Glu Val Gly Lys Arg His His Leu Glu 
285 290 295 

ACQ ATT AAA ATC TTT GAT GAA AAG GGG ATT TTA AAC GCG CAT TGC GGG 96 6 

Thr lie Lys lie Phe Asp Glu Lys Gly lie Leu Asn Ala His Cys Gly 
300 305 310 

GAG TTT GAA AAT TTA GAA CGA TTA GAA GCT AGA GAT AAG GTC GTA GAA 1014 
Glu Phe Glu Asn Leu Glu Arg Leu Glu Ala Arg Asp Lys Val Val Glu 
315 320 325 

AGA TTA AAA GAA AAC GCC CTA TTG GAA AAA ATA GAA GAA CAC ACG CAT 1062 
Arg Leu Lys Glu Asn Ala Leu Leu Glu Lys lie Glu Glu His Thr His 
330 335 340 345 

CAA GTG GGG CAT TGC TAT CGT TGT CAT AAT GTG GTA GAA CCT TAT GTG 1110 
Gin Val Gly His Cys Tyr Arg Cys His Asn Val Val Glu Pro Tyr Val 
350 355 360 

TCT AAG CAA TGG TTT GTC AAG CCT GAA ATC GCT CAA AGT TCT ATT GAA 1158 
Ser Lys Gin Trp Phe Val Lys Pro Glu lie Ala Gin Ser Ser lie Glu 
365 370 375 

AAA ATC CAA CAA GGT TTG GCG CGA TTC TAC CCT TCT AAT TGG ATC AAT 12 06 

Lys lie Gin Gin Gly Leu Ala Arg Phe Tyr Pro Ser Asn Trp lie Asn 
380 385 390 

AAT TAC AAC GCT TGG ATG AGG GAA TTA CGC CCT TGG TGT ATC AGC AGG 12 54 

Asn Tyr Asn Ala Trp Met Arg Glu Leu Arg Pro Trp Cys lie Ser Arg 
395 400 405 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2 8... 2 64 9 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:995: 

TAAGGAACGC TCTATTTTAG GATAATA ATG ATA ATG AAA CAA GAA CCC ACC ACC 54 

Met lie Met Lys Gin Glu Pro Thr Thr 
1 5 

TAC CAA CCA GAA GAG ATA GAA AAA AAG ATT TAT GAA ATT TGC TCT CAT 1C2 
Tyr Gin Pro Glu Glu lie Glu Lys Lys lie Tyr Glu lie Cys Ser His 
10 15 20 25 

AGG GGG TAT TTT GAA ATT GAT GGC AAT GAA GCG ATC CAA GAA AAA AAC 15 0 

Arg Gly Tyr Phe Glu lie Asp Gly Asn Glu Ala lie Gin Glu Lys Asn 
30 35 40 

AAA CGA TTT TGC TTG ATG ATG CCC CCT CCT AAT GTG ACC GGT GTG TTG 198 
Lys Arg Phe Cys Leu Met Met Pro Pro Pro Asn Val Thr Gly Val Leu 
45 50 55 

CAC ATA GGG CAT GCC CTG ACT TTA AGC TTG CAA GAT ATT TTA GCG CGT 24 6 

His lie Gly His Ala Leu Thr Leu Ser Leu Gin Asp lie Leu Ala Arg 
60 65 70 

TAC AAA CGC ATG GAT GGG TAT AAG ACT TTG TAT CAG CCC GGG TTG GAT 2 94 

Tyr Lys Arg Met Asp Gly Tyr Lys Thr Leu Tyr Gin Pro Gly Leu Asp 
75 80 85 

CAC GCT GGC ATT GCA ACG CAA AAT GTC GTG GAA AAG CAG CTT TTA AGT 342 
His Ala Gly lie Ala Thr Gin Asn Val Val Glu Lys Gin Leu Leu Ser 
90 95 100 105 

CAA GGG ATT AAA AAA GAA GAT TTA GGG CGT GAA GAG TTC ATT AAA AAA 3 90 

Gin Gly lie Lys Lys Glu Asp Leu Gly Arg Glu Glu Phe lie Lys Lys 
110 115 120 

GTG TGG GAA TGG AAA GAA AAG AGC GGG GGA GCG ATT TTA GAG CAA ATG 43 8 

Val Trp Glu Trp Lys Glu Lys Ser Gly Gly Ala lie Leu Glu Gin Met 
125 130 135 

AAG CGT TTA GGC GTG AGC GCG GCC TTT TCT AGG ACT CGT TTC ACG ATG 4 86 

Lys Arg Leu Gly Val Ser Ala Ala Phe Ser Arg Thr Arg Phe Thr Met 
140 145 150 

GAT AAG GGC TTG CAA AGA GCG GTC AAA TTG GCG TTT TTG AAA TGG TAT 5 34 

Asp Lys Gly Leu Gin Arg Ala Val Lys Leu Ala Phe Leu Lys Trp Tyr 
155 160 165 

GAA AAA GGT CTC ATT ATT CAA GAT AAT TAC ATG GTG AAT TGG TGC ACT 5 82 

Glu Lys Gly Leu lie lie Gin Asp Asn Tyr Met Val Asn Trp Cys Thr 
170 175 180 185 



-1460- 



BNSDOCID: <WO 9843478A1_l_> 



wo 98/43478 



PCTAJS98/06371 



130 

Tyr Ser Lys Gly 
145 

Lys Thr Val lie 

Asn Gly Tyr Ser 
180 

Asn Asp Leu Tyr 
195 

Ala Leu Cys Tyr 
210 

lie Ala Phe Thr 
225 

Gly Leu Asp Val 

Ser Gly Ala Glu 
260 

Gly Tyr Gin Pro 
275 

lie Ala Ser Thr 
290 

Val Pro Ala Asp 
305 

His Leu Asp Ala 

lie Tyr Pro Ala 
340 

Pro Gin Met lie 
355 

Gin Val Leu Gin 
370 

Gly Leu Asp Glu 
385 

Arg Lys lie Glu 

Phe Thr Gly Ser 
420 

Gly Phe Gly Gly 
435 

Ala Phe Tyr Met 
450 

Asn Met Lys Asn 
465 



135 

Gly Lys Val Gly 
150 

lie Met Glu Leu 
165 

Val Phe Ala Gly 

Phe Glu Met Lys 
200 

Gly Gin Met Asn 
215 

Gly Leu Thr Met 
230 

Leu Met Phe lie 
245 

Met Ser Ala Leu 

Thr Leu Ala Gly 
280 

Lys Asn Gly Ser 
295 

Asp Leu Thr Asp 
310 

Thr Thr Val Leu 
325 

Val Asp Pro Leu 

Gly Glu Lys His 
360 

Lys Tyr Lys Asp 
375 

Leu Ser Glu Glu 
390 

Lys Phe Leu Ser 
405 

Pro Gly Lys Tyr 

lie Leu Glu Gly 
440 

Val Gly Ser lie 
455 

Ser 



140 

Leu Phe Gly Gly 
155 

lie His Asn Val 
170 

Val Gly Glu Arg 
185 

Glu Gly Gly Val 

Glu Pro Pro Gly 
220 

Ala Glu Tyr Phe 
235 

Asp Asn lie Phe 
250 

Leu Gly Arg lie 
265 

Glu Met Gly Lys 

lie Thr Ser Val 
300 

Pro Ala Pro Ala 
315 

Asn Arg Lys lie 
330 

Asp Ser Thr Ser 
345 

Tyr Glu Val Ala 

Leu Gin Asp lie 
380 

Asp Lys Lys Thr 
395 

Gin Pro Phe Phe 
410 

Val Thr Leu Gin 
425 

Lys Tyr Asp His 

Gin Glu Val Leu 
460 



Ala Gly Val Gly 
160 

Ala Tyr Lys His 
175 

Thr Arg Glu Gly 
190 

Leu Asp Lys Val 
205 

Ala Arg Asn Arg 

Arg Asp Glu Lys 
240 

Arg Tyr Ala Gin 
255 

Pro Ser Ala Val 
270 

Leu Gin Glu Arg 
285 

Gin Ala Val Tyr 

Ser Val Phe Ala 
320 

Ala Glu Lys Gly 
335 

Arg lie Leu Ser 
350 

Thr Gly lie Gin 
365 

lie Ala lie Leu 

Val Glu Arg Ala 
400 

Val Ala Glu Val 
415 

Glu Thr Leu Glu 
430 

lie Pro Glu Asn 
445 

Glu Lys Ala Lys 



(2) INFORMATION FOR SEQ ID NO: 995: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



-1459- 



BNSOOCID: <WO 9843478A1J_> 



wo 98/43478 



PCTAJS98/0637I 



TTG CAA GAC ATT ATT GCG ATT TTG GGA TTA GAC GAA TTG AGC GAA GAG 12 0 5 

Leu Gin Asp lie lie Ala lie Leu Gly Leu Asp Glu Leu Ser Glu Glu 
380 385 390 

GAT AAA AAA ACG GTT GAA AGG GCC AGA AAA ATT GAG AAG TTT TTA TCC 125 3 

Asp Lys Lys Thr Val Glu Arg Ala Arg Lys lie Glu Lys Phe Leu Ser 
395 400 405 

CAG CCG TTC TTT GTG GCT GAA GTG TTT ACA GGA AGT OCT GGT AAA TAT 13 01 

Gin Pro Phe Phe Val Ala Glu Val Phe Thr Gly Ser Pro Gly Lys Tyr 
410 415 420 

GTA ACC CTT CAA GAG ACT TTA GAG GGC TTT GGA GGG ATT TTA GAG GGC 134 9 

Val Thr Leu Gin Glu Thr Leu Glu Gly Phe Gly Gly He Leu Glu Gly 
425 430 435 440 

AAA TAC GAT CAT ATT CCC GAG AAC GCG TTT TAT ATG GTG GGT AGC ATT 13 97 

Lys Tyr Asp His He Pro Glu Asn Ala Phe Tyr Met Val Gly Ser He 
445 450 455 

CAA GAG GTT TTA GAA AAA GCT AAA AAC ATG AAA AAT TCC TAAGGGTTTT GT 144 8 
Gin Glu Val Leu Glu Lys Ala Lys Asn Met Lys Asn Ser 
460 465 

GATGGCTTTG TTGAAAATTA 14 6 8 

(2) INFORMATION FOR SEQ ID NO: 994: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994: 

Met Lys Ala Met Glu Gly Lys He He Gin Val Leu Gly Pro Val Val 

15 10 15 

Asp Val Glu Phe Glu Ser Tyr Leu Pro Ala He Phe Glu Ala Leu Asp 

20 25 30 

He Asn Phe Glu Val Asn Gly Val Gin Lys Ser Leu Val Leu Glu Val 

35 40 45 

Ala Ala His Leu Gly Gly Asn Arg Val Arg Ala He Ala Met Asp Met 

50 55 60 

Thr Glu Gly Leu Val Arg Asn Gin Val He Lys Ala Arg Gly Lys Met 
65 70 75 80 

He Glu Val Pro Val Gly Glu Glu Val Leu Gly Arg He Phe Asn Val 

85 90 95 

Val Gly Glu Ser He Asp Asn Leu Glu Pro Leu Lys Pro Ser Leu Thr 

100 105 110 

Trp Pro He His Arg Lys Ala Pro Ser Phe Glu Gin Gin Ser Thr Lys 

115 120 125 

Thr Glu Met Phe Glu Thr Gly He Lys Val He Asp Leu Leu Ala Pro 
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TTG TTT GGT GGG GCT GGC GTA GGC AAA ACG GTG ATC ATT ATG GAG CTT 53 3 

Leu Phe Gly Gly Ala Gly Val Gly Lys Thr Val He He Met Glu Leu 
155 160 165 

ATC CAT AAT GTG GCT TAT AAG CAT AAC GGG TAT TCG GTG TTT GCA GGT 581 
lie His Asn Val Ala Tyr Lys His Asn Gly Tyr Ser Val Phe Ala Gly 
170 175 180 

GTG GGG GAG CGC ACC AGA GAG GGG AAT GAT CTG TAT TTT GAA ATG AAA 62 9 

Val Gly Glu Arg Thr Arg Glu Gly Asn Asp Leu Tyr Phe Glu Met Lys 
185 190 195 200 

GAA GGG GGC GTT TTA GAC AAA GTC GCA CTG TGT TAT GGG CAA ATG AAT 67 7 

Glu Gly Gly Val Leu Asp Lys Val Ala Leu Cys Tyr Gly Gin Met Asn 
205 210 215 

GAG CCA CCA GGC GCG AGG AAC CGC ATC GCA TTC ACC GGC TTG ACG ATG 72 5 

Glu Pro Pro Gly Ala Arg Asn Arg lie Ala Phe Thr Gly Leu Thr Met 
220 225 230 

GCG GAG TAT TTT CGT GAT GAA AAG GGC TTA GAT GTG TTG ATG TTT ATT 77 3 

Ala Glu Tyr Phe Arg Asp Glu Lys Gly Leu Asp Val Leu Met Phe He 
235 240 245 

GAC AAC ATC TTT AGA TAC GCT CAA AGC GGT GCG GAA ATG AGC GCG CTA 821 
Asp Asn He Phe Arg Tyr Ala Gin Ser Gly Ala Glu Met Ser Ala Leu 
250 255 260 

TTA GGC CGT ATC CCT TCA GCG GTG GGG TAT CAG CCC ACG CTA GCC GGG 869 
Leu Gly' Arg He Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu Ala Gly 
265 270 275 280 

GAA ATG GGG AAA CTT CAA GAG CGT ATC GCT TCC ACT AAA AAT GGC TCT 917 
Glu Met Gly Lys Leu Gin Glu Arg He Ala Ser Thr Lys Asn Gly Ser 
285 290 295 

ATC ACT TCC GTT CAA GCG GTG TAT GTG CCA GCA GAT GAC TTG ACT GAC 9 65 

He Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu Thr Asp 
300 305 310 

CCA GCC CCT GCT TCG GTG TTT GCG CAT TTG GAT GCG ACT ACG GTG TTG 1013 
Pro Ala Pro Ala Ser Val Phe Ala His Leu Asp Ala Thr Thr Val Leu 
315 320 325 

AAT AGA AAG ATC GCT GAA AAA GGG ATT TAT CCG GCG GTG GAT CCT TTG 10 61 

Asn Arg Lys He Ala Glu Lys Gly He Tyr Pro Ala Val Asp Pro Leu 
330 335 340 

GAT TCC ACT TCA AGG ATT TTA AGC CCT CAA ATG ATC GGT GAG AAA CAC 110 9 

Asp Ser Thr Ser Arg He Leu Ser Pro Gin Met He Gly Glu Lys His 
345 350 355 360 

TAT GAA GTC GCT ACC GGT ATC CAG CAG GTT TTA CAA AAA TAC AAG GAT 115 7 

Tyr Glu Val Ala Thr Gly He Gin Gin Val Leu Gin Lys Tyr Lys Asp 
365 370 375 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B ) LOCATION : 3 0 ... 14 3 6 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993: 

AAAATAAAAA TTATATTAAT CAAGGAGCG ATG AAA GCG ATG GAA GGT AAA ATC 5 3 

Met Lys Ala Met Glu Gly Lys lie 
1 5 

ATT CAG GTT TTA GGC CCT GTG GTA GAT GTG GAG TTT GAA TCC TAT CTG 101 
lie Gin Val Leu Gly Pro Val Val Asp Val Glu Phe Glu Ser Tyr Leu 
10 15 20 

CCG GCG ATT TTT GAA GCG TTA GAC ATT AAT TTT GAA GTC AAT GGT GTT 14 9 

Pro Ala lie Phe Glu Ala Leu Asp lie Asn Phe Glu Val Asn Gly Val 
25 30 35 40 

CAA AAG TCT TTA GTT TTA GAG GTG GCA GCC CAT TTG GGC GGT AAT CGG 197 
Gin Lys Ser Leu Val Leu Glu Val Ala Ala His Leu Gly Gly Asn Arg 
45 50 55 

GTG CGA GCG ATT GCT ATG GAT ATG ACA GAA GGC TTA GTG CGT AAC CAA 24 5 

Val Arg Ala lie Ala Met Asp Met Thr Glu Gly Leu Val Arg ' Asn Gin 
60 65 70 

GTG ATC AAG GCT CGC GGC AAA ATG ATT GAA GTG CCT GTG GGC GAA GAA 2 93 

Val lie Lys Ala Arg Gly Lys Met lie Glu Val Pro Val Gly Glu Glu 
75 80 85 

GTA TTA GGG CGT ATT TTT AAT GTT GTG GGC GAG AGC ATT GAC AAT TTA 341 
Val Leu Gly Arg lie Phe Asn Val Val Gly Glu Ser lie Asp Asn Leu 
90 95 100 

GAG CCG CTT AAG CCG TCC TTA ACT TGG CCC ATT CAC AGA AAA GCC CCT 3 89 

Glu Pro Leu Lys Pro Ser Leu Thr Trp Pro lie His Arg Lys Ala Pro 
105 110 115 120 

AGT TTT GAG CAG CAA AGC ACT AAA ACA GAA ATG TTT GAA ACT GGT ATT 437 
Ser Phe Glu Gin Gin Ser Thr Lys Thr Glu Met Phe Glu Thr Gly lie 
125 130 135 

AAA GTC ATT GAC TTA CTC GCG CCT TAT TCT AAG GGC GGT AAA GTA GGC 4 85 

Lys Val lie Asp Leu Leu Ala Pro Tyr Ser Lys Gly Gly Lys Val Gly 
140 145 150 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:992: 



Met Asp Phe Cys 
1 

Cys His Leu Lys 
20 

Arg Thr Tyr Glu 
35 

Met Arg lie Asn 
50 

Gly Phe Pro Cys 
65 

Glu Asp Glu Arg 

Val Lys Gin Pro 
100 

Asn His Asn Lys 
115 

Glu Val Gly Tyr 
130 

Gin Leu Ala Gin 
145 

Asp Leu Lys His 

Tyr Phe Lys Asp 
180 

Ser Asn Ala Ala 
195 

Arg Val Ser Leu 
210 

Thr Arg Gin Ser 
225 

Arg Thr Ser Arg 

Arg Leu Asn Ala 
260 

Leu lie Ala Lys 
275 

Leu Ser Gin Ala 
290 

Ala Lys Gin Met 
305 



Ser Gly lie Gly 
5 

Cys Val Gly His 

Leu Phe Phe Lys 
40 

Pro Asn Asp Leu 
55 

Gin Ala Phe Ser 
70 

Gly Thr lie lie 
85 

Glu Cys Phe Leu 

Lys Ala Thr Phe 
120 

Thr Thr Tyr Tyr 
135 

Asn Arg Glu Arg 
150 

Pro Phe Asn Phe 
165 

Phe Leu Asp Ala 

Phe Gin Arg Tyr 
200 

Glu Asp Leu Leu 
215 

Asp Leu Arg Leu 
230 

His Gly Leu Phe 
245 

lie Glu Ser Leu 

lie Lys Asp Asn 
280 

Gly Asn Ala Met 
295 

Leu Lys Ala lie 
310 



Gly Gly Arg Leu 
10 

Ala Glu lie Asn 
25 

Asp Thr His Asn 

Pro Asp Phe Asp 
60 

lie Asn Gly Lys 
75 

Tyr Gly Leu lie 
90 

Leu Glu Asn Val 
105 

Asn lie He He 

Lys He Leu Asn 
140 

Leu Tyr He Val 
155 

Pro Leu Gly Leu 
170 

Asp Asn Glu Cys 
185 

Leu His Asn Arg 

Thr Leu Glu Asn 
220 

Tyr Ser Asn Val 
235 

Tyr Thr Gin Lys 
250 

Leu Leu Gin Gly 
265 

Pro Asn Phe Lys 

Ser Val Asn Val 
300 



Gly Leu Glu Gin 
15 

His Glu Ala Leu 
30 

Phe Gly Asp Leu 
45 

Ala Leu He Ser 

Arg Lys Gly Leu 
80 

Arg He Leu Lys 
95 

Lys Gly Leu He 
110 

Lys Ala Leu Gin 
125 

Ser Ala Asp Phe 

Gly Phe Arg Lys 
160 

Ala Asn Asp Tyr 
175 

Tyr Leu Asp Val 
190 

Tyr Asn His Asn 
205 

Ala Val Leu Asp 

Phe Pro Thr Leu 
240 

Gly Ly s He Ly s 
255 

Phe Pro Arg Asp 
270 

Ala Ser His Leu 
285 

He Ala Ala He 



(2) INFORMATION FOR SEQ ID NO: 9 93: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 6 8 base pairs 
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Val Lys Gly Leu lie Asn His Asn Lys Lys Ala Thr Phe Asn lie lie 
110 115 120 

ATC AAA GCC CTA CAA GAA GTG GGT TAT ACA ACT TAT TAT AAA ATT TTA 435 
lie Lys Ala Leu Gin Glu Val Gly Tyr Thr Thr Tyr Tyr Lys lie Leu 
125 130 135 

AAC AGC GCT GAT TTT CAA TTA GCC CAA AAT AGA GAA CGC CTT TAT ATC 4 83 

Asn Ser Ala Asp Phe Gin Leu Ala Gin Asn Arg Glu Arg Leu Tyr lie 
140 145 150 155 

GTA GGG TTT AGG AAG GAT TTA AAA CAC CCA TTT AAT TTC CCT TTA GGT 531 
Val Gly Phe Arg Lys Asp Leu Lys His Pro Phe Asn Phe Pro Leu Gly 
160 165 170 

TTA GCC AAT GAT TAT TAT TTC AAG GAT TTT TTA GAC GCT GAT AAT GAA 57 9 

Leu Ala Asn Asp Tyr Tyr Phe Lys Asp Phe Leu Asp Ala Asp Asn Glu 
175 180 185 

TGT TAT TTG GAT GTG AGT AAC GCT GCA TTT CAA AGA TAC TTG CAC AAC 62 7 

Cys Tyr Leu Asp Val Ser Asn Ala Ala Phe Gin Arg Tyr Leu His Asn 
190 195 200 

CGA TAC AAC CAT AAC CGG GTT TCT TTA GAG GAT CTC TTA ACT TTA GAA 6 75 

Arg Tyr Asn His Asn Arg Val Ser Leu Glu Asp Leu Leu Thr Leu Glu 
205 210 215 

AAC GCT GTT TTA GAC ACA AGA CAA TCT GAT TTA AGG TTG TAT TCT AAT 7 23 

Asn Ala Val Leu Asp Thr Arg Gin Ser Asp Leu Arg Leu Tyr Ser Asn 
220 225 230 235 

GTT TTT CCT ACT TTA AGG ACT TCT CGG CAT GGC CTG TTT TAT ACC CAA 771 
Val Phe Pro Thr Leu Arg Thr Ser Arg His Gly Leu Phe Tyr Thr Gin 
240 245 250 

AAA GGC AAA ATC AAA AGA TTA AAC GCT ATT GAA AGC TTG CTT TTG CAA 819 
Lys Gly Lys lie Lys Arg Leu Asn Ala lie Glu Ser Leu Leu Leu Gin 
255 260 265 

GGA TTT CCT AGG GAT TTG ATC GCT AAG ATT AAA GAT AAT CCT AAC TTT 8 67 

Gly Phe Pro Arg Asp Leu lie Ala Lys lie Lys Asp Asn Pro Asn Phe 
270 275 280 

AAA GCA AGC CAT TTG CTA TCC CAA GCG GGG AAT GCG ATG AGC GTG AAT 915 
Lys Ala Ser His Leu Leu Ser Gin Ala Gly Asn Ala Met Ser Val Asn 
285 290 295 

GTG ATT GCT GCA ATC GCT AAA CAA ATG TTA AAG GCG ATT TAATAAGGGA GC 96 6 
Val He Ala Ala He Ala Lys Gin Met Leu Lys Ala He 
300 305 310 

TTTAAGGGGA GAATGATTTC AAAATACCCC CTATCCCCTT AA 10 0 8 

(2) INFORMATION FOR SEQ ID NO:992: 
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Val Ser Gly Met Leu Glu lie Gly Ala Phe Lys Glu Ala Phe Lys Lys 
145 150 155 160 

Val Leu Gly Lys Lys Leu Thr Gin Glu Val He Asp Ala Asn Met Leu 

165 170 175 

Ala He Gin Arg Ala Tyr Glu Glu Val Gin 
180 185 

(2) INFORMATION FOR SEQ ID NO: 991: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 0 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19 . . . 954 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991: 

TTGGGGATTT TAACTTTT ATG GAT TTT TGC TCT GGC ATT GGT GGA GGC CGT 51 

Met Asp Phe Cys Ser Gly lie Gly Gly Gly Arg 
15 10 

TTG GGC TTG GAG CAA TGC CAT TTA AAA TGC GTA GGG CAT GCA GAA ATC 99 
Leu Gly Leu Glu Gin Cys His Leu Lys Cys Val Gly His Ala Glu lie 
15 20 25 

AAT CAT GAA GCC CTT AGG ACT TAT GAA TTA TTT TTT AAA GAT ACC CAT 14 7 

Asn His Glu Ala Leu Arg Thr Tyr Glu Leu Phe Phe Lys Asp Thr His 
30 35 40 

AAT TTT GGG GAT TTG ATG CGA ATC AAC CCT AAT GAT TTA CCC GAT TTT 195 
Asn Phe Gly Asp Leu Met Arg lie Asn Pro Asn Asp Leu Pro Asp Phe 
45 50 55 

GAT GCA CTC ATT AGC GGG TTT CCT TGT CAA GCT TTT TCT ATC AAT GGC 24 3 

Asp Ala Leu He Ser Gly Phe Pro Cys Gin Ala Phe Ser He Asn Gly 
60 65 70 75 

AAA AGG AAG GGG CTT GAA GAT GAA AGA GGG ACG ATT ATT TAC GGG CTT 2 91 

Lys Arg Lys Gly Leu Glu Asp Glu Arg Gly Thr He He Tyr Gly Leu 
80 85 90 

ATT CGC ATT TTA AAA GTT AAA CAG CCT GAA TGT TTC TTG CTT GAA AAT 33 9 

He Arg He Leu Lys Val Lys Gin Pro Glu Cys Phe Leu Leu Glu Asn 
95 100 105 

GTT AAG GGC TTG ATC AAT CAT AAT AAA AAG GCA ACT TTT AAT ATT ATT 3 87 
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Lys Glu Glu Leu Phe Glu Lys Lys Pro Glu Leu Lys Thr Arg Lys Val 
105 110 115 

TTT TTA GTG GAT TGT TTA AAA ATC TCT ATG GAA ACC TTA AAA CGC CCC 441 
Phe Leu Val Asp Cys Leu Lys lie Ser Met Glu Thr Leu Lys Arg Pro 
120 125 130 

ATC CCT AAC ACG CCC ATG TTA GGG GCG TTA ATG AAA GTG TCT GGC ATG 4 89 

lie Pro Asn Thr Pro Met Leu Gly Ala Leu Met Lys Val Ser Gly Met 
135 140 145 

CTT GAA ATT GGG GCT TTT AAA GAA GCT TTT AAG AAA GTT TTA GGC AAA 53 7 

Leu Glu lie Gly Ala Phe Lys Glu Ala Phe Lys Lys Val Leu Gly Lys 
150 155 160 

AAA CTC ACG CAA GAA GTC ATT GAC GCT AAC ATG CTC GCT ATC CAA AGA 585 
Lys Leu Thr Gin Glu Val lie Asp Ala Asn Met Leu Ala lie Gin Arg 
165 170 175 180 

GCT TAT GAA GAA GTT CAA TAACATTAAG GAACAAAGAT GAAAGATTGG AACGAATT 641 
Ala Tyr Glu Glu Val Gin 
185 

TGAAATGGG 65 0 

(2) INFORMATION FOR SEQ ID NO: 990: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 186 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990: 

Met Phe Gin lie Arg Trp His Ala Arg Ala Gly Gin Gly Ala lie Thr 

15 10 15 

Gly Ala Lys Gly Leu Ala Asp Val lie Ser Lys Thr Gly Lys Glu Val 

20 25 30 

Gin Ala Phe Ala Ser Tyr Gly Ser Ala Lys Arg Gly Ala Ala Met Met 

35 40 45 

Ala Tyr Asn Arg Val Asp Asp Glu Pro lie Leu Asn His Glu Arg Phe 

50 55 60 

Met Gin Pro Asp Tyr Val Leu Val lie Asp Pro Gly Leu Val Phe lie 
65 70 75 80 

Glu Asn lie Phe Ala Asn Glu Lys Glu Asp Thr Thr Tyr lie lie Thr 

85 90 95 

Ser Tyr Leu Asn Lys Glu Glu Leu Phe Glu Lys Lys .Pro Glu Leu Lys 

100 105 110 

Thr Arg Lys Val Phe Leu Val Asp Cys Leu Lys lie Ser Met Glu Thr 

115 120 125 

Leu Lys Arg Pro lie Pro Asn Thr Pro Met Leu Gly Ala Leu Met Lys 
130 135 140 
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Arg Lys Arg Leu Phe Arg Val Phe Val Lys His Cys lie Leu lie Pro 
385 390 395 400 

Leu Lys Ser Leu Val Gly Lys Thr Leu Arg Leu Leu Lys Leu His Ala 

405 410 415 

Leu Ala Lys Lys lie Leu lie Gin Leu Lys Leu Leu Lys Lys Ser 

420 425 430 

(2) INFORMATION FOR SEQ ID NO: 989: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 603 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 9: 

GCTGAATTCA ATTTATTTTA TACGATATTA AGGAGACATA TTACC ATG TTT CAA ATT 5 7 

Met Phe Gin lie 
1 

AGA TGG CAT GCA CGA GCG GGT CAA GGT GCA ATC ACT GGC GCT AAA GGG 10 5 
Arg Trp His Ala Arg Ala Gly Gin Gly Ala lie Thr Gly Ala Lys Gly 
5 10 15 20 

TTG GCT GAT GTG ATT TCA AAA ACA GGC AAA GAA GTG CAA GCG TTC GCT 15 3 

Leu Ala Asp Val lie Ser Lys Thr Gly Lys Glu Val Gin Ala Phe Ala 
25 30 35 

TCT TAT GGT TCA GCT AAA AGG GGG GCT GCT ATG ATG GCT TAT AAC CGC 2 01 

Ser Tyr Gly Ser Ala Lys Arg Gly Ala Ala Met Met Ala Tyr Asn Arg 
40 45 50 

GTT GAT GAT GAA CCT ATC TTA AAC CAT GAA CGC TTC ATG CAG CCT GAT 24 9 

Val Asp Asp Glu Pro lie Leu Asn His Glu Arg Phe Met Gin Pro Asp 
55 60 65 

TAT GTG CTG GTG ATT GAC CCT GGT TTG GTT TTC ATT GAA AAC ATC TTC 2 97 

Tyr Val Leu Val lie Asp Pro Gly Leu Val Phe lie Glu Asn lie Phe 
70 75 80 

GCC AAT GAA AAA GAA GAC ACG ACT TAT ATT ATC ACT AGC TAC CTT AAC 345 
Ala Asn Glu Lys Glu Asp Thr Thr Tyr lie lie Thr Ser Tyr Leu Asn 
85 90 95 100 

AAA GAA GAA TTG TTT GAA AAA AAA CCT GAA TTA AAA ACC CGT AAG GTG 3 93 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988; 

Met Thr Ser Ala Ser Ser His Ser Phe Lys Glu Gin Asp Phe His lie 

15 10 15 

Pro lie Ala Phe Ala Phe Asp Lys Asn Tyr Leu lie Pro Ala Gly Ala 

20 25 30 

Cys Leu Tyr Ser Leu Leu Glu Ser lie Ala Lys Ala Asn Lys Lys lie 

35 40 45 

Arg Tyr Thr Leu His Ala Leu Val Val Gly Leu Asn Glu Glu Asp Lys 

50 55 60 

Ala Lys Leu Asn Gin lie Thr Glu Pro Phe Lys Glu Phe Ala Ala Leu 
65 70 75 80 

Glu Val Arg Asp He Glu Ser Phe Leu Asp Thr He Pro Asn Pro Phe 

85 90 95 

Asp Glu Asp Phe Thr Lys Arg Phe Ser Lys Met Val Leu Val Lys Tyr 

100 105 110 

Phe Leu Ala Asp Leu Phe Pro Lys Tyr Ser Lys Met Val Trp Ser Asp 

115 120 125 

Val Asp Val He Phe Cys Asn Glu Phe Ser Ala Asp Phe Leu Asn Leu 

130 135 140 

Glu Glu Asn Asp Glu Asn Tyr Phe Tyr Gly Val Leu Glu Val Glu Lys 
145 150 155 160 

His His Met Met Glu Gly Phe Leu Phe Cys Asn Leu Asp Tyr Gin Arg 

165 170 175 

Lys Lys Asn Phe Thr Leu Arg Met His Glu Leu Leu Arg Gly Asn Glu 

180 185 190 

Ala Lys Gly Glu Leu Asp Phe Thr Lys Trp Cys Trp Pro Asn Met Lys 

195 200 205 

Ala Leu Gly He Glu Tyr Cys Val Phe Pro Tyr Tyr Tyr Thr He Lys 

210 215 220 

Asp Phe Ser Asn Ala Tyr Leu Asn Glu Asn Tyr Lys Lys Thr He Leu 
225 230 235 240 

Glu Ala Arg Glu Asn Pro Thr He He His Tyr Asp Ala Trp Trp Gly 

245 250 255 

Ala Val Lys Pro Trp Asp Tyr Pro Phe Gly Leu Lys Ala Asp Leu Trp 

260 265 270 

Leu Asn Ala Leu Ala Lys Thr Pro Phe Met Ser Asp Trp He Asp Ser 

275 280 285 

He Ala Arg Val Glu He Gly Ser Glu Lys Trp His Arg Tyr His Ser 

290 295 300 

He Val Ala Tyr His Tyr Tyr Phe Pro Leu Trp Lys Thr Glu Glu Gin 
305 310 315 320 

He Ala His Asp Ala Leu Lys Thr Phe Leu Asp His Tyr Phe Ser Cys 

325 330 335 

He His Ala Ala He Lys Gin Glu Asn Leu Gly Met Phe Leu Asn His 

340 345 350 

Tyr Phe Ser His Ala His Ala Glu He Lys Glu Asn Ser Leu Glu Met 

355 360 365 

Phe Leu Asn His Tyr Phe Ser His Val Tyr Arg Leu Pro Lys Lys Ala 
370 375 380 
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CAC TAT GAC GCT TGG TGG GGA GCG GTG AAG OCT TGG GAC TAT CCT TTT 819 
His Tyr Asp Ala Trp Trp Gly Ala Val Lys Pro Trp Asp Tyr Pro Phe 
250 255 260 265 

GGT TTA AAA GCG GAT TTA TGG CTG AAC GCT TTG GCT AAA ACC CCT TTT 8 67 

Gly Leu Lys Ala Asp Leu Trp Leu Asn Ala Leu Ala Lys Thr Pro Phe 
270 275 280 

ATG AGC GAT TGG ATT GAT TGG ATC GCT AGG GTG GAA ATA GGC AGC GAA 915 
Met Ser Asp Trp lie Asp Ser lie Ala Arg Val Glu lie Gly Ser Glu 
285 290 295 

AAA TGG CAT CGT TAG CAC AGC ATC GTT GCC TAT CAC TAG TAG TTT CCC 963 
Lys Trp His Arg Tyr His Ser lie Val Ala Tyr His Tyr Tyr Phe Pro 
300 305 310 

CTA TGG AAG ACT GAA GAG CAG ATC GCC CAT GAC GCA CTC AAG ACC TTT 1011 
Leu Trp Lys Thr Glu Glu Gin lie Ala His Asp Ala Leu Lys Thr Phe 
315 320 325 

TTA GAC CAT TAT TTT TGG TGC ATC CAT GCC GCA ATC AAG CAA GAA AAT 10 5 9 

Leu Asp His Tyr Phe Ser Cys lie His Ala Ala lie Lys Gin Glu Asn 
330 335 340 345 

CTC GGA ATG TTC TTG AAC CAC TAC TTC TCG CAT GCC CAT GCA GAG ATC 110 7 

Leu Gly Met Phe Leu Asn His Tyr Phe Ser His Ala His Ala Glu lie 
350 355 360 

AAA GAA AAC TCC CTT GAA ATG TTC TTG AAC CAC TAC TTC TCG CAT GTT 115 5 

Lys Glu Asn Ser Leu Glu Met Phe Leu Asn His Tyr Phe Ser His Val 
365 370 375 

TAT AGG CTC CCT AAA AAA GCA CGG AAG AGA CTC TTT AGG GTG TTT GTC 12 0 3 

Tyr Arg Leu Pro Lys Lys Ala Arg Lys Arg Leu Phe Arg Val Phe Val 
380 385 390 

AAA CAC TGC ATC CTC ATA CCA CTC AAG AGC CTT GTG GGT AAG ACT CTA 12 51 

Lys His Cys lie Leu lie Pro Leu Lys Ser Leu Val Gly Lys Thr Leu 
395 400 405 

GGA CTC TTA AAA CTC CAT GCG CTA GCT AAA AAA ATC CTA ATC CAA CTC 12 99 

Arg Leu Leu Lys Leu His Ala Leu Ala Lys Lys lie Leu lie Gin Leu 
410 415 420 425 

AAG CTC TTA AAA AAG AGC TAGAGCCAAA ACTTTTAATC AAACGATTTT TTCATATC 13 5 5 
Lys Leu Leu Lys Lys Ser 
430 

AATCACATAG CGGAATTTCG CTT 137 8 

(2) INFORMATION FOR SEQ ID NO: 988: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
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TAG CTC ATT CCT GCG GGC GCG TGT CTT TAT TCC TTG CTA GAA AGC ATC 14 7 

Tyr Leu lie Pro Ala Gly Ala Cys Leu Tyr Ser Leu Leu Glu Ser lie 
30 35 40 

GCT AAA GCC AAT AAA AAA ATC CGT TAC ACC CTA CAC GCT TTA GTG GTA 195 
Ala Lys Ala Asn Lys Lys lie Arg Tyr Thr Leu His Ala Leu Val Val 
45 50 55 

GGC TTG AAT GAA GAA GAT AAA GCA AAG CTT AAT CAA ATC ACA GAG CCT 24 3 

Gly Leu Asn Glu Glu Asp Lys Ala Lys Leu Asn Gin lie Thr Glu Pro 
60 65 70 

TTT AAA GAA TTT GCC GCT TTG GAA GTG AGA GAT ATT GAG TCT TTT TTA 2 91 

Phe Lys Glu Phe Ala Ala Leu Glu Val Arg Asp lie Glu Ser Phe Leu 
75 80 85 

GAC ACT ATC CCT AAC CCT TTT GAT GAG GAT TTC ACT AAG CGT TTT TCT 33 9 

Asp Thr lie Pro Asn Pro Phe Asp Glu Asp Phe Thr Lys Arg Phe Ser 
90 95 100 105 

AAA ATG GTG TTA GTG AAG TAT TTT TTG GCG GAT TTG TTC CCC AAA TAT 3 87 

Lys Met Val Leu Val Lys Tyr Phe Leu Ala Asp Leu Phe Pro Lys Tyr 
110 115 120 

TCC AAA ATG GTG TGG AGC GAT GTG GAT GTC ATC TTT TGC AAT GAA TTT 435 
Ser Lys Met Val Trp Ser Asp Val Asp Val lie Phe Cys Asn Glu Phe 
125 130 135 

AGC GCT GAT TTC TTA AAC CTT GAA GAA AAT GAT GAG AAT TAT TTT TAT 4 83 

Ser Ala Asp Phe Leu Asn Leu Glu Glu Asn Asp Glu Asn Tyr Phe Tyr 
140 145 150 

GGA GTT TTA GAA GTT GAA AAG CAC CAC ATG ATG GAA GGG TTT TTG TTT 531 
Gly Val Leu Glu Val Glu Lys His His Met Met Glu Gly Phe Leu Phe 
155 160 165 

TGC AAT TTA GAT TAC CAG CGC AAG AAA AAT TTC ACC TTA AGA ATG CAT 57 9 

Cys Asn Leu Asp Tyr Gin Arg Lys Lys Asn Phe Thr Leu Arg Met His 
170 175 180 185 

GAG CTT TTA AGG GGG AAT GAG GCT AAA GGG GAG TTG GAT TTC ACG AAA 62 7 

Glu Leu Leu Arg Gly Asn Glu Ala Lys Gly Glu Leu Asp Phe Thr Lys 
190 195 200 

TGG TGT TGG CCT AAC ATG AAA GCT TTA GGG ATT GAA TAT TGC GTT TTC 67 5 

Trp Cys Trp Pro Asn Met Lys Ala Leu Gly lie Glu Tyr Cys Val Phe 
205 210 215 

CCT TAT TAT TAC ACC ATT AAA GAT TTT TCT AAC GCG TAT TTA AAC GAG 72 3 

Pro Tyr Tyr Tyr Thr lie Lys Asp Phe Ser Asn Ala Tyr Leu Asn Glu 
220 225 230 

AAT TAC AAG AAA ACC ATT TTA GAG GCA CGA GAA AAC CCT ACC ATT ATC 771 
Asn Tyr Lys Lys Thr lie Leu Glu Ala Arg Glu Asn Pro Thr lie lie 
235 240 245 
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130 135 140 

Ser Val Asp Lys Asn Ala Pro Leu Glu Lys Val Ala Pro Leu Leu Cys 
145 150 155 160 

Ala Gly lie Thr Thr Tyr Ser Pro Leu Lys Phe Ser Lys Val Thr Lys 

165 170 175 

Gly Thr Lys Val Gly Val Ala Gly Phe Gly Gly Leu Gly Ser Met Ala 

180 185 190 

Val Lys Tyr Ala Val Ala Met Gly Ala Glu Val Ser Val Phe Ala Arg 

195 200 205 

Asn Glu His Lys Lys Gin Asp Ala Leu Ser Met Gly Val Lys His Phe 

210 215 220 

Tyr Thr Asp Pro Lys Gin Cys Lys Glu Glu Leu Asp Phe lie lie Ser 
225 230 235 240 

Thr lie Pro Thr His Tyr Asp Leu Lys Asp Tyr Leu Lys Leu Leu Thr 

245 250 255 

Tyr Asn Gly Asp Leu Ala Leu Val Gly Leu Pro Pro Val Glu lie Ala 

260 265 270 

Pro Ala Leu Ser Val Phe Asp Phe lie His Leu Gly Asn Arg Lys Val 

275 280 285 

Tyr Gly Ser Leu lie Gly Gly lie Lys Glu Thr Gin Glu Met Met Asp 

290 295 300 

Phe Ser lie Lys His Asn lie Tyr Pro Glu lie Asp Leu lie Leu Gly 
305 310 315 320 

Lys Asp lie Asp Thr Ala Tyr His Asn Leu Thr His Gly Lys Ala Lys 

325 330 335 

Phe Arg Tyr Val lie Asp Met Lys Lys Ser Phe Asp 
340 345 

(2) INFORMATION FOR SEQ ID NO: 987: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2 5... 13 17 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987: 

TTAAAAAAGG GTGTTTAATT TTTT ATG ACT TCA GCT TCA AGC CAT TCT TTT 51 

Met Thr Ser Ala Ser Ser His Ser Phe 
1 5 

AAA GAA CAA GAT TTT CAT ATT CCT ATC GCT TTT GCT TTT GAT AAG AAT 99 
Lys Glu Gin Asp Phe His lie Pro He Ala Phe Ala Phe Asp Lys Asn 
10 15 20 25 
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GGC GAT CTA GCC CTT GTG GGA CTC CCC CCT GTA GAA ATC GCT CCA GCG 872 
Gly Asp Leu Ala Leu Val Gly Leu Pro Pro Val Glu lie Ala Pro Ala 
260 265 270 

CTT AGC GTT TTT GAT TTT ATC CAT TTA GGC AAT CGC AAG GTT TAT GGC 92 0 

Leu Ser Val Phe Asp Phe lie His Leu Gly Asn Arg Lys Val Tyr Gly 
275 280 285 290 

TCA TTG ATT GGG GGC ATT AAA GAA ACC CAA GAA ATG ATG GAT TTT TCT 96 8 

Ser Leu lie Gly Gly lie Lys Glu Thr Gin Glu Met Met Asp Phe Ser 
295 300 305 

ATC AAA CAC AAT ATT TAC CCT GAA ATA GAT TTG ATC TTA GGC AAG GAT 1016 
lie Lys His Asn lie Tyr Pro Glu lie Asp Leu He Leu Gly Lys Asp 
310 315 320 

ATT GAC ACC GCT TAT CAT AAT CTA ACC CAT GGG AAA GCG AAA TTC CGC 10 64 

He Asp Thr Ala Tyr His Asn Leu Thr His Gly Lys Ala Lys Phe Arg 
325 330 335 

TAT GTG ATT GAT ATG AAA AAA TCG TTT GAT TAAAAGTTTT GGCTCTAGCT CTT 1117 
Tyr Val He Asp Met Lys Lys Ser Phe Asp 
340 345 

TTTTAAGAGC TTGAGTTGG 113 6 

(2) INFORMATION FOR SEQ ID NO: 98 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 6: 

Met Arg Val Gin Ser Lys Gly Phe Ala He Phe Ser Lys Asp Gly His 

1 5 10 15 

Phe Lys Pro His Asp Phe Ser Arg His Ala Val Gly Pro Lys Asp Val 

20 25 30 

Leu He Asp He Leu Tyr Ala Gly He Cys His Ser Asp He His Ser 

35 40 45 

Ala Tyr Ser Glu Trp Lys Glu Gly He Tyr Pro Met Val Pro Gly His 

50 55 60 

Glu He Ala Gly Ala He Lys Glu Val Gly Lys Glu Val Lys Lys Phe 
65 70 75 80 

Lys Val Gly Asp Val Val Gly Val Gly Cys Phe Val Asn Ser Cys Lys 

85 90 95 

Ala Cys Lys Pro Cys Lys Glu His Gin Glu Gin Phe Cys Ala Lys Val 

100 105 110 

Val Phe Thr Tyr Asp Cys Leu Asp Tyr Phe His Asp Asn Glu Pro His 

115 120 125 

Met Gly Gly Tyr Ser Asn Asn He Val Val Asp Glu Asn Tyr Val He 
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GAC ATT CTT TAT GCA GGG ATT TGT CAT AGC GAT ATT CAT AGC GCT TAT 2 00 

Asp lie Leu Tyr Ala Gly lie Cys His Ser Asp lie His Ser Ala Tyr 
35 40 45 50 

AGC GAA TGG AAA GAA GGC ATT TAG CCT ATG GTT CCT GGG CAT GAA ATT 24 8 

Ser Glu Trp Lys Glu Gly lie Tyr Pro Met Val Pro Gly His Glu lie 
55 60 65 

GCT GGG GCC ATC AAA GAA GTG GGT AAG GAA GTT AAG AAA TTT AAG GTT 2 96 

Ala Gly Ala lie Lys Glu Val Gly Lys Glu Val Lys Lys Phe Lys Val 
70 75 80 

GGC GAT GTG GTG GGC GTG GGC TGT TTT GTC AAT TCA TGC AAA GCG TGT 344 
Gly Asp Val Val Gly Val Gly Cys Phe Val Asn Ser Cys Lys Ala Cys 
85 90 95 

AAG CCC TGT AAA GAA CAC CAA GAG CAA TTT TGC GCC AAA GTG GTA TTC 3 92 

Lys Pro Cys Lys Glu His Gin Glu Gin Phe Cys Ala Lys Val Val Phe 
100 105 110 

ACT TAC GAT TGT TTG GAT TAT TTC CAT GAC AAC GAA CCC CAC ATG GGC 44 0 

Thr Tyr Asp Cys Leu Asp Tyr Phe His Asp Asn Glu Pro His Met Gly 
115 120 125 130 

GGA TAC TCT AAT AAT ATT GTA GTG GAT GAA AAC TAT GTG ATT AGC GTG 48 8 

Gly Tyr Ser Asn Asn lie Val Val Asp Glu Asn Tyr Val lie Ser Val 
135 140 145 

GAT AAA AAC GCT CCT TTA GAA AAA GTA GCC CCC TTG CTT TGT GCG GGC 53 6 

Asp Lys Asn Ala Pro Leu Glu Lys Val Ala Pro Leu Leu Cys Ala Gly 
150 155 160 

ATC ACC ACT TAT TCG CCC TTA AAA TTT TCT AAG GTT ACT AAA GGC ACA 5 84 

lie Thr Thr Tyr Ser Pro Leu Lys Phe Ser Lys Val Thr Lys Gly Thr 
165 170 175 

AAA GTT GGC GTC GCT GGG TTT GGC GGG CTA GGA AGC ATG GCG GTT AAA 63 2 

Lys Val Gly Val Ala Gly Phe Gly Gly Leu Gly Ser Met Ala Val Lys 
180 185 190 

TAC GCT GTG GCT ATG GGG GCT GAA GTG AGC GTT TTT GCA AGA AAC GAA 68 0 

Tyr Ala Val Ala Met Gly Ala Glu Val Ser Val Phe Ala Arg Asn Glu 
195 200 205 210 

CAC AAA AAG CAA GAC GCT TTG AGC ATG GGG GTT AAA CAT TTC TAC ACT 72 8 

His Lys Lys Gin Asp Ala Leu Ser Met Gly Val Lys His Phe Tyr Thr 
215 220 225 

GAC CCC AAA CAA TGC AAA GAG GAA TTG GAC TTT ATC ATT TCA ACC ATT 77 6 

Asp Pro Lys Gin Cys Lys Glu Glu Leu Asp Phe lie lie Ser Thr lie 
230 235 240 

CCT ACC CAT TAT GAT TTA AAA GAC TAC CTC AAG CTC TTA ACT TAT AAT 82 4 

Pro Thr His Tyr Asp Leu Lys Asp Tyr Leu Lys Leu Leu Thr Tyr Asn 
245 250 255 
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lie Ala Gin Met Gly Arg Ala Ser Gly Leu His Leu lie Val Ala Thr 

675 680 685 

Gin Arg Pro Ser Val Asp Val Val Thr Gly Leu lie Lys Thr Asn Leu 

690 695 700 

Pro Ser Arg Val Ser Phe Arg Val Gly Thr Lys lie Asp Ser Lys Val 
705 710 715 720 

lie Leu Asp Thr Asp Gly Ala Gin Ser Leu Leu Gly Arg Gly Asp Met 

725 730 735 

Leu Phe Thr Pro Pro Gly Ala Asn Gly Leu Val Arg Leu His Ala Pro 

740 745 750 

Phe Ala Thr Glu Asp Glu lie Lys Lys lie Val Asp Phe lie Lys Ala 

755 760 765 

Gin Lys Glu Val Gin Tyr Asp Lys Asp Phe Leu Leu Glu Glu Ser Arg 

770 775 780 

Met Pro Leu Asp Thr Pro Asn Tyr Gin Gly Asp Asp lie Leu Glu Arg 
785 790 795 800 

Ala Lys Ala Val lie Leu Glu Lys Lys lie Thr Ser Thr Ser Phe Leu 

805 810 815 

Gin Arg Gin Leu Lys lie Gly Tyr Asn Gin Ala Ala Thr lie Thr Asp 

820 825 830 

Glu Leu Glu Ala Gin Gly Phe Leu Ser Pro Arg Asn Ala Lys Gly Asn 

835 840 845 

Arg Glu lie Leu Gin Asn Phe 
850 855 

(2) INFORMATION FOR SEQ ID NO: 985: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1094 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 85: 
AACCATAAAA ACGATACAAT AGCGGTATTT TAATAAAACA AGGAGTTTTA ATG AGA 5 6 



Met Arg 
1 



GTT CAA TCT AAA GGT TTT GCT ATT TTT TCT AAA 
Val Gin Ser Lys Gly Phe Ala lie Phe Ser Lys 
5 10 



GAC GGG CAT TTC AAA 
Asp Gly His Phe Lys 
15 



104 



CCC CAT GAT TTT AGC CGC CAT GCT GTA GGC CCT 
Pro His Asp Phe Ser Arg His Ala Val Gly Pro 
20 25 



AAA GAT GTG TTG ATT 
Lys Asp Val Leu lie 
30 



152 
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225 

Pro Pro Thr His 

Leu Thr Asn Pro 
260 

Lys Glu Arg Glu 
275 

Pro Lys Pro lie 
290 

Lys Thr Glu Asn 
305 

Pro Gin Glu Asn 

Met lie Lys Glu 
340 

Pro Asn Phe Ser 
355 

Val Lys Glu Leu 
370 

Gly Glu Val Gin 
385 

Leu Asn Ala Val 

Asp Gin Lys lie 
420 

Asp Gly Asp lie 
435 

Glu Phe Arg Pro 
450 

Ser Asp Asp Leu 
465 

Ala Pro lie Lys 

Gin Ser Gin lie 
500 

Gin Lys Ser Ser 
515 

Gly Asn Pro Phe 
530 

Ala Gly Thr Thr 
545 

Leu Ser Leu Leu 

lie Asp Pro Lys 
580 

Leu Leu Thr Pro 
595 

Gin Ser Val Ala 
610 

Tyr Lys Val Lys 
625 

Gly Val Glu Ala 

Asp Leu Met Met 
660 



230 

Tyr Thr lie Tyr 
245 

Thr Asn Pro Pro 

Pro Thr Pro Thr 
280 

Met Pro Thr Leu 
295 

Gin Lys Thr Pro 
310 

Thr Gin Glu Glu 
325 

Asn Leu Lys Lys 

Pro Val Thr Pro 
360 

Ser Glu Asn Lys 
375 

Lys Pro Lys Asp 
390 

Cys Leu Lys Asp 
405 

Gin Asp Leu Leu 

lie Arg Thr Tyr 
440 

Ala Pro Asn Val 
455 

Ala Met Thr Leu 

470 

Gly Lys Asp Val 
485 

lie Tyr Leu Arg 

Ser Pro Leu Thr 
520 

lie Thr Asp Leu 
535 

Gly Ser Gly Lys 
550 

Tyr Lys Asn Pro 
565 

Met Val Glu Phe 

lie lie Thr Asp 
600 

Lys Glu Met Glu 
615 

Thr lie Asp Ser 
630 

Phe Pro Tyr Leu 
645 

Thr Gly Gly Lys 



235 

Pro Lys Arg Asn 
250 

Leu Lys Glu lie 
265 

Lys Glu Thr Leu 

Ala Pro lie lie 
300 

Asn His Pro Lys 
315 

Met lie Glu Gly 
330 

Glu Glu Lys Glu 
345 

Thr Ser Ala Lys 

Glu lie Leu Asp 
380 

Tyr Glu Leu Pro 
395 

Thr Ser Leu Asp 
410 

Ser Lys Leu Arg 
425 

Ser Gly Pro lie 

Lys Val Ser Arg 
460 

Cys Ala Glu Ser 
475 

Val Gly lie Glu 
490 

Glu He Leu Glu 
505 

Leu Ala Leu Gly 

Lys Lys Leu Pro 
540 

Ser Val Gly Val 
555 

Pro Asp Gin Leu 
570 

Ser He Tyr Ala 
585 

Pro Lys Lys Ala 

Arg Arg Tyr Ser 
620 

Tyr Asn Glu Gin 
635 

He Val Val He 
650 

Glu Ala Glu Phe 
665 



240 

Arg Phe Asp Asp 
255 

Lys Gin Glu Thr 
270 

Thr Pro Thr Thr 
285 

Glu Asn Asp Asn 

Lys Glu Glu Asn 
320 

Arg He Glu Glu 
335 

Val Gin Asn Ala 
350 

Lys Pro Val Met 
365 

Gly Leu Asp Tyr 

Thr Thr Gin Leu 
400 

Glu Asn Glu He 
415 

Thr Phe Lys He 
430 

Val Thr Thr Phe 
445 

He Leu Gly Leu 

He Arg He Gin 
480 

He Pro Asn Ser 
495 

Ser Glu Leu Phe 
510 

Lys Asp He Val 
525 

His Leu Leu He 

Asn Ala Met He 
560 

Lys Leu Val Met 
575 

Asp He Pro His 
590 

He Gly Ala Leu 
605 

Leu Met Ser Glu 

Ala Pro Ser Asn 
640 

Asp Glu Leu Ala 
655 

Pro He Ala Arg 
670 
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TCT ACG AGT TTT TTA CAA CGC CAA TTA AAA ATC GGC TAG AAC CAA GCC 24 98 

Ser Thr Ser Phe Leu Gin Arg Gin Leu Lys lie Gly Tyr Asn Gin Ala 
815 820 825 

GCT ACC ATT ACT GAC GAA TTA GAA GCT CAA GGC TTT TTA TCC CCA AGA 2 54 6 

Ala Thr lie Thr Asp Glu Leu Glu Ala Gin Gly Phe Leu Ser Pro Arg 
830 835 840 

AAC GCT AAA GGC AAC AGA GAG ATT TTG CAA AAC TTT TAGGCTTTGT TTTCAT 2 5 98 
Asn Ala Lys Gly Asn Arg Glu lie Leu Gin Asn Phe 
845 850 855 

TGGATATTGG CAAACATTAT TTTTGATTT 2 62 7 

(2) INFORMATION FOR SEQ ID NO:984: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984: 

Met Lys Ser Lys Lys Leu Tyr Leu Ala Leu lie lie Gly Val Leu Leu 

15 10 15 

Ala Phe Leu Thr Leu Ser Ser Trp Leu Gly Asn Ser Gly Leu Val Gly 

20 25 30 

Arg Phe Gly Val Trp Phe Ala Ala Leu Asn Lys Lys Tyr Phe Gly His 

35 40 45 

Leu Ser Phe lie Asn Leu Pro Tyr Leu Ala Trp Val Leu Phe Leu Leu 

50 55 60 

Tyr Lys Thr Lys Asn Pro Phe Thr Glu lie Val Leu Glu Lys Thr Leu 
65 70 75 80 

Gly His Leu Leu Gly lie Leu Ser Leu Leu Phe Leu Gin Ser Ser Leu 

85 90 95 

Leu Asn Gin Gly Glu lie Gly Asn Ser Ala Arg Leu Phe Leu Arg Pro 

100 105 110 

Phe lie Gly Asp Phe Gly Leu Tyr Ala Leu lie Thr Leu Met Val Val 

115 120 125 

lie Ser Tyr Leu lie Leu Phe Lys Leu Pro Pro Lys Ser Val Phe Tyr 

130 135 140 

Pro Tyr Met Asn Lys Thr Gin Asn Leu Leu Lys Glu lie Tyr Lys Gin 
145 150 155 160 

Cys Leu Gin Ala Phe Ser Pro Asn Phe Ser Pro Lys Lys Glu Gly Phe 

165 170 175 

Glu Asn Thr Pro Ser Asp lie Gin Lys Lys Glu Thr Lys Asn Asp Lys 

180 185 190 

Glu Lys Glu Asn Arg Lys Glu Asn Pro lie Asn Glu Asn His Lys Thr 

195 200 205 

Pro Asn Glu Glu Pro Phe Leu Ala lie Pro Thr Pro Tyr Asn Thr Thr 

210 215 220 

Leu Asn Asp Ser Glu Pro Gin Glu Gly Leu Val Gin lie Ser Ser His 
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GCG GAT ATC CCT CAT TTG CTC ACG CCC ATT ATC ACC GAG CCT AAA AAA 182 6 

Ala Asp lie Pro His Leu Leu Thr Pro lie lie Thr Asp Pro Lys Lys 
590 595 600 

GCT ATT GGG GCT TTG CAA AGC GTG GOT AAA GAA ATG GAA CGC CGG TAT 1874 
Ala lie Gly Ala Leu Gin Ser Val Ala Lys Glu Met Glu Arg Arg Tyr 
605 610 615 

TCT TTA ATG AGC GAA TAC AAG GTT AAA ACC ATT GAT TCT TAT AAT GAA 1922 
Ser Leu Met Ser Glu Tyr Lys Val Lys Thr lie Asp Ser Tyr Asn Glu 
620 625 630 635 

CAA GCC CCA AGT AAC GGC GTT GAA GCG TTC CCC TAT TTG ATT GTG GTG 197 0 

Gin Ala Pro Ser Asn Gly Val Glu Ala Phe Pro Tyr Leu lie Val Val 
640 645 650 

ATT GAT GAA TTA GCG GAT TTA ATG ATG ACA GGG GGC AAA GAA GCG GAG 2 018 

lie Asp Glu Leu Ala Asp Leu Met Met Thr Gly Gly Lys Glu Ala Glu 
655 660 665 

TTT CCT ATC GCT AGA ATC GCT CAA ATG GGG CGC GCG AGC GGC TTA CAC 2 066 

Phe Pro lie Ala Arg lie Ala Gin Met Gly Arg Ala Ser Gly Leu His 
670 675 680 

CTC ATT GTA GCG ACC CAA CGC CCA AGC GTG GAT GTC GTA ACC GGC TTG 2114 
Leu lie Val Ala Thr Gin Arg Pro Ser Val Asp Val Val Thr Gly Leu 
685 690 695 

ATT AAA ACC AAC TTG CCT TCA AGG GTG AGT TTT AGG GTA GGC ACT AAG 2162 
lie Lys Thr Asn Leu Pro Ser Arg Val Ser Phe Arg Val Gly Thr Lys 
700 705 710 715 

ATT GAT TCT AAA GTG ATT TTA GAC ACT GAT GGG GCG CAA AGC TTG TTA 2210 
lie Asp Ser Lys Val lie Leu Asp Thr Asp Gly Ala Gin Ser Leu Leu 
720 725 730 

GGA AGA GGC GAT ATG CTC TTT ACC CCC CCA GGA GCG AAC GGG TTA GTG 2 25 8 

Gly Arg Gly Asp Met Leu Phe Thr Pro Pro Gly Ala Asn Gly Leu Val 
735 740 745 

CGC TTG CAT GCC CCC TTT GCC ACT GAA GAT GAA ATC AAA AAA ATC GTG 23 06 
Arg Leu His Ala Pro Phe Ala Thr Glu Asp Glu lie Lys Lys lie Val 
750 755 760 

GAT TTT ATT AAA GCC CAA AAA GAA GTA CAA TAC GAT AAA GAT TTC TTG 2 3 54 

Asp Phe lie Lys Ala Gin Lys Glu Val Gin Tyr Asp Lys Asp Phe Leu 
765 770 775 

CTA GAA GAA TCA CGC ATG CCT TTA GAC ACC CCT AAT TAT CAA GGC GAT 24 02 
Leu Glu Glu Ser Arg Met Pro Leu Asp Thr Pro Asn Tyr Gin Gly Asp 
780 785 790 795 

GAC ATT TTA GAA AGG GCT AAA GCG GTG ATT TTA GAA AAA AAG ATC ACT 24 5 0 

Asp lie Leu Glu Arg Ala Lys Ala Val lie Leu Glu Lys Lys lie Thr 
800 805 810 
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Lys Lys Pro Val Met Val Lys Glu Leu Ser Glu Asn Lys Glu lie Leu 
365 370 375 

GAC GGA TTG GAT TAT GGC GAA GTG CAA AAA CCC AAA GAT TAT GAG CTT 12 0 2 

Asp Gly Leu Asp Tyr Gly Glu Val Gin Lys Pro Lys Asp Tyr Glu Leu 
380 385 390 395 

CCC ACC ACG CAA TTA TTG AAT GCG GTT TGT TTG AAA GAC ACT TCT TTA 12 5 0 

Pro Thr Thr Gin Leu Leu Asn Ala Val Cys Leu Lys Asp Thr Ser Leu 
400 405 410 

GAC GAA AAC GAG ATT GAC CAA AAA ATC CAG GAT CTA TTG AGC AAA CTG 12 98 

Asp Glu Asn Glu lie Asp Gin Lys lie Gin Asp Leu Leu Ser Lys Leu 
415 420 425 

CGC ACC TTT AAA ATT GAT GGC GAT ATT ATC CGC ACT TAT TCA GGC CCT 134 6 

Arg Thr Phe Lys lie Asp Gly Asp lie lie Arg Thr Tyr Ser Gly Pro 
430 435 440 

ATT GTA ACC ACT TTT GAA TTC CGC CCA GCC CCT AAC GTT AAG GTG AGT 13 94 

lie Val Thr Thr Phe Glu Phe Arg Pro Ala Pro Asn Val Lys Val Ser 
445 450 455 

CGT ATT TTA GGC TTG AGC GAT GAT TTA GCG ATG ACT TTA TGC GCT GAA 14 4 2 

Arg lie Leu Gly Leu Ser Asp Asp Leu Ala Met Thr Leu Cys Ala Glu 
460 465 470 475 

TCC ATC CGC ATT CAA GCC CCT ATT AAG GGT AAA GAT GTC GTT GGC ATT 14 90 

Ser lie Arg lie Gin Ala Pro lie Lys Gly Lys Asp Val Val Gly lie 
480 485 490 

GAA ATC CCT AAC AGC CAA AGC CAA ATT ATT TAT TTA AGA GAA ATT CTA 15 3 8 

Glu lie Pro Asn Ser Gin Ser Gin lie lie Tyr Leu Arg Glu lie Leu 
495 500 505 

GAG AGC GAA TTG TTT CAA AAA TCC AGC TCG CCC TTA ACT CTA GCT TTA 15 8 6 

Glu Ser Glu Leu Phe Gin Lys Ser Ser Ser Pro Leu Thr Leu Ala Leu 
510 515 520 

GGC AAA GAC ATT GTG GGT AAC CCT TTC ATC ACG GAT TTA AAA AAG CTC 1634 
Gly Lys Asp lie Val Gly Asn Pro Phe lie Thr Asp Leu Lys Lys Leu 
525 530 535 

CCC CAT TTG CTC ATC GCT GGC ACG ACA GGA AGC GGT AAG AGC GTG GGC 16 82 

Pro His Leu Leu lie Ala Gly Thr Thr Gly Ser Gly Lys Ser Val Gly 
540 545 550 555 

GTG AAT GCG ATG ATT TTA TCC TTA CTT TAT AAA AAC CCT CCC GAT CAA 17 3 0 

Val Asn Ala Met lie Leu Ser Leu Leu Tyr Lys Asn Pro Pro Asp Gin 
560 565 570 

CTC AAA TTA GTG ATG ATC GAT CCC AAA ATG GTA GAA TTT AGT ATT TAT 17 7 8 

Leu Lys Leu Val Met lie Asp Pro Lys Met Val Glu Phe Ser lie Tyr 
575 580 585 
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140 145 150 155 

GAG ATT TAG AAA CAA TGC TTA CAA GCC TTT AGO CCT AAT TTT AGO CCA 53 0 

Glu lie Tyr Lys Gin Cys Leu Gin Ala Phe Ser Pro Asn Phe Ser Pro 
160 165 170 

AAA AAA GAG GGT TTT GAA AAC ACC CCA TCA GAT ATT CAA AAA AAA GAA 57 8 

Lys Liys Glu Gly Phe Glu Asn Thr Pro Ser Asp lie Gin Lys Lys Glu 
175 180 185 

ACC AAA AAC GAC AAA GAA AAA GAA AAC CGC AAA GAA AAC CCT ATT AAT 62 6 

Thr Lys Asn Asp Lys Glu Lys Glu Asn Arg Lys Glu Asn Pro lie Asn 
190 195 200 

GAA AAC CAC AAA ACC CCT AAC GAA GAA CCG TTT TTA GCG ATC CCT ACC 674 
Glu Asn His Lys Thr Pro Asn Glu Glu Pro Phe Leu Ala lie Pro Thr 
205 210 215 

CCC TAT AAC ACG ACT TTA AAT GAT TCA GAG CCG CAA GAA GGC TTA GTC 7 22 

Pro Tyr Asn Thr Thr Leu Asn Asp Ser Glu Pro Gin Glu Gly Leu Val 
220 225 230 235 

CAA ATT TCC TCC CAC CCC CCT ACC CAT TAG ACC ATT TAC CCT AAA AGA 770 
Gin lie Ser Ser His Pro Pro Thr His Tyr Thr lie Tyr Pro Lys Arg 
240 245 250 

AAC CGA TTT GAT GAT TTG ACT AAC CCC ACT AAC CCC CCT TTA AAA GAA 818 
Asn Arg Phe Asp Asp Leu Thr Asn Pro Thr Asn Pro Pro Leu Lys Glu 
255 260 265 

ATT AAA CAA GAA ACT AAA GAA AGA GAA CCC ACG CCT ACA AAA GAA ACT 866 
lie Lys Gin Glu Thr Lys Glu Arg Glu Pro Thr Pro Thr Lys Glu Thr 
270 275 280 

CTT ACG CCC ACC ACG CCC AAA CCT ATC ATG CCC ACA CTT GCA CCC ATA 914 
Leu Thr Pro Thr Thr Pro Lys Pro lie Met Pro Thr Leu Ala Pro lie 
285 290 295 

ATA GAA AAT GAC AAC AAA ACA GAA AAC CAA AAA ACC CCC AAC CAC CCT 962 
lie Glu Asn Asp Asn Lys Thr Glu Asn Gin Lys Thr Pro Asn His Pro 
300 305 310 315 

AAA AAA GAA GAA AAC CCA CAA GAA AAC ACG CAA GAA GAA ATG ATA GAA 1010 
Lys Lys Glu Glu Asn Pro Gin Glu Asn Thr Gin Glu Glu Met lie Glu 
320 325 330 

GGA AGG ATA GAA GAA ATG ATA AAG GAA AAT CTA AAA AAA GAA GAA AAA 1058 
Gly Arg lie Glu Glu Met He Lys Glu Asn Leu Lys Lys Glu Glu Lys 
335 340 345 

GAA GTG CAA AAC GCT CCA AAC TTT AGC CCA GTA ACC CCC ACA AGC GCT 1106 
Glu Val Gin Asn Ala Pro Asn Phe Ser Pro Val Thr Pro Thr Ser Ala 
350 355 360 

AAA AAA CCC GTT ATG GTT AAA GAA TTG AGC GAA AAT AAA GAG ATA TTA 1154 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 18 . . . 2582 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:983: 

AAAGACATGT GCAACCG ATG AAA TCT AAA AAA CTT TAT TTG GCT TTA ATC 5 0 

Met Lys Ser Lys Lys Leu Tyr Leu Ala Leu lie 
15 10 

ATA GGG GTT TTA TTA GCG TTT TTA ACC CTA TCT TCA TGG CTG GGT AAT 98 
lie Gly Val Leu Leu Ala Phe Leu Thr Leu Ser Ser Trp Leu Gly Asn 
15 20 25 

AGC GGT TTA GTG GGG CGT TTT GGG GTG TGG TTT GCC GCA CTC AAT AAA 14 6 

Ser Gly Leu Val Gly Arg Phe Gly Val Trp Phe Ala Ala Leu Asn Lys 
30 35 40 

AAA TAT TTT GGG CAT CTT TCA TTC ATT AAT TTA CCC TAT TTA GCA TGG 194 
Lys Tyr Phe Gly His Leu Ser Phe lie Asn Leu Pro Tyr Leu Ala Trp 
45 50 55 

GTT TTA TTC CTT TTA TAC AAG ACT AAA AAC CCT TTT ACA GAA ATC GTT 24 2 

Val Leu Phe Leu Leu Tyr Lys Thr Lys Asn Pro Phe Thr Glu lie Val 
60 65 70 75 

TTA GAA AAA ACT TTA GGG CAT CTA TTA GGC ATT TTA TCT TTG CTC TTT 2 90 

Leu Glu Lys Thr Leu Gly His Leu Leu Gly lie Leu Ser Leu Leu Phe 
80 85 90 

TTA CAA TCT AGC CTA TTA AAT CAA GGG GAA ATC GGC AAC AGC GCG CGT 33 8 

Leu Gin Ser Ser Leu Leu Asn Gin Gly Glu lie Gly Asn Ser Ala Arg 
95 100 105 

TTG TTT TTA CGC CCT TTT ATA GGG GAT TTT GGG CTT TAT GCG CTG ATA 3 86 

Leu Phe Leu Arg Pro Phe lie Gly Asp Phe Gly Leu Tyr Ala Leu lie 
110 115 120 

ACG CTT ATG GTA GTT ATT TCT TAT TTG ATT CTA TTC AAA CTA CCC CCT 4 34 

Thr Leu Met Val Val He Ser Tyr Leu He Leu Phe Lys Leu Pro Pro 
125 130 135 

AAA AGC GTT TTT TAT CCT TAT ATG AAC AAA ACA CAA AAC CTT TTA AAA 4 82 

Lys Ser Val Phe Tyr Pro Tyr Met Asn Lys Thr Gin Asn Leu Leu Lys 
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lie Gin Gin Asp lie Leu Arg Ala Lys Glu lie Leu Arg 
270 275 280 



AAATGACTCT CAAAAACCTT AAAAATGGAA AAATTT 



906 



(2) INFORMATION FOR SEQ ID NO: 9 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982: 



Met Leu Asn Phe Leu Ser lie Ser Ser Glu Pro Lys lie Lys Ser Leu 

15 10 15 

Ala lie Gly Lys Phe Asp Gly Leu His Leu Gly His Gin Ala Leu Phe 

20 25 30 

Lys Glu Leu Lys Asp Pro Lys Ala Leu Leu lie lie Glu Lys Lys His 

35 40 45 

Tyr Thr Lys Gly Tyr Leu Thr Pro Leu Lys Tyr Arg Ala Lys Leu Val 

50 55 60 

Gly Met Pro Leu Phe Phe Val Tyr Leu Glu Glu lie Ser Gin Leu Asn 
65 70 75 80 

Ala Leu Asp Phe Leu Asp Leu Leu Lys Lys Lys Phe Pro His Leu Glu 

85 90 95 

Arg Leu Val Val Gly Tyr Asp Phe Arg Phe Gly His Glu Arg Gin Asn 

100 105 110 

Asp Ala Leu Phe Leu Lys Glu Arg Phe Glu Lys Thr lie lie Val Pro 

115 120 125 

Glu Val Lys Val Gin Glu lie Ser Val His Ser Lys Met lie Lys Leu 

130 135 140 

Ala Leu Ser His Gly Asp Leu Ser Leu Ala Asn Lys Leu Leu Gly Arg 
145 150 155 160 

Pro Tyr Glu Val Cys Gly Glu Val lie Ser Asp Gin Gly Leu Gly His 

165 170 175 

Lys Glu Leu Ala Pro Thr Leu Asn lie Lys Thr Lys Asp Phe lie Leu 

180 185 190 

Pro Ser Phe Gly Val Tyr Ala Ser Leu Val Lys lie Lys Asp Pro lie 

195 200 205 

Tyr Gin Lys Ser Val Ser Phe lie Gly Asn Arg Leu Ser Thr Asp Gin 

210 215 220 

Asn Phe Ala lie Glu Cys His Val Leu Asp Thr lie lie Glu Asn Pro 
225 230 235 240 

Pro Gin Glu lie Ala Leu Arg Trp Val Gin Lys lie Arg Asp Asn Met 

245 250 255 

Arg Phe Ser Ser Leu Lys Glu Leu Lys Asn Gin lie Gin Gin Asp lie 

260 265 270 

Leu Arg Ala Lys Glu lie Leu Arg 



275 



280 



(2) INFORMATION FOR SEQ ID NO: 983: 
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45 5 50 55 

CGC GCT AAA CTC GTG GGC ATG CCT TTA TTT TTT GTG TAT TTA GAA GAG 24 3 

Arg Ala bys Leu Val Gly Met Pro Leu Phe Phe Val Tyr Leu Glu Glu 
60 65 70 75 

ATT TCA CAA TTA AAC GCC CTA GAT TTT TTA GAT CTT TTA AAA AAG AAA 2 91 

lie Ser Gin Leu Asn Ala Leu Asp Phe Leu Asp Leu Leu Lys Lys Lys 
80 85 90 

TTT CCC CAT TTA GAA CGC CTG GTC GTG GGC TAT GAT TTC AGG TTT GGG 33 9 

Phe Pro His Leu Glu Arg Leu Val Val Gly Tyr Asp Phe Arg Phe Gly 
95 100 105 

CAT GAG AGG CAA AAT GAC GCT TTA TTT TTA AAA GAG CGT TTT GAA AAA 3 87 

His Glu Arg Gin Asn Asp Ala Leu Phe Leu Lys Glu Arg Phe Glu Lys 
110 115 120 

ACC ATT ATT GTG CCT GAA GTG AAA GTC CAA GAG ATT AGC GTG CAT TCT 4 35 

Thr lie lie Val Pro Glu Val Lys Val Gin Glu lie Ser Val His Ser 
125 130 135 

AAG ATG ATC AAA CTA GCC CTA AGT CAT GGC GAC TTA TCT TTA GCT AAC 4 83 

Lys Met lie Lys Leu Ala Leu Ser His Gly Asp Leu Ser Leu Ala Asn 
140 145 150 155 

AAG CTC TTA GGC AGA CCT TAT GAA GTG TGT GGG GAA GTC ATT AGT GAT 531 
Lys Leu Leu Gly Arg Pro Tyr Glu Val Cys Gly Glu Val lie Ser Asp 
160 165 170 

CAA GGT TTG GGG CAT AAA GAA TTA GCA CCC ACT TTA AAT ATA AAA ACT 57 9 

Gin Gly Leu Gly His Lys Glu Leu Ala Pro Thr Leu Asn lie Lys Thr 
175 180 185 

AAA GAT TTT ATC CTC CCT AGT TTT GGG GTG TAT GCG AGT TTA GTG AAA 62 7 

Lys Asp Phe lie Leu Pro Ser Phe Gly Val Tyr Ala Ser Leu Val Lys 
190 195 200 

ATA AAA GAT CCA ATT TAT CAA AAA AGC GTG AGT TTT ATA GGC AAT CGC 67 5 

lie Lys Asp Pro lie Tyr Gin Lys Ser Val Ser Phe lie Gly Asn Arg 
205 210 215 

TTA AGC ACG GAT CAA AAT TTC GCC ATA GAA TGC CAT GTC CTT GAT ACC 72 3 

Leu Ser Thr Asp Gin Asn Phe Ala lie Glu Cys His Val Leu Asp Thr 
220 225 230 235 

ATC ATA GAA AAC CCG CCC CAA GAA ATC GCT TTG CGT TGG GTT CAA AAA 771 
lie lie Glu Asn Pro Pro Gin Glu He Ala Leu Arg Trp Val Gin Lys 
240 245 250 

ATA CGA GAC AAC ATG CGT TTT TCT TCT TTA AAA GAG CTT AAA AAT CAG 819 
He Arg Asp Asn Met Arg Phe Ser Ser Leu Lys Glu Leu Lys Asn Gin 
255 260 265 

ATC CAA CAA GAC ATC TTA AGA GCC AAA GAG ATT TTG AGA TAATTTGTGT TA 87 0 
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65 70 75 80 



Leu 


Asp 


Val 


Gly 


Ala 


Ser 


Lys 


Gly 


Gly 


Phe 


Ser 


Gin 


Val 


Ala 


Leu 


Leu 










85 










90 










95 




Lys 


Gly Ala 


Lys 


Arg 


Val 


Leu 


Cys 


Val 


Asp 


Val 


Gly 


Lys 


Met 


Gin 


Leu 








100 










105 










110 






Asp 


Glu 


Ser 


Leu 


Lys 


Gin 


Asp 


Lys 


Arg 


He 


Glu 


Cvs 


Tvr 


Glu 


Glu 


Cvs 






115 










120 










125 








Asp 


He 


Arg 


Gly 


Phe 


Lys 


Thr 


Pro 


Glu 


Thr 


He 


Asp 


Leu 


Ala 


Leu 


Cvs 




130 










135 










140 










Asp 


Val 


Ser 


Phe 


He 


Ser 


Leu 


Tyr 


Tyr 


He 


Leu 


Glu 


Ala 


He 


Leu 


Pro 


145 










150 










155 










160 


Leu 


Ser 


Asp 


Glu 


Phe 


Leu 


Thr 


Leu 


Phe 


Lvs 


Pro 


Gin 


Phe 


Glu 


Val 


Glv 










165 










170 










175 




Arg 


Gly 


He 


Lys 


Arg 


Asn 


Lys 


Lys 


Gly 


Val 


Val 


Val 


Asp 


Lys 


Glu 


Ala 








180 










185 










190 






He 


Leu 


Asn 


Ala 


Leu 


Glu 


Asn 


Phe 


Lys 


Asn 


His 


Leu 


Lys 


Thr 


Lys 


Asp 






195 










200 










205 








Phe 


Gin 


He 


Leu 


Lys 


He 


Gin 


Glu 


Ser 


Leu 


Val 


Lys 


Gly 


Lys 


Asn 


Gly 




210 










215 










220 










Asn 


Val 


Glu 


Phe 


Phe 


He 


His 


Phe 


Lys 


Arg 


Ala 













225 230 235 

(2) INFORMATION FOR SEQ ID NO: 981: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19,.. 858 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981: 

TGAAAGGGAA AAACGGGA ATG TTG AAT TTT TTA TCC ATT TCA AGC GAG CCT 51 

Met Leu Asn Phe Leu Ser He Ser Ser Glu Pro 
15 10 

AAA ATT AAA AGC CTA GCT ATC GGT AAA TTT GAC GGC TTG CAT TTA GGG 99 
Lys He Lys Ser Leu Ala He Gly Lys Phe Asp Gly Leu His Leu Gly 
15 20 25 

CAT CAA GCC CTT TTT AAA GAG TTA AAA GAT CCC AAA GCC CTT TTA ATC 14 7 

His Gin Ala Leu Phe Lys Glu Leu Lys Asp Pro Lys Ala Leu Leu He 
30 35 40 

ATA GAA AAA AAA CAT TAC ACT AAA GGC TAT TTA ACC CCC CTA AAA TAG 195 
He Glu Lys Lys His Tyr Thr Lys Gly Tyr Leu Thr Pro Leu Lys Tyr 
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AAG CGC ATA GAA TGT TAG GAA GAA TGC GAT ATT AGA GGG TTT AAA ACG 43 8 

Lys Arg lie Glu Cys Tyr Glu Glu Cys Asp lie Arg Gly Phe Lys Thr 
120 125 130 135 

CCA GAA ACA ATT GAT TTA GCG CTT TGC GAT GTG AGC TTT ATT TCT TTA 486 
Pro Glu Thr lie Asp Leu Ala Leu Cys Asp Val Ser Phe lie Ser Leu 
140 145 150 

TAT TAT ATT TTA GAA GCG ATT TTG CCT TTA AGC GAT GAA TTT TTA ACA 53 4 

Tyr Tyr lie Leu Glu Ala lie Leu Pro Leu Ser Asp Glu Phe Leu Thr 
155 160 165 

CTT TTC AAA CCG CAA TTT GAA GTG GGC AGA GGA ATA AAA CGC AAT AAA 582 
Leu Phe Lys Pro Gin Phe Glu Val Gly Arg Gly lie Lys Arg Asn Lys 
170 175 180 

AAA GGG GTG GTG GTG GAT AAA GAA GCC ATT TTG AAC GCT TTA GAA AAC 63 0 

Lys Gly Val Val Val Asp Lys Glu Ala lie Leu Asn Ala Leu Glu Asn 
185 190 195 

TTT AAA AAC CAT TTA AAA ACA AAG GAT TTT CAA ATC TTA AAG ATC CAA 67 8 

Phe Lys Asn His Leu Lys Thr Lys Asp Phe Gin lie Leu Lys lie Gin 
200 205 210 215 

GAA AGC TTA GTG AAA GGG AAA AAC GGG AAT GTT GAA TTT TTT ATC CAT 72 6 

Glu Ser Leu Val Lys Gly Lys Asn Gly Asn Val Glu Phe Phe lie His 
220 225 230 

TTC AAG CGA GCC TAAAATTAAA AGCCTAGCTA TCGGTAAATT TGACGGCTTG CATTT 7 83 
Phe Lys Arg Ala 
235 

AGGGCA 7 89 

(2) INFORMATION FOR SEQ ID NO:980: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980: 

Met Arg Leu Asp Tyr Ala Leu Phe Ser Gin His Leu Val Asn Ser Arg 

1 "5 10 15 

Glu Lys Ala Lys Ala Leu Val Leu Lys Asn Gin Val Leu Val Asn Lys 

20 25 30 

Met Val Val Ser Lys Pro Ser Phe lie Val Lys Glu Asn Asp Lys lie 

35 40 45 

Glu Leu lie Ala Glu Lys Leu Phe Val Ser Arg Ala Gly Glu Lys Leu 

50 55 60 

Gly Ala Phe Leu Glu Thr His Phe Val Asp Phe Lys Gly Lys Val Val 
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130 135 140 

Leu Ser Glu Val Phe Asn Asn Val Gly Ser Asp 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 97 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 738 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979: 

TTTCGCTTAT CAAGTCCCCC TACCTCAATT TTA ATG CGC TTA GAT TAC GCC TTA 54 

Met Arg Leu Asp Tyr Ala Leu 
1 5 

TTC ACT CAG CAT TTA GTA AAT AGC AGA GAA AAA GCT AAA GCG TTG GTT 102 
Phe Ser Gin His Leu Val Asn Ser Arg Glu Lys Ala Lys Ala Leu Val 
10 15 20 

TTA AAA AAT CAG GTT TTA GTC AAT AAA ATG GTG GTT TCC AAA CCC TCT 15 0 

Leu Lys Asn Gin Val Leu Val Asn Lys Met Val Val Ser Lys Pro Ser 
25 30 35 

TTT ATA GTG AAA GAG AAC GAT AAA ATT GAA CTC ATC GCT GAA AAA CTT 198 
Phe lie Val Lys Glu Asn Asp Lys lie Glu Leu lie Ala Glu Lys Leu 
40 45 50 55 

TTC GTT AGC AGG GCT GGG GAA AAA TTA GGG GCT TTT TTA GAA ACC CAT 246 
Phe Val Ser Arg Ala Gly Glu Lys Leu Gly Ala Phe Leu Glu Thr His 
60 65 70 

TTC GTG GAT TTT AAG GGA AAG GTG GTT TTA GAT GTG GGA GCG AGC AAA 2 94 

Phe Val Asp Phe Lys Gly Lys Val Val Leu Asp Val Gly Ala Ser Lys 
75 80 85 

GGG GGC TTT AGT CAA GTG GCT CTT TTA AAA GGG GCT AAA AGA GTG CTT 3 42 

Gly Gly Phe Ser Gin Val Ala Leu Leu Lys Gly Ala Lys Arg Val Leu 
90 95 100 

TGC GTG GAT GTG GGG AAA ATG CAA TTA GAT GAA AGT TTG AAA CAA GAC 3 90 

Cys Val Asp Val Gly Lys Met Gin Leu Asp Glu Ser Leu Lys Gin Asp 
105 110 115 
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GGT TTG ATT AGA GCG AGT TTG TCC AAT TAG CTT TCA GAG AGO ATA AGA 3 56 

Gly Leu lie Arg Ala Ser Leu Ser Asn Tyr Leu Ser Glu Ser He Arg 
75 80 85 

GAG GGG CTA ACG ATT GTT GGG TTA GTG GGG GTG GTG ATC TAT CAA AGC 4 04 

Glu Gly Leu Thr He Val Gly Leu Val Gly Val Val He Tyr Gin Ser 
90 95 100 

OCT AAA TTA GCG TTA GTG GGG TTA GTC ATC ATG CCG TTA GCT GCT ATT 452 
Pro Lys Leu Ala Leu Val Gly Leu Val He Met Pro Leu Ala Ala He 
105 110 115 

CCT ATC AGT AAA ATC ATT CGT AAG GTT AAA AAA CTC GCT AAA TCC CAT 5 00 

Pro He Ser Lys He He Arg Lys Val Lys Lys Leu Ala Lys Ser His 
120 125 130 

CAA GAG AGT AAC GCC AAA ATC ACC GCT CGT TTG AGT GAA GTT TTT AAC 54 8 

Gin Glu Ser Asn Ala Lys He Thr Ala Arg Leu Ser Glu Val Phe Asn 
135 140 145 150 

AAC GTG GGA AGC GAT TAAAATCTCT AATGGCGAAA AATTAGAGCA TAAGGCTTTT G 6 04 
Asn Val Gly Ser Asp 
155 

604 

(2) INFORMATION FOR SEQ ID NO: 97 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978: 

Met Leu Lys He Leu Pro Phe Leu Val He Leu Ala Tyr Leu Gly Lys 

15 10 15 

Ser Gly Gly Met Tyr Leu Gly Thr Tyr Phe Thr Asn Phe He Gly Leu 

20 25 30 

Asp He Val Lys Lys He Arg Asn Thr Met Leu Glu Ser Leu Leu Lys 

35 40 45 

Met Glu Met Asp Phe Phe Asn Arg Thr Lys Lys Gly Glu Leu He Ala 

50 55 60 

Arg He Thr Asn Asp He Gly Leu He Arg Ala Ser Leu Ser Asn Tyr 
65 70 75 80 

Leu Ser Glu Ser He Arg Glu Gly Leu Thr He Val Gly Leu Val Gly 

85 90 95 

Val Val He Tyr Gin Ser Pro Lys Leu Ala Leu Val Gly Leu Val He 

100 105 110 

Met Pro Leu Ala Ala He Pro He Ser Lys He He Arg Lys Val Lys 

115 120 125 

Lys Leu Ala Lys Ser His Gin Glu Ser Asn Ala Lys He Thr Ala Arg 
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195 200 205 

Val Glu Lys Met Pro His Gly lie Glu Ser Val Leu Asp Glu Phe Gly 

210 215 220 

Ala Asn Leu Ser Gly Gly Gin Arg Gin Arg lie Ala lie Ala Arg Ala 
225 230 235 240 

Leu Tyr Lys Asp Val Gin Val Leu lie Phe Asp Glu Ala Thr Ser Ala 

245 250 255 

Leu Asp Asn Asn Thr Glu Glu Ser Val Lys Gin Ser lie Leu Glu Leu 

260 265 270 

Lys Gin Asn Arg Leu lie lie Leu lie Ser His Asn Pro Ser Thr Leu 

275 280 285 

Lys Leu Ala Thr Lys His Val Lys Leu Glu His Gly Arg Leu Thr Glu 
290 295 300 

Cys 
305 

{2) INFORMATION FOR SEQ ID NO: 977: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 99... 563 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 7: 

TTTTATTTTC TTCTTTAGTG GTGGCTTTAA GCACGGCTTG GGGGACTTAT TTAGTCAAGC 6 0 

CCACTTTAGA TGAAATTTTT ATCAATAAAG ACACTCAC ATG CTC AAA ATC CTG CCT 116 

Met Leu Lys lie Leu Pro 
1 5 

TTT TTA GTG ATT TTG GCG TAT TTG GGT AAG AGT GGG GGC ATG TAT TTA 164 
Phe Leu Val lie Leu Ala Tyr Leu Gly Lys Ser Gly Gly Met Tyr Leu 
10 15 20 

GGC ACT TAT TTC ACC AAC TTC ATT GGG CTT GAT ATT GTC AAA AAA ATA 212 
Gly Thr Tyr Phe Thr Asn Phe lie Gly Leu Asp lie Val Lys Lys lie 
25 30 35 

CGC AAC ACT ATG CTA GAA AGC CTT CTT AAA ATG GAA ATG GAT TTT TTT 260 
Arg Asn Thr Met Leu Glu Ser Leu Leu Lys Met Glu Met Asp Phe Phe 
40 45 50 

AAC AGG ACG AAA AAG GGC GAA TTG ATC GCA AGG ATC ACC AAT GAT ATA 3 08 

Asn Arg Thr Lys Lys Gly Glu Leu lie Ala Arg lie Thr Asn Asp lie 
55 60 65 70 
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GCC ACT TCC GCT TTA GAC AAT AAC ACA GAA GAG AGC GTT AAA CAA AGC 87 8 

Ala Thr Ser Ala Leu Asp Asn Asn Thr Glu Glu Ser Val Lys Gin Ser 
255 260 265 

ATT TTA GAA TTG AAA CAA AAC CGC TTG ATC ATT CTT ATT TCG CAC AAC 92 6 

lie Leu Glu Leu Lys Gin Asn Arg Leu lie lie Leu lie Ser His Asn 
270 275 280 

CCA AGC ACQ CTA AAA TTA GCC ACT AAG CAT GTG AAA TTA GAG CAT GGG 974 
Pro Ser Thr Leu Lys Leu Ala Thr Lys His Val Lys Leu Glu His Gly 
285 290 295 300 

CGT TTG ACA GAA TGC TAAGGGTTTT AAGCGTTGGT GTTGCTTTTA TTTTACTAGG G 103 0 
Arg Leu Thr Glu Cys 
305 

TGTC 10 34 
(2) INFOR^4ATION FOR SEQ ID NO: 97 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 6: 

Met Glu Phe Leu Gly Ser lie Ala lie Ala Leu Val lie Tyr Leu Gly 

15 10 15 

Gly Asn Glu Val lie Arg Gly His lie Ser Val Gly Ala Phe Phe Ser 

20 25 30 

Phe lie Thr Ala Leu Phe Met Leu Tyr Thr Pro lie Lys Arg Leu Thr 

35 40 45 

Arg lie Val Ser Asn Phe Gin Glu Ala Leu Val Ala Ser Asp Arg lie 

50 55 60 

His Glu lie Leu Glu Arg Glu Pro Ala lie Val Asp Gly Glu Leu Thr 
65 70 75 80 

Leu Asn Asn Ala lie His Thr lie Glu Phe Lys Lys Val Trp Leu Ala 

85 90 95 

Tyr Thr Leu Asp Asn Gin Glu Arg Tyr Val Leu Asn Asp lie Ser Leu 

100 105 110 

Lys Phe Gin Gin Asn Glu lie lie Ala Leu Lys Gly Glu Ser Gly Ser 

115 120 125 

Gly Lys Ser Ser Leu Val Asn Leu lie Leu Arg Leu Tyr Glu Pro Ser 

130 135 140 

Lys Gly Glu lie Phe lie Asn Asp Gin Lys lie Glu Ser lie Thr Gin 
145 150 155 160 

Lys Ser Leu Arg Glu Lys lie Ser Val Val Thr Gin Arg Val Phe lie 

165 170 175 

Phe Asn Gly Ser Val Ala Glu Asn Val Ala Tyr Gly Leu Glu lie Asp 

180 185 190 

Glu Val Lys lie Lys Glu Cys Leu Lys Lys Ala Gin Ala Leu Asp Phe 
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GCG TTT TTT TCT TTC ATT ACG GCC CTT TTT ATG CTC TAT ACG CCG ATT 2 06 

Ala Phe Phe Ser Phe lie Thr Ala Leu Phe Met Leu Tyr Thr Pro lie 
30 35 40 

AAA CGC TTA ACT AGG ATT GTT TCT AAT TTT CAA GAA GCC TTA GTC GCT 2 54 

Lys Arg Leu Thr Arg He Val Ser Asn Phe Gin Glu Ala Leu Val Ala 
45 50 55 60 

AGO GAC AGG ATC CAT GAG ATT TTA GAA AGA GAG CCG GCT ATT GTT GAT 3 02 

Ser Asp Arg He His Glu lie Leu Glu Arg Glu Pro Ala He Val Asp 
65 70 75 

GGG GAA TTG ACG CTA AAT AAC GCC ATA CAC ACC ATA GAA TTT AAA AAG 350 
Gly Glu Leu Thr Leu Asn Asn Ala He His Thr He Glu Phe Lys Lys 
80 85 90 

GTA TGG CTG GCT TAT ACG CTA GAC AAT CAA GAG CGT TAT GTT TTA AAC 3 98 

Val Trp Leu Ala Tyr Thr Leu Asp Asn Gin Glu Arg Tyr Val Leu Asn 
95 100 105 

GAT ATT AGT TTG AAG TTC CAA CAA AAT GAA ATC ATC GCT TTA AAG GGC 44 6 

Asp He Ser Leu Lys Phe Gin Gin Asn Glu He He Ala Leu Lys Gly 
110 115 120 

GAA AGC GGG AGC GGT AAA AGC TCA TTA GTG AAT CTG ATC TTA CGC CTT 4 94 

Glu Ser Gly Ser Gly Lys Ser Ser Leu Val Asn Leu He Leu Arg Leu 
125 130 135 140 

TAT GAG CCA AGC AAA GGC GAA ATT TTC ATC AAC GAT CAA AAA ATA GAG 54 2 

Tyr Glu Pro Ser Lys Gly Glu He Phe He Asn Asp Gin Lys He Glu 
145 150 155 

AGC ATC ACT CAA AAA TCC TTA AGA GAA AAG ATT AGC GTT GTC ACT CAA 5 90 

Ser He Thr Gin Lys Ser Leu Arg Glu Lys He Ser Val Val Thr Gin 
160 165 170 

AGG GTG TTT ATT TTT AAC GGG AGC GTG GCT GAA AAT GTG GCG TAT GGT 63 8 

Arg Val Phe He Phe Asn Gly Ser Val Ala Glu Asn Val Ala Tyr Gly 
175 180 185 

TTA GAA ATT GAT GAG GTA AAA ATC AAA GAA TGC CTA AAA AAA GCT CAA 6 86 

Leu Glu He Asp Glu Val Lys He Lys Glu Cys Leu Lys Lys Ala Gin 
190 195 200 

GCC TTA GAT TTT GTT GAA AAA ATG CCT CAT GGG ATA GAG AGC GTT TTA 734 
Ala Leu Asp Phe Val Glu Lys Met Pro His Gly He Glu Ser Val Leu 
205 210 215 220 

GAT GAA TTT GGC GCT AAT CTT AGC GGC GGC CAA CGC CAA AGA ATC GCC 7 82 

Asp Glu Phe Gly Ala Asn Leu Ser Gly Gly Gin Arg Gin Arg He Ala 
225 230 235 

ATT GCA AGA GCT TTG TAT AAA GAC GTT CAA GTT TTA ATC TTT GAT GAA 83 0 

He Ala Arg Ala Leu Tyr Lys Asp Val Gin Val Leu He Phe Asp Glu 
240 245 250 
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Lys He Phe Lys Lys Ser His Ser Asn Glu Ser Leu Ser Gin Gin Gin 

115 120 125 

Asn Glu Glu He Glu Arg Leu Leu Thr Ser Arg Gly Leu Leu Asn Arg 

130 135 140 

Phe Phe Lys Pro Leu Phe Asn Phe Val Ser Lys Ser Trp His He Tyr 
145 150 155 160 

Pro He Gly Phe Leu Phe Gly Leu Gly Phe Asp Thr Ala Ser Glu He 

165 170 175 

Ala Leu Leu Ala Leu Ser Ser Ser Ala He Lys Val Ser Met Val Gly 

180 185 190 

Met Leu Ser Leu Pro He Leu Phe Ala Ala Gly Met Ser Leu Phe Asp 

195 200 205 

Thr Leu Asp Gly Ala Phe Met Leu Lys Ala Tyr Asp Trp Ala Phe Lys 

210 215 220 

Thr Pro Leu Arg Lys He Tyr Tyr Asn He Ser He Thr Ala Leu Ser 
225 230 235 240 

Val Phe He Ala Leu Phe He Gly Leu He Glu Leu Phe Gin Val Val 

245 250 255 

Ser Glu Lys Leu His Leu Lys Phe Glu Asn Arg Leu Leu Arg Ala Leu 

260 265 270 

Gin Ser Leu Glu Phe Thr Asp Leu Gly Tyr Tyr Leu Val Gly Leu Phe 

275 280 285 

Val He Ala Phe Leu Gly Ser Phe Phe Leu Trp Lys He Lys Phe Ser 

290 295 300 

Lys Leu Glu Ser 
305 

(2) INFORMATION FOR SEQ ID NO: 975: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 75... 989 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975: 

TTGTGAAAGA AAATGAAGCG TTTTTTAAAA TCGGTATCAA AAACATCGCC GTGGCTGAAA 6 0 

TTTCTTCGCC TTTA ATG GAG TTT TTA GGT TCA ATC GCT ATA GCG CTA GTG 110 
Met Glu Phe Leu Gly Ser He Ala He Ala Leu Val 
15 10 

ATT TAT TTA GGG GGG AAT GAA GTG ATT AGA GGC CAT ATT AGC GTG GGG 15 8 

He Tyr Leu Gly Gly Asn Glu Val He Arg Gly His He Ser Val Gly 
15 20 25 
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GGG GCG TTC ATG CTC AAG GCG TAT GAG TGG GCG TTC AAA ACC OCT TTA 72 9 

Gly Ala Phe Met Leu Lys Ala Tyr Asp Trp Ala Phe Lys Thr Pro Leu 
215 220 225 

AGA AAA ATC TAT TAG AAT ATC TCT ATC ACG GCC TTA AGC GTG TTT ATC 7 77 

Arg Lys lie Tyr Tyr Asn lie Ser lie Thr Ala Leu Ser Val Phe lie 
230 235 240 

GCG CTC TTT ATT GGC TTG ATT GAG CTT TTT CAA GTC GTT AGC GAG AAA 82 5 

Ala Leu Phe lie Gly Leu lie Glu Leu Phe Gin Val Val Ser Glu Lys 
245 250 * 255 

CTC CAT TTA AAA TTT GAA AAC CGC CTT TTA AGA GCC TTA CAA AGC CTG 87 3 

Leu His Leu Lys Phe Glu Asn Arg Leu Leu Arg Ala Leu Gin Ser Leu 
260 265 270 275 

GAA TTT ACA GAC TTG GGC TAT TAG TTG GTG GGC TTA TTT GTA ATA GCG 921 
Glu Phe Thr Asp Leu Gly Tyr Tyr Leu Val Gly Leu Phe Val lie Ala 
280 285 290 

TTT CTA GGA TCG TTC TTT TTA TGG AAA ATC AAA TTT TCT AAA CTA GAG 96 9 

Phe Leu Gly Ser Phe Phe Leu Trp Lys lie Lys Phe Ser Lys Leu Glu 
295 300 305 

AGC TGAATTCTAA GCCCTCAAAT TATCGCTTAA TAAATCTTTA AGGCTTTGAT TTG 102 5 

Ser 



(2) INFORMATION FOR SEQ ID NO : 974 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974: 



Met Ala Asn Asn 
1 

Leu Gly Ala Lys 
20 

Asn Thr lie Arg 
35 

Gly Phe Tyr Phe 
50 

lie lie Ser Ala 
65 

Leu Glu Glu lie 

Leu Leu lie lie 
100 



Ala Ser Phe Tyr 
5 

His Ala Phe Asp 

Lys Leu Thr Gin 
40 

Ser Met Gly His 
55 

Phe Ala lie Ala 
70 

Gly Gly Val Val 
85 

Gly Leu Leu Asn 



Ala Ala Ala Ser 
10 

Ala Asp His lie 
25 

Gin Gly Lys Asn 

Ser Ser Val Val 
60 

Trp Ala Lys Glu 
75 

Gly Thr Leu Val 
90 

Ala lie lie Leu 
105 



Met Ala Tyr Met 
15 

Ala Cys lie Asp 
30 

Ala Tyr Gly Val 
45 

lie Leu Met Thr 

His Thr Pro Met 
80 

Ser Gly Leu Phe 
95 

Leu Asp Leu Leu 
110 
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Met Ala Asn 
1 

AAC GCT TCG TTT TAT GCG GCG GCG TCT ATG GCC TAG ATG CTA GGG GCA 10 5 

Asn Ala Ser Phe Tyr Ala Ala Ala Ser Met Ala Tyr Met Leu Gly Ala 
5 10 15 

AAG CAT GCG TTT GAT GCG GAT CAC ATC GCT TGC ATA GAT AAC ACC ATT 15 3 

Lys His Ala Phe Asp Ala Asp His lie Ala Cys lie Asp Asn Thr lie 
20 25 30 35 

AGA AAG CTC ACC CAA CAA GGC AAA AAC GCC TAT GGT GTG GGG TTT TAC 201 
Arg Lys Leu Thr Gin Gin Gly Lys Asn Ala Tyr Gly Val Gly Phe Tyr 
40 45 50 

TTT TCT ATG GGG CAT TCA AGC GTG GTG ATT TTA ATG ACC ATC ATC AGC 24 9 

Phe Ser Met Gly His Ser Ser Val Val lie Leu Met Thr lie lie Ser 
55 60 65 

GCG TTT GCG ATC GCT TGG GCT AAA GAA CAC ACG CCG ATG CTA GAA GAA 2 97 

Ala Phe Ala lie Ala Trp Ala Lys Glu His Thr Pro Met Leu Glu Glu 
70 75 80 

ATA GGG GGG GTA GTG GGG ACT TTA GTT TCT GGG CTT TTT TTG CTC ATT 34 5 

lie Gly Gly Val Val Gly Thr Leu Val Ser Gly Leu Phe Leu Leu lie 
85 90 95 

ATA GGG CTA TTG AAT GCG ATT ATT CTC TTG GAT TTA TTA AAA ATA TTC 3 93 

lie Gly Leu Leu Asn Ala lie lie Leu Leu Asp Leu Leu Lys lie Phe 
100 105 110 115 

AAA AAA TCG CAC TCT AAT GAA AGC CTA AGC CAG CAA CAA AAT GAA GAG 441 
Lys Lys Ser His Ser Asn Glu Ser Leu Ser Gin Gin Gin Asn Glu Glu 
120 125 130 

ATC GAG CGG CTC TTA ACG AGT AGG GGC TTG CTC AAC CGC TTT TTT AAA 4 89 

lie Glu Arg Leu Leu Thr Ser Arg Gly Leu Leu Asn Arg Phe Phe Lys 
135 140 145 

CCC TTG TTT AAT TTT GTC TCC AAG TCG TGG CAT ATT TAT CCT ATC GGT 537 
Pro Leu Phe Asn Phe Val Ser Lys Ser Trp His lie Tyr Pro lie Gly 
150 155 160 

TTT CTT TTT GGG CTG GGT TTT GAT ACC GCT AGT GAA ATC GCG CTT TTG 5 85 

Phe Leu Phe Gly Leu Gly Phe Asp Thr Ala Ser Glu lie Ala Leu Leu 
165 170 175 

GCC CTC TCT AGC AGC GCG ATT AAA GTG AGT ATG GTG GGC ATG CTC TCT ' 633 
Ala Leu Ser Ser Ser Ala lie Lys Val Ser Met Val Gly Met Leu Ser 
180 185 190 195 

TTA CCC ATT CTT TTT GCC GCT GGC ATG AGT TTG TTT GAC ACT TTA GAT 681 
Leu Pro lie Leu Phe Ala Ala Gly Met Ser Leu Phe Asp Thr Leu Asp 
200 205 210 
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(2) INFORMATION FOR SEQ ID NO: 972: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972: 



Met Asp He Leu Lys Thr Leu Gin Lys His Leu Gly Asp Val Glu Thr 

15 10 15 

Ser Asp Phe Thr Thr Asn Ala lie Glu Lys Ser Gin Gin He Ala Lys 

20 25 30 

Phe Ser Arg Asp Met Lys Asn He Asn Glu Ser Val Gly Ala Leu Gin 

35 40 45 

Val Leu Gin He Ala Cys Lys Lys Leu Phe Asn Lys Ser Met Gly Leu 

50 55 60 

Glu Asp Lys Asp Ala Leu Gin Ala Ser He He Lys Gin Glu Leu Arg 
65 70 75 80 

Glu He Val Glu Asn Cys Gin Phe Leu Ala Ser Pro Leu Phe Asp Thr 

85 90 95 

Gin Leu Asn He Ala He Asn Asp Glu He Phe Ser Met He Val Val 

100 105 110 

Asn Pro Leu Asp Leu Leu Glu Asn Val Gly Glu Phe Gin Ala Tyr Leu 

115 120 125 

Glu Glu Lys Leu Asn Glu He Lys Glu Leu Leu Gly Tyr Leu Ser Glu 

130 135 140 

Ser Leu Ser Asn Pro Lys Ala Phe Met Pro Ser Phe Ser Asn Gin Ser 
145 150 155 160 

Leu Lys Asp Leu Leu Ser Asp Asn Leu Arg Ala 
165 170 



(2) INFORMATION FOR SEQ ID NO: 973: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 9... 972 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 
TTTTTAGCGA TTGTGTTCTT GCATGCATTG 



SEQ ID NO : 973 : 

GGTTTAGCGT TGCTCTTT ATG GCC AAT 5 7 
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(B) LOCATION: 25... 537 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971: 

TCAATTCTAT TTAAAAGGTT TTTT ATG GAT ATT TTA AAA ACT CTT CAA AAA 51 

Met Asp lie Leu Lys Thr Leu Gin Lys 
1 5 

CAT TTG GGC GAT GTT GAA ACA AGC GAT TTT ACA ACC AAT GCG ATA GAA 99 
His Leu Gly Asp Val Glu Thr Ser Asp Phe Thr Thr Asn Ala lie Glu 
10 15 20 25 

AAA TCC CAA CAA ATC GCT AAA TTC ACT AGG GAC ATG AAA AAT ATA AAC 14 7 

Lys Ser Gin Gin lie Ala Lys Phe Ser Arg Asp Met Lys Asn lie Asn 
30 35 40 

GAG AGC GTT GGA GCG TTA CAA GTC TTG CAA ATC GCT TGC AAA AAG CTT 195 
Glu Ser Val Gly Ala Leu Gin Val Leu Gin lie Ala Cys Lys Lys Leu 
45 50 55 

TTC AAT AAG AGC ATG GGT TTA GAA GAT AAA GAC GCT TTG CAA GCT TCT 24 3 

Phe Asn Lys Ser Met Gly Leu Glu Asp Lys Asp Ala Leu Gin Ala Ser 
60 65 70 

ATC ATC AAA CAG GAA TTG CGA GAA ATT GTA GAA AAT TGC CAG TTT TTA 2 91 

lie lie Lys Gin Glu Leu Arg Glu lie Val Glu Asn Cys Gin Phe Leu 
75 80 85 

GCC TCC CCT TTG TTT GAC ACT CAG CTC AAC ATT GCA ATC AAT GAT GAA 33 9 

Ala Ser Pro Leu Phe Asp Thr Gin Leu Asn lie Ala lie Asn Asp Glu 
90 95 100 105 

ATT TTT TCC ATG ATT GTG GTT AAT CCG TTG GAT TTA TTG GAA AAT GTG 3 87 

lie Phe Ser Met lie Val Val Asn Pro Leu Asp Leu Leu Glu Asn Val 
110 115 120 

GGC GAG TTT CAA GCT TAT TTG GAA GAA AAA TTA AAC GAA ATT AAG GAA 43 5 

Gly Glu Phe Gin Ala Tyr Leu Glu Glu Lys Leu Asn Glu He Lys Glu 
125 130 135 

TTA TTA GGT TAT TTG AGT GAA AGC CTT TCA AAC CCT AAA GCC TTC ATG 4 83 

Leu Leu Gly Tyr Leu Ser Glu Ser Leu Ser Asn Pro Lys Ala Phe Met 
140 145 150 

CCA AGT TTT TCA AAT CAA AGC CTT AAA GAT TTA TTA AGC GAT AAT TTG 531 
Pro Ser Phe Ser Asn Gin Ser Leu Lys Asp Leu Leu Ser Asp Asn Leu 
155 160 165 

AGG GCT TAGAATTCAG CTCTCTAGTT TAGAAAATTT GATTTTCC 575 

Arg Ala 

170 
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Phe Lys Asp Ala Lys Ser Leu Glu Lys Ala Arg Leu Val Asn Thr lie 

20 25 30 

Val Phe Asp Lys Thr Gly Thr Leu Thr Asn Gly Lys Pro Val Val Lys 

35 40 45 

Ser Val His Ser Lys lie Glu Leu Leu Glu Leu Leu Ser Leu Ala Leu 

SO 55 60 

Ser lie Glu Lys Ser Ser Glu His Val lie Ala Lys Gly lie Val Glu 
65 70 75 80 

Tyr Ala Lys Glu His Asn Ala Pro Leu Lys Glu Met Ser Gly Val Lys 

85 90 95 

Val Lys Thr Gly Phe Gly lie Ser Ala Lys Thr Asp Tyr Gin Gly Thr 

100 105 110 

Lys Glu lie lie Lys Val Gly Asn Ser Glu Phe Phe Asn Pro lie Asn 

115 120 125 

Thr Leu Glu lie Lys Glu Asn Gly He Leu Val Phe Val Gly Arg Ala 

130 135 140 

He Ser Glu Lys Glu Asp Glu Leu Leu Gly Ala Phe Val Leu Glu Asp 
145 150 155 160 

Leu Pro Lys Lys Gly Val Lys Glu His He Ala Gin He Lys Asn Leu 

165 170 175 

Gly He Asn Thr Phe Leu Leu Ser Gly Asp Asn Arg Glu Asn Val Gin 

180 185 190 

Lys Cys Ala Phe Glu Leu Gly He Asp Gly Tyr He Ser Asn Ala Lys 

195 200 205 

Pro Gin Asp Lys Leu Asn Lys He Lys Glu Leu Lys Glu Lys Gly Gin 

210 215 220 

He Val Met Met Val Gly Asp Gly Leu Asn Asp Ala Pro Ser Leu Ala 
225 230 235 240 

Met Ser Asp Val Ala Val Val Met Ala Lys Gly Ser Asp Val Ser Val 

245 250 255 

Gin Ala Ala Asp He Val Ser Phe Asn Asn Asp He Lys Ser Val Tyr 

260 265 270 

Ser Ala He Lys Leu Ser Gin Ala Thr He Lys Asn He Lys Glu Asn 

275 280 285 

Leu Phe Trp Ala Phe Cys Tyr Asn Ser Val Phe He Pro Leu Ala Cys 

290 295 300 

Gly Val Leu Tyr Lys Ala Asn Leu Met Leu Ser Pro Ala He Ala Gly 
305 310 315 320 

Leu Ala Met Ser Leu Ser Ser Val Ser Val Val Leu Asn Ser Gin Arg 

325 330 335 

Leu Arg Asn Phe Lys He Lys Asp His 



(2) INFORMATION FOR SEQ ID NO: 971: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 



340 



345 
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195 200 205 

CCA CAA GAC AAG CTC AAT AAG ATC AAA GAG CTT AAG GAA AAA GGG GAG 73 0 

Pro Gin Asp Lys Leu Asn Lys lie Lys Glu Leu Lys Glu Lys Gly Gin 
210 215 220 

ATC GTT ATG ATG GTG GGC GAT GGC TTG AAT GAC GCT CCT AGT CTT GCT 77 8 

lie Val Met Met Val Gly Asp Gly Leu Asn Asp Ala Pro Ser Leu Ala 
225 230 235 240 

ATG AGC GAT GTG GCG GTG GTG ATG GCT AAA GGG AGC GAT GTG AGC GTG 82 6 

Met Ser Asp Val Ala Val Val Met Ala Lys Gly Ser Asp Val Ser Val 
245 250 255 

CAA GCA GCG GAC ATT GTG AGT TTT AAT AAC GAT ATT AAA TCG GTT TAT 874 
Gin Ala Ala Asp lie Val Ser Phe Asn Asn Asp lie Lys Ser Val Tyr 
260 265 ' 270 

AGC GCG ATT AAA TTA AGC CAG GCG ACA ATT AAA AAT ATC AAA GAA AAT 922 
Ser Ala lie Lys Leu Ser Gin Ala Thr lie Lys Asn lie Lys Glu Asn 
275 280 285 

TTG TTT TGG GCT TTT TGT TAT AAT AGC GTG TTC ATC CCT TTA GCT TGT 97 0 

Leu Phe Trp Ala Phe Cys Tyr Asn Ser Val Phe lie Pro Leu Ala Cys 
290 295 300 

GGG GTT CTT TAT AAG GCT AAT CTC ATG TTA AGC CCG GCG ATA GCG GGT 1018 
Gly Val Leu Tyr Lys Ala Asn Leu Met Leu Ser Pro Ala lie Ala Gly 
305 310 315 320 

TTA GCG ATG AGT TTA AGC TCT GTG AGT GTG GTC TTA AAC TCC CAA AGG 1066 
Leu Ala Met Ser Leu Ser Ser Val Ser Val Val Leu Asn Ser Gin Arg 
325 330 335 

CTA AGG AAT TTT AAA ATT AAG GAT CAT TGAATGAAAG CAACTTTTCA AGTGCCAAG 112 2 
Leu Arg Asn Phe Lys lie Lys Asp His 
340 345 

CATTACTTGC AACCA 1137 

(2) INFORMATION FOR SEQ ID NO: 970: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970: 

Met Ser lie Leu Val Ala Asn Gin Lys Ala Ser Ser Leu Gly Leu Phe 
15 10 15 



-1422- 



BNSOOCIDi <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969: 
AAGTGTTTGT ATCGGTTTTA GTGATTTCTT GCCCTTGCGC TTTAGGATTG CTACGCCT 5 8 



ATG AGC ATT TTA GTA GCG AAC CAG AAA GCG AGT TCT TTA GGG TTA TTT 106 
Met Ser lie Leu Val Ala Asn Gin Lys Ala Ser Ser Leu Gly Leu Phe 
15 10 15 

TTT AAA GAC GCT AAA AGT TTA GAA AAA GCA AGG CTA GTC AAT ACG ATC 154 
Phe Lys Asp Ala Lys Ser Leu Glu Lys Ala Arg Leu Val Asn Thr lie 
20 25 30 

GTT TTT GAT AAA ACC GGC ACG CTC ACT AAC GGC AAG CCT GTC GTT AAA 2 02 

Val Phe Asp Lys Thr Gly Thr Leu Thr Asn Gly Lys Pro Val Val Lys 
35 40 45 

AGC GTT CAT TCT AAG ATA GAA TTA TTA GAG TTA TTG AGT TTA GCG CTC 25 0 

Ser Val His Ser Lys lie Glu Leu Leu Glu Leu Leu Ser Leu Ala Leu 
50 55 60 

AGT ATT GAA AAG AGT AGC GAA CAT GTC ATC GCT AAA GGG ATT GTA GAA 2 98 

Ser lie Glu Lys Ser Ser Glu His Val lie Ala Lys Gly lie Val Glu 
65 70 75 80 

TAG GCA AAA GAG CAT AAC GCT CCC TTA AAA GAA ATG AGC GGG GTT AAA 34 6 

Tyr Ala Lys Glu His Asn Ala Pro Leu Lys Glu Met Ser Gly Val Lys 
85 90 95 

GTG AAA ACG GGT TTT GGC ATT AGT GCT AAA ACA GAT TAT CAA GGC ACT 3 94 

Val Lys Thr Gly Phe Gly lie Ser Ala Lys Thr Asp Tyr Gin Gly Thr 
100 105 110 

AAA GAG ATT ATT AAA GTA GGC AAC AGC GAG TTT TTT AAC CCT ATT AAC 44 2 

Lys Glu lie lie Lys Val Gly Asn Ser Glu Phe Phe Asn Pro lie Asn 
115 120 125 

ACG CTA GAA ATT AAA GAA AAC GGG ATT TTA GTG TTT GTT GGT AGA GCG 4 90 

Thr Leu Glu lie Lys Glu Asn Gly lie Leu Val Phe Val Gly Arg Ala 
130 135 140 

ATC AGT GAA AAA GAA GAC GAG CTT TTA GGG GCG TTT GTT TTA GAA GAT 53 8 

lie Ser Glu Lys Glu Asp Glu Leu Leu Gly Ala Phe Val Leu Glu Asp 
145 150 155 160 

TTG CCC AAA AAA GGC GTG AAA GAG CAT ATC GCT CAA ATC AAA AAT TTA 586 
Leu Pro Lys Lys Gly Val Lys Glu His lie Ala Gin lie Lys Asn Leu 
165 170 175 

GGC ATT AAC ACC TTT CTT TTA AGC GGA GAC AAT AGG GAG AAT GTC CAA 634 
Gly lie Asn Thr Phe Leu Leu Ser Gly Asp Asn Arg Glu Asn Val Gin 
180 185 190 

AAA TGC GCG TTT GAA TTA GGG ATT GAT GGT TAT ATC AGC AAC GCT AAA 6 82 

Lys Cys Ala Phe Glu Leu Gly lie Asp Gly Tyr He Ser Asn Ala Lys 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968: 



Met Lys Pro Thr 
1 

lie Leu Ala Val 
20 

Phe Asn Ser Asp 
35 

Lys Asn Val Ser 
50 

Val Glu Asn Val 
65 

Glu Gly Asn Asn 

Thr Leu Val Pro 
100 

Ser Glu Ser Asn 
115 

Leu Val lie Leu 
130 

Asn Met Gly Gly 
145 

Asn Ala Glu Lys 

Glu Ala Lys Glu 
180 

Glu Arg Tyr Ala 
195 

Val Gly Pro Pro 
210 

Gly Glu Arg Met 
225 

Lys Cys Leu Trp 



Asn Glu Pro Lys 
5 

Leu Gly Gly lie 

Gly Ser Phe Ser 
40 

Tyr His Glu lie 
55 

Ser He Gly Gin 
70 

Arg Val He Tyr 
85 

Leu Leu Asp Glu 

Phe Phe Thr Asp 

120 

Gly Leu Trp Met 
135 

Gly He Phe Gly 
150 

Pro Asn Val Arg 
165 

Glu Val Val Glu 

Asn Leu Gly Ala 
200 

Gly Thr Gly Lys 
215 

Cys Arg Phe Ser 
230 

Ala 

245 



Lys Pro Phe Phe 
10 

Leu Leu He Phe 

25 

Asp Asn Phe Leu 

Lys Gin Leu He 
60 

Thr Leu He Lys 
75 

He Ala Lys Arg 
90 

Lys Lys He Asn 
105 

Met Leu Gly Trp 

Phe Met Ala Asn 
140 

Met Gly Ser Ala 
155 

Phe Asn Asp Met 
170 

He Val Asp Phe 
185 

Lys He Pro Lys 

Thr Leu Leu Ala 
220 

Leu Trp Glu Gly 
235 



Gin Ser Pro He 
15 

Phe Leu Arg Ser 
30 

Ala Ser Ser Thr 
45 

Ser Asn Asn Glu 

Ala Ser His Lys 
80 

Val Pro Asp Leu 
95 

Tyr Ser Gly Phe 
110 

Leu Met Pro He 
125 

Arg Met Gin Lys 

Lys Lys Leu He 
160 

Ala Gly Asn Glu 
175 

Leu Lys Tyr Pro 
190 

Gly Val Leu Leu 
205 

Lys Ala Val Ala 

Ala Val Ser Leu 
240 



(2) INFORMATION FOR SEQ ID NO: 969: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 59 . . . 1093 
(D) OTHER INFORMATION: 
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Lys Gin Leu lie Ser Asn Asn Glu Val Glu Asn Val Ser lie Gly Gin 
60 65 70 

ACT TTG ATC AAA GCC AGC CAT AAA GAG GGC AAC AAT CGT GTG ATC TAT 2 94 

Thr Leu lie Lys Ala Ser His Lys Glu Gly Asn Asn Arg Val lie Tyr 
75 80 85 

ATC GCT AAA CGG GTG CCT GAT TTG ACC TTA GTG CCT TTG TTA GAC GAG 342 
lie Ala Lys Arg Val Pro Asp Leu Thr Leu Val Pro Leu Leu Asp Glu 
90 95 100 

AAA AAA ATC AAT TAT TCT GGT TTT AGC GAG TCT AAC TTT TTT ACG GAC 3 90 

Lys Lys lie Asn Tyr Ser Gly Phe Ser Glu Ser Asn Phe Phe Thr Asp 
105 110 115 120 

ATG CTA GGG TGG CTC ATG CCT ATT TTA GTG ATT TTA GGG CTA TGG ATG 43 8 

Met Leu Gly Trp Leu Met Pro lie Leu Val lie Leu Gly Leu Trp Met 
125 130 135 

TTT ATG GCG AAC CGC ATG CAA AAA AAT ATG GGT GGG GGT ATT TTT GGC 4 86 

Phe Met Ala Asn Arg Met Gin Lys Asn Met Gly Gly Gly lie Phe Gly 
140 145 150 

ATG GGG AGC GCG AAA AAA CTC ATT AAC GCT GAA AAA CCC AAT GTG CGT 5 34 

Met Gly Ser Ala Lys Lys Leu lie Asn Ala Glu Lys Pro Asn Val Arg 
155 160 165 

TTT AAT GAC ATG GCA GGC AAT GAA GAA GCC AAA GAA GAA GTG GTA GAA 5 82 

Phe Asn Asp Met Ala Gly Asn Glu Glu Ala Lys Glu Glu Val Val Glu 
170 175 180 

ATC GTA GAT TTC TTA AAA TAC CCT GAA CGA TAC GCC AAT TTA GGG GCT 63 0 

lie Val Asp Phe Leu Lys Tyr Pro Glu Arg Tyr Ala Asn Leu Gly Ala 
185 190 195 200 

AAA ATC CCT AAA GGC GTG TTA TTA GTA GGG CCT CCA GGA ACC GGT AAA 67 8 

Lys lie Pro Lys Gly Val Leu Leu Val Gly Pro Pro Gly Thr Gly Lys 
205 210 215 

ACC CTT TTA GCC AAA GCG GTA GCC GGC GAA CGC ATG TGC CGT TTT TCT 72 6 

Thr Leu Leu Ala Lys Ala Val Ala Gly Glu Arg Met Cys Arg Phe Ser 
220 225 230 

CTA TGG GAG GGA GCA GTT TCA TTG AAA TGT TTG TGG GCT TAGGGGCAAG CA 777 
Leu Trp Glu Gly Ala Val Ser Leu Lys Cys Leu Trp Ala 
235 240 245 

GGGTTAGGGA TTTATTTGAA ACCGCTAAAA AACAAGC 814 

(2) INFORMATION FOR SEQ ID NO:968: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

-1419- 



BNSDOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966: 



Met 


Leu 


Tyr 


Ala 


Ser 


Lys 


Thr 


Ser 


Leu 


Phe 


Leu 


Gin 


He 


Lys 


Gly 


Lys 


1 








5 










10 










15 




Phe 


Met 


Leu 


Arg 


He 


Leu 


He 


Pro 


Leu 


Leu 


He 


He 


Val 


Trp 


Val 


Leu 








20 










25 










30 






Trp 


Arg 


Leu 


Phe 


Leu 


Arg 


Gin 


Lys 


Pro 


Pro 


Lys 


Asp 


Asn 


His 


Ser 


Tyr 






35 










40 










45 








Thr 


Gin 


Gin 


Thr 


Pro 


Lys 


Glu 


Leu 


Glu 


Asp 


His 


Met 


He 


Val 


Cys 


Ser 




50 










55 










60 










Lys 


Cys 


Gin 


Thr 


Tyr 


Val 


Ser 


Ser 


Lys 


Asp 


Ala 


He 


Tyr 


Ser 


Gly 


Ala 


65 










70 










75 










80 


Val 


Ala 


Tyr 


Cys 


Ser 


Glu 


Thr 


Cys 


Leu 


Lys 


Asp 


Lys 


Arg 









85 90 



(2) INFORMATION FOR SEQ ID NO: 967: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 765 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967: 

TAAAAAACAA CCAATAAATT AAGGATTATA ATG AAA CCA ACG AAC GAA CCT AAA 54 

Met Lys Pro Thr Asn Glu Pro Lys 
1 5 

AAA CCT TTT TTT CAA AGT CCC ATT ATC CTT GCG GTT CTT GGA GGG ATT 102 
Lys Pro Phe Phe Gin Ser Pro He He Leu Ala Val Leu Gly Gly He 
10 15 20 

TTA CTC ATC TTT TTT CTA CGC TCT TTC AAT TCT GAT GGC AGT TTT TCG 15 0 

Leu Leu He Phe Phe Leu Arg Ser Phe Asn Ser Asp Gly Ser Phe Ser 
25 30 35 40 

GAC AAT TTC TTA GCT TCT AGC ACT AAA AAT GTG AGC TAG CAT GAA ATC 19 8 

Asp Asn Phe Leu Ala Ser Ser Thr Lys Asn Val Ser Tyr His Glu He 
45 50 55 

AAA CAG CTC ATC AGC AAT AAT GAA GTG GAA AAT GTG AGT ATC GGT CAA 24 6 
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(2) INFORMATION FOR SEQ ID NO: 965: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 6... 324 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 965: 

TAAAGGCGAG CAGTTAAT^G ATGAAATCGC TTGTAAAGAC ACTGA ATG CTT TAT GCA 57 

Met Leu Tyr Ala 
1 

TCA AAA ACG AGT TTA TTT TTA CAA ATC AAA GGA AAG TTT ATG TTA AGA 105 
Ser Lys Thr Ser Leu Phe Leu Gin lie Lys Gly Lys Phe Met Leu Arg 
5 10 15 20 

ATT TTA ATC CCC TTG CTC ATT ATT GTG TGG GTT TTA TGG CGT TTG TTT 15 3 

lie Leu lie Pro Leu Leu lie lie Val Trp Val Leu Trp Arg Leu Phe 
25 30 35 

TTG AGG CAA AAA CCC CCT AAA GAC AAC CAC TCT TAC ACG CAA CAA ACC 201 
Leu Arg Gin Lys Pro Pro Lys Asp Asn His Ser Tyr Thr Gin Gin Thr 
40 45 50 

CCT AAA GAA TTA GAA GAT CAC ATG ATT GTA TGC TCT AAA TGC CAA ACC 24 9 

Pro Lys Glu Leu Glu Asp His Met lie Val Cys Ser Lys Cys Gin Thr 
55 60 65 

TAT GTC TCT AGC AAA GAC GCT ATT TAT AGC GGG GCG GTG GCG TAT TGC 2 97 

Tyr Val Ser Ser Lys Asp Ala lie Tyr Ser Gly Ala Val Ala Tyr Cys 
70 75 80 

AGT GAA ACC TGT TTG AAG GAT AAG AGG TAAATATGCT TATTTTAGGA CACCCTT 3 51 
Ser Glu Thr Cys Leu Lys Asp Lys Arg 
85 90 

TAATCCCT 359 
(2) INFORMATION FOR SEQ ID NO: 966: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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GAA GAC GCT AAA GTG ACT GCA GAA AAT GAG ATT AAA AGC ATT CAG GAT 34 3 

Glu Asp Ala Lys Val Thr Ala Glu Asn Glu lie Lys Ser lie Gin Asp 
90 95 100 

TTG ATG CAA GAT TAG CAA AAA AGC TTA GAA ACC AAC ACA ATC CCT AAC 3 91 

Leu Met Gin Asp Tyr Gin Lys Ser Leu Glu Thr Asn Thr lie Pro Asn 
105 110 115 

CAT TTA AAC GAA GAA GTT TCC AAT GAA GAA GCC TTA AAC AAA GAA GTT 43 9 

His Leu Asn Glu Glu Val Ser Asn Glu Glu Ala Leu Asn Lys Glu Val 
120 125 130 

TCA AGC GAT GAA TCC CCT AAA GAA GTC CAA TTA GCA ACC GAT AAC AAC 4 87 

Ser Ser Asp Glu Ser Pro Lys Glu Val Gin Leu Ala Thr Asp Asn Asn 
135 140 145 

ACC AAA GAA CAC GAC AAA GAA AAA GAG AAT GTT TGAAGATTTA AAACCGCATT 54 0 

Thr Lys Glu His Asp Lys Glu Lys Glu Asn Val 
150 155 160 

TACAGGAATT AAGAAAGCGT TTGATGGT 56 8 

(2) INFORiyiATION FOR SEQ ID NO:964: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 964 : 

Met Phe Gly Met Gly Phe Phe Glu lie Leu Val Val Leu Val Val Ala 

15 10 15 

lie He Phe Leu Gly Pro Glu Lys Phe Pro Gin Ala Val Val Asp Val 

20 25 30 

Val Lys Phe Phe Arg Ala Val Lys Lys Thr Leu Asn Asp Ala Lys Asp 

35 40 45 

Thr Leu Asp Lys Glu He Asn He Glu Glu He Lys Lys Glu Thr Leu 

50 55 60 

Glu Tyr Gin Lys Leu Phe Glu Asn Lys Val Glu Ser Leu Lys Gly Val 
65 70 75 80 

Lys He Glu Glu Leu Glu Asp Ala Lys Val Thr Ala Glu Asn Glu He 

85 90 95 

Lys Ser He Gin Asp Leu Met Gin Asp Tyr Gin Lys Ser Leu Glu Thr 

100 105 110 

Asn Thr He Pro Asn His Leu Asn Glu Glu Val Ser Asn Glu Glu Ala 

115 120 125 

Leu Asn Lys Glu Val Ser Ser Asp Glu Ser Pro Lys Glu Val Gin Leu 

130 135 140 

Ala Thr Asp Asn Asn Thr Lys Glu His Asp Lys Glu Lys Glu Asn Val 
145 150 155 160 
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180 185 190 

He Ala Gin Lys He Thr Gin Lys He Lys He Pro Thr He Gly He 

195 200 205 

Gly Ser Gly Lys Asp Cys Asp Gly Gin He Leu Val Trp Ser Asp Met 

210 215 220 

Leu Gly Phe Phe Asp Ser Phe Lys Pro Lys Phe Val Arg Glu Tyr Leu 
225 230 235 240 

Lys Gly Lys Glu Leu He Gin Asn Ala He Lys Gin Tyr Ala Asp Asp 

245 250 255 

Val Lys Lys Gly Asn Phe Pro Asn Glu Leu Glu Ser Tyr His 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 963: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 41... 52 0 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:963: 



AATAACGATA AAATTTTAAA GGGTGTAAAA GTAGATTGTT ATG TTT GGC ATG GGC 55 

Met Phe Gly Met Gly 
1 5 



TTT TTT GAA ATC CTT GTG GTG TTG GTT GTA GCG ATT ATT TTT TTA GGG 103 
Phe Phe Glu He Leu Val Val Leu Val Val Ala He He Phe Leu Gly 
10 15 20 



CCA GAA AAA TTC CCC CAG GCT GTC GTG GAT GTG GTG AAG TTT TTT CGC 151 
Pro Glu Lys Phe Pro Gin Ala Val Val Asp Val Val Lys Phe Phe Arg 
25 30 35 

GCG GTT AAA AAA ACG CTC AAT GAC GCT AAG GAC ACT TTA GAT AAA GAA 19 9 

Ala Val Lys Lys Thr Leu Asn Asp Ala Lys Asp Thr Leu Asp Lys Glu 
40 45 50 



ATC AAT ATT GAA GAA ATC AAA AAA GAA ACC CTA GAG TAT CAA AAG CTC 24 7 

He Asn He Glu Glu He Lys Lys Glu Thr Leu Glu Tyr Gin Lys Leu 
55 60 65 

TTT GAA AAC AAA GTG GAG AGT CTT AAG GGC GTT AAG ATT GAA GAA TTA 2 95 

Phe Glu Asn Lys Val Glu Ser Leu Lys Gly Val Lys He Glu Glu Leu 
70 75 80 85 
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Pro Thr lie Gly lie Gly Ser Gly Lys Asp Cys Asp Gly Gin lie Leu 
205 210 215 

GTG TGG AGC GAT ATG TTA GGC TTT TTT GAT AGC TTT AAG CCT AAA TTC 722 

Val Trp Ser Asp Met Leu Gly Phe Phe Asp Ser Phe Lys Pro Lys Phe 
220 225 230 235 



GTG CGA GAA TAG CTT AAG GGG AAA GAA TTG ATT CAA AAC GCT ATG AAA 77 0 

Val Arg Glu Tyr Leu Lys Gly Lys Glu Leu lie Gin Asn Ala lie Lys 
240 245 250 

CAA TAG GCT GAT GAT GTG AAA AAG GGA AAC TTC CCT AAC GAA TTA GAA 818 
Gin Tyr Ala Asp Asp Val Lys Lys Gly Asn Phe Pro Asn Glu Leu Glu 
255 260 265 



AGT TAT CAT TAATGAAAGA ACGGATAGTC AATTTAGAAA CTTTGGATTT TGAAATT 874 
Ser Tyr His 
270 



(2) INFORMATION FOR SEQ ID NO: 962: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962: 



Met Ser Met Gin 
1 

Gin Ala Lys Lys 
20 

Ala Leu Phe Ala 
35 

Gly Asp Ser Leu 
50 

Ala Ser Val Glu 
65 

Ala Lys Thr Pro 

Asp Glu Lys Thr 
100 

Gin Ala Ser Ala 
115 

Val Lys Thr Leu 
130 

Leu Met Pro Gin 
145 

Lys Asn Glu Glu 
Glu .Glu Ala Gly 



Thr Ala Pro lie 
5 

Asn Gin Glu Lys 

Gin lie Phe Asp 
40 

Asn Met Ser Phe 
55 

Met Met Leu Tyr 
70 

Phe lie lie Thr 
85 

Ala Leu Lys Asn 

lie Lys Leu Glu 
120 

Thr Asn Glu Gly 
135 

Phe Val Arg Leu 
150 

Gin Gin Lys Lys 
165 

Val Gly Leu Leu 



Lys Lys lie Thr 
10 

lie He Ala He 
25 

Pro Leu Val Asp 

Phe Asn Gin Asn 
60 

His Thr Lys Ala 
75 

Asp Met Pro Phe 
90 

Ala He Arg Val 
105 

Gly Gly Lys Glu 

Val He Val Val 
140 

Asp Gly Gly Tyr 
155 

Leu Leu Glu Asp 
170 

Val Leu Glu Gly 



Leu Asn His Leu 
15 

Thr Ala Tyr Asp 
30 

Val He Leu Val 
45 

Asp Thr Leu Ser 

Val Cys Ala Gly 
80 

Gly Ser Tyr Lys 
95 

Tyr Lys Glu Thr 
110 

Lys Ala Lys Leu 
125 

Gly His He Gly 

Lys He Lys Gly 
160 

Ala Leu Ser Leu 
175 

He Thr Thr Pro 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961: 

AACACTTAGG ATTTTTA ATG AGC ATG CAA ACC GCC CCA ATT AAA AAG ATC 5 0 

Met Ser Met Gin Thr Ala Pro lie Lys Lys lie 
15 10 

ACT CTC AAC CAC CTC CAA GCT AAA AAA AAT CAA GAA AAA ATC ATC GCT 98 
Thr Leu Asn His Leu Gin Ala Lys Lys Asn Gin Glu Lys lie lie Ala 
15 20 25 

ATT ACC GCT TAT GAT GCG CTG TTC GCT CAA ATA TTT GAT CCG CTA GTG 14 6 

lie Thr Ala Tyr Asp Ala Leu Phe Ala Gin lie Phe Asp Pro Leu Val 
30 35 40 

GAT GTG ATT TTA GTG GGC GAT AGT TTG AAT ATG AGT TTT TTC AAT CAA 194 
Asp Val lie Leu Val Gly Asp Ser Leu Asn Met Ser Phe Phe Asn Gin 
45 50 55 

AAC GAC ACT TTA AGC GCG AGT GTG GAA ATG ATG CTC TAT CAC ACC AAA 24 2 

Asn Asp Thr Leu Ser Ala Ser Val Glu Met Met Leu Tyr His Thr Lys 
60 65 70 75 

GCC GTG TGC GCG GGC GCT AAG ACT CCT TTT ATC ATC ACA GAC ATG CCT 2 90 

Ala Val Cys Ala Gly Ala Lys Thr Pro Phe lie lie Thr Asp Met Pro 
80 85 90 

TTT GGA AGC TAT AAA GAT GAA AAA ACA GCC CTA AAA AAC GCC ATT AGG 33 8 

Phe Gly Ser Tyr Lys Asp Glu Lys Thr Ala Leu Lys Asn Ala lie Arg 
95 100 105 

GTT TAT AAA GAA ACC CAA GCG AGC GCA ATC AAA TTA GAG GGG GGG AAA 3 86 

Val Tyr Lys Glu Thr Gin Ala Ser Ala lie Lys Leu Glu Gly Gly Lys 
110 115 120 

GAA AAA GCG AAA CTG GTT AAA ACG CTC ACT AAT GAG GGC GTT ATT GTG 4 34 

Glu Lys Ala Lys Leu Val Lys Thr Leu Thr Asn Glu Gly Val lie Val 
125 130 135 

GTA GGG CAT ATT GGC TTG ATG CCC CAA TTC GTG CGC CTT GAT GGA GGT 4 82 

Val Gly His lie Gly Leu Met Pro Gin Phe Val Arg Leu Asp Gly Gly 
140 145 150 155 

TAT AAG ATT AAG GGC AAA AAT GAA GAA CAA CAA AAA AAG CTT TTA GAA 53 0 

Tyr Lys lie Lys Gly Lys Asn Glu Glu Gin Gin Lys Lys Leu Leu Glu 
160 165 170 

GAC GCC TTG AGT TTA GAA GAA GCT GGG GTG GGT TTG TTG GTT TTA GAG 57 8 

Asp Ala Leu Ser Leu Glu Glu Ala Gly Val Gly Leu Leu Val Leu Glu 
175 180 185 

GGT ATA ACC ACC CCT ATC GCT CAA AAA ATC ACG CAA AAA ATC AAA ATC 62 6 

Gly He Thr Thr Pro He Ala Gin Lys He Thr Gin Lys He Lys He 
190 195 200 

CCC ACG ATC GGC ATA GGG AGC GGT AAA GAT TGC GAT GGG CAG ATT TTA 67 4 
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20 25 30 



Asp 


Glu 


Lys 


He 


Asn 


His 


Leu 


Lys 


Gin 


Gin 


Leu 


Thr 


Glu 


Lys 


Gly 


Val 






35 










40 










45 








Ser 


Pro 


Lys 


Glu 


Met 


Asp 


Lys 


Asp 


Lys 


Phe 


Glu 


Glu 


Glu 


Tyr 


He 


Asn 




50 










55 










60 










Arg 


Ser 


Tyr 


Pro 


Lys 


He 


Ser 


Ser 


Lys 


Lys 


Lys 


Glu 


Lys 


Leu 


Leu 


Lys 


65 










70 










75 










80 


Ser 


Phe 


Ser 


He 


Ala 


Asp 


Asp 


Lys 


Ser 


Gly Val 


Phe 


Leu 


Gly 


Gly 


Gly 










85 










90 










95 




Tyr 


Ala 


Tyr 


Gly 


Glu 


Leu 


Asn 


Leu 


Ser 


Tyr 


Gin 


Gly 


Glu 


Met 


Leu 


Asp 








100 










105 










110 






Arg 


Tyr 


Gly 


Ala 


Asn 


Ala 


Pro 


Ser 


Ala 


Phe 


Lys 


Asn 


Asn 


He 


Asn 


He 






115 










120 










125 








Asn 


Ala 


Pro 


Val 


Ser 


Met 


He 


Ser 


Ala 


Lys 


Phe 


Gly Tyr 


Gin 


Lys 


Tyr 




130 










135 










140 










Phe 


Val 


Ser 


Tyr 


Phe 


Gly 


Thr 


Arg 


Phe 


Tyr 


Gly Asp 


Leu 


Leu 


Leu 


Gly 


145 










150 










155 










160 


Gly 


Gly 


Ala 


Leu 


Lys 


Glu 


Asp 


Ala 


He 


Lys 


Gin 


Pro 


Val 


Gly 


Ser 


Phe 










165 










170 










175 




lie 


Tyr 


Val 


Leu 


Gly 


Ala 


Val 


Asn 


Thr 


Asp 


Leu 


Leu 


Phe 


Asp 


Met 


Pro 








180 










185 










190 






Leu 


Asp 


Phe 


Lys 


Thr 


Lys 


Lys 


His 


Phe 


Leu 


Gly 


Val 


Tyr 


Ala 


Gly 


Phe 






195 










200 










205 








Gly 


He 


Gly 


Leu 


Met 


Leu 


Tyr 


Gin 


Asp 


Arg 


Pro 


Asn 


Gin 


Asn 


Gly 


Arg 




210 










215 










220 










Asn 


Leu 


Val 


Val 


Gly 


Gly 


Tyr 


Ser 


Ser 


Pro 


Asn 


Phe 


Leu 


Trp 


Lys 


Ser 


225 










230 










235 










240 


Leu 


He 


Glu 


Val 


Asp 


Tyr 


Thr 


Phe 


Asn 


Val 


Gly 


Val 


Ser 


Leu 


Thr 


Leu 










245 










250 










255 




Tyr 


Arg 


Lys 


His 


Arg 


Leu 


Glu 


He 


Gly 


Thr 


Lys 


Leu 


Pro 


He 


Ser 


Tyr 








260 










265 










270 






Leu 


Arg 


Met 


Gly 


Val 


Glu 


Glu 


Gly 


Ala 


He 


Tyr 


Gin 


Asn 


Lys 


Glu 


Asp 






275 










280 










285 








Asp 


Glu 


Arg 


Leu 


Leu 


Val 


Ser 


Ala 


Asn 


Asn 


Gin 


Phe 


Lys 


Arg 


Ser 


Ser 




290 










295 










300 










Phe 


Leu 


Leu 


Val 


Asn 


Tyr 


Ala 


Phe 


He 


Phe 















305 310 



(2) INFORMATION FOR SEQ ID NO: 961: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 18... 827 
(D) OTHER INFORMATION: 
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Lys Gin Pro Val Gly Ser Phe lie Tyr Val Leu Gly Ala Val Asn Thr 
170 175 180 185 

GAT TTA TTG TTT GAT ATG CCT TTA GAT TTT AAA ACT AAA AAG CAT TTT 63 0 

Asp Leu Leu Phe Asp Met Pro Leu Asp Phe Lys Thr Lys Lys His Phe 
190 195 200 

TTA GGC GTT TAT GCG GGT TTT GGG ATA GGG CTT ATG CTC TAT CAA GAC 67 8 

Leu Gly Val Tyr Ala Gly Phe Gly lie Gly Leu Met Leu Tyr Gin Asp 
205 210 215 

AGG CCT AAT CAA AAC GGG AGG AAT TTA GTA GTG GGG GGC TAT TCA AGC 72 6 

Arg Pro Asn Gin Asn Gly Arg Asn Leu Val Val Gly Gly Tyr Ser Ser 
220 225 230 

CCT AAT TTT TTA TGG AAA TCT TTG ATT GAA GTG GAT TAG ACT TTT AAT 7 74 

Pro Asn Phe Leu Trp Lys Ser Leu lie Glu Val Asp Tyr Thr Phe Asn 
235 240 245 

GTG GGC GTG AGT TTA ACG CTT TAT AGG AAA CAC CGT TTA GAG ATT GGC 82 2 

Val Gly Val Ser Leu Thr Leu Tyr Arg Lys His Arg Leu Glu lie Gly 
250 255 260 265 

ACA AAA TTG GCG ATT AGC TAT TTG AGA ATG GGA GTG GAA GAG GGA GCG 87 0 

Thr Lys Leu Pro lie Ser Tyr Leu Arg Met Gly Val Glu Glu Gly Ala 
270 275 280 

ATT TAT CAA AAT AAA GAA GAT GAT GAG CGT TTG TTG GTT TCG GCT AAC 918 
lie Tyr Gin Asn Lys Glu Asp Asp Glu Arg Leu Leu Val Ser Ala Asn 
285 290 295 

AAC CAG TTC AAG CGA TCC AGT TTT TTA TTA GTG AAT TAT GCG TTT ATT 96 6 

Asn Gin Phe Lys Arg Ser Ser Phe Leu Leu Val Asn Tyr Ala Phe lie 
300 305 310 

TTT TAAGGCTTGA TCTTGGAGTT AAGGTTTAAA ATTTT 10 04 

Phe 



(2) INFORMATION FOR SEQ ID NO: 960: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 960: 

Met Leu Lys Phe Lys Tyr Gly Leu lie Tyr lie Ala Leu lie Leu Gly 

15 10 15 

Leu. Gin Ala Thr Asp Tyr Asp Asn Leu Glu Glu Glu Asn Gin Gin Leu 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2 8... 969 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959: 

TTACAACTAT TTATTGTAAA GGCTAAA ATG TTG AAA TTT AAA TAT GGT TTG ATT 54 

Met Leu Lys Phe Lys Tyr Gly Leu lie 

1 5 

TAT ATC GCG CTC ATA CTA GGA CTT CAA GCG ACA GAT TAT GAC AAT TTA 102 
Tyr lie Ala Leu lie Leu Gly Leu Gin Ala Thr Asp Tyr Asp Asn Leu 
10 15 20 25 

GAA GAA GAA AAC CAA CAA TTA GAT GAA AAA ATA AAC CAT TTA AAG CAA 15 0 

Glu Glu Glu Asn Gin Gin Leu Asp Glu Lys lie Asn His Leu Lys Gin 
30 35 40 

CAG CTC ACC GAA AAA GGG GTT TCG CCC AAA GAG ATG GAT AAG GAT AAG 19 8 

Gin Leu Thr Glu Lys Gly Val Ser Pro Lys Glu Met Asp Lys Asp Lys 
45 50 55 

TTT GAA GAA GAA TAC ATC AAT CGA TCT TAT CCT AAA ATT TCT TCC AAG 24 6 

Phe Glu Glu Glu Tyr lie Asn Arg Ser Tyr Pro Lys He Ser Ser Lys 
60 65 70 

AAA AAA GAG AAA TTG CTC AAA TCT TTT TCC ATA GCC GAT GAT AAG AGT 2 94 

Lys Lys Glu Lys Leu Leu Lys Ser Phe Ser He Ala Asp Asp Lys Ser 
75 80 85 

GGG GTT TTT TTA GGG GGT GGG TAT GCT TAT GGG GAA CTT AAC TTG TCT 342 
Gly Val Phe Leu Gly Gly Gly Tyr Ala Tyr Gly Glu Leu Asn Leu Ser 
90 95 100 105 

TAT CAA GGG GAA ATG TTA GAC AGA TAC GGC GCG AAT GCC CCT AGC GCG 3 90 

Tyr Gin Gly Glu Met Leu Asp Arg Tyr Gly Ala Asn Ala Pro Ser Ala 
110 115 120 

TTT AAA AAC AAT ATC AAT ATT AAC GCT CCT GTT TCT ATG ATT AGC GCT 43 8 

Phe Lys Asn Asn He Asn He Asn Ala Pro Val Ser Met He Ser Ala 
125 130 135 

AAA TTT GGG TAT CAA AAA TAC TTT GTG TCT TAT TTT GGG ACA CGA TTT 48 6 

Lys Phe Gly Tyr Gin Lys Tyr Phe Val Ser Tyr Phe Gly Thr Arg Phe 
140 145 150 

TAT GGG GAT TTA TTG CTT GGG GGT GGG GCA TTA AAA GAG GAT GCA ATC 534 
Tyr Gly Asp Leu Leu Leu Gly Gly Gly Ala Leu Lys Glu Asp Ala He 
155 160 165 

AAG CAG CCT GTA GGC TCG TTT ATT TAT GTT TTA GGG GCT GTC AAT ACC 582 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958: 

Met Gly Leu Glu Asp Lys Cys Leu Leu Tyr Glu lie Lys Ala Asn Asp 

15 10 15 

Trp Ser Tyr Ala Asn Phe Phe Asn Gly Asn Lys Ala Ser Phe Lys Gin 

20 25 30 

Glu Val Cys Val Asp Thr lie Lys Pro Ser lie Thr lie Leu Ser Arg 

35 40 45 

Ser Pro Ser lie Ala Tyr Gly Gly Ser Ala lie Val Val Phe Glu Ala 

50 55 60 

Leu Asp Lys Asn Leu Ser Gin Ala Phe Val Arg Val Lys Lys Lys Asp 
65 70 75 80 

Phe Glu Ala Phe Arg Leu Leu Glu Phe Lys Gin Arg Asn Val Phe lie 

85 90 95 

Ala Leu Val Pro Trp Ser Tyr Lys Asn Lys Asp Phe Lys Ala Phe lie 

100 105 110 

Val Ala Lys Asp Lys Ala Tyr Asn Phe Asn Thr Ala Pro Leu Leu Phe 

115 120 125 

Lys Arg Lys lie His Arg Leu Arg Glu Lys Asp lie Asp Leu Ser Ala 

130 135 140 

Leu Lys Asp Lys lie Ala Lys Gin Glu Lys Phe Gin Asn Asp Thr Glu 
145 150 155 160 

Gin Ala Leu Leu Glu Arg Phe Ser Asn Ala Arg Pro Lys Asp Leu Glu 

165 170 175 

Lys lie Gin Lys lie Ala Leu Glu Gin Gly Asp Phe Tyr Lys Asp Phe 

180 185 190 

Ser His Phe Gin Ala Leu Lys Pro Leu Asn Gly Pro Phe Lys Met Ala 

195 200 205 

Ser Asn Phe Leu Glu Asn Arg Arg lie Leu Lys Asn Asn Gin Val Leu 

210 215 220 

Phe Lys Phe Leu His Leu Gly Val Asp Leu lie Pro Gly Lys Asp Leu 
225 230 235 240 

Ser Leu Ala Phe Asp Leu Ser Val Lys Arg Val Phe Lys Gly Glu Phe 

245 250 255 

Asp Phe Tyr Gly Asn Ser Leu lie His Cys Tyr Gly Leu Gly Leu Cys 

260 265 270 

Val Phe Leu Ala His Leu Lys Asp Asp Lys Ser Val Gly Ser Ser Gly 

275 280 285 

Leu Lys Leu Gly Ser Gly Leu His Leu Gly Met Leu Leu Gin Gly Val 

290 295 300 

Phe Val Arg Pro Asn Glu Trp Leu Asn Glu Gin Trp lie Lys Thr Asn 
305 310 315 320 

lie Thr Ala Pro lie Glu Gin Ala Lys Arg Leu Leu Met Lys Gly 
325 330 335 



(2) INFORMATION FOR SEQ ID NO: 959: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

'(ii) MOLECULE TYPE: Genomic DNA 
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GAA CAA GCT TTA TTA GAA AGA TTT TCC AAT GCG CGC CCA AAA GAT TTA 58 8 

Glu Gin Ala Leu Leu Glu Arg Phe Ser Asn Ala Arg Pro Lys Asp Leu 
160 165 170 175 

GAA AAA ATC CAA AAG ATC GCT TTA GAG CAA GGG GAT TTT TAT AAG GAT 63 6 

Glu Lys lie Gin Lys lie Ala Leu Glu Gin Gly Asp Phe Tyr Lys Asp 
180 185 190 

TTT TCT CAT TTT CAA GCG CTA AAA CCC TTG AAC GGG CCT TTT AAA ATG 684 
Phe Ser His Phe Gin Ala Leu Lys Pro Leu Asn Gly Pro Phe Lys Met 
195 200 205 

GCA AGC AAT TTT TTA GAA AAT CGG CGT ATC TTA AAG AAT AAT CAG GTG 73 2 

Ala Ser Asn Phe Leu Glu Asn Arg Arg lie Leu Lys Asn Asn Gin Val 
210 215 220 

TTG TTT AAA TTC TTG CAT TTA GGG GTG GAT TTG ATA CCT GGC AAG GAT 780 
Leu Phe Lys Phe Leu His Leu Gly Val Asp Leu lie Pro Gly Lys Asp 
225 230 235 

TTA TCT TTA GCG TTT GAT TTG TCT GTG AAG AGG GTT TTT AAG GGG GAG 82 8 

Leu Ser Leu Ala Phe Asp Leu Ser Val Lys Arg Val Phe Lys Gly Glu 
240 245 250 255 

TTC GAT TTT TAT GGT AAT AGT TTA ATC CAT TGC TAT GGG TTA GGT TTG 87 6 

Phe Asp Phe Tyr Gly Asn Ser Leu lie His Cys Tyr Gly Leu Gly Leu 
260 265 270 

TGC GTT TTT TTA GCC CAT TTA AAA GAT GAT AAA AGC GTG GGG AGT AGT 92 4 

Cys Val Phe Leu Ala His Leu Lys Asp Asp Lys Ser Val Gly Ser Ser 
275 280 285 

GGT TTG AAA TTA GGG AGC GGG TTG CAT TTA GGG ATG CTT TTG CAA GGG 972 
Gly Leu Lys Leu Gly Ser Gly Leu His Leu Gly Met Leu Leu Gin Gly 
290 295 300 

GTT TTT GTC CGG CCC AAT GAA TGG CTT AAT GAG CAA TGG ATA AAA ACC 102 0 

Val Phe Val Arg Pro Asn Glu Trp Leu Asn Glu Gin Trp lie Lys Thr 
305 310 315 

AAT ATC ACC GCC CCC ATA GAG CAA GCC AAA CGG CTT TTA ATG AAA GGA T 106 9 
Asn lie Thr Ala Pro lie Glu Gin Ala Lys Arg Leu Leu Met Lys Gly 
320 325 330 335 

AGTCATGTTA AAAACGAATC AAAAAAATGT GCATGCGTTT GAAATTGAAA AG 1121 

(2) INFORMATION FOR SEQ ID NO: 958: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 64... 1068 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957: 

AAAGAAAATC TGGTATTGGA TAAACCCAAG TCTTTAGAAG TGCCTTTGAC TAGGCCCGAA 6 0 

ATC ATG GGG CTA GAA GAC AAG TGC CTT TTA TAT GAA ATT AAA GCT AAT 10 8 

Met Gly Leu Glu Asp Lys Cys Leu Leu Tyr Glu lie Lys Ala Asn 
15 10 15 

GAT TGG ACT TAT GCT AAT TTT TTC AAT GGC AAT AAA GCG TCT TTC AAA 15 6 

Asp Trp Ser Tyr Ala Asn Phe Phe Asn Gly Asn Lys Ala Ser Phe Lys 
20 25 30 

CAA GAA GTG TGT GTT GAT ACG ATA AAA CCC TCA ATC ACT ATT TTA TCT 2 04 

Gin Glu Val Cys Val Asp Thr lie Lys Pro Ser lie Thr lie Leu Ser 
35 40 45 

CGA TCC CCA AGC ATC GCT TAT GGG GGG AGC GCG ATA GTC GTT TTT GAA 2 52 

Arg Ser Pro Ser lie Ala Tyr Gly Gly Ser Ala lie Val Val Phe Glu 
50 55 60 

GCT TTG GAT AAG AAT TTG TCT CAA GCG TTT GTG CGC GTC AAA AAA AAG 3 00 

Ala Leu Asp Lys Asn Leu Ser Gin Ala Phe Val Arg Val Lys Lys Lys 
65 70 75 

GAT TTT GAA GCT TTC AGG CTT TTA GAA TTC AAA CAG CGT AAT GTT TTT 34 8 

Asp Phe Glu Ala Phe Arg Leu Leu Glu Phe Lys Gin Arg Asn Val Phe 
80 85 90 95 

ATC GCT CTA GTG CCT TGG TCT TAT AAA AAT AAG GAT TTT AAG GCG TTC 3 96 

lie Ala Leu Val Pro Trp Ser Tyr Lys Asn Lys Asp Phe Lys Ala Phe 
lOO 105 110 

ATT GTC GCT AAA GAT AAA GCC TAT AAC TTT AAT ACC GCC CCT TTA TTG 444 
lie Val Ala Lys Asp Lys Ala Tyr Asn Phe Asn Thr Ala Pro Leu Leu 
115 120 125 

TTC AAG CGA AAA ATC CAT CGT TTG AGG GAA AAA GAT ATA GAC TTA AGC 4 92 

Phe Lys Arg Lys lie His Arg Leu Arg Glu Lys Asp lie Asp Leu Ser 
130 135 140 

GCC TTA AAA GAT AAG ATT GCA AAG CAA GAA AAA TTT CAA AAC GAC ACT 54 0 

Ala Leu Lys Asp Lys lie Ala Lys Gin Glu Lys Phe Gin Asn Asp Thr 
145 150 155 
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TCT CTA AAA GAG GTT GAA CCG CTT TTG GTA AAT AAA AAT ATT AAA ATA 62 7 

Ser Leu Lys Glu Val Glu Pro Leu Leu Val Asn Lys Asn lie Lys lie 
190 195 200 

ATC ACT AAA AAT GAC GAT ATA CTA GAC ATA AAG GAA GTA TTA TGAAACAAA 67 8 
lie Thr Lys Asn Asp Asp lie Leu Asp lie Lys Glu Val Leu 
205 210 215 

CAACCATTAA CCACTCTGTG GAATTAGTAG GGATAGGCTT GCACAAG 72 5 

(2) INFORMATION FOR SEQ ID NO: 956: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956: 

Met Asp Lys Asn Gin Tyr His Arg Pro His Arg Ala Ser Gin Thr Ala 

15 10 15 

Phe Asn Glu Arg lie Val Met Leu Lys Thr Asn Gin Lys Asn Val His 

20 25 30 

Ala Phe Glu lie Glu Lys Gin Glu Pro Glu Ala Val lie Gly Phe Leu 

35 40 45 

Glu Lys Asn His Ala Leu Leu Gin Tyr Phe Leu lie lie Phe Lys Tyr 

50 55 60 

Asp lie Glu Ser Glu Val Lys Ala Val Leu Arg Lys His Gin Leu Leu 
65 70 75 80 

Phe Leu Glu Thr Asn Arg Val Leu Asn Gly Arg His lie Lys Thr Met 

85 90 95 

Pro Leu Lys Asp Glu Thr Asp His Pro Lys Pro Asn His Ser Lys Thr 

100 105 110 

Glu Pro Lys Thr Thr lie Tyr Glu Arg His lie Arg Ser Gly Glu Glu 

115 120 125 

lie Tyr Ser Thr Asn His Leu lie Phe Leu Gly Asn lie His Asn Gly 

130 135 140 

Ala Lys lie lie Ser Glu Gly Cys Val Ser Val Tyr Gly Val Cys Glu 
145 ■ 150 155 160 

Gly Ala lie Val Cys Phe Gly Glu Cys Leu lie Leu Lys Glu Val Lys 

165 170 175 

Ser Ala Gin lie Val Phe Gin Asn Lys lie Leu Ser Leu Lys Glu Val 

180 185 190 

Glu Pro Leu Leu Val Asn Lys Asn lie Lys lie lie Thr Lys Asn Asp 

195 200 205 

Asp lie Leu Asp lie Lys Glu Val Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO: 957: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1121 base pairs 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 669 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 955 : 

TGAATGGCTT T^TGAGCA ATG GAT AAA AAC CAA TAT CAC CGC CCC CAT AGA 51 

Met Asp Lys Asn Gin Tyr His Arg Pro His Arg 
15 10 

GCA AGC CAA ACG GCT TTT AAT GAA AGG ATA GTC ATG TTA AAA ACG AAT 99 
Ala Ser Gin Thr Ala Phe Asn Glu Arg lie Val Met Leu Lys Thr Asn 
15 20 25 

CAA AAA AAT GTG CAT GCG TTT GAA ATT GAA AAG CAA GAG CCT GAA GCG 14 7 

Gin Lys Asn Val His Ala Phe Glu lie Glu Lys Gin Glu Pro Glu Ala 
30 35 40 

GTC ATA GGG TTT TTA GAA AAA AAC CAT GCC CTT TTG CAA TAT TTT CTT 195 
Val lie Gly Phe Leu Glu Lys Asn His Ala Leu Leu Gin Tyr Phe Leu 
45 50 55 

ATT ATT TTT AAA TAC GAT ATT GAA TCA GAA GTC AAA GCC GTT TTG CGC 24 3 

lie lie Phe Lys Tyr Asp lie Glu Ser Glu Val Lys Ala Val Leu Arg 
60 65 70 75 

AAA CAC CAG CTT TTG TTT TTA GAA ACG AAT CGC GTT TTA AAC GGA CGC 2 91 

Lys His Gin Leu Leu Phe Leu Glu Thr Asn Arg Val Leu Asn Gly Arg 
80 85 90 

CAT ATC AAA ACC ATG CCT TTA AAA GAC GAA ACC GAT CAT CCA AAA CCC 33 9 

His lie Lys Thr Met Pro Leu Lys Asp Glu Thr Asp His Pro Lys Pro 
95 100 105 

AAT CAT TCT AAA ACA GAA CCT AAA ACA ACG ATT TAT GAG CGC CAT ATC 3 87 

Asn His Ser Lys Thr Glu Pro Lys Thr Thr lie Tyr Glu Arg His lie 
110 115 120 

AGG AGT GGG GAA GAG ATT TAT AGC ACT AAT CAC CTT ATT TTT TTG GGT 4 35 

Arg Ser Gly Glu Glu lie Tyr Ser Thr Asn His Leu lie Phe Leu Gly 
125 130 135 

AAT ATC CAT AAT GGA GCC AAG ATT ATT TCA GAG GGC TGT GTG TCT GTT 483 
Asn lie His Asn Gly Ala Lys lie lie Ser Glu Gly Cys Val Ser Val 
140 145 150 155 

TAT GGG GTT TGC GAA GGG GCG ATT GTG TGC TTT GGA GAG TGT TTG ATC 531 
Tyr Gly Val Cys Glu Gly Ala lie Val Cys Phe Gly Glu Cys Leu lie 
160 165 170 

TTA AAA GAA GTC AAG AGC GCT CAA ATC GTT TTT CAA AAC AAA ATT TTG 5 79 

Leu Lys Glu Val Lys Ser Ala Gin lie Val Phe Gin Asn Lys lie Leu 
175 180 185 
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Glu Gly Thr Met 
340 

Glu Glu Tyr Arg 
355 

Met Leu His Ala 
370 

Ser Leu Lys Val 
385 

Trp Lys Trp Phe 

Asp Thr Trp Trp 
420 

Pro Gly Ala Thr 
435 

Gly lie His Ala 
450 

Gly Glu Gin Gly 
465 

Arg Asn lie Trp 

Gin lie Lys Leu 
500 

lie Val Asp Glu 
515 

lie Val Asn Val 
530 

Ala lie Ser Lys 
545 

Pro Asp Ala lie 

Asp Gly Ala Lys 
580 

Met Asn His lie 
595 

Asn Val Met Tyr 
610 

Met Arg Arg Leu 
625 

Asp Leu Ser Thr 

lie Ala Gin Met 
660 



325 

Ser Tyr Pro Asp 

Val Asp Lys Phe 
360 

Lys Gly Glu Asn 
375 

Leu Gly Thr Val 
390 

Tyr Glu Lys lie 
405 

Gin Thr Glu Thr 

Pro lie Arg Ala 
440 

Glu Val Leu Asn 
455 

Phe Leu Cys lie 
470 

Gly Asp Glu Lys 
485 

Asn Gly Glu Tyr 

Asn Gly Tyr lie 
520 

Ser Gly His Arg 
535 

His Glu Met Val 
550 

Lys Gly Glu Gly 
565 

Cys Asn Leu Gly 

Leu Ser lie Glu 
600 

Val Pro Gly Leu 
615 

Leu Lys Ser lie 
630 

Leu Glu Asp Val 
645 

Glu Glu 



330 

Tyr Gly Arg Trp 
345 

Tyr Thr Ser Pro 

Glu Pro Ser Lys 
380 

Gly Glu Pro lie 
395 

Gly Asn Ser Lys 
410 

Gly Gly His lie 
425 

Ser Cys Ala Thr 

Glu Asp Gly Thr 
460 

Thr Lys Pro Trp 
475 

Arg Tyr lie Asp 
490 

Val Tyr Leu Ser 
505 

Thr lie lie Gly 

lie Gly Thr Ala 
540 

Ala Glu Cys Ala 
555 

Leu Phe Ala Phe 
570 

Glu Ser Leu Glu 
585 

lie Gly Lys lie 

Pro Lys Thr Arg 
620 

Ala Lys Lys Glu 
635 

Asn Val Val Lys 
650 



335 

Trp Arg Met lie 
350 

Thr Ala lie Arg 
365 

Tyr Asn Leu Glu 

Asn Pro Thr Ala 
400 

Cys Ser lie Val 
415 

lie Ser Pro Leu 
430 

Leu Pro Leu Pro 
445 

Lys Thr Lys Pro 

Pro Ser Met lie 
480 

Ser Tyr Phe Ser 
495 

Gly Asp Gly Ala 
510 

Arg Thr Asp Asp 
525 

Glu Val Glu Ser 

Val Val Gly lie 
560 

Val Val Leu Cys 
575 

Leu Leu Lys Glu 
590 

Ala Lys Leu Asp 
605 

Ser Gly Lys lie 

Pro lie Thr Gin 
640 

Glu lie Met Ser 
655 



(2) INFORMATION FOR SEQ ID NO: 955: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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AAAATCTAAA AAATGCTTTT TAGCGTTTTT TAGCCAAATA ATAAG 2 07 0 



(2) INFORMATION FOR SEQ ID NO: 954: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954: 



Met Gin Leu Asp 
1 

Asn Arg Ala Piie 
20 

Lys Asp Leu Val 
35 

Asp Leu Ala Lys 
50 

Leu Asn Ser Asp 
65 

lie Asn Val Ser 

Asn Lys Val Ala 
100 

lie Thr Tyr Arg 
115 

Leu Lys Asn Glu 
130 

Met Pro Met lie 
145 

lie Gly Ala lie 

Leu Arg Asp Arg 
180 

Asp Gly Thr Phe 
195 

Asp Lys Ala Leu 
210 

lie Val lie Arg 
225 

Phe Val Tyr Asn 

Glu Met Met Asp 
260 

Ser Thr Gly Lys 
275 

Leu Trp Ala Gin 
290 

Asp Asn Phe Trp 
305 

Tyr Val Val Tyr 



Glu Asp Leu Glu 
5 

Ala Lys Gin Ala 

His Glu Ala Asn 
40 

Gin Lys Leu Thr 
55 

Asn Ala Pro Phe 
70 

Tyr Asn Cys lie 
85 

lie lie Phe Glu 

Lys Leu His Ser 
120 

Phe Asn Val Lys 
135 

Val Glu Ser Val 
150 

His Ser lie Val 
165 

lie Asn Asp Ala 

Arg Lys Gly Lys 
200 

Glu Asn Asn Ala 
215 

Asn Ala Lys Glu 
230 

Glu Met Val Asn 
245 

Ser Glu Asp Pro 

Pro Lys Gly Val 
280 

Met Thr Met Glu 
295 

Cys Thr Ala Asp 
310 

Gly Pro Leu Ala 



Phe Ala Lys Lys 
10 

Arg lie Lys Asn 
25 

Glu Asp Tyr Glu 

Trp Phe Lys Pro 
60 

Phe Lys Trp Phe 
75 

Asp Arg His Leu 
90 

Gly Glu Met Gly 
105 

Glu Val Asn Lys 

Lys Gly Asp Arg 
140 

Tyr Met Met Leu 
155 

Phe Ala Gly Phe 
170 

Gin Ala Lys Leu 
185 

Pro Tyr Met Leu 

Cys Pro Ser Val 
220 

lie Asp Tyr Val 
235 

Tyr Gin Ser Asp 
250 

Leu Phe Leu Leu 
265 

Gin His Ser Ser 

Trp Val Phe Asp 
300 

lie Gly Trp lie 
315 

Cys Gly Ala Thr 



lie Phe Asn Pro 
15 

Met Cys Glu Tyr 
30 

His Phe Trp Gly 
45 

Phe Asp Lys Val 

Glu Asn Gly Lys 
80 

Lys Asp Lys Lys 
95 

Asp Tyr Asn Val 
110 

Thr Ala Asn Leu 
125 

Val lie lie Tyr 

Ala Cys Thr Arg 
160 

Ser Pro Glu Ala 
175 

Val lie Thr Ala 
190 

Lys Pro Ala Leu 
205 

Glu Lys Ala Leu 

Arg Gly Arg Asp 
240 

Lys Cys Glu Pro 
255 

Tyr Thr Ser Gly 
270 

Ala Gly Tyr Leu 
285 

lie Arg Asp Asn 

Thr Gly His Thr 
320 

Thr Leu lie Leu 
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AGG GCC AGT TGC GCG ACT TTA CCT TTG CCT GGA ATC CAT GCG GAA GTT 14 0 0 

Arg Ala Ser Cys Ala Thr Leu Pro Leu Pro Gly lie His Ala Glu Val 
440 445 450 

TTA AAC GAA GAC GGC ACT AAA ACA AAG CCT GGA GAG CAA GGG TTT TTA 14 4 8 

Leu Asn Glu Asp Gly Thr Lys Thr Lys Pro Gly Glu Gin Gly Phe Leu 
455 460 465 470 

TGC ATC ACT AAG CCA TGG CCT TCT ATG ATA AGA AAC ATT TGG GGC GAT 14 96 

Cys lie Thr Lys Pro Trp Pro Ser Met lie Arg Asn lie Trp Gly Asp 
475 480 485 

GAA AAA CGA TAC ATT GAT AGC TAT TTT TCT CAG ATC AAG TTG AAT GGG 154 4 

Glu Lys Arg Tyr lie Asp Ser Tyr Phe Ser Gin lie Lys Leu Asn Gly 
490 495 500 

GAA TAT GTC TAC CTC TCT GGA GAT GGC GCT ATC GTG GAT GAA AAC GGA 15 92 

Glu Tyr Val Tyr Leu Ser Gly Asp Gly Ala lie Val Asp Glu Asn Gly 
505 510 515 

TAC ATT ACT ATT ATT GGG CGC ACA GAT GAT ATT GTG AAT GTG AGT GGG 164 0 

Tyr lie Thr lie lie Gly Arg Thr Asp Asp lie Val Asn Val Ser Gly 
520 525 530 

CAT AGG ATT GGC ACG GCT GAA GTG GAG AGC GCT ATT TCC AAG CAT GAA 16 8 8 

His Arg lie Gly Thr Ala Glu Val Glu Ser Ala lie Ser Lys His Glu 
535 540 545 550 

ATG GTG GCT GAA TGC GCG GTG GTG GGT ATC CCT GAT GCG ATT AAA GGA 173 6 

Met Val Ala Glu Cys Ala Val Val Gly lie Pro Asp Ala lie Lys Gly 
555 560 565 

GAG GGC TTG TTT GCG TTT GTG GTG CTG TGC GAT GGG GCT AAA TGC AAT 17 84 

Glu Gly Leu Phe Ala Phe Val Val Leu Cys Asp Gly Ala Lys Cys Asn 
570 575 580 

CTT GGC GAG AGT TTA GAA TTG CTA AAA GAA ATG AAC CAT ATC TTA TCC 183 2 

Leu Gly Glu Ser Leu Glu Leu Leu Lys Glu Met Asn His lie Leu Ser 
585 590 595 

ATT GAG ATT GGA AAG ATC GCG AAA TTA GAC AAT GTC ATG TAT GTG CCA 188 0 

lie Glu lie Gly Lys lie Ala Lys Leu Asp Asn Val Met Tyr Val Pro 
600 605 610 

GGT TTG CCT AAA ACC AGG AGC GGG AAA ATC ATG AGA AGG CTT TTG AAA 192 8 

Gly Leu Pro Lys Thr Arg Ser Gly Lys lie Met Arg Arg Leu Leu Lys 
615 620 625 630 

TCC ATC GCC AAA AAA GAG CCT ATC ACT CAA GAT TTA AGC ACG CTA GAA 197 6 

Ser lie Ala Lys Lys Glu Pro lie Thr Gin Asp Leu Ser Thr Leu Glu 
635 640 645 

GAT GTG AAT GTG GTT AAA GAA ATA ATG AGC ATC GCT CAA ATG GAG GAG T 2 02 5 
Asp Val Asn Val Val Lys Glu lie Met Ser lie Ala Gin Met Glu Glu 
650 655 660 
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AAC GCC TGC CCT AGC GTG GAA AAA GCG CTC ATT GTG ATA CGA AAC GCC 72 8 

Asn Ala Cys Pro Ser Val Glu Lys Ala Leu lie Val lie Arg Asn Ala 
215 220 225 230 

AAA GAG ATT GAG TAT GTG AGA GGG CGC GAT TTT GTC TAT AAT GAA ATG 77 6 

Lys Glu lie Asp Tyr Val Arg Gly Arg Asp Phe Val Tyr Asn Glu Met 
235 240 245 

GTC AAT TAG CAA TCC GAG AAA TGC GAA CCT GAA ATG ATG GAC TCT GAA 824 
Val Asn Tyr Gin Ser Asp Lys Cys Glu Pro Glu Met Met Asp Ser Glu 
250 255 260 

GAT CCT TTA TTC TTG CTC TAT ACA AGC GGA TCA ACC GGA AAG CCT AAA 872 
Asp Pro Leu Phe Leu Leu Tyr Thr Ser Gly Ser Ttir Gly Lys Pro Lys 
265 270 275 

GGC GTT CAA CAC AGC AGT GCG GGG TAT TTG TTA TGG GCG CAA ATG ACG 92 0 

Gly Val Gin His Ser Ser Ala Gly Tyr Leu Leu Trp Ala Gin Met Thr 
280 285 290 

ATG GAG TGG GTT TTT GAT ATT AGA GAT AAC GAT AAT TTT TGG TGC ACC 96 8 

Met Glu Trp Val Phe Asp lie Arg Asp Asn Asp Asn Phe Trp Cys Thr 
295 300 305 310 

GCC GAT ATT GGC TGG ATC ACA GGG CAC ACT TAT GTG GTT TAT GGA CCT 1016 
Ala Asp He Gly Trp He Thr Gly His Thr Tyr Val Val Tyr Gly Pro 
315 320 325 

TTA GCT TGT GGG GCG ACG ACT TTG ATA CTA GAA GGC ACG ATG TCT TAT 1064 
Leu Ala Cys Gly Ala Thr Thr Leu He Leu Glu Gly Thr Met Ser Tyr 
330 335 340 

CCG GAT TAT GGG AGA TGG TGG AGG ATG ATA GAA GAA TAC CGT GTG GAT 1112 
Pro Asp Tyr Gly Arg Trp Trp Arg Met He Glu Glu Tyr Arg Val Asp 
345 350 355 

AAA TTC TAC ACT TCC CCT ACC GCT ATA AGA ATG TTG CAT GCC AAA GGT 116 0 

Lys Phe Tyr Thr Ser Pro Thr Ala He Arg Met Leu His Ala Lys Gly 
360 365 370 

GAA AAC GAA CCC TCA AAG TAT AAT TTA GAG TCG CTC AAA GTT TTA GGA 12 0 8 

Glu Asn Glu Pro Ser Lys Tyr Asn Leu Glu Ser Leu Lys Val Leu Gly 
375 380 385 390 

ACG GTG GGA GAG CCC ATT AAC CCT ACA GCA TGG AAA TGG TTT TAT GAA 12 5 6 

Thr Val Gly Glu Pro He Asn Pro Thr Ala Trp Lys Trp Phe Tyr Glu 
395 400 405 

AAA ATC GGC AAC TCA AAA TGC AGC ATC GTG GAT ACT TGG TGG CAG ACA 13 04 

Lys He Gly Asn Ser Lys Cys Ser He Val Asp Thr Trp Trp Gin Thr 
410 415 420 

GAA ACA GGC GGG CAC ATC ATC AGC CCT TTA CCG GGA GCT ACG CCT ATA 13 52 
Glu Thr Gly Gly His He He Ser Pro Leu Pro Gly Ala Thr Pro He 
425 430 435 
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ATAAAAAATG CGCTTAAAAC CATGAAAAAG GAGATGCG ATG CAA TTA GAC GAA GAT 56 

Met Gin Leu Asp Glu Asp 
1 5 

TTA GAA TTC GCT AAA AAA ATC TTT AAC CCT AAC AGA GCG TTT GCC AAG 104 
Leu Glu Phe Ala Lys Lys lie Phe Asn Pro Asn Arg Ala Phe Ala Lys 
10 15 20 

CAA GCC AGG ATT AAA AAC ATG TGC GAA TAT AAA GAT TTA GTG CAT GAA 152 
Gin Ala Arg lie Lys Asn Met Cys Glu Tyr Lys Asp Leu Val His Glu 
25 30 35 

GCC AAT GAA GAT TAT GAA CAT TTT TGG GGC GAT TTA GCC AAG CAG AAA 2 00 

Ala Asn Glu Asp Tyr Glu His Phe Trp Gly Asp Leu Ala Lys Gin Lys 
40 45 50 

CTC ACA TGG TTT AAA CCT TTT GAT AAG GTT TTA AAC AGC GAT AAC GCC 24 8 

Leu Thr Trp Phe Lys Pro Phe Asp Lys Val Leu Asn Ser Asp Asn Ala 
55 60 65 70 

CCT TTT TTC AAA TGG TTT GAA AAC GGC AAA ATC AAT GTT TCT TAC AAT ' 2 96 
Pro Phe Phe Lys Trp Phe Glu Asn Gly Lys lie Asn Val Ser Tyr Asn 
75 80 85 

TGC ATA GAC AGG CAT TTA AAA GAC AAA AAA AAT AAA GTG GCG ATC ATT 344 
Cys lie Asp Arg His Leu Lys Asp Lys Lys Asn Lys Val Ala lie lie 
90 95 100 

TTT GAA GGG GAA ATG GGG GAT TAT AAT GTC ATC ACT TAC AGA AAA CTC 3 92 

Phe Glu Gly Glu Met Gly Asp Tyr Asn Val lie Thr Tyr Arg Lys Leu 
105 110 115 

CAC TCT GAA GTC AAT AAA ACA GCC AAC CTT TTA AAA AAC GAA TTC AAT 44 0 

His Ser Glu Val Asn Lys Thr Ala Asn Leu Leu Lys Asn Glu Phe Asn 
120 125 130 

GTC AAA AAA GGC GAT AGG GTC ATT ATC TAT ATG CCC ATG ATT GTA GAA 4 88 

Val Lys Lys Gly Asp Arg Val lie lie Tyr Met Pro Met He Val Glu 
135 140 145 150 

AGC GTT TAT ATG ATG CTC GCA TGC ACT AGG ATT GGA GCG ATC CAT AGC 53 6 

Ser Val Tyr Met Met Leu Ala Cys Thr Arg He Gly Ala He His Ser 
155 160 165 

ATC GTT TTT GCT GGG TTT AGC CCT GAA GCC TTA AGG GAT AGG ATC AAC 5 84 

He Val Phe Ala Gly Phe Ser Pro Glu Ala Leu Arg Asp Arg He Asn 
170 175 180 

GAC GCT CAA GCT AAA TTA GTT ATC ACA GCG GAT GGG ACT TTT AGA AAA 63 2 

Asp Ala Gin Ala Lys Leu Val He Thr Ala Asp Gly Thr Phe Arg Lys 
185 190 195 

GGC AAA CCT TAC ATG CTC AAG CCA GCC CTT GAC AAG GCT CTA GAA AAT 68 0 

Gly Lys Pro Tyr Met Leu Lys Pro Ala Leu Asp Lys Ala Leu Glu Asn 
200 205 210 
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65 



70 



75 



80 



Glu His Arg Leu Gin Asn Lys Asn lie Gin lie Gin Leu Leu Asp His 

85 90 95 

His lie Ser Gly Lys Glu Val Ala Glu Ser Phe His Trp Tyr Phe Leu 

100 105 110 

Asp lie Asn Arg Cys Ala Thr Lys lie Val Tyr Glu Phe Leu Lys Lys 

115 120 125 

His Tyr Ala lie Leu Glu Pro Lys Asn Thr Thr Trp Leu Glu Pro Leu 

130 135 140 

Val Glu Met Val Asn Ser Val Asp lie Trp Asp Thr Gin Gly Tyr Gly 
145 150 155 160 

Phe Glu Leu Gly Lys Val Cys Met Arg Met lie Asn Gin Ser Ser Glu 

165 170 175 

Leu Asn Arg Phe Met Phe Asp Asp Glu Asn Arg Asn Tyr Lys Leu Lys 

180 185 190 

Leu Leu Glu Glu Val Lys Asn Tyr Leu Phe Leu Glu Asn Ala Pro Val 

195 200 205 

Ala Tyr Asp Asn Asp Leu Phe Lys Leu Lys Lys lie Ala Leu Gly Gly 

210 215 220 

Asp Pro Asp Ala Glu Thr Met Asp Asn lie Ser Ser Asn Ala Gin Thr 
225 230 235 240 

His Leu Leu Ser Leu Lys Lys His Asp Cys Ser Val Tyr Tyr Gin Asp 

245 250 255 

Lys Lys Gly Phe Leu Ser Tyr Ser Met Gly Gly lie Ser Val Leu Ala 

260 265 270 

Asn Leu Phe Leu Thr Gin Asn Pro Asp Phe Asp Phe Tyr Met Asp Val 

275 280 285 

Asn Ala Lys Gly Asn Val Ser Leu Arg Ala Asn Gly Asn Cys Asp Val 

290 295 300 

Cys Glu Leu Ser Gin Met Cys Phe Asn Gly Gly Gly His Arg Asn Ala 
305 310 315 320 

Ser Gly Gly Lys lie Asp Gly Phe Arg Glu Ser Phe Asn Tyr Arg Asp 

325 330 335 

lie Lys Glu Gin lie Glu Glu lie Phe Asn Asn Ala 



(2) INFORMATION FOR SEQ ID NO: 953: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2070 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 39... 2024 
(D) OTHER INFORMATION: 



340 



345 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953: 
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TTA GGG GGC GAG CCT GAT GCA GAA ACG ATG GAG AAT ATC TCT TCA AAC 72 3 

Leu Gly Gly Asp Pro Asp Ala Glu Thr Met Asp Asn lie Ser Ser Asn 
225 230 235 

GCG CAA ACG CAT TTG CTC TCT TTA AAA AAG CAT GAT TGC AGC GTT TAT 771 
Ala Gin Thr His Leu Leu Ser Leu Lys Lys His Asp Cys Ser Val Tyr 
240 245 250 

TAC CAG GAT AAA AAA GGG TTT TTA AGT TAT TCT ATG GGG GGC ATT AGC 819 
Tyr Gin Asp Lys Lys Gly Phe Leu Ser Tyr Ser Met Gly Gly lie Ser 
255 . 260 265 

GTG TTG GCT AAC CTT TTT TTA ACG CAA AAT CCG GAT TTT GAT TTT TAT 8 67 

Val Leu Ala Asn Leu Phe Leu Thr Gin Asn Pro Asp Phe Asp Phe Tyr 
270 275 280 285 

ATG GAT GTG AAC GCT AAA GGG AAT GTG AGC TTA AGG GCG AAT GGG AAT 915 
Met Asp Val Asn Ala Lys Gly Asn Val Ser Leu Arg Ala Asn Gly Asn 
290 295 300 

TGC GAT GTG TGC GAA CTC AGT CAA ATG TGT TTT AAT GGG GGT GGG CAT 9 63 

Cys Asp Val Cys Glu Leu Ser Gin Met Cys Phe Asn Gly Gly Gly His 
305 310 315 

AGG AAT GCG AGC GGA GGC AAG ATT GAT GGT TTT AGG GAG AGT TTC AAT 1011 
Arg Asn Ala Ser Gly Gly Lys He Asp Gly Phe Arg Glu Ser Phe Asn 
320 325 330 

TAT AGG GAT ATT AAA GAA CAA ATT GAA GAA ATC TTC AAC AAC GCT TAAAA 10 61 
Tyr Arg Asp He Lys Glu Gin He Glu Glu He Phe Asn Asn Ala 
335 340 345 

CTAAGCTGTT TAGAAAAAAC TAACAAAAAC TGAAAAGAGT TTAAAAGCTC 1111 

(2) INFORMATION FOR SEQ ID NO: 952: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 952 : 

Met Met Gin Val Tyr His Leu Ser His He Asp Leu Asp Gly Tyr Ala 

15 10 15 

Cys Gin Leu Val Ser Lys Gin Phe Phe Lys Asn He Gin Cys Tyr Asn 

20 25 30 

Ala Asn Tyr Gly Arg Glu Val Ser Ala Arg He Tyr Glu He Leu Asn 

35 40 45 

Ala He Ala Gin Ser Lys Glu Ser Glu Phe Leu He Leu Val Ser Asp 

50 55 60 

Leu Asn Leu Asn Leu Asn Glu Ala Glu Tyr Leu Gin Asp Lys He Gin 
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Met Met Gin Val Tyr His Leu Ser His lie Asp Leu Asp 
15 10 

GGC TAT GCA TGC CAG CTT GTT TCA AAA CAA TTT TTT AAA AAT ATC CAA 99 
Gly Tyr Ala Cys Gin Leu Val Ser Lys Gin Phe Phe Lys Asn lie Gin 
15 20 25 

TGC TAT AAC GCT AAT TAG GGG CGT GAA GTC TCA GCG AGA ATT TAT GAG 14 7 

Cys Tyr Asn Ala Asn Tyr Gly Arg Glu Val Ser Ala Arg lie Tyr Glu 
30 35 40 45 

ATT TTA AAC GCA ATC GCT CAG TCT AAA GAG AGT GAA TTC CTT ATT TTG 195 
lie Leu Asn Ala lie Ala Gin Ser Lys Glu Ser Glu Phe Leu lie Leu 
50 55 60 

GTT AGC GAT TTG AAT CTG AAT TTG AAT GAA GCA GAG TAT TTG CAG GAT 243 
Val Ser Asp Leu Asn Leu Asn Leu Asn Glu Ala Glu Tyr Leu Gin Asp 
65 70 75 

AAG ATC CAA GAA CAC CGC TTG CAA AAT AAA AAC ATT CAA ATC CAG CTT 2 91 

Lys lie Gin Glu His Arg Leu Gin Asn Lys Asn lie Gin lie Gin Leu 
80 85 90 

TTA GAT CAC CAT ATC AGC GGT AAG GAA GTG GCT GAG AGT TTC CAT TGG 33 9 

Leu Asp His His lie Ser Gly Lys Glu Val Ala Glu Ser Phe His Trp 
95 100 105 

TAT TTT TTA GAC ATT AAC CGT TGC GCG ACT AAA ATC GTG TAT GAA TTT 3 87 

Tyr Phe Leu Asp lie Asn Arg Cys Ala Thr Lys lie Val Tyr Glu Phe 
110 115 120 125 

TTG AAA AAG CAT TAC GCT ATT TTA GAG CCA AAA AAC ACA ACA TGG CTA 43 5 

Leu Lys Lys His Tyr Ala lie Leu Glu Pro Lys Asn Thr Thr Trp Leu 
130 135 140 

GAG CCT TTA GTG GAA ATG GTC AAT TCT GTG GAT ATT TGG GAC ACG CAA 4 83 

Glu Pro Leu Val Glu Met Val Asn Ser Val Asp lie Trp Asp Thr Gin 
145 150 155 

GGT TAT GGC TTT GAA TTA GGC AAG GTG TGC ATG CGC ATG ATT AAC CAA 531 
Gly Tyr Gly Phe Glu Leu Gly Lys Val Cys Met Arg Met lie Asn Gin 
160 165 170 

AGC TCT GAA TTG AAT CGT TTC ATG TTT GAT GAT GAA AAC CGC AAC TAT 579 
Ser Ser Glu Leu Asn Arg Phe Met Phe Asp Asp Glu Asn Arg Asn Tyr 
175 180 185 

AAA TTA AAG CTT TTA GAA GAA GTT AAA AAC TAT TTG TTT TTA GAA AAT 62 7 

Lys Leu Lys Leu Leu Glu Glu Val Lys Asn Tyr Leu Phe Leu Glu Asn 
190 195 200 205 

GCC CCT GTA GCC TAT GAT AAC GAT TTG TTC AAA CTC AAA AAA ATC GCT 67 5 

Ala Pro Val Ala Tyr Asp Asn Asp Leu Phe Lys Leu Lys Lys He Ala 
210 215 220 
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Glu Gly Leu Val 
20 

Ser Ser Ser Asp 
35 

Arg Val Val lie 
50 

Leu Ala Pro Val 
65 

Gly Glu Gly Ala 

Phe Lys Glu Met 
100 

Lys Ser Gin Glu 
115 

Glu lie Ala Lys 
130 

Val Phe Ser Phe 
145 

Leu Thr Thr Ala 

Glu Lys Glu Glu 
180 

Asp Ala Ser Leu 
195 

Asp Val Lys Leu 
210 

Leu Lys Asp Val 
225 

Gin Leu Val Asn 

Ala Lys Gly Glu 
260 

Thr Asp lie Gly 
275 



Gly Lys Ala Pro 

Glu Ser Phe Leu 
40 

Ser Ala lie Glu 
55 

Val Leu Val Thr 
70 

Ser Lys Glu Glu 
85 

Ala Ser Asn lie 

Leu Leu Pro Lys 
120 

Glu Leu Pro Lys 
135 

Lys Met Glu Ala 
150 

Ala Phe Glu Gly 
165 

Thr Thr Glu Gly 

Glu Asn lie Glu 
200 

Asn Val Lys Val 
215 

Val Ser Met Asp 
230 

Asp Pro Leu Glu 
245 

Val Val lie Val 

Thr Lys Lys Glu 
280 



Ser Val Gly Leu 
25 

Lys Leu lie Ser 

Lys Glu Glu Ser 
60 

Ser Leu Ser Asp 
75 

Met Asp Asn Asp 
90 

Phe Gly Ala lie 
105 

Leu Asn Phe Thr 

Lys Glu Asp Tyr 
140 

lie Lys Glu Ser 
155 

Gin Phe Glu Lys 
170 

Val Ala Glu Glu 
185 

lie Arg Asn lie 

Arg lie Gly Gin 
220 

lie Gly Ser Val 
235 

lie Leu Val Asp 
250 

Asp Gly Asn Phe 
265 

Arg Leu Glu Gin 



Glu Lys Glu lie 
30 

Thr Pro Tyr Ala 
45 

Ser lie Glu Leu 

Leu Met Leu Gly 
80 

Asp Leu Asp Ala 
95 

Ala Thr Ser Leu 
110 

Thr lie Asn Ala 
125 

Ala Lys Ala Met 

Gin lie He Leu 
160 

Thr His Lys Glu 
175 

Val Lys Thr His 
190 

Ser Met Leu Leu 
205 

Lys Lys Met He 

Val Glu Leu Asp 
240 

Asp Lys Val He 
255 

Gly He Gin He 

270 
Leu Lys His 
285 



(2) INFORMATION FOR SEQ ID NO: 951: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13 . . . 1056 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951: 
TAAAAAAGGA GA ATG ATG CAA GTT TAG CAC CTT TCA CAC ATT GAT TTA GAC 51 
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TAG GCT AAA GCG ATG GTG TTT TCT TTT AAA ATG GAA GCC ATC AAA GAA 4 83 

Tyr Ala Lys Ala Met Val Phe Ser Phe Lys Met Glu Ala lie Lys Glu 
140 145 150 155 

AGC CAA ATC ATT TTA TTG ACT ACG GCG GCT TTT GAG GGC CAA TTT GAA 531 
Ser Gin lie lie Leu Leu Thr Thr Ala Ala Phe Glu Gly Gin Phe Glu 
160 165 170 

AAA ACG CAT AAA GAA GAA AAA GAA GAA ACG ACA GAG GGC GTT GCT GAA 579 
Lys Thr His Lys Glu Glu Lys Glu Glu Thr Thr Glu Gly Val Ala Glu 
175 180 185 

GAG GTT AAA ACC CAT GAT GCG TCT TTA GAA AAC ATA GAA ATC CGC AAT 62 7 

Glu Val Lys Thr His Asp Ala Ser Leu Glu Asn lie Glu lie Arg Asn 
190 195 200 

ATC AGC ATG CTT TTA GAC GTG AAA TTG AAC GTT AAG GTG CGC ATC GGG 67 5 

lie Ser Met Leu Leu Asp Val Lys Leu Asn Val Lys Val Arg lie Gly 
205 210 215 

CAA AAA AAA ATG ATT TTA AAA GAC GTG GTC TCT ATG GAT ATA GGG AGC 72 3 

Gin Lys Lys Met lie Leu Lys Asp Val Val Ser Met Asp lie Gly Ser 
220 225 230 235 

GTG GTA GAG CTG GAT CAA TTG GTG AAT GAC CCT TTG GAA ATT CTT GTA 7 71 

Val Val Glu Leu Asp Gin Leu Val Asn Asp Pro Leu Glu lie Leu Val 
240 245 250 

GAT GAC AAG GTG ATC GCT AAG GGC GAA GTG GTG ATT GTG GAT GGG AAT 819 
Asp Asp Lys Val lie Ala Lys Gly Glu Val Val lie Val Asp Gly Asn 
255 260 265 

TTT GGC ATT CAA ATC ACG GAT ATT GGC ACT AAA AAA GAA CGC TTA GAA 8 67 

Phe Gly lie Gin lie Thr Asp lie Gly Thr Lys Lys Glu Arg Leu Glu 
270 275 280 

CAA TTG AAA CAT TAAATCTTTT TATCATAAAA AGGAAAGGGA TATG 913 
Gin Leu Lys His 
285 



(2) INFORMATION FOR SEQ ID NO: 95 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 87 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950: 

Met Gin Asp Phe lie Lys lie Phe lie Gin Glu Val Val Ser Thr Leu 
15 10 15 



-1395- 



BNSDOCIO: <WO 9S43478A1J_> 



wo 98/43478 



PCT/US98/06371 



(2) INFORMATION FOR SEQ ID NO: 94 9: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 879 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 9: 

ATAGAAGAAG AGTGAGAA ATG CAA GAT TTT ATT AAG ATT TTT ATT CAA GAG 51 

Met Gin Asp Phe He Lys He Phe He Gin Glu 
15 10 

GTT GTC TCT ACT TTA GAA GGG TTA GTG GGT AAG GCT CCA AGC GTG GGA 99 
Val Val Ser Thr Leu Glu Gly Leu Val Gly Lys Ala Pro Ser Val Gly 
15 20 25 

TTA GAA AAA GAA ATT TCT AGT AGC GAC GAA TCT TTT TTG AAA TTA ATC 14 7 

Leu Glu Lys Glu He Ser Ser Ser Asp Glu Ser Phe Leu Lys Leu He 
30 35 40 

AGC ACG CCT TAT GCA AGA GTT GTG ATA AGC GCG ATT GAA AAA GAA GAG 195 
Ser Thr Pro Tyr Ala Arg Val Val He Ser Ala He Glu Lys Glu Glu 
45 50 55 

AGC TCT ATT GAA TTA CTG GCT CCG GTA GTT TTA GTT ACC TCT TTA AGC 243 
Ser Ser He Glu Leu Leu Ala Pro Val Val Leu Val Thr Ser Leu Ser 
60 65 70 75 

GAT TTG ATG CTA GGA GGT GAG GGA GCG AGT AAG GAA GAA ATG GAT AAT 2 91 

Asp Leu Met Leu Gly Gly Glu Gly Ala Ser Lys Glu Glu Met Asp Asn 
80 85 90 

GAC GAT TTA GAC GCT TTT AAA GAA ATG GCT TCT AAT ATT TTT GGC GCG 33 9 

Asp Asp Leu Asp Ala Phe Lys Glu Met Ala Ser Asn He Phe Gly Ala 
95 100 105 

ATC GCT ACA AGC TTG AAG TCT CAA GAA TTG CTC CCT AAA CTC AAT TTC 3 87 

He Ala Thr Ser Leu Lys Ser Gin Glu Leu Leu Pro Lys Leu Asn Phe 
110 115 120 

ACC ACT ATA AAC GCT GAA ATC GCT AAA GAG CTT CCT AAA AAA GAA GAT 43 5 

Thr Thr He Asn Ala Glu He Ala Lys Glu Leu Pro Lys Lys Glu Asp 
125 130 135 
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115 120 125 

GTC AAT GCG TTA GGC AAG TAT AAA ACC GTT TCT CAA GTC GTG GCG ATT 4 93 

Val Asn Ala Leu Gly Lys Tyr Lys Thr Val Ser Gin Val Val Ala lie 
130 135 140 

GGT GCT TTA TTG GCT GAT OTA ACT TAG TCT TAT GCG CTT GTG GCT ATA 541 
Gly Ala Leu Leu Ala Asp Val Thr Tyr Ser Tyr Ala Leu Val Ala lie 
145 150 155 160 

GCG GTT TTT TTA ACC CTT TAT TCG GGG ATA GAT TAC ACC ATT AAA TAT 589 
Ala Val Phe Leu Thr Leu Tyr Ser Gly lie Asp Tyr Thr lie Lys Tyr 
165 170 175 

TAT AAA TCT TAATATTTTA AAAGAAGTTT TTAGCGTTCT TT 63 0 

Tyr Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 94 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 8: 



Met Lys Val Leu Lys Leu Leu Pro Asn Phe Leu Thr lie Leu Arg lie 

15 10 15 

Val Leu Ser Leu Phe Leu Leu Phe Leu Leu Leu Asn Thr Arg Thr Tyr 

20 25 30 

Phe Ser Phe Leu Thr Pro Phe Gin Thr Asn Met lie Ser Ser Leu Val 

35 40 45 

Phe Leu Phe Ala Ala Leu Thr Asp Leu Leu Asp Gly Tyr lie Ala Arg 

50 55 60 

Ser Tyr Lys Ala Lys Ser Arg Phe Gly Glu lie Phe Asp Pro Leu Ala 
65 70 75 80 

Asp Lys lie Leu lie Leu Ser Ala Phe Leu Gly Leu Val Tyr Leu Asp 

85 90 95 

Arg Val Asn Ala Trp lie Pro Phe Val lie Leu Gly Arg Glu Phe Phe 

100 105 110 

lie Ser Gly Leu Arg Val Leu Ala Ala Asn Glu Lys Lys Asp lie Pro 

115 120 125 

Val Asn Ala Leu Gly Lys Tyr Lys Thr Val Ser Gin Val Val Ala lie 

130 135 140 

Gly Ala Leu Leu Ala Asp Val Thr Tyr Ser Tyr Ala Leu Val Ala lie 
145 150 155 160 

Ala Val Phe Leu Thr Leu Tyr Ser Gly lie Asp Tyr Thr lie Lys Tyr 
165 170 175 

Tyr Lys Ser 
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Leu Thr Thr Ala Val Val Leu Met Cys Val Ala Leu Gly Phe Leu Ala 

180 185 190 

Arg Ala Phe Leu Asn Phe Ser Gin Val Phe Leu Lys Ala 
195 200 205 

(2) INFORMATION FOR SEQ ID NO:947: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 62... 598 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 7: 

GTGTGGCGCT TGGTTTTTTG GCGCGCGCGT TTTTGAATTT TTCACAAGTC TTTTTGAAAG 60 
C ATG AAA GTT TTA AAA CTC CTG CCT AAT TTT TTA ACA ATT TTA CGC ATT 109 
Met Lys Val Leu Lys Leu Leu Pro Asn Phe Leu Thr Tie Leu Arg lie 
15 10 15 

GTC TTA TCC TTA TTT TTA TTA TTT TTA TTG TTA AAC ACG CGC ACT TAT 157 
Val Leu Ser Leu Phe Leu Leu Phe Leu Leu Leu Asn Thr Arg Thr Tyr 
20 25 30 

TTT AGT TTT TTA ACC CCC TTT CAA ACC AAT ATG ATC TCT TCA TTG GTT 2 05,. 

Phe Ser Phe Leu Thr Pro Phe Gin Thr Asn Met lie Ser Ser Leu Val 
35 40 45 

TTT TTG TTT GCC GCG CTC ACG GAT TTA TTG GAC GGC TAG ATC GCT AGA 253 
Phe Leu Phe Ala Ala Leu Thr Asp Leu Leu Asp Gly Tyr lie Ala Arg 
50 55 60 

AGC TAT AAA GCC AAA TCG CGC TTT GGG GAA ATC TTT GAT CCT TTA GCG 3 01 

Ser Tyr Lys Ala Lys Ser Arg Phe Gly Glu lie Phe Asp Pro Leu Ala 
65 70 75 80 

GAT AAA ATC CTT ATT TTG AGC GCG TTT TTA GGG TTA GTT TAT TTG GAT 34 9 

Asp Lys lie Leu lie Leu Ser Ala Phe Leu Gly Leu Val Tyr Leu Asp 
85 90 95 

CGT GTG AAT GCG TGG ATC CCG TTT GTG ATT TTA GGG CGT GAA TTT TTT 3 97 

Arg Val Asn Ala Trp lie Pro Phe Val He Leu Gly Arg Glu Phe Phe 
100 105 110 

ATT TCA GGG CTT AGA GTC TTA GCC GCT AAT GAG AAA AAG GAT ATT CCT 44 5 

He Ser Gly Leu Arg Val Leu Ala Ala Asn Glu Lys Lys Asp He Pro 
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Ser lie Phe Tyr Leu His Lys Leu Tyr Arg Glu Leu Ser Leu Tyr Thr 
130 135 140 

CAA GAA AGG TTT TTT TTA AGC GGG TTT AGG TTG TTG CTT TTT AGT TTC 4 81 

Gin Glu Arg Phe Phe Leu Ser Gly Phe Arg Leu Leu Leu Phe Ser Phe 
145 150 155 160 

ATG CTG GCT CTT TTA GGG ATT TTA GTG CAA GCG TTA GTT ATC ATT TTT 52 9 

Met Leu Ala Leu Leu Gly lie Leu Val Gin Ala Leu Val lie lie Phe 
165 170 175 

TTA ACG ACC GCT GTG GTT TTA ATG TGT GTG GCG CTT GGT TTT TTG GCG 5 77 

Leu Thr Thr Ala Val Val Leu Met Cys Val Ala Leu Gly Phe Leu Ala 
180 185 190 

CGC GCG TTT TTG AAT TTT TCA CAA GTC TTT TTG AAA GCA TGAAAGTTTT AA 62 8 
Arg Ala Phe Leu Asn Phe Ser Gin Val Phe Leu Lys Ala 
195 200 205 

AACTCCTGCC TAATTT 644 

(2) INFORMATION FOR SEQ ID NO: 94 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 05 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946: 

Arg Tyr Phe Leu Gin Asn lie lie His lie His Gin Asn Lys Glu Leu 

15 10 15 

Gin Phe lie Lys Lys Cys Leu Leu Gly Tyr Phe Phe Ala Pro Leu Cys 

20 25 30 

Gly Ala lie Leu Leu Val Leu Phe lie Val Ser Ser Gly Ala Lys Ser 

35 40 45 

Phe Gin lie Ser Asn Leu Phe Asn Asn Gin Leu Ala Tyr lie Val Leu 

50 55 60 

Leu Ser Leu Phe Leu Cys Ala Leu Gly Phe lie Ala Gly Ala lie Gly 
65 70 75 80 

Phe Tyr Arg Leu Ser Lys lie Thr Arg His Leu Ser Phe Phe Glu Asn 

85 90 95 

Phe Ala Phe Ser Phe Leu Ala Val lie Leu Cys Ala lie Leu Ser Tyr 

100 105 110 

Leu Val Pro Asn Ala Ser Asn Ala Leu Ser Leu lie Gly Asn Gly Val 

115 120 125 

Ser lie Phe Tyr Leu His Lys Leu Tyr Arg Glu Leu Ser Leu Tyr Thr 

130 135 140 

Gin Glu Arg Phe Phe Leu Ser Gly Phe Arg Leu Leu Leu Phe Ser Phe 
145 150 155 160 

Met Leu Ala Leu Leu Gly lie Leu Val Gin Ala Leu Val lie lie Phe 
165 170 175 
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Val Asp Gly Gly Thr Thr Phe Lys 
145 150 

(2) INFORMATION FOR SEQ ID NO:945: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2... 616 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945: 

A AGA TAT TTC TTG CAA AAC ATT ATC CAC ATC CAC CAA AAC AAA GAG TTG 4 9 

Arg Tyr Phe Leu Gin Asn lie lie His lie His Gin Asn Lys Glu Leu 
15 10 15 

CAA TTC ATT AAA AAA TGC TTG TTG GGC TAT TTT TTC GCC CCT TTG TGT 97 
Gin Phe lie Lys Lys Cys Leu Leu Gly Tyr Phe Phe Ala Pro Leu Cys 
20 25 30 

GGG GCT ATT CTG TTA GTG CTT TTT ATT GTT TCA AGC GGG GCA AAA TCG 14 5 

Gly Ala lie Leu Leu Val Leu Phe lie Val Ser Ser Gly Ala Lys Ser 
35 40 45 

TTT CAA ATT TCT AAT CTC TTT AAC AAT CAA CTA GCC TAT ATC GTT TTG 193 
Phe Gin lie Ser Asn Leu Phe Asn Asn Gin Leu Ala Tyr lie Val Leu 
50 55 60 

TTG TCT CTT TTT TTG TGC GCG CTT GGG TTT ATT GCC GGA GCG ATT GGT 241 
Leu Ser Leu Phe Leu Cys Ala Leu Gly Phe lie Ala Gly Ala lie Gly 
65 70 75 80 

TTT TAT AGG CTT TCT AAA ATC ACA CGC CAT CTG AGT TTT TTT GAA AAT 2 89 

Phe Tyr Arg Leu Ser Lys lie Thr Arg His Leu Ser Phe Phe Glu Asn 
85 90 95 

TTC GCT TTC AGT TTT TTA GCG GTG ATT TTA TGC GCT ATT TTA AGC TAT 33 7 

Phe Ala Phe Ser Phe Leu Ala Val lie Leu Cys Ala lie Leu Ser Tyr 
100 105 110 

CTT GTC CCT AAC GCC AGT AAC GCT CTT TCG CTA ATC GGT AAT GGC GTT 3 85 

Leu Val Pro Asn Ala Ser Asn Ala Leu Ser Leu lie Gly Asn Gly Val 
115 120 125 

TCT ATT TTT TAT TTG CAC AAA CTC TAT AGA GAA TTG AGC CTT TAG ACG 433 
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Asn Ala Val Glu Thr Met Val L.ys Tyr Ala Ala Val Asp Leu Gly Glu 
65 70 75 



TTT AAC ATT AGA GTG AAT GCG GTT AGT GGC GGG CCT ATT GAT ACG GAC 3 50 

Phe Asn lie Arg Val Asn Ala Val Ser Gly Gly Pro He Asp Thr Asp 
80 85 90 



GCT TTG AAA GCC TTC CCT GAT TAT GTG GAG ATT AAA GAA AAA GTA GAA 3 98 

Ala Leu Lys Ala Phe Pro Asp Tyr Val Glu He Lys Glu Lys Val Glu 
95 100 105 



GAG CAA TCG CCC CTA AAA CGC ATG GGC AAT CCT AAC GAT CTA GCC GGA 44 6 

Glu Gin Ser Pro Leu Lys Arg Met Gly Asn Pro Asn Asp Leu Ala Gly 
110 115 120 



GCG GCT TAT TTT TTA TGC GAT GAG ACC CAA AGC GGT TGG CTT ACA GGG 4 94 

Ala Ala Tyr Phe Leu Cys Asp Glu Thr Gin Ser Gly Trp Leu Thr Gly 
125 130 135 140 



CAA ACG ATC GTT GTA GAT GGC GGG ACT ACT TTT AAA TAAAGATATT TCTTGC 54 6 
Gin Thr He Val Val Asp Gly Gly Thr Thr Phe Lys 
145 150 



546 

(2) INFORMATION FOR SEQ ID NO:944: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:944: 



Met Arg Leu Lys 
1 

Leu Ala Phe Val 
20 

He Gly Gly Gly 
35 

Tyr Met Pro Asn 
50 

Thr Met Val Lys 
65 

Val Asn Ala Val 

Phe Pro Asp Tyr 
100 

Leu Lys Arg Met 
115 

Leu Cys Asp Glu 
130 



Pro Lys Gly Leu 
5 

Val Gly Ala Gin 

Ala He Val Ser 
40 

Tyr Ala Gly His 
55 

Tyr Ala Ala Val 
70 

Ser Gly Gly Pro 
85 

Val Glu He Lys 

Gly Asn Pro Asn 
120 

Thr Gin Ser Gly 
135 



Asn Asn He Tyr 
10 

Glu Ala Ala Lys 
25 

Leu Ser Ser Thr 

Gly Asn Ser Lys 
60 

Asp Leu Gly Glu 
75 

He Asp Thr Asp 
90 

Glu Lys Val Glu 
105 

Asp Leu Ala Gly 

Trp Leu Thr Gly 
140 



Thr Ala Thr Val 
15 

Arg Met Gin Lys 
30 

Gly Asn Leu Val 
45 

Asn Ala Val Glu 

Phe Asn He Arg 
80 

Ala Leu Lys Ala 
95 

Glu Gin Ser Pro 
110 

Ala Ala Tyr Phe 
125 

Gin Thr He Val 
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130 135 140 

Phe Ser Leu Lys Asn lie Ala Phe Ala Lys His Lys Asp Asn Leu Pro 
145 150 155 160 

Lys His Leu Asn Ser Lys Asp Leu Lys Ser Phe lie Tyr Thr Leu lie 

165 170 175 

Asn Tyr Arg Thr Arg Ser Ser Tyr Glu Lys Arg Asn Lys Cys lie Leu 

180 185 190 

Leu Leu lie lie Leu Gly Gly Leu Arg Lys Ser Glu Val Phe Asn Leu 

195 200 205 

Glu Leu Arg Asn lie Val Leu Glu Lys Glu His Tyr lie Leu Leu lie 

210 215 220 

Lys Gly Lys Asn Asn Lys Glu Arg Lys Ala Phe lie Lys lie Ala Gin 
225 230 235 240 

Thr Asp lie Asp Thr Leu Ala Pro Leu lie Arg lie Leu Leu Glu Ser 

245 250 255 

lie Ala Lys Asn Leu Leu Ser His 
260 

(2) INFORMATION FOR SEQ ID NO: 94 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 75... 530 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 943 : 

TTGACAGAGT GGATTTTTTT ATTTCTAACG CTATTATTTA TGGGCGTTCT GTCGTGGGGG 6 0 

GATTTGCACC GTTT ATG CGA TTA AAA CCT AAG GGG TTA AAC AAC ATT TAG 110 
Met Arg Leu Lys Pro Lys Gly Leu Asn Asn lie Tyr 
15 10 

ACA GCC ACC GTG TTA GCG TTC GTC GTA GGG GCT CAA GAA GCG GCA AAA 15 8 

Thr Ala Thr Val Leu Ala Phe Val Val Gly Ala Gin Glu Ala Ala Lys 
15 20 25 

CGC ATG CAA AAA ATA GGC GGT GGG GCG ATC GTG AGC TTA AGT TCT ACC 2 06 

Arg Met Gin Lys lie Gly Gly Gly Ala lie Val Ser Leu Ser Ser Thr 
30 35 40 

GGG AAT CTA GTT TAT ATG CCT AAT TAG GCC GGG CAT GGC AAT TCC AAA 2 54 

Gly Asn Leu Val Tyr Met Pro Asn Tyr Ala Gly His Gly Asn Ser Lys 
45 50 55 60 

AAC GCG GTA GAA ACC ATG GTC AAA TAG GCT GCC GTG GAT TTA GGC GAA 3 02 
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AGA ACT AGA AGC AGT TAT GAA AAG AGA AAT AAG TGT ATT TTG CTC TTG 63 3 

Arg Thr Arg Ser Ser Tyr Glu Lys Arg Asn Lys Cys lie Leu Leu Leu 
180 185 190 

ATT ATT TTG GGT GGT TTG AGA AAA TCT GAG GTT TTT AAT TTA GAA TTG 681 
lie lie Leu Gly Gly Leu Arg Lys Ser Glu Val Phe Asn Leu Glu Leu 
195 200 205 210 

AGA AAT ATT GTT TTA GAG AAA GAG CAT TAT ATC TTG CTT ATA AAA GGC 72 9 

Arg Asn lie Val Leu Glu Lys Glu His Tyr lie Leu Leu lie Lys Gly 
215 220 225 

AAA AAC AAT AAA GAG CGA AAA GCG TTC ATT AAA ATC GCT CAA ACA GAT 77 7 

Lys Asn Asn Lys Glu Arg Lys Ala Phe lie Lys lie Ala Gin Thr Asp 
230 235 240 

ATT GAG ACA CTC GCA CCG CTT ATC CGT ATC CTT TTG GAA AGT ATT GCT 82 5 

lie Asp Thr Leu Ala Pro Leu lie Arg lie Leu Leu Glu Ser lie Ala 
245 250 255 

AAA AAT CTT TTA TCC CAC TAGCGCGAAA AACTCCGTCC TTTAGGGCGG AGATGTAA 881 
Lys Asn Leu Leu Ser His 
260 

GCGTTTAG 8 8 9 

(2) INFORMATION FOR SEQ ID NO: 94 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942: 

Met Lys Lys Ser Asn Asp Asn Asn Ala Leu Ala Arg Ser Gin Arg Glu 

15 10 15 

Leu Phe Val Gly lie Arg Asp Phe lie Val Phe Lys Phe Lys Arg Met 

20 25 30 

Val Val Phe Asn Gly Val Arg Asp Phe Thr Lys Met Arg Phe Leu Ser 

35 40 45 

lie Glu Leu Glu Lys Cys Glu Asn lie Lys Asp Leu Glu Lys Leu Cys 

50 55 60 

His Thr lie Tyr Asn Gin Gly Thr Lys His lie Leu Met Met Arg Val 
65 70 75 80 . 

Leu Phe Leu Phe Phe Asp Tyr Phe Cys Lys His Leu Lys Val Lys Arg 

85 90 95 

Leu Arg Leu Leu Asn Glu Glu Met Leu Val Asn Phe Leu Phe Glu Leu 

100 105 110 

Ala Lys Gin Arg Lys lie Asn Ser Met Ala Lys Tyr Val Met Tyr lie 

115 120 125 

Arg Gin Phe Phe Asp Tyr Leu Asp Arg Thr Lys His Tyr Glu Phe Tyr 
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(B) LOCATION: 52 . . . 843 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941: 

ATTACATTCT TTTTGATTTC TATTGAAAAA TTTAATATTA AGAGGACTTT T ATG AAA 5 7 

Met Lys 
1 

AAA TCA AAT GAC AAT AAC GCA CTC GCC AGA AGT CAA AGG GAG TTG TTT 105 
Lys Ser Asn Asp Asn Asn Ala Leu Ala Arg Ser Gin Arg Glu Leu Phe 
5 10 15 

GTA GGG ATT AGG GAT TTT ATT GTT TTT AAA TTT AAG CGT ATG GTT GTT 15 3 

Val Gly lie Arg Asp Phe lie Val Phe Lys Phe Lys Arg Met Val Val 
20 25 30 

TTT AAC GGA GTA AGG GAT TTT ACA AAA ATG AGA TTT TTG TCC ATA GAA 2 01 

Phe Asn Gly Val Arg Asp Phe Thr Lys Met Arg Phe Leu Ser lie Glu 
35 40 45 50 

TTA GAA AAA TGC GAA AAT ATT AAA GAT TTG GAA AAA TTA TGT CAT ACA 24 9 

Leu Glu Lys Cys Glu Asn lie Lys Asp Leu Glu Lys Leu Cys His Thr 
55 60 65 

ATT TAT AAT CAA GGC ACA AAG CAT ATT TTG ATG ATG CGT GTA TTG TTT 2 97 

lie Tyr Asn Gin Gly Thr Lys His lie Leu Met Met Arg Val Leu Phe 
70 75 80 

TTA TTC TTT GAC TAT TTC TGC AAG CAT TTG AAA GTT AAG CGA TTG AGA 345 
Leu Phe Phe Asp Tyr Phe Cys Lys His Leu Lys Val Lys Arg Leu Arg 
85 90 95 

CTA CTC AAT GAA GAA ATG CTT GTG AAT TTT TTA TTT GAG TTA GCT AAA 3 93 

Leu Leu Asn Glu Glu Met Leu Val Asn Phe Leu Phe Glu Leu Ala Lys 
100 105 110 

CAA AGA AAA ATT AAT TCA ATG GCA AAA TAT GTG ATG TAT ATT AGG CAA 441 
Gin Arg Lys lie Asn Ser Met Ala Lys Tyr Val Met Tyr lie Arg Gin 
115 120 125 130 

TTT TTT GAT TAC TTG GAT AGG ACT AAA CAT TAT GAA TTT TAT TTT AGT 4 89 

Phe Phe Asp Tyr Leu Asp Arg Thr Lys His Tyr Glu Phe Tyr Phe Ser 
135 140 145 

CTT AAA AAT ATA GCC TTT GCT AAA CAC AAG GAT AAT TTG CCT AAG CAT 53 7 

Leu Lys Asn lie Ala Phe Ala Lys His Lys Asp Asn Leu Pro Lys His 
150 155 160 

CTA AAT TCA AAA GAT TTA AAA TCT TTT ATA TAT ACT CTT ATA AAC TAT 585 
Leu Asn Ser Lys Asp Leu Lys Ser Phe lie Tyr Thr Leu lie Asn Tyr 
165 170 175 
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CAA ATC ATC TCT TTT TTA GAC TAG AAA CAA GAG CAT AAT GOG AAA TTG 48 9 

Gin lie lie Ser Phe Leu Asp Tyr Lys Gin Gin His Asn Gly Lys Leu 
135 140 145 

TTA GTG AAA GTT TCC CCC ACA ATA TAG GAG TAAAAGTTGG CATTGTTGTG GGA 54 2 
Leu Val Lys Val Ser Pro Thr lie Tyr Glu 
150 155 

ATATCAAGCA CAAGC 5 57 

(2) INFORMATION FOR SEQ ID NO: 94 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE :' protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 0: 

Met Lys Glu Leu Leu Gly lie Glu lie Asp Glu Glu Leu Asp Thr Lys 

15 10 15 

Arg Leu lie Pro Thr Tyr Ser Lys Leu Tyr Ser Leu Lys Lys Tyr Ser 

20 25 30 

Lys Lys Phe Lys Arg Leu Gin Arg Lys Gin Ser Arg Arg Val Leu Lys 

35 40 45 

Ser Lys Gin Asn Lys Thr Lys Leu Gly Gly Asn Phe Tyr Lys Thr Gin 

50 55 60 

Lys Lys Leu Asn Gin Ala Phe Asp Lys Ser Ser His Gin Lys Thr Asp 
65 70 75 80 

Arg Tyr His Lys lie Thr Ser Glu Leu Ser Lys Gin Phe Glu Leu lie 

85 90 95 

Val Val Glu Asp Leu Gin Val Lys Asn Met Thr Lys Arg Ala Lys Leu 

100 105 110 

Lys Asn Val Lys Gin Lys Ser Gly Leu Asn Gin Ser lie Leu Asn Ala 

115 120 125 

Ser Phe Tyr Gin lie lie Ser Phe Leu Asp Tyr Lys Gin Gin His Asn 

130 135 140 

Gly Lys Leu Leu Val Lys Val Ser Pro Thr lie Tyr Glu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 941: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

-1385- 



. * . BNSDOCID: <:WO 9843478A1_I_> 



wo 98/43478 



PCTAJS98/06371 



(2) INFORMATION FOR SEQ ID NO: 939: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 519 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 9: 

ATAAACAACC ATGACAAACT AACGGACTTT AAGCAATACC AAACAGAC ATG AAA GAA 5 7 

Met Lys Glu 
1 

TTA CTA GGG ATA GAA ATA GAT GAA GAG CTG GAT ACT AAA CGA CTT ATC 10 5 

Leu Leu Gly lie Glu lie Asp Glu Glu Leu Asp Thr Lys Arg Leu lie 
5 10 15 

CCT ACT TAT TCC AAA TTG TAT TCT TTA APJk AAA TAG TCT AAA AAA TTT 15 3 

Pro Thr Tyr Ser Lys Leu Tyr Ser Leu Lys Lys Tyr Ser Lys Lys Phe 
20 25 30 35 

AAA AGA TTA CAA AGA AAA CAA AGC CGT AGG GTG TTA AAG TCT AAA CAA 2 01 

Lys Arg Leu Gin Arg Lys Gin Ser Arg Arg Val Leu Lys Ser Lys Gin 
40 45 50 

AAC AAA ACC AAA TTA GGA GGT AAT TTT TAC AAA ACC CAA AAG AAA TTA 24 9 

Asn Lys Thr Lys Leu Gly Gly Asn Phe Tyr Lys Thr Gin Lys Lys Leu 
55 60 65 

AAC CAA GCC TTT GAC AAG TCT AGT CAT CAA AAA ACA GAC AGA TAC CAT 2 97 

Asn Gin Ala Phe Asp Lys Ser Ser His Gin Lys Thr Asp Arg Tyr His 
70 75 80 

AAA ATC ACA AGC GAA CTT TCA AAG CAA TTT GAA TTG ATA GTA GTT GAA 345 
Lys lie Thr Ser Glu Leu Ser Lys Gin Phe Glu Leu lie Val Val Glu 
85 90 95 

GAT TTG CAA GTA AAA AAC ATG ACT AAA AGA GCT AAA CTC AAA AAT GTT 3 93 

Asp Leu Gin Val Lys Asn Met Thr Lys Arg Ala Lys Leu Lys Asn Val 
100 105 110 115 

AAA CAA AAG AGT GGG CTT AAT CAA TCT ATT TTA AAC GCT TCA TTC TAT 44 1 

Lys Gin Lys Ser Gly Leu Asn Gin Ser lie Leu Asn Ala Ser Phe Tyr 
120 125 130 
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TGG GTA GTG CAA CTT CTC AAA TAGAAAGTTT GAAAAAAAGA GAAAATGCCC TATT 874 
Trp Val Val Gin Leu Leu Lys 
270 

TGATCATTTA 8 84 

(2) INFORMATION FOR SEQ ID NO: 93 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 8: 

Met Ala Leu Glu Lys Ser Tyr Ser Lys Asn Phe Glu Ser Asp Glu Leu 

15 10 15 

Phe Asp Tyr Glu lie lie Lys Pro Lys Lys Thr Leu Lys lie Gin Tyr 

20 25 30 

Thr Tyr Ala Lys Arg Tyr Tyr Lys Glu Val Glu Lys Phe Ala Lys Asn 

35 40 45 

Leu Thr Gin Leu Thr Gin Glu Glu Phe Met Arg Leu Arg Glu Pro Gin 

50 55 60 

Lys Gin Val Val lie Lys Asn lie Gly Asn Met Thr Arg Leu His Ser 
65 70 75 80 

Lys Arg Ala Met Asp Tyr lie Ala Lys His Gly Glu Leu Val Arg Asp 

85 90 95 

Glu Phe Phe Asn Glu Val Asn Tyr Asn Asp lie Ala Glu Gin Trp Asn 

100 105 110 

Glu Gin Phe Glu Lys Leu Leu Glu Asn Lys Ser Arg Val Lys Asn Cys 

115 120 125 

Ala Leu His Leu Val Phe Ser lie Asp Glu Asn Cys Asn Glu Lys Asn 

130 135 140 

Leu Lys Ala Leu Glu Leu Ser Val Tyr Gin Thr Leu Thr Asn Thr Leu 
145 150 155 160 

Gly Tyr Asp Tyr Pro Phe lie Met Lys Leu His Thr His Gin Asn Asn 

165 170 175 

Pro His Ala His Val lie lie Asn Lys Thr Asn Lys lie Thr Asn Lys 

180 185 190 

Gin Leu Cys Phe Asn Ser Lys Asp Ser Cys Lys Glu Phe Tyr His Thr 

195 200 205 

Leu Arg Glu Thr Phe Lys Asp Tyr Leu Phe Ala Asn Ser Lys Gly Glu 

210 215 220 

Leu Gin Tyr Ser Asn Thr Pro Asn lie Tyr Lys Ala lie Lys Asp lie 
225 230 235 240 

Glu Thr Glu Leu Asp Ala Leu Glu Asn Arg Leu Glu Thr lie Arg Val 

245 250 255 

Leu Gly Met Lys Thr lie Phe lie Lys Phe Trp Val Val Gin Leu Leu 
260 265 270 

Lys 
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GAA AAG TTT GCT AAA AAT TTA ACC CAA CTG ACA CAA GAA GAA TTC ATG 195 
Glu Lys Phe Ala Lys Asn Leu Thr Gin Leu Thr Gin Glu Glu Phe Met 
45 50 55 



CGT TTA AGA GAG CCA CAA AAA CAA GTG GTC ATC AAA AAC ATA GGC AAT 243 
Arg Leu Arg Glu Pro Gin Lys Gin Val Val lie Lys Asn lie Gly Asn 
60 65 70 



ATG ACA CGC CTG CAT TCA AAA AGA GCG ATG GAT TAT ATC GCT AAA CAT 2 91 

Met Thr Arg Leu His Ser Lys Arg Ala Met Asp Tyr lie Ala Lys His 
75 80 85 90 



GGT GAG CTA GTG AGA GAT GAA TTT TTT AAT GAA GTT AAT TAT AAT GAC 33 9 

Gly Glu Leu Val Arg Asp Glu Phe Phe Asn Glu Val Asn Tyr Asn Asp 
95 100 105 



ATA GCA GAG CAA TGG AAT GAG CAA TTT GAA AAA TTA TTA GAA AAT AAG 3 87 

lie Ala Glu Gin Trp Asn Glu Gin Phe Glu Lys Leu Leu Glu Asn Lys 
110 115 120 



AGC CGT GTT AAA AAT TGC GCT TTA CAT CTA GTG TTT AGC ATT GAT GAA 435 
Ser Arg Val Lys Asn Cys Ala Leu His Leu Val Phe Ser lie Asp Glu 
125 130 135 



AAT TGT AAT GAA AAA AAT TTA AAA GCT TTG GAA TTA AGC GTG TAT CAA 4 83 

Asn Cys Asn Glu Lys Asn Leu Lys Ala Leu Glu Leu Ser Val Tyr Gin 
140 145 150 



ACA CTC ACT AAC ACG CTA GGT TAT GAT TAT CCT TTT ATA ATG AAA CTC 531 
Thr Leu Thr Asn Thr Leu Gly Tyr Asp Tyr Pro Phe lie Met Lys Leu 
155 160 165 170 



CAT ACA CAC CAA AAC AAT CCG CAT GCG CAT GTG ATT ATC AAC AAA ACT 579 
His Thr His Gin Asn Asn Pro His Ala His Val lie lie Asn Lys Thr 
175 . 180 185 

AAC AAA ATT ACC AAT AAG CAA CTA TGC TTT AAT TCT AAA GAC AGC TGT 62 7 

Asn Lys lie Thr Asn Lys Gin Leu Cys Phe Asn Ser Lys Asp Ser Cys 
190 195 200 



AAA GAG TTT TAC CAC ACA CTA AGA GAA ACA TTT AAA GAT TAT TTA TTT 67 5 

Lys Glu Phe Tyr His Thr Leu Arg Glu Thr Phe Lys Asp Tyr Leu Phe 
205 210 215 



GCT AAC TCA AAA GGC GAA TTG CAA TAT TCT AAC ACG CCT AAT ATT TAT 72 3 

Ala Asn Ser Lys Gly Glu Leu Gin Tyr Ser Asn Thr Pro Asn lie Tyr 
220 225 230 



AAG GCG ATT AAA GAC ATA GAA ACA GAG CTA GAT GCA CTA GAA AAC AGG 771 
Lys Ala lie Lys Asp lie Glu Thr Glu Leu Asp Ala Leu Glu Asn Arg 
235 240 245 250 



CTA GAA ACA ATA AGA GTT TTA GGC ATG AAA ACT ATT TTT ATA AAG TTT 819 
Leu Glu Thr lie Arg Val Leu Gly Met Lys Thr lie Phe lie Lys Phe 
255 260 265 
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420 425 430 

Leu Glu Gin Asp Leu Lys Gin Lys Glu Glu Glu Phe Glu Arg Ala Lys 

435 440 445 

Glu His Tyr Val Lys Ser Ser Lys His Tyr Arg Glu Thr Ser Leu Ser 

450 455 460 

Pro Lys Glu Lys Gin Gly Phe Leu Lys Gin lie Lys Gin Phe Ser Lys 
465 470 475 480 

lie Ser Lys Asp lie Leu Tyr Thr Cys Asn Glu lie lie Gly Ala Asn 

485 490 495 

Arg Phe Leu Thr His Tyr Asp Asn Leu Asn Leu Glu Lys Val Leu Glu 

500 505 510 

His Ala Lys Asp Thr Lys Leu Glu Gin Lys Glu lie Gin Ala lie Thr 

515 520 525 

Lys Glu Pro Asn Asn Asp Glu Pro Trp lie Glu Phe Gly Lys Lys Glu 

530 535 540 

Gin Ala Arg Ala Lys Ala His Tyr Gin Ala Met Leu Glu Lys Glu Lys 
545 550 555 560 

Ala Lys Glu Leu Ala Lys Gin Gin Ala Asn Thr Leu His Ser Asn Glu 

565 570 575 

Leu Asp Asp Asp Pro Lys Ala His Ala Gly Leu Lys Gin Asn Asp Asn 

580 585 590 

Thr Asn Phe Lys Gly Arg Asn Arg 
595 600 

(2) INFORMATION FOR SEQ ID NO: 93 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22.. .840 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937: 

GGAGAATAAA ATTACAATAA A ATG GCG TTA GAA AAA AGT TAT AGT AAA AAC 51 

Met Ala Leu Glu Lys Ser Tyr Ser Lys Asn 
15 10 

TTT GAA AGC GAT GAG CTT TTT GAT TAT GAG ATC ATC AAG CCC AAA AAG 9 9 

Phe Glu Ser Asp Glu Leu Phe Asp Tyr Glu lie lie Lys Pro Lys Lys 
15 20 25 

ACG CTT AAG ATA CAA TAC ACT TAT GCT AAA CGC TAC TAT AAA GAA GTA 14 7 

Thr Leu Lys lie Gin Tyr Thr Tyr Ala Lys Arg Tyr Tyr Lys Glu Val 
30 35 40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 6: 

Met Lys Arg Ser His Leu Glu Asn Ala Leu Asn Tyr Ala Leu Glu Asn 

15 10 15 

Ser Glu Thr Ala Tyr Asn Glu Met Phe Leu Glu Cys Asp Lys Gin Phe 

20 25 30 

lie Leu Glu Ser Trp Leu Asn Asp Phe Asp Leu Thr Lys Asp Tyr Asn 

35 40 45 

Glu Thr Met His Leu Val Phe Ser lie Lys Asp Lys Pro Asp Glu Glu 

50 55 60 

Thr Met Gin Gly Leu Leu His Ser Thr Trp Glu Ser Leu Lys lie Arg 
65 70 75 80 

Leu Pro Glu Tyr Lys Phe Ala Leu Val Pro His Ala His Gin Asp His 

85 90 95 

Ala His lie His Cys Phe lie Asn Lys Thr Asn Gin Leu Thr Arg Arg 

100 105 110 

Arg Leu Arg Phe Lys Gly His Glu Asp Cys Lys Glu Phe Phe Asn Glu 

115 120 125 

Leu Arg Ser Glu Phe Ala Tyr Arg Leu Asn Asp His Leu Leu Ser Glu 

130 135 140 

Glu Tyr Leu Tyr Val Asn Glu Pro Lys Leu Lys Glu Leu Asp Asn lie 
145 150 155 160 

Lys Gin Gin Leu Gin Asp Leu Glu Lys Glu Glu Lys Ala Leu Glu Gin 

165 170 175 

lie Lys Ser Pro Gin Asp Glu Trp Asp Leu Asn Lys Ala Leu Gin Ser 

180 185 190 

Glu Tyr Leu Gin Glu Leu Lys Tyr Lys Asn Lys Ala Lys Ala Leu Asp 

195 200 205 

lie Gin Asn Asn His Ser Thr Pro Leu Lys Gin Lys lie Ser Glu Phe 

210 215 220 

Lys lie Ala Leu Phe Asn His Lys Asp Thr Ser Asp Asp Glu Lys Glu 
225 230 235 240 

Gin Leu Asp lie Asp Arg lie Asp Lys Arg Lys Pro Val Ser Glu His 

245 250 255 

Leu Lys Asn Thr Asn Lys His Glu Leu Tyr Glu Leu Leu Gly Phe Tyr 

260 265 270 

Gin Lys Glu Leu Asp Lys Lys Gin Asn His Ser Ala Phe Lys Asn Phe 

275 280 285 

Ala lie Leu Asn Gly Leu Asp Arg Asp Phe Glu Arg Glu Thr Asn Gly 

290 295 300 

Tyr Ser Val Leu Lys Lys Lys Glu Met Leu Leu Asn Lys Leu Glu His 
305 310 315 320 

Leu Asp Lys Arg Leu Leu Asp Lys Asn Ser His Leu Leu Leu Ala Gin 

325 330 335 

Leu Arg Asn Glu Val Lys Thr Lys Gin Asn lie Gin Tyr Asn Thr Leu 

340 345 350 

Thr Asn Pro lie Leu Leu Ala Lys Ala Leu Glu Leu Ser Lys Asp Lys 

355 360 365 

Arg Pro Thr Leu Lys Thr Phe Lys Asn Ala Tyr Phe Ser Ala Arg Lys 

370 375 380 

Tyr Gin Phe Met Leu Glu Ser Phe Lys Thr Lys Gin Asn Asp Pro Thr 
385 390 395 400 

Tyr Lys Leu Asn Asp Asn Thr Tyr Glu Leu Val Ser Lys Gin Leu Gin 

405 410 415 

Asp Tyr Gin Asn Thr Met Leu Leu Leu Ala Lys Glu Arg Leu Leu Phe 
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Leu Phe Leu Glu Gin Asp Leu Lys Gin Lys Glu Glu Glu Phe Glu Arg 
435 440 445 

GCC AAA GAA CAT TAT GTG AAA TCT TCA AAA CAT TAT AGA GAA ACT TCA 14 5 2 
Ala Lys Glu His Tyr Val Lys Ser Ser Lys His Tyr Arg Glu Thr Ser 
450 455 460 

TTG TCT CCA AAA GAA AAA CAA GGC TTT CTC AAA CAA ATT AAA CAA TTT 15 0 0 

Leu Ser Pro Lys Glu Lys Gin Gly Phe Leu Lys Gin lie Lys Gin Phe 
465 470 475 

TCT AAA ATT TCT AAG GAT ATT CTC TAT ACT TGT AAT GAG ATC ATA GGA 154 8 

Ser Lys lie Ser Lys Asp lie Leu Tyr Thr Cys Asn Glu lie lie Gly 
480 485 490 

GCT AAT AGG TTT TTA ACC CAC TAT GAC AAC CTA AAC CTT GAA AAA GTC 15 96 

Ala Asn Arg Phe Leu Thr His Tyr Asp Asn Leu Asn Leu Glu Lys Val 
495 500 505 510 

CTA GAA CAC GCT AAA GAT ACT AAG CTA GAG CAA AAA GAA ATT CAA GCT 164 4 

Leu Glu His Ala Lys Asp Thr Lys Leu Glu Gin Lys Glu lie Gin Ala 
515 520 525 

ATC ACA AAA GAG CCT AAT AAC GAT GAG CCT TGG ATT GAG TTT GGT AAA 1692 
lie Thr Lys Glu Pro Asn Asn Asp Glu Pro Trp lie Glu Phe Gly Lys 
530 535 540 

AAA GAA CAA GCT AGA GCT AAA GCA CAC TAT CAA GCT ATG CTA GAA AAA 174 0 

Lys Glu Gin Ala Arg Ala Lys Ala His Tyr Gin Ala Met Leu Glu Lys 
545 550 555 

GAA AAA GCT AAA GAA TTA GCT AAA CAA CAA GCT AAC ACC TTG CAC TCT 17 8 8 

Glu Lys Ala Lys Glu Leu Ala Lys Gin Gin Ala Asn Thr Leu His Ser 
560 565 570 

AAT GAG CTT GAT GAT GAC CCT AAA GCT CAT GCT GGA TTA AAA CAA AAT 183 6 

Asn Glu Leu Asp Asp Asp Pro Lys Ala His Ala Gly Leu Lys Gin Asn 
575 580 585 590 

GAC AAC ACA AAC TTT AAA GGG CGT AAT AGA TAATGCTCTC AAGCGATGAT TGC 188 9 
Asp Asn Thr Asn Phe Lys Gly Arg Asn Arg 
595 600 

CTTTAATGTT CTTAATAAAG AATATACCCT TTGAAAGGGG TTTAT 193 4 

(2) INFORMATION FOR SEQ ID NO: 93 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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210 215 220 

GAA TTT AAA ATC GCT CTG TTT AAT CAC AAA GAC ACA AGC GAT GAT GAA 780 
Glu Phe Lys lie Ala Leu Phe Asn His Lys Asp Thr Ser Asp Asp Glu 
225 230 235 

AAA GAA CAG CTA GAT ATT GAC AGG ATA GAT AAG AGA AAA CCA GTA AGC 82 8 

Lys Glu Gin Leu Asp lie Asp Arg lie Asp Lys Arg Lys Pro Val Ser 
240 245 250 

GAA CAC TTA AAA AAC ACT AAC AAA CAC GAG CTA TAG GAA CTC TTA GGC 876 
Glu His Leu Lys Asn Thr Asn Lys His Glu Leu Tyr Glu Leu Leu Gly 
255 260 265 270 

TTT TAT CAA AAA GAA TTA GAT AAA AAA CAA AAC CAT TCA GCC TTT AAG 924 
Phe Tyr Gin Lys Glu Leu Asp Lys Lys Gin Asn His Ser Ala Phe Lys 
275 280 285 

AAT TTT GCT ATT CTC AAT GGT TTA GAC AGA GAC TTT GAA AGA GAG ACT 972 
Asn Phe Ala lie Leu Asn Gly Leu Asp Arg Asp Phe Glu Arg Glu Thr 
290 295 300 

AAT. GGC TAT TCT GTT TTA AAG AAA AAA GAA ATG CTT TTA AAT AAG CTT 102 0 

Asn Gly Tyr Ser Val Leu Lys Lys Lys Glu Met Leu Leu Asn Lys Leu 
305 310 315 

GAA CAC CTA GAC AAA CGC CTT TTA GAT AAA AAC TCA CAC TTA CTA TTA 106 8 

Glu His Leu Asp Lys Arg Leu Leu Asp Lys Asn Ser His Leu Leu Leu 
320 325 330 

GCC CAG CTA AGA AAT GAA GTT AAA ACC AAG CAA AAC ATC CAA TAC AAC 1116 
Ala Gin Leu Arg Asn Glu Val Lys Thr Lys Gin Asn lie Gin Tyr Asn 
335 340 345 350 

ACT CTA ACT AAT CCT ATT CTT TTA GCC AAA GCC TTA GAA CTT TCT AAA 1164 
Thr Leu Thr Asn Pro He Leu Leu Ala Lys Ala Leu Glu Leu Ser Lys 
355 360 365 

GAT AAA CGC CCC ACT CTC AAA ACT TTT AAA AAC GCT TAT TTT AGT GCT 1212 
Asp Lys Arg Pro Thr Leu Lys Thr Phe Lys Asn Ala Tyr Phe Ser Ala 
370 375 380 

AGA AAA TAT CAA TTC ATG CTA GAG AGC TTT AAA ACT AAG CAA AAT GAC 12 6 0 

Arg Lys Tyr Gin Phe Met Leu Glu Ser Phe Lys Thr Lys Gin Asn Asp 
385 390 395 

CCC ACT TAC AAG CTT AAT GAT AAC ACT TAT GAG CTA GTG AGT AAG CAA 13 0 8 

Pro Thr Tyr Lys Leu Asn Asp Asn Thr Tyr Glu Leu Val Ser Lys Gin 
400 405 410 

CTA CAA GAC TAT CAA AAC ACC ATG CTT TTA TTA GCC AAA GAG AGA TTA 13 5 6 

Leu Gin Asp Tyr Gin Asn Thr Met Leu Leu Leu Ala Lys Glu Arg Leu 
415 420 425 430 

CTT TTT TTA GAA CAA GAT TTA AAA CAA AAA GAA GAA GAG TTT GAA AGA 14 04 
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AAACATAGGG CAAATCTAGT TGGCACAAAA ACAGCTAGTC CTGTGCTTAT TAAAAACATA 60 

GGGCAA ATG AAA CGC TCC CAC TTA GAA AAT GCC CTA AAT TAT GCT TTA 10 8 
Met Lys Arg Ser His Leu Glu Asn Ala Leu Asn Tyr Ala Leu 
15 10 

GAA AAT AGC GAA ACA GCT TAG AAT GAA ATG TTT TTA GAA TGC GAT AAG 15 6 

Glu Asn Ser Glu Thr Ala Tyr Asn Glu Met Phe Leu Glu Cys Asp Lys 
15 20 25 30 

CAA TTC ATC TTA GAG AGT TGG CTC AAT GAC TTT GAT TTG ACT AAA GAT 2 04 

Gin Phe lie Leu Glu Ser Trp Leu Asn Asp Phe Asp Leu Thr Lys Asp 
35 40 45 

TAT AAC GAG ACT ATG CAC TTA GTT TTT TCT ATC AAA GAT AAG CCA GAT 2 52 

Tyr Asn Glu Thr Met His Leu Val Phe Ser lie Lys Asp Lys Pro Asp 
50 55 60 

GAA GAG ACA ATG CAA GGG CTT TTA CAT TCT ACT TGG GAG AGC TTA AAA 300 
Glu Glu Thr Met Gin Gly Leu Leu His Ser Thr Trp Glu Ser Leu Lys 
65 70 75 

ATA AGA TTG OCT GAA TAG AAG TTT GCC CTT GTG CCA CAC GCT CAT CAA 34 8 

lie Arg Leu Pro Glu Tyr Lys Phe Ala Leu Val Pro His Ala His Gin 
80 85 90 

GAC CAT GCC CAT ATC CAT TGT TTT ATC AAT AAG ACT AAT CAG CTC ACA 3 96 

Asp His Ala His lie His Cys Phe lie Asn Lys Thr Asn Gin Leu Thr 
95 100 105 110 

CGA AGA AGA CTG CGT TTT AAG GGG CAT GAA GAT TGT AAA GAA TTT TTT 4 44 

Arg Arg Arg Leu Arg Phe Lys Gly His Glu Asp Cys Lys Glu Phe Phe 
115 120 125 

AAT GAA TTA AGA AGT GAG TTT GCT TAT AGG TTG AAT GAC CAC TTA TTG 4 92 

Asn Glu Leu Arg Ser Glu Phe Ala Tyr Arg Leu Asn Asp His Leu Leu 
130 135 140 

AGC GAA GAA TAG TTG TAT GTC AAT GAG CCA AAA CTT AAA GAG CTA GAC 54 0 

Ser Glu Glu Tyr Leu Tyr Val Asn Glu Pro Lys Leu Lys Glu Leu Asp 
145 150 155 

AAT ATC AAA CAA CAA TTA CAA GAC TTG GAA AAA GAA GAA AAA GCC TTA 588 
Asn lie Lys Gin Gin Leu Gin Asp Leu Glu Lys Glu Glu Lys Ala Leu 
160 165 170 

GAA CAA ATC AAA TCC CCA CAA GAT GAG TGG GAC TTA AAC AAG GCT TTA 63 6 

Glu Gin lie Lys Ser Pro Gin Asp Glu Trp Asp Leu Asn Lys Ala Leu 
175 180 185 190 

CAA AGC GAG TAT TTA CAA GAA CTC AAA TAT AAA AAC AAA GCA AAA GCC 6 84 

Gin Ser Glu Tyr Leu Gin Glu Leu Lys Tyr Lys Asn Lys Ala Lys Ala 
195 200 205 

CTA GAC ATT CAA AAT AAC CAC AGC ACC CCT TTA AAA CAA AAG ATT TCT 732 
Leu Asp lie Gin Asn Asn His Ser Thr Pro Leu Lys Gin Lys lie Ser 
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Gly Thr lie Gin Thr Glu Ala Leu Glu Gin Glu lie Glu Trp Phe Ser 
50 55 60 

CAA GAG TAT CAC ATT CCT ATG GAG AAT ATT TTA CAT GTC CTT TCT ATC 2 99 

Gin Asp Tyr His lie Pro Met Glu Asn lie Leu His Val Leu Ser lie 
65 70 75 



AAT CCC TAT CAA TGAAAAGAGC CTTAGTTTTA TCAAAAACAA CTTTCAAGCT 351 

Asn Pro Tyr Gin 

80 



(2) INFORMATION FOR SEQ ID NO: 934: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934: 



Met Ser Phe Ala Pro Met Leu Leu Ala Thr lie Asn Asn Ser lie Gly 

15 10 15 

Asn Lys Asp Lys His Val Ser Leu Glu Tyr Leu lie Gly Leu Phe Met 

20 25 30 

Asp Lys Lys Thr Thr Asn Leu Ser Asn Thr Asp Lys Tyr lie lie Gly 

35 40 45 

Thr lie Gin Thr Glu Ala Leu Glu Gin Glu lie Glu Trp Phe Ser Gin 

50 55 60 

Asp Tyr His lie Pro Met Glu Asn lie Leu His Val Leu Ser lie Asn 
65 70 75 80 

Pro Tyr Gin 



(2) INFORMATION FOR SEQ ID NO: 935: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1934 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 67 . . . 1866 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 5: 
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65 

Thr Asp Pro Thr 

Glu Asn Lys Lys 
100 

Thr Thr Arg Leu 
115 

Leu Asp Leu Thr 
130 

Ala Asn Phe Asn 
145 

Glu lie Thr Asn 

Ser Glu lie Met 
180 

Gly Lys Glu Glu 
195 

Asn Glu Arg Glu 
210 

Phe Asp Gin Phe 
225 



70 

Ala Asn His Asn 
85 

Thr Gly Lys Cys 

Asp Asn Glu Leu 
120 

Lys Ala Pro Asn 
135 

Gin Arg lie Lys 
150 

Gin Met Leu Phe 
165 

Ala Asp Asp Tyr 

Asn His lie Asp 
200 

Arg Ala Arg lie 
215 

Gly Phe Pro lie 
230 



75 

Lys Gin Gly lie 
90 

Asp Pro lie Asp 
105 

Glu Gin Glu lie 

Lys Asp Ala Gin 
140 

Leu Leu Thr Leu 
155 

Leu Asn Gin Thr 
170 

Phe Arg Gin Asn 
185 

Lys Gin Leu Thr 

Tyr Phe Gin Asn 
220 

Phe Ser lie Trp 
235 



80 

Hi s Tyr Cys Thr 
95 

Asn Val Phe Arg 
110 

Gin Thr Leu Thr 
125 

Ser Gin Ala Tyr 

Lys Tyr Leu Lys 
160 

Met Ala Met Gin 
175 

Asn Asp Gly Phe 
190 

Gin Lys Arg lie 
205 

Pro Asn Val Lys 
Asp 



(2) INFORMATION FOR SEQ ID NO: 933: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 63... 3 11 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:933: 

TCAAACCAAA ACCAACACAA AATTTGCTAA ACTACAATCA AATCAATTTA GGGAGGATAA 6 0 

AA ATG TCA TTT GCC CCT ATG TTA TTA GCT ACA ATC AAT AAC TCT ATT 107 
Met Ser Phe Ala Pro Met Leu Leu Ala Thr lie Asn Asn Ser lie 
1 5 10 15 

GGC AAT AAA GAT AAG CAT GTG AGT TTA GAG TAT CTT ATA GGG CTT TTT 15 5 

Gly Asn Lys Asp Lys His Val Ser Leu Glu Tyr Leu lie Gly Leu Phe 
20 25 30 

ATG GAT AAA AAA ACA ACT AAT CTA AGO AAT ACT GAC AAG TAT ATT ATA 2 03 

Met Asp Lys Lys Thr Thr Asn Leu Ser Asn Thr Asp Lys Tyr lie lie 
35 40 45 

GGC ACA ATT CAA ACA GAG GCA CTA GAG CAA GAA ATA GAA TGG TTT TCA 251 
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CGC TTA GAT AAC GAA TTA GAA CAA GAA ATC CAA ACG CTC ACA CTT GAT 441 
Arg Leu Asp Asn Glu Leu Glu Gin Glu lie Gin Thr Leu Thr Leu Asp 
115 120 125 130 

TTA ACC AAA GCC CCC AAT AAA GAC GCT CAA AGC CAA GCC TAG GCA AAT 4 89 

Leu Thr Lys Ala Pro Asn Lys Asp Ala Gin Ser Gin Ala Tyr Ala Asn 
135 140 145 

TTC AAT CAA AGG ATT AAA TTA CTT ACT CTA AAA TAT TTA AAA GAA ATT 53 7 

Phe Asn Gin Arg lie Lys Leu Leu Thr Leu Lys Tyr Leu Lys Glu lie 
150 155 160 

ACC AAT CAA ATG CTC TTT TTA AAT CAA ACA ATG GCA ATG CAA AGC GAG 585 
Thr Asn Gin Met Leu Phe Leu Asn Gin Thr Met Ala Met Gin Ser Glu 
165 170 175 

ATT ATG GCA GAT GAT TAT TTT AGG CAA AAT AAT GAT GGC TTT GGG AAA 63 3 

lie Met Ala Asp Asp Tyr Phe Arg Gin Asn Asn Asp Gly Phe Gly Lys 
180 185 190 

GAA GAA AAC CAT ATA GAC AAA CAA TTA ACG CAA AAA AGA ATA AAC GAA 681 
Glu Glu Asn His lie Asp Lys Gin Leu Thr Gin Lys Arg lie Asn Glu 
195 200 205 210 

AGA GAA AGA GCC AGA ATA TAC TTT CAA AAC CCT AAT GTT AAA TTT GAC 72 9 

Arg Glu Arg Ala Arg lie Tyr Phe Gin Asn Pro Asn Val Lys Phe Asp 
215 220 225 

CAA TTT GGT TTT CCC ATT TTT AGT ATA TGG GAT TAAGGGTTTA GTGATGAGAG 7 82 

Gin Phe Gly Phe Pro lie Phe Ser lie Trp Asp 
230 235 

ATAGAATAAG TATTTTTTTT CCAAACTATT CCTATTTTAG TGGTAGTGTT G 83 3 

(2) INFORMATION FOR SEQ ID NO:932: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:932: 

Met Gin Ser Leu Ala Gly Gly Leu Ser Gly Arg Ala Trp Gly Glu Met 

15 10 15 

Leu Cys Lys Met Val Asn Asp Ser Asn Tyr Glu Ser Glu Gin Ala Leu 

20 25 30 

Leu Ala Thr Gly Asn Ser Ser Glu Glu Gin Lys Arg Arg Phe Leu Leu 

35 40 45 

Arg Val Lys Lys Lys Val Asn Asp Asn Arg Gin Leu Lys Lys Lys Leu 

50 55 60 

Asp Pro Phe Leu Lys Arg Leu Asp Val Leu Gin Thr Glu Phe Gly Val 
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260 265 270 

Pro Lys 



(2) INFORMATION FOR SEQ ID NO: 931: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 762 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931: 

TAGCGGTTGG GTAATTCACT CCAATCTCTA AGTGGCAGTG GCTATGAAAA C ATG CAA 5 7 

Met Gin 
1 

TCA CTT OCT GGG GGA TTG AGT GGT AGA GCG TGG GGA GAA ATG TTG TGT 10 5 

Ser Leu Ala Gly Gly Leu Ser Gly Arg Ala Trp Gly Glu Met Leu Cys 
5 10 15 

AAA ATG GTA AAC GAT AGT AAT TAT GAA AGC GAG CAA GCT CTT TTA GCA 15 3 

Lys Met Val Asn Asp Ser Asn Tyr Glu Ser Glu Gin Ala Leu Leu Ala 
20 25 30 

ACA GGC AAT AGC TCA GAA GAG CAA AAA CGA AGA TTT TTG CTT AGA GTA 2 01 

Thr Gly Asn Ser Ser Glu Glu Gin Lys Arg Arg Phe Leu Leu Arg Val 
35 40 45 50 

AAG AAA AAG GTT AAT GAT AAT AGG CAG TTA AAA AAG AAA CTT GAC CCA 24 9 

Lys Lys Lys Val Asn Asp Asn Arg Gin Leu Lys Lys Lys Leu Asp Pro 
55 60 65 

TTT CTA AAA AGA CTT GAT GTC CTA CAA ACT GAG TTT GGT GTA ACT GAC 2 97 

Phe Leu Lys Arg Leu Asp Val Leu Gin Thr Glu Phe Gly Val Thr Asp 
70 75 80 

CCT ACA GCT AAC CAT AAT AAG CAA GGG ATA CAT TAT TGC ACA GAA AAT 34 5 

Pro Thr Ala Asn His Asn Lys Gin Gly lie His Tyr Cys Thr Glu Asn 
85 90 95 

AAA AAG ACA GGT AAA TGC GAC CCT ATT GAT AAT GTA TTT AGG ACA ACT 3 93 

Lys Lys Thr Gly Lys Cys Asp Pro He Asp Asn Val Phe Arg Thr Thr 
100 105 110 
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CTC ATT GAA CGC ATC AAA AAC GCC CTA GAC GCT AAC CAC ATT GAA ATC 824 
L.eu lie Glu Arg lie Lys Asn Ala Leu Asp Ala Asn His lie Glu lie 
245 250 255 

CCT TTC AAC AAG CTA GAT ATT GCT ATT AAA AAT CAA GAC TCT CCT AAA T 873 
Pro Phe Asn Lys Leu Asp lie Ala lie Lys Asn Gin Asp Ser Pro Lys 
260 265 270 

GATTGGTGTG AGATGTATTG ATTGTAG 90 0 

(2) INFORMATION FOR SEQ ID NO: 930: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 0: 

Met Asp Glu lie Lys Thr Leu Leu Val Asp Phe Phe Pro Gin Ala Lys 

15 10 15 

His Phe Gly lie lie Leu lie Lys Ala He Val Val Phe Cys He Gly 

20 25 30 

Phe Tyr Phe Ser Phe Phe Leu Gin Lys Lys Thr Met Lys Phe Leu Ser 

35 40 45 

Lys Lys Asp Glu He Leu Ala Asn Phe Val Ala Gin Val Thr Phe He 

50 55 60 

Leu He Leu He He Thr Thr He He Ala Leu Ser Thr Leu Gly Val 
65 70 75 80 

Gin Thr Thr Ser He He Thr Val Leu Gly Thr Val Gly He Ala Val 

85 90 95 

Ala Leu Ala Leu Lys Asp Tyr Leu Ser Ser He Ala Gly Gly He He 

100 105 110 

Leu He He Leu His Pro Phe Lys Lys Gly Asp He He Glu He Ser 

115 120 125 

Gly Leu Glu Gly Lys Val Glu Ala Leu Asn Phe Phe Asn Thr Ser Leu 

130 135 140 

Arg Leu His Asp Gly Arg Leu Ala Val Leu Pro Asn Arg Ser Val Ala 
145 150 155 160 

Asn Ser Asn He He Asn Ser Asn Asn Thr Ala Cys Arg Arg He Glu 

165 170 175 

Trp Val Cys Gly Val Gly Tyr Gly Ser Asp He Glu Leu Val His Lys 

180 185 190 

Thr He Lys Asp Val He Asp Ala Met Glu Lys He Asp Lys Asn Met 

195 200 205 

Pro Thr Phe He Gly He Thr Asp Phe Gly Gin Ser Ser Leu Asn Phe 

210 215 220 

Thr He Arg Val Trp Ala Lys He Glu Asp Gly He Phe Asn Val Arg 
225 230 235 240 

Ser Glu Leu He Glu Arg He Lys Asn Ala Leu Asp Ala Asn His He 

245 250 255 

Glu He Pro Phe Asn Lys Leu Asp He Ala He Lys Asn Gin Asp Ser 
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Gly lie lie Leu lie Lys Ala He Val Val Phe Cys He Gly Phe Tyr 
20 25 30 

TTT TCA TTT TTC TTA CAA AAA AAA ACC ATG AAA TTT TTA TCC AAA AAG 2 00 

Phe Ser Phe Phe Leu Gin Lys Lys Thr Met Lys Phe Leu Ser Lys Lys 
35 40 45 50 

GAT GAG ATT TTA GCG AAT TTT GTC GCA GAG GTT ACT TTT ATC TTA ATC 24 8 

Asp Glu He Leu Ala Asn Phe Val Ala Gin Val Thr Phe He Leu He 
55 60 65 

CTT ATC ATC ACC ACA ATC ATT GCG CTC AGC ACG CTA GGC GTG CAA ACC 2 96 

Leu He He Thr Thr He He Ala Leu Ser Thr Leu Gly Val Gin Thr 
70 75 80 

ACC TCT ATT ATC ACT GTT TTA GGA ACG GTA GGG ATT GCT GTG GCG TTG 344 
Thr Ser He He Thr Val Leu Gly Thr Val Gly He Ala Val Ala Leu 
85 90 95 

GCT TTA AAA GAT TAT CTT TCA AGC ATT GCT GGA GGG ATA ATC CTT ATT 3 92 

Ala Leu Lys Asp Tyr Leu Ser Ser He Ala Gly Gly He He Leu He 
100 105 110 

ATC TTG CAC CCT TTC AAA AAA GGA GAC ATC ATT GAA ATC TCT GGC CTA 44 0 

He Leu His Pro Phe Lys Lys Gly Asp He He Glu He Ser Gly Leu 
115 120 125 130 

GAG GGC AAA GTA GAA GCG CTT AAT TTT TTT AAC ACT TCT TTA CGC TTG 48 8 

Glu Gly Lys Val Glu Ala Leu Asn Phe Phe Asn Thr Ser Leu Arg Leu 
135 140 145 

CAT GAC GGA CGC TTG GCG GTT TTA CCC AAT AGA AGT GTC GCT AAT TCT 53 6 

His Asp Gly Arg Leu Ala Val Leu Pro Asn Arg Ser Val Ala Asn Ser 
150 155 160 

AAT ATT ATC AAT AGC AAT AAC ACG GCG TGT CGG CGC ATT GAA TGG GTT 5 84 

Asn He He Asn Ser Asn Asn Thr Ala Cys Arg Arg He Glu Trp Val 
165 170 175 

TGT GGG GTA GGG TAT GGG AGC GAT ATT GAA CTG GTG CAT AAG ACT ATA 63 2 

Cys Gly Val Gly Tyr Gly Ser Asp He Glu Leu Val His Lys Thr He 
180 185 190 

AAA GAT GTT ATT GAT GCA ATG GAA AAA ATT GAT AAA AAC ATG CCC ACT 680 
Lys Asp Val He Asp Ala Met Glu Lys He Asp Lys Asn Met Pro Thr 
195 200 205 210 

TTT ATT GGG ATC ACG GAT TTT GGA CAA AGT TCG CTG AAT TTC ACC ATT 72 8 

Phe He Gly He Thr Asp Phe Gly Gin Ser Ser Leu Asn Phe Thr He 
215 220 225 

AGG GTT TGG GCA AAG ATT GAA GAC GGA ATC TTT AAT GTG CGC AGC GAA 7 76 

Arg Val Trp Ala Lys He Glu Asp Gly He Phe Asn Val Arg Ser Glu 
230 235 240 
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5 70 75 

CTCATAC 311 
(2) INFORMATION FOR SEQ ID NO: 92 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:928: 

Met Val Val Ser Met Asn Cys He Gly Ser Lys Tyr Lys Leu He Ala 

15 10 15 

Phe He Gin Glu Asn He His Ala Val Val Gly Gin Pro Phe Gly Cys 

20 25 30 

Asp Phe Leu Arg Ser Val Arg Trp Asp Gly Tyr Arg Gly Val Cys Val 

35 40 45 

Lys Trp Ser Leu Gly Ser Thr Leu Lys Asn He Phe Ser Leu Asp Ser 

50 55 60 

Val Leu Lys Ala Asn Gin Val He Pro Lys Asp Ala 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 92 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 872 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 9: 

AAAATTAAGT TGTTTGATCG CTTTTAAACG ATTTTTAAAA GGAAAAATTT ATG GAT 5 6 

Met Asp 
1 

GAA ATT AAA ACG CTG TTA GTG GAT TTT TTT CCG CAG GCA AAG CAT TTT 104 
Glu He Lys Thr Leu Leu Val Asp Phe Phe Pro Gin Ala Lys His Phe 
5 10 15 

GGG ATA ATC TTA ATC AAG GCT ATT GTT GTC TTT TGT ATA GGT TTT TAT 152 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926: 

Met He Val Ser Phe He Ala Val Pro Cys Tyr Tyr Val Leu Leu Ala 

15 10 15 

Met Glu Tyr Gin He Ala Tyr Glu His Pro Gly Glu Leu He Ser Thr 

20 25 30 

He Gly Phe Val Ala Leu Ala Val Leu Val Tyr Tyr Leu Trp Gly Lys 

35 40 45 

Trp Glu Lys Leu Leu Trp Gly Ala Pro Ser Asn Gin Glu Gin Gin Leu 

50 55 60 

Ser Asn Gin Gly Asn Gin Asn Gin 
65 70 

(2) INFORMATION FOR SEQ ID NO: 92 7: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 287 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927: 



TAGGAAAAAG ACTATCATGC GAGCTATCCA AATTAGATCC GATCAAAAAC TACCCTTTA A 6 0 

Me 



TGG TTG TAT CAA TGA ACT GCA TCG GCT CTA AAT ACA AAC TCA TTG CCT 10 8 

t Val Val Ser Met Asn Cys He Gly Ser Lys Tyr Lys Leu He Ala Ph 
15 10 15 

TTA TTC AAG AAA ATA TCC ATG CGG TTG TGG GGC AAC CTT TTG GGT GTG 15 6 

e He Gin Glu Asn He His Ala Val Val Gly Gin Pro Phe Gly Cys As 
20 25 30 

ATT TTT TGC GAT CTG TTC GCT GGG ACG GGT ATC GTG GGG TGT GCG TAA 2 04 

p Phe Leu Arg Ser Val Arg Trp Asp Gly Tyr Arg Gly Val Cys Val Ly 
35 40 45 

AGT GGT CTC TAG GTT CAA CAC TAA AAA ACA TTT TTT CAT TAG ACA GCG 2 52 

s Trp Ser Leu Gly Ser Thr Leu Lys Asn He Phe Ser Leu Asp Ser Va 
50 55 60 6 

TGT TAA AAG CCA ATC AAG TTA TCC CTA AAG ATG CT TAACATGTTA AAATAAT 3 04 
1 Leu Lys Ala Asn Gin Val He Pro Lys Asp Ala 
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180 185 190 

Ljeu Gin Ala Phe Phe Ser Leu Thr Asn Tyr 
195 200 

(2) INFORMATION FOR SEQ ID NO: 925: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 65... 280 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925: 

AATTCAAGAA GGCAAAAAAT TATCTTTTTT CCTCGAATCA ATCATTAGGT TATTTTTTGG 60 
TTTT ATG ATA GTT TCT TTT ATT GCC GTT CCA TGC TAC TAT GTT TTA TTG 10 9 

Met lie Val Ser Phe lie Ala Val Pro Cys Tyr Tyr Val Leu Leu 
15 10 15 

GCG ATG GAA TAC CAA ATA GCC TAT GAA CAC CCA GGA GAA TTA ATA AGO 15 7 

Ala Met Glu Tyr Gin lie Ala Tyr Glu His Pro Gly Glu Leu lie Ser 
20 25 30 

ACG ATT GGT TTT GTT GCG TTA GCA GTG CTT GTG TAT TAC TTA TGG GGT 2 05 

Thr lie Gly Phe Val Ala Leu Ala Val Leu Val Tyr Tyr Leu Trp Gly 
35 40 45 

AAA TGG GAG AAG TTG CTA TGG GGC GCA CCT TCC AAT CAA GAG CAA CAA 2 53 

Lys Trp Glu Lys Leu Leu Trp Gly Ala Pro Ser Asn Gin Glu Gin Gin 
50 55 60 

CTC TCC AAT CAA GGC AAC CAA AAT CAA TGATTGTGAT TGATCGCTAG GTCAATC 3 07 
Leu Ser Asn Gin Gly Asn Gin Asn Gin 
65 70 

TGA 310 

(2) INFORMATION FOR SEQ ID NO: 926: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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140 145 150 

ACC CTT ATG CAG TAT TCG CCC ATT TAG TCC ATA ATC AAA CTT CTT GOT 533 
Thr Leu Met Gin Tyr Ser Pro lie Tyr Ser lie lie Lys Leu Leu Ala 
155 160 165 

CAT TTT AAT ATA GAG ATC ACT TCT AAG ATT ATC ATT TCT CTT GTT TGG 581 
His Phe Asn lie Glu lie Thr Ser Lys lie lie lie Ser Leu Val Trp 
170 175 180 

GTG TGT ATA GGG CTG TAT TTT TTG TTA TTG CAA GCG TTT TTT AGT CTT 62 9 

Val Cys lie Gly Leu Tyr Phe Leu Leu Leu Gin Ala Phe Phe Ser Leu 
185 190 195 

ACA AAT TAT TAGTTGCAGA AAATTCAAGA AGGCAAAAAA TTATCTTTTT TCCTCGAAT 687 

Thr Asn Tyr 

200 

CAATCATTAG GTTATTTTTT GGTTTTATGA TAG 72 0 

(2) INFORMATION FOR SEQ ID NO : 924 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924: 

Met Asn Val Leu lie Arg Leu Cys Phe lie Phe Leu lie Gly Phe Phe 

15 10 15 

Gly Ala Asn Lys Thr Leu Asn Ala Thr Ala lie Leu Ser Leu Asp Phe 

20 25 30 

Gly Ser Phe Ser Met Pro lie Thr Ala Asn Phe Ser Asp Gly Ala Leu 

35 40 45 

Asn Val Phe Lys Trp Phe Glu Lys His Pro Ser Val Gly Val Lys Val 

50 55 60 

Gly Arg Leu Ala Asn Gin Asp Asp Thr lie Phe Thr Leu Val Phe lie 
65 70 75 80 

Val lie Val Val Ala lie lie Ala Leu lie Ala lie Phe lie Arg Ser 

85 90 95 

lie Leu Leu Asn Thr lie Phe Val Gly Ser Leu He Gly Ser Leu Trp 

100 105 110 

Leu Tyr Met Val Gly Phe Tyr Tyr Phe Tyr Gly Val Pro Phe Leu Ser 

115 120 125 

Tyr Leu Ser Gly Cys Tyr Glu Ser Phe Ser Phe Ser Ala Cys Tyr Pro 

130 135 140 

His Ser Leu Gin Leu Leu Pro Thr Leu Met Gin Tyr Ser Pro He Tyr 
145 150 155 160 

Ser He He Lys Leu Leu Ala His Phe Asn He Glu He Thr Ser Lys 

165 170 175 

He He He Ser Leu Val Trp Val Cys He Gly Leu Tyr Phe Leu Leu 



-1367- 



BNSDOCID: cWO 9843478A1_I_> 



wo 98/43478 PCT/US98/06371 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



<A) NAME/KEY: Coding Sequence 
(B) LOCATION: 33.,. 63 8 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923: 

AAGCTTATAA AATCATCAAA AAGAGTGCTG AA ATG AAT GTT TTA ATC AGA TTG 5 3 

Met Asn Val Leu lie Arg Leu 
1 5 

TGC TTT ATT TTT TTG ATT GGG TTT TTT GGC GCG AAT AAA ACC CTA AAC 101 

Cys Phe lie Phe Leu lie Gly Phe Phe Gly Ala Asn Lys Thr Leu Asn 
10 15 20 

GCA ACA GCC ATT CTT TCT CTT GAC TTT GGC TCT TTT TCC ATG CCA ATC 14 9 

Ala Thr Ala lie Leu Ser Leu Asp Phe Gly Ser Phe Ser Met Pro lie 
25 30 35 

ACT GCC AAT TTC TCA GAT GGT GCG TTA AAT GTA TTC AAA TGG TTT GAA 197 
Thr Ala Asn Phe Ser Asp Gly Ala Leu Asn Val Phe Lys Trp Phe Glu 
40 45 50 55 

AAA CAC CCA TCA GTG GGT GTT AAA GTT GGT CGG CTT GCA AAT CAA GAC 24 5 

Lys His Pro Ser Val Gly Val Lys Val Gly Arg Leu Ala Asn Gin Asp 
60 65 70 

GAC ACT ATC TTT ACT CTA GTT TTC ATT GTG ATA GTT GTC GCA ATA ATT 2 93 

Asp Thr lie Phe Thr Leu Val Phe lie Val lie Val Val Ala lie lie 
75 80 85 

GCC CTT ATC GOT ATT TTT ATA AGG AGT ATA TTA CTA AAC ACA ATT TTT 341 
Ala Leu lie Ala lie Phe lie Arg Ser lie Leu Leu Asn Thr lie Phe 
90 95 100 

GTA GGA TCG CTC ATA GGA TCC TTA TGG TTG TAT ATG GTA GGG TTT TAT 389 
Val Gly Ser Leu lie Gly Ser Leu Trp Leu Tyr Met Val Gly Phe Tyr 
105 110 115 

TAT TTT TAT GGT GTT CCC TTT TTG AGT TAT TTG AGC GGT TGT TAT GAA 4 37 

Tyr Phe Tyr Gly Val Pro Phe Leu Ser Tyr Leu Ser Gly Cys Tyr Glu 
120 125 130 135 

TCG TTT TCT TTC TCC GCA TGC TAT CCT CAT AGT TTG CAG CTA CTC CCC 4 85 

Ser Phe Ser Phe Ser Ala Cys Tyr Pro His Ser Leu Gin Leu Leu Pro 
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Gin Thr Asp Thr Phe lie Ser Phe lie Pro Lys Lys Glu Trp Ser Val 

610 615 620 

Lys Thr Lys Asp Glu Leu Leu Glu Lys lie Met Asp Ser Leu Lys Asp 
625 630 635 640 

Phe Lys Gly lie Asn Phe Ser Phe Thr Gin Pro lie Glu Met Arg lie 

645 650 655 

Ser Glu Met Leu Thr Gly Val Arg Gly Asp Leu Ala Val Lys lie Phe 

660 665 670 

Gly Asp Gly lie Ser Glu Leu Asn Glu Leu Ser Phe Gin lie Ala Gin 

675 680 685 

Ala Leu Lys Gly lie Lys Gly Ser Ser Glu Val Leu Thr Thr Leu Asn 

690 695 700 

Glu Gly Val Asn Tyr Leu Tyr Val Thr Pro Asn Lys Glu Ser Met Ala 
705 710 715 720 

Asp Val Gly lie Thr Ser Asp Glu Phe Ser Lys Phe Leu Lys Ser Ala 

725 730 735 

Leu Glu Gly Leu Val Val Asp Val lie Pro Thr Gly He Ser Arg Thr 

740 745 750 

Pro Val Met He Arg Gin Glu Ser Asp Phe Ala Ser Ser He Thr Lys 

755 760 765 

He Lys Ser Leu Ala Leu Thr Ser Lys Tyr Gly Val Leu Val Pro He 

770 775 780 

Thr Ser He Ala Lys He Glu Glu Val Asp Gly Pro Val Ser Val Val 
785 790 795 800 

Arg Glu Asn Ser Met Arg Met Ser Val Val Arg Ser Asn Val Val Gly 

805 810 815 

Arg Asp Leu Lys Ser Phe Val Glu Glu Ala Lys Lys Val He Ala Gin 

820 825 830 

Asn He Lys Leu Pro Pro Ser Tyr Tyr He Thr Tyr Gly Gly Gin Phe 

835 840 845 

Glu Asn Gin Gin Arg Ala Asn Lys Arg Leu Ser Thr Val He Pro Leu 

850 855 860 

Ser He Leu Ala He Phe Phe He Leu Phe Phe Thr Phe Lys Ser He 
865 870 875 880 

Pro Leu Ala Leu Leu He Leu Leu Asn He Pro Phe Ala Val Thr Gly 

885 890 895 

Gly Leu He Ala Leu Phe Ala Val Gly Glu Tyr He Ser Val Pro Ala 

900 905 910 

Ser Val Gly Phe He Ala Leu Phe Gly He Ala Val Leu Asn Gly Val 

915 920 925 

Val Met He Gly Tyr Phe Lys Glu Leu Leu Leu Gin Gly Lys Ser Val 

930 935 940 

Glu Glu Cys Val Leu Leu Gly Ala Lys Arg Arg Leu Arg Pro Val Leu 
945 950 955 960 

Met Thr Ala Cys He Ala Gly Leu Gly Leu Leu Pro Leu Leu Phe Ser 

965 970 975 

His Ser Val Gly Ser Glu Val Gin Lys Pro Leu Ala He Val Val Leu 

980 985 990 

Gly Gly Leu Val Thr Ser Ser Ala Leu Thr Leu Leu Leu Leu Pro Pro 

995 1000 1005 

Met Phe Met Leu He Ala Lys Lys He Lys He Val 
1010 1015 1020 

(2) INFORMATION FOR SEQ ID NO: 923: 
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Ser lie Gly Gly 
180 

Asp Met Ala Arg 
195 

Arg Val Asn Leu 
210 

Glu Thr Phe Leu 
225 

lie Gly Lys He 

Asp Phe Ala Lys 
260 

Thr Lys Asp Gly 
275 

Lys Asp Ala Asn 
290 

Glu Glu Leu Lys 
305 

Tyr Asp Arg Ser 

Thr Leu He Glu 
340 

Leu Gly Asn Leu 
355 

Ser Leu Ser Val 
370 

Asn Leu Met Ser 
385 

Asp Ser Ala Val 

Asn Thr Lys Thr 
420 

He Ala Val Ser 
435 

Val Pro He Leu 
450 

Leu Ala Gin Ser 
465 

He Thr He He 

His Ser Glu Thr 
500 

Leu Leu Glu Phe 
515 

Phe Val Phe Leu 
530 

Asn Phe Met Pro 
545 

Thr Thr Pro Ser 

He Glu Ser Ala 
580 

Ala A.rg Thr Gly 
595 



165 

Phe Ser Arg Ala 

Leu Gly Val Ser 
200 

Arg Asn Ser Gly 
215 

Val Lys He Gin 
230 

Thr Val Ser Thr 
245 

Val He Ser Gin 

Val Gly Glu Thr 
280 

Thr Lys Glu He 
295 

Pro Phe Leu Pro 
310 

Glu Phe Thr Gin 
325 

Ala Val Val Leu 

Arg Ala Ser Val 
360 

Ala Phe He Phe 
375 

Leu Gly Gly Leu 
390 

Val Val Val Glu 
405 

Thr Lys Leu His 

Val Val Ser Gly 
440 

Thr Leu Gin Gly 
455 

He Val Tyr Ala 
470 

Pro Val Val Ser 
485 

Phe Leu Thr Arg 

Phe Val His Asn 
520 

He Ala Ser Leu 
535 

Val Leu Asp Glu 
550 

He Ser Leu Asp 
565 

He Lys Lys His 

Ser Asp Glu Leu 
600 



170 

Phe Val He Val 
185 

He Ser Asp Leu 

Ala Gly Arg Val 
220 

Thr Ala Ser Leu 
23 5 

Asn Leu Gly His 
250 

Ser Arg Thr Arg 
265 

Thr Glu Gly Leu 

He Thr Gin Val 
300 

Asn Gly Val Ser 
315 

Lys Ala He Ala 
330 

He He He Thr 
345 

Ala Val Gly Val 

He Lys Phe Ser 
380 

Val He Ala He 
395 

Asn Ala Phe Glu 
410 

Ala He Tyr Arg 
425 

Val Val He He 

Leu Glu Gly Lys 
460 

Leu Leu Gly Thr 
475 

Ser Leu Val Leu 
490 

Phe Leu Asn Arg 
505 

Pro Lys Lys Val 

Ser Leu Phe Pro 
540 

Gly Asp Val Val 
555 

Gin Ser Arg Asp 
570 

Val Lys Glu Val 
585 

Gly Leu Asp Leu 



175 

Pro Asp Phe Asn 
190 

Glu Ser Ala Val 
205 

Asp Arg Asp Gly 

Ser Leu Glu Asp 
240 

Leu His He Lys 
255 

Leu Gly Phe Val 
270 

Val Leu Ser Leu 
285 

Tyr Gin Lys Leu 

He Asn Val Phe 
320 

Thr Val Ser Lys 
335 

Leu Phe Leu Phe 
350 

He Leu Pro Leu 
365 

Asp Leu Thr Leu 

Gly Met Leu He 
400 

Lys Leu Ser Ala 
415 

Ser Cys Lys Glu 
430 

He Val Phe Phe 
445 

Met Phe Arg Pro 

Leu Val Leu Ser 
480 

Lys Ala Thr Pro 
495 

He Tyr Ala Pro 
510 

He Leu Gly Ala 
525 

Phe Val Gly Lys 

Leu Ser Val Glu 
560 

Leu Met Leu Asn 
575 

Lys Ser He Val 
590 

Gly Gly Leu Asn 
605 
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GAA GAA TGC GTT TTA TTG GGC GCT AAA AGG CGT TTG AGA CCG GTT TTA 2 94 0 

Glu Glu Cys Val beu Leu Gly Ala Lys Arg Arg Leu Arg Pro Val Leu 
945 950 955 960 

ATG ACC GCT TGC ATT GCC GGT TTG GGT TTG CTC CCT TTA TTA TTT TOT 2 98 8 

Met Thr Ala Cys lie Ala Gly Leu Gly Leu Leu Pro Leu Leu Phe Ser 
965 970 975 

CAT AGC GTG GGA TCA GAA GTC CAA AAA CCT TTA GCG ATC GTG GTG CTT 3 03 6 

His Ser Val Gly Ser Glu Val Gin Lys Pro Leu Ala lie Val Val Leu 
980 985 990 

GGA GGC TTG GTT ACC TCA AGC GCT CTA ACC TTA CTC CTA CTG CCG CCA 3 0 84 

Gly Gly Leu Val Thr Ser Ser Ala Leu Thr Leu Leu Leu Leu Pro Pro 
995 1000 1005 

ATG TTT ATG CTC ATC GCT AAA AAG ATT AAA ATC GTT TGAGTTAAAG GATTTC 313 6 
Met Phe Met Leu lie Ala Lys Lys lie Lys lie Val 
1010 1015 1020 

ACATGCTCGC TTTAGAAATT TATATTGATA TTTGTTTGAA AGA 317 9 

(2) INFORMATION FOR SEQ ID NO: 922: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 2: 

Met Met Leu Ala Ser lie lie Glu Phe Ser Leu Arg Gin Arg Val lie 

15 10 15 

Val lie Val Gly Ala lie Leu lie Leu Phe Phe Gly Thr Tyr Ser Phe 

20 25 30 

lie Asn Thr Pro Val Asp Ala Phe Pro Asp lie Ser Pro Thr Gin Val 

35 40 45 

Lys lie lie Leu Lys Leu Pro Gly Ser Ser Pro Glu Glu Met Glu Asn 

50 55 60 

Asn lie Val Arg Pro Leu Glu Leu Glu Leu Leu Gly Leu Lys Gly Gin 
65 70 75 80 

Lys Ser Leu Arg Ser Val Ser Lys Tyr Ser lie Ser Asp lie Thr lie 

85 90 95 

Asp Phe Asp Asp Ser Val Asp lie Tyr Leu Ala Arg Asn lie Val Asn 

100 105 . 110 

Glu Arg Leu Ser Ser Val Met Lys Asp Leu Pro Val Gly Val Glu Gly 

115 120 125 

Gly Met Ala Pro lie Val Thr Pro Leu Ser Asp lie Phe Met Phe Thr 

130 135 140 

lie Asp Gly Asn lie Thr Glu lie Glu Lys Arg Gin Leu Leu Asp Phe 
145 150 155 160 

Val lie Arg Pro Gin Leu Arg Met lie Ser Gly Val Ala Asp Val Asn 
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GAT GTG GGG ATC ACT AGC GAT GAA TTT TCC AAG TTT TTA AAA TCC GCT 22 6 8 

Asp Val Gly lie Thr Ser Asp Glu Phe Ser Lys Phe Leu Lys Ser Ala 
725 730 735 

TTA GAG GGC TTG GTT GTA GAT GTG ATC CCT ACA GGG ATT TCA CGC ACG 2 316 

Leu Glu Gly Leu Val Val Asp Val lie Pro Thr Gly lie Ser Arg Thr 
740 745 750 

CCA GTG ATG ATC CGC CAA GAG AGC GAT TTT GCA AGC TCT ATC ACT AAA 2 3 64 

Pro Val Met lie Arg Gin Glu Ser Asp Phe Ala Ser Ser lie Thr Lys 
755 760 765 

ATC AAA AGT TTA GCC TTG ACT TCA AAA TAT GGC GTT TTA GTG CCT ATC 2 412 

lie Lys Ser Leu Ala Leu Thr Ser Lys Tyr Gly Val Leu Val Pro lie 
770 775 780 

ACT TCT ATC GCC AAA ATT GAA GAA GTG GAT GGC CCT GTT TCT GTT GTG 24 6 0 

Thr Ser lie Ala Lys lie Glu Glu Val Asp Gly Pro Val Ser Val Val 
785 790 795 800 

CGT GAA AAT TCA ATG CGC ATG AGC GTG GTT CGC AGT AAT GTG GTG GGG 2 50 8 

Arg Glu Asn Ser Met Arg Met Ser Val Val Arg Ser Asn Val Val Gly 
805 810 815 

CGC GAT TTG AAA TCT TTT GTA GAA GAG GCT AAA AAA GTG ATC GCT CAA 255 6 

Arg Asp Leu Lys Ser Phe Val Glu Glu Ala Lys Lys Val lie Ala Gin 
820 825 830 

AAC ATC AAA CTC CCT CCC AGC TAC TAT ATC ACT TAT GGG GGG CAG TTT 2 604 

Asn lie Lys Leu Pro Pro Ser Tyr Tyr lie Thr Tyr Gly Gly Gin Phe 
835 840 845 

GAA AAC CAG CAA CGG GCC AAT AAA AGG CTC TCC ACC GTT ATC CCT TTA 2 65 2 

Glu Asn Gin Gin Arg Ala Asn Lys Arg Leu Ser Thr Val lie Pro Leu 
850 855 860 

AGC ATC TTA GCG ATT TTT TTC ATT CTT TTT TTC ACT TTT AAA AGC ATT 2 700 

Ser lie Leu Ala lie Phe Phe lie Leu Phe Phe Thr Phe Lys Ser lie 
865 870 875 880 

CCT TTA GCC TTG CTC ATT CTT TTG AAT ATC CCT TTT GCG GTT ACC GGA 2 74 8 

Pro Leu Ala Leu Leu lie Leu Leu Asn lie Pro Phe Ala Val Thr Gly 
885 890 895 

GGC CTT ATT GCG TTG TTT GCG GTC GGG GAG TAT ATT TCA GTG CCA GCG 2 7 96 

Gly Leu lie Ala Leu Phe Ala Val Gly Glu Tyr lie Ser Val Pro Ala 
900 905 910 

AGC GTG GGC TTT ATC GCT CTT TTT GGG ATT GCG GTT TTA AAT GGC GTG 2 844 
Ser Val Gly Phe lie Ala Leu Phe Gly lie Ala Val Leu Asn Gly Val 
915 920 925 

GTG ATG ATA GGC TAT TTT AAA GAG CTT CTC TTG CAA GGG AAA AGC GTA 2 8 92 

Val Met lie Gly Tyr Phe Lys Glu Leu Leu Leu Gin Gly Lys Ser Val 
930- 935 940 
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CAT AGC GAA ACC TTT TTA ACG AGO TTT TTA AAC AGA ATC TAG GCC CCT 15 96 

His Ser Glu Thr Phe Leu Thr Arg Phe Leu Asn Arg He Tyr Ala Pro 
500 505 510 

TTA TTG GAA TTT TTT GTG CAT AAC CCT AAA AAA GTG ATT TTA GGA GCG 164 4 

Leu Leu Glu Phe Phe Val His Asn Pro Lys Lys Val He Leu Gly Ala 
515 520 525 

TTT GTT TTT TTA ATC GCA AGC CTT TCT TTA TTC CCT TTT GTG GGG AAG 16 92 

Phe Val Phe Leu He Ala Ser Leu Ser Leu Phe Pro Phe Val Gly Lys 
530 535 540 

AAT TTC ATG CCC GTT TTA GAT GAG GGC GAT GTG GTT TTG AGC GTG GAA 17 4 0 

Asn Phe Met Pro Val Leu Asp Glu Gly Asp Val Val Leu Ser Val Glu 
545 550 555 560 

ACC ACC CCT TCT ATT TCT TTA GAT CAA TCT AGG GAT CTC ATG CTA AAC 17 8 8 

Thr Thr Pro Ser He Ser Leu Asp Gin Ser Arg Asp Leu Met Leu Asn 
565 570 575 

ATT GAG AGC GCG ATT AAA AAG CAT GTC AAG GAA GTT AAA AGC ATT GTC 183 6 

He Glu Ser Ala He Lys Lys His Val Lys Glu Val Lys Ser He Val 
580 585 590 

GCG CGC ACA GGG AGC GAT GAA TTG GGG CTG GAT TTA GGA GGT TTG AAT 18 84 

Ala Arg Thr Gly Ser Asp Glu Leu Gly Leu Asp Leu Gly Gly Leu Asn 
595 600 605 

CAA ACC GAT ACT TTT ATT TCT TTT ATT CCT AAA AAA GAA TGG AGC GTT 1932 
Gin Thr Asp Thr Phe He Ser Phe He Pro Lys Lys Glu Trp Ser Val 
610 615 620 

AAA ACC AAA GAT GAA TTA TTA GAA AAA ATC ATG GAT TCT TTA AAA GAC 198 0 

Lys Thr Lys Asp Glu Leu Leu Glu Lys He Met Asp Ser Leu Lys Asp 
625 630 635 640 

TTT AAG GGG ATT AAC TTT TCT TTC ACC CAA CCC ATT GAA ATG AGA ATT 2 02 8 

Phe Lys Gly He Asn Phe Ser Phe Thr Gin Pro He Glu Met Arg He 
645 650 655 

TCT GAA ATG CTG ACA GGG GTT AGG GGG GAT TTA GCG GTT AAG ATT TTT 2 076 

Ser Glu Met Leu Thr Gly Val Arg Gly Asp Leu Ala Val Lys He Phe 
660 665 670 

GGA GAT GGT ATT AGC GAA TTG AAT GAA TTG AGT TTT CAA ATC GCG CAA 212 4 

Gly Asp Gly He Ser Glu Leu Asn Glu Leu Ser Phe Gin He Ala Gin 
675 680 685 

GCT CTA AAA GGG ATT AAA GGA TCT AGT GAA GTT TTA ACC ACG CTT AAT 2172 
Ala Leu Lys Gly He Lys Gly Ser Ser Glu Val Leu Thr Thr Leu Asn 
690 695 700 

GAG GGC GTG AAT TAT TTG TAT GTA ACC CCT AAT AAA GAA TCG ATG GCG 22 20 

Glu Gly Val Asn Tyr Leu Tyr Val Thr Pro Asn Lys Glu Ser Met Ala 
705 . 710 715 720 
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ACT AAA GAT GGC GTG GGC GAG ACC ACA GAA GGC TTG GTG CTT TCT TTA 924 
Thr Lys Asp Gly Val Gly Glu Thr Thr Glu Gly Leu Val Leu Ser Leu 
275 280 285 

AAA GAC GCT AAC ACC AAA GAA ATC ATC ACT CAA GTG TAT CAA AAA CTA 97 2 

Lys Asp Ala Asn Thr Lys Glu- lie lie Thr Gin Val Tyr Gin Lys Leu 
290 295 300 

GAA GAA TTA AAA CCC TTT TTA CCG AAT GGC GTG TCC ATT AAT GTT TTT 102 0 

Glu Glu Leu Lys Pro Phe Leu Pro Asn Gly Val Ser lie Asn Val Phe 
305 310 315 320 

TAT GAT CGC TCA GAA TTT ACG CAA AAA GCC ATT GCC ACC GTT TCT AAA 106 8 

Tyr Asp Arg Ser Glu Phe Thr Gin Lys Ala lie Ala Thr Val Ser Lys 
325 330 335 

ACC CTC ATT GAA GCC GTT GTT TTA ATC ATC ATC ACG CTC TTT TTA TTT 1116 
Thr Leu lie Glu Ala Val Val Leu lie lie lie Thr Leu Phe Leu Phe 
340 345 350 

TTA GGG AAT TTG AGG GCG AGC GTG GCT GTG GGG GTG ATT TTA CCT TTA 1164 
Leu Gly Asn Leu Arg Ala Ser Val Ala Val Gly Val lie Leu Pro Leu 
355 360 365 

AGC TTG TCC GTG GCG TTT ATT TTT ATC AAG TTT AGC GAT CTG ACT TTA 1212 
Ser Leu Ser Val Ala Phe lie Phe lie Lys Phe Ser Asp Leu Thr Leu 
370 375 380 

AAT TTG ATG AGT TTA GGG GGA TTG GTT ATC GCT ATA GGC ATG CTC ATT 12 6 0 

Asn Leu Met Ser Leu Gly Gly Leu Val lie Ala lie Gly Met Leu lie 
385 390 395 400 

GAC TCA GCC GTG GTG GTG GTG GAA AAC GCT TTT GAA AAA TTA AGC GCT 13 0 8 

Asp Ser Ala Val Val Val Val Glu Asn Ala Phe Glu Lys Leu Ser Ala 
405 410 415 

AAC ACT AAA ACC ACT AAA CTC CAT GCA ATC TAT CGT TCG TGT AAA GAA 13 5 6 

Asn Thr Lys Thr Thr Lys Leu His Ala lie Tyr Arg Ser Cys Lys Glu 
420 425 430 

ATC GCT GTT TCA GTG GTG AGC GGG GTG GTG ATC ATC ATT GTG TTT TTT 14 04 

lie Ala Val Ser Val Val Ser Gly Val Val lie lie lie Val Phe Phe 
435 440 445 

GTG CCG ATT TTA ACC TTA CAG GGG TTA GAG GGT AAG ATG TTT AGG CCT 14 52 

Val Pro lie Leu Thr Leu Gin Gly Leu Glu Gly Lys Met Phe Arg Pro 
450 455 460 

TTA GCG CAA AGC ATT GTG TAT GCG CTT TTA GGC ACT TTA GTT CTA TCT 15 0 0 

Leu Ala Gin Ser lie Val Tyr Ala Leu Leu Gly Thr Leu Val Leu Ser 
465 470 475 480 

ATT ACA ATC ATT CCT GTA GTC AGC TCT CTT GTC TTA AAA GCC ACG CCC 154 8 

lie Thr lie lie Pro Val Val Ser Ser Leu Val Leu Lys Ala Thr Pro 
485 490 495 
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Lys lie lie Leu Lys Leu Pro Gly Ser Ser Pro Glu Glu Met Glu Asn 
50 55 60 

AAC ATC GTG CGC CCT TTA GAA TTG GAG CTT TTA GGC TTG AAA GGG CAA 300 
Asn lie Val Arg Pro Leu Glu Leu Glu Leu Leu Gly Leu Lys Gly Gin 
65 70 75 80 

AAA TCT TTA AGG AGT GTT TCA AAA TAT TCT ATT TCA GAT ATT ACG ATA 34 8 

Lys Ser Leu Arg Ser Val Ser Lys Tyr Ser lie Ser Asp lie Thr lie 
85 90 95 

GAT TTT GAT GAG AGO GTG GAT ATT TAT TTA GCG AGG AAT ATT GTC AAT 3 96 

Asp Phe Asp Asp Ser Val Asp lie Tyr Leu Ala Arg Asn lie Val Asn 
100 105 110 

GAG CGC TTG AGC AGC GTG ATG AAA GAT TTA CCC GTG GGG GTT GAG GGG 444 
Glu Arg Leu Ser Ser Val Met Lys Asp Leu Pro Val Gly Val Glu Gly 
115 120 125 

GGC ATG GCG CCC ATT GTT ACG CCG CTA TCA GAT ATC TTT ATG TTC ACT 4 92 

Gly Met Ala Pro lie Val Thr Pro Leu Ser Asp lie Phe Met Phe Thr 
130 135 140 

ATT GAT GGC AAT ATC ACT GAG ATA GAA AAA CGA CAG CTT TTA GAT TTT 54 0 

lie Asp Gly Asn lie Thr Glu lie Glu Lys Arg Gin Leu Leu Asp Phe 
145 150 155 160 

GTG ATC CGC CCA CAA TTA AGA ATG ATT AGC GGC GTA GCA GAT GTC AAT 58 8 

Val lie Arg Pro Gin Leu Arg Met lie Ser Gly Val Ala Asp Val Asn 
165 170 175 

TCC ATT GGA GGC TTT AGC AGA GCG TTT GTG ATC GTG CCG GAT TTT AAT 63 6 

Ser lie Gly Gly Phe Ser Arg Ala Phe Val lie Val Pro Asp Phe Asn 
180 185 190 

GAC ATG GCA AGG CTT GGG GTG AGT ATT TCT GAT TTA GAA TCG GCT GTG 6 84 

Asp Met Ala Arg Leu Gly Val Ser lie Ser Asp Leu Glu Ser Ala Val 
195 200 205 

AGA GTG AAT TTA AGA AAC AGC GGA GCG GGG CGC GTG GAT AGA GAT GGC 73 2 

Arg Val Asn Leu Arg Asn Ser Gly Ala Gly Arg Val Asp Arg Asp Gly 
210 215 220 

GAA ACC TTT TTA GTC AAA ATC CAA ACC GCT TCT TTG AGT TTA GAA GAC 7 80 

Glu Thr Phe Leu Val Lys lie Gin Thr Ala Ser Leu Ser Leu Glu Asp 
225 230 235 240 

ATT GGC AAA ATC ACC GTT TCC ACT AAT TTA GGG CAT TTG CAC ATT AAG 82 8 

lie Gly Lys lie Thr Val Ser Thr Asn Leu Gly His Leu His lie Lys 
245 250 255 

GAT TTT GCG AAA GTC ATC AGC CAG TCT CGC ACC CGT TTG GGG TTT GTT 87 6 

Asp Phe Ala Lys Val lie Ser Gin Ser Arg Thr Arg Leu Gly Phe Val 
260 265 270 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 0: 

Met lie Leu Gly Asn Asn Val Ser Lys Ser Phe Lys Arg Asp Phe Asn 

15 10 15 

Pro Cys Lys lie Ser Ala Asn Gly Leu Val Thr Gly Phe Leu Glu Lys 

20 25 30 

lie Thr Leu Lys Ala Leu lie Pro Asn Ser lie Pro Cys Ser Lys Phe 

35 40 45 

Thr Asp Arg Ser Leu Glu Ser Leu Phe Arg Leu Ser Asn Ser Ala Trp 

50 55 60 

Lys Ser Phe Glu Lys Glu Arg Leu lie Phe Ser lie Ser Asn Leu Ala 
65 70 75 80 

Phe Leu lie Lys Leu Cys Cys Met Met Leu Ser lie Ser Leu Leu Asn 

85 90 95 

Ser Leu Gly Leu Leu lie Lys Glu Ala Lys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 921: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 61... 3120 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921: 

CAGCGTGGCA GTGGGGTCAT TGATAGGGTT AAAAGGCATG ATCAACAATT TAGGGGAGGA 6 0 

ATG ATG CTC GOT TCC ATT ATT GAA TTT TCC TTA CGC CAA AGA GTG ATC 10 8 
Met Met Leu Ala Ser lie lie Glu Phe Ser Leu Arg Gin Arg Val lie 
15 10 15 

GTG ATT GTT GGT GCG ATT CTT ATT TTA TTT TTT GGG ACT TAT AGT TTT 15 6 

Val He Val Gly Ala He Leu He Leu Phe Phe Gly Thr Tyr Ser Phe 
20 25 30 

ATC AAC ACT CCA GTG GAC GCT TTC CCG GAT ATT TCG CCC ACT CAA GTT 2 04 

He Asn Thr Pro Val Asp Ala Phe Pro Asp He Ser Pro Thr Gin Val 
35 40 45 

AAA ATC ATT TTA AAA CTC CCC GGC TCT AGC CCT GAA GAA ATG GAA AAC 2 52 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 348 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919: 

TCTTTTTCTT CATTCCTAAA GAATGAAGCG ATG ATA TTA GGC AAC AAT GTC ACT 54 

Met lie Leu Gly Asn Asn Val Ser 
1 5 

AAA TCT TTT AAA AGA GAT TTT AAC CCT TGC AAA ATC AGC GCG AAT GGC 102 
Lys Ser Phe Lys Arg Asp Phe Asn Pro Cys Lys lie Ser Ala Asn Gly 
10 15 20 

CTT GTA ACC GGA TTC TTG GAA AAA ATC ACG CTT AAA GCG TTG ATC CCT 15 0 

Leu Val Thr Gly Phe Leu Glu Lys lie Thr Leu Lys Ala Leu lie Pro 
25 30 35 40 

AAT TCA ATC CCA TGC TCT AAA TTC ACA GAC AGA TCG CTA GAA AGC TTG 198 
Asn Ser lie Pro Cys Ser Lys Phe Thr Asp Arg Ser Leu Glu Ser Leu 
45 50 55 

TTC AGG CTT TCA AAT TCC GCA TGG AAA TCT TTT GAA AAA GAA AGG TTG 24 6 

Phe Arg Leu Ser Asn Ser Ala Trp Lys Ser Phe Glu Lys Glu Arg Leu 
60 65 70 

ATC TTT TCA ATC TCT AAT TTA GCG TTT TTG ATC AAG CTT TGT TGC ATG 2 94 

lie Phe Ser lie Ser Asn Leu Ala Phe Leu lie Lys Leu Cys Cys Met 
75 80 85 

ATG CTT TCT ATT TCG CTA TTG AAC TCG TTA GGC TTG TTG ATT AAA GAG 34 2 

Met Leu Ser lie Ser Leu Leu Asn Ser Leu Gly Leu Leu lie Lys Glu 
90 95 100 

GCT AAA TAGGATTTTT GA 3 60 

Ala Lys 

105 



(2) INFORMATION FOR SEQ ID NO: 92 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
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130 

Gly Phe Asp Ser 
145 

Leu Glu Arg Gly 

Asn Leu Thr Lys 
180 

Asp Lys Gly Leu 
195 

Ser Gin Val Gly 
210 

His Leu' Asp Leu 
225 

Ala Leu Leu Glu 

Ala Leu Tyr Leu 
260 

Ser Leu Lys Ser 
275 

Leu Glu Glu lie 
290 

Thr Tyr Glu Lys 
305 

Ala Val Gly Ser 

Lys Ser Tyr Leu 
340 

Glu lie Glu Ser 
355 

Glu lie Glu Lys 
370 

Phe Glu Ser Leu 
385 

His Gly lie Glu 

Asn Pro Val Thr 
420 

Leu Leu Lys Asp 
435 

Phe Arg Asn Glu 
450 

Val Arg Val lie 
465 

Gly Leu Phe Asn 

Arg Cys Glu Leu 
500 

Gin Lys Glu Lys 
515 

lie Leu Glu Asn 
530 

Leu Lys Asp Gin 
545 



135 

Pro lie Ser Ser 
150 

Val His Phe Val 
165 

Arg Met Val Arg 

Ser Phe lie Leu 
200 

Glu He Ser His 
215 

Thr Thr His Leu 
230 

Val Ala Asp Lys 
245 

Lys Arg Leu Lys 

Val Met Glu Ser 
280 

Gin Ala Leu Asp 
295 

Leu Arg Ala Asn 
310 

Val Val Lys Lys 
325 

Ala Ser Leu He 

He Met Gin Gin 
360 

He Asn Leu Ser 
375 

Asn Lys Leu Ser 
390 

Leu Gly He Asn 
405 

Arg Pro Phe Ala 

Leu Leu Thr Leu 
440 

Glu Lys Glu Arg 
455 

Pro Glu He Gin 
470 

Glu Ala Leu Gin 
485 

Glu He Thr His 

Glu Lys His Leu 
520 

Lys He Asn Ala 
535 

Gin 



140 

His Thr His Ala 
155 

He Leu Thr Ser 
170 

Glu Leu Lys Asn 
185 

Ser Lys Thr Asn 

Tyr He Gin Asp 
220 

He His Ser Asn 
235 

He Asp Ala Glu 
250 

Phe Leu Asn Ser 
265 

Phe Asp Tyr Ser 

Leu Gly Val Lys 
300 

Leu Glu Glu Glu 
315 

Val Val Glu Glu 
330 

Asn Lys Pro Asn 
345 

Ser Leu He Lys 

Phe Ser Lys Asp 
380 

Ser Asp Leu Ser 
395 

Ala Leu Ser Val 
410 

Leu He Leu Gin 
425 

Leu Pro Asn He 

Ala Lys Leu Glu 
460 

Tyr Lys Leu Lys 
475 

Asn Ser Leu Lys 
490 

Lys Lys Gin Glu 
505 

Asn Asp Leu Glu 

Leu Ser Asp Leu 
540 



He Leu Glu Tyr 
160 

Val Glu Glu Gly 
175 

Leu Leu Glu Phe 
190 

Leu Arg Thr Pro 
205 

Gin He Gin Asp 

Lys Asp Asn Asn 
240 

Lys Leu Phe Ser 
255 

Lys Leu Gin Asn 
270 

Lys Glu Lys Ala 
285 

Asp He Glu Lys 

Tyr Ser Ser Val 
320 

Val Arg Asp Gin 
335 

Glu Phe Asn Ser 
350 

Asn Ala Lys Leu 
365 

Phe His Ala Glu 

Val Asn Leu Glu 
400 

He Phe Ser Lys 
415 

Gly Leu Lys Ser 
430 

He Ala Ser Phe 
445 

Asn Leu He Glu 

Lys Val Leu Pro 
480 

Ser Leu Lys Asp 
495 

He Ala Leu Ala 
510 

Asp Gin Lys Gin 
525 

Glu Gin Gin Tyr 
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AAA TTA GAA AAT CTG ATT GAA GTC AGA GTG ATT CCA GAA ATC CAA TAC 144 6 

Lys Leu Glu Asn Leu lie Glu Val Arg Val He Pro Glu He Gin Tyr 
460 465 470 

AAG CTT AAA AAA GTT TTA CCG GGA TTG TTT AAT GAA GCT TTG CAA AAT 14 94 

Lys Leu Lys Lys Val Leu Pro Gly Leu Phe Asn Glu Ala Leu Gin Asn 
475 480 485 

TCC CTA AAA TCT CTA AAA GAT CGG TGC GAG CTA GAA ATC ACG CAT AAA 154 2 

Ser Leu Lys Ser Leu Lys Asp Arg Cys Glu Leu Glu He Thr His Lys 
490 495 500 505 

AAA CAA GAA ATC GCG CTC GCT CAA AAG GAA AAA GAA AAA CAC CTA AAC 15 90 

Lys Gin Glu He Ala Leu Ala Gin Lys Glu Lys Glu Lys His Leu Asn 
510 515 520 

GAT TTA GAA GAT CAA AAA CAA ATC TTA GAA AAT AAG ATC AAC GCT TTA 163 8 

Asp Leu Glu Asp Gin Lys Gin He Leu Glu Asn Lys He Asn Ala Leu 
525 530 535 

AGC GAT TTA GAA CAA CAA TAT TTA AAG GAT CAA CAA TGAACGAGCA AGAACT 16 90 
Ser Asp Leu Glu Gin Gin Tyr Leu Lys Asp Gin Gin 
540 545 

CATTCAAAAA AGCGCTTTAA TTGAAAAA 1718 

(2) INFORMATION FOR SEQ ID NO: 918: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918: 

Met Met Val Leu Arg Thr Gin Thr Asn Phe Val Glu Phe Leu Glu Gin 

15 10 15 

Val Leu Glu Val Leu Lys Glu Val Glu He Asp Lys Thr Glu Cys Ser 

20 25 30 

Thr Leu Leu Ala Ser Val Gin Lys Gin Gin Leu Val He Pro Val Val 

35 40 45 

Gly Asn Phe Ser Ala Gly Lys Ser Thr Leu Leu Asn Arg Phe Leu Gly 

50 55 60 

Ser Ser Val Leu Pro Thr Gly He Thr Pro Glu Thr Ser Leu Ala Thr 
65 70 75 80 

Glu Leu His Tyr Ser Ala Lys Glu Arg He Glu Ala Phe Ser Asn Asn 

85 90 95 

Asp Glu Lys Thr Glu Ser Phe Glu Leu Asn Glu Gin Ser Phe Glu Ala 

100 105 110 

He Lys Glu Asn Ala Thr Lys Tyr Ser Tyr Leu Lys Val Tyr Leu Asn 

115 120 125 

Asn Glu Ala Leu Lys Asn Ser Ala Pro Leu Val Phe Val Asp Met Pro 
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CAT TCC AAT AAA GAC AAT AAC GCC CTT TTA GAG GTA GCG GAT AAA ATA 7 74 

His Ser Asn Lys Asp Asn Asn Ala Leu Leu Glu Val Ala Asp Lys lie 
235 240 245 



GAC GCT GAA AAG CTT TTT AGC GCT TTG TAT TTG AAA CGA TTG AAG TTT 82 2 

Asp Ala Glu Lys Leu Phe Ser Ala Leu Tyr Leu Lys Arg Leu Lys Phe 
250 255 260 265 



TTA AAT TCT AAG TTA CAA AAT AGC CTA AAA AGC GTG ATG GAA AGC TTT 870 
Leu Asn Ser Lys Leu Gin Asn Ser Leu Lys Ser Val Met Glu Ser Phe 
270 275 280 



GAT TAT TCT AAA GAA AAG GCT TTA GAA GAA ATA CAA GCT TTG GAT TTG 918 
Asp Tyr Ser Lys Glu Lys Ala Leu Glu Glu lie Gin Ala Leu Asp Leu 
285 290 295 



GGC GTT AAA GAC ATT GAA AAA ACC TAT GAA AAA TTA AGG GCT AAT TTA 96 6 

Gly Val Lys Asp lie Glu Lys Thr Tyr Glu Lys Leu Arg Ala Asn Leu 
300 305 310 



GAA GAA GAA TAT TCT AGC GTG GCT GTG GGA TCG GTG GTT AAA AAA GTA 1014 
Glu Glu Glu Tyr Ser Ser Val Ala Val Gly Ser Val Val Lys Lys Val 
315 320 325 



GTA GAA GAG GTT AGG GAT CAA AAA TCC TAT TTA GCC TCT TTA ATC AAC 106 2 

Val Glu Glu Val Arg Asp Gin Lys Ser Tyr Leu Ala Ser Leu lie Asn 
330 335 340 345 



AAG CCT AAC GAG TTC AAT AGC GAA ATA GAA AGC ATC ATG CAA CAA AGC 1110 
Lys Pro Asn Glu Phe Asn Ser Glu lie Glu Ser lie Met Gin Gin Ser 
350 355 360 



TTG ATC AAA AAC GCT AAA TTA GAG ATT GAA AAG ATC AAC CTT TCT TTT 115 8 

Leu lie Lys Asn Ala Lys Leu Glu lie Glu Lys lie Asn Leu Ser Phe 
365 370 375 



TCA AAA GAT TTC CAT GCG GAA TTT GAA AGC CTG AAC AAG CTT TCT AGC 12 0 6 

Ser Lys Asp Phe His Ala Glu Phe Glu Ser Leu Asn Lys Leu Ser Ser 
380 385 390 



GAT CTG TCT GTG AAT TTA GAG CAT GGG ATT GAA TTA GGG ATC AAC GCT 12 5 4 

Asp Leu Ser Val Asn Leu Glu His Gly lie Glu Leu Gly lie Asn Ala 
395 400 405 



TTA AGC GTG ATT TTT TCC AAG AAT CCG GTT ACA AGG CCA TTC GCG CTG 13 0 2 

Leu Ser Val lie Phe Ser Lys Asn Pro Val Thr Arg Pro Phe Ala Leu 
410 415 420 425 



ATT TTG CAA GGG TTA AAA TCT CTT TTA AAA GAT TTA CTG ACA TTG TTG 13 5 0 

lie Leu Gin Gly Leu Lys Ser Leu Leu Lys Asp Leu Leu Thr Leu Leu 
430 435 440 



CCT AAT ATC ATC GCT TCA TTC TTT AGG AAT GAA GAA AAA GAG CGG GCG 13 9 8 

Pro Asn lie lie Ala Ser Phe Phe Arg Asn Glu Glu Lys Glu Arg Ala 
445 450 455 
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Phe Val Glu Phe Leu Glu Gin Val Leu Glu Val Leu Lys Glu Val Glu 
10 15 20 25 

ATC GAT AAA ACA GAA TGC TCC ACG CTT TTA GCA AGC GTT CAA AAA CAA 15 0 

lie Asp Lys Thr Glu Cys Ser Thr Leu Leu Ala Ser Val Gin Lys Gin 
30 35 40 

GAG CTA GTG ATA CCC GTT GTG GGG AAT TTT AGC GCA GGG AAA AGC ACG 198 
Gin Leu Val lie Pro Val Val Gly Asn Phe Ser Ala Gly Lys Ser Thr 
45 50 55 

CTA TTA AAC CGC TTT TTA GGC AGC AGC GTT TTG CCT ACC GGT ATC ACG 24 6 

Leu Leu Asn Arg Phe Leu Gly Ser Ser Val Leu Pro Thr Gly lie Thr 
60 65 70 

CCA GAG ACT TCT TTA GCC ACT GAG TTG CAC TAT AGC GCT AAG GAA CGC 2 94 

Pro Glu Thr Ser Leu Ala Thr Glu Leu His Tyr Ser Ala Lys Glu Arg 
75 80 85 

ATA GAG GCT TTT TCA AAC AAT GAT GAA AAA ACA GAG AGT TTT GAA CTG 3 42 

lie Glu Ala Phe Ser Asn Asn Asp Glu Lys Thr Glu Ser Phe Glu Leu 
90 95 100 105 

AAT GAG CAA AGT TTT GAA GCG ATT AAA GAG AAT GCC ACG AAG TAT TCC 3 90 

Asn Glu Gin Ser Phe Glu Ala lie Lys Glu Asn Ala Thr Lys Tyr Ser 
110 115 120 

TAG CTT AAG GTT TAT TTG AAT AAT GAA GCT TTG AAA AAC AGC GCT CCT 43 8 

Tyr Leu Lys Val Tyr Leu Asn Asn Glu Ala Leu Lys Asn Ser Ala Pro 
125 130 135 

TTA GTG TTT GTG GAT ATG CCA GGC TTT GAT AGC CCC ATT TCA AGC CAC 4 86 

Leu Val Phe Val Asp Met Pro Gly Phe Asp Ser Pro lie Ser Ser His 
140 145 150 

ACC CAT GCC ATT TTG GAA TAT TTA GAA AGG GGC GTG CAT TTT GTC ATT 5 34 

Thr His Ala lie Leu Glu Tyr Leu Glu Arg Gly Val His Phe Val lie 
155 160 165 

CTC ACA AGC GTA GAA GAG GGC AAT CTC ACT AAA CGC ATG GTT AGG GAG 5 82 

Leu Thr Ser Val Glu Glu Gly Asn Leu Thr Lys Arg Met Val Arg Glu 
170 175 180 185 

TTA AAA AAC CTT TTA GAG TTT GAC AAA GGC CTT AGC TTT ATT TTG AGT 63 0 

Leu Lys Asn Leu Leu Glu Phe Asp Lys Gly Leu Ser Phe lie Leu Ser 
190 195 200 

AAA ACG AAT TTA AGA ACG CCT TCG CAA GTG GGA GAA ATC TCT CAC TAG 67 8 

Lys Thr Asn Leu Arg Thr Pro Ser Gin Val Gly Glu lie Ser His Tyr 
205 210 215 

ATT CAA GAT CAA ATC CAG GAT CAC CTT GAT TTG ACA ACG CAC CTC ATC 72 6 

lie Gin Asp Gin lie Gin Asp His Leu Asp Leu Thr Thr His Leu lie 
220 225 230 
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Ala Lys Gin Gly 
210 

Leu Lys His Ala 
225 

Glu lie His Glu 

Gly Lys Gin Val 
260 

Arg Gin Leu Glu 
275 

Asn Gly lie Glu 
290 

Gin Glu Tyr Lys 
305 

Glu Glu Phe Leu 

lie Leu Gly Tyr 
340 

Glu Gin Asp Ala 
355 

Val Arg Leu Asp 
370 

lie Phe Arg Arg 
385 

Arg S e r Ly s Ar g 

Phe Lys Glu Gly 
420 

T r p Glu Ly s Tyr 
435 



Tyr Thr Gly Gly 
215 

Lys His lie Leu 
230 

Asp Asp Leu Glu 
245 

Phe lie Val Leu 

Glu Val Val lie 
280 

Phe Leu Gly lie 
295 

Glu Phe Ser Glu 
310 

Met Lys Leu Asn 
325 

Leu Tyr Glu Val 

Asn Gin Phe Lys 
360 

Leu Met Gin Lys 
375 

lie Glu Asn Leu 
390 

Glu Ser Leu Ala 
405 

lie Asp Lys Val 

Lys Glu Gin Asn 
440 



Asp Lys Glu Ala 
220 

Trp Leu lie Ser 
235 

Tyr Leu Gin Glu 
250 

Ser Arg Ala Asp 
265 

Lys lie Lys Glu 

Gly Ala Tyr Ser 
300 

Lys Ser Lys Val 
315 

Gin Arg Ser Glu 
330 

His Ser Met Tyr 
345 

Arg Tyr Gin Ser 

Gly Phe Asp Asp 
380 

Glu Lys Glu Phe 
395 

Arg Leu Asn Glu 
410 

Phe Asp Arg Val 
425 

Asp Asp Glu Glu 



Ser Lys Glu Ser 

Cys Glu Ser Gly 
240 

Leu Tyr Glu Glu 
255 

Arg Arg Thr Lys 
270 

Thr Leu Lys Asp 
285 

Ser Thr Arg Tyr 

Phe Asn Ser Leu 
320 

Lys Gin Asn Glu 
335 

Glu Lys Ala lie 
350 

Glu Leu His Ser 
365 

Phe Ser Asp Lys 

Ser Glu Gin Glu 
400 

Val lie Asp Leu 
415 

Ser Ala Phe Thr 

430 
Asp Asp Asp 
445 



(2) INFORMATION FOR SEQ ID NO: 917: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 1674 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917: 

ATTTTAGTTT TTAATTTTAA AGGATTG ATG ATG GTT TTA CGC ACA CAG ACA AAT 54 

Met Met Val Leu Arg Thr Gin Thr Asn 
1 5 

TTT GTG GAG TTT TTA GAA CAG GTT TTA GAA GTT TTA AAA GAA GTG GAG 102 
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Asn Leu Glu Lys Glu Phe Ser Glu Gin Glu Arg Ser Lys Arg Glu Ser 
395 400 405 

TTA GCG CGA TTG AAT GAA GTG ATT GAC TTG TTT AAA GAA GGT ATT GAT 13 64 

Leu Ala Arg Leu Asn Glu Val lie Asp Leu Phe Lys Glu Gly lie Asp 
410 415 420 

AAG GTT TTT GAT CGC GTG AGO GCT TTC ACT TGG GAA AAA TAG AAA GAA 1412 
Lys Val Phe Asp Arg Val Ser Ala Phe Thr Trp Glu Lys Tyr Lys Glu 
425 430 435 

CAA AAT GAC GAT GAA GAG GAC GAT GAT TGAAGAAAAC TACAAAGAAG AGCGTTA 14 66 
Gin Asn Asp Asp Glu Glu Asp Asp Asp 
440 445 

CACCGAAAGG GTGAATCAAG GCGG 14 90 

(2) INFORMATION FOR SEQ ID NO:916: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916: 

Met Ser Val Asn Phe Phe Lys Gly lie Phe Asn Asp Asn Ser Arg Ala 

15 10 15 

Glu Asn His Gin Asp Asn His Gin Asn Asn His Gin Val Gly Leu Lys 

20 25 30 

Glu Arg Tyr Asp Leu lie Ala Arg lie Leu Asn Ala Arg lie Glu Asn 

35 40 45 

Glu Gly Leu Glu Glu Tyr Gin Ser Val Leu Asp Asn Glu Phe Leu Glu 

50 55 60 

Phe Ala Ser Gly Val Asp Ser Leu Lys Glu Lys Glu lie Ala Leu Leu 
65 70 75 80 

Thr Leu Gin Glu lie Gin Lys Glu Leu Gin Leu Val Ala Ser Tyr Pro 

85 90 95 

Ser Leu Phe Gin Lys Thr lie Val Ala Val Gly Gly Gly Phe Ser Ala 

100 105 110 

Gly Lys Ser Thr Phe Leu Asn Asn Leu Leu Gly Leu Lys Leu Lys Leu 

115 120 125 

Pro Glu Asp Met Asn Pro Thr Thr Ala lie Pro Thr Tyr Cys Leu Lys 

130 135 140 

Gly Lys Arg Glu Val Leu Met Gly Phe Ser Gin Asn Gly Gly Met Val 
145 150 155 160 

Glu Leu Pro His Leu Ala Phe Asp His Gin Phe Leu Asn Ser Leu Gly 

165 170 175 

Phe Asn Leu Lys Glu lie Met Pro Phe Met Leu Leu Ser Ala Pro Ser 

180 185 190 

Val Pro Phe Glu Phe Leu Cys Phe lie Asp Thr Pro Gly Phe Asn Ser 
195 200 205 
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170 175 180 

ATG CCT TTC ATG CTT TTA AGC OCT CCT AGC GTG CCT TTT GAA TTT TTA 6 92 

Met Pro Phe Met Leu Leu Ser Ala Pro Ser Val Pro Phe Glu Phe Leu 
185 190 195 

TGC TTC ATA GAG ACG CCT GGT TTT AAC TCC GCC AAG CAA GGC TAT ACG 74 0 

Cys Phe lie Asp Thr Pro Gly Phe Asn Ser Ala Lys Gin Gly Tyr Thr 
200 205 210 

GGT GGG GAT AAA GAA GCC TCT AAA GAA TCC CTA AAA CAC GCC AAA CAC 788 
Gly Gly Asp Lys Glu Ala Ser Lys Glu Ser Leu Lys His Ala Lys His 
215 220 225 230 

ATT CTG TGG CTC ATT AGT TGC GAG AGT GGG GAG ATT CAC GAA GAT GAT 836 
lie Leu Trp Leu lie Ser Cys Glu Ser Gly Glu lie His Glu Asp Asp 
235 240 245 

TTA GAA TAT TTG CAA GAA TTA TAC GAA GAA GGC AAG CAG GTT TTT ATC 8 84 

Leu Glu Tyr Leu Gin Glu Leu Tyr Glu Glu Gly Lys Gin Val Phe lie 
250 255 260 

GTA TTG AGT AGG GCT GAT AGG CGC ACA AAA AGG CAA TTA GAA GAA GTC 93 2 

Val Leu Ser Arg Ala Asp Arg Arg Thr Lys Arg Gin Leu Glu Glu Val 
265 270 275 

GTT ATT AAA ATT AAA GAG ACT TTA AAA GAT AAT GGC ATT GAA TTT TTA 98 0 

Val lie Lys lie Lys Glu Thr Leu Lys Asp Asn Gly lie Glu Phe Leu 
280 285 290 

GGG ATT GGT GCT TAT AGT TCT ACA AGG TAT CAA GAA TAT AAA GAA TTC 102 8 

Gly lie Gly Ala Tyr Ser Ser Thr Arg Tyr Gin Glu Tyr Lys Glu Phe 
295 300 305 310 

AGC GAA AAA AGC AAA GTT TTT AAC TCG CTT GAG GAA TTT CTA ATG AAG 107 6 

Ser Glu Lys Ser Lys Val Phe Asn Ser Leu Glu Glu Phe Leu Met Lys 
315 320 325 

TTA AAT CAA AGG AGC GAG AAA CAA AAC GAA ATT TTA GGA TAT TTA TAC 1124 
Leu Asn Gin Arg Ser Glu Lys Gin Asn Glu lie Leu Gly Tyr Leu Tyr 
330 335 340 

GAG GTG CAT TCC ATG TAT GAA AAG GCT ATT GAG CAA GAC GCT AAC CAA 1172 
Glu Val His Ser Met Tyr Glu Lys Ala lie Glu Gin Asp Ala Asn Gin 
345 350 355 

TTC AAA CGC TAC CAA AGC GAA TTG CAT TCT GTT AGA TTG GAT TTG ATG 12 2 0 

Phe Lys Arg Tyr Gin Ser Glu Leu His Ser Val Arg Leu Asp Leu Met 
360 365 370 

CAA AAA GGC TTT GAT GAT TTT AGC GAT AAA ATT TTT AGA AGA ATT GAG 12 6 8 

Gin Lys Gly Phe Asp Asp Phe Ser Asp Lys lie Phe Arg Arg lie Glu 
375 380 385 390 

AAT TTA 'GAA AAA GAA TTT TCC GAG CAA GAG CGA TCC AAA AGA GAG AGT 1316 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 99... 1439 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 915 : 

CTTAAAGAAA ACATGCAAAA TTTGCAAAAT CAAGTTCAAA ACAAAGAGCA ATCCATCGCT 6 0 

CAATTAGATG CACAAATCCA AGCTTTAAAG GGGATTCA ATG AGC GTT AAT TTT TTT 116 

Met Ser Val Asn Phe Phe 
1 5 

AAG GGC ATT TTT AAT GAC AAT AGC AGG GOT GAA AAC CAC CAA GAC AAC 164 
Lys Gly lie Phe Asn Asp Asn Ser Arg Ala Glu Asn His Gin Asp Asn 
10 15 20 

CAC CAA AAC AAC CAT CAA GTG GGC TTA AAA GAG CGT TAG GAT TTG ATC 212 
His Gin Asn Asn His Gin Val Gly Leu Lys Glu Arg Tyr Asp Leu lie 
25 30 35 

GCT CGT ATT TTA AAC GCC AGA ATT GAA AAT GAA GGG CTA GAA GAA TAT 260 
Ala Arg lie Leu Asn Ala Arg lie Glu Asn Glu Gly Leu Glu Glu Tyr 
40 45 50 

CAG AGC GTC TTG GAT AAC GAG TTT TTA GAG TTC GCT AGC GGC GTG GAT 3 08 

Gin Ser Val Leu Asp Asn Glu Phe Leu Glu Phe Ala Ser Gly Val Asp 
55 60 65 70 

TCG CTC AAA GAA AAG GAA ATA GCG TTA CTG ACG CTC CAA GAA ATC CAA 3 56 

Ser Leu Lys Glu Lys Glu lie Ala Leu Leu Thr Leu Gin Glu lie Gin 
75 80 85 

AAA GAA TTG CAA TTG GTA GCG AGC TAC CCT AGT TTG TTC CAA AAA ACC 4 04 

Lys Glu Leu Gin Leu Val Ala Ser Tyr Pro Ser Leu Phe Gin Lys Thr 
90 95 100 

ATC GTT GCG GTG GGG GGA GGG TTT AGC GCG GGC AAA TCC ACT TTT TTA 4 52 

lie Val Ala Val Gly Gly Gly Phe Ser Ala Gly Lys Ser Thr Phe Leu 
105 110 115 

AAC AAC TTG TTG GGC TTG AAA TTA AAA CTC CCT GAA GAC ATG AAT CCC 50 0 

Asn Asn Leu Leu Gly Leu Lys Leu Lys Leu Pro Glu Asp Met Asn Pro 
120 125 130 

ACC ACA GCT ATC CCC ACT TAT TGC TTA AAG GGT AAA AGA GAA GTT TTA 54 8 

Thr Thr Ala lie Pro Thr Tyr Cys Leu Lys Gly Lys Arg Glu Val Leu 
135 140 145 150 

ATG GGG TTT TCT CAA AAT GGG GGC ATG GTG GAA TTG CCA CAT CTC GCT 5 96 

Met Gly Phe Ser Gin Asn Gly Gly Met Val Glu Leu Pro His Leu Ala 
155 160 165 

TTT GAC CAT CAG TTT TTA AAC TCC CTT GGC TTT AAT TTG AAA GAG ATC 644 
Phe Asp His Gin Phe Leu Asn Ser Leu Gly Phe Asn Leu Lys Glu lie 
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Ser Lys Val Asp Ala lie Thr Trp Leu 
205 210 



(2) INFORMATION FOR SEQ ID NO: 914: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914: 

Met Lys Phe Leu Asp Gin Glu Lys Arg Arg Gin Leu Leu Asn Glu Arg 

15 10 15 

His Ser Cys Lys Met Phe Asp Ser His Tyr Glu Phe Ser Ser Thr Glu 

20 25 30 

Leu Glu Glu lie Ala Glu lie Ala Arg Leu Ser Pro Ser Ser Tyr Asn 

35 40 45 

Thr Gin Pro Trp His Phe Val Met Val Thr Asp Lys Asp Leu Lys Lys 

50 55 60 

Gin lie Ala Ala His Ser Tyr Phe Asn Glu Glu Met lie Lys Ser Ala 
65 70 75 80 

Ser Ala Leu Met Val Val Cys Ser Leu Arg Pro Ser Glu Leu Leu Pro 

85 90 95 

His Gly His Tyr Met Gin Asn Leu Tyr Pro Glu Ser Tyr Lys Val Arg 

100 105 110 

Val lie Pro Ser Phe Ala Gin Met Leu Gly Val Arg Phe Asn His Ser 

115 120 125 

Met Gin Arg Leu Glu Ser Tyr lie Leu Glu Gin Cys Tyr lie Ala Val 

130 135 140 

Gly Gin lie Cys Met Gly Val Ser Leu Met Gly Leu Asp Ser Cys lie 
145 150 155 160 

lie Gly Gly Phe Asp Pro Leu Lys Val Gly Glu Val Leu Glu Glu Arg 

165 170 175 

lie Asn Lys Pro Lys lie Ala Cys Leu lie Ala Leu Gly Lys Arg Val 

180 185 190 

Ala Glu Ala Ser Gin Lys Ser Arg Lys Ser Lys Val Asp Ala lie Thr 
195 200 205 

Trp Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 915: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913: 

AAATTCTAAA AAAATAAAGG AAAATCA ATG AAA TTT TTG GAT CAA GAA AAA AGA 54 

Met Lys Phe Leu Asp Gin Glu Lys Arg 
1 5 

AGA CAA TTA TTA AAC GAG CGC CAT TCT TGC AAG ATG TTT GAT AGC CAT 102 
Arg Gin Leu Leu Asn Glu Arg His Ser Cys Lys Met Phe Asp Ser His 
10 15 20 25 

TAT GAG TTT TCT AGC ACA GAA TTA GAA GAA ATC GCT GAA ATC GCC AGG 15 0 

Tyr Glu Phe Ser Ser Thr Glu Leu Glu Glu lie Ala Glu lie Ala Arg 
30 35 40 

CTA TCG CCA AGC TCT TAC AAC ACG CAG CCA TGG CAT TTT GTG ATG GTT 198 
Leu Ser Pro Ser Ser Tyr Asn Thr Gin Pro Trp His Phe Val Met Val 
45 50 55 

ACT GAT AAG GAT TTA AAA AAA CAA ATT GCA GCG CAC AGC TAT TTC AAT 24 6 

Thr Asp Lys Asp Leu Lys Lys Gin lie Ala Ala His Ser Tyr Phe Asn 
60 65 70 

GAA GAG ATG ATT AAA AGC GCT TCA GCG TTA ATG GTG GTA TGC TCT TTA 2 94 

Glu Glu Met lie Lys Ser Ala Ser Ala Leu Met Val Val Cys Ser Leu 
75 80 85 

AGA CCC AGC GAG TTG TTA CCA CAC GGC CAC TAC ATG CAA AAT CTC TAT 342 
Arg Pro Ser Glu Leu Leu Pro His Gly His Tyr Met Gin Asn Leu Tyr 
90 95 100 105 

CCG GAG TCT TAT AAA GTT AGA GTG ATC CCC TCT TTT GCT CAA ATG CTT 3 90 

Pro Glu Ser Tyr Lys Val Arg Val lie Pro Ser Phe Ala Gin Met Leu 
110 115 120 

GGC GTG AGA TTC AAC CAC AGC ATG CAA AGA TTA GAA AGC TAT ATT TTA 43 8 

Gly Val Arg Phe Asn His Ser Met Gin Arg Leu Glu Ser Tyr lie Leu 
125 130 135 

GAG CAA TGC TAT ATC GCT GTG GGG CAA ATT TGC ATG GGC GTG AGC TTA 4 86 

Glu Gin Cys Tyr lie Ala Val Gly Gin He Cys Met Gly Val Ser Leu 
140 145 150 

ATG GGA TTG GAT AGT TGC ATT ATT GGA GGC TTT GAT CCT TTA AAG GTG 534 
Met Gly Leu Asp Ser Cys He He Gly Gly Phe Asp Pro Leu Lys Val 
155 160 165 

GGC GAA GTT TTA GAA GAG CGT ATC AAT AAG CCT AAA ATC GCA TGC TTG 5 82 

Gly Glu Val Leu Glu Glu Arg He Asn Lys Pro Lys He Ala Cys Leu 
170 175 180 185 

ATC GCT TTG GGC AAG AGG GTG GCA GAA GCG AGT CAA AAA TCA AGA AAA 63 0 

He Ala Leu Gly Lys Arg Val Ala Glu Ala Ser Gin Lys Ser Arg Lys 
190 195 200 

TCA AAA GTT GAT GCG ATT ACT TGG TTG TGATTAAACA AAATCAAAAA CTTT 681 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 912 : 

Met Lys Phe Lys Phe Leu Asn Met Asp Asn Glu Ser Gly Phe lie Leu 

1 5 10 15 

lie Glu Lys Glu Leu Lys Arg Leu Asn lie Leu Ala Gin Val Lys Glu 

20 25 30 

Asp Cys lie Glu Leu Lys Gly Glu Asn Thr Glu Gin Ala Arg lie Tyr 

35 40 45 

Leu Lys Thr Leu Phe Asn Ser Asn lie Val Glu Leu Asp Asp His Gin 

50 55 60 

Lys Ser Ala Asn Ala Leu lie Glu Arg Leu Lys Ser Leu Asp Leu Lys 
65 70 75 80 

lie Ala Val Ala Glu Ser Cys Ser Gly Gly Leu Leu Ser His Ala Phe 

85 90 95 

Thr Ser lie Ser Gly Ala Ser Ala Val Phe Met Gly Gly lie Val Cys 

100 105 110 

Tyr Asn Glu Glu Val Lys Arg Glu Leu Leu Lys Val Asn Ala Thr Thr 

115 120 125 

Leu Lys Val Phe Gly Val Tyr Ser Glu Glu Cys Val Lys Glu Met Leu 

130 135 140 

Leu Gly Val Phe Leu Asn Phe Lys Val Asp Leu Ala Leu Ala Met Ser 
145 150 155 160 

Gly Val Ala Gly Pro Asn Gly Gly Asn Lys Ala Asn Pro Val Gly Thr 

165 170 175 

lie Tyr lie Gly Ala Gin Lys Leu Gly Ser Gin Ala Leu lie Asp Arg 

180 185 190 

Cys Phe Phe Glu Gly Asn Arg Glu Ser lie Gin Asn Lys Ser Val Glu 

195 200 205 

His Ala Leu Asn Met Leu Ala Arg Met Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO: 913: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 657 
(D) OTHER INFORMATION: 
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CGA TTA AAC ATT CTC GCT CAA GTC AAA GAA GAT TGC ATT GAA TTA AAA 14 9 

Arg Leu Asn He Leu Ala Gin Val Lys Glu Asp Cys He Glu Leu Lys 
25 30 35 

GGC GAA AAC ACA GAA CAA GCG AG A ATT TAT CTT AAA ACG CTT TTT AAC 197 
Gly Glu Asn Thr Glu Gin Ala Arg He Tyr Leu Lys Thr Leu Phe Asn 
40 45 50 

TCC AAT ATT GTA GAA TTA GAC GAT CAT CAA AAA AGT GCA AAC GCT TTA 24 5 

Ser Asn He Val Glu Leu Asp Asp His Gin Lys Ser Ala Asn Ala Leu 
55 60 65 70 

ATA GAG CGC TTG AAA TCT TTA GAT TTA AAA ATT GCG GTG GCT GAA AGC 2 93 

He Glu Arg Leu Lys Ser Leu Asp Leu Lys He Ala Val Ala Glu Ser 
75 80 85 

TGC TCT GGG GGG CTA TTA TCG CAT GCA TTC ACT TCC ATT AGC GGG GCT 341 
Cys Ser Gly Gly Leu Leu Ser His Ala Phe Thr Ser He Ser Gly Ala 
90 95 100 

TCA GCG GTT TTT ATG GGG GGT ATT GTG TGC TAC AAT GAA GAG GTT AAG 3 89 

Ser Ala Val Phe Met Gly Gly He Val Cys Tyr Asn Glu Glu Val Lys 
105 110 115 

CGC GAA TTA TTG AAG GTC AAT GCC ACG ACT TTA AAA GTC TTT GGG GTT 437 
Arg Glu Leu Leu Lys Val Asn Ala Thr Thr Leu Lys Val Phe Gly Val 
120 125 130 

TAT AGC GAA GAA TGC GTG AAA GAA ATG CTA CTA GGC GTG TTT TTG AAT 4 85 

Tyr Ser Glu Glu Cys Val Lys Glu Met Leu Leu Gly Val Phe Leu Asn 
135 140 145 150 

TTT AAA GTG GAT TTA GCG CTT GCG ATG AGT GGG GTG GCT GGC CCT AAT 53 3 

Phe Lys Val Asp Leu Ala Leu Ala Met Ser Gly Val Ala Gly Pro Asn 
155 160 165 

GGG GGG AAC AAG GCT AAT CCT GTA GGC ACG ATT TAC ATT GGC GCG CAA 581 
Gly Gly Asn Lys Ala Asn Pro Val Gly Thr He Tyr He Gly Ala Gin 
170 175 180 

AAG TTA GGA TCT CAA GCT TTA ATC GAT CGC TGT TTT TTT GAA GGG AAC 62 9 

Lys Leu Gly Ser Gin Ala Leu He Asp Arg Cys Phe Phe Glu Gly Asn 
185 190 195 

AGA GAA AGC ATT CAA AAT AAA AGC GTA GAG CAT GCC TTA AAC ATG CTC 67 7 

Arg Glu Ser He Gin Asn Lys Ser Val Glu His Ala Leu Asn Met Leu 
200 205 210 

GCT AGA ATG CTA TAAAACTACC TTAACGCACA AACGCTACCA AATTCTTTTT GAGCG 73 4 
Ala Arg Met Leu 

215 

ACCTTAGCGA TGTAAGCGAT TT 756 
(2) INFORMATION FOR SEQ ID NO: 912: 
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85 90 95 

CCGGATAAGG CTCTGTAA 3 67 

(2) INFORMATION FOR SEQ ID NO: 910: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 910 : 

Met Ser Phe Arg Val Ser Asn Thr Thr Pro Gly His Glu Ser Gly Ala 

15 10 15 

Phe Leu Asn Ala Glu lie Ser Pro Ala Phe Pro Lys Glu Val Pro Phe 

20 25 30 

Ala Pro Ser Phe Phe Ser lie Thr Gin Thr Leu Cys Pro Asn Leu Cys 

35 40 45 

lie Glu Tyr Ala Gin Glu Ser Pro Thr lie Pro Pro Pro Met Thr Thr 

50 55 60 

Thr Ser Phe Phe Met Leu lie Val Pro Leu lie Asn Tyr Leu Met Ala 

65 70 75 80 

lie Ala Ser lie Ser Thr Lys Ala Ser Leu Gly Ser Leu Ala Thr 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 911: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 36... 689 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 911 : 

GCGCACGCTG AATTTAGGGT TTTGAAAGGT TAAAA ATG AAA TTT AAA TTT TTG 5 3 

Met Lys Phe Lys Phe Leu 
1 5 

AAT ATG GAT AAT GAA AGC GGT TTT ATT TTG ATT GAA AAA GAA TTG AAA 101 
Asn Met Asp Asn Glu Ser Gly Phe lie Leu lie Glu Lys Glu Leu Lys 
10 15 20 
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385 390 395 400 

Val Gly Ala Met Ala Lys lie Asp Phe Val Trp Asn Leu Ala Asp Leu 

405 410 415 

Ser Asn Gly Leu Met Ala lie Pro Asn Leu lie Ala Leu lie Leu Leu 

420 425 430 

His Lys Val Val Tyr Ser Glu Thr Arg Trp Tyr Phe Ser Lys His Ser 
435 440 445 

Asn Lys 
450 

(2) INFORMATION FOR SEQ ID NO: 90 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 333 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909: 

TTTAAATTAA GCCCGAAATG GAATTTTAAA GGGGCTTGGT TTTTGAGC ATG AGC TTC 5 7 

Met Ser Phe 
1 

AGG GTG TCT AAC ACC ACA CCA GGG CAT GAG AGT GGG GCT TTT TTA AAC 105 
Arg Val Ser Asn Thr Thr Pro Gly His Glu Ser Gly Ala Phe Leu Asn 
5 10 15 

GCA GAA ATA AGC CCA GCA TTC CCA AAA GAA GTG CCG TTT GCG CCA TCG 153 
Ala Glu lie Ser Pro Ala Phe Pro Lys Glu Val Pro Phe Ala Pro Ser 
20 25 30 35 

TTT TTT TCT ATC ACG CAG ACC TTA TGC CCT AAC TTG TGC ATA GAA TAC 2 01 

Phe Phe Ser lie Thr Gin Thr Leu Cys Pro Asn Leu Cys lie Glu Tyr 
40 45 50 

GCA CAA GAA AGC CCT ACA ATC CCA CCG CCT ATG ACC ACG ACC TCT TTT 24 9 

Ala Gin Glu Ser Pro Thr lie Pro Pro Pro Met Thr Thr Thr Ser Phe 
55 60 65 

TTC ATG CTG ATA GTC CCT TTA ATA AAT TAC TTA ATG GCT ATC GCT TCA 2 97 

Phe Met Leu lie Val Pro Leu lie Asn Tyr Leu Met Ala lie Ala Ser 
70 75 80 

ATT TCT ACT AAA GCG TCT TTA GGC AGT TTA GCC ACT TGAAAGGTCG CTCTGG 34 9 
lie Ser Thr Lys Ala Ser Leu Gly Ser Leu Ala Thr 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 8: 

Met Glu Thr lie Asp Ser Val Val Arg Leu Leu Ser Asn Leu Val Trp 

1 5 10 15 

Gly lie Pro Met Gin He Leu Leu Val Gly Thr Gly Leu Phe Leu Thr 

20 25 30 

Phe Tyr Leu Arg Gly Leu Gin Phe Ser Lys He Phe Tyr Ala He Lys 

35 40 45 

He Leu Phe Asp Lys Glu Ser Gin Ser Lys Gly Asp He Ser Gin Phe 

50 55 60 

Ser Ala Leu Met Leu Ser Leu Gly Ala Thr Val Gly He Gly Ser He 
65 70 75 80 

Val Gly Val Ala Thr Ala He Ser He Ala Gly Pro Gly Ala Val Phe 

85 90 95 

Trp Met Trp Val Thr Gly Leu Val Gly Met Ala Thr Lys Tyr Ser Glu 

100 105 110 

Gly He Leu Ala Val Lys Tyr Arg Glu Lys Gly Ala Phe Gly Tyr Asn 

115 120 125 

Gly Gly Pro Met Tyr Tyr He Lys Asn Gly Leu Asn Met Pro Lys Leu 

130 135 140 

Ala Met Ala Phe Ala He Phe Thr He He Ala Ser He Gly Thr Gly 
145 150 155 160 

Asn Met Thr Gin Ser Asn Ala Val Ser Ser He Leu Ser Glu Gin Ala 

165 170 175 

Asn Leu Pro Asn Trp Val Ser Gly Leu Leu Leu Thr Leu Leu Thr Ala 

180 185 190 

Phe He Val He Gly Gly He Lys Ser He Gly Lys Phe Thr Ser Tyr 

195 200 205 

Leu Ala Pro Val Met Val Leu Leu Tyr Leu He Ala He He Tyr He 

210 215 220 

He Val Ser His Phe Asp Leu Ala Leu Gin Ala He Lys Leu He Phe 
225 230 235 240 

Glu Glu Ala Phe Asn Pro Lys Pro Val Val Gly Gly Ala Ser Gly Ala 

245 250 255 

Leu He Ala Thr Met He Lys Thr Gly Val Ala Arg Gly Leu Tyr Ser 

260 265 270 

Asn Glu Ala Gly Leu Gly Ser Ser Ala He He Ala Ala Ser Ala Gin 

275 280 285 

Thr Arg His Pro Val Arg Gin Ala Leu Val Ser Met Leu Gin Thr Phe 

290 295 300 

He Val Thr Leu He Val Cys Ser Ala Thr Ala Ser Val He Leu Met 
305 310 315 320 

Ala Pro Glu Tyr Asn Thr Leu Leu Pro Asn Gly Glu Lys Leu Ser Ala 

325 330 335 

Asn Leu Leu Thr Leu Lys Ser Thr Glu Tyr Phe Leu Gly Ser Leu Gly 

340 345 350 

Thr Val Val He Phe Thr Thr Met He Phe Phe Ala Tyr Ser Thr He 

355 360 365 

He Gly Trp Ala Tyr Tyr Gly Glu Lys Cys Thr Glu Tyr Ala Phe Gly 

370 375 380 

Glu Lys Lys Val Lys Tyr Tyr Arg Leu He Phe Leu Ala Ser Val Met 
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Ala Arg Gly Leu Tyr Ser Asn Glu Ala Gly Leu Gly Ser Ser Ala lie 
270 275 280 

ATT GCC GCG AGC GCT CAA ACA CGC CAC CCG GTG CGC CAA GCC TTA GTG 916 
lie Ala Ala Ser Ala Gin Thr Arg His Pro Val Arg Gin Ala Leu Val 
285 290 295 

TCC ATG CTC CAA ACT TTT ATT GTA ACC TTA ATA GTG TGT TCG GCA ACA 964 
Ser Met Leu Gin Thr Phe lie Val Thr Leu lie Val Cys Ser Ala Thr 
300 305 310 

GCG AGC GTG ATT TTA ATG GCT CCA GAA TAC AAC ACC TTG CTC CCT AAT 1012 
Ala Ser Val lie Leu Met Ala Pro Glu Tyr Asn Thr Leu Leu Pro Asn 
315 320 325 330 

GGG GAA AAA TTA AGC GCT AAT TTG CTC ACT CTA AAA AGC ACG GAG TAT 1060 
Gly Glu Lys Leu Ser Ala Asn Leu Leu Thr Leu Lys Ser Thr Glu Tyr 
335 340 345 

TTT CTA GGC TCA TTA GGG ACG GTG GTG ATT TTT ACA ACC ATG ATC TTT 1108 
Phe Leu Gly Ser Leu Gly Thr Val Val lie Phe Thr Thr Met lie Phe 
350 355 360 

TTT GCC TAC TCT ACG ATC ATT GGT TGG GCT TAT TAT GGG GAA AAA TGC 115 6 

Phe Ala Tyr Ser Thr lie lie Gly Trp Ala Tyr Tyr Gly Glu Lys Cys 
365 370 375 

ACT GAA TAC GCC TTT GGT GAA AAA AAA GTG AAA TAT TAC CGC TTG ATC 12 04 

Thr Glu Tyr Ala Phe Gly Glu Lys Lys Val Lys Tyr Tyr Arg Leu lie 
380 385 390 

TTT TTA GCG AGT GTG ATG GTG GGG GCT ATG GCC AAA ATT GAT TTT GTG 1252 
Phe Leu Ala Ser Val Met Val Gly Ala Met Ala Lys lie Asp Phe Val 
395 400 405 410 

TGG AAT TTA GCG GAT CTT TCT AAC GGG CTT ATG GCT ATC CCT AAT TTA 13 0 0 

Trp Asn Leu Ala Asp Leu Ser Asn Gly Leu Met Ala He Pro Asn Leu 
415 420 425 

ATC GCT TTG ATT TTA TTG CAT AAA GTG GTT TAT TCT GAA ACT CGT TGG 13 4 8 

He Ala Leu He Leu Leu His Lys Val Val Tyr Ser Glu Thr Arg Trp 
430 435 440 

TAT TTT AGC AAG CAT TCT AAC AAG TAAAATGGCA TGTTAAAAAG GGCGAGTTTT 14 02 
Tyr Phe Ser Lys His Ser Asn Lys 
445 450 

GTAGAAGTGG ATACCGCTTC T 14 2 3 

(2) INFORMATION FOR SEQ ID NO: 908: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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45 50 55 

GGC GAC ATT TCA CAA TTT TCC OCT CTC ATG CTC TCT TTG GGG GCG ACT 244 
Gly Asp lie Ser Gin Phe Ser Ala Leu Met Leu Ser Leu Gly Ala Thr 
60 65 70 

GTA GGC ATT GGG AGT ATC GTA GGC GTA GCG ACC GCT ATT AGC ATC GCA 2 92 

Val Gly lie Gly Ser lie Val Gly Val Ala Thr Ala lie Ser lie Ala 
75 80 85 90 

GGG CCA GGA GCG GTG TTT TGG ATG TGG GTT ACT GGG CTT GTT GGC ATG 34 0 

Gly Pro Gly Ala Val Phe Trp Met Trp Val Thr Gly Leu Val Gly Met 
95 100 105 

GCG ACT AAG TAT TCT GAG GGG ATT TTA GCG GTG AAA TAC CGG GAA AAA 3 88 

Ala Thr Lys Tyr Ser Glu Gly lie Leu Ala Val Lys Tyr Arg Glu Lys 
110 115 120 

GGG GCG TTT GGA TAC AAC GGA GGG CCC ATG TAT TAC ATC AAA AAC GGT 43 6 

Gly Ala Phe Gly Tyr Asn Gly Gly Pro Met Tyr Tyr lie Lys Asn Gly 
125 130 135 

CTT AAC ATG CCC AAA CTC GCC ATG GCG TTT GCG ATT TTT ACG ATT ATT 4 84 

Leu Asn Met Pro Lys Leu Ala Met Ala Phe Ala lie Phe Thr lie lie 
140 145 150 

GCA AGC ATT GGC ACC GGT AAC ATG ACG CAA TCT AAT GCG GTT TCT TCC 53 2 

Ala Ser lie Gly Thr Gly Asn Met Thr Gin Ser Asn Ala Val Ser Ser 
155 160 165 170 

ATT TTA AGC GAA CAA GCG AAC CTG CCT AAT TGG GTT TCA GGT TTA TTG 5 80 

lie Leu Ser Glu Gin Ala Asn Leu Pro Asn Trp Val Ser Gly Leu Leu 
175 180 185 

CTC ACT CTT TTA ACC GCT TTC ATT GTC ATA GGG GGG ATC AAA TCC ATT 62 8 

Leu Thr Leu Leu Thr Ala Phe lie Val lie Gly Gly lie Lys Ser lie 
190 195 200 

GGT AAA TTC ACT TCT TAC TTA GCT CCT GTT ATG GTG CTT TTA TAT TTG 67 6 

Gly Lys Phe Thr Ser Tyr Leu Ala Pro Val Met Val Leu Leu Tyr Leu 
205 210 215 

ATC GCT ATT ATT TAT ATT ATT GTT AGC CAT TTT GAT TTA GCC CTT CAA 7 24 

lie Ala lie lie Tyr lie lie Val Ser His Phe Asp Leu Ala Leu Gin 
220 225 230 

GCG ATC AAA CTC ATT TTT GAA GAA GCC TTT AAC CCT AAA CCC GTT GTG 7 72 

Ala lie Lys Leu lie Phe Glu Glu Ala Phe Asn Pro Lys Pro Val Val 
235 240 245 250 

GGC GGA GCG AGC GGC GCG TTG ATA GCG ACG ATG ATA AAA ACG GGC GTG 82 0 

Gly Gly Ala Ser Gly Ala Leu lie Ala Thr Met lie Lys Thr Gly Val 
255 260 265 

GCT AGG GGG TTG TAT TCT AAT GAA GCG GGG TTA GGG AGC TCA GCC ATT 86 8 
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85 90 95 

Val Ser L.eu Ala Thr Lys Lys Arg Gin Ala Ala Phe Asp Lys Ser Leu 

100 105 110 

Pro Tyr Ser Tyr Ser Gly Thr lie Met Leu Val Arg Lys Asp Glu Asn 

115 120 125 

Arg lie Lys Asp lie Lys Asp lie Lys Gly Leu Arg Ala Ala Asn Thr 

130 135 140 

Leu Ser Ser Thr Tyr Gly Glu lie Ala Phe Lys Tyr Asp Ala Gin lie 
145 150 155 160 

Val Ser Val Asp Ser Met Ala Gin Ala Leu Leu Leu Val Ala Gin Lys 

165 170 175 

Arg Ala Asp Leu Thr Leu Asn Ser Ser Leu Ala lie Leu Asn Tyr Leu 

180 185 190 

Asn Thr His Lys Asp Asn Pro Phe Lys He Ala Trp Glu Ser Lys Glu 

195 200 205 

Lys Asp Gly Gly Ala Ser Phe Val He Asn Lys His Gin Glu Lys Ala 

210 215 220 

Leu Glu Leu He Asn Gin Ala Met Gin Arg Leu He Asn Lys Gly Val 
225 230 235 240 

Leu Lys Arg Leu Gly Glu Gin Phe Phe Gly Lys Asp Val Ser Gin Pro 
245 250 255 



(2) INFORMATION FOR SEQ ID NO: 907: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 23 . . . 1372 
<D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ 

CCTAGTTTAT TAAGGAGTTT TT ATG GAA ACG 

Met Glu Thr 
1 

TTA TCT AAT TTG GTG TGG GGG ATT CCC 
Leu Ser Asn Leu Val Trp Gly He Pro 
15 

ACC GGC TTG TTT TTA ACC TTT TAT CTT 
Thr Gly Leu Phe Leu Thr Phe Tyr Leu 
30 35 

ATT TTT TAT GCG ATC AAA ATC CTT TTT 
He Phe Tyr Ala He Lys He Leu Phe 



ID NO: 907 : 

ATT GAT TCG GTG GTG CGT TTG 52 

He Asp Ser Val Val Arg Leu 
5 10 

ATG CAA ATT TTA TTA GTA GGC 10 0 

Met Gin He Leu Leu Val Gly 
20 25 

AGG GGT TTG CAA TTC AGT AAG 14 8 

Arg Gly Leu Gin Phe Ser Lys 
40 

GAC AAA GAG TCC CAA TCT AAG 196 
Asp Lys Glu Ser Gin Ser Lys 
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140 145 150 

TTT AAA TAC GAC GCT CAA ATC GTT TCG GTG GAT TCT ATG GCG CAA GCT 53 2 

Phe Lys Tyr Asp Ala Gin lie Val Ser Val Asp Ser Met Ala Gin Ala 
155 160 165 

TTG TTG CTG GTG GCG CAA AAA CGA GCC GAT TTG ACC TTA AAT AGT TCT 580 
Leu Leu Leu Val Ala Gin Lys Arg Ala Asp Leu Thr Leu Asn Ser Ser 
170 175 180 185 

TTA GCG ATC TTA AAC TAC CTT AAC ACC CAC AAA GAT AAC CCC TTT AAA 62 8 

Leu Ala lie Leu Asn Tyr Leu Asn Thr His Lys Asp Asn Pro Phe Lys 
190 195 200 

ATC GCA TGG GAG TCC AAA GAA AAA GAT GGG GGC GCT TCC TTT GTT ATT 67 6 

lie Ala Trp Glu Ser Lys Glu Lys Asp Gly Gly Ala Ser Phe Val lie 
205 210 215 

AAC AAG CAC CAA GAA AAA GCC TTA GAG CTT ATC AAC CAA GCG ATG CAA 724 
Asn Lys His Gin Glu Lys Ala Leu Glu Leu lie Asn Gin Ala Met Gin 
220 225 230 

AG A TTG ATC AAC AAA GGG GTT TTA AAA CGC TTA GGC GAA CAA TTT TTT 77 2 

Arg Leu lie Asn Lys Gly Val Leu Lys Arg Leu Gly Glu Gin Phe Phe 
235 240 245 

GGA AAA GAT GTC AGC CAG CCC TAATCTGTCT TTGTTTTTTG AATCTTTAGA TTTG 82 7 
Gly Lys Asp Val Ser Gin Pro 
250 255 

AGCAAGGAGC GTTTGGAAT 84 6 

(2) INFORMATION FOR SEQ ID NO: 906: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906: 

Met Lys Lys Val Leu Phe Leu Xaa Val lie Ser Phe Phe Gly Gly Phe 

15 10 15 

Leu Asn Ala Ser Ser Leu Tyr Glu Lys Leu lie Asn Lys Glu Thr He 

20 25 30 

Ser Val Gly Thr Glu Gly He Tyr Pro Pro Phe Thr Tyr His Asn Lys 

35 40 45 

Glu Gly Lys Leu Thr Gly Tyr Asp Val Glu Val Ala Arg Glu Leu Ala 

50 55 60 

Lys Glu Leu Gly Val Lys He Lys Phe His Glu Thr Ser Trp Asp He 
65 70 75 80 

Met Leu Thr Gly Leu Lys Ser Gly Arg Phe Asp Met Val Ala Asn Gin 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2 6... 793 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905: 

AAAAAATTCA GGATTAAAAT ATAAA ATG AAA AAA GTT TTA TTT TTG TNG GTA 52 

Met Lys Lys Val Leu Phe Leu Xaa Val 
1 5 

ATA AGC TTT TTT GGG GGT TTT TTG AAC GCT TCT AGC TTG TAT GAA AAA 100 

lie Ser Phe Phe Gly Gly Phe Leu Asn Ala Ser Ser Leu Tyr Glu Lys 
10 15 20 25 

CTG ATT AAT AAA GAA ACG ATC AGC GTT GGC ACA GAA GGC ATT TAC CCC 148 
Leu lie Asn Lys Glu Thr lie Ser Val Gly Thr Glu Gly lie Tyr Pro 
30 35 40 

CCT TTC ACT TAC CAC AAT AAA GAA GGC AAG CTC ACC GGC TAT GAT GTG 196 
Pro Phe Thr Tyr His Asn Lys Glu Gly Lys Leu Thr Gly Tyr Asp Val 
45 50 55 

GAA GTG GCT AGG GAG TTG GCC AAA GAG CTT GGC GTG AAG ATC AAA TTC 244 
Glu Val Ala Arg Glu Leu Ala Lys Glu Leu Gly Val Lys lie Lys Phe 
60 65 70 

CAC GAA ACT.TCA TGG GAT ATC ATG CTG ACA GGT TTG AAA TCG GGG CGT 2 92 

His Glu Thr Ser Trp Asp lie Met Leu Thr Gly Leu Lys Ser Gly Arg 
75 80 85 

TTT GAT ATG GTC GCT AAC CAA GTG AGT TTG GCG ACT AAA AAA CGC CAA 3 40 

Phe Asp Met Val Ala Asn Gin Val Ser Leu Ala Thr Lys Lys Arg Gin 
90 95 100 105 

GCG GCT TTT GAT AAA AGC TTG CCT TAT AGC TAT TCA GGC ACG ATC ATG 3 88 

Ala Ala Phe Asp Lys Ser Leu Pro Tyr Ser Tyr Ser Gly Thr lie Met 
110 115 120 

CTG GTC AGG AAA GAT GAA AAC CGC ATT AAA GAT ATT AAA GAC ATC AAG 436 
Leu Val Arg Lys Asp Glu Asn Arg lie Lys Asp lie Lys Asp lie Lys 
125 130 135 

GGT TTG AGA GCG GCT AAC ACT TTA AGC TCC ACT TAT GGG GAA ATC GCT 4 84 

Gly Leu Arg Ala Ala Asn Thr Leu Ser Ser Thr Tyr Gly Glu lie Ala 
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TTA CAG CAT TTC CAC CTC ACT TTC ACT AAA ATC OCT CAA GGC GTG CCT 53 0 

Leu Gin His Phe His Leu Thr Phe Thr Lys lie Ala Gin Gly Val Pro 
155 160 165 170 

ACT GGA GTG AAT TTT GAA AAC ATT GAC TCA GTT TCG CTC TCA AGG GCG 57 8 

Thr Gly Val Asn Phe Glu Asn lie Asp Ser Val Ser Leu Ser Arg Ala 
175 180 185 

TTT AAT TCA AGG ATC AAA GCA TGAATTTAAA TTTTATGCCC CTATTGCATG CTTA 63 3 
Phe Asn Ser Arg lie Lys Ala 
190 

TTVACCATGCG AGCATTGATT TTCATTTCAA TTCTAGTG 671 
(2) INFORMATION FOR SEQ ID NO: 904: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904: 

Met Asn Thr Tyr Lys Asn Ser Leu Asn His Phe Leu Asn Leu Val Asp 

15 10 15 

Cys Leu Glu Lys lie Pro Asn Val Gly Lys Lys Ser Ala Phe Lys Met 

20 25 30 

Ala Tyr His Leu Gly Leu Glu Asn Pro Tyr Leu Ala Leu Lys lie Thr 

35 40 45 

His Ala Leu Glu Asn Ala Leu Glu Asn Leu Lys Thr Cys Ser Ser Cys 

50 55 60 

Asn Ala Leu Ser Glu Ser Glu Val Cys Glu lie Cys Ser Asp Glu Ser 
65 70 75 80 

Arg Gin Asn Ser Gin Leu Cys Met Val Leu His Pro Arg Asp Val Phe 

85 90 95 

lie Leu Glu Asp Leu Lys Asp Phe Leu Gly Arg Tyr Tyr Val Leu Asn 

100 105 110 

Ser lie Glu Glu Val Asp Phe Asn Ala Leu Glu Lys Arg Leu lie Glu 

115 120 125 

Glu Asn lie Lys Glu lie lie Phe Ala Phe Pro Pro Thr Leu Ala Asn 

130 135 140 

Asp Ser Leu Met Leu Tyr lie Glu Asp Lys Leu Gin His Phe His Leu 
145 150 155 160 

Thr Phe Thr Lys lie Ala Gin Gly Val Pro Thr Gly Val Asn Phe Glu 

165 170 175 

Asn lie Asp Ser Val Ser Leu Ser Arg Ala Phe Asn Ser Arg lie Lys 
180 185 190 

Ala 



(2) INFORMATION FOR SEQ ID NO: 905: 
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(B) TYPE: nucleic acid 

(C) STRANBEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 21... 599 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:903: 

AAGTTTGATA TACTAACAGA ATG AAT ACT TAT AAA AAC AGC TTG AAT CAC 5 0 

Met Asn Thr Tyr Lys Asn Ser Leu Asn His 
15 10 

TTT TTA AAT TTA GTG GAT TGT TTA GAA AAA ATC CCC AAT GTG GGT AAA 9 8 

Phe Leu Asn Leu Val Asp Cys Leu Glu Lys lie Pro Asn Val Gly Lys 
15 20 25 

AAG TCC GCC TTT AAA ATG GCG TAT CAT TTG GGT TTA GAA AAC CCC TAT 14 6 

Lys Ser Ala Phe Lys Met Ala Tyr His Leu Gly Leu Glu Asn Pro Tyr 
30 35 40 

CTG GCG CTA AAA ATC ACG CAC GCT TTA GAG AAC GCC CTA GAA AAC CTT 194 
Leu Ala Leu Lys lie Thr His Ala Leu Glu Asn Ala Leu Glu Asn Leu 
45 50 55 

AAA ACA TGT TCA TCT TGT AAC GCG CTC AGC GAG AGT GAG GTT TGT GAG 2 42 

Lys Thr Cys Ser Ser Cys Asn Ala Leu Ser Glu Ser Glu Val Cys Glu 
60 65 70 

ATT TGC TCT GAT GAA AGC CGA CAA AAT TCT CAG CTT TGC ATG GTT TTA 2 90 

lie Cys Ser Asp Glu Ser Arg Gin Asn Ser Gin Leu Cys Met Val Leu 
75 80 85 90 

CAC CCA AGA GAT GTG TTT ATT TTA GAA GAT TTA AAG GAT TTT TTA GGG 33 8 

His Pro Arg Asp Val Phe lie Leu Glu Asp Leu Lys Asp Phe Leu Gly 
95 100 105 

CGC TAT TAT GTG TTA AAC TCC ATA GAA GAA GTG GAT TTT AAC GCC CTA 3 86 

Arg Tyr Tyr Val Leu Asn Ser lie Glu Glu Val Asp Phe Asn Ala Leu 
110 115 120 

GAA AAA CGC CTG ATT GAA GAA AAC ATT AAA GAA ATC ATT TTT GCT TTC 4 34 

Glu Lys Arg Leu lie Glu Glu Asn lie Lys Glu lie lie Phe Ala Phe 
125 130 135 

CCT CCC ACT TTA GCT AAT GAT TCT CTA ATG CTT TAT ATT GAA GAC AAA 4 82 

Pro Pro Thr Leu Ala Asn Asp Ser Leu Met Leu Tyr lie Glu Asp Lys 
140 145 150 
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GCA TAG ACQ CTC ATC AAA CCA AAG ACA ATA GTT GTC ATG CCC AAA GCC 198 
Ala Tyr Thr Leu lie Lys Pro Lys Thr lie Val Val Met Pro Lys Ala 
45 50 55 

TGC CAA ATC GCT CCT AAA CCA GCT TTT GCA ATC ACC ATA CCC AAC AAA 24 6 

Cys Gin lie Ala Pro Lys Pro Ala Phe Ala lie Thr lie Pro Asn Lys 
60 65 70 

GGC ACT AGC GTA ACC CCT GAT AAT GAA GTG AAA GCA AAC AGC ATG AAC 2 94 

Gly Thr Ser Val Thr Pro Asp Asn Glu Val Lys Ala Asn Ser Met Asn 
75 80 85 

AGA TTC AAT CCG GGT TTA GAT TTA GAA AAC ATC AAA CCA AAA AAC GCC 342 
Arg Phe Asn Pro Gly Leu Asp Leu Glu Asn lie Lys Pro Lys Asn Ala 
90 95 100 

GCA ATT TCA GCG ATA AAA AAC ACC CAT TTA TAC TGC ACT ACG GCT TGAAA 3 92 
Ala lie Ser Ala lie Lys Asn Thr His Leu Tyr Cys Thr Thr Ala 
105 110 115 

ATTCATTAAA CCTAGTAACG CCCCAATAGT CGCTAATAA 4 31 

(2) INFORMATION FOR SEQ ID NO: 902: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 902 : 

Met Thr Thr Trp Asn Met Gly Leu Pro Lys Asn Lys Leu Met Ser Glu 

15 10 15 

His Thr Thr Thr Thr lie Lys Ala Met Lys Ser lie Leu Pro lie Phe 

20 25 30 

Ala Arg Ser Phe Leu Val Leu Arg Ala Tyr Thr Leu lie Lys Pro Lys 

35 40 45 

Thr lie Val Val Met Pro Lys Ala Cys Gin lie Ala Pro Lys Pro Ala 

50 55 60 

Phe Ala lie Thr lie Pro Asn Lys Gly Thr Ser Val Thr Pro Asp Asn 
65 70 75 80 

Glu Val Lys Ala Asn Ser Met Asn Arg Phe Asn Pro Gly Leu Asp Leu 

85 90 95 

Glu Asn lie Lys Pro Lys Asn Ala Ala lie Ser Ala lie Lys Asn Thr 

100 105 110 

His Leu Tyr Cys Thr Thr Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 903: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 671 base pairs 
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Val lie His Thr 
180 

Leu Asn Tyr Gin 
195 

Glu Leu Val Lys 
210 

Ser Pro Asn Asp 
225 

Thr Ala Asn Val 

Pro Met Val Thr 
260 

Val Asp Lys Asp 
275 

Ala Ala Leu Asn 
290 

Tyr Ala Tyr Val 
305 

Arg Gin Asp Asn 

Ala Lys Cys Gly 
340 

Leu Glu Pro Pro 
355 



lie Asp His Lys 

Asn Leu Thr Thr 
200 

Pro Gly Asp Ala 
215 

Pro Asn Met Asn 
230 

lie Gly Val Tyr 
245 

Leu Gly Ala Gly 

Trp Gin Leu His 
280 

Val Ser Pro Leu 
295 

Thr Arg Gly Pro 
310 

Leu Xaa Leu Gin 
325 

lie Leu Thr Pro 

Val Leu Ser Asn 
360 



Asn Asp Asn Leu 
185 

Phe Asp Lys Ala 

Pro Asp Ala Cys 
220 

Gly Arg Cys Gin 
•235 

Ala Gin Ala Asn 
250 

Thr Arg Tyr Asp 
265 

lie Thr Gin Gly 

Glu Asn Leu Asn 
300 

Met Pro Gly Gly 
315 

Pro Gin Phe Lys 
330 

Asn Thr Ala Val 
345 



Leu Glu Tyr Gly 
190 

lie Pro Asn Ser 
205 

Leu Arg Val Thr 

Arg Asn Gly Ala 
240 

Tyr Thr Leu His 
255 

Val Tyr Thr Leu 
270 

Phe Ser Pro Ser 
285 

Phe Arg Leu Ser 

Leu Val Trp Met 
320 

Ala Arg Asn Trp 
335 

Ser lie Leu lie 
350 



(2) INFORMATION FOR SEQ ID NO: 901: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 387 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901: 

AGACTGAATA AAATCGCACT CGCTCCCGCA ATG ACA ACC TGG AAC ATG GGG CTG 54 

Met Thr Thr Trp Asn Met Gly Leu 
1 5 

CCC AAA AAC AAG TTA ATG AGC GAA CAC ACC ACC ACC ACA ATC AAA GCG 102 
Pro Lys Asn Lys Leu Met Ser Glu His Thr Thr Thr Thr lie Lys Ala 
10 15 20 

ATG AAG AGC ATT TTA CCC ATA TTC GCT AGA TCG TTT TTA GTC TTA AGO 15 0 

Met Lys Ser lie Leu Pro lie Phe Ala Arg Ser Phe Leu Val Leu Arg 
25 30 35 40 
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Leu Asn Val Ser Pro Leu Glu Asn Leu Asn Phe Arg Leu Ser Tyr Ala 
295 300 305 

TAT GTA ACC AGA GGC CCT ATG CCT GGA GGT TTG GTG TGG ATG CGT CAA 1016 
Tyr Val Thr Arg Gly Pro Met Pro Gly Gly Leu Val Trp Met Arg Gin 
310 315 320 

GAT AAT TTG CGN CTA CAA CCG CAA TTT AAA GCC AGA AAT TGG GCA AAA 10 64 

Asp Asn Leu Xaa Leu Gin Pro Gin Phe Lys Ala Arg Asn Trp Ala Lys 
325 330 335 

TGT GGA ATT TTA ACA CCG AAT ACA GCA GTC AGT ATT TTG ATT TTA GAG 1112 
Cys Gly lie Leu Thr Pro Asn Thr Ala Val Ser lie Leu lie Leu Glu 
340 345 350 

CCG CCG GTT TTG TCC AAT TGATTTCTAA TTACATCAAT CAATTTTCTT CAACGCTT 116 8 
Pro Pro Val Leu Ser Asn 
355 360 

TTTGTAACCA ACTTG 1183 
(2) INFORMATION FOR SEQ ID NO: 900: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900: 

Met Ala Lys lie Asn Gly Tyr Leu Ser Glu Arg Asp lie Leu Thr Leu 

15 10 15 

Ser Tyr Asn Met Thr Arg Asp Asn Ala Asn Arg Pro Leu Arg Ala Asn 

20 25 30 

Phe Thr Gly Thr Phe Leu Pro Tyr Ser Cys Gly Asp Phe Asn Ala Phe 

35 40 45 

Pro Asn Glu Lys Asn Pro" Ser Asp Cys Leu Phe Glu Asn Asp Ala Ser 

50 55 60 

Leu Phe Lys Thr Tyr Ser Val Asn Leu Val His Asn Val Ser Leu Asn 
65 70 75 80 

Tyr Glu Arg Glu Gly Gly Ser Arg Phe Gly Asp Pro Lys Leu Lys lie 

85 90 95 

Asn Gly Tyr Thr Ser lie Arg Asn Val Gin lie Asp Pro Leu Phe Lys 

100 105 110 

Pro Asn Asp lie Ala Ala Ser lie Pro Phe Thr Pro Asn Pro Lys Leu 

115 120 125 

Gly Glu Glu Asn Glu Cys Val Ala Gin Gly Gly lie Tyr Asp Ala Leu 

130 135 140 

Lys Gin Thr Cys Ser lie Thr Phe Lys Ser Leu Gly Gly Gly Ser Val 
145 150 155 160 

Val Ala Asn Lys Asn Leu Phe lie lie Asn Ser Gly Phe Asn Ala Asn 
165 170 175 
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70 75 80 

AGA GAA GGG GGG AGC CGT TTT GGT GAT CCT AAA TTA AAA ATC AAT GGC 344 
Arg Glu Gly Gly Ser Arg Phe Gly Asp Pro Lys Leu Lys lie Asn Gly 
85 90 95 

TAT ACA AGC ATT AGG AAT GTC CAA ATT GAT CCG CTT TTT AAG CCT AAC 3 92 

Tyr Thr Ser lie Arg Asn Val Gin lie Asp Pro Leu Phe Lys Pro Asn 
100 105 110 

GAC ATA GCG GCT ACT ATT CCT TTC ACC CCA AAC CCA AAA CTT GGC GAA 44 0 

Asp lie Ala Ala Ser lie Pro Phe Thr Pro Asn Pro Lys Leu Gly Glu 
115 120 125 130 

GAG AAT GAA TGC GTG GCG CAA GGG GGC ATT TAT GAC GCT CTT AAA CAA 4 88 

Glu Asn Glu Cys Val Ala Gin Gly Gly lie Tyr Asp Ala Leu Lys Gin 
135 140 145 

ACT TGC TCC ATC ACT TTT AAA AGC CTT GGA GGG GGT TCT GTG GTG GCT 53 6 

Thr Cys Ser lie Thr Phe Lys Ser Leu Gly Gly Gly Ser Val Val Ala 
150 155 160 

AAT AAA AAT TTA TTC ATC ATC AAT TCT GGG TTT AAT GCG AAC GTG ATC 5 84 

Asn Lys Asn Leu Phe lie lie Asn Ser Gly Phe Asn Ala Asn Val lie 
165 170 175 

CAC ACC ATA GAC CAT AAG AAT GAC AAC CTT TTG GAA TAC GGG TTG AAT 6 32 

His Thr lie Asp His Lys Asn Asp Asn Leu Leu Glu Tyr Gly Leu Asn 
180 185 190 

TAC CAA AAC TTA ACC ACT TTT GAT AAA GCG ATC CCT AAT AGC GAA TTA 680 
Tyr Gin Asn Leu Thr Thr Phe Asp Lys Ala He Pro Asn Ser Glu Leu 
195 200 205 210 

GTC AAA CCC GGC GAT GCC CCT GAC GCA TGC TTA AGG GTT ACA AGC CCC 72 8 

Val Lys Pro Gly Asp Ala Pro Asp Ala Cys Leu Arg Val Thr Ser Pro 
215 220 225 

AAT GAT CCC AAC ATG AAC GGG CGT TGC CAA CGA AAT GGC GCT ACG GCG 77 6 

Asn Asp Pro Asn Met Asn Gly Arg Cys Gin Arg Asn Gly Ala Thr Ala 
230 235 240 

AAT GTG ATT GGG GTG TAT GCG CAA GCG AAT TAC ACC TTG CAT CCT ATG 8 24 

Asn Val He Gly Val Tyr Ala Gin Ala Asn Tyr Thr Leu His Pro Met 
245 250 255 

GTA ACT TTA GGG GCA GGG ACT CGT TAT GAT GTC TAT ACT TTA GTG GAT 8 72 

Val Thr Leu Gly Ala Gly Thr Arg Tyr Asp Val Tyr Thr Leu Val Asp 
260 265 270 

AAA GAC TGG CAA TTG CAC ATA ACC CAA GGG TTT AGC CCT AGC GCG GCT 92 0 

Lys Asp Trp Gin Leu His He Thr Gin Gly Phe Ser Pro Ser Ala Ala 
275 280 285 290 

TTA AAT GTC TCG CCT TTA GAA AAT TTG AAT TTC AGG CTT TCT TAT GCG 96 8 
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260 

lie Ser Pro L.ys 
275 

Asn Pro Phe Phe 
290 

Ala Leu Phe Pro 
305 

Lys Leu Asn Ala 

Arg Phe Leu Phe 
340 

Asp Phe Leu Asn 
355 



Lys Arg lie Tyr 
280 

lie His Gin Cys 
295 

Tyr Asn Gin Asn 
310 

lie Asn Trp Gin 
325 

Ser Gin Arg Ser 
Leu Gly 



265 

Val Asn Ala Lys 

Pro Asn Tyr Asp 
300 

Leu Asp Leu Gin 
315 

Glu Leu Gly Phe 
330 

Leu Glu Asn Ala 
345 



270 

Thr His Gin Lys 
285 

Gly Ser lie Leu 

Asn Leu Cys Asp 
320 

Val Cys Gly Gly 
335 

Leu Leu Pro Lys 
350 



(2) INFORMATION FOR SEQ ID NO: 899: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51. -.1130 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899: 



CCTAAAGCGG AGAATAAAGT TACAGAAGTC CTAGCGAGCA AAACAATGTG ATG GCT 5 6 

Met Ala 
1 



AAG ATC AAT GGT TAT TTG AGC GAA AGG GAT ATT TTA ACG CTC AGT TAT 104 
Lys lie Asn Gly Tyr Leu Ser Glu Arg Asp lie Leu Thr Leu Ser Tyr 
5 10 15 

AAC ATG ACC AGA GAC AAC GCT AAC CGC CCT TTA AGA GCG AAT TTT ACA 15 2 

Asn Met Thr Arg Asp Asn Ala Asn Arg Pro Leu Arg Ala Asn Phe Thr 
20 25 30 

GGC ACT TTT TTA CCC TAT TCT TGC GGT GAT TTT AAC GCT TTC CCT AAC 2 00 

Gly Thr Phe Leu Pro Tyr Ser Cys Gly Asp Phe Asn Ala Phe Pro Asn 
35 40 45 50 

GAG AAA AAC CCT AGC GAT TGT TTG TTT GAA AAC GAC GCT AGT TTG TTT 24 8 

Glu Lys Asn Pro Ser Asp Cys Leu Phe Glu Asn Asp Ala Ser Leu Phe 
55 60 65 

AAA ACT TAT AGC GTC AAT TTA GTG CAT AAT GTG AGT TTG AAT TAT GAA 2 96 

Lys Thr Tyr Ser Val Asn Leu Val His Asn Val Ser Leu Asn Tyr Glu 
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TGG CAA GAA TTA GGC TTT GTG TGC GGC GGG CGT TTT TTG TTT TCG CAG 106 3 

Trp Gin Glu Leu Gly Phe Val Cys Gly Gly Arg Phe Leu Phe Ser Gin 
330 335 340 

GGC TCT TTA GAA AAC GCG CTT TTG CCT AAA GAC TTT TTA AAT CTA GGA T 1112 
Arg Ser Leu Glu Asn Ala Leu Leu Pro Lys Asp Phe Leu Asn Leu Gly 
345 350 355 

AAAACTTGTT AGAAACTTTG CAATTAAACC CTGAGCAGC 1151 

(2) INFORMATION FOR SEQ ID NO:898: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 358 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898: 

Met Leu Thr Leu Lys His Asn His Gly Ser Val Leu Glu Pro Ser Ala 

15 10 15 

Gly Asp Gly Ser Phe Leu Lys Arg Leu Lys Lys Ala Val Arg lie Glu 

20 25 30 

lie Asp Pro Lys lie Cys Pro Lys Asn Ala Leu Cys Met Asp Phe Phe 

35 40 45 

Asp Tyr Pro Leu Glu Asn Gin Phe Asp Thr lie lie Gly Asn Pro Pro 

50 55 60 

Tyr Val Lys His Lys Asp lie Ala Pro Ser Thr Lys Glu Lys Leu His 
65 70 75 80 

Tyr Ser Leu Phe Asp Glu Arg Ser Asn Leu Tyr Leu Phe Phe lie Glu 

85 90 95 

Lys Ala lie Lys His Leu Lys Pro Lys Gly Glu Leu lie Phe lie Thr 

100 105 110 

Pro Arg Asp Phe Leu Lys Ser Thr Ser Ser Val Lys Leu Asn Glu Trp 

115 120 125 

lie Tyr Lys Glu Gly Thr lie Thr His Phe Phe Glu Leu Gly Asp Gin 

130 135 140 

Lys Val Phe Pro Asn Ala Met Pro Asn Cys Val lie Phe Arg Phe Cys 
145 150 155 160 

Lys Gly Asn Phe Ser Arg lie Thr Asn Asp Gly Leu Gin Phe Leu Cys 

165 170 175 

Lys Lys Gly lie Leu Tyr Phe Leu Asn Gin Ser Tyr Thr Gin Lys Leu 

180 185 190 

Ser Glu Val Phe Lys Val Lys Val Gly Ala Val Ser Gly Cys Asp Lys 

195 200 205 

lie Phe Lys Asn Glu Lys Tyr Gly Asn Leu Glu Phe Val Thr Ser lie 

210 215 220 

Thr Lys Arg Thr Asn Ala Leu Glu Lys Met Val Phe Val Asn Glu Pro 
225 230 235 240 

Asn Asp Tyr Leu Leu Gin His Lys Asp Ser Leu Met Gin Arg Lys lie 

245 250 255 

Lys Lys Phe Asn Glu Asn Asn Trp Phe Glu Trp Gly Arg Met His His 
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AAA CCT AAA GGC GAA TTG ATT TTC ATC ACC CCA AGG GAT TTT TTA AAA 3 91 

Lys Pro Lys Gly Glu Leu lie Phe lie Thr Pro Arg Asp Phe Leu Lys 
105 110 115 

TCC ACT TCT AGC GTG AAA TTA AAC GAA TGG ATT TAT AAA GAA GGC ACG 43 9 

Ser Thr Ser Ser Val Lys Leu Asn Glu Trp lie Tyr Lys Glu Gly Thr 
120 125 130 

ATA ACG CAT TTT TTT GAA CTG GGC GAT CAA AAG GTT TTC CCA AAC GCC 487 
lie Thr His Phe Phe Glu Leu Gly Asp Gin Lys Val Phe Pro Asn Ala 
135 140 145 150 

ATG CCT AAT TGC GTG ATT TTT CGT TTT TGT AAG GGT AAT TTC AGT AGA 53 5 

Met Pro Asn Cys Val lie Phe Arg Phe Cys Lys Gly Asn Phe Ser Arg 
155 160 165 



ATC ACC AAC 
lie Thr Asn 



TTC CTC AAC 
Phe Leu Asn 
185 

AAA GTG GGG 
Lys Val Gly 
200 

TAG GGG AAT 
Tyr Gly Asn 
215 

TTA GAA AAA 
Leu Glu Lys 

CAT AAA GAC 
His Lys Asp 



GAT GGT TTG 
Asp Gly Leu 
170 

CAA TCT TAG 
Gin Ser Tyr 



GCA GTG AGC 
Ala Val Ser 



TTA GAA TTT 
Leu Glu Phe 
220 

ATG GTT TTT 
Met Val Phe 
235 

AGC TTA ATG 
Ser Leu Met 
250 



CAA TTT TTG 
Gin Phe Leu 
175 

ACG CAA AAA 
Thr Gin Lys 
190 

GGG TGC GAT 
Gly Cys Asp 
205 

GTC ACC TCA 
Val Thr Ser 



GTC AAT GAG 
Val Asn Glu 



CAA AGA AAG 
Gin Arg Lys 
255 



TGC AAA AAA 
Cys Lys Lys 



TTA AGC GAG 
Leu Ser Glu 



AAG ATT TTT 
Lys lie Phe 
210 

ATC ACG AAA 
lie Thr Lys 
225 

CCT AAT GAT 
Pro Asn Asp 
240 

ATT AAA AAA 
lie Lys Lys 



GGC ATT TTG 
Gly lie Leu 
180 

GTT TTT AAG 
Val Phe Lys 
195 

AAA AAT GAA 
Lys Asn Glu 

AGA ACC AAT 
Arg Thr Asn 



TAT TTA CTC 
Tyr Leu Leu 
245 

TTC AAT GAA 
Phe Asn Glu 
260 



TAT 5 8 3 

Tyr 



GTT 631 
Val 



AAA 679 
Lys 

GCT 72 7 

Ala 

230 

CAG 775 
Gin 



AAT 82 3 

Asn 



AAC TGG TTT GAG TGG GGG AGA ATG CAT CAC ATA TCC CCT AAA AAA CGC 871 
Asn Trp Phe Glu Trp Gly Arg Met His His lie Ser Pro Lys Lys Arg 
265 270 275 



ATT TAT GTC AAC GCC AAA ACG CAC CAA AAA AAC CCC TTT TTT ATC CAC 919 
lie Tyr Val Asn Ala Lys Thr His Gin Lys Asn Pro Phe Phe lie His 
280 285 290 



CAA TGC CCT AAT TAT GAC GGC TCT 
Gin Cys Pro Asn Tyr Asp Gly Ser 
295 300 

CAA AAC CTG GAC TTA CAA AAT CTC 
Gin Asn Leu Asp Leu Gin Asn Leu 
315 



ATT TTA GCG CTA TTC CCT TAT AAC 967 
lie Leu Ala Leu Phe Pro Tyr Asn 
305 310 

TGC GAC AAA CTC AAC GCT ATC AAC 1015 
Cys Asp Lys Leu Asn Ala lie Asn 
320 325 
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Glu Ala Leu Thr 
65 

Gly Pro Lieu Gin 

Lys Cys Leu Thr 
100 

Arg lie 



lie Ala Gin Lys 
70 

Phe Asp Ala Ser 
85 

Ala Lys Trp Leu 



Leu Asp Pro Gin 
75 

lie Asp Lys Ser 
90 

Gly Lys Leu Ala 
105 



Leu Glu lie Asp 
80 

Val Ala Lys Lys 
95 

Phe Leu Phe Ser 
110 



(2) INFORMATION FOR SEQ ID NO: 897: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 38... 1111 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897: 

AGGGCAGGTT TTCACCCCTA AAAAGATAGT GGATTTC ATG CTC ACT CTC AAA CAC 5 5 

Met Leu Thr Leu Lys His 
1 5 

AAT CAT GGG AGT GTT TTA GAA CCG AGT GCT GGC GAT GGG AGT TTT TTA 103 
Asn His Gly Ser Val Leu Glu Pro Ser Ala Gly Asp Gly Ser Phe Leu 
10 15 20 

AAG CGC TTA AAA AAG GCC GTA AGG ATT GAA ATC GAT CCT AAA ATC TGC 151 
Lys Arg Leu Lys Lys Ala Val Arg lie Glu lie Asp Pro Lys lie Cys 
25 30 35 

CCT AAA AAT GCC CTT TGC ATG GAC TTT TTT GAC TAC CCT TTA GAA AAT 199 
Pro Lys Asn Ala Leu Cys Met Asp Phe Phe Asp Tyr Pro Leu Glu Asn 
40 45 50 

CAA TTT GAC ACC ATT ATT GGT AAC CCG CCC TAT GTC AAG CAC AAG GAT 24 7 

Gin Phe Asp Thr lie lie Gly Asn Pro Pro Tyr Val Lys His Lys Asp 
55 60 65 70 

ATT GCG CCA AGC ACC AAA GAA AAA CTC CAT TAC AGC CTT TTT GAT GAA 2 95 

lie Ala Pro Ser Thr Lys Glu Lys Leu His Tyr Ser Leu Phe Asp Glu 
75 80 85 

AGG AGT AAT CTC TAC TTG TTT TTC ATA GAA AAA GCG ATC AAG CAT TTA 3 43 

Arg Ser Asn Leu Tyr Leu Phe Phe lie Glu Lys Ala lie Lys His Leu 
90 95 100 
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TGAATCACAG CTGAGACCAT TAGACCCGCT TTACAAATCA TCAAAACTAA ACCCGGCGTG 6 0 

AGCCTGGTTT CAAGCGTGTT TTTA ATG TGT TTA GAG ACT CAA GTG CTA GTC 111 

Met Cys Leu Asp Thr Gin Val Leu Val 
1 5 

TTT GGG GAT TGC GCG ATT ATC CCT AAC OCT AGO OCT AAA GAA TTA GCC 159 
Phe Gly Asp Cys Ala lie lie Pro Asn Pro Ser Pro Lys Glu Leu Ala 
10 15 20 25 

GAG ATC GCT ACC ACT TCC GCA CAA ACC GCC AAG CAA TTC AAT ATT GCG 2 07 

Glu lie Ala Thr Thr Ser Ala Gin Thr Ala Lys Gin Phe Asn lie Ala 
30 35 40 

CCT AAA GTG GCC TTG CTT TCT TAT GCG ACA GGC GAT TCC GCT CAA GGC 2 55 

Pro Lys Val Ala Leu Leu Ser Tyr Ala Thr Gly Asp Ser Ala Gin Gly 
45 50 55 

GAA ATG ATA GAC AAA ATC AAC GAA GCT TTA ACA ATC GCT CAA AAG TTG 3 03 

Glu Met lie Asp Lys lie Asn Glu Ala Leu Thr lie Ala Gin Lys Leu 
60 65 70 

GAT CCC CAA TTA GAA ATT GAT GGC CCC TTA CAA TTT GAC GCT TCC ATT 3 51 

Asp Pro Gin Leu Glu lie Asp Gly Pro Leu Gin Phe Asp Ala Ser lie 
75 80 85 

GAT AAA AGC GTA GCC AAG AAA AAA TGC CTA ACA GCC AAG TGG CTG GGC 3 99 

Asp Lys Ser Val Ala Lys Lys Lys Cys Leu Thr Ala Lys Trp Leu Gly 
90 95 100 105 

AAG CTA GCG TTT TTA TTT TCC CGG ATT TAAACGCTGG GAACATCGCT TATAAAG 45 3 
Lys Leu Ala Phe Leu Phe Ser Arg lie 
110 

CGGTGCAACG GAGCGCTAAA GCCGTGGCGA TAG 4 86 

(2) INFORMATION FOR SEQ ID N0:B96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896: 

Met Cys Leu Asp Thr Gin Val Leu Val Phe Gly Asp Cys Ala lie lie 

15 10 15 

Pro Asn Pro Ser Pro Lys Glu Leu Ala Glu lie Ala Thr Thr Ser Ala 

20 25 30 

Gin Thr Ala Lys Gin Phe Asn lie Ala Pro Lys Val Ala Leu Leu Ser 

35 40 45 

Tyr Ala Thr Gly Asp Ser Ala Gin Gly Glu Met lie Asp Lys lie Asn 
50 - 55 60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 94: 

Met Pro Gly Pro Lys Pro Gly Ala Leu Pro Leu Gly Asp Thr Pro Lys 

15 10 15 

Thr Lys Glu Ser lie lie Gin Lys Leu Phe Lys Lys Val Lys Leu Lys 

20 25 30 

Arg Tyr Asn Phe lie Met Glu Asn Gly Phe Asp Pro lie lie Tyr Lys 

35 40 45 

Arg Tyr Leu Lys Lys Lys Glu Thr Phe Leu Leu Phe Lys Lys lie Ala 

50 55 60 

Gin Ala Ser Ala Phe Lys Asn Leu Lys Leu Gin Leu Lys Arg Arg Glu 
65 70 75 80 

lie lie Asn Arg Tyr Val Ser Gin Ala Leu Gly Asp Leu Lys Lys Gly 

85 90 95 

Phe Arg Tyr Ala Lys Val Glu His Gin lie Leu Lys lie Tyr Phe Thr 

100 105 110 

His Pro Ser Tyr Leu Lys Ala Phe Lys lie Glu Glu Ala Tyr Tyr Thr 

115 120 125 

Asn His Leu Lys Ala His Leu Lys Glu Thr Gin Lys Thr Leu Lys Ala 

130 135 140 

Leu Asp Tyr Pro Phe Asp Phe Lys Thr lie Gin Ala Ser Val Lys Lys 
145 150 155 160 

Arg Ala Tyr Gin Lys Pro Val Val Lys Lys Glu Lys Pro Pro Lys Ser 

165 170 175 

Val Asn Val Asn Cys Glu Gly Leu Ser Asp Phe Thr Lys Lys Gin Phe 

180 185 190 

Leu Lys Leu Lys Arg Ala Cys Asn Asp Asn Thr Leu Arg Thr Pro Pro 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 895: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 85... 426 
(D) OTHER INFORMATION: 



.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 95: 
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GGT GCC TTA CCG CTT GGC GAC ACC CCA AAA ACT AAA GAA AGC ATT ATA 104 
Gly Ala Leu Pro Leu Gly Asp Thr Pro Lys Thr Lys Glu Ser lie lie 
10 15 20 

CAA AAG CTT TTT AAA AAA GTC AAG CTA AAA CGC TAT AAT TTT ATC ATG 152 
Gin Lys Leu Phe Lys Lys Val Lys Leu Lys Arg Tyr Asn Phe lie Met 
25 30 35 

GAA AAT GGA TTT GAC CCC ATC ATT TAT AAA CGC TAT TTG AAA AAG AAA 2 00 

Glu Asn Gly Phe Asp Pro lie lie Tyr Lys Arg Tyr Leu Lys Lys Lys 
40 45 50 

GAA ACC TTT TTG CTG TTT AAA AAA ATC GCT CAA GCG TCT GCG TTT AAA 24 8 

Glu Thr Phe Leu Leu Phe Lys Lys lie Ala Gin Ala Ser Ala Phe Lys 
55 60 65 70 

AAT TTA AAA CTC CAA CTC AAA CGA AGA GAA ATA ATC AAC CGC TAT GTT 2 96 

Asn Leu Lys Leu Gin Leu Lys Arg Arg Glu lie lie Asn Arg Tyr Val 
75 80 85 

TCT CAA GCT TTG GGG GAT TTA AAA AAA GGG TTT AGA TAC GCT AAA GTA 344 
Ser Gin Ala Leu Gly Asp Leu Lys Lys Gly Phe Arg Tyr Ala Lys Val 
90 95 100 

GAA CAC CAA ATC CTA AAA ATC TAT TTC ACG CAC CCT AGC TAT TTG AAA 3 92 

Glu His Gin lie Leu Lys lie Tyr Phe Thr His Pro Ser Tyr Leu Lys 
105 110 115 

GCC TTT AAA ATA GAA GAA GCC TAT TAC ACC AAC CAC CTG AAA GCC CAT 44 0 

Ala Phe Lys lie Glu Glu Ala Tyr Tyr Thr Asn His Leu Lys Ala His 
120 125 130 

TTA AAA GAA ACG CAA AAA ACC CTA AAA GCC CTA GAT TAC CCC TTT GAT 4 88 

Leu Lys Glu Thr Gin Lys Thr Leu Lys Ala Leu Asp Tyr Pro Phe Asp 
135 140 145 150 

TTT AAG ACT ATC CAA GCG AGC GTG AAA AAA AGG GCT TAT CAA AAA CCA 53 6 

Phe Lys Thr lie Gin Ala Ser Val Lys Lys Arg Ala Tyr Gin Lys Pro 
155 160 165 

GTT GTT AAA AAA GAA AAA CCC CCT AAA AGC GTG AAT GTC AAT TGC GAA 5 84 

Val Val Lys Lys Glu Lys Pro Pro Lys Ser Val Asn Val Asn Cys Glu 
170 175 180 

GGT TTG AGC GAT TTC ACT AAA AAG CAA TTT TTA AAG CTC AAA CGC GCT 632 
Gly Leu Ser Asp Phe Thr Lys Lys Gin Phe Leu Lys Leu Lys Arg Ala 
185 190 195 

TGT AAC GAT AAT ACG CTG CGC ACG CCC CCT TGAGAGCTGA CCATGCAACT GCC 68 5 
Cys Asn Asp Asn Thr Leu Arg Thr Pro Pro 
200 205 

GATCGGGTTT TGCGGGGTGC AAGTTT 711 

(2) INFORMATION FOR SEQ ID NO: 8 94: 
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lie Thr Gin Ala Lys Ala lie 
105 

ACGATTAATC TGA 410 
(2) INFORMATION FOR SEQ ID NO: 8 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 92: 

Met Leu Asp Ser Thr Thr Ala lie Ala lie Ala Thr Pro Ser lie Ala 

15 10 15 

Pro Leu Glu lie Pro Ser Lys Lys Gly Ser Thr Lys Gly Phe Leu Lys 

20 25 30 

lie Val Cys lie Thr Thr Pro Leu Pro Asp Lys Leu Ala Pro Thr Arg 

35 40 45 

Ser Ala Lys lie Thr Arg Gly Ser Arg lie Ser Lys lie lie lie Leu 

50 55 60 

Lys Glu Leu Ser Ser Ser Leu Cys Lys Lys Trp Phe Phe Thr Leu Arg 
65 70 75 80 

His Thr Ser Cys His Ser Ser Lys Leu Lys Asp Ser Pro Pro Asn Asn 

85 90 95 

Ser Thr Thr Thr Ala Lys lie Thr Gin Ala Lys Ala lie 
100 105 

(2) INFORMATION FOR SEQ ID NO: 893: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 39... 662 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:893: 

AGGT^AAAATG GCTGGGGTGC AAGGATTCGA ACCTCGGA ATG CCA GGA CCA AAA OCT 56 

Met Pro Gly Pro Lys Pro 
1 5 
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Leu Ala Asp Val Val Ala Lys Thr lie Thr Pro Tyr Asp Leu Pro Val 

290 295 300 

Gly lie Ala Thr Ser Val Leu Gly Ala Pro Phe Phe Leu Trp Leu Leu 
305 310 315 320 

Phe Arg Thr Arg Gly Val 
325 

(2) INFORMATION FOR SEQ ID NO: 8 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37.., 363 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 91: 

CTAGCTAAAA TCGCCCCAAA AAACGCCAAA ATCGCA ATG TTA GAC TCC ACT ACC 54 

Met Leu Asp Ser Thr Thr 
1 5 

GCT ATC GCC ATC GCC ACG CCT AGC ATC GCC CCG CTA GAA ATC CCT AGT 102 
Ala lie Ala lie Ala Thr Pro Ser lie Ala Pro Leu Glu lie Pro Ser 
10 15 20 

AAA AAG GGA TCC ACT AAG GGG TTT CTA AAA ATC GTT TGC ATC ACC ACC 15 0 

Lys Lys Gly Ser Thr Lys Gly Phe Leu Lys lie Val Cys lie Thr Thr 
25 30 35 

CCA CTC CCA GAC AAA CTC GCT CCC ACC AGG AGC GCT AAA ATC ACT CGT 198 
Pro Leu Pro Asp Lys Leu Ala Pro Thr Arg Ser Ala Lys lie Thr Arg 
40 45 50 

GGT AGT CGT ATT TCT AAA ATA ATA ATA CTT AAA GAG CTC AGT TCT TCA 24 6 

Gly Ser Arg lie Ser Lys lie lie lie Leu Lys Glu Leu Ser Ser Ser 
55 60 65 70 

TTG TGC AAA AAG TGG TTT TTC ACA TTA AGG CAC ACT TCT TGC CAT TCT 2 94 

Leu Cys Lys Lys Trp Phe Phe Thr Leu Arg His Thr Ser Cys His Ser 
75 80 85 

TCC AAG CTC AAG GAC TCC CCT CCA AAC AAC AGC ACC ACC ACC GCT AAA 34 2 

Ser Lys Leu Lys Asp Ser Pro Pro Asn Asn Ser Thr Thr Thr Ala Lys 
90 95 100 

ATC ACG CAA GCT AAG GCG ATA TGATAGGTTT TAAGCATCAC GCTTCCTTTT AAGA 3 97 
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TTA GGA GCG CCT TTC TTC TTG TOG CTT TTG TTT AGA ACT AGG GGG GTG T 1017 
Leu Gly Ala Pro Phe Phe Leu Trp Leu Leu Phe Arg Thr Arg Gly Val 
315 320 325 

GATGGTTTTA GAAGTTAAAA ACCTGTCCTT TAAATATTCT CAAAAACTCA TTTTGGATAA 1077 
ATT 1080 

(2) INFORMATION FOR SEQ ID NO: 8 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 90: 

Met Leu Lys Thr Tyr His lie Ala Leu Ala Cys Val lie Leu Ala Val 

15 10 15 

Val Val Leu Leu Phe Gly Gly Glu Ser Leu Ser Leu Glu Glu Trp Gin 

20 25 30 

Glu Val Cys Leu Asn Val Lys Asn His Phe Leu His Asn Glu Glu Leu 

35 40 45 

Ser Ser Leu Ser lie lie lie Leu Glu lie Arg Leu Pro Arg Val lie 

50 55 60 

Leu Ala Leu Leu Val Gly Ala Ser Leu Ser Gly Ser Gly Val Val Met 
65 70 75 80 

Gin Thr lie Phe Arg Asn Pro Leu Val Asp Pro Phe Leu Leu Gly lie 

85 90 95 

Ser Ser Gly Ala Met Leu Gly Val Ala Met Ala He Ala Val Val Glu 

100 105 110 

Ser Asn He Ala He Leu Ala Phe Phe Gly Ala He Leu Ala Ser Leu 

115 120 125 

Ala Val Leu Ala Met Asn Arg Val Leu Gly Asn Ser Val Leu Ser Leu 

130 135 140 

Val Leu Ser Gly Val Val Leu Ser Ala Phe Leu Ser Ala Leu Ala Gly 
145 150 155 160 

Ala He Lys Phe Phe Val He Pro Gin Lys Ala Gin Ala He Val Val 

165 170 175 

Trp Leu Leu Gly Ser Leu Ser Leu Ser Ser Tyr Lys Asp Cys Leu He 

180 185 190 

Ala Phe He Gly Leu Ser Leu Gly Phe He Pro Leu Phe Leu Leu Arg 

195 200 205 

Trp Arg He Asn Leu Leu Ser Leu Ser Asp Ala Gin Ser Leu Ser Leu 

210 215 220 

Gly He Asn Pro Val Leu Leu Arg Ser Leu Cys Leu Val Cys Val Ser 
225 230 235 240 

Val Ala Ser Ala Leu Ala Val Ser Val Ser Gly Thr He Gly Trp He 

245 250 255 

Gly Leu Val He Pro His Val Ala Arg Leu Phe Phe Gly Ala Asn Leu 

260 265 270 

Gin Lys Leu Leu Leu Ser Ser Leu Leu Met Gly Ala Phe Phe Leu Leu 
275 280 285 
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CCC TTA GTG GAT CCC TTT TTA ^TA GOG ATT TCT AGC GGG GCG ATG CTA 344 
Pro Leu Val Asp Pro Phe Leu Leu Gly lie Ser Ser Gly Ala Met Leu 
90 95 100 

GGC GTG GCG ATG GCG ATA GCG GTA GTG GAG TCT AAC ATT GCG ATT TTG 3 92 

Gly Val Ala Met Ala lie Ala Val Val Glu Ser Asn lie Ala lie Leu 
105 110 115 

GCG TTT TTT GGG GCG ATT TTA. GCT AGC CTT GCT GTT TTG GCG ATG AAT 44 0 

Ala Phe Phe Gly Ala lie Leu Ala Ser Leu Ala Val Leu Ala Met Asn 
120 125 130 

AGG GTT TTG GGT AAT TCC GTC CTT TCG TTG GTG CTT TCA GGG GTG GTG 4 88 

Arg Val Leu Gly Asn Ser Val Leu Ser Leu Val Leu Ser Gly Val Val 
135 140 145 150 

TTG AGC GCG TTT TTA AGC GCC TTA GCC GGA GCG ATA AAA TTC TTT GTG 53 6 

Leu Ser Ala Phe Leu Ser Ala Leu Ala Gly Ala lie Lys Phe Phe Val 
155 160 165 

ATC CCC CAA AAA GCG C7^ GCG ATT GTC GTG TGG CTT TTA GGG AGC TTG 5 84 

lie Pro Gin Lys Ala Gin Ala lie Val Val Trp Leu Leu Gly Ser Leu 
170 175 180 

TCG TTG AGC AGT TAT AAG GAT TGC TTG ATC GCT TTC ATA GGG CTA TCT 63 2 

Ser Leu Ser Ser Tyr Lys Asp Cys Leu lie Ala Phe lie Gly Leu Ser 
185 190 195 

TTA GGC TTT ATC CCG CTT TTT TTG TTA AGG TGG CGC ATC AAT TTA TTG 68 0 

Leu Gly Phe lie Pro Leu Phe Leu Leu Arg Trp Arg lie Asn Leu Leu 
200 205 210 

AGC TTG AGC GAT GCG CAA AGT TTG AGC TTG GGG ATT AAC CCG GTG CTG 72 8 

Ser Leu Ser Asp Ala Gin Ser Leu Ser Leix Gly lie Asn Pro Val Leu 
215 220 225 230 

TTG CGA TCG CTT TGT TTG GTG TGC GTG AGC GTT GCG AGC GCT TTA GCG 7 76 

Leu Arg Ser Leu Cys Leu Val Cys Val Ser Val Ala Ser Ala Leu Ala 
235 240 245 

GTG AGC GTG TCC GGC ACG ATT GGC TGG ATT GGG TTA GTC ATT CCG CAT 82 4 

Val Ser Val Ser Gly Thr lie Gly Trp lie Gly Leu Val lie Pro His 
250 255 260 

GTG GCT AGG TTG TTT TTT GGG GCG AAT TTG CAA AAA CTG CTT TTA AGT 8 72 

Val Ala Arg Leu Phe Phe Gly Ala Asn Leu Gin Lys Leu Leu Leu Ser 
265 270 275 

TCT TTG TTA ATG GGA GCG TTT TTC TTG CTT CTA GCG GAT GTG GTG GCT 92 0 

Ser Leu Leu Met Gly Ala Phe Phe Leu Leu Leu Ala Asp Val Val Ala 
280 285 290 

AAA ACC ATT ACC CCC TAT GAT TTA CCG GTA GGC ATT GCG ACA AGC GTT 96 8 

Lys Thr lie Thr Pro Tyr Asp Leu Pro Val Gly lie Ala Thr Ser Val 
295 300 305 310 
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65 70 75 80 

Gly Leu lie Phe Cys lie Val Leu Phe Met Trp Cys Leu Leu Val Ala 

85 90 95 

Leu Asn Pro Leu Trp Leu Ala Lys Leu Leu Ala Tyr Gly Phe Asp Glu 

100 105 110 

Glu Thr Leu Lys Leu Cys Ala Pro lie Val Ala lie Asn Phe Trp Xaa 

115 120 125 

Leu Leu Leu Val Phe lie Thr Thr Phe Leu Gly Ala Leu Leu Gin Xaa 

130 135 140 

Gin Thr Gin Leu Phe Cys Gin Arg Leu 
145 150 

(2) INFORMATION FOR SEQ ID NO: 88 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 3 9... 1016 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 9: 

TTCTTTGTTC AGATTAATCG TTCTTAAAAG GAAGCGTG ATG CTT AAA ACC TAT CAT 5 6 

Met Leu Lys Thr Tyr His 
1 5 

ATC GCC TTA GCT TGC GTG ATT TTA GCG GTG GTG GTG CTG TTG TTT GGA 104 
lie Ala Leu Ala Cys Val lie Leu Ala Val Val Val Leu Leu Phe Gly 
10 15 20 

GGG GAG TCC TTG AGC TTG GAA GAA TGG CAA GAA GTG TGC CTT AAT GTG 15 2 

Gly Glu Ser Leu Ser Leu Glu Glu Trp Gin Glu Val Cys Leu Asn Val 
25 30 35 

AAA T^C CAC TTT TTG CAC AAT GAA GAA CTG AGC TCT TTA AGT ATT ATT 200 
Lys Asn His Phe Leu His Asn Glu Glu Leu Ser Ser Leu Ser lie He 
40 45 50 

ATT TTA GAA ATA CGA CTA CCA CGA GTG ATT TTA GCG CTC CTG GTG GGA 24 8 

He Leu Glu He Arg Leu Pro Arg Val He Leu Ala Leu Leu Val Gly 
55 60 65 70 

GCG AGT TTG TCT GGG AGT GGG GTG GTG ATG CAA ACG ATT TTT AGA AAC 2 96 

Ala Ser Leu Ser Gly Ser Gly Val Val Met Gin Thr He Phe Arg Asn 
75 80 85 
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ATG ATG GCT AAT ATT CTA GGG OCT GGG GTG TAT AGC GAT ATT TTC TTT 150 
Met Met Ala Asn lie Leu Gly Ala Gly Val Tyr Ser Asp lie Phe Phe 
30 35 40 

GTG GCT TTC AAA TTG CCT AAT TTA TTC AGG CGT ATT TTT GCG GAG GGC 19 8 

Val Ala Phe Lys Leu Pro Asn Leu Phe Arg Arg lie Phe Ala Glu Gly 
45 50 55 

TCT TTT TCA CAA AGC TTT TTA CCG AGC TTC ATA CGA AGT TCT ATT AAA 24 6 

Ser Phe Ser Gin Ser Phe Leu Pro Ser Phe lie Arg Ser Ser lie Lys 
60 65 70 

GGG AGC TTT GCG AGT TTG GTA GGG CTT ATT TTT TGT ATC GTT TTA TTC 2 94 

Gly Ser Phe Ala Ser Leu Val Gly Leu lie Phe Cys lie Val Leu Phe 
75 80 85 

ATG TGG TGC TTA TTG GTG GCG TTA AAT CCC TTA TGG CTA GCT AAA CTC 34 2 

Met Trp Cys Leu Leu Val Ala Leu Asn Pro Leu Trp Leu Ala Lys Leu 
90 95 100 105 

CTA GCT TAC GGC TTT GAT GAA GAA ACG CTC AAA TTA TGC GCC CCT ATT 3 90 

Leu Ala Tyr Gly Phe Asp Glu Glu Thr Leu Lys Leu Cys Ala Pro lie 
110 115 120 

GTA GCG ATC AAT TTT TGG NAT CTT TTA TTG GTG TTT ATC ACC ACC TTT 43 8 

Val Ala lie Asn Phe Trp Xaa Leu Leu Leu Val Phe lie Thr Thr Phe 
125 130 135 

TTA GGC GCG CTT TTA CAA NTA CAA ACA CAG CTT TTT TGC CAG CGC TTA T 4 87 
Leu Gly Ala Leu Leu Gin Xaa Gin Thr Gin Leu Phe Cys Gin Arg Leu 
140 145 150 

AGCGCAAGCT TACTCAATGT ATGCATGATT TTAGCCCTTT TGATTTCTAA AGA 54 0 

(2) INFORMATION FOR SEQ ID NO: 8 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 8: 

Met Leu Lys Lys lie Phe Leu Thr Asn Ser Leu Gly lie Leu Cys Ser 

15 10 15 

Arg lie Phe Gly Phe Leu Arg Asp Leu Met Met Ala Asn lie Leu Gly 

20 25 30 

Ala Gly Val Tyr Ser Asp lie Phe Phe Val Ala Phe Lys Leu Pro Asn 

35 40 45 

Leu Phe Arg Arg lie Phe Ala Glu Gly Ser Phe Ser Gin Ser Phe Leu 

50 55 60 

Pro Ser Phe lie Arg Ser Ser lie Lys Gly Ser Phe Ala Ser Leu Val 
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(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDISTESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 86: 



Met Lys Thr Ser Ala 

1 5 
Leu Gly Lys Gly Leu 
20 

Asp Ala lie Pro lie 

35 

lie Ser Lys Thr Asp 
50 

Glu Asn Cys Ser His 
65 

Lys Asp Leu Glu Glu 
85 

Leu Glu Arg Asp Glu 
100 

Glu Tyr Gin Asn Lys 
115 

Lys Asp lie Asp Asp 
130 



Lys Val Leu Leu Thr Leu 
10 

Asn Ser Leu lie Ser Ala 
25 

Glu Thr Arg Leu His Lys 
40 

Ser lie Glu lie Gin Asp 
55 

Thr Tyr Thr Ser Lys Asp 
70 75 
Leu Glu Glu Gly Val Pro 
90 

Glu Ser lie Ala Lys Asn 
105 

lie Ala Asn Phe Gin Lys 
120 

Tyr Ser Ala Leu Met Ala 
135 



Leu lie Val lie Ser 
15 

Trp Arg Gly Lys Asp 
30 

Asn Lys Leu Thr lie 
45 

lie Gin Phe Asn Arg 
60 

Leu Glu Lys lie Gin 
80 

Glu Leu Phe Glu Glu 
95 

Lys Lys Thr lie Gin 
110 

Tyr Tyr Lys Asp lie 
125 

Gin 
140 



(2) INFORMATION FOR SEQ ID NO: 887: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 486 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887: 

GAAAATTAGG TAATAAATAC AACCAGT ATG CTA AAA AAA ATA TTT TTA ACC AAC 54 

Met Leu Lys Lys lie Phe Leu Thr Asn 
1 5 

AGC TTA GGG ATT TTA TGC TCT AGG ATT TTT GGC TTT TTA CGG GAT TTG 102 
Ser Leu Gly lie Leu Cys Ser Arg lie Phe Gly Phe Leu Arg Asp Leu 
10 15 20 25 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22. . .441 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885: 

ATATTGAAAG ATAATCAAAA A ATG AAG ACA AGC GOT AAA GTA TTA TTG ACT 51 

Met Lys Thr Ser Ala Lys Val Leu Leu Thr 
15 10 

TTA TTG ATT GTA ATA TCA TTA GGT AAG GGA TTA AAT AGT CTC ATA TCA 99 
Leu Leu lie Val lie Ser Leu Gly Lys Gly Leu Asn Ser Leu lie Ser 
15 20 25 

GOT TGG CGT GGC AAA GAT GAT GCG ATC CCC ATT GAA ACA AGA CTC CAT 14 7 

Ala Trp Arg Gly Lys Asp Asp Ala lie Pro lie Glu Thr Arg Leu His 
30 35 40 

AAA AAC AAA CTG ACA ATC ATT TCT AAA ACA GAC AGC ATA GAA ATC CAA 195 
Lys Asn Lys Leu Thr lie lie Ser Lys Thr Asp Ser lie Glu lie Gin 
45 50 55 

GAC ATT CAG TTT AAT AGA GAG AAT TGT TCT CAC ACT TAT ACT AGT AAG 24 3 

Asp lie Gin Phe Asn Arg Glu Asn Cys Ser His Thr Tyr Thr Ser Lys 
60 65 70 

GAT TTG GAA AAA ATT CAA AAA GAT TTA GAA GAG CTT GAA GAA GGA GTG 2 91 

Asp Leu Glu Lys lie Gin Lys Asp Leu Glu Glu Leu Glu Glu Gly Val 
75 80 85 90 

CCT GAA TTG TTC GAG GAG CTT GAG CGT GAT GAA GAG TCC ATC GCT AAA 339 
Pro Glu Leu Phe Glu Glu Leu Glu Arg Asp Glu Glu Ser lie Ala Lys 
95 100 105 

AAT AAA AAA ACG ATC CAA GAG TAT CAA AAT AAA ATT GCT AAT TTT CAA 3 87 

Asn Lys Lys Thr lie Gin Glu Tyr Gin Asn Lys lie Ala Asn Phe Gin 
110 115 120 

AAA TAC TAT AAA GAT ATA AAA GAT ATT GAC GAT TAT TCG GCG TTA ATG 4 35 

Lys Tyr Tyr Lys Asp lie Lys Asp He Asp Asp Tyr Ser Ala Leu Met 
125 130 135 

GCT CAA TGAACATAAA GATTCTTATA CTTGGGATAA TGATCTTGAT 4 81 

Ala Gin 
140 



(2) INFORMATION FOR SEQ ID NO: 886: 
( i ) SEQUENCE CHARACTERISTICS : 
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TTAGATTCTT AAGCTTGAGC GATAACCTTT AAAAAGCGTT ATGGGGTGGT GTTGCAAAAC 8 38 

CC 84 0 



(2) INFORMATION FOR SEQ ID NO: 884: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:884: 



Met Lys Lys Ala Gly Phe Leu Phe Leu Ala Val Met Ala lie Val Val 

15 10 15 

Met Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys lie Val Phe 

20 25 30 

Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys 

35 40 45 

lie Glu Val Lys Pro Asp Ala Gly Tyr Val Val Leu Lys Asp lie Asn 

50 55 60 

Gly Pro Leu Gin Tyr Leu Leu Met Pro Thr Thr His lie Ser Gly lie 
65 70 75 80 

Glu Ser Pro Leu Leu Leu Asp Pro Ser Thr Pro Asn Phe Phe Tyr Leu 

85 90 95 

Ser Trp Gin Ala Arg Asp Phe Met Ser Lys Lys Tyr Gly Gin Pro lie 

100 105 110 

Pro Asp Tyr Ala lie Ser Leu Thr lie Asn Ser Ser Lys Gly Arg Ser 

115 120 125 

Gin Asn His Phe His lie His lie Ser Cys lie Ser Leu Glu Ala Arg 

130 135 140 

Lys Gin Leu Asp Asn Asn Leu Lys Lys lie Asn Ser Arg Trp Ser Pro 
145 150 155 160 

Leu Pro Gly Gly Leu Asn Gly His Lys Tyr Leu Ala Arg Arg Val Thr 

165 170 175 

Glu Ser Glu Leu Val Gin Lys Ser Pro Phe Val Met Leu Asn Lys Glu 

180 185 190 

Val Pro Asn Ala Tyr Lys Arg Met Gly Asp Tyr Gly Leu Ala Val Val 

195 200 205 

Gin Gin Ser Asp Asn Ser Phe Val Leu Leu Ala Thr Gin Phe Asn Pro 

210 215 220 

Leu Thr Leu Asn Arg Ala Ser Ala Glu Glu lie Gin Asp His Glu Cys 
225 230 235 240 

Al a lie Leu Hi s 



(2) INFORMATION FOR SEQ ID NO: 8 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Asp Pro Asn Val Leu Arg Lys lie Val Phe Glu hys Cys Leu Pro Asn 
25 30 35 

TAT GAG AAA AAT CAG AAT CCT TCG CCA TGC ATA GAA GTC AAA CCC GAT 199 
Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys lie Glu Val Lys Pro Asp 
40 45 50 

GCC GGC TAT GTG GTT TTA AAA GAT ATT AAC GGC CCG TTG CAA TAT TTG 24 7 

Ala Gly Tyr Val Val Leu Lys Asp lie Asn Gly Pro Leu Gin Tyr Leu 
55 60 65 70 

TTG ATG CCA ACA ACT CAC ATT AGC GGT ATT GAA AGC CCT TTG TTA CTT 2 95 

Leu Met Pro Thr Thr His lie Ser Gly lie Glu Ser Pro Leu Leu Leu 
75 80 85 

GAT CCT TCT ACG CCT AAC TTT TTT TAT TTA TCC TGG CAA GCG CGT GAT 34 3 

Asp Pro Ser Thr Pro Asn Phe Phe Tyr Leu Ser Trp Gin Ala Arg Asp 
90 95 100 

TTT ATG AGT AAA AAA TAC GGC CAA CCC ATT CCT GAT TAT GCG ATT TCT 3 91 

Phe Met Ser Lys Lys Tyr Gly Gin Pro lie Pro Asp Tyr Ala lie Ser 
105 110 115 

TTG ACG ATT AAC TCT AGC AAA GGG CGA TCG CAA AAC CAT TTT CAT ATC 43 9 

Leu Thr lie Asn Ser Ser Lys Gly Arg Ser Gin Asn His Phe His lie 
120 125 130 

CAT ATC TCT TGC ATT AGT CTT GAA GCA CGC AAA CAG CTG GAT AAT AAC 4 87 

His lie Ser Cys lie Ser Leu Glu Ala Arg Lys Gin Leu Asp Asn Asn 
135 140 145 150 

CTA AAA AAA ATC AAC AGC CGT TGG TCG CCA TTA CCG GGC GGT TTG AAT 53 5 

Leu Lys Lys lie Asn Ser Arg Trp Ser Pro Leu Pro Gly Gly Leu Asn 
155 160 165 

GGG CAT AAA TAC TTG GCG CGT CGG GTA ACA GAG AGC GAG TTA GTG CAA 5 83 

Gly His Lys Tyr Leu Ala Arg Arg Val Thr Glu Ser Glu Leu Val Gin 
170 175 180 

AAA AGC CCG TTT GTC ATG CTT AAT AAA GAA GTG CCT AAT GCG TAC AAA 631 
Lys Ser Pro Phe Val Met Leu Asn Lys Glu Val Pro Asn Ala Tyr Lys 
185 190 195 

CGC ATG GGG GAC TAT GGC TTA GCG GTG GTG CAA CAA AGC GAT AAC TCC 67 9 

Arg Met Gly Asp Tyr Gly Leu Ala Val Val Gin Gin Ser Asp Asn Ser 
200 205 210 

TTT GTC TTA TTA GCG ACA CAA TTT AAC CCA TTG ACT TTA AAT CGC GCT 72 7 

Phe Val Leu Leu Ala Thr Gin Phe Asn Pro Leu Thr Leu Asn Arg Ala 
215 220 225 230 

TCA GCC GAA GAG ATT CAA GAT CAT GAA TGC GCG ATT TTG CAC TAAAGCGAG 77 8 
Ser Ala Glu Glu lie Gin Asp His Glu Cys Ala lie Leu His 
235 240 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:882: 



Met His Glu Tyr Ser Val Val Ser Ser Leu lie Ala Leu Cys Glu Glu 

15 10 15 

His Ala Lys Lys Asn Gin Ala His Lys He Glu Arg Val Val Val Gly 

20 25 30 

He Gly Glu Arg Ser Ala Met Asp Lys Ser Leu Phe Val Ser Ala Piie 

35 40 45 

Glu Thr Phe Arg Glu Glu Ser Leu Val Cys Lys Asp Ala He Leu Asp 

50 55 60 

He Val Asp Glu Lys Val Glu Leu Glu Cys Lys Asp Cys Ser His Val 
65 70 75 80 

Phe Lys Pro Asn Ala Leu Asp Tyr Gly Val Cys Glu Lys Cys His Ser 

85 90 95 

Lys Asn Val He He Thr Gin Gly Asn Glu Met Arg Leu Leu Ser Leu 

100 105 110 

Glu Met Leu Ala Glu 
115 



(2) INFORMATION FOR SEQ ID NO:883: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 38... 769 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883: 

TTGCAAAAAC TCTCATTAAA AACAAGGAGC AAAAAAG ATG AAA AAG GCG GGC TTT 5 5 

Met Lys Lys Ala Gly Phe 
1 5 

CTT TTT TTA GCG GTA ATG GCT ATC GTT GTT ATG AGT TTA AAC GCT AAA 103 
Leu Phe Leu Ala Val Met Ala He Val Val Met Ser Leu Asn Ala Lys 
10 15 20 

GAT CCG AAT GTG TTG CGT AAG ATT GTT TTT GAG AAA TGT CTG CCT AAT 151 
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(2) INFORMATION FOR SEQ ID NO: 8 81: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 410 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 366 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881: 

AGATTAAGGT TTAGT ATG CAT GAA TAC TCG GTC GTT TCT TCT TTA ATC GCT 51 
Met His Glu Tyr Ser Val Val Ser Ser Leu lie Ala 
15 10 

CTT TGC GAA GAG CAT GCG AAG AAA AAT CAA GCC CAT AAG ATT GAA AGA 99 
Leu Cys Glu Glu His Ala Lys Lys Asn Gin Ala His Lys lie Glu Arg 
15 20 25 

GTC GTG GTC GGT ATT GGT GAA AGA AGT GCT ATG GAT AAG AGC TTG TTT 14 7 

Val Val Val Gly lie Gly Glu Arg Ser Ala Met Asp Lys Ser Leu Phe 
30 35 40 

GTG AGT GCG TTT GAG ACT TTT AGA GAA GAA TCT TTG GTG TGT AAA GAC 195 
Val Ser Ala Phe Glu Thr Phe Arg Glu Glu Ser Leu Val Cys Lys Asp 
45 50 55 60 

GCT ATT TTA GAC ATT GTA GAT GAA AAG GTT GAA TTA GAA TGC AAG GAT 24 3 

Ala lie Leu Asp lie Val Asp Glu Lys Val Glu Leu Glu Cys Lys Asp 
65 70 75 

TGT TCG CAT GTT TTT AAG CCT AAC GCG CTA GAT TAT GGG GTG TGT GAG 2 91 

Cys Ser His Val Phe Lys Pro Asn Ala Leu Asp Tyr Gly Val Cys Glu 
80 85 90 

AAA TGC CAC AGC AAG AAT GTT ATT ATC ACT CAA GGC AAT GAA ATG CGT 33 9 

Lys Cys His Ser Lys Asn Val lie lie Thr Gin Gly Asn Glu Met Arg 
95 100 105 

TTG TTG TCT TTA GAA ATG TTA GCG GAA TAACCGATGC AAGAAGAATT GAACGCT 3 93 
Leu Leu Ser Leu Glu Met Leu Ala Glu 
110 115 

TACCAGCAAG AAATTGA 410 

(2) INFORMATION FOR SEQ ID NO: 88 2: 
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(2) INFORMATION FOR SEQ ID NO: 88 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880: 

Met Asn Lys Pro Phe Leu lie Leu Leu lie Ala Leu lie Val Phe Ser 

15 10 15 

Gly Cys Asn Met Arg Lys Tyr Phe Lys Pro Ala Lys His Gin lie Lys 

20 25 30 

Gly Glu Ala Tyr Phe Pro Asn His Leu Gin Glu Ser He Val Ser Ser 

35 40 45 

Asn Arg Tyr Gly Ala He Leu Lys Asn Gly Ala Val He Gly Asp Lys 

50 55 60 

Gly Leu Thr Gin Leu Arg He Gly Lys Asn Phe Asn Tyr Glu Ser Ser 
65 70 75 80 

Phe Leu Asn Glu Ser Gin Gly Phe Phe He Leu Ala Gin Asp Cys Leu 

85 90 95 

Asn Lys He Asp Lys Lys Thr Asn Lys Ser Lys Val Ala Lys Thr Glu 

100 105 110 

Glu Thr Glu Leu Lys Leu Lys Gly Val Glu Ala Glu Val Gin Asp Lys 

115 120 125 

Val Cys His Gin Val Glu Leu He Ser Asn Asn Pro Asn Ala Ser Gin 

130 135 140 

Gin Ser He Val He Pro Leu Glu Thr Phe Ala Leu Ser Ala Ser Val 
145 150 155 160 

Lys Gly Asn Leu Leu Ala Val Val Leu Ala Asp Asn Ser Ala Asn Leu 

165 170 175 

Tyr Asp He Thr Ser Gin Lys Leu Leu Phe Ser Glu Lys Gly Ser Pro 

180 185 190 

Ser Thr Thr He Asn Ser Leu Met Ala Met Pro He Phe Met Asp Thr 

195 200 205 

Val Val Val Phe Pro Met Leu Asp Gly Arg Leu Leu Val Val Asp Tyr 

210 215 220 

Val His Gly Asn Pro Thr Pro He Arg Asn He Val He Ser Ser Asp 
225 230 235 240 

Lys Phe Phe Asn Asn He Thr Tyr Leu He Val Asp Gly Asn Asn Met 

245 250 255 

He Ala Ser Thr Gly Lys Arg He Leu Ser Val Val Ser Gly Gin Glu 

260 265 270 

Phe Asn Tyr Asp Gly Asp He Val Asp Leu Leu Tyr Asp Lys Gly Thr 

275 280 285 

Leu Tyr Val Leu Thr Leu Asp Gly Gin He Leu Gin Met Asp Lys Ser 

290 295 300 

Leu Arg Glu Leu Asn Ser Val Lys Leu Pro Xaa Xaa Ala Gin His Asp 
305 310 315 320 

Cys He Lys Pro 
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105 110 115 

GGC GTT GAA GCG GAA GTC CAA GAT AAA GTC TGT CAT CAA GTG GAA TTG 43 8 

Gly Val Glu Ala Glu Val Gin Asp Lys Val Cys His Gin Val Glu Leu 
120 125 130 135 

ATT AGC AAT AAC CCT AAC GCC AGC CAA CAA TCT ATC GTT ATT CCT TTG 486 
lie Ser Asn Asn Pro Asn Ala Ser Gin Gin Ser lie Val lie Pro Leu 
140 145 150 

GAG ACT TTT GCC TTG AGC GCA AGC GTT AAA GGG AAT CTT TTA GCG GTG 53 4 

Glu Thr Phe Ala Leu Ser Ala Ser Val Lys Gly Asn Leu Leu Ala Val 
155 160 165 

GTG TTA GCG GAC AAT TCA GCG AAC TTA TAC GAC ATC ACT TCT CAA AAA 5 82 

Val Leu Ala Asp Asn Ser Ala Asn Leu Tyr Asp lie Thr Ser Gin Lys 
170 175 180 

TTG CTT TTT AGT GAG AAA GOT TCC CCA AGC ACC ACG ATC AAT TCT TTA 63 0 

Leu Leu Phe Ser Glu Lys Gly Ser Pro Ser Thr Thr lie Asn Ser Leu 
185 190 195 

ATG GCG ATG CCT ATT TTT ATG GAT ACG GTC GTG GTG TTC CCC ATG CTA 67 8 

Met Ala Met Pro lie Phe Met Asp Thr Val Val Val Phe Pro Met Leu 
200 205 210 215 

GAT GGG CGC TTG TTG GTC GTG GAT TAT GTG CAC GGA AAC CCT ACG CCT 72 6 

Asp Gly Arg Leu Leu Val Val Asp Tyr Val His Gly Asn Pro Thr Pro 
220 225 230 

ATT AGA AAC ATT GTT ATC AGC AGC GAT AAG TTT TTT AAC AAT ATC ACC 774 
lie Arg Asn lie Val lie Ser Ser Asp Lys Phe Phe Asn Asn lie Thr 
235 240 245 

TAC CTT ATC GTA GAT GGC AAT AAC ATG ATC GCT TCT ACA GGG AAA AGG 822 
Tyr Leu lie Val Asp Gly Asn Asn Met lie Ala Ser Thr Gly Lys Arg 
250 255 260 

ATA CTC TCA GTA GTG AGC GGT CAA GAG TTC AAC TAT GAT GGG GAT ATT 870 
lie Leu Ser Val Val Ser Gly Gin Glu Phe Asn Tyr Asp Gly Asp lie 
265 270 275 

GTG GAT TTG CTT TAT GAT AAG GGG ACT TTA TAT GTG CTC ACG CTA GAC 918 
Val Asp Leu Leu Tyr Asp Lys Gly Thr Leu Tyr Val Leu Thr Leu Asp 
280 285 290 295 

GGG CAG ATT TTG CAA ATG GAT AAG AGT TTG AGG GAA TTA AAC AGC GTG 966 
Gly Gin lie Leu Gin Met Asp Lys Ser Leu Arg Glu Leu Asn Ser Val 
300 305 310 

AAA CTG CCT NTC NTC GCT CAA CAC GAT TGT ATT AAA CCA TAATAAATTG TA 1017 
Lys Leu Pro Xaa Xaa Ala Gin His Asp Cys lie Lys Pro 
315 320 

TTCTTTAGAA AAACGAGGGT ATGTGATAGA 104 7 
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195 200 205 

Arg Glu His Gin Glu Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys 

210 215 220 

Ala Leu Glu Met Asn Lys Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 87 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34 . . . 1005 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 9: 

AGAAAGAAAC CATTCAAGGA ACGCATTGAT TTG ATG AAT AAA CCA TTT TTA ATC 54 

Met Asn Lys Pro Phe Leu lie 
1 5 

TTA CTC ATA GCC CTA ATT GTC TTT AGO GGC TGT AAC ATG AGA AAA TAT 102 
Leu Leu lie Ala Leu lie Val Phe Ser Gly Cys Asn Met Arg Lys Tyr 
10 15 20 

TTC AAA CCC GCT AAA CAC CAA ATT AAA GGC GAA GCG TAT TTC CCT AAC 15 0 

Phe Lys Pro Ala Lys His Gin lie Lys Gly Glu Ala Tyr Phe Pro Asn 
25 30 35 

CAT TTG CAA GAA AGT ATC GTT TCG TCT AAT CGT TAT GGA GCC ATT TTG 198 
His Leu Gin Glu Ser lie Val Ser Ser Asn Arg Tyr Gly Ala lie Leu 
40 45 50 55 

AAA AAT GGA GCG GTT ATA GGC GAT AAA GGT TTA ACG CAG CTA AGA ATC 24 6 

Lys Asn Gly Ala Val lie Gly Asp Lys Gly Leu Thr Gin Leu Arg lie 
60 65 70 

GGT AAG AAC TTC AAT TAC GAA AGC AGT TTT TTA AAT GAG AGT CAA GGG 2 94 

Gly Lys Asn Phe Asn Tyr Glu Ser Ser Phe Leu Asn Glu Ser Gin Gly 
75 80 85 

TTT TTT ATT CTT GCG CAA GAT TGT TTG AAC AAG ATT GAT AAA AAA ACA 342 
Phe Phe lie Leu Ala Gin Asp Cys Leu Asn Lys lie Asp Lys Lys Thr 
90 95 100 

AAC AAA AGC AAG GTG GCT AAG ACT GAA GAA ACG GAA TTG AAA TTA AAG 3 90 

Asn Lys Ser Lys Val Ala Lys Thr Glu Glu Thr Glu Leu Lys Leu Lys 
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CCA AAA CCA AAT TOT AAA GAA GAA AAA CGC CGC TTG AAA GAA GAA AAG 576 
Pro Lys Pro Asn Ser Lys Glu Glu Lys Arg Arg Leu Lys Glu Glu Lys 
180 185 190 



AAA AAA GCC AAA GCC GAA CAA AGA GCG AGA GAA TTT GAA CAA AG A GCG 624 
Lys Lys Ala Lys Ala Glu Gin Arg Ala Arg Glu Phe Glu Gin Arg Ala 
195 200 205 



AGA GAG CAT CAA GAA AGA GAT GAA AAA GAG CTT GAA GAA AGA AGA AAA 672 
Arg Glu His Gin Glu Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys 
210 215 220 



GCT TTA GAA ATG AAT AAG AAG TAGGCCTATG CCAGCTAGGC AATCTTTTAC AGAT 727 
Ala Leu Glu Met Asn Lys Lys 
225 230 



TTGAAAAA 73 5 

(2) INFORMATION FOR SEQ ID NO: 87 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 8: 



Asn 


Cys 


Xaa 


Xaa 


Ser 


Leu 


Asn 


Thr 


He 


Val 


Leu 


Asn 


His 


Asn 


Lys 


Leu 


1 








5 










10 










15 




Tyr 


Ser 


Leu 


Glu 


Lys 


Arg 


Gly 


Tyr 


Val 


He 


Glu 


Val 


Asp 


Leu 


Asn 


Asp 








20 










25 










30 






Phe 


Asp 


Ser 


Tyr 


Asn 


Val 


Tyr 


Lys 


Thr 


Pro 


Thr 


He 


Gly 


Ser 


Phe 


Lys 






35 










40 










45 








Phe 


Phe 


Ser 


Ser 


Asn 


Arg 


Leu 


Asp 


Lys 


Gly 


Val 


Phe 


Tyr 


Asp 


Lys 


Asn 




50 










55 










60 










Arg 


Val 


Tyr 


Tyr 


Asp 


Arg 


Tyr 


Tyr 


Leu 


Asp 


Tyr 


Asn 


Asp 


Phe 


Lys 


Pro 


65 










70 










75 










80 


Lys 


Leu 


Tyr 


Pro 


Val 


Val 


Glu 


Lys 


Ser 


Ala 


Ser 


Lys 


Lys 


Ser 


Gin 


Lys 










85 










90 










95 




Gly 


Glu 


Lys 


Gly 


Asn 


Ala 


Pro 


He 


Tyr 


Leu 


Gin 


Glu 


Arg 


His 


Lys 


Ala 








100 










105 










110 






Lys 


Glu 


Asn 


Lys 


Gin 


Pro 


Leu 


Glu 


Glu 


Asn 


Lys 


Val 


Lys 


Pro 


Arg 


Asn 






115 










120 










125 








Ser 


Gly 


Phe 


Glu 


Glu 


Glu 


Glu 


Val 


Lys 


Thr 


Arg 


Arg 


Pro 


Glu 


Pro 


He 




130 










135 










140 










Arg 


Asp 


Gin 


Asn 


Asn 


Ala 


Thr 


Gin 


Gin 


Gly 


Glu 


Thr 


Lys 


Asn 


Asn 


Glu 


145 










150 










155 










160 


Ser 


Lys 


Asn 


Ala 


Pro 


Val 


Leu 


Lys 


Glu 


Asn 


Ala 


Ala 


Lys 


Lys 


Glu 


Val 










165 










170 










175 




Pro 


Lys 


Pro 


Asn 


Ser 


Lys 


Glu 


Glu 


Lys 


Arg 


Arg 


Leu 


Lys 


Glu 


Glu 


Lys 








180 










185 










190 






Lys 


Lys 


Ala 


Lys 


Ala 


Glu 


Gin 


Arg 


Ala 


Arg 


Glu 


Phe 


Glu 


Gin 


Arg 


Ala 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 693 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877: 

AAC TGC CTN TCN TCG CTC AAC ACG ATT GTA TTA AAC CAT AAT AAA TTG 4 8 

Asn Cys Xaa Xaa Ser Leu Asn Thr lie Val Leu Asn His Asn Lys Leu 
15 10 15 

TAT TCT TTA GAA AAA CGA GGG TAT GTG ATA GAG GTG GAT TTA AAT GAT 96 
Tyr Ser Leu Glu Lys Arg Gly Tyr Val lie Glu Val Asp Leu Asn Asp 
20 25 30 

TTT GAT TCG TAT AAT GTC TAT AAA ACG CCA ACT ATA GGC ACT TTT AAG 14 4 

Phe Asp Ser Tyr Asn Val Tyr Lys Thr Pro Thr lie Gly Ser Phe Lys 
35 40 45 

TTT TTT TCA TCT AAT CGT TTG GAT AAA GGG GTG TTT TAT GAT AAA AAT 192 
Phe Phe Ser Ser Asn Arg Leu Asp Lys Gly Val Phe Tyr Asp Lys Asn 
50 55 60 

CGG GTG TAT TAC GAT CGC TAC TAT TTA GAT TAT AAC GAT TTT AAA CCA 24 0 

Arg Val Tyr Tyr Asp Arg Tyr Tyr Leu Asp Tyr Asn Asp Phe Lys Pro 
65 70 75 80 

AAA CTT TAT CCC GTT GTG GAA AAA TCG GCA TCT AAA AAA TCT CAA AAA 288 
Lys Leu Tyr Pro Val Val Glu Lys Ser Ala Ser Lys Lys Ser Gin Lys 
85 90 95 

GGC GAA AAA GGG AAC GCT CCT ATT TAT TTG CAA GAA AGG CAT AAA GCT 33 6 

Gly Glu Lys Gly Asn Ala Pro lie Tyr Leu Gin Glu Arg His Lys Ala 
100 105 110 

AAA GAA AAT AAA CAG CCT TTA GAA GAA AAC AAA GTT AAA CCA AGA AAT 3 84 

Lys Glu Asn Lys Gin Pro Leu Glu Glu Asn Lys Val Lys Pro Arg Asn 
115 120 125 

AGC GGG TTT GAA GAA GAA GAG GTT AAA ACC AGA AGG CCT GAG CCT ATT 432 
Ser Gly Phe Glu Glu Glu Glu Val Lys Thr Arg Arg Pro Glu Pro lie 
, 130 135 140 

AGG GAT CAA AAT AAC GCC ACC CAA CAA GGC GAA ACA AAA AAC AAT GAA 4 80 

Arg Asp Gin Asn Asn Ala Thr Gin Gin Gly Glu Thr Lys Asn Asn Glu 
145 150 155 160 

AGT AAA AAC GCT CCT GTC TTA AAA GAA AAC GCC GCT AAA AAA GAA GTG 52 8 

Ser Lys Asn Ala Pro Val Leu* Lys Glu Asn Ala Ala Lys Lys Glu Val 
165 170 175 
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ACAGAGACGG CATTATCAAT ATTGA 801 
(2) INFORMATION FOR SEQ ID NO: 876: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876: 



Met Gin Met Met 
1 

Leu Ser Met Ser 
20 

Ser Ala Phe Ser 
35 

Gin Leu Thr Cys 
50 

Met Leu Leu Gly 
65 

Ser Met Gly Phe 

lie Cys Leu Ala 
100 

lie Pro Phe lie 
115 

lie Leu Gly Leu 
130 

Met Met Val Tyr 
145 

Phe Met Gly Ala 

Pro Leu Phe Leu 
180 

Arg Lys Phe Phe 
195 

Leu Tyr lie Leu 
210 

His Ala Met His 
225 

Val His Ser His 



His Asn Leu Ser 
5 

Leu Gly His Cys 

Gin lie Arg Phe 
40 

His Ala Leu Tyr 
55 

Ala lie Ala Ala 
70 

Arg Gly Val Leu 
85 

Leu Leu Gly Ala 

Ser Phe Leu Met 
12 0 

Tyr Phe Leu Gly 
135 

Ser Phe Leu Ala 
150 

Met Leu Gly Leu 
165 

Met Gly lie Phe 

Asn Leu Leu Ser 
200 

Tyr Met Gly lie 
215 

His Gin Asn Asn 

230 
Glu His 
245 



Phe Leu Gly Met 
10 

Val Gly Met Cys 
25 

Ser Lys Val Thr 

Asn Val Gly Arg 
60 

Ser Leu Gly His 
75 

Phe lie Ser Met 
90 

Arg Met Glu Lys 
105 

Lys Lys Thr Leu 

Val Leu Asn Gly 
140 

Ser Val lie Leu 
155 

Ser Phe Gly Leu 
170 

Leu Ser Lys lie 
185 

Lys lie Leu Met 

Met Leu lie Asn 
220 

Thr Thr Gin His 
235 



Phe Leu Ala Ala 
15 

Gly Gly lie Val 
30 

Ser Phe Ser Tyr 
45 

lie Ser Thr Tyr 

Ser Leu Ser Val 
80 

Gly lie He Leu 
95 

Leu Ser Phe Gin 
110 

Gin Ser Gin Asn 
125 

Phe Leu Pro Cys 

Ser His Ser Ala 
160 

Gly Thr Ser Met 
175 

Ser Val Ser Tyr 
190 

Gly Val Phe Gly 
205 

His Lys Met Pro 

Asp His Lys Gly 
240 



(2) INFORMATION FOR SEQ ID NO:877: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Cys Gly Gly lie Val Ser Ala Phe Ser Gin lie Arg Phe Ser Lys Val 
30 35 40 

ACA AGC TTT TCT TAG GAG CTC ACT TGC CAT GCC CTT TAT AAT GTA GGG 195 
Thr Ser Phe Ser Tyr Gin Leu Thr Cys His Ala Leu Tyr Asn Val Gly 
45 50 55 

AGG ATC AGC ACT TAG ATG CTT TTA GGG GCT ATA GCG GCA AGT TTG GGG 243 
Arg lie Ser Thr Tyr Met Leu Leu Gly Ala lie Ala Ala Ser Leu Gly 
60 65 70 75 

CAT AGT CTT AGC GTG AGC ATG GGT TTT AGG GGT GTT TTA TTC ATT AGC. 2 91 

His Ser Leu Ser Val Ser Met Gly Phe Arg Gly Val Leu Phe lie Ser 
80 85 90 

ATG GGG ATT ATT TTG ATC TGT TTA GCG TTG CTA GGG GCA AGA ATG GAA 33 9 

Met Gly lie lie Leu lie Cys Leu Ala Leu Leu Gly Ala Arg Met Glu 
95 100 105 

AAA TTA AGC TTT CAA ATC CCT TTT ATT TCT TTT TTG ATG AAA AAA ACC 3 87 

Lys Leu Ser Phe Gin lie Pro Phe lie Ser Phe Leu Met Lys Lys Thr 
110 115 120 

TTG CAA TCT CAA AAC ATT CTA GGG CTG TAT TTC TTA GGC GTG TTG AAC 4 35 

Leu Gin Ser Gin Asn lie Leu Gly Leu Tyr Phe Leu Gly Val Leu Asn 
125 130 135 

GGG TTT TTA CCT TGC ATG ATG GTG TAT TCG TTT TTA GCG AGC GTG ATT 4 83 

Gly Phe Leu Pro Cys Met Met Val Tyr Ser Phe Leu Ala Ser Val He 
140 145 150 155 

CTC AGT CAT AGC GCG TTT ATG GGA GCG ATG CTA GGC CTT TCT TTT GGG 531 
Leu Ser His Ser Ala Phe Met Gly Ala Met Leu Gly Leu Ser Phe Gly 
160 165 170 

CTT GGC ACC AGC ATG CCG TTG TTT TTA ATG GGG ATT TTT TTA AGC AAA 57 9 

Leu Gly Thr Ser Met Pro Leu Phe Leu Met Gly He Phe Leu Ser Lys 
175 180 185 

ATT TCC GTT TCT TAG AGG AAA TTT TTC AAT CTT TTG TCT AAA ATT TTA 62 7 

He Ser Val Ser Tyr Arg Lys Phe Phe Asn Leu Leu Ser Lys He Leu 
190 195 200 

ATG GGG GTT TTT GGG CTT TAT ATC CTT TAT ATG GGG ATC ATG CTC ATT 67 5 

Met Gly Val Phe Gly Leu Tyr He Leu Tyr Met Gly He Met Leu He 
205 210 215 

AAC CAC AAA ATG CCT CAT GCC ATG CAT CAT CAA AAC AAC ACC ACT CAG 72 3 

Asn His Lys Met Pro His Ala Met His His Gin Asn Asn Thr Thr Gin 
220 225 230 235 

CAT GAT CAT AAA GGA GTG CAT TCG CAT GAA CAC TAACAAAGCC CTTTTTTTGG 7 76 

His Asp His Lys Gly Val His Ser His Glu His 
240 245 
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Asn Gin Val Ser Leu Lys Val Met Pro Lys Asp Lys lie Ala Leu lie 

210 215 220 

Gly Pro Asn Gly Val Gly Lys Ser Thr Leu Cys Lys lie Leu Val Glu 
225 230 235 240 

Glu Leu Lys Pro Asp Lys Gly Val Val Lys Trp Gly Ala Thr Val Ser 

245 250 255 

Lys Gly Tyr Phe Pro Gin Asn Val Ser Glu Glu He Ser Gly Glu Glu 

260 265 270 

Thr Leu Tyr Gin Trp Leu Phe Asn Phe Asn Lys Lys He Glu Ser Ala 

275 280 285 

Glu Val Arg Asn Ala Leu Gly Arg Met Leu Phe Asn Gly Glu Glu Gin 

290 295 300 

Glu Lys Cys Val Asn Ala Leu Ser Gly Gly Glu Lys His Arg Met Val 
305 310 315 320 

Leu Ser Lys Leu Met Leu Glu Gly Gly Asn Phe Leu Val Leu Asp Glu 

325 330 335 

Pro Thr Asn His Leu Asp Leu Glu Ala He lie Ala Leu Gly Glu Ala 

340 345 350 

Leu Phe Lys Phe Asp Gly Ala Leu He Cys Val Ser His Asp Arg Glu 

355 360 365 

Leu He Asp Ala Tyr Ala Asn Arg He He Glu Leu Val Pro Ser Pro 

370 375 380 

Lys Gly Ala Ser He He Asp Phe Lys Gly Ser Tyr Glu Glu Tyr Leu 
385 390 395 400 

Ala Ser Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 875: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 756 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875: 

AAAAGCAGGG ATACTAGA ATG CAA ATG ATG CAC AAT TTG AGT TTT TTG GGC 51 

Met Gin Met Met His Asn Leu Ser Phe Leu Gly 
15 10 

ATG TTT TTA GCC GCT TTG AGO ATG TCT TTA GGG CAT TGT GTG GGC ATG 99 
Met Phe Leu Ala Ala Leu Ser Met Ser Leu Gly His Cys Val Gly Met 
15 20 25 

TGT GGG GGG ATT GTG AGC GCG TTC AGT CAA ATA AGA TTT TCT AAA GTT 14 7 
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Asp Leu Glu Ala lie lie Ala Leu Gly Glu Ala Leu Phe Lys Phe Asp 
345 350 355 

GGG GCG CTG ATT TGC GTA AGC CAT GAC AGA GAG CTC ATT GAT GCG TAT 1160 
Gly Ala Leu lie Cys Val Ser His Asp Arg Glu Leu lie Asp Ala Tyr 
360 365 370 

GOT AAT AGG ATC ATT GAA TTA GTC CCA AGC CCT AAA GGC GCT TCA ATC 12 0 8 

Ala Asn Arg lie lie Glu Leu Val Pro Ser Pro Lys Gly Ala Ser lie 
375 380 385 

ATT GAT TTT AAA GGC AGT TAT GAA GAG TAT TTG GCG AGC AAA AAA TGAAA 12 5 8 
lie Asp Phe Lys Gly Ser Tyr Glu Glu Tyr Leu Ala Ser Lys Lys 
390 395 400 

CCGCAAGACA TTGAAATCGT TC7UUVGCGTT TTAGAGATTA CAGGACC 13 0 5 

(2) INFORMATION FOR SEQ ID NO:874: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 874 : 

Met Tyr Glu Cys Glu Val Ala lie Glu Lys lie Leu Glu Asp Leu Gly 

15 10 15 

lie Pro Ser Ser Lys His Asn Asp Leu Met Lys Thr Leu Pro Ser Ser 

20 25 30 

Asp Lys Phe Lys lie Leu Leu Ala Gin Val Leu Phe Pro Lys Pro Asp 

35 40 45 

lie Leu Leu Leu Asp Glu Pro Thr Asn Asn Leu Asp Leu Asn Ala lie 

50 55 60 

Glu Trp Leu Glu Asn Asn Leu Lys Arg His Glu Gly Thr Met Val Val 
65 70 75 80 

lie Ser His Asp Arg His Phe Leu Asn Ala Val Cys Thr His lie Leu 

85 90 95 

Asp Leu Asp Phe His Ser Val Arg Glu Phe Ser Gly Asn Tyr Asp Asp 

100 105 110 

Trp Tyr lie Ala Ser Thr Leu lie Ala Lys Gin Gin Glu Ala Glu Arg 

115 120 125 

Asn Lys Lys Leu Lys Glu Lys Glu Glu Leu Glu Lys Phe lie Ala Arg 

130 135 140 

Phe Xaa Xaa Xaa Ala Ser Lys Ala Lys Gin Ala Thr Ser Arg Gin Lys 
145 150 155 160 

Gin Leu Asp Lys Leu Asp lie Gin Ser Leu Ala Val Ser Ser Arg Arg 

165 170 175 

Asp Pro Ser lie lie Phe Lys Pro Lys Arg Thr lie Gly Asn Glu Ala 

180 185 190 

Leu Glu Cys Glu Asn lie Ser Lys Ser Tyr Asp Asp Gin lie Val Leu 
195 200 205 
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120 125 130 

GAA AAA GAA GAG CTA GAA AAA TTC ATC GCG CGC TTT ARN NNN NAC GCT 488 
Glu Lys Glu Glu Leu Glu Lys Phe lie Ala Arg Phe Xaa Xaa Xaa Ala 
135 140 145 

TCT AAA GCC AAG CAA GCC ACC AGC CGC CAA AAA CAA CTG GAT AAA TTA 53 6 

Ser Lys Ala Lys Gin Ala Thr Ser Arg Gin Lys Gin Leu Asp Lys Leu 
150 155 160 165 

GAC ATT CAA AGT TTA GCG GTA TCT AGC AGG AGG GAT CCT AGC ATT ATT 5 84 

Asp lie Gin Ser Leu Ala Val Ser Ser Arg Arg Asp Pro Ser lie lie 
170 175 180 

TTT AAA CCC AAA CGC ACC ATT GGT AAT GAA GCC TTA GAG TGC GAA AAC 63 2 

Phe Lys Pro Lys Arg Thr lie Gly Asn Glu Ala Leu Glu Cys Glu Asn 
185 190 195 

ATC TCT AAA AGT TAT GAC GAC CAA ATC GTT TTA AAT CAA GTG AGC TTG 68 0 

lie Ser Lys Ser Tyr Asp Asp Gin lie Val Leu Asn Gin Val Ser Leu 
200 205 210 

AAA GTG ATG CCT AAA GAC AAG ATC GCC CTC ATA GGG CCA AAC GGC GTG 72 8 

Lys Val Met Pro Lys Asp Lys lie Ala Leu lie Gly Pro Asn Gly Val 
215 220 225 

GGT AAA TCC ACG CTT TGT AAA ATT CTA GTA GAA GAA TTA AAG CCG GAT 77 6 

Gly Lys Ser Thr Leu Cys Lys lie Leu Val Glu Glu Leu Lys Pro Asp 
230 235 240 245 

AAG GGC GTG GTG AAA TGG GGG GCG ACG GTT TCA AAA GGC TAT TTC CCT 824 
Lys Gly Val Val Lys Trp Gly Ala Thr Val Ser Lys Gly Tyr Phe Pro 
250 255 260 

CAA AAC GTG AGC GAA GAA ATT AGC GGG GAA GAG ACC TTG TAT CAA TGG 872 
Gin Asn Val Ser Glu Glu lie Ser Gly Glu Glu Thr Leu Tyr Gin Trp 
265 270 275 

CTC TTT AAC TTC AAT AAA AAG ATT GAA AGC GCT GAG GTT AGG AAC GCT 92 0 

Leu Phe Asn Phe Asn Lys Lys lie Glu Ser Ala Glu Val Arg Asn Ala 
280 285 290 

TTA GGG AGG ATG CTG TTT AAT GGC GAA GAG CAA GAA AAA TGC GTG AAC 96 8 

Leu Gly Arg Met Leu Phe Asn Gly Glu Glu Gin Glu Lys Cys Val Asn 
295 300 305 

GCT TTA AGT GGG GGC GAA AAA CAC CGA ATG GTT TTA TCC AAG CTC ATG 1016 
Ala Leu Ser Gly Gly Glu Lys His Arg Met Val Leu Ser Lys Leu Met 
310 315 320 325 

CTA GAG GGG GGG AAT TTT TTA GTC TTA GAT GAG CCA ACC AAC CAT TTG 1064 
Leu Glu Gly Gly Asn Phe Leu Val Leu Asp Glu Pro Thr Asn His Leu 
330 335 340 

GAT TTA GAA GCG ATT ATC GCT TTA GGC GAA GCG CTC TTT AAA TTT GAT 1112 
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(2) INFORMATION FOR SEQ ID NO:873: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 42 , . . 1253 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873: 

GATTAGGGGA GTTAGAAACC ATTTGCGTGG AAGAAGATCC C ATG TAT GAA TGC GAA 56 

Met Tyr Glu Cys Glu 
1 5 

GTG GCG ATT GAA AAA ATC CTA GAA GAT TTA GGC ATT CCT AGC TCT AAA 104 
Val Ala lie Glu Lys lie Leu Glu Asp Leu Gly lie Pro Ser Ser Lys 
10 15 20 

CAC AAC GAT TTG ATG AAA ACC CTG CCA AGC AGC GAT AAA TTT AAA ATC 152 
His Asn Asp Leu Met Lys Thr Leu Pro Ser Ser Asp Lys Phe Lys lie 
25 30 35 

CTT CTC GCT CAA GTC TTG TTC CCT AAA CCG GAT ATT TTG CTT TTA GAT 2 00 

Leu Leu Ala Gin Val Leu Phe Pro Lys Pro Asp lie Leu Leu Leu Asp 
40 45 50 

GAG CCG ACC AAC AAC CTG GAT TTA AAC GCC ATT GAA TGG CTA GAA AAC 24 8 

Glu Pro Thr Asn Asn Leu Asp Leu Asn Ala lie Glu Trp Leu Glu Asn 
55 60 65 

AAC CTC AAA CGC CAT GAA GGC ACG ATG GTC GTC ATT AGC CAT GAC AGG 2 96 

Asn Leu Lys Arg His Glu Gly Thr Met Val Val lie Ser His Asp Arg 
70 75 80 85 

CAT TTT TTA AAT GCG GTA TGC ACG CAT ATT TTG GAT TTG GAT TTC CAC 344 
His Phe Leu Asn Ala Val Cys Thr His lie Leu Asp Leu Asp Phe His 
90 95 100 

AGC GTG CGC GAA TTT AGC GGG AAT TAT GAC GAT TGG TAT ATC GCT TCC 3 92 

Ser Val Arg Glu Phe Ser Gly Asn Tyr Asp Asp Trp Tyr lie Ala Ser 
105 110 115 

ACT CTG ATC GCT AAA CAG CAA GAG GCC GAA CGC AAT AAA AAA CTC AAA 44 0 

Thr Leu lie Ala Lys Gin Gin Glu Ala Glu Arg Asn Lys Lys Leu Lys 
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TAG CGA GAA AAA CTA CTG ACC TTC AAA CCT CTC CAA AAC AAG GAA TAACA 97 6 
Tyr Arg Glu Lys Leu Leu Thr Phe Lys Pro Leu Gin Asn Lys Glu 
285 290 295 

TGAGTTACGA AACGATCGCA GAAAGCAATG AAAG 1010 

(2) INFORMATION FOR SEQ ID NO: 872: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872: 

Met Phe Cys Phe Glu Asn Leu Asn lie Gin Asn Xaa lie Lys Ser Lys 

15 10 15 

Ser Phe Gly Gly lie Val Lys Ser lie Ser Met Asn Asp Leu Gin Gin 

20 25 30 

lie Thr lie Pro lie Pro Pro Leu Glu lie Gin Gin Glu lie Val Lys 

35 40 45 

lie Leu Asp Ala Phe Thr Glu Leu Asn Thr Glu Leu Asn Thr Glu Leu 

50 55 60 

Lys Ala Arg Lys Lys Gin Tyr Glu Tyr Tyr Gin Asn Met Leu Leu Asp 
65 70 75 80 

Phe Asn Asp lie Asn Gin Asn His Lys Asp Ala Lys lie Lys Thr Tyr 

85 90 95 

Pro Lys Arg Leu Lys Thr Leu Leu His Thr Leu Ala Pro Lys Gly Val 

100 105 110 

Glu Phe Arg Lys Leu Gly Glu Val Cys Glu Ser Thr Asn Lys Lys Thr 

115 120 125 

Leu Lys lie Ser Glu Val Ser Glu Val Lys Asn Lys Gly Met Tyr Pro 

130 135 140 

Val lie Asn Ser Gly Arg Asp Leu Tyr Gly Tyr Tyr His Asp Phe Asn 
145 150 155 160 

Asn Asp Gly Glu Asn lie Thr lie Ala Ser Arg Gly Glu Tyr Ala Gly 

165 170 175 

Phe lie Asn Tyr Phe Asn Glu Lys Phe Phe Ala Gly Gly Leu Cys Tyr 

180 185 190 

Pro Tyr Lys Val Lys Asp Thr Asn Glu Leu Leu Thr Lys Phe Leu Tyr 

195 200 205 

Phe Tyr Leu Lys Thr Asn Glu lie Gin lie Met Glu Asn Leu Val Phe 

210 215 220 

Arg Gly Ser lie Pro Ala Leu Asn Lys Ala Asp lie Glu Thr Leu Thr 
225 230 235 240 

lie Pro lie Pro Pro Leu Glu lie Gin Gin Glu lie Val Lys lie Leu 

245 250 255 

Asp Gin Phe Ser Ala Leu Thr Thr Asp Leu Leu Ala Gly lie Pro Ala 

260 265 270 

Glu lie Lys Ala Arg Lys Lys Gin Tyr Glu Tyr Tyr Arg Glu Lys Leu 

275 280 285 

Leu Thr Phe Lys Pro Leu Gin Asn Lys Glu 
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Leu Asn Thr Glu Leu Lys Ala Arg Lys Lys Gin Tyr Glu Tyr Tyr Gin 
60 65 70 75 

AAC ATG CTT TTA GAC TTT AAC GAT ATT AAT CAA AAC CAC AAA. GAC GCC 350 
Asn Met Leu Leu Asp Phe Asn Asp lie Asn Gin Asn His Lys Asp Ala 
80 85 90 

AAA ATA AAA ACC TAG OCT AAA CGC TTG AAA ACC TTA CTC CAC ACT TTA 3 98 

Lys lie Lys Thr Tyr Pro Lys Arg Leu Lys Thr Leu Leu His Thr Leu 
95 100 105 

GCG CCT AAG GGG GTG GAG TTT AGG AAA TTG GGG GAG GTG TGT GAA AGC 44 6 

Ala Pro Lys Gly Val Glu Phe Arg Lys Leu Gly Glu Val Cys Glu Ser 
110 115 120 

ACA AAT AAA AAA ACA CTC AAA ATA AGC GAA GTA AGT GAA GTA AAA AAT 4 94 

Thr Asn Lys Lys Thr Leu Lys lie Ser Glu Val Ser Glu Val Lys Asn 
125 130 135 

AAG GGA ATG TAT CCA GTG ATA AAT TCA GGG AGG GAT TTG TAT GGT TAT 54 2 

Lys Gly Met Tyr Pro Val lie Asn Ser Gly Arg Asp Leu Tyr Gly Tyr 
140 145 150 155 

TAC CAT GAT TTT AAC AAT GAT GGA GAA AAT ATA ACT ATT GCA TCT AGG 59 0 

Tyr His Asp Phe Asn Asn Asp Gly Glu Asn lie Thr lie Ala Ser Arg 
160 165 170 

GGA GAA TAT GCA GGA TTT ATA AAC TAT TTC AAT GAA AAA TTT TTT GCA 63 8 

Gly Glu Tyr Ala Gly Phe lie Asn Tyr Phe Asn Glu Lys Phe Phe Ala 
175 180 185 

GGG GGT CTA TGT TAT CCC TAT AAA GTT AAA GAC ACT AAC GAG CTT TTA 6 86 

Gly Gly Leu Cys Tyr Pro Tyr Lys Val Lys Asp Thr Asn Glu Leu Leu 
190 195 200 

ACA AAA TTT TTA TAC TTT TAT CTC AAA ACT AAT GAA ATC CAA ATT ATG 73 4 

Thr Lys Phe Leu Tyr Phe Tyr Leu Lys Thr Asn Glu lie Gin lie Met 
205 210 215 

GAG AAC CTT GTT TTT CGT GGC AGT ATC CCC GCA CTC AAT AAA GCA GAT 7 82 

Glu Asn Leu Val Phe Arg Gly Ser lie Pro Ala Leu Asn Lys Ala Asp 
220 225 230 235 

ATT GAA ACT TTA ACA ATC CCC ATC CCA CCT CTA GAG ATC CAA CAA GAG 83 0 

lie Glu Thr Leu Thr lie Pro lie Pro Pro Leu Glu lie Gin Gin Glu 
240 245 250 

ATC GTT AAG ATT TTG GAT CAA TTT TCA GCC CTA ACC ACC GAT TTA TTA 87 8 

lie Val Lys lie Leu Asp Gin Phe Ser Ala Leu Thr Thr Asp Leu Leu 
255 260 265 

GCC GGT ATC CCC GCT GAA ATA AAA GCC CGA AAA AAG CAA TAC GAA TAT 92 6 

Ala Gly lie Pro Ala Glu He Lys Ala Arg Lys Lys Gin Tyr Glu Tyr 
270 275 280 
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145 150 155 160 

Ala Ala Gin Lys Tyr Ser Val lie Ala Val Met Thr Gly Lys Thr Asp 

165 170 175 

Tyr Val Ser Asp Gly Lys Lys Val Leu Ser lie Thr Gly Gly Ser Glu 

180 185 190 

Tyr Leu Ala Leu lie Thr Gly Ala Gly Cys Leu His Ala Ala Ala Cys 

195 200 205 

Ala Ser Phe Leu Ser Leu Lys Lys Asp Pro Leu Asp Ser Met Ala Gin 

210 215 220 

Leu Cys Ala Leu Tyr Lys Gin Ala Ala Phe Asn Ala Gin Lys Lys Val 
225 230 235 240 

Leu Glu Asn Asn Gly Ser Asn Gly Ser Phe Leu Phe Tyr Phe Leu Asp 

245 250 255 

Ala Leu Ser Leu Pro lie Glu Leu Glu Asn Ser Leu lie Lys Glu Glu 
260 265 270 

Trp 



(2) INFORMATION FOR SEQ ID NO: 871: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 7 8... 971 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871: 

ATCCAAATAA TTGGGCGATT AAAGAGGGAA TTTATTCAAT CAAACCAAAT AAAAAAATAG 6 0 

TATTTCCAAG ATTTTTA ATG TTT TGC TTT GAA AAT TTG AAT ATT CAA AAT 110 

Met Phe Cys Phe Glu Asn Leu Asn lie Gin Asn 
15 10 

GMT ATA AAA AGT AAA AGT TTT GGA GGA ATA GTT AAA AGT ATA TCA ATG 15 8 

Xaa lie Lys Ser Lys Ser Phe Gly Gly lie Val Lys Ser lie Ser Met 
15 20 25 

AAC GAT TTA CAA CAA ATA ACC ATC CCC ATC CCA CCC CTA GAG ATC CAA 2 06 

Asn Asp Leu Gin Gin lie Thr lie Pro lie Pro Pro Leu Glu lie Gin 
30 35 40 

CAA GAG ATC GTT AAG ATT TTG GAC GCT TTC ACA GAA TTA AAC ACA GAA 2 54 

Gin Glu lie Val Lys lie Leu Asp Ala Phe Thr Glu Leu Asn Thr Glu 
45 50 55 

TTA AAC ACA GAA TTA AAA GCG CGC AAA AAG CAA TAT GAG TAT TAC CAA 3 02 
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180 185 190 

TTA GCG CTC ATT ACT GGG GCT GGG TGT TTG CAT GCC GCA GCA TGC GCG 6 84 

Leu Ala Leu lie Thr Gly Ala Gly Cys Leu His Ala Ala Ala Cys Ala 
195 200 205 

AGC TTT TTA AGT TTG AAA AAA GAC CCC TTA GAT TCT ATG GCG CAA CTT 73 2 

Ser Phe Leu Ser Leu Lys Lys Asp Pro Leu Asp Ser Met Ala Gin Leu 
210 215 220 225 

TGC GCG CTC TAT AAA CAA GCC GCT TTT AAC GCG CAA AAA AAG GTG TTG 7 80 

Cys Ala Leu Tyr Lys Gin Ala Ala Phe Asn Ala Gin Lys Lys Val Leu 
230 235 240 

GAA AAT AAC GGC TCT AAT GGT TCG TTC TTG TTT TAT TTT TTA GAT GCT 82 8 

Glu Asn Asn Gly Ser Asn Gly Ser Phe Leu Phe Tyr Phe Leu Asp Ala 
245 250 255 

CTA AGC TTG CCC ATA GAG TTA GAA AAC AGC CTT ATT AAG GAA GAG TGG T 8 77 
Leu Ser Leu Pro lie Glu Leu Glu Asn Ser Leu lie Lys Glu Glu Trp 
260 265 270 

GAAAATTTAC CCGCAAGTTT TAAGCATTGC TGGCAGCGAT AG 919 

(2) INFORMATION FOR SEQ ID NO: 87 0: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870: 

Met Asp Phe Cys Lys lie Lys Glu lie Leu Arg Arg Leu Val Val Leu 

15 10 15 

Lys Glu Leu Arg Gin Lys Arg Pro Leu Val His Asn lie Thr Asn Tyr 

20 25 30 

Val Ala Ala Gin Phe Val Ala Asn Gly Leu Leu Ala Leu Gly Ala Ser 

35 40 45 

Pro Leu Met Ser Asp Ala lie Asp Glu Met Arg Asp Leu Ala Lys lie 

50 55 60 

Ser Asp Ala Leu Ala lie Asn lie Gly Thr Leu Asn Asp Arg Ala lie 
65 70 75 80 

Leu Cys Ala Lys Glu Ala lie Lys His Tyr Lys Ala Leu Asn Lys Pro 

85 90 95 

lie Val Leu Asp Pro Val Gly Cys Ser Ala Ser Ala Leu Arg His Asp 

100 105 110 

Thr Ser Leu Glu Leu Leu Lys Ser Gly Gly lie Ser Ala Leu Arg Gly 

115 120 125 

Asn Ala Ala Glu Leu Gly Ser Leu Val Gly lie Ser Cys Glu Ser Lys 

130 135 140 

Gly Leu Asp Ser Asn Asp Ala Ala Thr Pro Val Glu lie lie Lys Leu 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 869: 

TAATGAAAAA TAGTTCATGA ACGCTTTTGC ATTAAGGCTC AAAAAAAGCG CCGTTTA ATG 60 

Met 
1 

GAT TTT TGT AAA ATA AAA GAA ATT TTA AGG AGG CTT GTG GTG TTG AAA 108 

Asp Phe Cys Lys lie Lys Glu lie Leu Arg Arg Leu Val Val Leu Lys 
5 10 15 

GAA TTA CGC CAA AAA CGC COT TTA GTG CAT AAT ATC ACC AAT TAT GTG 15 6 

Glu Leu Arg Gin Lys Arg Pro Leu Val His Asn lie Thr Asn Tyr Val 
20 25 30 

GCG GCG CAA TTT GTG GCT AAT GGT TTG TTA GCT TTA GGG GCA TCG CCT 2 04 

Ala Ala Gin Phe Val Ala Asn Gly Leu Leu Ala Leu Gly Ala Ser Pro 

35 40 45 

TTA ATG AGC GAT GCG ATT GAT GAA ATG CGA GAT TTA GCG AAA ATT TCT 2 52 

Leu Met Ser Asp Ala lie Asp Glu Met Arg Asp Leu Ala Lys lie Ser 
50 55 60 65 

GAC GCG CTC GCT ATC AAT ATT GGC ACC CTT AAT GAT CGC GCT ATT TTA 3 00 

Asp Ala Leu Ala lie Asn lie Gly Thr Leu Asn Asp Arg Ala lie Leu 
70 75 80 

TGC GCT AAA GAG GCT ATC AAG CAT TAC AAG GCT TTG AAC AAA CCC ATT 34 8 

Cys Ala Lys Glu Ala lie Lys His Tyr Lys Ala Leu Asn Lys Pro lie 
85 90 95 

GTG TTA GAT CCT GTG GGG TGT TCA GCG AGC GCT TTG CGT CAT GAC ACC 3 96 

Val Leu Asp Pro Val Gly Cys Ser Ala Ser Ala Leu Arg His Asp Thr 
100 105 110 

AGT TTA GAG CTT TTG AAA AGT GGT GGG ATT AGC GCG CTT AGG GGT AAT 4 44 

Ser Leu Glu Leu Leu Lys Ser Gly Gly lie Ser Ala Leu Arg Gly Asn 

115 120 125 

GCT GCA GAA TTA GGC TCT TTA GTG GGG ATT TCT TGC GAA AGT AAG GGG 4 92 

Ala Ala Glu Leu Gly Ser Leu Val Gly lie Ser Cys Glu Ser Lys Gly 
130 135 140 145 

CTA GAC TCT AAT GAT GCC GCC ACG CCT GTA GAA ATA ATC AAA TTA GCG 54 0 

Leu Asp Ser Asn Asp Ala Ala Thr Pro Val Glu lie lie Lys Leu Ala 
150 155 160 

GCT CAA AAA TAT TCT GTG ATA GCG GTA ATG ACG GGT AAA ACA GAT TAC 5 88 

Ala Gin Lys Tyr Ser Val lie Ala Val Met Thr Gly Lys Thr Asp Tyr 
165 170 175 

GTG AGC GAT GGG AAA AAG GTT TTG AGT ATT ACT GGG GGG AGC GAG TAT 63 6 

Val Ser Asp Gly Lys Lys Val Leu Ser lie Thr Gly Gly Ser Glu Tyr 
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Lys lie Glu Lrys Leu Pro lie Val Asp Lys Asp Asn Val Leu Lys Gly 

180 185 190 

Leu lie Thr lie Lys Asp lie Gin Lys Arg lie Glu Tyr Pro Glu Ala 

195 200 205 

Asn Lys Asp Asp Phe Gly Arg Leu Arg Val Gly Ala Ala lie Gly Val 

210 215 220 

Gly Gin Leu Asp Arg Ala Glu Met Leu Val Lys Ala Gly Val Asp Ala 
225 230 235 240 

Leu Val Leu Asp Ser Ala His Gly His Ser Ala Asn lie Leu His Thr 

245 250 255 

Leu Glu Glu lie Lys Lys Ser Leu Val Val Asp Val lie Val Gly Asn 

260 265 270 

Val Val Thr Lys Glu Ala Thr Ser Asp Leu lie Ser Ala Gly Ala Asp 

275 280 285 

Ala lie Lys Val Gly lie Gly Pro Gly Ser lie Cys Thr Thr Arg lie 

290 295 300 

Val Ala Gly Val Gly Met Pro Gin Val Ser Ala lie Asp Asn Cys Val 
305 310 315 320 

Glu Val Ala Ser Lys Phe Asp lie Pro Val lie Ala Asp Gly Gly lie 

325 330 335 

Arg Tyr Ser Gly Asp Val Ala Lys Ala Leu Ala Leu Gly Ala Ser Ser 

340 345 350 

Val Met lie Gly Ser Leu Leu Ala Gly Thr Glu Glu Ser Pro Gly Asp 

355 360 365 

Phe Met lie Tyr Gin Gly Arg Gin Tyr Lys Ser Tyr Arg Gly Met Gly 

370 375 380 

Ser lie Gly Ala Met Thr Lys Gly Ser Ser Asp Arg Tyr Phe Gin Glu 
385 390 395 400 

Gly Val Ala Ser Glu Lys Leu Val Pro Glu Gly lie Glu Gly Arg Val 

405 410 415 

Pro Tyr Arg Gly Lys Val Ser Asp Met lie Phe Gin Leu Val Gly Gly 

420 425 430 

Val Arg Ser Ser Met Gly Tyr Gin Gly Ala Lys Asn lie Leu Glu Leu 

435 440 445 

Tyr Gin Asn Ala Glu Phe Val Glu lie Thr Ser Ala Gly Leu Lys Glu 

450 455 460 

Ser His Val His Gly Val Asp lie Thr Lys Glu Ala Pro Asn Tyr Tyr 
465 470 475 480 

Gly 



(2) INFORMATION FOR SEQ ID NO: 8 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58.,. 876 
(D) OTHER INFORMATION: 
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Pro Glu Gly lie Glu Gly Arg Val Pro Tyr Arg Gly Lys Val Ser Asp 
410 415 420 

ATG ATT TTC CAA TTA GTA GGG GGC GTG CGC TCT TCT ATG GGG TAT CAG 13 5 0 

Met lie Phe Gin Leu Val Gly Gly Val Arg Ser Ser Met Gly Tyr Gin 
425 430 435 440 

GGG GCG AAA AAT ATT TTG GAA TTG TAT CAA AAC GCT GAA TTT GTA GAA 13 9 8 

Gly Ala Lys Asn lie Leu Glu Leu Tyr Gin Asn Ala Glu Phe Val Glu 
445 450 455 

ATC ACT AGC GCG GGG TTA AAA GAA AGC CAT GTG CAT GGC GTG GAT ATT 144 6 

lie Thr Ser Ala Gly Leu Lys Glu Ser His Val His Gly Val Asp lie 
460 465 470 

ACT AAA GAA GCC CCT AAT TAT TAT GGG TGAATTGTAA AAGAAAACAA GACAAAT 15 0 0 
Thr Lys Glu Ala Pro Asn Tyr Tyr Gly 
475 480 

CGTTAAAAAA CTCGTTAAAA AGCTTGGTTT AATGAGTTTT TAAAACTTAA TTGCTACA 15 5 8 

(2) INFORMATION FOR SEQ ID NO: 86 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868: 

Met Arg lie Leu Gin Arg Ala Leu Thr Phe Glu Asp Val Leu Met Val 

15 10 15 

Pro Arg Lys Ser Ser Val Leu Pro Lys Asp Val Ser Leu Lys Ser Arg 

20 25 30 

Leu Thr Lys Asn lie Arg Leu Asn lie Pro Phe lie Ser Ala Ala Met 

35 40 45 

Asp Thr Val Thr Glu His Lys Thr Ala lie Ala Met Ala Arg Leu Gly 

50 55 60 

Gly lie Gly lie Val His Lys Asn Met Asp lie Gin Thr Gin Val Lys 
65 , 70 75 80 

Glu lie Thr Lys Val Lys Lys Ser Glu Ser Gly Val lie Asn Asp Pro 

85 90 95 

lie Phe lie His Ala His Arg Thr Leu Ala Asp Ala Lys Val lie Thr 

100 105 110 

Asp Asn Tyr Lys lie Ser Gly Val Pro Val Val Asp Asp Lys Gly Leu 

115 120 125 

Leu lie Gly lie Leu Thr Asn Arg Asp Val Arg Phe Glu Thr Asp Leu 

130 135 140 

Ser Lys Lys Val Gly Asp Val Met Thr Lys Met Pro Leu Val Thr Ala 
145 150 155 160 

His Val Gly lie Ser Leu Asp Glu Ala Ser Asp Leu Met His Lys His 
165 170 175 
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185 190 195 200 

AAA CGC ATT GAA TAC CCT GAG GCC AAT AAA GAT GAT TTT GGG AGG TTG 67 8 

Lys Arg lie Glu Tyr Pro Glu Ala Asn Lys Asp Asp Phe Gly Arg Leu 
205 210 215 

AGA GTG GGG GCG GCT ATT GGA GTG GGG CAG TTG GAT AGG GCT GAG ATG 72 6 

Arg Val Gly Ala Ala lie Gly Val Gly Gin Leu Asp Arg Ala Glu Met 
220 225 230 

TTA GTT AAA GCG GGG GTG GAT GCA CTG GTG CTA GAG AGC GGA CAT GGG 7 74 

Leu Val Lys Ala Gly Val Asp Ala Leu Val Leu Asp Ser Ala His Gly 
235 240 245 

CAT TCA GCC AAT ATC TTA CAC ACT TTA GAA GAG ATT AAA AAA AGC TTG 82 2 

His Ser Ala Asn lie Leu His Thr Leu Glu Glu lie Lys Lys Ser Leu 
250 255 260 

GTA GTG GAT GTG ATT GTG GGG AAT GTG GTT ACT AAA GAA GCC ACA AGC 87 0 

Val Val Asp Val lie Val Gly Asn Val Val Thr Lys Glu Ala Thr Ser 
265 270 275 280 

GAT TTG ATT AGC GCG GGA GCA GAC GCT ATT AAA GTG GGT ATT GGG CCA 918 
Asp Leu lie Ser Ala Gly Ala Asp Ala lie Lys Val Gly lie Gly Pro 
285 290 295 

GGA AGC ATT TGC ACC ACT AGG ATT GTG GCT GGG GTG GGA ATG CCC CAA 96 6 

Gly Ser lie Cys Thr Thr Arg lie Val Ala Gly Val Gly Met Pro Gin 
300 305 310 

GTG AGC GCG ATT GAT AAT TGC GTA GAA GTG GCG TCT AAA TTT GAT ATT 1014 
Val Ser Ala lie Asp Asn Cys Val Glu Val Ala Ser Lys Phe Asp lie 
315 320 325 

CCT GTG ATT GCA GAT GGA GGG ATC CGC TAT TCA GGC GAT GTG GCT AAG 1062 
Pro Val lie Ala Asp Gly Gly lie Arg Tyr Ser Gly Asp Val Ala Lys 
330 335 340 

GCT TTG GCT TTG GGG GCA TCA AGC GTG ATG ATA GGC TCT TTA CTC GCT 1110 
Ala Leu Ala Leu Gly Ala Ser Ser Val Met lie Gly Ser Leu Leu Ala 
345 350 355 360 

GGC ACA GAA GAA TCT CCT GGG GAT TTT ATG ATC TAT CAA GGG AGG CAA 115 8 

Gly Thr Glu Glu Ser Pro Gly Asp Phe Met lie Tyr Gin Gly Arg Gin 
365 370 375 

TAT AAA AGC TAT AGG GGC ATG GGC AGC ATT GGG GCT ATG ACT AAA GGG 12 0 6 

Tyr Lys Ser Tyr Arg Gly Met Gly Ser lie Gly Ala Met Thr Lys Gly 
380 385 390 

AGC TCT GAT AGG TAT TTT CAA GAG GGC GTA GCG AGT GAA AAG TTA GTC 12 54 

Ser Ser Asp Arg Tyr Phe Gin Glu Gly Val Ala Ser Glu Lys Leu Val 
395 400 405 

CCA GAA GGC ATT GAA GGG CGT GTG CCT TAT CGT GGT AAG GTT TCG GAT 13 02 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 67: 

TTAGATTTAA AATTAGATTA AGGATAGAAA ATG AGA ATT TTA CAA AGG GOT TTG 54 

Met Arg lie Leu Gin Arg Ala Leu 
1 5 

ACT TTT GAA GAT GTG TTG ATG GTG CCT AGA AAG TCT AGC GTT TTA CCT 102 
Thr Phe Glu Asp Val Leu Met Val Pro Arg Lys Ser Ser Val Leu Pro 
10 15 20 

AAA GAT GTG AGC TTA AAG TCT CGC TTA ACT AAA AAC ATT CGT TTG AAT 15 0 

Lys Asp Val Ser Leu Lys Ser Arg Leu Thr Lys Asn lie Arg Leu Asn 
25 30 35 40 

ATC CCC TTT ATC AGT GCG GCT ATG GAT ACG GTT ACA GAG CAT AAA ACC 19 8 

lie Pro Phe lie Ser Ala Ala Met Asp Thr Val Thr Glu His Lys Thr 
45 50 55 

GCT ATC GCT ATG GCG CGC CTT GGG GGT ATT GGC ATC GTG CAT AAA AAC 24 6 

Ala lie Ala Met Ala Arg Leu Gly Gly lie Gly lie Val His Lys Asn 
60 65 70 

ATG GAT ATT CAA ACG CAA GTT AAA GAA ATC ACT AAG GTT AAA AAA AGC 2 94 

Met Asp lie Gin Thr Gin Val Lys Glu He Thr Lys Val Lys Lys Ser 
75 80 85 

GAG AGC GGG GTG ATT AAT GAT CCT ATT TTT ATC CAT GCG CAC AGG ACG 34 2 

Glu Ser Gly Val He Asn Asp Pro He Phe He His Ala His Arg Thr 
90 95 100 

CTA GCG GAC GCT AAA GTC ATA ACG GAT AAT TAC AAG ATT TCA GGC GTG 3 90 

Leu Ala Asp Ala Lys Val He Thr Asp Asn Tyr Lys He Ser Gly Val 
105 110 115 120 

CCT GTG GTA GAT GAT AAG GGG TTG TTG ATT GGG ATT TTA ACC AAC AGA 43 8 

Pro Val Val Asp Asp Lys Gly Leu Leu He Gly He Leu Thr Asn Arg 
125 130 135 

GAT GTG CGC TTT GAA ACC GAT TTG AGT AAA AAA GTG GGC GAT GTG ATG 4 86 

Asp Val Arg Phe Glu Thr Asp Leu Ser Lys Lys Val Gly Asp Val Met 
140 145 150 

ACT AAA ATG CCT TTA GTT ACC GCT CAT GTG GGT ATC AGT TTG GAT GAA 5 34 

Thr Lys Met Pro Leu Val Thr Ala His Val Gly He Ser Leu Asp Glu 
155 160 165 

GCG AGC GAT TTG ATG CAC AAG CAT AAG ATT GAA AAA TTG CCC ATT GTG 582 
Ala Ser Asp Leu Met His Lys His Lys He Glu Lys Leu Pro He Val 
170 175 180 

GAT AAA GAT AAT GTC TTA AAA GGC TTG ATC ACG ATC AAA GAT ATT CAA 63 0 

Asp Lys Asp Asn Val Leu Lys Gly Leu He Thr He Lys Asp He Gin 
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AAA CCT AAA GGT TTT GGC TTT GTA GAA ATG CAA GAA GAG AGC GTT AGT 24 0 

Lys Pro Lys Gly Phe Gly Phe Val Glu Met Gin Glu Glu Ser Val Ser 
65 70 75 80 

GAA GCG ATC GCT AAA TTA GAC AAT ACG GAT TTT ATG GGC AGA ACG ATT 2 88 

Glu Ala lie Ala Lys Leu Asp Asn Thr Asp Phe Met Gly Arg Thr He 
85 90 95 

AGO GTA ACC GAA GCT AAT CCT AAA AAG TCT TAGTAACATT AGAAAATAAT TTT 341 
Arg Val Thr Glu Ala Asn Pro Lys Lys Ser 
100 105 

CTAATGCGCT T 3 52 

(2) INFORMATION FOR SEQ ID NO: 866: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866: 

Lys Cys Tyr Phe Phe He Thr Phe Ser Tyr Ser Tyr Gly Tyr Val Val 

15 10 15 

lie Phe Leu Pro Glu Asn Phe lie Leu Arg Asn lie Tyr Val Gly Asn 

20 25 30 

Leu Val Tyr Ser Ala Thr Ser Glu Gin Val Lys Glu Leu Phe Ser Gin 

35 40 45 

Phe Gly Lys Val Phe Asn Val Lys Leu lie Tyr Asp Arg Glu Thr Lys 

50 55 60 

Lys Pro Lys Gly Phe Gly Phe Val Glu Met Gin Glu Glu Ser Val Ser 
65 70 75 80 

Glu Ala lie Ala Lys Leu Asp Asn Thr Asp Phe Met Gly Arg Thr lie 

85 90 95 

Arg Val Thr Glu Ala Asn Pro Lys Lys Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 867: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 1473 
(D) OTHER INFORMATION: 
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15 10 15 

Lys Leu Glu Thr Leu Leu Lys Leu Lys Lys Leu Lys Val Phe Met Pro 

20 25 30 

Leu Ser Leu Gin Glu Asn Leu Leu Phe lie Phe lie Lys Asp Ser Lys 

35 40 45 

Leu Leu Phe Ala Phe Lys Asp lie Trp Ala Ser Lys Glu Phe Asn Gin 

50 55 60 

Arg Phe Ala Lys Glu lie Ser His Phe Leu Asn Thr Gin Gly His Ala 
65 70 75 80 

Tyr Gly Phe Asp Gly Leu Asn Gly Leu Glu lie Leu Gly Tyr Val Pro 

85 90 95 

Lys Asp Ala Leu Lys Lys Ser Asn Phe Tyr Ala Pro lie Lys Lys Gin 

100 105 110 

Ala Arg Phe Phe Arg Pro Ser Ala Leu Gly Leu Phe His Asn Pro lie 

115 120 125 

Lys Asp Ala Arg Leu His Glu Cys Phe Glu Lys Ala Arg Ala Leu lie 

130 135 140 

His Tyr Gin Arg Ser Phe Phe Glu Glu 
145 150 

(2) INFORMATION FOR SEQ ID NO: 865: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . .318 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865: 

AAG TOT TAC TTT TTT ATA ACT TTT TCA TAT TCT TAT GGG TAT GTT GTC 4 8 

Lys Cys Tyr Phe Phe lie Thr Phe Ser Tyr Ser Tyr Gly Tyr Val Val 

15 10 15 

ATT TTT TTA CCG GAG AAT TTT ATC TTG AGA AAC ATT TAT GTA GGG AAT 96 

lie Phe Leu Pro Glu Asn Phe lie Leu Arg Asn lie Tyr Val Gly Asn 
20 25 30 

TTG GTT TAT AGC GCT ACC AGT GAG CAA GTC AAG GAG CTT TTC AGT CAA 14 4 

Leu Val Tyr Ser Ala Thr Ser Glu Gin Val Lys Glu Leu Phe Ser Gin 
35 40 45 

TTT GGC AAA GTT TTT AAT GTC AAG CTG ATT TAT GAC AGA GAA ACG AAG 192 
Phe Gly Lys Val Phe Asn Val Lys Leu lie Tyr Asp Arg Glu Thr Lys 
50 55 60 
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Met Glu Gin Asn lie Phe Ser Leu Leu lie Gin Lys 
15 10 

AAG TCT TAT AAA AAG CTT GAA ACC CTT TTG AAA CTC AAA AAG CTT AAG 9 9 

Lys Ser Tyr Lys Lys Leu Glu Thr Leu Leu Lys Leu Lys Lys Leu Lys 
15 20 25 

GTT TTT ATG CCT TTA AGT TTA CAA GAA AAT TTG CTT TTT ATC TTC ATA 14 7 

Val Phe Met Pro Leu Ser Leu Gin Glu Asn Leu Leu Phe lie Phe lie 
30 35 40 

AAA GAC TCT AAA TTG CTT TTT GCG TTT AAA GAC ATT TGG GCT TCT AAA 195 
Lys Asp Ser Lys Leu Leu Phe Ala Phe Lys Asp lie Trp Ala Ser Lys 
45 50 55 60 

GAA TTT AAC CAA CGA TTC GCT AAA GAA ATC AGC CAT TTT TTA AAC ACG 24 3 

Glu Phe Asn Gin Arg Phe Ala Lys Glu lie Ser His Phe Leu Asn Thr 
65 70 75 

CAA GGG CAT GCT TAT GGG TTT GAC GGG TTG AAT GGG TTA GAA ATT TTA 2 91 

Gin Gly His Ala Tyr Gly Phe Asp Gly Leu Asn Gly Leu Glu lie Leu 
80 85 90 

GGT TAT GTG CCT AAA GAC GCG CTA AAA AAA TCC AAT TTT TAT GCC CCC 33 9 

Gly Tyr Val Pro Lys Asp Ala Leu Lys Lys Ser Asn Phe Tyr Ala Pro 
95 100 105 

ATT AAA AAA CAA GCC CGT TTT TTT CGC CCT AGT GCT TTA GGG TTG TTC 3 87 

lie Lys Lys Gin Ala Arg Phe Phe Arg Pro Ser Ala Leu Gly Leu Phe 
110 115 120 

CAT AAC CCC ATT AAA GAC GCT CGT TTG CAT GAA TGT TTT GAA AAA GCG 435 
His Asn Pro lie Lys Asp Ala Arg Leu His Glu Cys Phe Glu Lys Ala 
125 130 135 140 

CGC GCT TTG ATC CAC TAC CAA CGA AGT TTT TTT GAG GAA TGAATGGCTG AT 4 86 
Arg Ala Leu lie His Tyr Gin Arg Ser Phe Phe Glu Glu 
145 150 

TTATTGTCCA GTTTAAAAAA CCTTCCTAAC AGCAGTGGCG TGTATCAATA TTTTGATAAA 54 6 

AAC 54 9 

(2) INFORMATION FOR SEQ ID NO: 864: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864: 
Met Glu Gin Asn lie Phe Ser Leu Leu lie Gin Lys Lys Ser Tyr Lys 
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Met Ala Lys Lys Asn Xaa Pro Thr Glu Cys Pro Ala Gly Glu Lys Trp 

15 10 15 

Ala Val Pro Tyr Ala Asp Phe Leu Ser Leu Leu Leu Ala Leu Phe lie 

20 - 25 30 

Ala Leu Tyr Ala lie Ser Ala Val Asn Lys Ser Lys Val Glu Ala Leu 

35 40 45 

Lys Thr Glu Phe lie Lys lie Phe Asn Tyr Ala Pro Lys Pro Glu Ala 

50 55 60 

Met Gin Pro Val Val Val lie Pro Pro Asp Ser Gly Lys Glu Glu Glu 
65 70 75 80 

Gin Met Ala Ser Glu Ser Ser Lys Pro Ala Ser Gin Asn Thr Glu Thr 

85 90 95 

Lys Ala Thr lie Ala Arg Lys Gly Glu Gly Ser Val Leu Glu Gin lie 

100 105 110 

Asp Gin Gly Ser lie Leu Lys Leu Pro Ser Asn Leu Leu Phe Glu Asn 

115 120 125 

Ala Thr Ser Asp Ala lie Asn Gin Asp Met Met Leu Tyr lie Glu Arg 

130 135 140 

lie Ala Lys lie lie Gin Lys Leu Pro Lys Arg Val His lie Asn Val 
145 150 155 160 

Arg Gly Phe Thr Asp Asp Thr Pro Leu Val Lys Thr Arg Phe Lys Ser 

165 170 175 

His Tyr Glu Leu Ala Ala Asn Arg Ala Tyr Arg Val Met Lys Val Leu 

180 185 190 

lie Gin Tyr Gly Val Asn Pro Asn Gin Leu Ser Phe Ser Ser Tyr Gly 

195 200 205 

Ser Thr Asn Pro lie Ala Pro Asn Asp Ser Leu Glu Asn Arg Met Lys 

210 215 220 

Asn Asn Arg Val Glu lie Phe Phe Ser Thr Asp Ala Asn Asp Leu Ser 
225 230 235 240 

Lys lie His Ser lie Leu Asp Asn Glu Phe Asn Pro His Lys Gin Gin 
245 250 255 

Glu 



(2) INFORMATION FOR SEQ ID NO:863: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 474 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863: 
TGTTAAGATC AGTTT ATG GAA CAA AAT ATT TTC TCC TTA CTC ATT CAA AAA 51 
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95 100 105 

GAG CAA ATT GAT CAA GGC TCT ATC TTA AAG CTC CCC TCT AAT TTG CTG 3 87 

Glu Gin lie Asp Gin Gly Ser lie Leu Lys Leu Pro Ser Asn Leu Leu 
110 115 120 125 

TTT GAA AAC GCT ACT TCA GAG GCT ATC AAT CAA GAC ATG ATG CTT TAT 435 
Phe Glu Asn Ala Thr Ser Asp Ala lie Asn Gin Asp Met Met Leu Tyr 
130 135 140 

ATT GAA CGG ATC GCT AAA ATC ATT CAA AAA CTC CCT AAA AGG GTG CAT 4 83 

lie Glu Arg lie Ala Lys lie lie Gin Lys Leu Pro Lys Arg Val His 
145 150 155 

ATT AAT GTG AGA GGC TTT ACG GAT GAT ACG CCT TTA GTT AAA ACC CGT 531 
lie Asn Val Arg Gly Phe Thr Asp Asp Thr Pro Leu Val Lys Thr Arg 
160 165 170- 

TTT AAA AGC CAT TAT GAA TTA GCC GCC AAT CGC GCT TAT AGG GTG ATG 57 9 

Phe Lys Ser His Tyr Glu Leu Ala Ala Asn Arg Ala Tyr Arg Val Met 
175 180 185 

7VAA GTC CTT ATA CAA TAG GGC GTA AAT CCT AAC CAA TTG TCT TTT TCT 62 7 

Lys Val Leu lie Gin Tyr Gly Val Asn Pro Asn Gin Leu Ser Phe Ser 
190 195 200 205 

TCT TAC GGC TCT ACC AAC CCT ATC GCG CCT AAC GAC TCC CTA GAG AAC 67 5 

Ser Tyr Gly Ser Thr Asn Pro lie Ala Pro Asn Asp Ser Leu Glu Asn 
210 215 220 

AGA ATG AAA AAC AAT CGT GTG GAA ATC TTT TTT TCA ACC GAT GCG AAC 72 3 

Arg Met Lys Asn Asn Arg Val Glu lie Phe Phe Ser Thr Asp Ala Asn 
225 230 235 

GAT TTG AGT AAA ATT CAT TCT ATT TTA GAT AAT GAG TTC AAT CCC CAC 771 
Asp Leu Ser Lys lie His Ser lie Leu Asp Asn Glu Phe Asn Pro His 
240 245 250 

AAA CAG CAA GAA TGAATCGCAT GAATAAAAAT TATCTTT 810 
Lys Gin Gin Glu 
255 



(2) INFORMATION FOR SEQ ID NO: 862: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 62: 
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145 150 155 160 

Lys Ala Lys Ser Lys Asp lie lie Lys Glu Lys Thr Val Leu Leu Glu 

165 170 175 

Gly lie Leu Gly He Ala Asn Gly Glu Asn Pro Arg Asp Leu Glu Asn 

180 185 190 

Lys Leu Leu Asn Tyr He Ala Pro Gly Glu Pro Lys Lys Ser Gin Phe 
195 200 205 

Glu Gly 
210 

(2) INF0R^4ATI0N FOR SEQ ID NO: 861: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13... 783 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861: 

TGAGGGCTAA AG ATG GCT AAG AAA AAC AMA CCC ACC GAA TGC CCC GCC GGT 51 
Met Ala Lys Lys Asn Xaa Pro Thr Glu Cys Pro Ala Gly 
15 10 

GAA AAA TGG GCG GTT CCT TAT GCG GAC TTT TTG TCG TTG TTG CTC GCG 99 
Glu Lys Trp Ala Val Pro Tyr Ala Asp Phe Leu Ser Leu Leu Leu Ala 
15 20 25 

CTT TTT ATC GCT CTT TAT GCC ATT TCA GCG GTC AAC AAA TCC AAA GTG 14 7 

Leu Phe lie Ala Leu Tyr Ala lie Ser Ala Val Asn Lys Ser Lys Val 
30 35 40 45 

GAA GCC TTA AAA ACC GAA TTT ATT AAG ATT TTT AAT TAC GCT CCC AAG 195 
Glu Ala Leu Lys Thr Glu Phe lie Lys lie Phe Asn Tyr Ala Pro Lys 
50 55 60 

CCA GAG GCG ATG CAG CCG GTT GTA GTG ATC CCG CCT GAT TCA GGG AAA 24 3 

Pro Glu Ala Met Gin Pro Val Val Val lie Pro Pro Asp Ser Gly Lys 
65 70 75 

GAA GAA GAA CAA ATG GCG AGC GAA AGC TCC AAA CCG GCT TCG CAA AAT 2 9Z 

Glu Glu Glu Gin Met Ala Ser Glu Ser Ser Lys Pro Ala Ser Gin Asn 
80 85 90 

ACC GAA ACA AAA GCC ACT ATC GCT CGC AAA GGC GAA GGC AGT GTT TTA 33 9 

Thr Glu Thr Lys Ala Thr lie Ala Arg Lys Gly Glu Gly Ser Val Leu 
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125 130 135 

GCT TTT ACG GCT ACT GTT ACA GGG ATT ATG TGT TCT TAT GCG ATT TTT 4 84 

Ala Phe Thr Ala Thr Val Thr Gly lie Met Cys Ser Tyr Ala lie Phe 
140 145 150 

GGC CCT TTT GGG CAT AAG CTC AAA GCT AAG TCT AAA GAC ATT ATC AAA 53 2 

Gly Pro Phe Gly His Lys Leu L»ys Ala hys Ser Lys Asp lie lie Lys 
155 160 165 

GAA AAA ACC GTT CTT TTA GAG GGG ATT TTA GGC ATC GCT AAT GGG GAA 580 
Glu Lys Thr Val Leu Leu Glu Gly lie Leu Gly lie Ala Asn Gly Glu 
170 175 180 185 

AAC CCA AGG GAT TTA GAA AAC AAA CTC TTA AAC TAG ATC GCT CCC GGT 62 8 

Asn Pro Arg Asp Leu Glu Asn Lys Leu Leu Asn Tyr lie Ala Pro Gly 
190 195 200 

GAA CCT AAA AAA TCT CAA TTT GAG GGC TAAAGATGGC TAAGAAAAAC ANACCCA 682 
Glu Pro Lys Lys Ser Gin Phe Glu Gly 
205 210 

CCGAATGCCC CGC 695 

(2) INFORMATION FOR SEQ ID NO: 86 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860: 

Met Thr Gly Thr His Ala Arg Tyr Val Lys Ala Ala Tyr Lys Glu lie 

15 10 15 

Lys lie Val Phe Leu Asn Pro Lys lie Asn Leu Asn Glu Thr lie Lys 

20 25 30 

Asn Leu Val Glu Leu Ala Thr Leu Ala Arg Lys Asp Gly Val Leu Ser 

35 40 45 

Leu Glu Gly Arg Val Ala Gin lie Glu Asp Asp Phe Thr Arg Asn Gly 

50 55 60 

Leu Ser Met lie lie Asp Gly Lys Asp Leu Lys Ser Val Lys Glu Ser 
65 70 75 80 

Leu Glu lie Ser lie Glu Glu Met Glu Glu Tyr Tyr His Gly Ala Ala 

85 90 95 

His Tyr Trp Glu Thr Ala Gly Glu Thr Ala Pro Thr Met Gly Leu Val 

100 105 110 

Gly Ala Val Met Gly Leu Met Leu Ala Leu Gin Lys Leu Asp Asn Pro 

115 120 125 

Ala Glu Met Ala Ala Gly lie Ala Gly Ala Phe Thr Ala Thr Val Thr 

130 135 140 

Gly lie— Met Cys Ser Tyr Ala lie Phe Gly Pro Phe Gly His Lys Leu 

-1285- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



195 200 

(2) INFORMATION FOR SEQ ID NO: 85 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 26... 655 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 9: 

CGTGCCTACT TCGCTTTTTG CCGCC ATG ACA GGC ACA CAT GCG CGT TAC GTG 5 2 

Met Thr Gly Thr His Ala Arg Tyr Val 

1 5 

AAA GCC GCT TAC AAA GAG ATA AAA ATT GTT TTT TTA AAC CCT AAA ATC 10 0 

Lys Ala Ala Tyr Lys Glu lie Lys lie Val Phe Leu Asn Pro Lys lie 
10 15 20 25 

AAT TTA AAC GAA ACC ATC AAA AAT TTA GTG GAA TTA GCC ACT CTG GCT 14 8 

Asn Leu Asn Glu Thr lie Lys Asn Leu Val Glu Leu Ala Thr Leu Ala 
30 35 40 

AGA AAA GAT GGG GTG TTG AGT TTA GAG GGG CGA GTG GCG CAA ATT GAA 196 
Arg Lys Asp Gly Val Leu Ser Leu Glu Gly Arg Val Ala Gin lie Glu 
45 50 55 

GAC GAT TTC ACC CGT AAT GGC TTG TCT ATG ATC ATA GAT GGC AAG GAT 24 4 

Asp Asp Phe Thr Arg Asn Gly Leu Ser Met lie lie Asp Gly Lys Asp 
60 65 70 

TTA AAA TCC GTT AAG GAA AGC TTA GAA ATC AGC ATT GAA GAA ATG GAA 2 92 

Leu Lys Ser Val Lys Glu Ser Leu Glu lie Ser lie Glu Glu Met Glu 
75 80 85 

GAG TAT TAC CAC GGC GCC GCT CAT TAT TGG GAG ACG GCC GGT GAG ACC 34 0 

Glu Tyr Tyr His Gly Ala Ala His Tyr Trp Glu Thr Ala Gly Glu Thr 
90 95 100 105 

GCT CCT ACT ATG GGG TTA GTG GGG GCG GTT ATG GGG CTT ATG TTA GCC 3 88 

Ala Pro Thr Met Gly Leu Val Gly Ala Val Met Gly Leu Met Leu Ala 
110 115 120 

TTG CAA AAA CTA GAC AAC CCG GCT GAA ATG GCA GCA GGG ATC GCT GGG 436 
Leu Gin Lys Leu Asp Asn Pro Ala Glu Met Ala Ala Gly lie Ala Gly 
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GAA ACA AGT GGG TTT TTA GGG ATT TAT GAA AAG TGT TTT CAA GCT TTA 4 86 

Glu Thr Ser Gly Phe Leu Gly He Tyr Glu Lys Cys Phe Gin Ala Leu 
140 145 150 

GAA AGG TTA TTG AAA CGC TTT AAT CCA AAA AAT CTT TTA GTG GTT TTT 5 34 

Glu Arg Leu Leu Lys Arg Phe Asn Pro Lys Asn Leu Leu Val Val Phe 
155 160 165 

GAG CAT GAA AGC ATG CAT GAA ATG CCT AAA AGT CTT GTA ACT TTA GCT 5 82 

Glu His Glu Ser Met His Glu Met Pro Lys Ser Leu Val Thr Leu Ala 
170 175 180 

ATA ATC AAA CAA AAA AAA TTT GGA AAA ACC ACT TTA ACT TAT TTT CAA T 631 
He He Lys Gin Lys Lys Phe Gly Lys Thr Thr Leu Thr Tyr Phe Gin 
185 190 195 200 

AGGAATAGGC ATGGCAGAAG AACAAGAA 65 9 

(2) INFORMATION FOR SEQ ID NO: 85 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858: 

Met Pro Asn His Gin Pro Val Lys Lys Phe Lys He He Gly Gly Ala 

15 10 15 

Cys Lys Gly Leu Gly Leu Asn Leu Pro Asn He Ser Ser Thr Arg Pro 

20 25 30 

Thr Lys Ala He Val Arg Glu Ser Phe Phe Asn Thr Leu Gin Ala Glu 

35 40 45 

He Asn Gly Ala His Phe He Glu Val Phe Ser Gly Ser Ala Ser Met 

50 55 60 

Gly Leu Glu Ala Leu Ser Arg Gly Ala Lys Ser Ala Val Phe Phe Glu 
65 70 75 80 

Gin Asn Lys Ser Ala Tyr Lys Thr Leu Leu Glu Asn He Ser Leu Phe 

85 90 95 

Lys Asn Arg Leu Lys Lys Glu Met Glu He Gin Thr Phe Leu Asp Asp 

100 105 110 

Ala Phe Lys Leu Leu Pro Thr Leu Cys Leu Lys Asn Gly Val Leu Asn 

115 120 125 

He He Tyr Leu Asp Pro Pro Phe Glu Thr Ser Gly Phe Leu Gly He 

130 135 140 

Tyr Glu Lys Cys Phe Gin Ala Leu Glu Arg Leu Leu Lys Arg Phe Asn 
145 150 155 160 

Pro Lys Asn Leu Leu Val Val Phe Glu His Glu Ser Met His Glu Met 

165 170 175 

Pro Lys Ser Leu Val Thr Leu Ala He He Lys Gin Lys Lys Phe Gly 

180 185 190 

Lys Thr Thr Leu Thr Tyr Phe Gin 
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(2) INFORMATION FOR SEQ ID NO:857: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 630 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857: 

GCAATTACTA GAAGAATACA CCCACAAATT ATG CCA AAT CAT CAG CCA GTA AAA 54 

Met Pro Asn His Gin Pro Val Lys 
1 5 

AAA TTT AAG ATT ATT GGG GGG GCT TGT AAG GGA TTA GGC TTG AAT TTG 102 
Lys Phe Lys lie lie Gly Gly Ala Cys Lys Gly Leu Gly Leu Asn Leu 
10 15 20 

CCT AAC ATT TCT AGC ACG CGC CCC ACC AAA GCG ATC GTA AGA GAG TCG 15 0 

Pro Asn lie Ser Ser Thr Arg Pro Thr Lys Ala lie Val Arg Glu Ser 
25 30 35 40 

TTT TTT AAC ACC TTG CAA GCA GAA ATT AAT GGA GCG CAT TTT ATA GAA 19 8 

Phe Phe Asn Thr Leu Gin Ala Glu lie Asn Gly Ala His Phe lie Glu 
45 50 55 

GTG TTT TCA GGC AGC GCT TCT ATG GGT TTA GAG GCT TTG AGT AGG GGG 24 6 

Val Phe Ser Gly Ser Ala Ser Met Gly Leu Glu Ala Leu Ser Arg Gly 
60 65 70 

GCT AAA AGT GCG GTG TTT TTT GAA CAA AAC AAA AGC GCT TAT AAG ACG 2 94 

Ala Lys Ser Ala Val Phe Phe Glu Gin Asn Lys Ser Ala Tyr Lys Thr 
75 80 85 

CTT TTA GAA AAT ATT TCC CTT TTT AAA AAC CGC TTG AAA AAA GAA ATG 34 2 

Leu Leu Glu Asn lie Ser Leu Phe Lys Asn Arg Leu Lys Lys Glu Met 
90 95 100 

GAA ATT CAA ACC TTT TTA GAT GAC GCT TTC AAG CTT TTG CCC ACG CTG 3 90 

Glu lie Gin Thr Phe Leu Asp Asp Ala Phe Lys Leu Leu Pro Thr Leu 
105 110 115 120 

TGT TTA AAA AAT GGC GTT TTG AAT ATT ATT TAT TTG GAT CCT CCT TTT 43 8 

Cys Leu Lys Asn Gly Val Leu Asn lie lie Tyr Leu Asp Pro Pro Phe 
125 130 135 
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110 115 120 

GGT ATT AGG AAT AAG AGT TTG ATT ATT AAT CTC CCT GGT AAG CCA AAA 4 35 

Gly lie Arg Asn Lys Ser Leu lie lie Asn Leu Pro Gly Lys Pro Lys 
125 130 135 

AGT ATT AGA GAA TGC TTA GAG GCG GTT TTT CCA GCG ATT CCT TAT TGC 483 
Ser lie Arg Glu Cys Leu Glu Ala Val Phe Pro Ala lie Pro Tyr Cys 
140 145 150 

GTG GAT TTG ATT TTA GGG AAT TAT ATG CAA GTG AAT GAA AAA AAC ATT 531 
Val Asp Leu lie Leu Gly Asn Tyr Met Gin Val Asn Glu Lys Asn lie 
155 160 165 

CAA GCG TTT CGC CCC AAA CAA TAGGATAAAA AATGCCGCTC ACTCATTTGA ATGA 586 
Gin Ala Phe Arg Pro Lys Gin 
170 175 

AGAAAATCAG CCTAAAATG 605 
(2) INFORMATION FOR SEQ ID NO: 85 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856: 

Met Gin Thr lie His lie Gly Val Leu Ser Ala Ser Asp Arg Ala Ser 

15 10 15 

Lys Gly lie Tyr Glu Asp Leu Ser Gly Lys Ala lie Gin Glu Val Leu 

20 25 30 

Ser Glu Tyr Leu Leu Asn Pro Leu Glu Phe Tyr Tyr Glu lie Val Ala 

35 40 45 

Asp Glu Arg Asp Leu lie Glu Lys Ser Leu lie Lys Met Cys Asp Glu 

50 55 60 

Tyr Gin Cys Asp Leu Val Val Thr Thr Gly Gly Thr Gly Pro Ala Leu 
65 70 75 80 

Arg Asp lie Thr Pro Glu Ala Thr Glu Lys Val Cys Gin Lys Met Leu 

85 90 95 

Pro Gly Phe Gly Glu Leu Met Arg Met Thr Ser Leu Lys Tyr Val Pro 

100 105 110 

Thr Ala He Leu Ser Arg Gin Ser Ala Gly He Arg Asn Lys Ser Leu 

115 120 125 

He He Asn Leu Pro Gly Lys Pro Lys Ser He Arg Glu Cys Leu Glu 

130 135 140 

Ala Val Phe Pro Ala He Pro Tyr . Cys Val Asp Leu He Leu Gly Asn 
145 150 155 160 

Tyr Met Gin Val Asn Glu Lys Asn He Gin Ala Phe Arg Pro Lys Gin 
165 170 175 
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115 120 125 

Val Lys Ala lie Asp Lys Ser Met Thr lie Ser Gly Val Met L.eu Glu 

130 135 140 

His Lys Ser Gly Gly Lys Ser Gly Asp Tyr Asn Ala Lys Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 855: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DMA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25. . .552 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 5: 

CTTTTAGCTT AAAAAGGAGT TCAA ATG CAA ACG ATT CAT ATA GGC GTT TTG 51 

Met Gin Thr lie His He Gly Val Leu 
1 5 

AGC GCG AGC GAT AGA GCG TCA AAA GGG ATT TAT GAA GAT TTA AGC GGT 9 9 

Ser Ala Ser Asp Arg Ala Ser Lys Gly He Tyr Glu Asp Leu Ser Gly 
10 15 20 25 

AAG GCG ATA CAA GAA GTG TTG AGC GAA TAC TTG CTC AAT CCT TTA GAA 14 7 

Lys Ala He Gin Glu Val Leu Ser Glu Tyr Leu Leu Asn Pro Leu Glu 
30 35 40 

TTT TAT TAC GAA ATT GTC GCT GAT GAA AGG GAT TTA ATT GAA AAA TCA 195 
Phe Tyr Tyr Glu He Val Ala Asp Glu Arg Asp Leu He Glu Lys Ser 
45 50 55 

CTG ATT AAA ATG TGC GAT GAA TAC CAA TGC GAT CTA GTC GTT ACT ACA 24 3 

Leu He Lys Met Cys Asp Glu Tyr Gin Cys Asp Leu Val Val Thr Thr 
60 65 70 

GGA GGC ACA GGC CCT GCT TTA AGA GAT ATA ACC CCA GAA GCC ACA GAA 291 
Gly Gly Thr Gly Pro Ala Leu Arg Asp He Thr Pro Glu Ala Thr Glu 
75 80 85 

AAA GTG TGC CAA AAA ATG CTT CCT GGT TTT GGA GAG CTT ATG CGA ATG 33 9 

Lys Val Cys Gin Lys Met Leu Pro Gly Phe Gly Glu Leu Met Arg Met 
90 95 100 105 

ACT AGT TTA AAA TAT GTG CCT ACA GCG ATC CTG TCG CGC CAG AGC GCT 3 87 

Thr Ser Leu Lys Tyr Val Pro Thr Ala He Leu Ser Arg Gin Ser Ala 

-1280- 



BNSDOCID; <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



60 65 70 

TGC CAT CCA ATC ATG CTC AAT GGG GTG GAT ATT GAT ATT TTA GAA GAA 2 91 

Cys His Pro lie Met Leu Asn Gly Val Asp lie Asp lie Leu Glu Glu 
75 80 85 90 

AAA GAG ACT TGT AGT TTT AAA CTC TAT GCG AGA GTC AAA ACT CAA GCT 339 
Lys Glu Thr Cys Ser Phe Lys Leu Tyr Ala Arg Val Lys Thr Gin Ala 
95 100 105 

AAA ACG GGC GTA GAA ATG GAA GCG CTA ATG AGT GTG AGC ATA GGG CTT 3 87 

Lys Thr Gly Val Glu Met Glu Ala Leu Met Ser Val Ser lie Gly Leu 
110 115 120 

TTA ACC ATT TAT GAC ATG GTG AAA GCC ATT GAC AAG AGC ATG ACA ATT 43 5 

Leu Thr lie Tyr Asp Met Val Lys Ala lie Asp Lys Ser Met Thr lie 
125 130 135 

AGC GGT GTG ATG TTG GAG CAT AAA AGT GGA GGC AAA AGT GGG GAT TAT 483 
Ser Gly Val Met Leu Glu His Lys Ser Gly Gly Lys Ser Gly Asp Tyr 
140 145 150 

AAC GCT AAA AAA TAGAAAAAGA CCAATAATCT AAAGATGTTA GGGTAAAATA ACATT 54 0 

Asn Ala Lys Lys 

155 

TTGACAACAA AAGCGTGTTG GTTG 564 
(2) INFORMATION FOR SEQ ID N0:854: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 854 : 

Met Pro Leu Thr His Leu Asn Glu Glu Asn Gin Pro Lys Met Val Asp 

15 10 15 

lie Gly Asp Lys Glu Thr Thr Glu Arg lie Ala Leu Ala Ser Gly Arg 

20 25 30 

lie Ser Met Asn Lys Glu Ala Tyr Asp Ala lie He Asn His Cys Val 

35 40 45 

Lys Lys Gly Pro Val Leu Gin Thr Ala He He Ala Gly He Met Gly 

50 55 60 

Ala Lys Lys Thr Ser Glu Leu He Pro Met Cys His Pro He Met Leu 
65 70 75 80 

Asn Gly Val Asp He Asp He Leu Glu Glu Lys Glu Thr Cys Ser Phe 

85 90 95 

Lys Leu Tyr Ala Arg Val Lys Thr Gin Ala Lys Thr Gly Val Glu Met 

100 105 110 

Glu Ala Leu Met Ser Val Ser He Gly Leu Leu Thr He Tyr Asp Met 
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35 40 45 

Ala lie Gin Val Gly Leu Pro Leu Arg Met Leu lie lie Asn Leu Pro 

50 55 60 

Gin Glu Asp Gly Val Gin His Lys Glu Asp Cys Leu Glu lie lie Asn 
.65 70 75 80 

Pro Lys Phe lie Glu Thr Gly Gly Ser Met Met Tyr Arg Glu Gly Cys 

85 90 95 

Leu Ser Val Pro Gly Phe Tyr Glu Glu Val Glu Arg Phe Glu Lys Val 

100 105 110 

Lys He Glu Tyr Gin Asn Arg Phe Ala Glu Val Lys Val Leu Glu Ala 

115 120 125 

Ser Glu Leu Leu Ala Val Ala He Gin His Glu He Asp His Leu Asn 

130 135 140 

Gly Val Leu Phe Val Asp Lys Leu Ser He Leu Lys Arg Lys Lys Phe 
145 150 155 160 

Glu Lys Glu Leu Lys Glu Leu Gin Lys Lys Gin Lys His Glu 
165 170 

(2) INFORMATION FOR SEQ ID NO: 8 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 495 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853: 

CCCCAAACAA TAGGATAAAA A ATG CCG CTC ACT CAT TTG AAT GAA GAA AAT 51 

Met Pro Leu Thr His Leu Asn Glu Glu Asn 
15 10 

CAG CCT AAA ATG GTG GAT ATA GGG GAT AAA GAA ACC ACT GAA AGA ATC 99 
Gin Pro Lys Met Val Asp He Gly Asp Lys Glu Thr Thr Glu Arg He 
15 20 25 

GCT TTA GCA AGC GGT CGT ATC AGC ATG AAT AAA GAG GCT TAT GAC GCT 14 7 

Ala Leu Ala Ser Gly Arg He Ser Met Asn Lys Glu Ala Tyr Asp Ala 
30 35 40 

ATT ATC AAT CAT TGC GTC AAA AAG GGT CCG GTG TTA CAG ACT GCT ATT 195 
He He Asn His Cys Val Lys Lys Gly Pro Val Leu Gin Thr Ala He 
45 50 55 

ATT GCT GGA ATT ATG GGG GCT AAA AAG ACA AGC GAG CTC ATT CCC ATG 243 
He Ala Gly He Met Gly Ala Lys Lys Thr Ser Glu Leu He Pro Met 
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AGT GAG GGG ATA GGG TTA GCC GCT ATT CAA GTG GGT TTG CCT TTA AGA 196 
Ser Glu Gly lie Gly Leu Ala Ala lie Gin Val Gly Leu Pro Leu Arg 
45 50 55 

ATG CTC ATC ATC AAC CTC CCG CAA GAA GAC GGC GTG CAA CAC AAA GAA 244 
Met Leu lie lie Asn Leu Pro Gin Glu Asp Gly Val Gin His Lys Glu 
60 65 70 

GAC TGC TTG GAA ATC ATT AAC CCT AAG TTT ATA GAA ACT GGG GGA TCA 2 92 

Asp Cys Leu Glu lie lie Asn Pro Lys Phe lie Glu Thr Gly Gly Ser 
75 80 85 

ATG ATG TAT AGA GAA GGG TGC TTG TCT GTG CCG GGA TTT TAC GAA GAA 34 0 

Met Met Tyr Arg Glu Gly Cys Leu Ser Val Pro Gly Phe Tyr Glu Glu 
90 95 100 ' 105 

GTG GAG CGT TTT GAA AAG GTT AAG ATA GAG TAT CAA AAC CGC TTC GCT 3 88 

Val Glu Arg Phe Glu Lys Val Lys lie Glu Tyr Gin Asn Arg Phe Ala 
110 115 120 

GAA GTG AAA GTT TTA GAA GCG AGC GAG CTT TTA GCG GTA GCC ATT CAG 436 
Glu Val Lys Val Leu Glu Ala Ser Glu Leu Leu Ala Val Ala lie Gin 
125 130 135 

CAT GAG ATC GAT CAC CTC AAT GGC GTG TTA TTC GTG GAT AAA TTA TCC 4 84 

His Glu lie Asp His Leu Asn Gly Val Leu Phe Val Asp Lys Leu Ser 
140 145 150 

ATT TTG AAG CGT AAG AAA TTT GAA AAA GAA CTC AAA GAG CTG CAA AAA 53 2 

lie Leu Lys Arg Lys Lys Phe Glu Lys Glu Leu Lys Glu Leu Gin Lys 
155 160 165 

AAA CAA AAA CAC GAG TAACAACCAT GATTAACACG ATGTTTTGCG CGACCATGCA A 58 8 

Lys Gin Lys His Glu 

170 

AGGGGAGTGG CGGAAAT 6 05 

(2) INFORMATION FOR SEQ ID NO: 852: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852: 

Met Ala Leu Leu Glu lie lie His Tyr Pro Ser Lys lie Leu Arg Thr 

15 10 15 

lie Ser Lys Glu Val Val Ser Phe Asp Ser Lys Leu His Gin Gin Leu 

20 25 30 

Asp Asp Met His Glu Thr Met lie Ala Ser Glu Gly lie Gly Leu Ala 
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Ser Arg Val His 
340 

Gly Ala Gin Asn 
355 

Ala Cys Arg Cys 
370 

Ser Glu Pro Phe 
385 

Gly Asn Tyr Lys 

His Glu Leu Val 
420 

Ser Val Phe Asn 
435 

Pro Leu Asn Ala 
450 

Phe Asn Leu Ser 
465 

Thr lie Ala Asp 

Leu Lys Ala Leu 
500 



325 

Ser Lys lie Glu 

Pro Cys Leu Cys 
360 

Gin Asp Arg Glu 
375 

Leu Asp Arg lie 
390 

Asp Thr Pro Ser 
405 

Leu Leu Ala Phe 

Gly Lys Leu Asn 
440 

Glu Ala Lys Lys 
455 

Met Arg Ser lie 
470 

Leu Asn Ala Cys 
485 

Ser Phe Arg Lys 



330 

Tyr Glu Thr Ser 
345 

Gly Asn Leu Leu 

lie Thr Gin Tyr 
380 

Asp Leu Phe Val 
395 

His Ser Trp Thr 
410 

Lys Gin Gin Lys 
425 

Glu Glu Gin lie 

Leu Leu Glu Gin 
460 

Asn Lys Val Lys 
475 

Glu Asp lie Glu 

490 
lie Ser 
505 



335 

Phe Leu Phe Val 
350 

Ser Ala Thr Lys 
365 

Lys Asn Arg Leu 

Gin Met Glu Glu 
400 

Ser Lys Glu Met 
415 

Leu Arg Lys Gin 
430 

Glu Arg Phe Cys 
445 

Ala Val Glu Arg 

Lys Val Ala Arg 
480 

Lys Ser His Met 
495 



(2) INFORMATION FOR SEQ ID NO: 851: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2 6... 547 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851: 



TTACAGAAAA ACGTGAAGTG ATTGC ATG GCG TTA TTA GAG ATT ATC CAT TAG 52 

Met Ala Leu Leu Glu lie lie His Tyr 

1 5 

CCT TCT AAA ATC TTA AGA ACG ATT TCT AAA GAG GTC GTT TCT TTT GAT 10 0 

Pro Ser Lys lie Leu Arg Thr lie Ser Lys Glu Val Val Ser Phe Asp 
10 15 20 25 



TCA AAA CTC CAC CAA CAG CTA GAT GAC ATG CAT GAG ACT ATG ATC GCT 14 8 

Ser Lys Leu His Gin Gin Leu Asp Asp Met His Glu Thr Met lie Ala 
30 35 40 
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CAAGAATACC ACATTCATAA TTTGGATTGC CC 



1631 



(2) INFORMATION FOR SEQ ID NO: 850: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 506 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850: 



Met lie Asn Thr Met Phe Cys Ala Thr Met Gin Arg Gly Val Ala Glu 

15 10 15 

lie Val Ala Val Glu Ala Thr Phe Thr Arg Ala Leu Pro Ala Phe Val 

20 25 30 

lie Ser Gly Leu Ala Asn Ser Ser lie Gin Glu Ala Lys Gin Arg Val 

35 40 45 

Gin Ser Ala Leu Gin Asn Asn Asp Phe Thr Phe Pro Pro Leu Lys lie 

50 55 60 

Thr lie Asn Leu Ser Pro Ser Asp Leu Pro Lys Ser Gly Ser His Phe 
65 70 75 80 

Asp Leu Pro lie Ala Leu Leu lie Ala Leu Gin Lys Gin Glu Leu Ala 

85 90 95 

Phe Lys Glu Trp Phe Ala Phe Gly Glu Leu Gly Leu Asp Gly Lys lie 

100 105 110 

Lys Pro Asn Pro Asn lie Phe Pro Met Leu Leu Asp lie Ala lie Lys 

115 120 125 

His Pro His Ala Lys lie lie Ala Pro Lys Ala Asn Glu Glu Leu Phe 

130 135 140 

Ser Leu lie Pro Asn Leu Gin Cys Phe Phe Val Gly His Phe Lys Glu 
145 150 155 160 

Ala Leu Glu lie Leu Gin Asn Pro Glu Thr Lys Ala Asp Thr His Thr 

165 170 175 

Lys Lys Leu Pro Phe Lys Thr lie Glu Leu Asn Asp Lys Glu Tyr Tyr 

180 185 190 

Phe Ser Asp Ala Tyr Ala Leu Asp Phe Lys Glu Val Lys Gly Gin Ala 

195 200 205 

Val Ala Lys Glu Ala Ala Leu lie Ala Ser Ala Gly Phe His Asn Leu 

210 215 220 

lie Leu Glu Gly Ser Pro Gly Cys Gly Lys Ser Met lie lie Asn Arg 
225 230 235 240 

Met Arg Tyr lie Leu Pro Pro Leu Ser Leu Asn Glu lie Leu Glu Ala 

245 250 255 

Thr Lys Leu Arg lie Leu Ser Glu Gin Asp Ser Ala Tyr Tyr Pro Leu 

260 265 270 

Arg Ser Phe Arg Asn Pro His Gin Ser Ala Ser Lys Ser Ser lie Leu 

275 280 285 

Gly Ser Ser Ser Leu Arg Glu Pro Lys Pro Gly Glu lie Ala Leu Ala 

290 295 300 

His Asn Gly Met Leu Phe Phe Asp Glu Leu Pro His Phe Lys Lys Asp 
305 310 315 320 

lie Leu Glu Ala Leu Arg Glu Pro Leu Glu Asn Asn Lys Leu Val Val 
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AGC TCT CTA AGA GAG CCA AAA CCT GGC GAA ATC GCG CTA GCG CAT AAC 96 9 

Ser Ser Leu Arg Glu Pro Lys Pro Gly Glu lie Ala Leu Ala His Asn 
295 300 305 

GGC ATG CTT TTT TTT GAT GAA TTG CCT CAT TTT AAA AAG GAT ATT TTG 1017 
Gly Met Leu Phe Phe Asp Glu Leu Pro His Phe Lys Lys Asp lie Leu 
310 315 320 

GAA GCT TTA AGA GAG CCT TTA GAA AAC AAT AAA TTG GTG GTT TCA AGA 1065 
Glu Ala Leu Arg Glu Pro Leu Glu Asn Asn Lys Leu Val Val Ser Arg 
325 330 335 

GTG CAT AGC AAA ATT GAA TAC GAA ACC TCT TTT TTA TTT GTA GGG GCT 1113 
Val His Ser Lys lie Glu Tyr Glu Thr Ser Phe Leu Phe Val Gly Ala 
340 345 350 

CAA AAC CCT TGC TTG TGC GGG AAT TTA CTC AGC GCG ACC AAA GCA TGC 1161 
Gin Asn Pro Cys Leu Cys Gly Asn Leu Leu Ser Ala Thr Lys Ala Cys 
355 360 365 370 

CGT TGC CAA GAC AGA GAA ATC ACG CAG TAT AAA AAC CGC TTG AGC GAG 12 0 9 

Arg Cys Gin Asp Arg Glu lie Thr Gin Tyr Lys Asn Arg Leu Ser Glu 
375 380 385 

CCT TTT TTG GAT AGG ATT GAT TTG TTT GTG CAA ATG GAA GAG GGG AAT 12 57 

Pro Phe Leu Asp Arg lie Asp Leu Phe Val Gin Met Glu Glu Gly Asn 
390 395 400 

TAT AAA GAC ACG CCG TCG CAT TCT TGG ACT TCA AAA GAG ATG CAT GAA 13 05 

Tyr Lys Asp Thr Pro Ser His Ser Trp Thr Ser Lys Glu Met His Glu 
405 410 415 

TTG GTG TTA TTA GCT TTC AAG CAG CAA AAG TTA AGG AAA CAG AGC GTT 13 5 3 

Leu Val Leu Leu Ala Phe Lys Gin Gin Lys Leu Arg Lys Gin Ser Val 
420 425 430 

TTT AAT GGT AAG CTT AAT GAA GAG CAG ATA GAA CGA TTT TGC CCC TTA 14 01 

Phe Asn Gly Lys Leu Asn Glu Glu Gin lie Glu Arg Phe Cys Pro Leu 
435 440 445 450 

AAC GCT GAA GCA AAA AAG TTG TTG GAG CAG GCG GTT GAA AGG TTT AAT 144 9 

Asn Ala Glu Ala Lys Lys Leu Leu Glu Gin Ala Val Glu Arg Phe Asn 
455 460 465 

CTC TCC ATG CGC TCT ATT AAT AAG GTC AAA AAA GTC GCT AGG ACG ATT 14 97 

Leu Ser Met Arg Ser lie Asn Lys Val Lys Lys Val Ala Arg Thr lie 
470 475 480 

GCG GAT TTA AAC GCT TGC GAG GAT ATA GAA AAA TCT CAC ATG CTT AAA 154 5 

Ala Asp Leu Asn Ala Cys Glu Asp lie Glu Lys Ser His Met Leu Lys 
485 490 495 

GCG CTG AGT TTT AGA AAG ATT TCT TAAAAGGATT TTTATAAGGG AGAAAAAATG 15 99 
Ala Leu Ser Phe Arg Lys lie Ser 
500 505 
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AAC CTT TCC CCC TCA GAT TTG CCT AAA TCC GGG AGT CAT TTT GAT TTG 2 97 

Asn Leu Ser Pro Ser Asp Leu Pro Lys Ser Gly Ser His Phe Asp Leu 

70 75 80 



CCT ATC OCT CTT TTA ATC GCT TTG CAA AAA CAA GAG TTG GCT TTT AAA 34 5 

Pro lie Ala Leu Leu lie Ala Leu Gin Lys Gin Glu Leu Ala Phe Lys 
85 90 95 



GAG TGG TTT GCT TTT GGG GAG TTA GGG CTT GAT GGC AAG ATC AAA CCC 3 93 

Glu Trp Phe Ala Phe Gly Glu Leu Gly Leu Asp Gly Lys lie Lys Pro 
100 105 110 



AAT CCT AAC ATT TTC CCC ATG CTT TTA 
Asn Pro Asn lie Phe Pro Met Leu Leu 
115 120 

CAT GCT AAG ATC ATT GCG CCT AAG GCC 
His Ala Lys lie lie Ala Pro Lys Ala 
135 

ATC CCT AAT TTG CAA TGC TTT TTT GTG 
lie Pro Asn Leu Gin Cys Phe Phe Val 
150 155 



GAC ATT GCC ATT AAA CAC CCC 441 
Asp lie Ala lie Lys His Pro 
125 130 

AAT GAA GAG CTT TTT TCG CTT 489 
Asn Glu Glu Leu Phe Ser Leu 
140 145 

GGG CAT TTT AAA GAA GCG TTA 53 7 

Gly His Phe Lys Glu Ala Leu 
160 



GAA ATC TTG CAA AAC CCT GAA ACC AAA GCA GAC ACC CAC ACG AAA AAA 5 85 

Glu lie Leu Gin Asn Pro Glu Thr Lys Ala Asp Thr His Thr Lys Lys 
165 170 175 



CTA CCC TTT AAA ACG ATA GAA TTG AAC GAT AAA GAG TAT TAT TTT TCA 633 
Leu Pro Phe Lys Thr lie Glu Leu Asn Asp Lys Glu Tyr Tyr Phe Ser 
180 185 190 



GAC GCC TAT GCC TTA GAT TTT AAA GAA GTT AAG GGG CAA GCT GTC GCT 681 
Asp Ala Tyr Ala Leu Asp Phe Lys Glu Val Lys Gly Gin Ala Val Ala 
195 200 205 210 



AAA GAG GCC GCT TTG ATC GCT AGC GCT GGG TTT CAT AAC TTG ATT TTA 72 9 

Lys Glu Ala Ala Leu lie Ala Ser Ala Gly Phe His Asn Leu lie Leu 
215 220 225 



GAG GGA AGT CCA GGG TGT GGG AAA AGC ATG ATC ATT AAT CGC ATG CGT 7 77 

Glu Gly Ser Pro Gly Cys Gly Lys Ser Met lie lie Asn Arg Met Arg 
230 235 240 



TAT ATC TTG- CCT CCA TTA AGC CTG AAT GAA ATC CTA GAA GCG ACA AAA 82 5 

Tyr lie Leu Pro Pro Leu Ser Leu Asn Glu lie Leu Glu Ala Thr Lys 
245 250 255 



TTA CGC ATT TTA AGC GAG CAA GAC AGT GCC TAT TAC CCC TTA AGG AGT 87 3 

Leu Arg lie Leu Ser Glu Gin Asp Ser Ala Tyr Tyr Pro Leu Arg Ser 
260 265 270 

TTT AGA AAC CCT CAC CAA AGC GCT TCA AAA TCC AGT ATT TTA GGC TCA 921 
Phe Arg Asn Pro His Gin Ser Ala Ser Lys Ser Ser lie Leu Gly Ser 
275 - 280 285 290 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 8: 

Val Ala Val Lys Lys lie Val Val Ser Trp Cys Val Ala Leu Ala Phe 

15 10 15 

Leu Ser Ala Asp Ser Ala Gin Ala Asn Lys Ala lie Ser Asn Ala Asp 

20 25 30 

Leu lie Lys Glu lie Arg Asp Leu Lys Lys lie lie Ser Ala Gin Asn 

35 40 45 

Thr Glu lie Asn Asn Leu Arg Lys Val Gin Glu Val Leu Ser Gly Gin 

50 55 60 

Leu Gly Asp Met Arg Lys Asp lie Leu Ser Thr Arg Asp Tyr Cys lie 
65 70 75 80 

Ser Leu Arg Pro Tyr lie Tyr Asn Trp Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO: 84 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52. -.1569 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 9: 

TGAAAAAGAA CTCAAAGAGC TGCAAAAAAA ACAAAAACAC GAGTAACAAC C ATG ATT 57 

Met lie 
1 

AAC ACG ATG TTT TGC GCG ACC ATG CAA AGG GGA GTG GCG GAA ATC GTG 105 
Asn Thr Met Phe Cys Ala Thr Met Gin Arg Gly Val Ala Glu lie Val 
5 10 15 

GCT GTG GAA GCG ACT TTC ACA AGG GCT TTG CCG GCG TTT GTG ATT TCA 153 
Ala Val Glu Ala Thr Phe Thr Arg Ala Leu Pro Ala Phe Val lie Ser 
20 25 30 

GGG TTA GCT AAT AGC TCT ATC CAA GAA GCC AAA CAG CGG GTT CAA TCG 2 01 

Gly Leu Ala Asn Ser Ser lie Gin Glu Ala Lys Gin Arg Val Gin Ser 
35 40 45 50 

GCT TTA CAA AAT AAC GAT TTC ACT TTC CCG CCT TTA AAA ATC ACC ATC 24 9 

Ala Leu Gin Asn Asn Asp Phe Thr Phe Pro Pro Leu Lys lie Thr lie 
55 60 65 
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Leu Asp Asp Ala Tyr Glu Ser 
770 775 

(2) INFORMATION FOR SEQ ID NO:847: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 10... 279 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847: 

AAAAGGAGA GTG GCG GTG AAA AAA ATC GTT GTG AGT TGG TGT GTG GCG TTG 51 
Val Ala Val Lys Lys lie Val Val Ser Trp Cys Val Ala Leu 
15 10 

GCT TTT TTA AGC GCG GAT TCA GCA CAA GCC AAT AAA GCG ATC AGT AAT 99 
Ala Phe Leu Ser Ala Asp Ser Ala Gin Ala Asn Lys Ala lie Ser Asn 
15 20 25 30 

GCG GAT TTG ATT AAA GAG ATA AGG GAT TTA AAA AAA ATC ATC AGC GCG 14 7 

Ala Asp Leu lie Lys Glu lie Arg Asp Leu Lys Lys lie lie Ser Ala 
35 40 45 

CAA AAC ACT GAG ATT AAC AAC TTA AGA AAA GTG CAA GAA GTG TTG TCT 195 
Gin Asn Thr Glu lie Asn Asn Leu Arg Lys Val Gin Glu Val Leu Ser 
50 55 60 

GGG CAA TTA GGG GAC ATG CGT AAG GAT ATA TTA AGC ACT AGA GAT TAT 24 3 

Gly Gin Leu Gly Asp Met Arg Lys Asp lie Leu Ser Thr Arg Asp Tyr 
65 70 75 

TGC ATT AGC TTA AGG CCT TAT ATC TAT AAT TGG CGC TAGGGGATAA TCCAAA 2 95 
Cys lie Ser Leu Arg Pro Tyr lie Tyr Asn Trp Arg 
80 85 90 

AAATGAAAGC ATGCG 310 

(2) INFORMATION FOR SEQ ID NO: 84 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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lie His Thr Thr 
340 

Arg Thr Phe Asp 
355 

Trp Lys Tyr Lys 
370 

Met Arg Trp Leu 
385 

Asp Pro Lys Glu 

Asp lie Val Val 
420 

Gly Ala lie Ala 
435 

Asp Lys Ala Thr 
450 

Gin Glu Leu Arg 
465 

lie lie Pro Arg 

Lys Asn His Leu 
500 

Thr Lys Ser Met 
515 

Phe Glu Asp Met 
530 

Thr Asp Cys Gly 
545 

Glu Asn Leu Ala 

Leu Lys Glu Asp 
580 

Arg Gly Leu Arg 
595 

Ala Pro Asn Arg 
610 

Cys Ser Ser Asn 
625 

Cys Tyr Pro Lys 

Lys Asp Pro Lys 
660 

Ala Glu Val Asp 
675 

Arg Asp Arg Thr 
690 

Ser Val Leu Ala 
705 

lie Val Gly Val 

Cys Glu Val Ser 
740 

Ala Leu lie Asn 
755 



325 

lie Phe Asp Glu 

Met His Met Gly 
360 

Ala Gly Gly Val 
375 

Gin Asn Phe Lys 
390 

Phe Tyr Glu Leu 
405 

Phe Ser Pro His 

Leu Asp Phe Ala 
440 

Asp Ala Tyr lie 
455 

Ser Gly Asp Val 
470 

Phe lie Trp Met 
485 

Arg lie Gin Arg 

lie Asn lie Leu 
520 

Asp Leu Lys Asp 
535 

Val Glu Thr Thr 
550 

Lys Leu Thr Glu 
565 

Ala lie Leu Glu 

Tyr Leu Gly Phe 
600 

Gin Trp Gin Cys 
615 

Ala Leu Glu lie 
630 

Tyr Gly Asp Glu 
645 

Ala lie Ala His 

Ala Lys Val Pro 
680 

lie Tyr Lys Met 
695 

Gly Leu Leu Thr 
710 

Ser Tyr Leu Gly 
725 

Phe Glu He Ala 

Arg Lys Tyr Gin 
760 



330 

Ser Ser Val Tyr 
345 

Ala Glu Tyr Gly 

Asp His Glu Asp 
380 

Tyr His Asp Ser 
395 

Ala Lys Asn Asp 
410 

Gly Asp Thr Tyr 
425 

Tyr Met Val His 

Asn Ser Lys Lys 
460 

Val Lys He He 
475 

Asp Gin Leu Lys 
490 

Arg Asn Arg Leu 
505 

Ala Thr Phe Phe 

Tyr Lys Asn Phe 
540 

Leu Thr Glu Ala 
555 

Glu He Glu Asn 
570 

Tyr Gin Glu Met 
585 

Lys Thr Asn Val 

Lys Glu Leu Glu 
620 

Lys Gin Val Leu 
635 

He He Ala He 
650 

His Lys Phe Cys 
665 

Met Val Tyr He 

Met Phe Tyr Leu 
700 

Phe Leu Asn Arg 
715 

Tyr Lys Asp Lys 
730 

Thr Asp Lys Ala 
745 

Asp Arg He Val 



335 

Glu Val Gin He 
350 

Asn Ser Ala His 
365 

His His Glu Gly 

Asp Leu Lys Asn 
400 

Leu Tyr Arg Glu 
415 

Thr Leu Pro Val 
430 

Ser Asp Leu Gly 
445 

Ala Leu Leu Asn 

Lys Gly Asp Lys 
480 

Thr Ser Lys Ala 
495 

Lys Glu Val Asp 
510 

Gly Arg Ser Val 
525 

Glu Glu Arg Leu 

Met Lys Ser Phe 
560 

Lys Val Phe Ser 
575 

Ser Leu Trp Thr 
590 

Leu Asn Phe Leu 
605 

His Phe Ser Val 

Leu Asn Asp Cys 
640 

Val Thr Asp Leu 
655 

Lys Lys Ala Met 
670 

Glu Trp His Lys 
685 . 

Gly Glu Lys Lys 

Asn Glu Cys Asn 
720 

Tyr Ser Ser His 
735 

Asp Trp He Arg 
750 

Glu Leu Ser Ser 
765 
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AACATGGAAC AAA 2 373 

(2) INFORMATION FOR SEQ ID NO: 846: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 6: 

Met Asn Glu lie Asp Lys Ser Val Asp lie Gly Phe Leu Arg lie Leu 

15 10 15 

Asp Val lie Lys Lys Val Thr Thr Pro Lys Gly Gly lie Glu lie Leu 

20 25 30 

Arg Thr Leu lie Asp Phe Thr Pro Lys lie Glu Asn Ala Leu Asn Leu 

35 40 45 

Ala Ala Lys Ser His Lys Gly Gin Tyr Arg Lys Ser Gly Glu Pro Tyr 

50 55 60 

He Val His Pro He Cys Val Ala Ser Leu Val Ala Phe Cys Gly Gly 
65 70 75 80 

Asp Glu Ala Met Val Cys Ala Ala Leu Leu His Asp Val Val Glu Asp 

85 90 95 

Thr Pro Cys Lys He Glu Thr He Glu Gin Glu Phe Gly Gin Asp Val 

100 105 110 

Ala Asn Leu Val Asp Ala Leu Thr Lys He Thr Glu He Arg Lys Glu 

115 120 125 

Glu Leu Gly Val Ser Ser Gin Asp Pro Arg Met Val Val Ser Ala Leu 

130 135 140 

Thr Phe Arg Lys He Leu He Ser Ala He Gin Asp Pro Arg Ala Leu 
145 150 155 160 

Val Val Lys He Ser Asp Arg Leu His Asn Met Leu Thr Leu Asp Ala 

165 170 175 

Leu Pro His Asp Lys Gin Val Arg He Ser Lys Glu Thr Leu Ala Val 

180 185 190 

Tyr Ala Pro He Ala Ser Arg Leu Gly Met Ser Ser He Lys Asn Glu 

195 200 205 

Leu Glu Asp Lys Ser Phe Tyr Tyr He Tyr Pro Glu Glu Tyr Lys Asn 

210 215 220 

He Lys Glu Tyr Leu His Lys Asn Lys Gin Ser Leu Leu Leu Lys Leu 
225 230 235 240 

Asn Ala Phe Ala Ser Lys Leu Glu Lys Lys Leu Phe Asp Ser Gly Phe 

245 250 255 

Ser His Ser Asp Phe Lys Leu Val Thr Arg Val Lys Arg Pro Tyr Ser 

260 265 270 

He Tyr Leu Lys Met Gin Arg Lys Gly Ala Val Asn He Asp Glu He 

275 280 285 

Leu Asp Leu Leu Ala He Arg He Leu Leu Lys Asn Pro He Asp Cys 

290 295 300 

Tyr Lys Val Leu Gly He He His Leu Asn Phe Lys Pro He Val Ser 
305 310 315 320 

Arg Phe Lys Asp Tyr He Ala Leu Pro Lys Glu Asn Gly Tyr Lys Thr 
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AAA AGT TTT GAA AAT TTA GCC AAA CTC ACT GAA GAA ATT GAA AAT AAG 17 31 

Lys Ser Phe Glu Asn Leu Ala Lys Leu Thr Glu Glu lie Glu Asn Lys 
560 565 570 



GTG TTT TOT TTA AAA GAA GAT GCG ATT TTA GAh TAG CAA GAG ATG AGT 177 9 

Val Phe Ser Leu Lys Glu Asp Ala lie Leu Glu Tyr Gin Glu Met Ser 
575 580 585 

TTA TGG ACT CGA GGT TTA AGG TAT TTG GGC TTT AAA ACC AAT GTC TTG 1827 
Leu Trp Thr Arg Gly Leu Arg Tyr Leu Gly Phe Lys Thr Asn Val Leu 
590 595 600 605 

AAT TTT TTA GCC CCC AAT CGG CAG TGG CAG TGT AAG GAA TTA GAA CAT 18 75 

Asn Phe Leu Ala Pro Asn Arg Gin Trp Gin Cys Lys Glu Leu Glu His 
610 615 620 



TTT AGC GTT TGT TCA AGC AAC GCT TTA GAA ATC AAA CAG GTG TTG TTG 192 3 

Phe Ser Val Cys Ser Ser Asn Ala Leu Glu lie Lys Gin Val Leu Leu 
625 630 635 



AAT GAT TGT TGT TAG CCT AAA TAT GGC GAT GAA ATC ATT GCG ATT GTA 1971 
Asn Asp Cys Cys Tyr Pro Lys Tyr Gly Asp Glu lie lie Ala lie Val 
640 645 650 



ACG GAT TTA AAA GAT CCA AAA GCG ATT GCG CAC CAT AAA TTT TGC AAA 2 019 

Thr Asp Leu Lys Asp Pro Lys Ala lie Ala His His Lys Phe Cys Lys 
655 660 665 

AAA GCG ATG GCG GAA GTA GAT GCT AAA GTG CCT ATG GTT TAT ATA GAA 2 067 

Lys Ala Met Ala Glu Val Asp Ala Lys Val Pro Met Val Tyr lie Glu 

670 675 680 685 



TGG CAC AAG CGG GAT CGA ACG ATT TAT AAA ATG ATG TTT TAT TTG GGC 2115 
Trp His Lys Arg Asp Arg Thr lie Tyr Lys Met Met Phe Tyr Leu Gly 
690 695 700 



GAA AAA AAG TCG GTT TTA GCG GGT TTA TTA ACT TTT TTA AAC AGG AAT 2163 
Glu Lys Lys Ser Val Leu Ala Gly Leu Leu Thr Phe Leu Asn Arg Asn 
705 710 715 



GAA TGC AAC ATT GTG GGC GTG TCT TAT TTG GGC TAT AAA GAC AAG TAT 2 211 

Glu Cys Asn lie Val Gly Val Ser Tyr Leu Gly Tyr Lys Asp Lys Tyr 
720 725 730 



TCT AGC CAT TGT GAA GTG AGT TTT GAA ATA GCC ACA GAT AAG GCG GAT 22 5 9 

Ser Ser His Cys Glu Val Ser Phe Glu lie Ala Thr Asp Lys Ala Asp 
735 740 745 



TGG ATC AGA GCC TTA ATC AAT CGC AAA TAT CAG GAT AGG ATT GTA GAA 2 3 07 

Trp lie Arg Ala Leu lie Asn Arg Lys Tyr Gin Asp Arg lie Val Glu 
750 755 760 765 



TTA TCC AGT CTG GAT GAC GCT TAT GAA TCA TAATAAGCCC TAATTAAGGA ATG 23 6 0 
Leu Ser Ser Leu Asp Asp Ala Tyr Glu Ser 
770 775 
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TAT AAG ACG ATA CAC ACG ACC ATT TTT GAT GAA TCT TCT GTT TAT GAA 105 9 

Tyr Lys Thr lie His Thr Thr lie Phe Asp Glu Ser Ser Val Tyr Glu 
335 340 345 

GTG CAG ATC CGC ACC TTT GAT ATG CAT ATG GGG GCG GAG TAT GGT AAT 110 7 

Val Gin lie Arg Thr Phe Asp Met His Met Gly Ala Glu Tyr Gly Asn 
350 355 360 365 



TCA GCC CAC TGG AAG TAT AAA GCC GGG GGC GTG GAT CAT GAA GAT CAT 115 5 

Ser Ala His Trp Lys Tyr Lys Ala Gly Gly Val Asp His Glu Asp His 
370 375 380 



CAT GAG GGC ATG AGA TGG TTG CAA AAT TTT AAA TAG CAT GAC AGC GAT 12 0 3 

His Glu Gly Met Arg Trp Leu Gin Asn Phe Lys Tyr His Asp Ser Asp 
385 390 395 



TTG AAA AAC GAC CCT AAG GAA TTT TAG GAA CTC GCT AAG AAC GAT TTG 12 51 

Leu Lys Asn Asp Pro Lys Glu Phe Tyr Glu Leu Ala Lys Asn Asp Leu 
400 405 410 



TAT CGT GAA GAT ATT GTC GTT TTT TCG CCT CAT GGG GAC ACT TAC ACT 12 99 

Tyr Arg Glu Asp lie Val Val Phe Ser Pro His Gly Asp Thr Tyr Thr 
415 420 425 



TTA CCG GTG GGT GCG ATC GCT TTA GAT TTT GCT TAC ATG GTG CAT AGC 134 7 

Leu Pro Val Gly Ala lie Ala Leu Asp Phe Ala Tyr Met Val His Ser 
430 435 440 445 



GAT TTG GGC GAT AAA GCC ACG GAC GCT TAT ATC AAT AGT AAA AAA GCC 13 95 

Asp Leu Gly Asp Lys Ala Thr Asp Ala Tyr lie Asn Ser Lys Lys Ala 
450 455 460 



TTA CTC AAT CAG GAA TTA AGA AGT GGG GAT GTG GTT AAA ATC ATT AAA 144 3 

Leu Leu Asn Gin Glu Leu Arg Ser Gly Asp Val Val Lys lie lie Lys 
465 470 475 



GGC GAT AAA ATA ATA CCT CGT TTC ATT TGG ATG GAT CAG CTT AAA ACT 14 91 

Gly Asp Lys lie lie Pro Arg Phe lie Trp Met Asp Gin Leu Lys Thr 
480 485 490 



TCT AAG GCT AAA AAC CAT TTG CGC ATC CAA AGA AGA AAC CGC TTG AAA 153 9 

Ser Lys Ala Lys Asn His Leu Arg lie Gin Arg Arg Asn Arg Leu Lys 
495 500 505 



GAG GTT GAC ACT AAG AGC ATG ATC 
Glu Val Asp Thr Lys Ser Met lie 
510 515 

CGC TCT GTT TTT GAA GAC ATG GAT 
Arg Ser Val Phe Glu Asp Met Asp 
530 



AAT ATC TTA GCG ACT TTT TTT GGG 15 8 7 

Asn lie Leu Ala Thr Phe Phe Gly 
520 525 

TTA AAG GAT TAT AAA AAC TTT GAA 163 5 

Leu Lys Asp Tyr Lys Asn Phe Glu 
535 540 



GAA AGA TTA ACA GAT TGC GGG GTG GAG ACC ACC TTA ACA GAA GCG ATG 16 83 

Glu Arg Leu Thr Asp Cys Gly Val Glu Thr Thr Leu Thr Glu Ala Met 
545 550 555 
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CAA GAT GTG GCT AAT TTA GTG GAT GCG CTC ACT AAA ATC ACT GAA ATC 3 87 

Gin Asp Val Ala Asn Leu Val Asp Ala Leu Thr Lys lie Thr Glu lie 
110 115 120 125 

AGG AAA GAA GAA TTA GGC GTG AGC TCT CAA GAT CCC AGA ATG GTG GTT 43 5 

Arg Lys Glu Glu Leu Gly Val Ser Ser Gin Asp Pro Arg Met Val Val 
130 135 140 

TCA GCG CTC ACT TTC AGA AAG ATT TTA ATT AGC GCG ATA CAA GAT CCA 48 3 

Ser Ala Leu Thr Phe Arg Lys lie Leu lie Ser Ala lie Gin Asp Pro 
145 150 155 

AGA GCC TTA GTG GTA AAG ATT AGC GAC AGG TTG CAC AAC ATG CTC ACC 531 
Arg Ala Leu Val Val Lys lie Ser Asp Arg Leu His Asn Met Leu Thr 
160 165 170 

TTA GAC GCC TTG CCT CAT GAC AAG CAA GTG CGT ATT TCT AAA GAG ACT 57 9 

Leu Asp Ala Leu Pro His Asp Lys Gin Val Arg lie Ser Lys Glu Thr 
175 180 185 

CTA GCG GTG TAT GCC CCT ATA GCG AGC CGA TTG GGC ATG TCT TCA ATC 62 7 

Leu Ala Val Tyr Ala Pro lie Ala Ser Arg Leu Gly Met Ser Ser lie 
190 195 200 205 

AAA AAT GAA TTA GAA GAC AAG AGC TTT TAT TAT ATT TAT CCA GAA GAG 675 
Lys Asn Glu Leu Glu Asp Lys Ser Phe Tyr Tyr lie Tyr Pro Glu Glu 
210 215 220 

TAT AAA AAT ATC AAG GAA TAT TTG CAC AAA AAC AAG CAG TCT TTA CTC 72 3 

Tyr Lys Asn lie Lys Glu Tyr Leu His Lys Asn Lys Gin Ser Leu Leu 
225 230 235 

TTA AAG CTC AAC GCT TTT GCG AGC AAG TTA GAA AAA AAA CTT TTT GAT 771 
Leu Lys Leu Asn Ala Phe Ala Ser Lys Leu Glu Lys Lys Leu Phe Asp 
240 245 250 

AGT GGG TTT AGC CAT TCG GAT TTT AAA CTC GTT ACA AGG GTG AAA CGC 819 
Ser Gly Phe Ser His Ser Asp Phe Lys Leu Val Thr Arg Val Lys Arg 
255 260 265 

CCT TAT TCT ATC TAT CTT AAG ATG CAA CGA AAG GGC GCG GTT AAT ATT 867 
Pro Tyr Ser lie Tyr Leu Lys Met Gin Arg Lys Gly Ala Val Asn lie 
270 275 280 285 

GAT GAA ATT TTG GAC TTG TTA GCC ATT AGG ATT TTA TTG AAA AAC CCG 915 
Asp Glu lie Leu Asp Leu Leu Ala lie Arg lie Leu Leu Lys Asn Pro 
290 295 300 

ATT GAT TGC TAT AAA GTT TTA GGG ATT ATC CAT TTG AAT TTC AAA CCC 963 
lie Asp Cys Tyr Lys Val Leu Gly lie lie His Leu Asn Phe Lys Pro 
305 310 315 

ATT GTC TCT CGT TTT AAA GAT TAC ATC GCT TTG CCC AAA GAA AAT GGC 1011 
lie Val Ser Arg Phe Lys Asp Tyr lie Ala Leu Pro Lys Glu Asn Gly 
320 325 330 
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Val Ala Pro Cys Asn Arg Gly Glu Glu Ala Lys Lys Val Gly Tyr Cys 
305 310 315 320 

lie Ala Asp Gly Leu Gly Arg Ser Tyr Leu Gly Asn Arg Glu Glu Gly 

325 330 335 

Leu Tyr Phe Thr Gly Ala Asn Gly Tyr Arg Val Asp Lys lie lie Ser 

340 345 350 

Val His Glu Leu lie Lys Glu Leu Thr Glu Gly 
355 360 

(2) INFORMATION FOR SEQ ID NO: 84 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 13... 2337 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 5: 

TAGACAGGAT AG ATG AAC GAA ATT GAT AAA TCC GTT GAT ATC GGA TTC TTA 51 
Met Asn Glu lie Asp Lys Ser Val Asp lie Gly Phe Leu 
15 10 

CGG ATT CTT GAT GTT ATT AAA AAA GTT ACG ACC CCA AAG GGT GGC ATT 99 
Arg lie Leu Asp Val lie Lys Lys Val Thr Thr Pro Lys Gly Gly lie 
15 20 25 

GAA ATC TTA AGG ACT TTA ATT GAT TTT ACG CCC AAA ATT GAA AAC GCC 14 7 

Glu lie Leu Arg Thr Leu lie Asp Phe Thr Pro Lys lie Glu Asn Ala 
30 35 40 45 

CTG AAT TTA GCG GCC AAA AGC CAT AAG GGG CAA TAG AGA AAA AGC GGC 195 
Leu Asn Leu Ala Ala Lys Ser His Lys Gly Gin Tyr Arg Lys Ser Gly 
50 55 60 

GAG CCT TAT ATT GTC CAT CCT ATT TGC GTG GCA AGC TTG GTA GCG TTT 24 3 

Glu Pro Tyr lie Val His Pro lie Cys Val Ala Ser Leu Val Ala Phe 
65 70 75 

TGT GGG GGC GAT GAG GCG ATG GTG TGT GCT GCG CTT TTG CAT GAT GTG 2 91 

Cys Gly Gly Asp Glu Ala Met Val Cys Ala Ala Leu Leu His Asp Val 
80 85 90 

GTG GAA GAC ACG CCT TGT AAG ATT GAA ACG ATT GAG CAA GAA TTT GGG 3 39 

Val Glu Asp Thr Pro Cys Lys lie Glu Thr lie Glu Gin Glu Phe Gly 
95 100 105 
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GAG CTT ACA GAG GGT TAATTTGTAG TGCTTGTGAG GTTAGGGGTT GTTGCA 1161 
Glu Leu Thr Glu Gly 
360 



(2) INFORMATION FOR SEQ ID NO: 844: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844: 



Met Val Ser Thr 
1 

Phe Pro He Phe 
20 

Leu Ala Gly Asn 
35 

Val Gly Thr Gly 
50 

Ala Lys Lys Pro 
65 

Asn Glu He Phe 

Gly Ala Asn He 
100 

Asp Ser Cys Glu 
115 

Pro Thr Asn Met 
130 

He Pro He He 
145 

Trp Ser Asp Arg 

Pro Leu Ser Gly 
180 

Glu Glu Phe Arg 
195 

Lys Glu Trp Gly 
210 

Arg Lys Asp He 
225 

Met Ala Thr Arg 

Ala Asp Leu Leu 
260 

Ser Pro Val Gly 
275 

Arg He Glu Glu 

2 9 0- 



Leu Lys Pro Leu 
5 

Gin Gly Gly Met 

Val Ala Lys Glu 
40 

Tyr Tyr Lys Asn 
55 

Phe Glu Ala Leu 
70 

Ala Asn Ala Arg 
85 

Leu Tyr Ala He 

Ala Gly Ala Asn 
120 

Pro Glu Phe Ala 
135 

Ser Ser Ala Lys 
150 

Tyr Lys Arg He 
165 

Gly His Gin Gly 

Leu Glu Asn Leu 
200 

Asn He Pro He 
215 

Asp Thr Met Leu 
230 

Phe Leu Gly Thr 
245 

Pro Thr Leu Lys 

Tyr Pro Ala Arg 
280 

Gly Asn Ala Pro 
295 



Lys He Gly Lys 
10 

Gly Val Gly He 
25 

Gly Ala Leu Gly 

Met Arg Phe Val 
60 

Asn Phe Tyr Ser 
75 

Lys He Cys Gly 
90 

Asn Asp Tyr Gly 
105 

He He He Thr 

Lys Asp Phe Ser 
140 

Ala Leu Lys He 
155 

Pro Asp Ala Phe 
170 

Phe Lys Tyr Glu 
185 

Val Pro Lys Val 

He Ala Ala Gly 
220 

Ser Leu Gly Ala 
235 

Lys Glu Cys Asp 
250 

Lys Glu Asp He 
265 

Ala He Asn Thr 

Lys He Ala Cys 
300 



His Thr He Lys 
15 

Ser Trp Asp Glu 
30 

Val He Ser Ala 
45 

Glu Arg He Val 

Lys Lys Ala Leu 
80 

Asn Asn Pro Leu 
95 

Arg Val Leu Arg 
110 

Gly Ala Gly Leu 
125 

Asp Val Ala Leu 

Leu Cys Lys Arg 
160 

He Val Glu Gly 
175 

Asp Cys Phe Lys 
190 

Val Glu Ala Ser 
205 

Gly He Trp Asp 

Ser Gly Val Gin 
240 

Ala Lys Val Tyr 
255 

Leu Leu He Lys 
270 

Gly Val He Lys 
285 

Val Ser Asn Cys 
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TTC GCT AAG GAT TTT AGC GAT GTG GCG CTC ATC CCT ATT ATT TCT TCA 4 86 

Phe Ala Lys Asp Phe Ser Asp Val Ala Leu lie Pro lie lie Ser Ser 
135 140 145 150 

GCG AAG GCT TTA AAA ATC CTT TGT AAA AGA TGG AGC GAT CGC TAT AAA 53 4 

Ala Lys Ala Leu Lys lie Leu Cys Lys Arg Trp Ser Asp Arg Tyr Lys 
155 160 165 

AGA ATC CCG GAC GCG TTC ATT GTG GAA GGG CCT TTG AGT GGG GGG CAT 5 82 

Arg lie Pro Asp Ala Phe lie Val Glu Gly Pro Leu Ser Gly Gly His 
170 175 180 

CAG GGC TTT AAA TAG GAA GAT TGT TTC AAA GAA GAA TTC CGA TTA GAA 63 0 

Gin Gly Phe Lys Tyr Glu Asp Cys Phe Lys Glu Glu Phe Arg Leu Glu 
185 190 195 

AAC TTA GTG CCT AAA GTC GTG GAA GCT TCT AAA GAA TGG GGG AAT ATC 67 8 

Asn Leu Val Pro Lys Val Val Glu Ala Ser Lys Glu Trp Gly Asn lie 
200 205 210 

CCT ATC ATC GCC GCT GGG GGG ATT TGG GAT AGG AAG GAT ATA GAC ACC 72 6 

Pro lie lie Ala Ala Gly Gly lie Trp Asp Arg Lys Asp lie Asp Thr 
215 220 225 230 

ATG TTA AGT CTT GGA GCG AGT GGG GTG CAG ATG GCG ACT CGT TTT TTA 774 
Met Leu Ser Leu Gly Ala Ser Gly Val Gin Met Ala Thr Arg Phe Leu 
235 240 245 

GGC ACG AAA GAA TGC GAC GCT AAA GTG TAT GCC GAT CTT TTG CCC ACG 82 2 

Gly Thr Lys Glu Cys Asp Ala Lys Val Tyr Ala Asp Leu Leu Pro Thr 
250 255 260 

CTC AAA AAA GAA GAT ATT TTA CTC ATT AAA TCG CCT GTA GGT TAT CCG 87 0 

Leu Lys Lys Glu Asp lie Leu Leu lie Lys Ser Pro Val Gly Tyr Pro 
265 270 275 

GCT AGG GCT ATT AAT ACG GGA GTG ATC AAG CGC ATT GAA GAG GGT AAC 918 
Ala Arg Ala lie Asn Thr Gly Val lie Lys Arg lie Glu Glu Gly Asn 
280 285 290 

GCG CCC AAA ATC GCA TGC GTG AGC AAT TGT GTA GCG CCT TGC AAC AGG 96 6 

Ala Pro Lys lie Ala Cys Val Ser Asn Cys Val Ala Pro Cys Asn Arg 
295 300 305 310 

GGT GAA GAG GCT AAA AAG GTG GGC TAT TGT ATC GCT GAT GGT TTG GGG 1014 
Gly Glu Glu Ala Lys Lys Val Gly Tyr Cys lie Ala Asp Gly Leu Gly 
315 320 325 

CGC AGT TAT TTA GGG AAC AGA GAA GAG GGG CTT TAT TTT ACC GGG GCT 10 62 

Arg Ser Tyr Leu Gly Asn Arg Glu Glu Gly Leu Tyr Phe Thr Gly Ala 
330 335 340 

AAT GGC TAT AGA GTG GAT AAG ATT ATC AGC GTG CAT GAA TTG ATT AAA 1110 
Asn Gly Tyr Arg Val Asp Lys lie lie Ser Val His Glu Leu lie Lys 
345 350 355 
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(2) INFORMATION FOR SEQ ID NO:843: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 1125 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 3: 

TGAAATTAAA TATTAACTAA TATTAAGGAA AGAGCT ATG GTA TCA ACA CTC AAA 54 

Met Val Ser Thr Leu Lys 
1 5 

CCG CTA AAA ATC GGT AAG CAC ACC ATA AAA TTC CCT ATC TTT CAA GGG 102 

Pro Leu Lys lie Gly Lys His Thr lie Lys Phe Pro lie Phe Gin Gly 
10 15 20 

GGA ATG GGT GTG GGG ATT AGC TGG GAT GAA CTA OCT GGA AAT GTT GCC 15 0 

Gly Met Gly Val Gly lie Ser Trp Asp Glu Leu Ala Gly Asn Val Ala 
25 30 35 

AAA GAA GGG GCT TTA GGA GTG ATT TCA GCC GTA GGG ACT GGT TAT TAT 19 8 

Lys Glu Gly Ala Leu Gly Val He Ser Ala Val Gly Thr Gly Tyr Tyr 
40 45 50 

AAA AAC ATG CGT TTT GTA GAA AGG ATT GTG GCT AAA AAA CCC TTT GAA 24 6 

Lys Asn Met Arg Phe Val Glu Arg He Val Ala Lys Lys Pro Phe Glu 
55 60 65 70 

GCC TTG AAT TTT TAG TCC AAA AAA GCG TTG AAT GAG ATT TTT GCA AAC 2 94 

Ala Leu Asn Phe Tyr Ser Lys Lys Ala Leu Asn Glu He Phe Ala Asn 
75 80 85 

GCT AGG AAG ATT TGC GGG AAC AAC CCT TTA GGA GCG AAT ATT TTA TAC 342 
Ala Arg Lys He Cys Gly Asn Asn Pro Leu Gly Ala Asn He Leu Tyr 
90 95 100 

GCT ATC AAT GAC TAT GGC CGT GTT TTA AGG GAC TCT TGT GAA GCG GGA 3 90 

Ala He Asn Asp Tyr Gly Arg Val Leu Arg Asp Ser Cys Glu Ala Gly 
105 110 115 

GCG AAT ATC ATT ATT ACA GGG GCT GGT TTG CCC ACC AAC ATG CCT GAA 43 8 

Ala Asn He He He Thr Gly Ala Gly Leu Pro Thr Asn Met Pro Glu 
120 125 130 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842: 

Met Val Phe Asn Gin Glu Val Lys Lys Phe Lys Glu Val Ser Leu Lys 

15 10 15 

Asn Phe Lys Ser Tyr Leu Glu Leu Glu Ala lie Leu Thr lie Pro Lys 

20 25 30 

Lys His Tyr Gin Phe Ser Lys Gin Ser Phe lie Thr lie Ala Gin Phe 

35 40 45 

Ser Pro Lys Leu Val Arg Val Val lie Gly Tyr Ala Pro Lys Met Thr 

50 55 60 

Tyr Glu Val Lys lie Leu Lys Asp Lys Leu Tyr Val Ser lie Val Glu 
65 70 75 80 

Lys Lys Pro Leu lie Arg His Gin Met Ala Leu Lys Pro Pro Lys His 

85 90 95 

His Ala Leu Lys His Thr Thr Pro Lys Pro Ala His Lys Pro lie Lys 

100 105 110 

Lys Glu Ala Lys Lys Val Lys Glu Lys Thr Pro Thr Lys His Ala His 

115 120 125 

Ser Lys His Thr His Ser Pro Leu Asn Glu Arg Ser Thr Lys Lys Glu 

130 135 140 

lie Pro Lys Lys Glu lie Pro Lys Lys Glu Ala Glu Asn Glu Ser Lys 
145 150 155 160 

Asn Gin Val Phe lie Ala Glu Lys Asn Asp Thr Phe lie Lys Thr Lys 

165 170 175 

Arg Lys Lys" His Lys Lys lie Val Leu Asp Ala Gly His Gly Gly Lys 

180 185 190 

Asp Cys Gly Ala Met Ser Ala Asn Leu Val Cys Glu Lys Asp lie Val 

195 200 205 

Leu Glu Val Val Lys Phe Leu His Lys Glu Leu Lys Lys Arg Asp Tyr 

210 215 220 

Ser Val Leu Leu Thr Arg Asp Lys Asp lie Tyr lie Asp Leu Val Ala 
225 230 235 240 

Arg Thr Glu Leu Ala Asn Lys Lys Ser Ala Asp Leu Phe lie Ser Val 

245 250 255 

His Ala Asn Ser lie Pro Lys His Ser Thr Ser Asn Ala His Gly lie 

260 265 270 

Glu Thr Tyr Phe Leu Ser Thr Ala Arg Ser Glu Arg Ala Arg Lys Val 

275 280 285 

Ala Glu Gin Glu Asn Lys Asp Asp Val Asn Leu Met Asp Tyr Phe Ser 

290 295 300 

Lys Ser Leu Phe Leu Asn Ser Leu Asn Thr Gin Arg Leu lie Val Ser 
305 310 315 320 

Asn Lys Leu Ala lie Asp Val Gin Tyr Gly Met Leu Gin Ser Val Arg 

325 330 335 

Lys Asn Tyr Pro Asp Val Val Asp Gly Gly Val Arg Glu Gly Pro Phe 

340 345 350 

Trp Val Leu Ala Gly Ala Leu Met Pro Ser lie Leu lie Glu lie Gly 

355 360 365 

Tyr Asn Ser His Ala lie Glu Ser Lys Arg lie Gin Ser Lys Pro Tyr 

370 375 380 

Gin Lys lie Leu Ala Lys Gly lie Ala Asp Gly lie Asp Ser Phe Phe 
385 390 395 400 

Ser Lys Asn Asp 
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GTLi CiCT CGC ACG GAA TTA GCC AAT AAA AAA AGC GCG GAT TTA TTC ATC 831 
Val Ala Arg Thr Glu Leu Ala Asn Lys Lys Ser Ala Asp Leu Phe lie 
240 245 2'SO 

TCA GTG CAT GCC AAT TCC ATC CCC AAA CAT TCC ACT TCT AAC GCT CAT 87 9 

Ser Val His Ala Asn Ser lie Pro Lys His Ser Thr Ser Asn Ala His 
255 260 265 270 

GGT ATA GAG ACT TAT TTT TTA TCC ACC GCA AGG AGC GAA AGG GCT AGG 92 7 

Gly lie Glu Thr Tyr Phe Leu Ser Thr Ala Arg Ser Glu Arg Ala Arg 
275 280 285 

AAA GTG GCT GAG CAA GAA AAT AAA GAC GAT GTG AAT TTA ATG GAT TAT 97 5 

Lys Val Ala Glu Gin Glu Asn Lys Asp Asp Val Asn Leu Met Asp Tyr 
290 295 300 

TTT TCT AAA AGT TTG TTT TTA AAT TCA TTG AAC ACG CAG CGA TTG ATC 102 3 

Phe Ser Lys Ser Leu Phe Leu Asn Ser Leu Asn Thr Gin Arg Leu lie 
305 310 315 

GTC TCT AAC AAA TTA GCG ATT GAT GTG CAA TAC GGC ATG CTC CAA AGC 1071 
Val Ser Asn Lys Leu Ala lie Asp Val Gin Tyr Gly Met Leu Gin Ser 
320 325 330 

GTC CGC AAA AAT TAC CCT GAT GTG GTG GAT GGA GGC GTG AGA GAG GGG 1119 
Val Arg Lys Asn Tyr Pro Asp Val Val Asp Gly Gly Val Arg Glu Gly 
335 340 345 350 

CCT TTT TGG GTG TTG GCC GGG GCT TTA ATG CCT TCA ATC TTA ATA GAA 116 7 

Pro Phe Trp Val Leu Ala Gly Ala Leu Met Pro Ser lie Leu lie Glu 
355 360 365 

ATT GGT TAT AAT TCC CAT GCG ATA GAA TCT AAA CGC ATC CAA AGC AAA 1215 
lie Gly Tyr Asn Ser His Ala lie Glu Ser Lys Arg lie Gin Ser Lys 
370 375 380 

CCG TAT CAA AAG ATC TTG GCT AAG GGC ATT GCT GAT GGC ATT GAT AGT 12 6 3 

Pro Tyr Gin Lys lie Leu Ala Lys Gly lie Ala Asp Gly lie Asp Ser 
385 390 395 

TTC TTC AGC AAG AAT GAT TAGGCAATGA TTAGGTTGTA GATGAATTTT TATCAAAA 1319 
Phe Phe Ser Lys Asn Asp 
400 

AATATACACT CATAAAGTC 13 3 8 

(2) INFORMATION FOR SEQ ID NO: 842: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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Leu Lys Asn Phe Lys Ser Tyr Leu Glu Leu Glu Ala lie Leu Thr lie 
15 20 25 30 

CCT AAA AAG CAT TAG CAA TTC TCC AAG CAA TCG TTC ATC ACG ATC GCG 2 07 

Pro Lys Lys His Tyr Gin Phe Ser Lys Gin Ser Phe lie Thr lie Ala 
35 40 45 

CAA TTC AGC CCT AAG TTA GTG CGA GTG GTT ATC GGC TAT GCT CCT AAG 2 55 

Gin Phe Ser Pro Lys Leu Val Arg Val Val lie Gly Tyr Ala Pro Lys 
50 55 60 

ATG ACT TAT GAA GTT AAA ATC CTT AAA GAC AAG CTT TAT GTT TCT ATC 3 03 

Met Thr Tyr Glu Val Lys lie Leu Lys Asp Lys Leu Tyr Val Ser lie 
65 70 75 

GTG GAG AAA AAG CCC TTA ATT AGG CAT CAA ATG GCG TTA AAA CCA CCC 3 51 

Val Glu Lys Lys Pro Leu lie Arg His Gin Met Ala Leu Lys Pro Pro 
80 85 90 

AAA CAC CAT GCA CTC AAA CAC ACA ACG CCA AAA CCC GCC CAT AAG CCC 3 99 

Lys His His Ala Leu Lys His Thr Thr Pro Lys Pro Ala His Lys Pro 
95 100 105 110 

ATT AAA AAA GAG GCT AAA AAG GTT AAA GAA AAA ACG CCA ACT AAA CAT 4 47 

lie Lys Lys Glu Ala Lys Lys Val Lys Glu Lys Thr Pro Thr Lys His 
115 120 125 

GCG CAT TCA AAA CAC ACG CAT TCC CCA TTG AAC GAA AGG AGC ACT AAA 4 95 

Ala His Ser Lys His Thr His Ser Pro Leu Asn Glu Arg Ser Thr Lys 
130 135 140 

AAA GAA ATT CCT AAA AAA GAA ATT CCT AAA AAA GAA GCG GAA AAT GAG 54 3 

Lys Glu lie Pro Lys Lys Glu lie Pro Lys Lys Glu Ala Glu Asn Glu 
145 150 155 

AGC AAG AAC CAA GTC TTT ATA GCA GAA AAA AAT GAT ACT TTC ATC AAA 5 91 

Ser Lys Asn Gin Val Phe lie Ala Glu Lys Asn Asp Thr Phe lie Lys 
160 165 170 

ACC AAA CGC AAA AAA CAC AAA AAG ATC GTT TTA GAC GCT GGG CAT GGG 63 9 

Thr Lys Arg Lys Lys His Lys Lys lie Val Leu Asp Ala Gly His Gly 
175 180 185 190 

GGG AAA GAT TGC GGG GCG ATG AGC GCG AAT TTG GTG TGT GAA AAA GAC 6 87 

Gly Lys Asp Cys Gly Ala Met Ser Ala Asn Leu Val Cys Glu Lys Asp 
195 200 205 

ATT GTT TTA GAA GTG GTG AAG TTT TTA CAC AAA GAG CTT AAA AAA AGA 73 5 

lie Val Leu Glu Val Val Lys Phe Leu His Lys Glu Leu Lys Lys Arg 
210 215 220 

GAT TAT AGC GTT TTA TTG ACA AGG GAT AAG GAT ATT TAT ATT GAT TTA 7 83 

Asp Tyr Ser Val Leu Leu Thr Arg Asp Lys Asp lie Tyr lie Asp Leu 
225 230 235 
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50 










55 










60 










Cvs 


Glv 


Ara 


Val 


Ala 


Phe 


Leu 


Lys 


Asp 


He 


Val 


Ser 


Asn 


Ser 


Pro 


Asn 


6 5 










70 










75 










80 


Glu 


Thr 


lie 


Gin 


Ser 


Phe 


Asp 


Gly 


Asp 


Leu 


Glu 


Val 


Ala 


Met 


His 


Leu 










85 










90 










95 




Glu 


Lys 


lie 


Gly 


He 


Glu 


Cys 


Tyr 


Lys 


He 


Phe 


He 


Asp 


Tyr Gly 


Ser 








100 










105 










110 






Gin 


Lys 


lie 


Asp 


Asp 


Asn 


Glu 


Leu 


Ser 


Cys 


Arg 


Leu 


Leu 


His 


Thr 


Gly 






115 










120 










125 








Thr 


Lys 


lie 


Leu 


Gly 


Thr 


Lys 


Ala 


Met 


Ala 


Val 


Val 


Gly 


Gin 


Thr 


Phe 




130 










135 










140 










lie 


Pro 


lie 


Pro 


Gly 


Val 


Gly 


Ala 


He 


He 


Gly 


Asn 


Phe 


Val 


Gly 


Ala 


145 










150 










155 










160 


Leu 


Leu 


Ser 


Lys 


Thr 


Leu 


Cys 


Glu 


Asn 


Leu 


Arg 


Asp 


Val 


Leu 


Lys 


Glu 










165 










170 










175 




Ala 


Lys 


Leu 


Ala 


Arg 


Gin 


Arg 


Arg 


He 


Glu 


He 


Glu 


Lys 


Glu 


Cys 


Arg 








180 










185 










190 






Glu 


Ser 


lie 


Arg 


Leu 


Leu 


Glu 


He 


Tyr 


Arg 


Asn 


Gin 


Phe 


Lys 


Glu 


Val 






195 










200 










205 








Phe 


Glu 


Ara 


Tvr 


Phe 


His 


Gly 


Asn 


Val 


Lvs 


Phe 


Phe 


Asn 


Glu 


Asn 


Phe 




Din 

dL \J 










2 15 










22 0 










Asn 


Asn 


Leu 


Glu 


Arg 


Ala 


Leu 


Tyr 


Ala 


Gly 


Asp 


Ala 


Asp 


Leu 


Ala 


He 


225 










230 










235 










240 


Gly 


Val 


Asn 


Asn 


Glu 


He 


Gin 


Glu 


Arg 


Leu 


Gly 


Gin 


Lys 


Pro 


Leu 


Phe 










245 










250 










255 




Asn 


Asn 


Thr 


Gin 


Glu 


Phe 


Leu 


Glu 


Leu 


Met 


Asn 


Asn 


Gly 


Gly 


Lys 


He 








260 










265 










270 







Glu He 



(2) INFORMATION FOR SEQ ID NO: 841: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 1281 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:841: 

CTTCATTACG CTTACGCTAC AACCCTTAAG ATCACCAATG TTGTGCCTTT TGGCTCTAGC 6 0 

AGCGTTAAA ATG GTG TTC AAT CAA GAG GTT AAA AAA TTC AAA GAA GTT TCG 111 
Met Val Phe Asn Gin Glu Val Lys Lys Phe Lys Glu Val Ser 
15 10 

CTC AAA AAT TTC AAG AGT TAT TTG GAA TTA GAA GCC ATT TTA ACC ATT 15 9 
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140 145 150 

ATA ATT GGA AAT TTT GTG GGT GCA TTA CTG AGC AAA ACT CTC TGT GAA 53 3 

lie lie Gly Asn Phe Val Gly Ala Leu Leu Ser Lys Thr Leu Cys Glu 
155 160 165 

AAT TTG CGA* GAT GTT TTA AAA GAG GOT AAA TTG GCG CGC CAA AGG CGT 5 81 

Asn Leu Arg Asp Val Leu Lys Glu Ala Lys Leu Ala Arg Gin Arg Arg 
170 175 180 

ATA GAG ATT GAA AAA GAA TGC CGT GAA AGT ATT AGG CTG TTA GAG ATC 62 9 

lie Glu lie Glu Lys Glu Cys Arg Glu Ser lie Arg Leu Leu Glu lie 
185 190 195 200 

TAT CGC AAT CAA TTT AAG GAA GTG TTT GAG CGG TAT TTT CAT GGG AAT 677 
Tyr Arg Asn Gin Phe Lys Glu Val Phe Glu Arg Tyr Phe His Gly Asn 
205 210 215 

GTA AAA TTC TTT AAT GAG AAT TTT AAT AAT CTT GAG AGG GCG CTT TAT 72 5 

Val Lys Phe Phe Asn Glu Asn Phe Asn Asn Leu Glu Arg Ala Leu Tyr 
220 225 230 

GCA GGA GAT GCG GAT TTG GCC ATA GGA GTC AAT AAT GAG ATT CAA GAA 77 3 

Ala Gly Asp Ala Asp Leu Ala lie Gly Val Asn Asn Glu lie Gin Glu 
235 240 245 

AGA CTA GGT CAA AAA CCC TTG TTT AAT AAT ACC CAA GAA TTT TTG GAA 821 
Arg Leu Gly Gin Lys Pro Leu Phe Asn Asn Thr Gin Glu Phe Leu Glu 
250 255 260 

CTC ATG AAT AAT GGT GGA AAA ATA GAA ATT TAAAGGAGAA ATCATGGAAG AAC 874 
Leu Met Asn Asn Gly Gly Lys lie Glu lie 
265 270 

AAAAGGATAT GGGTCAAAGT 8 94 

(2) INFORMATION FOR SEQ ID NO: 84 0: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 0: 

Met Leu Glu Asn Met Gin Asp lie Ser Leu Gin Ser Ser His Glu Val 

15 10 15 

Gly Val Asp lie Thr Glu Ser Lys Met Leu Thr Lys Phe Ala Ser Ser 

20 25 30 

Leu Leu Met Asn Leu Tyr Glu Tyr lie Gly Asn Gly Lys Asp Pro Lys 

35 40 45 

Glu Ala Ser Asp His Ala Met Arg Asp Ala Lys Asp Val Val Leu Ser 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 30... 851 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 9: 

TATTAACACC GTTAAAGAGG GGGTAATTC ATG TTA GAA AAC ATG CAA GAT ATT 5 3 

Met Leu Glu Asn Met Gin Asp lie 
1 5 

TCA TTG CAA AGC TCT CAT GAA GTA GGA GTG GAT ATT ACA GAG AGC AAA 101 
Ser Leu Glh Ser Ser His Glu Val Gly Val Asp lie Thr Glu Ser Lys 
10 15 20 

ATG CTT ACA AAA TTT GCA TCC TCG TTA TTA ATG AAT TTA TAT GAA TAT 14 9 

Met Leu Thr Lys Phe Ala Ser Ser Leu Leu Met Asn Leu Tyr Glu Tyr 
25 30 35 40 

ATT GGA AAT GGC AAG GAT CCC AAA GAA GCG TCC GAT CAT GCC ATG AGG 197 
lie Gly Asn Gly Lys Asp Pro Lys Glu Ala Ser Asp His Ala Met Arg 
45 50 55 

GAT GCA AAG GAT GTG GTG CTT AGT TGT GGT AGA GTA GCC TTT CTT AAA 24 5 

Asp Ala Lys Asp Val Val Leu Ser Cys Gly Arg Val Ala Phe Leu Lys 
60 65 70 

GAC ATA GTT TCA AAT AGT CCA AAC GAA ACA ATC CAA AGT TTT GAT GGA 2 93 

Asp lie Val Ser Asn Ser Pro Asn Glu Thr lie Gin Ser Phe Asp Gly 
75 80 85 

GAC TTA GAA GTT GCG ATG CAT TTA GAA AAA ATT GGC ATA GAA TGT TAT 341 
Asp Leu Glu Val Ala Met His Leu Glu Lys lie Gly lie Glu Cys Tyr 
90 95 100 

AAG ATA TTT ATT GAC TAT GGT TCT CAA AAG ATC GAT GAT AAT GAG CTT 3 89 

Lys lie Phe lie Asp Tyr Gly Ser Gin Lys lie Asp Asp Asn Glu Leu 
105 110 115 120 

TCT TGT CGT TTG TTA CAC ACT GGC ACG AAA ATT TTA GGC ACA AAA GCT 4 37 

Ser Cys Arg Leu Leu His Thr Gly Thr Lys lie Leu Gly Thr Lys Ala 
125 130 135 

ATG GCA GTT GTT GGT CAA ACA TTC ATC CCC ATT CCT GGA GTT GGA GCG 4 85 

Met Ala Val Val Gly Gin Thr Phe lie Pro lie Pro Gly Val Gly Ala 
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Xaa Tyr Gly Val Lys Lys Lys Val Asp Ala Asp lie Leu Ser Glu Glu 

20 25 30 

Thr Asn Glu Tyr lie Lys Tyr lie Asn Glu Gly Asn Asp Leu Leu Glu 

35 40 45 

Glu Ala Glu Glu Val He Lys Ala Val Ala Ser Asp Cys Glu Phe Ala 

50 55 60 

Leu Ala Arg Phe Glu Glu Lys Arg Cys Tyr He Arg Asn His Val He 
65 70 75 80 

Ser Glu Phe Leu His His Phe Asn Gin Leu Glu Gly Phe Glu Leu Thr 

85 90 95 

Asn Lys Lys Asp Ser Met Glu Asn He Gin Leu Asp Val Ser Asn Thr 

100 105 110 

Leu Lys lie He Asp Lys Asn Leu Lys Met Ser Ser Phe Asp Thr Leu 

115 120 125 

Gly Ala Val Gly Asn Val Val Gly Gly Phe Ser Met Gly Phe Gly Leu 

130 135 140 

Ala Ala Gly Gly He Val Gly Ser Val Gly Leu Leu Ala Gly Pro Thr 
145 150 155 160 

Leu Ala He Phe Gly Ala Leu Arg Ala Ala Glu Met Glu Lys Lys Leu 

165 170 175 

Glu Asp Ala Lys Ala Tyr Cys Ser Gin Val Glu Ala Ala Val Lys Lys 

180 185 190 

Ala Asp Ala Met He Asp Asn Leu Gin Ala Val Arg Lys Met Ala Asp 

195 200 205 

Leu Phe Thr Arg Gin He Thr Lys Phe Asp Ala Leu Phe Phe Ser Leu 

210 215 220 

Ala Gin Glu Ala He Ala Thr Met Lys Lys His Asn Tyr Asp Phe Ser 
225 230 235 240 

His Tyr Asn Gin Lys Glu Gin Asp Gin Leu Ala Thr Ala Ser Ser Thr 

245 250 255 

Leu Lys Thr Leu Gly Ala Phe Leu Lys Val Pro He Met Asp Lys His 

260 265 270 

Gin Lys Leu Asn Glu Ala Thr Gin Ser Lys Leu Glu Phe Met Gin Arg 

275 280 285 

Glu Met Ser Ser Leu Glu Ala Lys His Tyr Asp Ser Val Lys He Lys 

290 295 300 

Phe Gly Leu Val Arg Arg Leu Phe Glu Phe Phe Arg Ser Leu Trp Gly 
305 310 315 320 

Lys Asn Gly Arg He Gin Arg Ala Lys Thr Thr Pro Asp Arg Phe Pro 

325 330 335 

Cys Thr Ser Cys Gly Leu Cys Cys Lys Asn He Ala Gly He He Glu 

340 345 350 

Leu He Gly Phe Asp Ala Gly Asn Gly Val Cys Lys Phe Leu Asp Leu 

355 360 365 

Glu Thr Asn Leu Cys Lys He Tyr Glu Ser Arg Pro Leu He Cys Arg 

370 375 380 

He Asp Glu Ala His Lys Lys Leu Tyr Pro His He Pro Leu Lys Glu 
385 390 395 400 

Phe Tyr Ala Lys Asn Ala Glu Val Cys Asn Ala Leu Gin Glu Ala Asn 

405 410 415 

His Met Asp Lys Ser Phe Arg Val He Leu Lys Lys 



420 



425 



(2) INFORMATION FOR SEQ ID NO: 83 9: 
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CTA GAG TTT ATG CAA AGO GAG ATG AGT AGC CTA GAA GCT AAG CAT TAT 915 
Leu Glu Phe Met Gin Arg Glu Met Ser Ser Leu Glu Ala Lys His Tyr 
285 290 295 

GAT TCA GTT AAA ATC AAA TTT GGA TTG GTA CGC AGA TTA TTT GAA TTT 963 
Asp Ser Val Lys lie Lys Phe Gly Leu Val Arg Arg Leu Phe Glu Phe 
300 305 310 

TTT AGA TCG CTT TGG GGA AAA AAT GGA AGA ATC CAA AGA GCG AAA ACA 1011 
Phe Arg Ser Leu Trp Gly Lys Asn Gly Arg lie Gin Arg Ala Lys Thr 
315 320 325 330 

ACT CCT GAT CGC TTC CCT TGC ACC TCT TGC GGG CTT TGC TGC AAG AAT 105 9 

Thr Pro Asp Arg Phe Pro Cys Thr Ser Cys Gly Leu Cys Cys Lys Asn 
335 340 345 

ATC GCC GGG ATT ATT GAG CTT ATT GGG TTT GAT GCT GGC AAT GGG GTG 110 7 

lie Ala Gly lie He Glu Leu He Gly Phe Asp Ala Gly Asn Gly Val 
350 355 360 

TGC AAA TTT TTG GAT TTA GAA ACC AAT CTG TGC AAG ATT TAT GAA TCG 115 5 

Cys Lys Phe Leu Asp Leu Glu Thr Asn Leu Cys Lys He Tyr Glu Ser 
365 370 375 

CGC CCG TTA ATT TGC AGG ATT GAT GAA GCG CAC AAA AAG CTT TAT CCC 12 0 3 

Arg Pro Leu He Cys Arg He Asp Glu Ala His Lys Lys Leu Tyr Pro 
380 385 390 

CAC ATC CCG CTT AAG GAG TTT TAT GCC AAA AAC GCA GAG GTT TGT AAC 12 51 

His He Pro Leu Lys Glu Phe Tyr Ala Lys Asn Ala Glu Val Cys Asn 
395 400 405 410 

GCT TTG CAA GAA GCA AAC CAT ATG GAT AAG AGC TTT AGG GTT ATT CTT 12 99 

Ala Leu Gin Glu Ala Asn His Met Asp Lys Ser Phe Arg Val He Leu 
415 420 425 

AAG AAA TAATTTAGAA TTTATTGTCC CA 13 2 7 

Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 83 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 8 amino acids 

(B) TYPE: amino acid 

{ C ) STRANDEDNESS : s ingl e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 8: 

Met Pro Leu Pro Phe He He Ala Ala Gly Val Ala Leu Val Ala Ala 
1.5 10 15 
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TCT GAT TGT GAG TTT GOT CTT GCG AGA TTT GAA GAG AAA AGG TGC TAT 24 3 

Ser Asp Cys Glu Phe Ala Leu Ala Arg Phe Glu Glu Lys Arg Cys Tyr 
60 65 70 

ATT AGA AAT CAT GTA ATT TCA GAA TTT TTG CAC CAT TTT AAT CAA TTA 2 91 

lie Arg Asn His Val lie Ser Glu Phe Leu His His Phe Asn Gin Leu 
75 80 85 90 

GAA GGA TTC GAG CTT ACC AAC AAA AAA GAT AGC ATG GAA AAT ATC CAA 33 9 

Glu Gly Phe Glu Leu Thr Asn Lys Lys Asp Ser Met Glu Asn lie Gin 
95 100 105 

CTC GAT GTA TCA AAT ACA CTA AAA ATT ATT GAT AAA AAT CTC AAG ATG 3 87 

Leu Asp Val Ser Asn Thr Leu Lys lie lie Asp Lys Asn Leu Lys Met 
110 115 120 

AGC TCT TTT GAC ACC CTT GGT GCC GTT GGA AAT GTT GTG GGA GGT TTT 4 35 

Ser Ser Phe Asp Thr Leu Gly Ala Val Gly Asn Val Val Gly Gly Phe 
125 130 135 

TCT ATG GGA TTT GGT TTG GCT GCT GGA GGT ATA GTT GGA AGT GTA GGG 4 83 

Ser Met Gly Phe Gly Leu Ala Ala Gly Gly He Val Gly Ser Val Gly 
140 145 150 

CTT TTA GCC GGA CCC ACA CTC GCT ATT TTT GGA GCT TTG AGA GCT GCT 531 
Leu Leu Ala Gly Pro Thr Leu Ala He Phe Gly Ala Leu Arg Ala Ala 
155 160 165 170 

GAA ATG GAA AAA AAA TTA GAA GAT GCT AAG GCT TAT TGC TCT CAA GTT 57 9 

Glu Met Glu Lys Lys Leu Glu Asp Ala Lys Ala Tyr Cys Ser Gin Val 
175 180 185 

GAA GCA GCC GTC AAA AAA GCC GAT GCG ATG ATT GAT AAT CTT CAA GCC 6 27 

Glu Ala Ala Val Lys Lys Ala Asp Ala Met He Asp Asn Leu Gin Ala 
190 195 200 

GTT AGG AAA ATG GCA GAT CTT TTC ACT AGG CAG ATC ACA AAA TTT GAC 67 5 

Val Arg Lys Met Ala Asp Leu Phe Thr Arg Gin He Thr Lys Phe Asp 
205 210 215 

GCA CTG TTT TTC TCG CTT GCT CAA GAG GCA ATC GCC ACG ATG AAA AAG 72 3 

Ala Leu Phe Phe Ser Leu Ala Gin Glu Ala He Ala Thr Met Lys Lys 
220 225 230 

CAC AAC TAC GAT TTT TCG CAT TAC AAT CAA AAA GAA CAA GAT CAG CTA 771 
His Asn Tyr Asp Phe Ser His Tyr Asn Gin Lys Glu Gin Asp Gin Leu 
235 240 245 250 

GCT ACT GCT TCT TCA ACC CTT AAA ACT TTG GGT GCT TTT TTG AAA GTG 819 
Ala Thr Ala Ser Ser Thr Leu Lys Thr Leu Gly Ala Phe Leu Lys Val 
255 260 265 

CCT ATC ATG GAC AAA CAC CAA AAG CTC AAT GAA GCT ACA CAA AGT AAG 8 67 

Pro He Met Asp Lys His Gin Lys Leu Asn Glu Ala Thr Gin Ser Lys 
270 275 280 
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Gly Gin Asn Leu Gly Ala Asn Lys Pro Lys lie Ala Thr Glu Tyr Ala 

245 250 255 

His Leu He Leu Lys He Ser Met Gly Leu Met Gly Val Leu Gly He 

260 265 270 

Val Leu Val Leu Phe Ala Lys Glu Phe Ala Ser Leu Phe Ser Gin Asp 

275 280 285 

Glu Glu Val Leu Glu Val Ala Arg Ser Tyr Leu He Ala Val Gly Leu 

290 295 300 

Ser Gin Ala Pro Leu He Gly Tyr Phe Val Leu Asp Gly Val Phe Arg 
305 310 315 320 

Gly Ala Gly He Ser Lys Val Ser Leu Tyr He Asn Thr Leu Ser Leu 

325 330 335 

Trp Gly Leu Arg He Met Pro He Tyr Leu Leu Leu He His His Phe 

340 345 350 

Lys Val Glu Phe He Phe Val Val He Ala Ser Glu Thr Phe Leu Arg 

355 360 365 

Ser Phe He Tyr Tyr Lys Val Phe Ser Lys Gly He Trp Lys Arg Cys 

370 375 380 

Gly Lys Lys Ala 
385 



(2) INFORMATION FOR SEQ ID NO: 83 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 1305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837: 



TAAACATCTA AAGGATAAAA C ATG CCA TTG CCA TTT ATT ATA GCA GCT GGA 51 

Met Pro Leu Pro Phe He He Ala Ala Gly 
15 10 



GTG GCC TTA GTG GCC GCA GNA TAG GGA GTT AAA AAA AAA GTT GAT GCA 99 
Val Ala Leu Val Ala Ala Xaa Tyr Gly Val Lys Lys Lys Val Asp Ala 
15 20 25 



GAC ATT CTC ACT GAA GAG ACC AAT GAA TAT ATT AAG TAT ATC AAT GAA 14 7 

Asp He Leu Ser Glu Glu Thr Asn Glu Tyr He Lys Tyr He Asn Glu 
30 35 40 



GGC AAT GAC TTG CTA GAG GAA GCA GAA GAA GTT ATT AAA GCT GTG GCT 195 
Gly Asn Asp Leu Leu Glu Glu Ala Glu Glu Val He Lys Ala Val Ala 
-45 50 55 
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CAT CAT TTT AAG GTG GAA TTT ATT TTT GTA GTG ATC GCA TCA GAA ACT 1167 
His His Phe Lys Val Glu Phe lie Phe Val Val lie Ala Ser Glu Thr 
350 355 360 365 

TTT TTG CGC TCA TTC ATC TAT TAT AAA GTT TTT TCT AAA GGC ATT TGG 1215 
Phe Leu Arg Ser Phe lie Tyr Tyr Lys Val Phe Ser Lys Gly lie Trp 
370 375 380 

AAA AGG TGC GGG AAA AAG GCT TGATTATTGC TTGAGCGTAG CGGT 12 6 0 

Lys Arg Cys Gly Lys Lys Ala 
385 



(2) INFORMATION FOR SEQ ID NO: 83 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 6: 



Met Leu Phe Tyr Gly lie Asn Thr lie Leu Tyr Thr Gly Thr Asn Ala 

15 10 15 

lie Leu Ser Arg Leu Val Gly Ala Arg Asp Phe Thr Gin lie Asn His 

20 25 30 

Ala Phe Ser Ser lie Phe lie Gly Ala Phe Met lie Cys Leu Gly Val 

35 40 45 

Leu Phe Val Ser Tyr Phe Leu lie Glu Pro Phe Leu Asn Trp Met Gin 

50 55 60 

Leu Gin Asp Pro Ser Arg Gin Leu Thr Gin Asp Tyr Leu Glu Val Leu 
65 70 75 80 

Val Val Ala Leu Pro Ser lie Phe Leu Lys Asn lie Leu Val Ser Ala 

85 90 95 

Leu Ala Ser Phe Ser Asp Thr Leu Thr Pro Phe lie Val Lys lie lie 

100 105 110 

Met Val lie Ala Cys lie Phe Leu Asn Gin Ala Leu lie Phe Gly Asp 

115 120 125 

Phe Gly Phe Lys Glu Met Gly lie Val Gly Ser Ala Leu Ala Asn Val 

130 135 140 

Val Val Ser Tyr Leu Glu Leu Leu Ala Leu Gly Val Trp lie Gin lie 
145 150 155 160 

Lys Lys lie Pro Leu Lys Phe Lys lie Thr Phe His Phe Ser Phe Leu 

165 170 175 

Lys Thr Met Phe Arg Val Gly Trp Pro Ala Gly Phe Glu Arg Leu Leu 

180 185 190 

Ser Leu Phe Ser Leu lie Leu Leu Ser Lys Phe Val Ala Ser Tyr Gly 

195 200 205 

Asp Lys Val Leu Ala Gly Met Gin lie Gly lie Arg Val Glu Thr Phe 

210 215 220 

Ser Phe Met Pro Gly Phe Gly Phe Met lie Ala Ala Met Val Leu Thr 
225 230 235 240 
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Phe Gly Asp Phe Gly Phe Lys Glu Met Gly lie Val Gly Ser Ala Leu 
130 135 140 

GCG AAT GTG GTT GTC TCT TAT TTG GAA TTA CTC GCA CTT GGC GTT TGG 54 3 

Ala Asn Val Val Val Ser Tyr Leu Glu Leu Leu Ala Leu Gly Val Trp 
145 150 155 

ATA CAA ATC AAA AAA ATC CCT TTA AAA TTC AAA ATA ACC TTT CAT TTT 5 91 

lie Gin lie Lys Lys lie Pro Leu Lys Phe Lys lie Thr Phe His Phe 
160 165 170 

TCT TTT TTA AAA ACC ATG TTT AGA GTG GGT TGG CCA GCC GGG TTT GAG 63 9 

Ser Phe Leu Lys Thr Met Phe Arg Val Gly Trp Pro Ala Gly Phe Glu 
175 180 185 

CGC TTA TTG AGT TTA TTT TCT TTA ATC CTC TTA TCC AAA TTT GTA GCG 6 87 

Arg Leu Leu Ser Leu Phe Ser Leu lie Leu Leu Ser Lys Phe Val Ala 
190 195 200 205 

AGC TAT GGG GAT AAA GTG TTA GCG GGC ATG CAA ATA GGC ATT AGG GTT 7 35 

Ser Tyr Gly Asp Lys Val Leu Ala Gly Met Gin lie Gly lie Arg Val 
210 215 220 

GAA ACC TTT TCG TTC ATG CCC GGA TTT GGG TTT ATG ATC GCA GCG ATG 7 83 

Glu Thr Phe Ser Phe Met Pro Gly Phe Gly Phe Met lie Ala Ala Met 
225 230 235 

GTT TTA ACA GGG CAA AAT TTA GGG GCA AAC AAG CCA AAG ATC GCC ACA 831 
Val Leu Thr Gly Gin Asn Leu Gly Ala Asn Lys Pro Lys lie Ala Thr 
240 245 250 

GAA TAC GCG CAT TTG ATT TTA AAA ATC TCT ATG GGT TTA ATG GGG GTT 87 9 

Glu Tyr Ala His Leu lie Leu Lys lie Ser Met Gly Leu Met Gly Val 
255 260 265 

TTA GGG ATT GTT ' TTA GTC TTA TTC GCT AAA GAA TTT GCG AGC CTT TTT 92 7 

Leu Gly lie Val Leu Val Leu Phe Ala Lys Glu Phe Ala Ser Leu Phe 
270 275 280 285 

TCT CAA GAT GAA GAA GTC TTG GAA GTG GCG CGA TCT TAT TTG ATC GCT 975 
Ser Gin Asp Glu Glu Val Leu Glu Val Ala Arg Ser Tyr Leu lie Ala 
290 295 300 

GTG GGC CTC TCT CAA GCC CCC TTA ATT GGG TAT TTT GTG CTA GAT GGA 102 3 

Val Gly Leu Ser Gin Ala Pro Leu lie Gly Tyr Phe Val Leu Asp Gly 
305 310 315 

GTT TTT AGA GGG GCT GGC ATT TCT AAA GTC TCA CTG TAT ATT AAC ACC 1071 
Val Phe Arg Gly Ala Gly lie Ser Lys Val Ser Leu Tyr lie Asn Thr 
320 325 330 

CTA AGC TTA TGG GGG TTA AGG ATC ATG CCC ATT TAC TTG CTT TTA ATT 1119 
Leu Ser Leu Trp Gly Leu Arg lie Met Pro lie Tyr Leu Leu Leu lie 
335 340 345 
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210 215 220 

(2) INFORMATION FOR SEQ ID NO: 83 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 73... 123 6 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 5: 

CTCTCGGTTT TTTTTGTGGG TAAGATTTCG CACCATCACA TTGTGGCTTT AGGGGTGGGC 6 0 

TTGCAATTTT TG ATG CTT TTT TAT GGC ATC AAC ACG ATT TTA TAC ACC GGC 111 
Met Leu Phe Tyr Gly lie Asn Thr lie Leu Tyr Thr Gly 
15 10 

ACT AAC GCC ATT CTT TCT AGG CTT GTG GGG GCT AGG GAT TTT ACT CAA 159 
Thr Asn Ala lie Leu Ser Arg Leu Val Gly Ala Arg Asp Phe Thr Gin 
15 20 25 

ATC AAC CAC GCT TTT TCC AGT ATT TTC ATA GGG GCT TTT ATG ATC TGT 2 07 

lie Asn His Ala Phe Ser Ser lie Phe lie Gly Ala Phe Met lie Cys 
30 35 40 45 

TTG GGC GTG CTG TTT GTT TCT TAT TTT TTG ATT GAG CCT TTT TTA AAT 255 
Leu Gly Val Leu Phe Val Ser Tyr Phe Leu lie Glu Pro Phe Leu Asn 
50 55 60 

TGG ATG CAA TTA CAA GAT CCT TCG CGC CAA TTG ACG CAA GAT TAT TTA 3 03 

Trp Met Gin Leu Gin Asp Pro Ser Arg Gin Leu Thr Gin Asp Tyr Leu 
65 70 75 

GAA GTC TTA GTT GTA GCG CTA CCG AGT ATT TTT TTA AAA AAT ATT TTA 3 51 

Glu Val Leu Val Val Ala Leu Pro Ser lie Phe Leu Lys Asn lie Leu 
80 85 90 

GTT TCA GCG CTC GCT AGT TTT TCA GAC ACC CTA ACC CCC TTT ATT GTC 3 99 

Val Ser Ala Leu Ala Ser Phe Ser Asp Thr Leu Thr Pro Phe lie Val 
95 100 105 

AAA ATC ATC ATG GTC ATT GCA TGC ATT TTT TTG AAT CAA GCC TTG ATT 44 7 

Lys He He Met Val He Ala Cys He Phe Leu Asn Gin Ala Leu He 
110 115 120 125 

TTT GGG -'GAT TTT GGT TTT AAA GAA ATG GGG ATT GTA GGC TCT GCT TTA 4 95 
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165 170 175 

CGT TTT AAA GGG GAT TTT AAA GTG GTT TTT AGC CCT GAA ATT CCG CAT 57 6 

Arg Phe Lys Gly Asp Phe Lys Val Val Phe Ser Pro Glu lie Pro His 
180 185 190 

TGC ATA GAG CTT GGG AGT TTT AAT GCG GTT ACG GCG AGT TTT GGT TTG 624 
Cys lie Glu Leu Gly Ser Phe Asn Ala Val Thr Ala Ser Phe Gly Leu 
195 200 205 

CAA ATA GCG AGT GAA GTC GTG CAA GAG ATT ATC AAC GAT AAA AGG AAG T 67 3 
Gin lie Ala Ser Glu Val Val Gin Asp lie lie Asn Asp Lys Arg Lys 
210 215 220 

GAGATGAAAG ATTACGAAGA CGAATTGG 7 01 

(2) INFORMATION FOR SEQ ID NO: 8 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 34 : 



Arg 


Leu 


Ser 


Glu 


Pro 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 672 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:833: 

AGG TTG TCT GAA CCC ATA GAT AGA TTC ACG CGC ATA AGG TGG TTG TTT 4 8 

Arg Leu Ser Glu Pro lie Asp Arg Phe Thr Arg lie Arg Trp Leu Phe 
15 10 15 

AAA AAC GAT TTT GAA AAA ATC CGC CAA CAA AGG GTT TTA ATC TGT GGC 96 
Lys Asn Asp Phe Glu Lys lie Arg Gin Gin Arg Val Leu lie Cys Gly 
20 25 30 

GTG GGG GGC GTT GGG GGC TTT GCG CTA GAC GCT TTG TAT CGT GTG GGG 144 
Val Gly Gly Val Gly Gly Phe Ala Leu Asp Ala Leu Tyr Arg Val Gly 
35 40 45 

ATA GGG CAA ATC ACT ATC ATT GAT AAA GAC GTG TTT GAT GTT ACC AAT 192 
lie Gly Gin lie Thr lie lie Asp Lys Asp Val Phe Asp Val Thr Asn 
50 55 60 

CAA AAC CGC CAG ATT GGC TCA GAA AGG ATA GGA GAA TCT AAA GTG TTG 24 0 

Gin Asn Arg Gin lie Gly Ser Glu Arg lie Gly Glu Ser Lys Val Leu 
65 70 75 80 

GTG TTG CAA GAT CTC TAT AAG GGC ATT CAA GCT TTG AAC TTG CAT ATA 28 8 

Val Leu Gin Asp Leu Tyr Lys Gly lie Gin Ala Leu Asn Leu His lie 
85 90 95 

GAT GAA GCG TTT TTA AAT TCA TTT AAT TTT AGA GAT TAT GAT TAC ATT 33 6 

Asp Glu Ala Phe Leu Asn Ser Phe Asn Phe Arg Asp Tyr Asp Tyr lie 
100 105 110 

TTA GAT TGC ATG GAC GAT TTG CCT ATT AAA ACA AGC TTA GCG ATA AAA 3 84 

Leu Asp Cys Met Asp Asp Leu Pro lie Lys Thr Ser Leu Ala lie Lys 
115 120 125 

TGC CAG AAT TTC GCT TAC GGA AAA TTT ATC AGC TCT ATG GGG AGT GCG 43 2 

Cys Gin Asn Phe Ala Tyr Gly Lys Phe lie Ser Ser Met Gly Ser Ala 
130 135 140 

AAA CGC TTG AAC CCT AAA CAC ATC CAA GTG GGG AGC GTG TGG GAA AGC 480 
Lys Arg Leu Asn Pro Lys His lie Gin Val Gly Ser Val Trp Glu Ser 
145 150 155 160 

TAT GGC GAT AAA TTC GGG CGT AAA TTT AGG GAT TTT TTA T^AA AAA CGC 52 8 

Tyr Gly Asp Lys Phe Gly Arg Lys Phe Arg Asp Phe Leu Lys Lys Arg 
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GCGATATTGC TAAAGATATC CAA 90 0 

(2) INFORMATION FOR SEQ ID NO: 83 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832: 

Met Gly Lys Phe Met Asn Thr Leu Lys Lys His Leu Ala Phe lie lie 

15 10 15 

Pro Leu Val Ala Leu Leu Phe Ser Leu Glu Cys Val Leu Phe lie Asn 

20 25 30 

Gin Ala lie Glu Gin Lys Glu Lys Lys Leu lie Glu Asp Tyr Ser Val 

35 40 45 

Val Leu Ala Ser Thr Gin Lys Leu Asn Leu Glu Leu Leu Arg Gin Asn 

50 55 60 

Phe Ser Glu lie lie Ala Leu Lys Glu lie Asp Pro Asn Tyr Ser Leu 
65 70 75 80 

Glu Pro Leu Gin Lys Thr Leu Gly lie Asp Gly Leu Lys Glu Leu Arg 

85 90 95 

Lys Asn Leu Pro Phe Phe Tyr Ser Leu Gin Leu Ser Thr Phe Pro Thr 

100 105 110 

Gin Glu Arg Leu Glu Asn lie Lys Glu Lys Leu Leu Lys lie Pro Gly 

115 120 125 

Val Gin Lys Val Glu Val Phe Ala Lys Thr Tyr Met Gin Val Tyr Asp 

130 135 140 

Leu Leu Ser Phe lie Lys Thr Ala Val Tyr lie Phe Ala Leu Val Val 
145 150 155 160 

Phe Val Leu Ser Val Leu Leu Met Phe Lys Gin Val Arg lie Trp lie 

165 170 175 

Tyr Gin Tyr His Glu Arg Leu Glu lie Met Asp Leu Leu Gly Ala Ser 

180 185 190 

Val Ser Phe Lys Asn Gly Phe Leu Tyr Lys lie Ala Leu Met Asp Ser 

195 200 205 

Val lie Ala Ser Phe Leu Ala Pro Met Leu Met Leu Tyr Thr Thr Ser 

210 215 220 

Gin Lys Gly Phe Glu Lys Thr Met Asp Thr Leu Gly lie lie Gly Gly 
225 230 235 240 

Ala Phe Val Leu Asn His Phe Leu Trp Gly Leu Leu Phe Ser Leu Val 

245 250 255 

Val Ser Phe Val Ser Val Leu Leu Val Ala Trp Arg Thr Arg His Val 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 83 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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AAT TTT AGC GAA ATC ATA GCG TTA AAA GAA ATT GAT CCT AAT TAT TCT 2 42 

Asn Phe Ser Glu lie lie Ala Leu Lys Glu lie Asp Pro Asn Tyr Ser 
65 70 75 

TTA GAA CCT CTT CAA AAA ACC TTA GGC ATA GAT GGG CTT AAG GAA TTA 2 90 

Leu Glu Pro Leu Gin Lys Thr Leu Gly lie Asp Gly Leu Lys Glu Leu 
80 85 90 95 

AGA AAA AAT TTG CCC TTT TTT TAT TCT TTA CAA CTT TCC ACA TTC CCC 33 8 

Arg Lys Asn Leu Pro Phe Phe Tyr Ser Leu Gin Leu Ser Thr Phe Pro 
100 105 110 

ACT CAA GAG CGT TTA GAA AAC ATT AAA GAA AAA TTG CTC AAA ATC CCT 3 86 

Thr Gin Glu Arg Leu Glu Asn lie Lys Glu Lys Leu Leu Lys He Pro 
115 120 125 

GGC GTT CAA AAA GTT GAA GTC TTT GCC AAA ACT TAC ATG CAA GTG TAT 4 34 

Gly Val Gin Lys Val Glu Val Phe Ala Lys Thr Tyr Met Gin Val Tyr 
130 135 140 

GAT CTC TTG AGT TTT ATT AAA ACA GCG GTC TAT ATC TTT GCG TTA GTG 4 82 

Asp Leu Leu Ser Phe He Lys Thr Ala Val Tyr He Phe Ala Leu Val 
145 150 155 

GTC TTT GTT TTA TCG GTT TTA TTG ATG TTT AAA CAA GTC CGC ATC TGG 53 0 

Val Phe Val Leu Ser Val Leu Leu Met Phe Lys Gin Val Arg He Trp 
160 165 170 175 

ATC TAT CAA TAC CAT GAG AGA TTA GAG ATC ATG GAT TTA TTA GGG GCT 57 8 

He Tyr Gin Tyr His Glu Arg Leu Glu He Met Asp Leu Leu Gly Ala 
180 185 190 

TCG GTG TCT TTT AAA AAC GGG TTT TTG TAT AAA ATA GCT TTA ATG GAT 62 6 

Ser Val Ser Phe Lys Asn Gly Phe Leu Tyr Lys He Ala Leu Met Asp 
195 200 205 

TCT GTA ATC GCT AGT TTT TTA GCC CCC ATG CTC ATG CTC TAT ACC ACT 6 74 

Ser Val He Ala Ser Phe Leu Ala Pro Met Leu Met Leu Tyr Thr Thr 
210 215 220 

TCG CAA AAA GGT TTT GAA AAA ACG ATG GAT ACT TTG GGT ATT ATA GGA 72 2 

Ser Gin Lys Gly Phe Glu Lys Thr Met Asp Thr Leu Gly He He Gly 
225 230 235 

GGC GCG TTT GTT TTA AAC CAT TTT TTA TGG GGA CTG CTT TTT AGC CTT 77 0 

Gly Ala Phe Val Leu Asn His Phe Leu Trp Gly Leu Leu Phe Ser Leu 
240 245 250 255 

GTG GTC TCA TTT GTT TCT GTT TTA CTT GTA GCT TGG AGG ACT AGG CAT 818 
Val Val Ser Phe Val Ser Val Leu Leu Val Ala Trp Arg Thr Arg His 
260 265 270 

GTA TAAATTAGGG GTGTTTTTGT TAGCCACCTT ACTATCAGCT AACACGCAAA AAGTGA 877 
Val 
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(2) INFORMATION FOR SEQ ID NO: 831: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 6 ... 821 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831: 

AGAAG ATG GGG AAG TTT ATG AAT ACT CTT AAA AAG CAT TTA GCC TTT ATC 5 0 

Met Gly Lys Phe Met Asn Thr Leu Lys Lys His Leu Ala Phe He 
15 10 15 

ATT CCC CTA GTA GCG TTA TTG TTT AGC TTG GAG TGC GTG TTA TTT ATC 98 

He Pro Leu Val Ala Leu Leu Phe Ser Leu Glu Cys Val Leu Phe He 
20 25 30 

AAT CAA GCG ATC GAA CAG AAA GAA AAA AAA TTG ATT GAA GAT TAT TCG 14 6 

Asn Gin Ala He Glu Gin Lys Glu Lys Lys Leu He Glu Asp Tyr Ser 

35 40 45 

GTC GTG TTG GCC AGC ACG CAA AAA TTA AAC TTG GAA TTG TTG CGT CAA 194 

Val Val Leu Ala Ser Thr Gin Lys Leu Asn Leu Glu Leu Leu Arg Gin 
50 55 60 
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55 60 » 65 

ATA CCC TTT AGC GAT GCA CTC GCT ATC GCG CCC ATT CAA GCA GGA ATG 2 95 

lie Pro Phe Ser Asp Ala Leu Ala lie Ala Pro lie Gin Ala Gly Met 
70 75 80 

ATC TAG AAA ATG AAT GAC GCT TTT GGA ATG GAT TTG GAT AAA TCT GTA 343 
lie Tyr Lys Met Asn Asp Ala Phe Gly Met Asp Leu Asp Lys Ser Val 
85 90 95 

GCC GCA TCA TTA ATC ACC GGA TTG TTA GGC GTA ACC GCT GTC GCG CAA 3 91 

Ala Ala Ser Leu lie Thr Gly Leu Leu Gly Val Thr Ala Val Ala Gin 
100 105 110 115 

GTG GGG AGA ACG CTT GTT AAT GGT TTC CTT AAA TTC ATT CCT GTT GTG 43 9 

Val Gly Arg Thr Leu Val Asn Gly Phe Leu Lys Phe lie Pro Val Val 
120 125 130 

GGG AGT GTT GCA GGG GGC ACA ACC GCT GTA ATT ATC ACA GAA GGC ATT 4 87 

Gly Ser Val Ala Gly Gly Thr Thr Ala Val lie lie Thr Glu Gly lie 
135 140 145 

GGG TTT GCG TAT TTG AAA GTG CTA GAA AAG TGC TTT AAT GAT GAG ACG 535 
Gly Phe Ala Tyr Leu Lys Val Leu Glu Lys Cys Phe Asn Asp Glu Thr 
150 155 160 

GGC GAA GTC AAT TTG CCT GAT GAA GTT GGC ATG ATA ACT TCT CTC TTT 5 83 

Gly Glu Val Asn Leu Pro Asp Glu Val Gly Met lie Thr Ser Leu Phe 
165 170 175 

AAG GAG AAT TAT CTC AAC TTG GAT ACA ATC AAG AAA TTA ACA CAA TAAGA 633 
Lys Glu Asn Tyr Leu Asn Leu Asp Thr lie Lys Lys Leu Thr Gin 
180 185 190 

TTAGGGGTTA TGAAAAACGC ATGGCATTAG ACAAAAGGAT TTGGATGCAT TTT 686 
(2) INFORMATION FOR SEQ ID NO: 83 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 0: 

Met Glu Val Pro lie Glu Gly Leu Glu Glu Leu Val Asp Glu Thr Lys 

1 . 5 10 15 

Lys Cys Leu lie Glu Ala Lys Lys Asn Lys Gin Asn His Phe Leu Leu 

20 25 30 

lie Gin Lys Ala Asn lie Gin Ala Arg Lys Gin Ala Met lie Asp Glu 

35 40 45 

Ser Lys Thr lie lie His Val Ala Ser Gly Ala Ala Gly Ala Ala Gly 
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165 170 175 

Cys Phe Ser Tyr Val Glu Glu Pro Asn Lys Gin Glu Phe Leu Asp Phe 

180 185 190 

Lys Gin Lys Tyr Leu Asp Phe Ser Arg Asn Lys Ala Pro Lys Ala Asn 

195 200 205 

Leu Ser Asn Ala Leu Glu Glu Gin Leu Lys Glu Asn Phe Lys Lys Leu 

210 215 220 

Tyr Asn Asp Leu Phe Asp Gly Ala lie He Arg Cys Asp Phe Phe Val 
225 230 235 240 

He Lys Asn Glu Val Tyr Leu Asn Glu He Asn Pro He Pro Gly Ser 

245 250 255 

Leu Ala Asn Tyr Leu Phe Asp Asp Phe Lys Thr Thr Leu Glu Asn Leu 

260 265 270 

Ala Gin Ser Leu Pro Lys Thr Pro Lys He Gin He Lys Asn Ser Tyr 

275 280 285 

Leu Leu Gin He Gin Lys Asn Lys 
290 295 

(2) INFORMATION FOR SEQ ID NO: 82 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47... 628 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 9: 

CAAAGCCTAT GCGAGAGTCA ATTCCGTTGC ATTTTCATTT AGAGGC ATG GAA GTC 5 5 

Met Glu Val 
1 

CCC ATT GAA GGT TTA GAA GAA TTG GTA GAT GAA ACG AAA AAA TGC TTG 103 
Pro He Glu Gly Leu Glu Glu Leu Val Asp Glu Thr Lys Lys Cys Leu 
5 10 15 

ATA GAA GCT AAG AAA AAC AAA CAA AAC CAT TTC TTG CTG ATT CAA AAA 151 
He Glu Ala Lys Lys Asn Lys Gin Asn His Phe Leu Leu He Gin Lys 
20 25 30 35 

GCT AAC ATC CAA GCA AGA AAA CAA GCC ATG ATA GAT GAA AGT AAA ACC 19 9 

Ala Asn He Gin Ala Arg Lys Gin Ala Met He Asp Glu Ser Lys Thr 
40 45 50 

ATT ATC CAT GTT GCA TCA GGA GCG GCT GGA GCG GCC GGG CTT ATC CCC 24 7 

He He His Val Ala Ser Gly Ala Ala Gly Ala Ala Gly Leu He Pro 
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AAA CTC TAT AAC GAT TTG TTT GAT GGC GCG ATC ATT COT TGC GAT TTT 7 83 

Lys Leu Tyr Asn Asp Leu Phe Asp Gly Ala lie lie Arg Cys Asp Phe 
225 230 235 

TTT GTC ATA AAA AAT GAA GTG TAT CTT AAT GAG ATC AAC CCC ATT CCT 831 
Phe Val lie Lys Asn Glu Val Tyr Leu Asn Glu lie Asn Pro lie Pro 
240 245 250 

GGC AGT TTG GCC AAT TAT TTG TTT GAT GAT TTT AAA ACA ACG OTA GAA 87 9 

Gly Ser Leu Ala Asn Tyr Leu Phe Asp Asp Phe Lys Thr Thr Leu Glu 
255 260 265 270 

AAT TTA GCG CAA TCA TTA CCC AAA ACC CCT AAG ATC CAA ATC AAA AAC 92 7 

Asn Leu Ala Gin Ser Leu Pro Lys Thr Pro Lys lie Gin lie Lys Asn 
275 280 285 

TCT TAT TTG TTG CAA ATC CAA AAG AAT AAG TAATGGCCAA ACGCAGTATC GCT 98 0 
Ser Tyr Leu Leu Gin lie Gin Lys Asn Lys 
290 295 

TATTTGGATA GCGTTTTTGA CATTTCCTAC ACTTTTATAG A 1021 

(2) INFORMATION FOR SEQ ID NO: 82 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 96 amino acids 

(B) TYPE : amino acid 

(C) STRAITOEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 8: 

Met His Ser Lys Tyr Phe Ala Gin lie Lys Glu Lys Lys Leu Pro Pro 

15 10 15 

Leu lie Leu Thr His Asn Gly Leu Leu Lys Asn Ser Phe Leu Gly Ala 

20 25 30 

Lys lie lie Glu Leu Pro Leu Val lie Asn Leu Val His Gly Gly Asp 

35 40 45 

Gly Glu Asp Gly Lys Leu Ala Ser Leu Leu Glu Phe Tyr Arg lie Ala 

50 55 60 

Phe lie Gly Pro Arg lie Glu Ala Ser Val Leu Ser Tyr Asn Lys Tyr 
65 70 75 80 

Leu Thr Lys Leu Tyr Ala Lys Asp Leu Gly Val Lys Thr Leu Asp His 

85 90 95 

Val Leu Leu Asn Glu Lys Asn Arg Ala Asn Ala Leu Asp Leu Met Asn 

100 105 110 

Phe Asn Phe Pro Phe lie lie Lys Pro Asn Asn Ala Gly Ser Ser Leu 

115 120 125 

Gly Val Asn Val Val Lys Glu Glu Lys Glu Leu Val Tyr Ala Leu Asp 

130 135 140 

Gly Ala Phe Glu Tyr Ser Lys Glu Val Leu lie Glu Pro Phe lie Gin 
145 150 155 160 

Gly Val Lys Glu Tyr Asn Leu Ala Gly Cys Lys lie Lys Lys Asp Phe 
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GAATCCAAC ATG CAT TCA AAA TAG TTC GOT CAA ATC AAA GAA AAA AAA TTA 111 
Met His Ser Lys Tyr Phe Ala Gin lie Lys Glu Lys Lys Leu 
15 10 

OCT CCC CTA ATC CTC ACA CAC AAT GGC TTG CTT AAA AAC TCA TTT TTA 15 9 

Pro Pro Leu lie Leu Thr His Asn Gly Leu Leu Lys Asn Ser Phe Leu 
15 20 25 30 

GGT GOT AAG ATT ATA GAA TTG CCT TTA GTG ATC AAT CTC GTG CAT GGG 207 
Gly Ala Lys lie lie Glu Leu Pro Leu Val lie Asn Leu Val His Gly 
35 40 45 

GGC GAT GGC GAA GAT GGG AAA TTA GCG AGC TTG TTA GAA TTT TAT CGT 2 55 

Gly Asp Gly Glu Asp Gly Lys Leu Ala Ser Leu Leu Glu Phe Tyr Arg 
50 55 60 

ATC GCT TTT ATA GGC CCT AGG ATT GAA GCG AGC GTG CTG AGT TAT AAC 303 
lie Ala Phe lie Gly Pro Arg lie Glu Ala Ser Val Leu Ser Tyr Asn 
65 70 75 

AAA TAT TTA ACC AAG CTT TAC GCC AAA GAC TTA GGG GTA AAG ACT TTA 3 51 

Lys Tyr Leu Thr Lys Leu Tyr Ala Lys Asp Leu Gly Val Lys Thr Leu 
80 85 90 

GAT CAT GTT CTT TTG AAT GAA AAA AAC CGC GCT AAC GCC TTG GAT TTG 3 99 

Asp His Val Leu Leu Asn Glu Lys Asn Arg Ala Asn Ala Leu Asp Leu 
95 100 105 110 

ATG AAC TTT AAT TTC CCT TTC ATA ATC AAG CCT AAT AAC GCC GGA AGC 44 7 

Met Asn Phe Asn Phe Pro Phe lie lie Lys Pro Asn Asn Ala Gly Ser 
115 120 125 

TCT TTA GGG GTG AAT GTT GTG AAA GAA GAA AAA GAA TTG GTT TAC GCT 4 95 

Ser Leu Gly Val Asn Val Val Lys Glu Glu Lys Glu Leu Val Tyr Ala 
130 135 140 

TTA GAC GGT GCG TTT GAA TAT TCT AAA GAG GTC TTG ATA GAG CCT TTC 54 3 

Leu Asp Gly Ala Phe Glu Tyr Ser Lys Glu Val Leu lie Glu Pro Phe 
145 150 155 

ATT CAG GGA GTG AAA GAA TAC AAT TTG GCC GGT TGC AAG ATC AAA AAG 5 91 

lie Gin Gly Val Lys Glu Tyr Asn Leu Ala Gly Cys Lys lie Lys Lys 
160 165 170 

GAT TTT TGT TTT TCC TAT GTG GAA GAG CCT AAC AAA CAG GAA TTT TTA 63 9 

Asp Phe Cys Phe Ser Tyr Val Glu Glu Pro Asn Lys Gin Glu Phe Leu 
175 180 185 190 

GAT TTC AAA CAA AAA TAT TTG GAT TTT TCA CGC AAT AAA GCC CCT AAA 6 87 

Asp Phe Lys Gin Lys Tyr Leu Asp Phe Ser Arg Asn Lys Ala Pro Lys 
195 200 205 

GCG AAT CTT TCT AAC GCC CTA GAA GAG CAA TTA AAA GAA AAT TTT AAA 73 5 

Ala Asn Leu Ser Asn Ala Leu Glu Glu Gin Leu Lys Glu Asn Phe Lys 
210 215 220 
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Asp Lys Glu Val Lys Gly Gly Leu Gly Val Val Ala Asp Asp Ala Leu 
40 45 50 

GCG GGT GTT TTA GCC GGA TTG AGC GCG TTA TTA GTC ATC CAT ATT TTA 24 9 

Ala Gly Val Leu Ala Gly Leu Ser Ala Leu Leu Val lie His lie Leu 
55 60 65 

GGA TTT TTT AAC ATT AAA CTT TAATTTTAAG AAAAT 2 85 

Gly Phe Phe Asn lie Lys Leu 
70 75 



(2) INFORMATION FOR SEQ ID NO: 826: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826: 



Met Trp Leu Ala 
1 

Leu Ser Phe lie 
20 

lie Gly Lys lie 
35 

Asp Asp Ala Leu 
50 

lie His lie Leu 
65 



Met Ala lie Ser 
5 

Phe Phe Arg lie 

Asp Lys Glu Val 
40 

Ala Gly Val Leu 
55 

Gly Phe Phe Asn 
70 



Gly Leu Ser Leu 
10 

Tyr Asp lie Thr 
25 

Lys Gly Gly Leu 

Ala Gly Leu Ser 
60 

lie Lys Leu 
75 



Ala Gly Val lie 
15 

Lys Pro Ser Leu 
30 

Gly Val Val Ala 
45 

Ala Leu Leu Val 



(2) INFORMATION FOR SEQ ID NO: 827: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 957 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 
AAAGATAGGA TTAAATATTT TATTTTTTTA 



SEQ ID NO: 8 27: 

GATGAAAACC ATCATTTTTA TTTGATTGAA 6 0 
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Asn Val Gly Phe 
210 

Asn Asn Thr Thr 
225 

Gin Arg Tyr Phe 

Tyr Arg Glu Thr 
260 

Ala Leu Gly Leu 
275 

Thr lie Val Asn 
290 

Lys Tyr Gin Val 
305 

Leu lie Arg lie 

Tyr Ala Leu Asn 
340 

Glu Phe Asp Gly 
355 

lie 



Tyr Phe Asn Leu 
215 

Ser Tyr Thr Ala 
230 

Glu Leu Val Gin 
245 

Lys Lys Ala Ala 

Lys lie Phe Pro 
280 

Glu His Ala Lys 
295 

Gin Phe Ala Gly 
310 

Asn His Met Gly 
325 

Ala Leu Glu Leu 

Val Ala Asn Ala 
360 



Lys Ser Glu Leu 
220 

Pro lie Leu His 
235 

Asn Leu Gly Gly 
250 

Leu Ala Thr Gin 
265 

Lys Ser Pro Ser 

Glu Leu Arg Asn 
300 

Gly Gin Glu Pro 
315 

lie lie Pro Val 
330 

Ala Leu Asn Asp 
345 

Thr Phe Leu Lys 



Lys Asn Gin Arg 

Thr Leu Gly Leu 
240 

Phe Glu Ala Leu 
255 

Lys Ala Val Leu 
270 

Leu Ser Met Thr 
285 

Leu Leu Lys Glu 

Tyr Lys Asp Ala 
320 

Tyr Lys Ser Ala 
335 

Leu Asp Leu Arg 
350 

Gin Tyr Tyr Gly 
365 



(2) INFORMATION FOR SEQ ID NO: 82 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 270 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:825: 

AAGAGGCATG ACAGCTCTTA CATTGTGATA GACGAATTAG TGGGC ATG TGG TTG GCG 57 

Met Trp Leu Ala 
1 

ATG GCG ATT AGC GGG TTA TCG TTA GCG GGT GTG ATC TTG AGT TTT ATC 105 
Met Ala lie Ser Gly Leu Ser Leu Ala Gly Val lie Leu Ser Phe lie 
5 10 15 20 

TTT TTT AGG ATC TAT GAT ATT ACT AAA CCC TCA CTC ATT GGC AAG ATA 15 3 

Phe Phe Arg lie Tyr Asp lie Thr Lys Pro Ser Leu lie Gly Lys lie 
25 30 35 

GAT AAA GAA GTT AAA GGG GGC TTA GGG GTT GTG GCT GAT GAC GCT TTA 201 
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Glu Pro Tyr Lys Asp Ala Leu lie Arg lie Asn His Met Gly lie lie 
315 320 325 330 

CCT GTT TAT AAA AGC GCT TAG GCT TTA AAC GCC CTA GAG TTA GCC CTA 1061 
Pro Val Tyr Lys Ser Ala Tyr Ala Leu Asn Ala Leu Glu Leu Ala Leu 
335 340 345 

AAC GAG TTG GAT TTA AGG GAA TTT GAT GGC GTG GCG AAC GCA ACT TTT 110 9 

Asn Asp Leu Asp Leu Arg Glu Phe Asp Gly Val Ala Asn Ala Thr Phe 
350 355 360 

TTA AAG CAA TAT TAT GGA ATT TAAGGATCAC AATGCATTAT TCTTATGAAA CCTT 1164 
Leu Lys Gin Tyr Tyr Gly lie 
365 

TTTAAA 1170 
(2) INFORMATION FOR SEQ ID NO: 824: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:824: 

Met Leu Leu Phe Thr Pro Gly Pro Val Ala lie Asn Glu Glu Met Arg 

15 10 15 

Thr Ser Phe Ser Gin Pro Met Pro His His Arg Thr Lys Asp Phe Glu 

20 25 30 

Lys lie Phe Gin Ser Val Arg Glu Asn Leu Lys Lys Met Thr Gly Leu 

35 40 45 

Glu Glu Val Leu Leu Leu Ser Ser Ser Gly Thr Gly Ala Met Glu Ala 

50 55 60 

Ser Val lie Ser Leu Cys Gin Lys Glu Leu Leu Phe Val Asn Ala Gly 
65 70 75 80 

Lys Phe Gly Glu Arg Phe Gly Lys lie Ala Lys Ala His Ser lie Lys 

85 90 95 

Ala His Glu Leu Val Tyr Glu Trp Asp Thr Pro Ala Gin Val Asp Glu 

100 105 110 

lie Leu Ser Val Leu Lys Ala Asn Pro Asn lie Asp Ala Phe Cys lie 

115 120 125 

Gin Ala Cys Glu Ser Ser Gly Gly Leu Arg His Pro Val Glu Lys lie 

130 135 140 

Ala Gin Ala lie Lys Glu Thr Asn Pro Asn Val Phe Val lie Val Asp 
145 150 155 160 

Ala lie Thr Ala Leu Gly Val Glu Pro Leu Glu lie Thr His Val Asp 

165 170 175 

Ala Leu lie Gly Gly Ser Gin Lys Ala Phe Met Leu Pro Pro Ala Met 

180 185 190 

Ser Leu Val Ala Leu Ser Gin Asn Ala lie Glu Arg lie Glu Glu Arg 
195 200 205 



-1237- 



BNSDOCIO: <WO 9943478A1 t > 



wo 98/43478 



PCT/US98/06371 



95 100 105 

CCA GCT CAA GTA GAT GAA ATA TTA AGC GTT CTT AAA GCC AAC CCT AAC 3 89 

Pro Ala Gin Val Asp Glu lie Leu Ser Val Leu Lys Ala Asn Pro Asn 
110 115 120 

ATT GAT GCG TTT TGC ATT CAA GCA TGC GAG TCT AGT GGG GGG TTA CGA 43 7 

lie Asp Ala Phe Cys lie Gin Ala Cys Glu Ser Ser Gly Gly Leu Arg 
125 130 135 

CAC CCT GTG GAA AAA ATC GCT CAA GCG ATC AAA GAA ACT AAC CCG AAT 4 85 

His Pro Val Glu Lys lie Ala Gin Ala lie Lys Glu Thr Asn Pro Asn 
140 145 150 

GTT TTT GTA ATT GTA GAT GCT ATC ACC GCT TTA GGG GTT GAG CCT TTA 533 
Val Phe Val lie Val Asp Ala lie Thr Ala Leu Gly Val Glu Pro Leu 
155 160 165 170 

GAA ATA ACG CAT GTT GAT GCG CTC ATT GGA GGG AGT CAA AAA GCG TTC 581 
Glu lie Thr His Val Asp Ala Leu lie Gly Gly Ser Gin Lys Ala Phe 
175 180 185 

ATG CTG CCT CCT GCG ATG AGC CTA GTC GCA TTG AGC CAG AAT GCA ATT 62 9 

Met Leu Pro Pro Ala Met Ser Leu Val Ala Leu Ser Gin Asn Ala lie 
190 195 200 

GAG CGT ATA GAA GAA CGC AAT GTG GGG TTT TAT TTC AAT TTA AAG AGC 677 
Glu Arg lie Glu Glu Arg Asn Val Gly Phe Tyr Phe Asn Leu Lys Ser 
205 210 215 

GAA TTG AAA AAC CAA AGG AAT AAC ACC ACA AGC TAC ACC GCT CCT ATT 72 5 

Glu Leu Lys Asn Gin Arg Asn Asn Thr Thr Ser Tyr Thr Ala Pro lie 
220 225 230 

TTA CAC ACT TTA GGG TTG CAA CGC TAT TTT GAA TTG GTG CAA AAT TTA 7 73 

Leu His Thr Leu Gly Leu Gin Arg Tyr Phe Glu Leu Val Gin Asn Leu 
235 240 245 250 

GGG GGC TTT GAA GCG CTC TAT AGA GAG ACT AAA AAA GCC GCT TTG GCC 821 
Gly Gly Phe Glu Ala Leu Tyr Arg Glu Thr Lys Lys Ala Ala Leu Ala 
255 260 265 

ACT CAA AAA GCC GTT TTA GCT TTA GGT TTA AAG ATT TTC CCT AAA AGC 8 69 

Thr Gin Lys Ala Val Leu Ala Leu Gly Leu Lys lie Phe Pro Lys Ser 
270 275 280 

CCA AGC TTG AGC ATG ACA ACG ATT GTT AAT GAG CAT GCC AAA GAA TTG 917 
Pro Ser Leu Ser Met Thr Thr lie Val Asn Glu His Ala Lys Glu Leu 
285 290 295 

AGA AAC CTT TTA AAA GAA AAA TAC CAG GTG CAA TTT GCG GGC GGT CAA 965 
Arg Asn Leu Leu Lys Glu Lys Tyr Gin Val Gin Phe Ala Gly Gly Gin 
300 305 310 

GAG CCT TAT AAA GAT GCG CTC ATT CGT ATC AAC CAC ATG GGG ATC ATT 1013 
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260 265 270 

Asn Leu His Ala Tyr Ala Lys Gly Glu Met Gin Ala Ser Ala Phe Arg 

275 280 285 

Asn Cys Val Asn Thr Leu Thr Glu lie Lys Ser Met Arg Glu Ser lie 

290 295 300 

Glu Glu Phe Phe Asn Gin Glu Met Leu Gin Ser Glu Val Pro Leu Trp 
305 310 315 320 

Val Glu Leu Asn Gin Lys Ser Val 
325 



(2) INFORMATION FOR SEQ ID NO:823: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 24... 1130 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823: 

TCAAAAATTA AAGGTTAATT TTA ATG TTG CTT TTC ACT CCA GGC CCT GTA GCC 5 3 

Met Leu Leu Phe Thr Pro Gly Pro Val Ala 
15 10 

ATT AAT GAA GAG ATG CGC ACA AGC TTT TCT CAG CCA ATG CCC CAC CAC 101 
lie Asn Glu Glu Met Arg Thr Ser Phe Ser Gin Pro Met Pro His His 
15 20 25 

CGC ACT AAA GAT TTT GAA AAG ATT TTC CA?^ AGC GTG CGA GAA AAT TTG 14 9 

Arg Thr Lys Asp Phe Glu Lys lie Phe Gin Ser Val Arg Glu Asn Leu 
30 35 40 

AAA AAA ATG ACC GGT TTA GAA GAG GTT TTG CTT CTA AGC AGC AGC GGG 197 
Lys Lys Met Thr Gly Leu Glu Glu Val Leu Leu Leu Ser Ser Ser Gly 
45 50 55 

ACA GGG GCT ATG GAA GCG AGC GTG ATT TCC TTG TGT CAA AAA GAG TTG 24 5 

Thr Gly Ala Met Glu Ala Ser Val lie Ser Leu Cys Gin Lys Glu Leu 
60 65 70 

CTT TTT GTT AAT GCG GGC AAG TTT GGC GAA AGG TTT GGC AAG ATC GCT 2 93 

Leu Phe Val Asn Ala Gly Lys Phe Gly Glu Arg Phe Gly Lys lie Ala 
75 80 85 90 

AAA GCC CAT TCT ATC AAA GCC CAT GAA TTA GTC TAT GAA TGG GAC ACA 341 
Lys Ala His Ser lie Lys Ala His Glu Leu Val Tyr Glu Trp Asp Thr 
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AGC ATG CGA GAG AGC ATA GAG GAA TTT TTT AAT CAA GAA ATG TTG CAA 963 
Ser Met Arg Glu Ser lie Glu Glu Phe Phe Asn Gin Glu Met Leu Gin 
300 305 310 

AGT GAA GTG CCG TTA TGG GTA GAA TTG AAT CAA AAA AGC GTT TGAAAGCGC 1014 
Ser Glu Val Pro Leu Trp Val Glu Leu Asn Gin Lys Ser Val 
315 320 325 

TTGTTTTTTT AGCCAGCTTG GGGG 103 8 

(2) INFORMATION FOR SEQ ID NO:822: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822: 

Met Asp Phe Lys Asn Lys Lys Trp Leu Phe Leu Ala Pro Leu Ala Gly 

15 10 15 

Tyr Thr Asp Leu Pro Phe Arg Ser Val Val Lys Lys Phe Gly Val Asp 

20 25 30 

Val Thr Thr Ser Glu Met Val Ser Ser His Ser Leu Val Tyr Ala Phe 

35 40 45 

Asp Lys Thr Ser Lys Met Leu Glu Lys Ser Pro Leu Glu Asp His Phe 

50 55 60 

Met Ala Gin He Ser Gly Ser Lys Glu Ser Val Val Lys Glu Ala Val 
65 70 75 80 

Glu Lys He Asn Ala Leu Glu His Val Asn Gly He Asp Phe Asn Cys 

85 90 95 

Gly Cys Pro Ala Pro Lys Val Ala Asn His Gly Asn Gly Ser Gly Leu 

100 105 110 

Leu Lys Asp Leu Asn His Leu Val Lys Leu Leu Lys Thr He Arg Glu 

115 120 125 

Asn Thr Ser Lys Lys He Thr Ser Val Lys Val Arg Leu Gly Phe Glu 

130 135 140 

Lys Lys He Pro Lys Glu He Ala His Ala Leu Asn Asp Ala Pro Val 
145 150 155 160 

Asp Tyr Val Val Val His Gly Arg Thr Arg Ser Asp Lys Tyr Gin Lys 

165 170 175 

Asp Lys He Asp Tyr Glu Ser He Ala Leu Met Lys Lys He Leu Lys 

180 185 190 

Lys Pro Val He Ala Asn Gly Glu He Asp Ser Val Lys Lys Ala Phe 

195 200 205 

Glu Val Leu Gin He Thr Gin Ala Asp Gly Leu Met He Gly Arg Ala 

210 215 220 

Ala Leu Arg Ala Pro Trp He Phe Trp Gin He Arg Asn Asn Thr Thr 
225 230 235 240 

Lys Leu Pro Ala Val Val Lys Lys Asp Leu Val Leu Glu His Phe Asp 

245 250 255 

Lys Met Val Glu Phe Tyr Gly Asp Met Gly Val He Met Phe Arg Lys 
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GTA GTC AAA GAA GCG GTG GAG AAA ATC AAC GCT TTA GAG CAT GTG AAT 2 91 

Val Val Lys Glu Ala Val Glu Lys lie Asn Ala Leu Glu His Val Asn 
75 80 85 90 

GGG ATT GAT TTT AAT TGC GGT TGT CCC GCT CCT AAA GTG GCT AAT CAT 33 9 

Gly lie Asp Phe Asn Cys Gly Cys Pro Ala Pro Lys Val Ala Asn His 
95 100 105 

GGT AAT GGT AGT GGG TTA TTG AAG GAT TTA AAC CAC TTA GTG AAG CTT 3 87 

Gly Asn Gly Ser Gly Leu Leu Lys Asp Leu Asn His Leu Val Lys Leu 
110 115 120 

TTA AAA ACC ATC AGA GAA AAC ACT AGT AAA AAA ATC ACA AGC GTG AAA 4 35 

Leu Lys Thr lie Arg Glu Asn Thr Ser Lys Lys lie Thr Ser Val Lys 
125 130 135 

GTG COT TTA GGC TTT GAA AAG AAA ATC CCT AAA GAA ATC GCT CAT GCC 4 83 

Val Arg Leu Gly Phe Glu Lys Lys lie Pro Lys Glu lie Ala His Ala 
140 145 150 

CTA AAT GAC GCA CCG GTG GAT TAT GTG GTG GTG CAT GGG AGG ACA CGA 531 
Leu Asn Asp Ala Pro Val Asp Tyr Val Val Val His Gly Arg Thr Arg 
155 160 165 170 

AGC GAT AAA TAC CAA AAA GAC AAA ATA GAT TAC GAA AGC ATC GCT TTA 57 9 

Ser Asp Lys Tyr Gin Lys Asp Lys lie Asp Tyr Glu Ser lie Ala Leu 
175 180 185 

ATG AAA AAG ATT TTA AAA AAG CCG GTG ATA GCC AAT GGC GAA ATT GAC 62 7 

Met Lys Lys lie Leu Lys Lys Pro Val lie Ala Asn Gly Glu lie Asp 
190 195 200 

AGC GTG AAA AAG GCT TTT GAA GTT TTA CAA ATC ACT CAA GCG GAT GGG 67 5 

Ser Val Lys Lys Ala Phe Glu Val Leu Gin lie Thr Gin Ala Asp Gly 
205 210 215 

CTA ATG ATA GGG CGA GCG GCC TTA AGA GCC CCA TGG ATA TTT TGG CAA 72 3 

Leu Met lie Gly Arg Ala Ala Leu Arg Ala Pro Trp lie Phe Trp Gin 
220 225 230 

ATC AGA AAC AAC ACC ACA AAA TTA CCC GCA GTC GTG AAA AAA GAC CTG 771 
lie Arg Asn Asn Thr Thr Lys Leu Pro Ala Val Val Lys Lys Asp Leu 
235 240 245 250 

GTT TTA GAA CAT TTT GAT AAA ATG GTG GAG TTT TAT GGG GAT ATG GGG 819 
Val Leu Glu His Phe Asp Lys Met Val Glu Phe Tyr Gly Asp Met Gly 
255 260 265 

GTA ATC ATG TTT AGG AAA AAT TTG CAT GCT TAC GCT AAG GGC GAA ATG 867 
Val He Met Phe Arg Lys Asn Leu His Ala Tyr Ala Lys Gly Glu Met 
270 275 280 

CAA GCG AGC GCG TTT CGT AAC TGC GTC AAT ACC CTT ACA GAA ATA AAG 915 
Gin Ala Ser Ala Phe Arg Asn Cys Val Asn Thr Leu Thr Glu He Lys 
285 290 295 
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65 70 75 . 80 

Lys lie Lieu Arg Lys His Pro Lys Gin lie Asn Tyr Leu Ala Pro Ser 

85 90 95 

Val Ala Tyr Cys Leu Thr lie Leu Ala Gly Thr Gly His Thr Val Phe 

100 105 110 

Ser Leu lie Pro Val lie Val Glu Val Ser Gin Ser Gin Asn lie Lys 

115 120 125 

Pro Lys Ala Pro Leu Ser Leu Ala Val Val Ser Ser Gin Val Ala lie 

130 135 140 

Thr Ala Ser Pro Val Ser Ala Ala Xaa Gly Val Tyr Glu Arg His Phe 
145 150 155 160 

Arg Ala Phe Arg Ser Lys Leu Leu Asp Pro Phe Asn Gly Leu Asp Pro 

165 170 175 

Tyr Asp Phe Phe Ser Met His Ala His Gly lie Tyr Tyr Gly Phe Tyr 

180 185 190 

(2) INFORMATION FOR SEQ ID NO: 821: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 10 05 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821: 

AAAACTTTGA TATGAGAATA A ATG GAC TTT AAA AAT AAA AAA TGG CTT TTT 51 

Met Asp Phe Lys Asn Lys Lys Trp Leu Phe 
15 10 

CTA GCC CCT TTA GCA GGC TAT ACG GAT TTG CCT TTC AGG AGO GTG GTG 99 
Leu Ala Pro Leu Ala Gly Tyr Thr Asp Leu Pro Phe Arg Ser Val Val 
15 20 25 

AAA AAA TTT GGC GTG GAT GTT ACC ACG AGC GAA ATG GTG AGC TCG CAT 14 7 

Lys Lys Phe Gly Val Asp Val Thr Thr Ser Glu Met Val Ser Ser His 
30 35 40 

TCG TTG GTG TAT GCG TTT GAT AAA ACT TCT AAA ATG TTG GAA AAA TCC 195 
Ser Leu Val Tyr Ala Phe Asp Lys Thr Ser Lys Met Leu Glu Lys Ser 
45 50 55 

CCT TTA GAA GAT CAT TTC ATG GCG CAA ATT TCA GGC TCT AAA GAA AGC 24 3 

Pro Leu Glu Asp His Phe Met Ala Gin lie Ser Gly Ser Lys Glu Ser 
60 65 70 
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GGC TTG GAT TAC TTA GTC AAA ATC GCT GAA AAA ATT TTA AGG AAA CAC 2 94 

Gly Leu Asp Tyr Leu Val Lys lie Ala Glu Lys lie Leu Arg Lys His 
75 80 85 

CCC AAG CAA ATC AAT TAC CTT GCG CCA AGC GTG GCG TAT TGT TTA ACG 34 2 

Pro Lys Gin lie Asn Tyr Leu Ala Pro Ser Val Ala Tyr Cys Leu Thr 
90 95 100 

ATA CTA GCG GGC ACC GGG CAT ACG GTT TTT TCC TTG ATC CCG GTG ATT 3 90 

He Leu Ala Gly Thr Gly His Thr Val Phe Ser Leu He Pro Val lie 
105 110 115 

GTG GAA GTG AGC CAG AGC CAA AAC ATC AAG CCT AAA GCG CCT TTA AGC 43 8 

Val Glu Val Ser Gin Ser Gin Asn He Lys Pro Lys Ala Pro Leu Ser 
120 125 130 

TTA GCG GTA GTC TCT AGT CAA GTC GCT ATT ACT GCA AGC CCG GTG AGC 486 
Leu Ala Val Val Ser Ser Gin Val Ala He Thr Ala Ser Pro Val Ser 
135 140 145 150 

GCA GCG GTN GGT GTT TAT GAG CGG CAT TTT AGA GCC TTT AGG AGC AAA 5 34 

Ala Ala Xaa Gly Val Tyr Glu Arg His Phe Arg Ala Phe Arg Ser Lys 
155 160 165 

TTA CTT GAC CCT TTT AAT GGT TTG GAT CCC TAC GAC TTT TTT AGC ATG 5 82 

Leu Leu Asp Pro Phe Asn Gly Leu Asp Pro Tyr Asp Phe Phe Ser Met 
170 175 180 

CAT GCT CAC GGC ATT TAT TAT GGG TTT TAC TGATTTGAAA TTAGACAGCG ATC 635 
His Ala His Gly He Tyr Tyr Gly Phe Tyr 
185 190 

CGCATTATTT AGAGCGCTTG AAAGCGGGCA AAATCTCGCC CCC 67 8 

(2) INFORMATION FOR SEQ ID NO: 820: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820: 

Met Val Asp Ala Phe Phe Gin He Ala Val Leu Leu Phe Ser Leu Phe 

15 10 15 

Leu Gly Ala Arg Leu Gly Gly Leu Gly Val Gly Tyr Ala Gly Gly Leu 

20 25 30 

Gly Val Leu He Leu Cys Leu Phe Leu Gly Leu Asn Pro Gly Lys He 

35 40 45 

Pro Phe Asp Val He Leu He He Met Ala Val He Ser Ala He Ser 

50 55 60 

Ala Met Gin Lys Ala Gly Gly Leu Asp Tyr Leu Val Lys He Ala Glu 
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Ser Glu Phe Ser Leu Ser Gin Leu Lys Thr Pro Leu Pro Lys Val Asp 

245 250 255 

lie lie Tyr Thr His Ala Gly Met Thr Pro Asp Leu Phe Gin Ala Ser 

260 265 270 

Leu Asn Ser His Ala Lys Gly Val Val lie Ala Gly Val Gly Asn Gly 

275 280 285 

Asn Val Ser Ala Gly Phe Leu Lys Ala Met Gin Glu Ala Ser Gin Met 

290 295 300 

Gly Val Val lie Val Arg Ser Ser Arg Val Asn Ser Gly Glu lie Thr 
305 310 315 320 

Ser Gly Glu lie Asp Asp Lys Ala Phe lie Thr Ser Asp Asn Leu Asn 

325 330 335 

Pro Gin Lys Ala Arg Val Leu Leu Gin Leu Ala Leu Thr Lys Thr Asn 

340 345 350 

Asn Lys Glu Lys He Gin Glu Met Phe Glu Glu Tyr 
355 360 

(2) INFORMATION FOR SEQ ID NO: 819: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJSTDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
(E) LOCATION: 37... 612 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819: 



AGTAAAACAT CGTTAAAATA AATTTAAAAA GGGTTA ATG GTG GAT GCC TTT TTC 54 

Met Val Asp Ala Phe Phe 
1 5 



CAA ATT GCA GTG TTA CTT TTT TCG CTT TTT TTA GGG GCA AGG CTA GGG 102 
Gin lie Ala Val Leu Leu Phe Ser Leu Phe Leu Gly Ala Arg Leu Gly 
10 15 20 



GGC TTG GGA GTG GGC TAT GCG GGG GGC TTG GGC GTG CTT ATT TTA TGC 15 0 

Gly Leu Gly Val Gly Tyr Ala Gly Gly Leu Gly Val Leu lie Leu Cys 
25 30 35 



TTA TTT TTG GGG CTA AAT CCG GGC AAA ATC CCT TTT GAT GTG ATT TTA 198 
Leu Phe Leu Gly Leu Asn Pro Gly Lys lie Pro Phe Asp Val lie Leu 
40 45 50 

ATC ATC ATG GCA GTC ATT AGC GCT ATT AGC GCG ATG CAA AAA GCG GGG 24 6 

He He Met Ala Val He Ser Ala He Ser Ala Met Gin Lys Ala Gly 
55 60 65 70 
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Asp Lys Ala Phe lie Thr Ser Asp Asn Leu Asn Pro Gin Lys Ala Arg 
330 335 340 

GTG CTT TTA CAA CTC GCT TTA ACT AAA ACA AAT AAT AAA GAA AAA ATC 1111 
Val Leu Leu Gin Leu Ala Leu Thr Lys Thr Asn Asn Lys Glu Lys lie 
345 350 355 

CAA GAA ATG TTT GAA GAG TAT TGAAAGATTC TCTTAAATCA CCCAATTATC AAAG 1166 
Gin Glu Met Phe Glu Glu Tyr 
360 

ATAATTGGGT GATTTGGTTT ATTTTGTTTT 1196 
(2) INFORMATION FOR SEQ ID NO: 818: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818: 

Met Phe Tyr Tyr Thr lie Lys Ser Phe Asn Phe Lys Arg Trp Ser lie 

15 10 15 

Met Arg lie Phe Leu Lys Leu Leu lie Leu Leu Phe Cys Leu Lys Gly 

20 25 30 

Gin Val Met Ala Gin Asn Leu Pro Thr lie Ala Leu Leu Ala Thr Gly 

35 40 45 

Gly Thr lie Ala Gly Ser Gly Ala Ser Ala Ser Leu Gly Ser Tyr Lys 

50 55 60 

Ser Gly Glu Leu Gly lie Lys Glu Leu Leu Lys Ala lie Pro Ser Leu 
65 70 75 80 

Asn Arg Leu Ala Arg lie Gin Gly Glu Gin lie Ser Asn lie Gly Ser 

85 90 95 

Gin Asp Met Asn Glu Glu Val Trp Phe Lys Leu Ala Lys Arg Ala Gin 

100 105 110 

Glu Leu Leu Asp Asp Ser Arg lie Gin Gly Val Val lie Thr His Gly 

115 120 125 

Thr Asp Thr Leu Glu Glu Ser Ala Tyr Phe Leu Asn Leu Val Leu Arg 

130 135 140 

Ser Thr Lys Pro Val Val Leu Val Gly Ala Met Arg Asn Ala Ala Ser 
145 150 155 160 

Leu Ser Ala Asp Gly Ala Leu Asn Leu Tyr Asn Ala Val Ser Val Ala 

165 170 175 

Leu Asn Glu Lys Ser Ala Asn Lys Gly Val Leu Val Val Met Asp Asp 

180 185 190 

Asn lie Phe Ser Ala Arg Glu Val lie Lys Thr His Thr Thr His Thr 

195 200 205 

Ser Thr Phe Lys Ala Leu Asn Ser Gly Ala lie Gly Ser Val Tyr Tyr 

210 215 220 

Gly Lys Thr Arg Tyr Tyr Met Gin Pro Leu Arg Lys His Thr Thr Glu 
225 230 235 240 
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105 110 115 

AGC CGT ATT CAA GGC GTG GTC ATC ACG CAT GGC ACQ GAG ACT TTA GAA 43 9 

Ser Arg lie Gin Gly Val Val He Thr His Gly Thr Asp Thr Leu Glu 
120 125 130 

GAG AGC GCG TAT TTT TTA AAC TTA GTT TTA CGC TCC ACA AAA CCG GTC 4 87 

Glu Ser Ala Tyr Phe Leu Asn Leu Val Leu Arg Ser Thr Lys Pro Val 
135 140 145 

GTG CTG GTG GGA GCG ATG CGT AAT GCT GCT TCT TTG AGC GCG GAT GGG 53 5 

Val Leu Val Gly Ala Met Arg Asn Ala Ala Ser Leu Ser Ala Asp Gly 
150 155 160 165 

GCT TTG AAT TTA TAT AAT GCT GTG AGC GTA GCG CTC AAT GAA AAA ACT 583 
Ala Leu Asn Leu Tyr Asn Ala Val Ser Val Ala Leu Asn Glu Lys Ser 
170 175 180 

GCG AAT AAA GGC GTG TTA GTG GTG ATG GAC GAT AAT ATT TTT AGC GCT 631 
Ala Asn Lys Gly Val Leu Val Val Met Asp Asp Asn He Phe Ser Ala 
185 190 195 

AGA GAA GTG ATT AAA ACG CAC ACC ACC CAC ACT TCC ACC TTT AAA GCC 67 9 

Arg Glu Val He Lys Thr His Thr Thr His Thr Ser Thr Phe Lys Ala 
200 205 210 

TTA AAT AGC GGC GCG ATA GGG AGC GTG TAT TAT GGC AAA ACG CGC TAT 72 7 

Leu Asn Ser Gly Ala He Gly Ser Val Tyr Tyr Gly Lys Thr Arg Tyr 
215 220 225 

TAC ATG CAG CCT TTG AGA AAA CAC ACC ACA GAG AGC GAA TTT TCC CTT 77 5 

Tyr Met Gin Pro Leu Arg Lys His Thr Thr Glu Ser Glu Phe Ser Leu 
230 235 240 245 

TCA CAA CTC AAA ACC CCC CTG CCT AAA GTG GAT ATT ATT TAC ACG CAT 82 3 

Ser Gin Leu Lys Thr Pro Leu Pro Lys Val Asp He He Tyr Thr His 
250 255 260 

GCT GGC ATG ACC CCT GAT TTA TTC CAA GCG AGC CTA AAC TCG CAT GCA 871 
Ala Gly Met Thr Pro Asp Leu Phe Gin Ala Ser Leu Asn Ser His Ala 
265 270 275 

AAA GGC GTT GTG ATA GCC GGG GTG GGT AAT GGG AAT GTG AGC GCT GGG 919 
Lys Gly Val Val He Ala Gly Val Gly Asn Gly Asn Val Ser Ala Gly 
280 285 290 

TTT TTA AAA GCG ATG CAA GAA GCG AGC CAA ATG GGG GTG GTT ATT GTT 967 
Phe Leu Lys Ala Met Gin Glu Ala Ser Gin Met Gly Val Val He Val 
295 300 305 

CGT TCT AGC AGG GTA AAT AGC GGT GAG ATT ACT TCA GGC GAG ATT GAT 1015 
Arg Ser Ser Arg Val Asn Ser Gly Glu He Thr Ser Gly Glu He Asp 
310 315 320 325 

GAC AAG GCC TTC ATC ACA AGC GAC AAT TTA AAC CCC CAA AAA GCT AGG 1063 
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130 135 140 

Leu Pro Ser Asp Pro Ala Glu lie Arg Ala Glu Phe Leu Ser Val Leu 
145 150 155 160 

Ser Val 



(2) INFORMATION FOR SEQ ID NO: 817: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 41... 1132 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817: 

AAGGCATCCA CCATTAACCC TTTTTAAATT TATTTTAACG ATG TTT TAG TAT ACT 5 5 

Met Phe Tyr Tyr Thr 
1 5 

ATA AAA TCT TTT AAT TTC AAA AGG TGG TCC ATA ATG AGA ATA TTT TTG 103 
lie Lys Ser Phe Asn Phe Lys Arg Trp Ser lie Met Arg lie Phe Leu 
10 15 20 

AAA TTG TTG ATT CTT TTA TTT TGT TTG AAG GGG CAG GTT ATG GCT CAA 151 
Lys Leu Leu lie Leu Leu Phe Cys Leu Lys Gly Gin Val Met Ala Gin 
25 30 35 

AAT TTA CCC ACC ATT GCT TTA CTG GCG ACA GGG GGG ACG ATT GCA GGG 199 
Asn Leu Pro Thr lie Ala Leu Leu Ala Thr Gly Gly Thr lie Ala Gly 
40 45 50 

AGT GOT GCG AGC GCG AGT TTG GGT AGT TAT AAG AGT GGT GAG TTG GGC 247 
Ser Gly Ala Ser Ala Ser Leu Gly Ser Tyr Lys Ser Gly Glu Leu Gly 
55 60 65 

ATC AAA GAG CTT TTG AAG GCT ATC CCT AGT CTT AAC AGA CTC GCT CGC 2 95 

lie Lys Glu Leu Leu Lys Ala lie Pro Ser Leu Asn Arg Leu Ala Arg 
70 75 80 85 

ATT CAA GGG GAG CAG ATT TCT AAC ATC GGC TCA CAA GAC ATG AAT GAA 343 
lie Gin Gly Glu Gin lie Ser Asn lie Gly Ser Gin Asp Met Asn Glu 
90 95 100 

GAG GTA TGG TTC AAG CTC GCC AAA CGT GCC CAA GAA TTG CTA GAT GAT 3 91 

Glu Val Trp Phe Lys Leu Ala Lys Arg Ala Gin Glu Leu Leu Asp Asp 
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TTT TAT CCT TTT AAA TTA AAT TTA TCT CAC TTA GGA GAG CAA TGC TCG 2 94 

Phe Tyr Pro Phe Lys Leu Asn Leu Ser His Leu Gly Glu Gin Cys Ser 
75 80 85 

TCT TTT TTC TTA ACA GCC CTA CAC CAA ACT TTT CTC GTA TCG CCG CTG 34 2 

Ser Phe Phe Leu Thr Ala Leu His Gin Thr Phe Leu Val Ser Pro Leu 
90 95 100 

CAA ACG CTC ACA TTA AGT CCT TTT GCC TTG ATT TCT TCA TCG CTC AAA 3 90 

Gin Thr Leu Thr Leu Ser Pro Phe Ala Leu lie Ser Ser Ser Leu Lys 
105 110 115 

CCT TTG GTT TTT TCT TCT AAT TCT TTA CGC ACT TCT TCA CGC ATT TTT 43 8 

Pro Leu Val Phe Ser Ser Asn Ser Leu Arg Thr Ser Ser Arg lie Phe 
120 125 130 

TTG AAA TCC TTC GCT CAT TTT GGA AAG ATT CTT CCT AGC GAT CCG GCT 4 86 

Leu Lys Ser Phe Ala His Phe Gly Lys lie Leu Pro Ser Asp Pro Ala 
135 140 145 150 

GAA ATT CGC GCG GAA TTT CTT AGC GTC CTC AGC GTT TAAAGTTTTA AGGCGT 53 8 
Glu lie Arg Ala Glu Phe Leu Ser Val Leu Ser Val 
155 160 

TTAGACACTT CCATGCGATA AT 5 60 

(2) INFORMATION FOR SEQ ID NO: 816: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816: 

Met Gin Leu Leu Met Glu Arg Leu Ser Leu Val Phe Arg His Leu Phe 

15 10 15 

His Ala Leu Phe Gin Leu Phe Cys Val lie His Asn lie Leu Arg Xaa 

20 25 30 

Thr Thr Pro Ala lie Leu Ala Lys Ser Ser Ser Leu Val Leu Leu Lys 

35 40 45 

Ser Leu Ala Phe Lys Ala Thr Asn Ser Asn Ala Leu Thr Glu Ser lie 

50 55 60 

Phe Asn Ala Phe Phe lie Phe Tyr Pro Phe Lys Leu Asn Leu Ser His 
65 70 75 80 

Leu Gly Glu Gin Cys Ser Ser Phe Phe Leu Thr Ala Leu His Gin Thr 

85 90 95 

Phe Leu Val Ser Pro Leu Gin Thr Leu Thr Leu Ser Pro Phe Ala Leu 

100 105 110 

lie Ser Ser Ser Leu Lys Pro Leu Val Phe Ser Ser Asn Ser Leu Arg 

115 120 125 

Thr Ser Ser Arg lie Phe Leu Lys Ser Phe Ala His Phe Gly Lys lie 
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Ala Ala Asn Tyr 
180 

Val Cys Thr Pro 
195 

Thr Ser Thr Val 
210 

Asn lie Tyr Lys 
225 

Leu lie Asn Lys 

Tyr His Leu Lys 
260 

Phe 



165 

Trp Lys Glu Gin 

Thr Tyr Cys Asn 
200 

Ala Phe Gin Val 
215 

His Asn Gly Val 
230 

Phe Leu Ser Ala 
245 

Arg Asp Tyr Ser 



170 

He He Glu Ala 
185 

Pro Asn Ala Pro 

Trp Leu Asn Phe 
220 

Glu Phe Gly Val 
235 

Gly Pro Asn Ala 
250 

Leu Tyr Leu Gly 
265 



175 

Lys Gly Pro Asp 
190 

Tyr Ser Thr Lys 
205 

Gly Val Arg Ala 

Arg Val Pro Leu 
240 

Thr Asn Leu Tyr 
255 

Tyr Asn Tyr Thr 
270 



(2) INFORr^TION FOR SEQ ID NO: 815: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 37... 522 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815: 

CACGGTCATT TTGTCTGTGT TTTTAGTCGC GTATTC ATG CAA CTG CTT ATG GAA 54 

Met Gin Leu Leu Met Glu 

1 5 

CGC CTT TCT CTT GTC TTC AGG CAT CTT TTC CAT GCG CTT TTT CAA CTC 102 

Arg Leu Ser Leu Val Phe Arg His Leu Phe His Ala Leu Phe Gin Leu 
10 15 20 

TTT TGT GTA ATC CAC AAT ATC CTG CGG AGK ACA ACG CCA GCC ATT TTA 15 0 

Phe Cys Val lie His Asn lie Leu Arg Xaa Thr Thr Pro Ala lie Leu 
25 30 35 

GCC AAA TCT TCA TCG CTT GTT TTG CTG AAA TCT TTG GCG TTC AAA GCC 19 8 

Ala Lys Ser Ser Ser Leu Val Leu Leu Lys Ser Leu Ala Phe Lys Ala 
40 45 50 

ACA AAT AGC AAC GCG CTA ACA GAA AGT ATT TTC AAC GCT TTT TTC ATT 24 6 
Thr Asn Ser Asn Ala Leu Thr Glu Ser lie Phe Asn Ala Phe Phe lie 

55 60 65 70 
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CCT TAT AGC ACC AAA ACT TCA ACC GTC GCT TTT CAG GTA TGG TTG AAT 675 
Pro Tyr Ser Thr Lys Thr Ser Thr Val Ala Phe Gin Val Trp Leu Asn 
205 210 215 

TTT GGG GTG AGA GCC AAT ATT TAG AAG CAT AAT GGC GTA GAG TTT GGC 72 3 

Phe Gly Val Arg Ala Asn lie Tyr Lys His Asn Gly Val Glu Phe Gly 
220 225 230 235 

GTG AGA GTG CCG CTA CTC ATC AAC AAG TTT TTG AGT GCG GGT CCT AAC 7 71 

Val Arg Val Pro Leu Leu lie Asn Lys Phe Leu Ser Ala Gly Pro Asn 
240 245 250 

GCT ACT AAT CTT TAT TAG CAT TTG AAA CGG GAT TAT TCG CTT TAT TTA 819 
Ala Thr Asn Leu Tyr Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu 
255 260 265 

GGG TAT AAC TAG ACT TTT TAAACCCTTT AAAAGGGTGT CTTTAAGCCC TTTTTAGT 875 
Gly Tyr Asn Tyr Thr Phe 
270 

CCTTATAAAA AGG 888 
(2) INFORMATION FOR SEQ ID NO: 814: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814: 

Met Glu Phe Met Lys Lys Phe Val Ala Leu Gly Leu Leu Ser Ala Val 

15 10 15 

Leu Ser Ser Ser Leu Leu Ala Glu Gly Asp Gly Val Tyr lie Gly Thr 

20 25 30 

Asn Tyr Gin Leu Gly Gin Ala Arg Leu Asn Ser Asn lie Tyr Asn Thr 

35 40 45 

Gly Asp Cys Thr Gly Ser Val Val Gly Cys Pro Pro Gly Leu Thr Ala 

50 55 60 

Asn Lys His Asn Pro Gly Gly Thr Asn lie Asn Trp His Ala Lys Tyr 
65 70 75 80 

Ala Asn Gly Ala Leu Asn Gly Leu Gly Leu Asn Val Gly Tyr Lys Lys 

85 90 95 

Phe Phe Gin Phe Lys Ser Phe Asp Met Thr Ser Lys Trp Phe Gly Phe 

100 105 110 

Arg Val Tyr Gly Leu Phe Asp Tyr Gly His Ala Thr Leu Gly Lys Gin 

115 120 125 

Val Tyr Ala Pro Asn Lys lie Gin Leu Asp Met Val Ser Trp Gly Val 

130 135 140 

Gly Ser Asp Leu Leu Ala Asp lie lie Asp Asn Asp Asn Ala Ser Phe 
145 . 150 155 160 

Gly He Phe Gly Gly Val Ala He Gly Gly Asn Thr Trp Lys Ser Ser 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813: 

AGATAGGAAT GTAAAGGA ATG GAA TTT ATG AAA AAG TTT GTA GCT TTA GGG 51 

Met Glu Phe Met Lys Lys Phe Val Ala Leu Gly 
15 10 

CTT CTA TCC GCA GTT TTA AGO TCT TCG TTG TTA GCC GAA GGT GAT GGT 9 9 

Leu Leu Ser Ala Val Leu Ser Ser Ser Leu Leu Ala Glu Gly Asp Gly 
15 20 25 

GTT TAT ATA GGG ACT AAT TAT CAG CTT GGA CAA GCC CGT TTG AAT AGT 14 7 

Val Tyr lie Gly Thr Asn Tyr Gin Leu Gly Gin Ala Arg Leu Asn Ser 
30 35 40 

AAT ATT TAT AAT ACA GGG GAT TGC ACA GGG AGT GTT GTA GGT TGC CCC 195 
Asn lie Tyr Asn Thr Gly Asp Cys Thr Gly Ser Val Val Gly Cys Pro 
45 50 55 

CCA GGT CTT ACC GCT AAT AAG CAT AAT CCA GGA GGC ACC AAT ATC AAT 24 3 

Pro Gly Leu Thr Ala Asn Lys His Asn Pro Gly Gly Thr Asn lie Asn 
60 65 70 75 

TGG CAT GCT AAA TAC GCT AAT GGG GCT TTG AAT GGT CTT GGG TTG AAT 2 91 

Trp His Ala Lys Tyr Ala Asn Gly Ala Leu Asn Gly Leu Gly Leu Asn 
80 85 90 

GTG GGT TAT AAG AAG TTC TTC CAG TTC AAG TCT TTT GAT ATG ACA AGC 33 9 

Val Gly Tyr Lys Lys Phe Phe Gin Phe Lys Ser Phe Asp Met Thr Ser 
95 100 105 

AAG TGG TTT GGT TTT AGA GTG TAT GGG CTT TTT GAT TAT GGG CAT GCC 3 87 

Lys Trp Phe Gly Phe Arg Val Tyr Gly Leu Phe Asp Tyr Gly His Ala 
110 115 120 

ACT TTA GGC AAG CAA GTT TAT GCA CCT AAT AAA ATC CAG TTG GAT ATG 435 
Thr Leu Gly Lys Gin Val Tyr Ala Pro Asn Lys lie Gin Leu Asp Met 
125 130 135 

GTC TCT TGG GGT GTG GGG AGC GAT TTG TTA GCT GAT ATT ATT GAT AAC 4 83 

Val Ser Trp Gly Val Gly Ser Asp Leu Leu Ala Asp lie lie Asp Asn 
140 145 150 155 

GAT AAC GCT TCT TTT GGT ATT TTT GGT GGG GTC GCT ATC GGC GGT AAC 531 
Asp Asn Ala Ser Phe Gly lie Phe Gly Gly Val Ala lie Gly Gly Asn 
160 165 170 

ACT TGG AAA AGC TCA GCG GCA AAC TAT TGG AAA GAG CAA ATC ATT GAA 57 9 

Thr Trp Lys Ser Ser Ala Ala Asn Tyr Trp Lys Glu Gin lie lie Glu 
175 180 185 

GCT AAG GGT CCT GAT GTT TGT ACC CCT ACT TAT TGT AAC CCT AAC GCT 62 7 

Ala Lys Gly Pro Asp Val Cys Thr Pro Thr Tyr Cys Asn Pro Asn Ala 
190 195 200 
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450 455 460 

Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Asn Asn Lys Glu Pro Glu 
465 470 475 480 

Lys His Asp Phe Ser Val Trp Asn lie Thr Lys Lys Arg Gin Asn Glu 

485 490 495 

Trp Ser Pro Ala Leu Asn lie Gly Tyr Lys Pro Met Glu Asn Trp lie 

500 505 510 

Trp Tyr Ala Asn Tyr Arg Arg Ser Phe lie Pro Pro Gin His Thr Met 

515 520 525 

Leu Gly lie Thr Arg Thr Asn Tyr Asn Gin lie Phe Asn Glu lie Glu 

530 535 540 

Val Gly Gin Arg Tyr Ser Tyr Lys Asn Leu Leu Ser Phe Asn Thr Asn 
545 550 555 560 

Tyr Phe Val lie Phe Ala Lys Arg Tyr Tyr Ala Gly Gly Tyr Ser Pro 

565 570 575 

Gin Pro lie Asn Ala Arg Ser Gin Gly Val Glu Leu Glu Leu Tyr Tyr 

580 585 590 

Ala Pro lie Arg Gly Leu Gin Phe His Val Ala Tyr Thr Tyr lie Asp 

595 600 605 

Ala Arg lie Thr Ser Asn Ala Asp Asp lie Ala Tyr Tyr Phe Thr Gly 

610 615 620 

lie Val Asn Lys Pro Phe Asp lie Lys Gly Lys Arg Leu Pro Tyr Val 
625 630 635 640 

Ser Pro Asn Gin Phe lie Phe Asp Met Met Tyr Thr Tyr Lys His Thr 

645 650 655 

Thr Phe Gly lie Ser Ser Tyr Phe Tyr Ser Arg Ala Tyr Ser Ser Met 

660 665 670 

Leu Asn Gin Ala Lys Ser Gin Thr Val Cys Leu Pro Leu Asn Pro Glu 

675 680 685 

Tyr Thr Gly Gly Leu Glu Tyr Gly Cys Asn Ser Val Gly Leu Leu Pro 

690 695 700 

Leu Tyr Phe Val Leu Asn Val Gin Val Ser Ser Val Leu Trp Gin Ser 
705 710 715 720 

Gly Arg His Lys He Thr Gly Ser Leu Gin He Asn Asn Leu Phe Asn 

725 730 735 

Met Lys Tyr Tyr Phe Arg Gly He Gly Thr Ser Pro Thr Gly Arg Glu 

740 745 750 

Pro Ala Pro Gly Arg Ser He Thr Ala Tyr Leu Asn Tyr Glu Phe 
755 760 765 

(2) INFORMATION FOR SEQ ID NO: 813: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19... 837 
(D) OTHER INFORMATION: 
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Ala Val Lys Thr 
20 

Asn Asp Lys Glu 
35 

Tyr Met Gly Ser 
50 

Ala Asn Gin Ser 
65 

lie Arg Asn Ala 

Gly Phe Gly Gly 
100 

Val Asn Gly lie 
115 

Pro lie Phe Pro 
130 

Lys Gly Gly Glu 
145 

lie Asn Val lie 

Ser Glu Arg Ala 
180 

Asn Gin Asn Ser 
195 

Phe Asp Thr Tyr 
210 

lie Gin Ala Gin 
225 

Ser Pro Thr Asn 

Asn Asp Ser Asn 
260 

Met Ala Asp Pro 
275 

Phe Gin Asn Asn 
290 

Trp Gly Ala Val 
305 

Gly Asp Phe Thr 

Gin Phe Asp Ser 
340 

Pro Val Tyr Thr 
355 

Leu Arg Arg Tyr 
370 

Val Asn Thr Asn 
385 

Met Thr Met Asp 

Thr Asp lie Phe 
420 

Lys Lys Pro Ser 
435 

Trp Leu Ser Asp 



His Asn Leu Glu 

Ala Pro Leu Ser 
40 

Arg Thr Val lie 
55 

lie Glu Glu Ala 
70 

Thr Gly He Gly 
85 

Gly Ser Ser Gly 

Pro He Tyr Val 
120 

Val Thr Phe Gin 
135 

Ser Val Arg Tyr 
150 

Thr Lys Gly He 
165 

Thr Phe Trp Gly 

Lys Asn Leu Asp 
200 

Leu Arg Thr Gly 
215 

Ala Asn Trp Leu 
230 

He Gin Asn Tyr 
245 

Lys He Thr Ala 

Gly Ser Leu Gly 
280 

Arg Pro Asn Asn 
295 

Tyr Gin Asn Phe 
310 

Phe Ser Tyr Tyr 
325 

Asn Phe Leu Asn 

Asp Gin Asn Tyr 
360 

He Met Asn Ala 
375 

Lys Val Lys Gin 
390 

Met Tyr Phe Arg 
405 

Asn Gly Val Cys 

Asn Asn Gin Asn 
440 

Lys He Glu Leu 



Arg Val Glu Ala 
25 

Trp Arg Ser Lys 

Ser Asn Lys Gin 
60 

Leu Gin Asn Val 
75 

Ala Val Pro Ser 
90 

His Ser Asn Thr 
105 

Ala Pro Tyr Val 

Ser Val Asp Arg 
140 

Gly Pro Asn Val 
155 

Pro Thr Lys Trp 
170 

Lys Ser Glu Asn 
185 

Lys Ser Leu Ala 

Gly Met Met Asn 
220 

Lys Gly Gin Gly 
235 

Met Leu Asp Ser 
250 

Phe Phe Gin Tyr 
265 

He Glu Ala Tyr 

Asn Lys Ser Gly 
300 

Phe Gly Asp Thr 
315 

Gly His Asp Met 
330 

Val Asn Thr Asn 
345 

Pro Gly Phe Phe 

Phe Glu Pro Asn 
380 

Thr Phe Asn Val 
395 

Leu Asp Gin Ser 
410 

Arg Met Pro Pro 
425 

Leu Phe Asn Asn 
Phe Asp Ser Lys 



Ser Gly Val Ala 
30 

Glu Val Arg Asn 
45 

Leu Thr Lys Ser 

Pro Gly Val His 
80 

Phe Ser Val Arg 
95 

Ala Met Val Leu 
110 

Asp He Ser He 
125 

He Ser Val Thr 

Phe Gly Gly Val 
160 

Glu Ser Gin Val 
175 

Gly Gly Phe Phe 
190 

Asn Asn Met Leu 
205 

Lys His Phe Gly 

Phe Arg Tyr Asn 
240 

Leu Tyr Gin He 
255 

Tyr Asn Tyr Phe 
270 

Asn Gin Asn Arg 
285 

Arg Ala Lys Xaa 

Asp Lys He Gly 
320 

Ser Arg Asp Phe 
335 

Pro Lys Leu Gly 
350 

He Phe Asp His 
365 

Leu Asn Leu Val 

Gly Met Arg Phe 
400 

Thr Cys Glu Lys 
415 

Phe Val Leu Ser 
430 

Tyr Thr Ala Val 
445 

Leu Val He Thr 
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TTT ACA GGC ATT GTC AAT AAA CCC TTT GAC ATT AAA GGG AAG CGT TTG 192 3 

Phe Thr Gly lie Val Asn Lys Pro Phe Asp lie Lys Gly Lys Arg Leu 
625 630 635 

car TAT GTG AGT CCT AAC CAA TTC ATA TTT GAC ATG ATG TAT ACT TAC 1971 
Pro Tyr Val Ser Pro Asn Gin Phe lie Phe Asp Met Met Tyr Thr Tyr 
640 645 650 

AAG CAC ACG ACT TTT GGT ATT AGC AGC TAT TTT TAT AGC CGT GCT TAT 2 019 

Lys His Thr Thr Phe Gly lie Ser Ser Tyr Phe Tyr Ser Arg Ala Tyr 
655 660 665 

AGT TCT ATG CTC AAT CAG GCC AAA AGC CAA ACC GTG TGC CTG CCC TTA 2 0 67 

Ser Ser Met Leu Asn Gin Ala Lys Ser Gin Thr Val Cys Leu Pro Leu 
670 675 680 685 

AAC CCA GAA TAC ACA GGG GGG CTA GAG TAT GGT TGT AAT TCA GTA GGG 2115 
Asn Pro Glu Tyr Thr Gly Gly Leu Glu Tyr Gly Cys Asn Ser Val Gly 
690 695 700 

TTA TTG CCC TTG TAT TTT GTG TTG AAC GTT CAA GTA AGC TCG GTT TTA 2163 
Leu Leu Pro Leu Tyr Phe Val Leu Asn Val Gin Val Ser Ser Val Leu 
705 710 715 

TGG CAA AGC GGT AGG CAT AAA ATC ACA GGG AGT TTG CAA ATC AAT AAT 2211 
Trp Gin Ser Gly Arg His Lys lie Thr Gly Ser Leu Gin lie Asn Asn 
720 725 730 

CTT TTT AAC ATG AAG TAT TAT TTT AGG GGA ATT GGC ACA AGC CCT ACA 22 5 9 

Leu Phe Asn Met Lys Tyr Tyr Phe Arg Gly lie Gly Thr Ser Pro Thr 
735 740 745 

GGA AGA GAG CCC GCA CCA GGG CGA TCC ATT ACA GCG TAT TTG AAT TAT 23 0 7 

Gly Arg Glu Pro Ala Pro Gly Arg Ser lie Thr Ala Tyr Leu Asn Tyr 
750 755 760 765 

GAG TTT TAAACTAGCT TCAAGCTTTT ATCGCTTGAA TGCTTGTTA 2 3 52 

Glu Phe 



(2) INFORMATION FOR SEQ ID NO: 8 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:812: 

Met Lys Arg He Leu Val Ser Leu Ala Val Leu Ser His Ser Ala His 
15 10 15 
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ATG CGT TTT ATG ACA ATG GAT ATG TAT TTC AGA TTG GAT CAA AGC ACA 12 51 

Met Arg Phe Met Thr Met Asp Met Tyr Phe Arg Leu Asp Gin Ser Thr 
400 405 410 

TGC GAA AAA AGO GAT ATT TTT AAT GGG GTG TGC CGC ATG OCT OCT TTT 12 99 

Cys Glu Lys Thr Asp lie Phe Asn Gly Val Cys Arg Met Pro Pro Phe 
415 420 425 

GTT CTT TCT AAA AAA CCC AGC AAC AAT CAA AAC CTG TTT AAC AAC TAT 13 4 7 

Val Leu Ser Lys Lys Pro Ser Asn Asn Gin Asn Leu Phe Asn Asn Tyr 
430 435 440 445 

ACA GCG GTA TGG TTG AGC GAT AAA ATA GAG CTT TTT GAT TCT AAA TTG 13 95 

Thr Ala Val Trp Leu Ser Asp Lys lie Glu Leu Phe Asp Ser Lys Leu 
450 455 460 

GTG ATA ACT CCA GGG CTT AGA TAC ACT TTT TTG AAC TAT AAC AAC AAA 14 43 

Val lie Thr Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Asn Asn Lys 
465 470 475 

GAG CCA GAA AAG CAT GAT TTT TCT GTG TGG AAT ATT ACA AAA AAG CGT 14 91 

Glu Pro Glu Lys His Asp Phe Ser Val Trp Asn lie Thr Lys Lys Arg 
480 485 490 

CAA AAC GAA TGG AGT CCC GCC CTT AAC ATT GGC TAT AAA CCT ATG GAA 15 3 9 

Gin Asn Glu Trp Ser Pro Ala Leu Asn lie Gly Tyr Lys Pro Met Glu 
495 500 505 

AAT TGG ATA TGG TAT GCG AAC TAC CGC CGC AGT TTT ATC CCC CCA CAA 15 8 7 

Asn Trp lie Trp Tyr Ala Asn Tyr Arg Arg Ser Phe lie Pro Pro Gin 
510 515 520 525 

CAT ACA ATG CTA GGC ATT ACT AGG ACT AAT TAC AAC CAA ATT TTT AAT 16 3 5 

His Thr Met Leu Gly lie Thr Arg Thr Asn Tyr Asn Gin lie Phe Asn 
530 535 540 

GAA ATT GAA GTG GGG CAA CGC TAT AGT TAT AAA AAT CTA TTG AGC TTT 16 8 3 

Glu lie Glu Val Gly Gin Arg Tyr Ser Tyr Lys Asn Leu Leu Ser Phe 
545 550 555 

AAC ACG AAT TAT TTT GTG ATT TTT GCC AAG CGT TAC TAT GCG GGA GGC 1731 
Asn Thr Asn Tyr Phe Val lie Phe Ala Lys Arg Tyr Tyr Ala Gly Gly 
560 565 570 

TAT AGC CCA CAG CCT ATT AAC GCT AGG AGT CAA GGG GTA GAA TTG GAA 1779 
Tyr Ser Pro Gin Pro lie Asn Ala Arg Ser Gin Gly Val Glu Leu Glu 
575 580 585 

TTG TAT TAC GCG CCG ATT AGG GGT TTG CAA TTC CAT GTG GCT TAC ACC 1827 
Leu Tyr Tyr Ala Pro He Arg Gly Leu Gin Phe His Val Ala Tyr Thr 
590 595 600 605 

TAT ATT GAT GCA CGC ATC ACT TCT AAC GCT GAT GAT ATT GCT TAT TAT 1875 
Tyr He Asp Ala Arg He Thr Ser Asn Ala Asp Asp He Ala Tyr Tyr 
610 615 620 
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AGT CAG GTG AGC GAG AGG GCC ACT TTT TGG GGC AAA TCT GAA AAT GGG 57 9 

Ser Gin Val Ser Glu Arg Ala Thr Phe Trp Gly Lys Ser Glu Asn Gly 
175 180 185 

GGC TTT TTC AAT CAA AAT TCT AAA AAC CTT GAC AAA AGC TTA GCC AAT 62 7 

Gly Phe Phe Asn Gin Asn Ser Lys Asn Leu Asp Lys Ser Leu Ala Asn 
190 195 200 205 

AAC ATG CTT TTT GAC ACT TAG TTA AGA ACA GGG GGC ATG ATG AAT AAG 67 5 

Asn Met Leu Phe Asp Thr Tyr Leu Arg Thr Gly Gly Met Met Asn Lys 
210 215 220 

CAT TTT GGA ATC CAA GCT CAA GCC AAC TGG CTT AAA GGG CAA GGG TTT 72 3 

His Phe Gly lie Gin Ala Gin Ala Asn Trp Leu Lys Gly Gin Gly Phe 
225 230 235 

AGA TAG AAC AGC CCT ACG AAC ATT CAA AAC TAC ATG CTA GAT TCC TTG 771 
Arg Tyr Asn Ser Pro Thr Asn lie Gin Asn Tyr Met Leu Asp Ser Leu 
240 245 250 

TAT CAA ATT AAT GAT AGT AAT AAG ATC ACT GCT TTT TTC CAA TAC TAT 819 
Tyr Gin lie Asn Asp Ser Asn Lys lie Thr Ala Phe Phe Gin Tyr Tyr 
255 260 265 

AAT TAT TTT ATG GCA GAC CCC GGA TCT TTA GGC ATA GAA GCG TAT AAT 86 7 

Asn Tyr Phe Met Ala Asp Pro Gly Ser Leu Gly lie Glu Ala Tyr Asn 
270 275 280 285 

CAA AAT CGT TTT CAA AAC AAC CGC CCT AAT AAC AAT AAA AGC GGG AGA 915 
Gin Asn Arg Phe Gin Asn Asn Arg Pro Asn Asn Asn Lys Ser Gly Arg 
290 295 300 

GCG AAG CGR TGG GGA GCT GTG TAT CAA AAC TTT TTT GGG GAT ACG GAC 96 3 

Ala Lys Xaa Trp Gly Ala Val Tyr Gin Asn Phe Phe Gly Asp Thr Asp 
305 310 315 

AAA ATA GGT GGG GAT TTC ACT TTT AGT TAC TAT GGG CAT GAC ATG TCA 1011 
Lys lie Gly Gly Asp Phe Thr Phe Ser Tyr Tyr Gly His Asp Met Ser 
320 325 330 

AGG GAT TTT CAA TTT GAT TCT AAT TTT TTG AAT GTC AAT ACC AAT CCT 105 9 

Arg Asp Phe Gin Phe Asp Ser Asn Phe Leu Asn Val Asn Thr Asn Pro 
335 340 345 

AAA TTA GGC CCT GTT TAT ACC GAT CAA AAT TAT CCA GGA TTT TTT ATT 1107 
Lys Leu Gly Pro Val Tyr Thr Asp Gin Asn Tyr Pro Gly Phe Phe lie 
350 355 360 365 

TTT GAT CAT TTA AGG CGT TAC ATA ATG ..AAC GCT TTT GAG CCT AAT TTG 115 5 

Phe Asp His Leu Arg Arg Tyr lie Met Asn Ala Phe Glu Pro Asn Leu 
370 375 380 

AAC TTA GTT GTC AAT ACC AAT AAA GTT AAG CAA ACT TTT AAT GTG GGC 12 03 

Asn Leu Val Val Asn Thr Asn Lys Val Lys Gin Thr Phe Asn Val Gly 
385 , 390 395 

-1218- 



BNSDOCIO: <WO ^9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 13... 2 313 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811: 

AAAAGGTGGT AA ATG AAA AGA ATT TTA GTC TCT TTG GCT GTT TTG ACT CAT 51 
Met Lys Arg lie Leu Val Ser Leu Ala Val Leu Ser His 
15 10 

AGC GCG CAT GCT GTC AAA ACT CAT AAT TTG GAA AGG GTG GAA GCT TCA 99 
Ser Ala His Ala Val Lys Thr His Asn Leu Glu Arg Val Glu Ala Ser 
15 20 25 

GGG GTG GCT AAC GAT AAG GAA GCG CCT TTA AGC TGG AGG AGC AAG GAA 14 7 

Gly Val Ala Asn Asp Lys Glu Ala Pro Leu Ser Trp Arg Ser Lys Glu 
30 35 40 45 

GTG AGA AAC TAT ATG GGA TCT CGC ACG GTG ATT TCT AAC AAG CAA CTC 195 
Val Arg Asn Tyr Met Gly Ser Arg Thr Val lie Ser Asn Lys Gin Leu 
50 55 60 

ACT AAA AGC GCC AAT CAG AGC ATT GAA GAA GCT TTG CAA AAT GTG CCA 24 3 

Thr Lys Ser Ala Asn Gin Ser lie Glu Glu Ala Leu Gin Asn Val Pro 
65 70 75 

GGC GTG CAT ATT AGA AAC . GCT ACG GGT ATT GGA GCT GTG CCT AGC TTT 2 91 

Gly Val His lie Arg Asn Ala Thr Gly lie Gly Ala Val Pro Ser Phe 
80 85 90 

TCT GTT AGG GGC TTT GGT GGG GGA AGT TCA GGG CAT TCC AAT ACG GCT 33 9 

Ser Val Arg Gly Phe Gly Gly Gly Ser Ser Gly His Ser Asn Thr Ala 
95 100 105 

ATG GTT TTA GTC AAT GGG ATC CCT ATT TAT GTT GCG CCC TAT GTT GAT 3 87 

Met Val Leu Val Asn Gly lie Pro lie Tyr Val Ala Pro Tyr Val Asp 
110 115 120 125 

ATT AGC ATT CCT ATT TTC CCT GTA ACC TTT CAA TCT GTA GAT AGA ATC 43 5 

lie Ser lie Pro lie Phe Pro Val Thr Phe Gin Ser Val Asp Arg lie 
130 135 140 

AGC GTA ACC AAG GGT GGG GAG AGC GTG CGT TAT GGC CCT AAT GTT TTT 4 83 

Ser Val Thr Lys Gly Gly Glu Ser Val Arg Tyr Gly Pro Asn Val Phe 
145 150 155 

GGC GGT GTG ATT AAT GTG ATC ACT AAG GGC ATT CCT ACC AAG TGG GAG 531 
Gly Gly Val lie Asn Val lie Thr Lys Gly lie Pro Thr Lys Trp Glu 
160 165 170 
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ATG ATG CTC CCG CCT GTA ATG ATT TCT CTG CCT TTT AAA ATT TTA GTG 506 
Met Met Leu Pro Pro Val Met lie Ser Leu Pro Phe Lys lie Leu Val 
110 115 120 125 

TTT ATT CTG GTA GAT GGG TTT AAT TTA TTG ACC GAA AAT TTA GTG GCG 5 54 

Phe lie Leu Val Asp Gly Phe Asn Leu Leu Thr Glu Asn Leu Val Ala 
130 135 140 

AGT TTT AAA ATG GTT TGATATTAAC AAGCATTCAA GCGATAAAAG CTTGAAGCTA G 610 
Ser Phe Lys Met Val 
145 

TTTAAAACTC ATAATTCAAA 63 0 

(2) INFORMATION FOR SEQ ID NO: 810: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810: 

Met Glu Pro Ser Leu Lys Lys Ala Tyr Asp Thr Gly lie Lys Pro Tyr 

1 5 10 15 

Met Asp Lys Lys lie Ser Tyr Thr Glu Ala Phe Glu Lys Ser Ala Leu 

20 25 30 

Pro Phe Lys Glu Phe Met Leu Lys Asn Thr Arg Glu Lys Asp Leu Ala 

35 40 45 

Leu Phe Phe Arg lie Arg Asn Leu Pro Asn Pro Lys Thr Pro Asp Glu 

50 55 60 

Val Ser Leu Ser Val Leu lie Pro Ala Phe Met lie Ser Glu Leu Lys 
65 70 75 80 

Thr Ala Phe Gin lie Gly Phe Leu Leu Tyr Leu Pro Phe Leu Val lie 

85 90 95 

Asp Met Val lie Ser Ser lie Leu Met Ala Met Gly Met Met Met Leu 

100 105 110 

Pro Pro Val Met lie Ser Leu Pro Phe Lys lie Leu Val Phe lie Leu 

115 120 125 

Val Asp Gly Phe Asn Leu Leu Thr Glu Asn Leu Val Ala Ser Phe Lys 

130 135 140 

Met Val 
145 

(2) INFORMATION FOR SEQ ID NO: 811: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



1216- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCTAJS98/06371 



65 70 75 80 

Val Asp Arg Glu Pro Gin Asn Lys Glu Pro Gin Ala Pro Arg Glu Pro 

85 90 95 

lie Arg Asp Leu Tyr Asp Tyr Gly Tyr Ser Phe Gly His Ala Trp 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 8 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 132... 569 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809: 

GATTTTAGTG ATGACGAGTT TCACCCGTTT GATCGTGGTG TTTTCTTTTT TAAGGACCGC 6 0 

TTTGGGCACG CAACAAACCC CCCCCCACTC AAATTTTAGT CTCGCTCTCT TTGATATTGA 12 0 

CTTTTTTCAT C ATG GAA CCT AGC CTA AAA AAG GCC TAT GAT ACA GGG ATT 17 0 

Met Glu Pro Ser Leu Lys Lys Ala Tyr Asp Thr Gly lie 
15 10 

AAG CCT TAT ATG GAT AAA AAG ATT TCT TAC ACC GAA GCG TTT GAA AAA 218 
Lys Pro Tyr Met Asp Lys Lys lie Ser Tyr Thr Glu Ala Phe Glu Lys 
15 20 25 

AGC GCT CTG CCC TTC AAG GAA TTC ATG CTT AAA AAC ACA CGA GAA AAG 2 66 

Ser Ala Leu Pro Phe Lys Glu Phe Met Leu Lys Asn Thr Arg Glu Lys 
30 35 40 45 

GAT CTA GCC CTT TTT TTT AGG ATT AGA AAC CTC CCT AAC CCT AAA ACC 314 
Asp Leu Ala Leu Phe Phe Arg lie Arg Asn Leu Pro Asn Pro Lys Thr 
50 55 60 

CCT GAT GAG GTG AGT TTG AGC GTT TTG ATC CCG GCA TTT ATG ATA AGC 3 62 

Pro Asp Glu Val Ser Leu Ser Val Leu lie Pro Ala Phe Met lie Ser 
65 70 75 

GAG TTG AAA ACA GCG TTT CAA ATC GGC TTT TTA CTC TAC TTG CCT TTT ' 410 
Glu Leu Lys Thr Ala Phe Gin lie Gly Phe Leu Leu Tyr Leu Pro Phe 
80 ' 85 90 

TTG GTG ATT GAT ATG GTG ATC AGC TCT ATT TTA ATG GCG ATG GGC ATG 45 8 

Leu Val He Asp Met Val He Ser Ser He Leu Met Ala Met Gly Met 
95 100 105 



-1215- 



BNSDOCID: <WO 9843478A1_1_> 



wo 98/43478 



PCT/US98/06371 



GTTGTGGTAT AAATATTCTT ATCAAGGTGT 
ATCTCAATTT T ATG AAA GGA TTT GTT 
Met Lys Gly Phe Val 
1 5 



GCCAAACATG CCTTGAATCT CAATTTTTGA 60 
ATG AGT GGA TTA AGA ACA TTT AGT 110 
Met Ser Gly Leu Arg Thr Phe Ser 
10 



TGT GTA GTG GTT TTA TGC GGT GCA ATG GCT AAT GTG GCT ATA GCT AGT 15 8 

Cys Val Val Val Leu Cys Gly Ala Met Ala Asn Val Ala He Ala Ser 
15 20 25 



CCT AAA ATA GAG GCA AGG GGT GAA TTA GGC AAA TTT ATA GGG GGT GGT 2 06 

Pro Lys He Glu Ala Arg Gly Glu Leu Gly Lys Phe He Gly Gly Gly 
30 35 40 45 



GTT GGG GGT TTT GTT GGT GAT AAA ATG GGC GGA TTT GTT GGT GGT GCA 2 54 

Val Gly Gly Phe Val Gly Asp Lys Met Gly Gly Phe Val Gly Gly Ala 
50 55 60 



ATA GGA GGA TAT ATT GGG TCT GAA ATA GGC GAT AGG GTA GAA GAT TAT 3 02 

He Gly Gly Tyr He Gly Ser Glu He Gly Asp Arg Val Glu Asp Tyr 
65 70 75 



ATC CGT GGT GTT GAT AGA GAG CCA CAA AAC AAA GAA CCA CAA GCC CCA 35 0 

He Arg Gly Val Asp Arg Glu Pro Gin Asn Lys Glu Pro Gin Ala Pro 
80 85 90 



AGA GAA CCT ATC CGT GAT CTT TAT GAT TAC GGC TAT AGT TTT GGG CAT 3 98 

Arg Glu Pro He Arg Asp Leu Tyr Asp Tyr Gly Tyr Ser Phe Gly His 
95 100 105 



GCT TGG TGATCTTAAA TGAGTCAAGA 4 24 

Ala Trp 

110 



(2) INFORMATION FOR SEQ ID NO: 808: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808: 



Met Lys Gly Phe Val Met Ser Gly 

1 5 

Val Leu Cys Gly Ala Met Ala Asn 
20 

Glu Ala Arg Gly Glu Leu Gly Lys 

35 40 

Phe Val Gly Asp Lys Met Gly Gly 

50 55 

Tyr Ile-Gly Ser Glu He Gly Asp 



Leu Arg Thr Phe Ser Cys Val Val 

10 15 
Val Ala He Ala Ser Pro Lys He 
25 30 
Phe He Gly Gly Gly Val Gly Gly 
45 

Phe Val Gly Gly Ala He Gly Gly 
60 

Arg Val Glu Asp Tyr He Arg Gly 
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(2) INFORMATION FOR SEQ ID NO:806:- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 06: 



Met Ser Gin Gly Asp Gly Val Glu Gly Asn Asn Met Asp Thr Thr Lys 

15 10 15 

Glu Asn Leu Asn Gly Ser Lys Glu Arg Leu Ser Asp Trp Glu Tyr Arg 

20 25 30 

Trp Ala Met Ala Leu Val Tyr Gly Gly Cys lie Ser lie Thr Thr Arg 

35 40 45 

lie Phe Tyr Asp lie Asn Gly Ser Ala Ser Asp Pro Leu Phe Asp Pro 

50 55 60 

Lys Tyr Ser Tyr Tyr Val Trp Leu Val Ala Leu lie Ala Ala Leu Leu 
65 70 75 80 

Ser Asn Leu Leu Phe Asn Pro Lys Gly Arg Ser Val Gly Tyr Leu Met 

85 90 95 

lie Glu Thr Trp Gin Gly Phe Pro Lys Phe Phe Lys Ala lie Phe Lys 

100 105 110 

Ala Arg Phe Phe Gly Ala Phe Tyr Asp Ala Val Leu Gly Ser Arg Leu 

115 120 125 

Arg Asp Phe Tyr Val Met Leu Leu Thr Met Pro Phe lie Ala Ala lie 

130 135 140 

His Glu Val Ser Ala Tyr Cys Gly His Pro Ser Asn Leu Leu Val Glu 
145 150 155 160 

Gly Leu Val lie Leu Gly Phe Gin Gly Phe Leu Lys Leu Cys Ala Lys 
165 170 175 

Trp Gly Trp 



(2) INFORMATION FOR SEQ ID NO: 8 07: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 72... 404 
(D) OTHER INFORMATION: 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 05: 

TTTGGGCATG CTTGGTGATC TTAA ATG AGT CAA GGT GAT GGG GTG GAA GGA 51 

Met Ser Gin Gly Asp Gly Val Glu Gly 
1 5 

AAT AAT ATG GAT ACT ACQ AAA GAG AAC TTG AAT GGC TCA AAA GAG CGT 99 
Asn Asn Met Asp Thr Thr Lys Glu Asn Leu Asn Gly Ser Lys Glu Arg 
10 15 20 25 

TTG AGO GAT TGG GAA TAT CGA TGG GCA ATG GOT CTA GTC TAT GGA GGA 14 7 

Leu Ser Asp Trp Glu Tyr Arg Trp Ala Met Ala Leu Val Tyr Gly Gly 
30 35 40 

TGT ATC TCC ATA ACC ACT AGG ATT TTT TAT GAC ATA AAT GGT TCA GCT 195 
Cys lie Ser lie Thr Thr Arg lie Phe Tyr Asp lie Asn Gly Ser Ala 
45 50 55 

AGC GAT CCG CTT TTT GAC CCT AAA TAG AGC TAT TAT GTG TGG TTA GTG 24 3 

Ser Asp Pro Leu Phe Asp Pro Lys Tyr Ser Tyr Tyr Val Trp Leu Val 
60 65 70 

GCT CTA ATA GCG GCT TTG TTG TCT AAT CTC TTG TTT AAT CCT AAA GGC 2 91 

Ala Leu lie Ala Ala Leu Leu Ser Asn Leu Leu Phe Asn Pro Lys Gly 
75 80 85 

AGG TCG GTA GGT TAT TTA ATG ATT GAA ACT TGG CAA GGG TTC CCC AAG 33 9 

Arg Ser Val Gly Tyr Leu Met lie Glu Thr Trp Gin Gly Phe Pro Lys 
90 95 100 105 

TTT TTT AAA GCC ATT TTT AAG GCT AGG TTT TTT GGT GCG TTT TAT GAC 3 87 

Phe Phe Lys Ala lie Phe Lys Ala Arg Phe Phe Gly Ala Phe Tyr Asp 
110 115 120 

GCT GTG TTA GGA TCA AGG CTA AGG GAT TTT TAT GTG ATG CTT TTA ACG 43 5 

Ala Val Leu Gly Ser Arg Leu Arg Asp Phe Tyr Val Met Leu Leu Thr 
125 130 135 

ATG CCC TTT ATT GCC GCT ATC CAT GAG GTT TCG GCG TAT TGT GGG CAT 4 83 

Met Pro Phe lie Ala Ala lie His Glu Val Ser Ala Tyr Cys Gly His 
140 145 150 

CCT AGC AAT CTC CTT GTA GAG GGT TTG GTC ATT TTG GGG TTT CAA GGT 531 
Pro Ser Asn Leu Leu Val Glu Gly Leu Val lie Leu Gly Phe Gin Gly 
155 160 165 

TTT CTT AAG CTT TGC GCT AAA TGG GGG TGG TGATTTAACC CAAATGTCAT TAA 5 84 
Phe Leu Lys Leu Cys Ala Lys Trp Gly Trp 
170 175 

ATGGAGGGGG TATAAAAAAA TTAAAAA 611 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:804: 

Met Leu Ala Tyr Cys Phe lie Thr Pro Asp Asp Leu Lys Asn Leu Lys 

15 10 15 

Glu Arg Leu Phe lie Asp lie lie Asn Ala lie Asn Gin Lys Lys Arg 

20 25 30 

Val Ala Leu Asp His Ala Gin lie Asp Asp lie Gin Tyr Asn Val Leu 

35 40 45 

Asp Asn Ala Phe Tyr Phe lie Phe Asp Val Gly Asn Pro Ser Gin Leu 

50 55 60 

Ala lie Lys Val Pro Arg Lys Ser Leu Glu Asn Asp Glu Leu Pro Asn 
65 70 75 80 

Thr Lys Lys Asn lie Phe Asn Gly Leu lie Arg Thr lie Tyr Gly Cys 

85 90 95 

lie Asp Asp Glu Asn Ser Phe Leu Leu Glu Asn Asp Lys Thr lie Lys 

100 105 110 

Asp Leu Asn lie Gin Asp Leu Leu Gly Pro Leu Lys Thr Gin .Ala Phe 

115 120 125 

Pro Leu Ser Tyr lie lie Thr Asp Ala lie Asn Gin Lys Glu Gly Val 

130 135 140 

Ala Leu Asp Tyr Ala Leu lie Asn Asp lie Lys Tyr Asn Leu Leu Asp 
145 150 155 160 

Asn Thr Phe His Phe lie Phe Asp Val Gly Asn Pro Leu Leu Lys Glu 

165 170 175 

Ser Ser Gin Phe lie lie Glu Val Pro Arg Glu Ala Leu Asp Leu Glu 

180 185 190 

Asn Val Asp Arg Leu Val Glu Tyr Thr Leu Ser Pro Asn Asn His Ser 

195 200 205 

Gin Ser Ser Leu Val Tyr His lie Ser Glu Gly Ser Tyr He He His 

210 215 220 

Leu He Asp Asp 
225 

(2) INFORMATION FOR SEQ ID NO: 8 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 561 
(D) OTHER INFORMATION: 
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ATC AAC CAA AAA AAG AGA GTC GCG CTC GAT CAT GOT CAA ATA GAT GAC 14 7 

lie Asn Gin Lys Lys Arg Val Ala Leu Asp His Ala Gin lie Asp Asp 
30 35 40 

ATC CAG TAT AAT GTG CTT GAT AAT GCG TTT TAT TTT ATC TTT GAT GTT 195 
lie Gin Tyr Asn Val Leu Asp Asn Ala Phe Tyr Phe lie Phe Asp Val 
45 50 55 

GGT AAC CCT TCT CAA TTA GCT ATT AAA GTG CCT AGA AAA TCT TTA GAA 243 
Gly Asn Pro Ser Gin Leu Ala lie Lys Val Pro Arg Lys Ser Leu Glu 
60 65 70 

AAT GAT GAG TTG CCC AAC ACT AAA AAA AAC ATA TTC AAT GGA TTA ATA 2 91 

Asn Asp Glu Leu Pro Asn Thr Lys Lys Asn lie Phe Asn Gly Leu lie 
75 80 85 90 

AGA ACT ATC TAT GGG TGT ATT GAT GAT GAA AAT TCA TTT TTA TTA GAA 3 39 

Arg Thr lie Tyr Gly Cys lie Asp Asp Glu Asn Ser Phe Leu Leu Glu 
95 100 105 

AAC GAT AAA ACC ATC AAG GAT TTA AAT ATT CAG GAT TTA TTG GGG CCA 3 87 

Asn Asp Lys Thr lie Lys Asp Leu Asn lie Gin Asp Leu Leu Gly Pro 
110 115 120 

TTA AAA ACT CAA GCA TTT CCA TTA TCA TAG ATT ATT ACT GAC GCT ATC 435 
Leu Lys Thr Gin Ala Phe Pro Leu Ser Tyr lie lie Thr Asp Ala lie 
125 130 135 

AAT CAA AAA GAA GGG GTG GCT CTC GAT TAC GCT CTA ATA AAC GAT ATT 4 83 

Asn Gin Lys Glu Gly Val Ala Leu Asp Tyr Ala Leu lie Asn Asp lie 
140 145 150 

AAG TAT AAT TTG CTT GAT AAC ACA TTC CAT TTT ATC TTT GAT GTT GGT 531 
Lys Tyr Asn Leu Leu Asp Asn Thr Phe His Phe lie Phe Asp Val Gly 
155 160 165 170 

AAT CCT TTG TTG AAA GAG TCA AGT CAA TTT ATT ATT GAA GTG CCT AGA 579 
Asn Pro Leu Leu Lys Glu Ser Ser Gin Phe lie lie Glu Val Pro Arg 
175 180 185 

GAG GCG TTG GAT CTA GAG AAT GTT GAT CGG CTT GTT GAA TAT ACG CTG 62 7 

Glu Ala Leu Asp Leu Glu Asn Val Asp Arg Leu Val Glu Tyr Thr Leu 
190 195 200 

TCT CCT AAT AAT CAT AGT CAA AGT TCT TTA GTG TAT CAT ATT TCT GAA 675 
Ser Pro Asn Asn His Ser Gin Ser Ser Leu Val Tyr His lie Ser Glu 
205 210 215 

GGC TCT TAT ATC ATT CAC TTA ATA GAT GAC TAAACTTAAA TGAAA 72 0 

Gly Ser Tyr lie lie His Leu lie Asp Asp 
220 225 



(2) INFORMATION FOR SEQ ID NO: 804: 
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Thr lie Glu Asp Tyr Leu lie Leu Asp Asn Leu Lys Ser Cys Phe Val 

210 215 220 

lie Gin Asn Lys Pro Asn Val Thr Leu Phe Asp Asn Asp Glu Asn Asp 
225 230 235 240 

Arg Pro Phe Asn Leu Lys Arg Tyr Leu Leu Gly Leu Lys Glu Lys Leu 

245 250 255 

Gly Phe Glu Pro Thr Gly lie Phe Tyr Cys Glu Asn Ala Asn Thr His 

260 265 270 

Lys lie Glu Leu Tie Gly Asn Asp Ser Asp Phe Arg Glu Val Leu Leu 

275 280 285 

Glu Phe Ser Glu Asn lie Pro Lys Ala Pro Asn Glu Leu Pro Gin Phe 

290 295 300 

Leu Thr Asn Phe Lys Asn Ser Lys lie Pro Asn Gly Asn lie Ser Phe 
305 310 315 320 

Ser Pro Pro Lys Asn Ser Pro Ser lie Ser Ser Tyr Ala Leu Ser Asp 

325 330 335 

Lys lie Lys Arg Glu Val Arg Asp Thr Phe Asp Arg Tyr Leu Trp His 

340 345 350 

Gly Tyr Ser Lys lie Pro Gin Glu Lys Arg lie Ala Lys lie Lys Glu 

355 360 „ 365 

Gin Val Lys Glu Glu lie Lys Leu Asn Pro Ser Phe Arg Asn Tyr Arg 

370 375 380 

Val Asp Ser Glu Gin Asn Arg Lys lie Asn Glu lie Ala Glu Gly Leu 
385 390 395 400 

Lys Ser Gly Lys lie lie Gly Lys Lys Val lie Ala Asn Ala Phe Asp 

405 410 415 

Leu Asn Ala Ser Leu Leu Phe Tyr Tyr Ser 
420 425 

(2) INFORMATION FOR SEQ ID NO:803: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 705 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803: 

TTGCTAATGC GTTCGATCTA A ATG CTA GCT TAT TGT TTT ATT ACT CCT GAT 51 

Met Leu Ala Tyr Cys Phe lie Thr Pro Asp 
1 .5 10 

GAT TTA AAG AAT TTA AAG GAA CGA TTA TTT ATA GAT ATT ATC AAT GCT 99 
Asp Leu Lys Asn Leu Lys Glu Arg Leu Phe lie Asp lie lie Asn Ala 
15 20 25 
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CCT TCT TTT CGT AAT TAT AGA GTA GAG TCT GAA CAA AAC CGC AAG ATC 12 03 

Pro Ser Phe Arg Asn Tyr Arg Val Asp Ser Glu Gin Asn Arg Lys lie 
380 385 390 

AAT GAA ATT GCT GAG GGT TTA AAA AGT GOT AAG ATA ATT GGT AAA AAG 12 51 

Asn Glu lie Ala Glu Gly Leu Lys Ser Gly Lys lie lie Gly Lys Lys 
395 400 405 

GTT ATT GCT AAT GCG TTC GAT CTA AAT GCT AGC TTA TTG TTT TAT TAC 12 99 

Val lie Ala Asn Ala Phe Asp Leu Asn Ala Ser Leu Leu Phe Tyr Tyr 
410 415 420 425 

TCC TGATGATTTA AAGAATTTAA AGGAACGATT ATTTATAGAT ATT 13 4 5 

Ser 



(2) INFORMATION FOR SEQ ID NO: 802: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:802: 



Met Ala Val Arg Phe Gly lie lie Phe lie Ser Asp Ser lie Asp Asp 

15 10 15 

Tyr Lys Ala Lys Gin Leu Arg Ser lie Leu Glu Arg Lys Lys Glu Cys 

20 25 30 

Asn Phe lie Trp Phe Asn Glu Ser Ser Ala lie lie His Asn Thr Pro 

35 40 45 

Lys Val Phe Glu Gly Glu Ser Phe Phe Asp His Leu Phe Val Ser Ala 

50 55 60 

Lys lie Thr Ala Phe Val Val Ser Thr Asn Glu Ser Asp Thr lie Phe 
65 70 75 80 

Asn Leu Lys Asn Tyr Leu Leu Val Leu Ala Lys Asn Leu Asn Asn Arg 

85 90 95 

Asp lie Trp Tyr Cys Glu Asn Thr lie Cys Asp Lys Lys Gly Thr Tyr 

100 105 110 

Asn lie Glu lie Glu Leu Val Ser Asn Ala Asn Asp Phe Arg Gly Val 

115 120 125 

Phe Gly Glu Val Leu Gly lie Val Lys Asp Thr Phe Gly Asp Leu Leu 

130 135 140 

Gin Leu Leu Thr Asn Leu Lys Asn Lys Glu lie Glu Phe Asn Phe His 
145 150 155 160 

Lys Lys lie Asn Tyr Gly Leu Pro Phe Gly lie lie Phe lie Ala Ser 

165 170 175 

Asn Ser Asp Asn Pro lie Asp lie Asp Asn Lys Thr Lys Lys Leu Lys 

180 185 190 

Ser Cys Phe Arg Asp Asp Glu Ser Asn Cys Phe lie Asp Cys Pro lie 
195 200 205 
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Glu lie Glu Phe Asn Phe His Lys Lys lie Asn Tyr Gly Leu Pro Phe 
155 160 165 

GGG ATT ATC TTT ATC GCT AGC AAC TCT GAC AAC CCT ATT GAT ATT GAC 57 9 

Gly lie lie Phe lie Ala Ser Asn Ser Asp Asn Pro lie Asp lie Asp 
170 175 180 185 

AAT AAA ACC AAA AAG TTA AAA TCA TGC TTT CGT GAT GAT GAG AGT AAC 62 7 

Asn Lys Thr Lys Lys Leu Lys Ser Cys Phe Arg Asp Asp Glu Ser Asn 
190 195 200 

TGT TTT ATT GAC TGC CCA ATT ACA ATT GAG GAT TAT TTA ATT TTA GAT 675 
Cys Phe lie Asp Cys Pro lie Thr lie Glu Asp Tyr Leu lie Leu Asp 
205 210 215 

AAT CTA AAA AGC TGT TTT GTA ATC CAA AAT AAG CCA AAT GTA ACA TTA 72 3 

Asn Leu Lys Ser Cys Phe Val lie Gin Asn Lys Pro Asn Val Thr Leu 
220 225 230 

TTT GAT AAC GAC GAG AAC GAT AGA CCA TTC AAT TTA AAG CGA TAC TTG 771 
Phe Asp Asn Asp Glu Asn Asp Arg Pro Phe Asn Leu Lys Arg Tyr Leu 
235 240 245 

TTA GGA TTG AAA GAA AAG TTA GGG TTT GAG CCA ACG GGT ATT TTC TAT 819 
Leu Gly Leu Lys Glu Lys Leu Gly Phe Glu Pro Thr Gly lie Phe Tyr 
250 255 260 265 

TGC GAA AAC GCA AAC ACA CAC AAA ATT GAA TTG ATT GGT AAT GAT TCT 867 
Cys Glu Asn Ala Asn Thr His Lys lie Glu Leu lie Gly Asn Asp Ser 
270 275 280 

GAT TTC AGA GAG GTA TTA CTT GAA TTT TCA GAG AAT ATA CCA AAA GCC 915 
Asp Phe Arg Glu Val Leu Leu Glu Phe Ser Glu Asn lie Pro Lys Ala 
285 290 295 

CCT AAT GAA CTA CCA CAA TTT CTT ACA AAC TTT AAA AAT TCA AAA ATC 963 
Pro Asn Glu Leu Pro Gin Phe Leu Thr Asn Phe Lys Asn Ser Lys lie 
300 305 310 

CCC AAT GGA AAC ATT TCA TTT TCG CCA CCA AAA AAT TCT CCA TCA ATT 1011 
Pro Asn Gly Asn lie Ser Phe Ser Pro Pro Lys Asn Ser Pro Ser lie 
315 320 325 

TCT TCA TAT GCT TTA TCT GAT AAG ATT AAA AGA GAA GTA AGA GAT ACC 105 9 

Ser Ser Tyr Ala Leu Ser Asp Lys lie Lys Arg Glu Val Arg Asp Thr 
330 335 340 345 

TTT GAT CGC TAT TTG TGG CAT GGT TAT TCT AAA ATT CCA CAG GAG AAA 1107 
Phe Asp Arg Tyr Leu Trp His Gly Tyr Ser Lys lie Pro Gin Glu Lys 
350 355 360 

AGG ATA GCC AAA ATA AAA GAG CAA GTG AAG GAA GAA ATT AAA CTA AAT 115 5 

Arg lie Ala Lys lie Lys Glu Gin Val Lys Glu Glu lie Lys Leu Asn 
365 370 375 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 25... 1302 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801: 

CTAAATGAAT TTAAAAGGAA AAAT ATG GCA GTA AGA TTT GGG ATT ATC TTT 51 

Met Ala Val Arg Phe Gly lie lie Phe 
1 5 

ATA TCT GAC TCT ATT GAT GAT TAT AAA GCC AAA CAA TTA AGA TCA ATT 9 9 

lie Ser Asp Ser lie Asp Asp Tyr Lys Ala Lys Gin Leu Arg Ser lie 
10 15 20 25 

TTA GAA CGC AAG AAA GAG TGT AAT TTT ATA TGG TTT AAT GAA TCA AGT 14 7 

Leu Glu Arg Lys Lys Glu Cys Asn Phe lie Trp Phe Asn Glu Ser Ser 
30 35 40 

GCT ATA ATT CAC AAT ACT CCT AAA GTT TTT GAA GGA GAG AGT TTT TTT 195 
Ala lie lie His Asn Thr Pro Lys Val Phe Glu Gly Glu Ser Phe Phe 
45 50 55 

GAT CAT CTT TTC GTT AGT GCA AAA ATT ACT GCT TTT GTG GTA TCC ACA 24 3 

Asp His Leu Phe Val Ser Ala Lys lie Thr Ala Phe Val Val Ser Thr 
60 65 70 

AAC GAA TCA GAT ACA ATA TTC AAT TTA AAA AAC TAC TTG CTA GTA TTA 2 91 

Asn Glu Ser Asp Thr lie Phe Asn Leu Lys Asn Tyr Leu Leia Val Leu 
75 80 85 

GCC AAA AAT CTC AAT AAT AGA GAT ATT TGG TAT TGT GAA AAC ACT ATT 33 9 

Ala Lys Asn Leu Asn Asn Arg Asp lie Trp Tyr Cys Glu Asn Thr lie 
90 95 100 105 

TGC GAT AAA AAA GGC ACT TAT AAT ATA GAA ATA GAA TTA GTG AGC AAT 3 87 

Cys Asp Lys Lys Gly Thr Tyr Asn lie Glu lie Glu Leu Val Ser Asn 
110 115 120 

GCT AAT GAT TTT AGA GGA GTG TTT GGA GAA GTG TTA GGT ATA GTC AAA 435 
Ala Asn Asp Phe Arg Gly Val Phe Gly Glu Val Leu Gly lie Val Lys 
125 130 135 

GAC ACT TTC GGT GAT TTA CTG CAA CTT CTT ACA AAT TTA AAG AAC AAG 4 83 

Asp Thr Phe Gly Asp Leu Leu Gin Leu Leu Thr Asn Leu Lys Asn Lys 
140 145 150 

GAA ATT GAA TTT AAT TTT CAT AAA AAA ATT AAT TAC GGA TTG CCT TTT 531 
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65 



70 



75 



80 



Arg lie He Leu Gly His Thr Leu Gly Tyr Ser Ala Pro He Thr Leu 

85 90 95 

Tyr TYr Glu Trp Leu Phe Asp Asp Trp He Asp Pro Glu Lys He Met 

100 105 110 

Gly Asp Arg Phe Val Cys Arg Thr Asn Tyr Leu Glu Ser Phe Phe Thr 

115 120 125 

Thr Lys Lys His Leu Leu Pro Asp Thr Leu Phe Lys Val Asp Glu Ser 

130 135 140 

Gly Cys Glu Ser Tyr His Glu Asn Asn Asp Lys Asp Phe He Leu Gin 
145 150 155 160 

Ser Phe Tyr He Gin Asn Asp Phe Leu Ser Gin Arg Tyr Glu Lys Asp 

165 170 175 

Lys He Lys Ala Lys Ser Asn Leu He Pro Lys Arg Gin Asn Arg Leu 

180 185 190 

Leu Thr Tyr Gin Phe Asp Leu Ser Leu Glu Cys Asn He He Phe Glu 

195 200 205 

Thr Leu Glu Lys Leu Ala Leu He Ala Gly Ala He Lys Asn Phe Phe 

210 215 220 

He Leu He Tyr Ala His Ser Asn Phe Asp He Gin He Asp Tyr He 
225 230 235 240 

Gin Phe Lys Leu Ser Asn Lys Asp He Thr Ala He Arg Asn Thr Tyr 

245 250 255 

Lys Lys Asp Lys Lys Ser Met Glu He Asp Leu Tyr Gly He Ala He 

260 265 270 

Asn Phe Gin Arg He Asp Asn Phe Ser Val He Leu Glu Lys Trp He 

275 280 285 

Val Phe Tyr He Lys Asp Asn Arg Asp Phe Gin Leu Ala Ser He Leu 

290 295 300 

Asp He He Asn Lys Lys Asp Pro He He His Leu Tyr Leu Asp Met 
305 310 315 320 

Phe Val Leu He Ser Met He Glu Ser Phe Leu Lys Lys Pro Gin Gin 

325 330 335 

Thr Lys Leu His Glu Lys Leu Ser Glu Phe Phe Lys He Ser Leu Ser 

340 345 350 

Arg Thr Lys Cys Asp Gin Thr Lys Asn Tyr Phe Asn Asp Lys Cys Gin 

355 360 365 

Glu Asp Leu He Gin Gin He Val Asp Cys Arg Asn Ser Leu Ala His 

370 375 380 

Gly Arg Ser Leu Lys Leu Asp Thr Asn Lys Ala. Thr Asp He Ser His 
385 390 395 400 

Ala Phe He Asp Phe Lys Gin He Val He Glu Phe Phe Phe Gly Glu 

405 410 415 

He Gly Leu Ser Asp Phe He Thr Asn Asn Phe Gly Phe Leu Asn Lys 

420 425 430 

Val Lys Leu Arg Asn Pro Pro Lys Thr Glu Lys He Thr Glu Pro Asn 
435 440 445 

Arg 



(2) INFORMATION FOR SEQ ID NO: 8 01: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1345 base pairs 

(B) TYPE: nucleic acid 
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TTA AAG AAA CCA CAA CAA ACA AAA CTC CAT GAA AAA CTC TCT GAA TTT 1061 
Leu Lys Lys Pro Gin Gin Thr Lys Leu His Glu Lys Leu Ser Glu Phe 
335 340 345 

TTT AAA ATT TCA TTA TCT AGG ACA AAA TGC GAT CAA ACG AAA AAT TAT 1109 
Phe Lys lie Ser Leu Ser Arg Thr Lys Cys Asp Gin Thr Lys Asn Tyr 
350 355 360 

TTT AAT GAT AAA TGT CAA GAA GAT CTA ATC CAA CAG ATT GTT GAC TGC 115 7 

Phe Asn Asp Lys Cys Gin Glu Asp Leu He Gin Gin He Val Asp Cys 
365 370 375 

CGT AAC TCT CTA GCG CAC GGA AGA AGT TTA AAG CTT GAT ACA AAC AAA 12 0 5 

Arg Asn Ser Leu Ala His Gly Arg Ser Leu Lys Leu Asp Thr Asn Lys 
380 385 390 

GCT ACA GAC ATT AGC CAT GCT TTT ATA GAT TTC AAG CAA ATT GTC ATT 12 53 

Ala Thr Asp He Ser His Ala Phe He Asp Phe Lys Gin He Val He 
395 400 405 410 

GAA TTT TTC TTT GGC GAG ATA GGA TTG AGC GAT TTT ATT ACA AAC AAT 13 01 

Glu Phe Phe Phe Gly Glu He Gly Leu Ser Asp Phe He Thr Asn Asn 
415 420 425 

TTT GGT TTT CTT AAC AAA GTT AAA TTA AGA AAC CCC CCA AAA ACA GAA 13 4 9 

Phe Gly Phe Leu Asn Lys Val Lys Leu Arg Asn Pro Pro Lys Thr Glu 
430 435 440 

AAA ATC ACC GAG CCA AAC CGC TAAAACCCCT TAGAAAATTT AAAATTTTAA GTTT 14 04 
Lys He Thr Glu Pro Asn Arg 
445 

TAGGGGTGTT TTTCTTAAGA ATTTAGGTTT TTTATA 144 0 

(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:800: 

Val He Thr Ser Leu Gly Gly Val Glu Tyr Phe Glu Arg Gin Cys Leu 

15 10 15 

Ala Phe Leu Lys Asn Pro Gin Thr Asn Pro Gin Asn Glu Gin Tyr He 

20 25 30 

Pro Gly Val Phe Ser Tyr Gin Glu Asn Lys He Ser Phe Ser Phe Leu 

35 40 45 

Val Leu Gly Glu He Glu Glu He His Ser Leu Gin Tyr Gin Thr Leu 

50 55 60 

Tyr He Val Asp Asn Lys Lys Arg Tyr Thr Leu Tyr Lys Leu Tyr Asp 
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GAT CCA GAA AAA ATT ATG GGC GAT CGT TTT GTT TGT AGG ACA AAT TAT 3 89 

Asp Pro Glu Lys lie Met Gly Asp Arg Phe Val Cys Arg Thr Asn Tyr 
110 115 120 

TTA GAA AGT TTT TTT ACG ACC AAG AAG CAT TTG CTA CCT GAT ACA TTA 43 7 

Leu Glu Ser Phe Phe Thr Thr Lys Lys His Leu Leu Pro Asp Thr Leu 
125 130 135 

TTT AAA GTA GAT GAA AGT GGG TGT GAA AGT TAT CAT GAG AAT AAC GAT 4 85 

Phe Lys Val Asp Glu Ser Gly Cys Glu Ser Tyr His Glu Asn Asn Asp 
140 145 150 

AAG GAC TTT ATC CTA CAA TCA TTT TAT ATT CAA AAT GAT TTT TTA TCC 53 3 

Lys Asp Phe lie Leu Gin Ser Phe Tyr lie Gin Asn Asp Phe Leu Ser 
155 160 165 170 

CAA AGA TAT GAA AAA GAC AAG ATA AAA GCA AAA TCT AAT TTG ATT CCT 5 81 

Gin Arg Tyr Glu Lys Asp Lys lie Lys Ala Lys Ser Asn Leu lie Pro 
175 180 185 

AAA AGA CAG AAT CGT TTA TTA ACT TAT CAA TTT GAT TTG TCT TTG GAA 62 9 

Lys Arg Gin Asn Arg Leu Leu Thr Tyr Gin Phe Asp Leu Ser Leu Glu 
190 195 200 

TGC AAT ATA ATT TTT GAA ACC CTT GAA AAA TTA GCA CTT ATT GCT GGA 67 7 

Cys Asn lie lie Phe Glu Thr Leu Glu Lys Leu Ala Leu lie Ala Gly 
205 210 215 

GCG ATT AAA AAC TTT TTT ATT TTG ATT TAT GCT CAT TCT AAT TTT GAC 72 5 

Ala lie Lys Asn Phe Phe lie Leu lie Tyr Ala His Ser Asn Phe Asp 
220 225 230 

ATC CAA ATT GAC TAT ATC CAA TTC AAG CTT TCT AAT AAA GAC ATT ACA 77 3 

lie Gin lie Asp Tyr lie Gin Phe Lys Leu Ser Asn Lys Asp lie Thr 
235 240 245 250 

GCA ATA AGA AAC ACT TAC AAA AAA GAT AAA AAG TCT ATG GAG ATA GAT 821 
Ala lie Arg Asn Thr Tyr Lys Lys Asp Lys Lys Ser Met Glu lie Asp 
255 260 265 

CTT TAT GGG ATT GCT ATA AAT TTC CAA CGG ATA GAC AAT TTT TCT GTA 86 9 

Leu Tyr Gly lie Ala lie Asn Phe Gin Arg lie Asp Asn Phe Ser Val 
270 275 280 

ATA CTT GAA AAA TGG ATT GTT TTT TAT ATC AAA GAC AAT AGA GAT TTC 917 
lie Leu Glu Lys Trp lie Val Phe Tyr lie Lys Asp Asn Arg Asp Phe 
285 290 295 

CAA CTT GCA AGT ATT TTA GAC ATT ATT AAT AAA AAA GAT CCA ATT ATT 965 
Gin Leu Ala Ser lie Leu Asp lie lie Asn Lys Lys Asp Pro lie lie 
300 305 310 

CAC TTG TAT TTG GAC ATG TTT GTA TTG ATT AGC ATG ATT GAA AGT TTT 1013 
His Leu Tyr Leu Asp Met Phe Val Leu lie Ser Met lie Glu Ser Phe 
315 320 325 330 
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885 890 895 

Pro Asn Asp Tyr Ala Gly Gly Lys Tyr Val Phe Glu Leu Leu Cys Arg 

900 905 910 

Val lie Thr Leu Ser Val Lys Ser Val Asp Leu lie Glu Lys lie Ser 

915 920 925 

Glu Lys Arg Phe Glu 
930 

(2) INFORMATION FOR SEQ ID NO: 7 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 24... 1370 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 99: 

AAGATCAGCG AAAAGAGGTT TGA GTG ATT ACA TCG CTT GGG GGT GTG GAA TAT 5 3 

Val lie Thr Ser Leu Gly Gly Val Glu Tyr 
15 10 

TTT GAA AGG CAA TGT CTT GCT TTC TTA AAA AAT CCA CAA ACT AAT CCA 101 
Phe Glu Arg Gin Cys Leu Ala Phe Leu Lys Asn Pro Gin Thr Asn Pro 
15 20 25 

CAA AAT GAG CAA TAC ATT CCA GGA GTG TTT TCG TAT CAA GAA AAC AAA 14 9 

Gin Asn Glu Gin Tyr lie Pro Gly Val Phe Ser Tyr Gin Glu Asn Lys 
30 35 40 

ATT TCT TTT TCT TTT TTG GTT TTA GGA GAA ATT GAA GAG ATC CAC TCT 197 
lie Ser Phe Ser Phe Leu Val Leu Gly Glu lie Glu Glu lie His Ser 
45 50 55 

TTG CAA TAC CAA ACG CTC TAT ATT GTG GAT AAC AAA AAA AGA TAC ACT 24 5 

Leu Gin Tyr Gin Thr Leu Tyr lie Val Asp Asn Lys Lys Arg Tyr Thr 
60 65 70 

CTT TAC AAG CTT TAT GAT CGC ATT ATT TTG GGT CAT ACT TTA GGG TAT 2 93 

Leu Tyr Lys Leu Tyr Asp Arg lie lie Leu Gly His Thr Leu Gly Tyr 
75 80- 85 90 

TCT GCA CCA ATC ACG CTC TAT TAT GAA TGG CTG TTT GAT GAT TGG ATC 341 
Ser Ala Pro lie Thr Leu Tyr Tyr Glu Trp Leu Phe Asp Asp Trp lie 
95 100 105 
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Gly Asn Gin Arg 
450 

lie Phe Asp Ser 
465 

Lys Asp Lys Ser 

Asp Tyr Leu Lys 
500 

Asn Leu Asp Phe 
515 

Asp Trp lie Asn 
530 

Lys Arg Asp Lys 
545 

Leu Gly Val Val 

Asn lie Leu Thr 
580 

Asp Leu Lys Arg 
595 

Ala Gin Ser Val 
610 

Glu He Thr Thr 
625 

Asn Lys Leu lie 

Val Arg Leu Ala 
660 

Asp Lys Asp Trp 
675 

Asp Lys Asp Ala 
690 

Lys Asp Phe Ser 
705 

He Ser Pro Asn 

Asn Arg His Tyr 
740 

His Tyr Gly Asp 
755 

Tyr Ala He Phe 
770 

Leu Ala Lys Glu 
785 

Leu Ser Val Leu 

Ser Gly Glu Met 
820 

Glu He Glu Gly 
835 

Cys He He Tyr 
850 

Ala Phe Asp Tyr 
865 

Glu Arg Tyr Gin 



Thr Ser Gly Glu 
455 

Gly Ser Arg Ala 
470 

Thr Pro Asp Asn 
485 

Arg Glu Ala Lys 

Val Pro Phe Glu 
520 

Gin Arg Asn Asp 
535 

Thr Leu Gin Asn 
550 

Ser Gly Arg Asp 
565 

Gin Ser Val Gin 

Phe Asn Ala Arg 
600 

Lys Ala Gly Asp 
615 

Asp Lys Thr Lys 
630 

Lys Asn Lys Ser 
645 

Leu Tyr Arg Pro 

He Asn Arg Gin 
680 

Gin Asn Val Val 
695 

Ala Leu Val Ser 
710 

Gin Ala Tyr Pro 
725 

Ala He Ser Gly 

Asn Leu He Ala 
760 

His His Lys Gly 
775 

Ala Pro Arg He 
790 

Gly Lys Glu Leu 
805 

His Asp Asn He 

Tyr Tyr Asp Val 
840 

Asn Gin Asn He 
855 

Val He Asn Gly 
870 

Lys Thr Met Asp 



Val Ser Lys Lys 
460 

Thr Val Ala He 
475 

Thr He Phe Tyr 
490 

Leu Asn Trp Leu 
505 

Lys He Thr Pro 

Ala Phe Glu Lys 
540 

Asp Ser Val Phe 
555 

Pro Trp Val Tyr 
570 

Lys Cys He Asp 
585 

Phe Arg Glu Ala 

Leu Tyr Lys Gin 
620 

He Ala Trp Thr 
635 

Ala Arg Glu Ser 
650 

Phe Asn Lys Gin 
665 

Arg Glu Phe Ser 

He Asn Thr Gly 
700 

Asp Phe He Ser 
715 

Leu Tyr Tyr Tyr 
730 

Tyr Cys Leu Asn 
745 

Glu Glu Glu He 

Tyr Leu Glu Lys 
780 

Ala Leu Ser Glu 
795 

Ala Glu Leu His 
810 

Lys Tyr Thr Thr 
825 

Asp Lys Met Thr 

Ala He Thr Lys 
860 

Lys Ser Ala He 
875 

Lys Glu Ser Leu 



Glu Gly Gly Lys 

He Phe Phe Val 
480 

Tyr Glu Val Glu 
495 

Ala Asn Phe Glu 
510 

Asn Asp Lys Gly 
525 

Leu He Pro Leu 

Asp He Asn Ser 
560 

Asn Phe Ser Pro 
575 

Thr Tyr Asn Ala 
590 

Phe Lys Gin Arg 
605 

Leu Asn Asp Lys 

Asp Gly Leu Lys 
640 

Ser Glu Glu Arg 
655 

Trp Leu Tyr Trp 
670 

Lys He Phe Pro 
685 

Val Gly Asn Gly 

Asp Tyr Ser Leu 
720 

Asp Asp Leu Gly 
735 

Leu Phe Arg Arg 
750 

Phe Tyr Tyr He 
765 

Tyr Ly s As n Ser 

Asp Phe Lys Glu 
800 

Leu Asn Tyr Glu 
815 

Leu Met Asn Ala 
830 

Lys Lys Gly Asp 
845 

He Pro Lys Lys 

Asp Trp Val He 
880 

He Glu Asn Asn 
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Ser Leu Glu Ala 
20 

Phe Asp Ala lie 
35 

Met Pro Thr Lys 
50 

Lys Thr Gly Asn 
65 

Phe Asp Lys Asn 

Glu Asn lie His 
100 

Thr Ser His He 
115 

Asn He Lys Asn 
130 

Leu Ser Thr Leu 
145 

Tyr Glu Ser Val 

Gin Gin Glu Leu 
180 

Phe Lys Lys Gin 
195 

Val Val Asp Phe 
210 

Phe Asn Thr Asp 
225 

Thr Gly Thr Gly 

Leu He Ser Asp 
260 

Ala Phe Asp He 
275 

Thr Gin Ala Ala 
290 

He Ala Leu Thr 
305 

Gly Val Leu Pro 

Asp Thr Leu Ala 
340 

Tyr Ser Ala Gly 
355 

His Pro Lys Leu 
370 

Thr Ser Arg Ser 
385 

He Arg Met Ala 

Val Val Asn Gly 
420 

Lys Cys Val Ala 
435 
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5 

Met Met Ala Thr 

Leu Leu Gin Asp 
40 

Leu Gly Asp Arg 
55 

He Ala Arg Thr 
70 

Pro Glu Phe Phe 
85 

Gin Asn He Lys 

He Thr Lys Pro 
120 

Pro He Ala Lys 
135 

Gly Leu Glu Gly 
150 

Lys Thr Glu Ala 
165 

He Lys Asn Leu 

Ser Glu Lys Leu 
200 

He Leu Arg Ala 
215 

Phe Asn Asp Gin 
230 

Ser Phe He Ala 
245 

Glu Ala Leu Lys 

Val Leu Leu Ser 
280 

Gin Asn Arg Asp 
295 

Asp Ser Leu Asp 
310 

Leu Tyr Glu Asp 
325 

Asn Gin Asn He 

Ala Lys Ser Gin 
360 

Glu Lys Leu Val 
375 

Val Gly Lys Thr 
390 

Ser Asp Val Val 
405 

Gly Phe He Asp 

Lys Glu Phe Ser 
440 



10 

Asn His Asn Asp 
25 

Leu Ala Asp Ala 

Asn Tyr Trp Glu 
60 

Leu Asn Asn Arg 
75 

His Gly Phe Leu 
90 

Glu Asp Glu Ala 
105 

He Phe Asp Ala 

Ala Leu Asp Lys 
140 

Glu Thr Lys Asp 
155 

Leu His Ala Lys 
170 

Tyr Asn Thr Phe 
185 

Gly He Val Tyr 

Thr Asn Gly He 
220 

Ser He Thr He 
235 

Arg Leu Leu Ser 
250 

Glu Lys Phe Gin 
265 

Tyr Tyr He Ala 

Gly Ser Leu Asn 
300 

Tyr Leu Glu Glu 
315 

Leu Lys Glu Asn 
330 

Arg Val He He 
345 

Asn Asp Asn Asn 

Tyr Glu Lys Tyr 
380 

Thr Arg Asp Thr 
395 

Lys Asp Arg Gly 
410 

Ser Lys Ser Ala 
425 

His Leu Tyr Val 
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Glu Lys Thr Leu 
30 

Met Tyr Asn Val 
45 

Asn Phe Thr Lys 

Leu Lys He He 
80 

Asp Ser Leu Arg 
95 

Leu Asp Met He 
110 

Leu Phe Gly Asp 
125 

Met Val Glu Lys 

Leu Lys Asn Leu 
160 

Ser Gin Lys Ser 
175 

Phe Lys Glu Ala 
190 

Thr Pro He Glu 
205 

Leu Lys Lys His 

Phe Asp Pro Phe 
240 

Lys Glu Asn Ala 
255 

Lys Asn Leu Phe 
270 

Leu He Asn He 
285 

Asn Phe Lys Asn 

Lys Thr Asn Lys 
320 

Lys Gly He Lys 
335 

Gly Asn Pro Pro 
350 

Gin Asn Leu Ser 
365 

Gly Lys Asn Ser 

Leu He Gin Ser 
400 

Val He Gly Phe 
415 

Asp Gly Phe Arg 
430 

Leu Asn Leu Arg 
445 
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TAG AAA AAT TCC CTC GCC AAA GAA GCG CCG CGC ATC GCT TTG AGC GAA 24 03 

Tyr Lys Asn Ser Leu Ala Lys Glu Ala Pro Arg lie Ala Leu Ser Glu 
785 790 795 

GAT TTT AAA GAA CTC TCT GTG CTT GGC AAA GAA TTG GCC GAA TTG CAC 24 51 

Asp Phe Lys Glu Leu Ser Val Leu Gly Lys Glu Leu Ala Glu Leu His 
800 805 810 

CTG AAC TAT GAG AGT GGG GAA ATG CAT GAT AAT ATT AAA TAG ACC ACA 24 99 

Leu Asn Tyr Glu Ser Gly Glu Met His Asp Asn lie Lys Tyr Thr Thr 
815 820 825 

CTG ATG AAC GCC GAA ATA GAG GGT TAT TAT GAT GTG GAT AAA ATG ACC 2 54 7 

Leu Met Asn Ala Glu lie Glu Gly Tyr Tyr Asp Val Asp Lys Met Thr 
830 835 840 

AAA AAA GGG GAT TGC ATC ATC TAT AAC CAA AAC ATC GCT ATC ACT AAG 2 5 95 

Lys Lys Gly Asp Cys lie lie Tyr Asn Gin Asn lie Ala lie Thr Lys 
845 850 855 860 

ATC CCT AAA AAA GCC TTT GAC TAT GTC ATT AAT GGC AAG AGC GCG ATT 2 64 3 

lie Pro Lys Lys Ala Phe Asp Tyr Val lie Asn Gly Lys Ser Ala lie 
865 870 875 

GAC TGG GTG ATC GAA CGC TAT CAA AAA ACT ATG GAT AAA GAA AGC CTG 2 691 

Asp Trp Val lie Glu Arg Tyr Gin Lys Thr Met Asp Lys Glu Ser Leu 
880 885 890 

ATT GAA AAC AAC CCG AAC GAT TAC GCC GGC GGA AAA TAC GTT TTT GAA 2 73 9 

lie Glu Asn Asn Pro Asn Asp Tyr Ala Gly Gly Lys Tyr Val Phe Glu 
895 900 905 

CTC CTT TGT AGG GTC ATC ACA CTT TCG GTA AAA AGC GTG GAT TTG ATA 2787 
Leu Leu Cys Arg Val lie Thr Leu Ser Val Lys Ser Val Asp Leu lie 
910 915 920 

GAA AAG ATC AGC GAA AAG AGG TTT GAG TGATTACATC GCTTGGGGGT GTGGAAT 2 841 
Glu Lys lie Ser Glu Lys Arg Phe Glu 
925 930 

ATTTTGAAAG GCAATGTCTT GCTTTCTTAA AAAATCCAC 2 880 

(2) INFORMATION FOR SEQ ID NO: 7 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 98: 
Met Thr -Gin Ala Trp Leu Met Lys Pro Leu Leu Lys Lys Lys Ser Lys 
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GAC ATC AAT TCT CTT GGC GTG GTG AGC GGT CGT GAT CCT TGG GTG TAT 17 31 

Asp lie Asn Ser Leu Gly Val Val Ser Gly Arg Asp Pro Trp Val Tyr 
560 565 570 

AAC TTT TCT CCA AAC ATT TTA ACC CAA TCG GTG CAA AAA TGC ATT GAC 177 9 

Asn Phe Ser Pro Asn lie Leu Thr Gin Ser Val Gin Lys Cys He Asp 
575 580 585 

ACT TAT AAC GCT GAT TTG AAG CGC TTC AAT GCG CGT TTT AGG GAA GCT 182 7 

Thr Tyr Asn Ala Asp Leu Lys Arg Phe Asn Ala Arg Phe Arg Glu Ala 
590 595 600 

TTC AAA CAA CGC GCT CAA AGC GTC AAA GCA GGC GAT CTT TAC AAA CAA 187 5 

Phe Lys Gin Arg Ala Gin Ser Val Lys Ala Gly Asp Leu Tyr Lys Gin 
605 610 615 620 

CTT AAT GAT AAA GAA ATC ACC ACC GAT AAA ACG AAA ATC GCT TGG ACT 192 3 

Leu Asn Asp Lys Glu He Thr Thr Asp Lys Thr Lys He Ala Trp Thr 
625 630 635 

GAT GGT TTG AAA AAC AAA CTC ATT AAA AAT AAA TCT GCA AGA GAA AGC 1971 
Asp Gly Leu Lys Asn Lys Leu He Lys Asn Lys Ser Ala Arg Glu Ser 
640 645 650 

AGT GAG GAG CGT GTA AGG TTG GCC TTG TAT CGC CCT TTT AAC AAA CAA 2 019 

Ser Glu Glu Arg Val Arg Leu Ala Leu Tyr Arg Pro Phe Asn Lys Gin 
655 660 665 

TGG CTT TAT TGG GAT AAG GAT TGG ATA AAC AGG CAA AGA GAA TTT TCA 2 067 

Trp Leu Tyr Trp Asp Lys Asp Trp He Asn Arg Gin Arg Glu Phe Ser 
670 675 680 

AAA ATT TTC CCG GAT AAA GAC GCT CAG AAT GTG GTG ATT AAT ACC GGT 2115 
Lys He Phe Pro Asp Lys Asp Ala Gin Asn Val Val He Asn Thr Gly 
685 690 695 700 

GTG GGA AAT GGT AAA GAT TTT AGC GCT TTG GTA AGC GAT TTT ATT TCT 2163 
Val Gly Asn Gly Lys Asp Phe Ser Ala Leu Val Ser Asp Phe He Ser 
705 710 715 

GAT TAT AGT TTG ATC TCA CCC AAT CAA GCT TAC CCC TTG TAT TAT TAC 2211 
Asp Tyr Ser Leu He Ser Pro Asn Gin Ala Tyr Pro Leu Tyr Tyr Tyr 
720 725 730 

GAT GAT TTG GGG AAT CGC CAT TAC GCC ATC AGC GGC TAT TGC TTA AAC 22 5 9 

Asp Asp Leu Gly Asn Arg His Tyr Ala He Ser Gly Tyr Cys Leu Asn 
735 740 745 

CTC TTC AGG AGG CAT TAT GGG GAT AAT CTG ATC GCT GAA GAA GAG ATT 2 3 07 

Leu Phe Arg Arg His Tyr Gly Asp Asn Leu He Ala Glu Glu Glu He 
750 755 760 

TTT TAT TAC ATT TAT GCG ATT TTC CAC CAT AAA GGC TAT TTA GAA AAA 2 3 55 

Phe Tyr Tyr He Tyr Ala He Phe His His Lys Gly Tyr Leu Glu Lys 
765 770 775 780 
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AAA GGC ATC AAA GAC ACT CTA GCC AAC CAA AAT ATT AGA GTC ATC ATC 10 5 9 

Lys Gly lie Lys Asp Thr Leu Ala Asn Gin Asn lie Arg Val lie lie 
335 340 345 

GGC AAC COG CCT TAT TCA GCC GGC GCA AAG AGC CAA AAC GAT AAC AAC 1107 
Gly Asn Pro Pro Tyr Ser Ala Gly Ala Lys Ser Gin Asn Asp Asn Asn 
350 355 360 

CAA AAC CTC TCA CAC CCA AAG CTT GAA AAA TTA GTT TAT GAA AAA TAC 115 5 

Gin Asn Leu Ser His Pro Lys Leu Glu Lys Leu Val Tyr Glu Lys Tyr 
365 370 375 380 

GGA AAA AAT TCC ACA TCT AGA AGT GTG GGA AAA ACC ACA CGA GAC ACG 12 03 

Gly Lys Asn Ser Thr Ser Arg Ser Val Gly Lys Thr Thr Arg Asp Thr 
385 390 395 

CTC ATT CAA AGC ATC CGC ATG GCG AGC GAT GTT GTT AAA GAT AGG GGG 1251 
Leu lie Gin Ser lie Arg Met Ala Ser Asp Val Val Lys Asp Arg Gly 
400 405 410 

GTG ATA GGC TTT GTG GTG AAC GGG GGT TTT ATT GAC TCT AAA AGC GCG 12 9 9 

Val lie Gly Phe Val Val Asn Gly Gly Phe lie Asp Ser Lys Ser Ala 
415 420 425 

GAT GGG TTC AGA AAA TGC GTG GCC AAA GAA TTT TCG CAT CTT TAT GTA 13 4 7 

Asp Gly Phe Arg Lys Cys Val Ala Lys Glu Phe Ser His Leu Tyr Val 
430 435 440 

TTG AAT TTG AGA GGC AAT CAG CGC ACT TCT GGG GAA GTG TCA AAA AAA 13 95 

Leu Asn Leu Arg Gly Asn Gin Arg Thr Ser Gly Glu Val Ser Lys Lys 
445 450 455 460 

GAG GGA GGG AAA ATC TTT GAT AGC GGA TCG AGG GCG ACG GTA GCG ATT 14 4 3 

Glu Gly Gly Lys lie Phe Asp Ser Gly Ser Arg Ala Thr Val Ala lie 
465 470 475 

ATC TTT TTT GTG AAA GAT AAG AGC ACT CCT GAT AAT ACG ATT TTT TAT 14 91 

lie Phe Phe Val Lys Asp Lys Ser Thr Pro Asp Asn Thr lie Phe Tyr 
480 485 490 

TAT GAA GTG GAA GAT TAC TTG AAA AGA GAA GCC AAA CTC AAC TGG CTC 15 3 9 

Tyr Glu Val Glu Asp Tyr Leu Lys Arg Glu Ala Lys Leu Asn Trp Leu 
495 500 505 

GCC AAT TTT GAA AAT TTG GAT TTT GTG CCT TTT GAG AAA ATC ACC CCG 15 8 7 

Ala Asn Phe Glu Asn Leu Asp Phe Val Pro Phe Glu Lys lie Thr Pro 
510 515 520 

AAT GAT AAA GGC GAT TGG ATC AAC CAA AGG AAT GAC GCT TTT GAA AAA 163 5 

Asn Asp Lys Gly Asp Trp lie Asn Gin Arg Asn Asp Ala Phe Glu Lys 
525 530 535 540 

CTC ATC CCT TTA AAA AGA GAC AAA ACA CTC CAA AAC GAC AGC GTT TTT 16 8 3 

Leu lie Pro Leu Lys Arg Asp Lys Thr Leu Gin Asn Asp Ser Val Phe 
545 550 555 
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Leu Asp Met 
110 

CTT TTT GGG 
Leu Phe Gly 
125 

ATG GTA GAA 
Met Val Glu 



CTG AAA AAC 
Leu Lys Asn 

AGC CAA AAA 
Ser Gin Lys 
175 

TTT AAA GAA 
Phe Lys Glu 
190 

ACG CCC ATA 
Thr Pro He 
205 

TTG AAA AAG 
Leu Lys Lys 

TTT GAC CCA 
Phe Asp Pro 



AAA GAA AAC 
Lys Glu Asn 
255 

AAA AAT TTG 
Lys Asn Leu 
270 

TTA ATC AA.T 
Leu He Asn 
285 

AAT TTC AAA 
Asn Phe Lys 



AAA ACC AAT 
Lys Thr Asn 



lie Thr Ser 



GAC AAC ATC 
Asp Asn He 
130 

AAA CTC TCC 
Lys Leu Ser 
145 

CTC TAT GAA 
Leu Tyr Glu 
160 

AGC CAA CAA 
Ser Gin Gin 



GCC TTT AAA 
Ala Phe Lys 



GAG GTG GTG 
Glu Val Val 
210 

CAT TTC AAC 
His Phe Asn 
225 

TTC ACC GGC 
Phe Thr Gly 
240 

GCG CTC ATT 
Ala Leu He 



TTC GCT TTT 
Phe Ala Phe 



ATC ACC CAA 
He Thr Gin 
290 

AAC ATC GCG 
Asn He Ala 
305 

AAA GGG GTG 
Lys Gly Val 
320 



His He He 
115 

AAA AAC CCT 
Lys Asn Pro 

ACT TTA GGA 
Thr Leu Gly 



AGC GTG AAA 
Ser Val Lys 
165 

GAA CTC ATT 
Glu Leu He 
180 

AAG CAA AGC 
Lys Gin Ser 
195 

GAT TTC ATT 
Asp Phe He 



ACG GAT TTT 
Thr Asp Phe 



ACC GGG AGT 
Thr Gly Ser 
245 

AGC GAT GAA 
Ser Asp Glu 
260 

GAC ATC GTG 
Asp He Val 
275 

GCC GCG CAA 
Ala Ala Gin 



CTC ACG GAC 
Leu Thr Asp 



CTC CCT TTA 
Leu Pro Leu 
325 



Thr Lys Pro 
120 

ATC GCT AAA 
He Ala Lys 
135 

TTA GAA GGA 
Leu Glu Gly 
150 

ACC GAA GCC 
Thr Glu Ala 



AAA AAC CTC 
Lys Asn Leu 



GAA AAA CTA 
Glu Lys Leu 
200 

TTA AGA GCC 
Leu Arg Ala 
215 

AAC GAT CAA 

Asn Asp Gin 
230 

TTT ATC GCT 

Phe He Ala 



GCC TTA AAA 
Ala Leu Lys 



CTT TTG TCT 
Leu Leu Ser 
280 

AAT AGG GAT 
Asn Arg Asp 
295 

AGC CTG GAT 
Ser Leu Asp 
310 

TAT GAG GAT 
Tyr Glu Asp 



He Phe Asp 

GCC TTG GAT 
Ala Leu Asp 

GAA ACT AAA 
Glu Thr Lys 
155 

TTG CAC GCC 
Leu His Ala 
170 

TAC AAC ACT 
Tyr Asn Thr 
185 

GGG ATC GTT 
Gly He Val 



ACT AAC GGC 
Thr Asn Gly 



AGC ATC ACG 
Ser He Thr 
235 

CGT TTG CTT 
Arg Leu Leu 
250 

GAG AAG TTT 
Glu Lys Phe 
265 

TAT TAT ATC 
Tyr Tyr He 



GGC TCG TTA 
Gly Ser Leu 



TAT TTA GAA 
Tyr Leu Glu 
315 

TTG AAA GAA 
Leu Lys Glu 
330 



Ala 



AAA 435 

Lys 

140 

GAT 4 83 

Asp 



AAA 531 
Lys 



TTC 579 
Phe 



TAT 627 
Tyr 



ATT 675 

He 

220 

ATT 723 
He 



TCT 771 
Ser 



CAA 819 
Gin 



GCT 867 
Ala 



AAC 915 

Asn 

300 

GAA 963 
Glu 



AAC 1011 
Asn 
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lie Trp Lys Val lie Lys Ala Leu Arg Ser His Asp Pro Ser Leu Val 

515 520 525 

Asp Glu Ala Thr Phe Lys Glu Lys* lie Lys lie Phe Gly Ser Asp Asp 

530 535 540 

Gly Asn Gin Ser Gin Arg 
545 550 

(2) INFORMATION FOR SEQ ID NO: 7 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 880 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 2814 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797: 

AAGCCTTAAG AAGCC ATG ACC CAA GCC TGG TTG ATG AAG CCA CTT TTA AAG 51 
Met Thr Gin Ala Trp Leu Met Lys Pro Leu Leu Lys 
15 10 

AAA AAA TCA AAA TCT TTG GAA GCG ATG ATG GCA ACC AAT CAC AAC GAT 9 9 

Lys Lys Ser Lys Ser Leu Glu Ala Met Met Ala Thr Asn His Asn Asp 
15 20 25 

GAA AAA ACC CTT TTT GAC GOT ATG TTA CTG CAA GAT CTA GCG GAC GCT 14 7 

Glu Lys Thr Leu Phe Asp Ala He Leu Leu Gin Asp Leu Ala Asp Ala 
30 35 40 

ATG TAT AAT GTC ATG CCC ACT AAA TTA GGG GAC AGG AAT TAT TGG GAA 195 
Met Tyr Asn Val Met Pro Thr Lys Leu Gly Asp Arg Asn Tyr Trp Glu 
45 50 55 60 

AAT TTC ACT AAA AAA ACG GGC AAC ATC GCA AGG ACC TTG AAC AAC CGC 24 3 

Asn Phe Thr Lys Lys Thr Gly Asn He Ala Arg Thr Leu Asn Asn Arg 
65 70 75 

CTA AAA ATT ATT TTT GAC AAA AAC CCT GAA TTT TTC CAC GGC TTT TTG 2 91 

Leu Lys He He Phe Asp Lys Asn Pro Glu Phe Phe His Gly Phe Leu 
80 85 90 

GAT TCC TTA AGG GAA AAT ATC CAT CAA AAC ATT AAA GAA GAT GAA GCC 339 
Asp Ser Leu Arg Glu Asn He His Gin Asn He Lys Glu Asp Glu Ala 
95 100 105 

TTA GAC ATG ATC ACT TCT CAC ATC ATC ACT AAG CCC ATT TTT GAT GCA 3 87 



-1195- 



BNSDOCID: <WO 9&43478Al_l_> 



wo 98/43478 



PCT/US98/06371 



65 



70 



75 



80 



Met Glu Ala Leu Asp Ser Lys lie Thr Leu Phe Leu Ala Pro Ser lie 

85 90 95 

Ala Leu Leu Ser Gin Thr Phe Arg Glu Tyr Ala Gin Glu Lys Ser Glu 

100 105 110 

Pro Phe Tyr Ala Ser lie Val Cys Ser Asp Asp Lys Val Gly Lys Ser 

115 120 125 

Lys Asp Glu Asp Asn Asp Asp lie Lys Phe Ser Glu Leu Pro Leu Lys 

130 135 140 

Pro Ser Thr Arg Leu Glu Asp lie Leu Ser Val Arg Lys Lys Ala Gin 
145 150 155 160 

Lys Glu Asn Lys Arg Phe lie lie Phe Ser Thr Tyr Gin Ser Ala Leu 

165 170 175 

Arg lie Lys Glu Ala Gin Glu Ala Gly Leu Gly Gly lie Asp Leu lie 

180 185 190 

lie Cys Asp Glu Ala His Arg Thr Val Gly Ala Met Tyr Ser Ser Asn 

195 200 205 

Glu Arg Asp Asp Lys Asn Ala Phe Thr Leu Cys His Ser Asp Lys Asn 

210 215 220 

lie Lys Ala Lys Lys Arg Leu Tyr Met Thr Ala Thr Pro Lys Val Tyr 
225 230 235 240 

Ser Glu Ser Ser Lys Ala Lys Ala Lys Glu Ser Asp Asn Val lie Tyr 

245 250 255 

Ser Met Asp Asp Ala Glu lie Phe Gly Glu Glu lie Tyr Thr Leu Asn 

260 265 270 

Phe Ser Lys Ala lie Ala Leu Asp Leu Leu Thr Asp Tyr Lys Val lie 

275 280 285 

lie Leu Ala Val Arg Lys Glu Asn Leu Ser Gly Val Thr Asn Ser Val 

290 295 300 

Asn Lys Lys lie Ser Gin Leu Lys Ala Glu Gly Thr Lys Leu Asp Lys 
305 310 315 320 

Lys Leu lie Asn Asn Glu Phe Val Cys Lys lie lie Gly Thr His Lys 

325 330 335 

Gly Leu Ala Lys Gin Asp Leu lie Val Leu Asn Glu Lys Asn Lys Glu 

340 345 350 

Asp His Asn Leu Gin Asn Gin Tyr Asp Thr Ala Pro Ser Gin Arg Ala 

355 360 365 

lie Asn Phe Cys Lys Ser lie Asn Thr Ser Lys Asn lie Lys Asp Ser 

370 375 380 

Phe Glu Thr lie Met Glu Cys Tyr Asp Glu Glu Leu Lys Lys Lys Ser 
385 39.0 395 400 

Phe Lys Asn Leu Lys lie Ser lie Asp His lie Asp Gly Thr Met Asn 

405 410 415 

Cys Lys Asp Arg Leu Glu Lys Leu Glu Glu Leu Asn Gin Phe Glu Pro 

420 425 430 

Asn Thr Cys Lys Val Leu Ser Asn Ala Arg Cys Leu Ser Glu Gly Val 

435 440 445 

Asp Val Pro Ala Leu Asp Ser lie Val Phe Phe Asp Gly Lys Ser Ala 

450 455 460 

Met Val Asp lie lie Gin Ala Val Gly Arg Val Met Arg Lys Ala Lys 
465 470 475 480 

Arg Lys Lys Arg Gly Tyr lie lie Leu Pro lie Ala Leu Glu Glu Ser 

485 490 495 

Glu lie Gin Asn Leu Asp Glu Ala Val Asn Asn Thr Asn Phe Lys Asn 



500 



505 



510 
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GAG CCC AAC ACT TGC AAG GTT TTA AGC AAC GCC AGG TGT TTG AGC GAA 14 06 

Glu Pro Asn Thr Cys Lys Val Leu Ser Asn Ala Arg Cys Leu Ser Glu 
435 440 445 

GGG GTG GAT GTC CCA GCG TTA GAT AGC ATC GTC TTT TTT GAT GGC AAA 14 54 
Gly Val Asp Val Pro Ala Leu Asp Ser lie Val Phe Phe Asp Gly Lys 
450 455 460 

AGC GCT ATG GTG GAT ATT ATC CAA GCG GTG GGT AGG GTG ATG CGA AAA 15 02 

Ser Ala Met Val Asp lie lie Gin Ala Val Gly Arg Val Met Arg Lys 
465 470 475 

GCC AAA CGC AAG AAA AGA GGC TAT ATC ATT TTG CCT ATC GCT TTA GAA 15 50 

Ala Lys Arg Lys Lys Arg Gly Tyr lie lie Leu Pro lie Ala Leu Glu 
480 485 490 

GAG AGT GAA ATC CAA AAC CTG GAT GAA GCC GTC AAT AAC ACC AAT TTC 15 98 

Glu Ser Glu lie Gin Asn Leu Asp Glu Ala Val Asn Asn Thr Asn Phe 
495 500 505 510 

AAA AAC ATT TGG AAA GTG ATA AAA GCC TTA AGA AGC CAT GAC CCA AGC 164 6 

Lys Asn lie Trp Lys Val lie Lys Ala Leu Arg Ser His Asp Pro Ser 
515 520 525 

CTG GTT GAT GAA GCC ACT TTT AAA GAA AAA ATC AAA ATC TTT GGA AGC 1694 
Leu Val Asp Glu Ala Thr Phe Lys Glu Lys lie Lys lie Phe Gly Ser 
530 535 540 

GAT GAT GGC AAC CAA TCA CAA CGA TGAAAAAACC CTTTTTGACG CTATCTTACT 174 8 
Asp Asp Gly Asn Gin Ser Gin Arg 
545 550 

GCAAGATCTA GCGGACGCTA TGTATAATGT CATGCCCACT AAATTAGGGG AC 18 00 

(2) INFORMATION FOR SEQ ID NO: 796: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 96: 

Met Gly lie Asp lie Asp Glu lie Thr Glu Glu Asp Phe lie Tyr Ser 

15 10 15 

Arg lie Asp Trp Glu Lys Phe Asp Pro Thr Lys Thr Gin Asp Glu lie 

20 25 30 

Pro Leu Cys Asp Lys Lys Lys Pro Arg Ser His Gin Thr Glu Ala lie 

35 40 45 

Asn Ala Thr Lys Glu Tyr Phe Ser Asp Pro Lys Asn Ala Arg Gly Lys 

50 55 60 

Leu lie Met Ala Cys Gly Thr Gly Lys Thr Tyr Thr Ser Leu Lys lie 
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AGT AAT GAA AGG GAC GAT AAA AAC GCT TTC ACG CTT TGC CAT AGC GAT 7 34 

Ser Asn Glu Arg Asp Asp Lys Asn Ala Phe Thr Leu Cys His Ser Asp 
210 215 220 

AAA AAT ATC AAA GCG AAA AAA CGC CTG TAT ATG ACC GCC ACG CCT AAA 7 82 

Lys Asn lie Lys Ala Lys Lys Arg Leu Tyr Met Thr Ala Thr Pro Lys 
225 230 235 

GTT TAT AGC GAA AGC TCC AAA GCT AAA GCC AAA GAG AGC GAT AAT GTT 83 0 

Val Tyr Ser Glu Ser Ser Lys Ala Lys Ala Lys Glu Ser Asp Asn Val 
240 245 250 

ATC TAT TCT ATG GAC GAT GCA GAG ATT TTT GGC GAA GAA ATC TAT ACG 878 
lie Tyr Ser Met Asp Asp Ala Glu lie Phe Gly Glu Glu lie Tyr Thr 
255 260 265 270 

CTC AAT TTT TCA AAA GCG ATC GCT TTG GAT CTC TTA ACC GAT TAT AAA 92 6 

Leu Asn Phe Ser Lys Ala lie Ala Leu Asp Leu Leu Thr Asp Tyr Lys 
275 280 285 

GTC ATC ATT TTA GCG GTG CGA AAA GAA AAT TTA AGC GGC GTT ACT AAC 974 
Val lie lie Leu Ala Val Arg Lys Glu Asn Leu Ser Gly Val Thr Asn 
290 295 300 

AGC GTG AAT AAA AAG ATC AGC CAG CTC AAA GCC GAA GGC ACT AAA TTA 1022 
Ser Val Asn Lys Lys lie Ser Gin Leu Lys Ala Glu Gly Thr Lys Leu 
305 310 315 

GAT AAA AAG CTC ATC AAT AAC GAA TTT GTT TGT AAG ATC ATC GGC ACT 10 70 

Asp Lys Lys Leu lie Asn Asn Glu Phe Val Cys Lys lie He Gly Thr 
320 325 330 

CAT AAA GGG TTA GCC AAG CAG GAT TTA ATC GTT TTA AAC GAG AAA AAC 1118 
His Lys Gly Leu Ala Lys Gin Asp Leu He Val Leu Asn Glu Lys Asn 
335 340 345 350 

AAA GAA GAT CAC AAC TTG CAA AAC CAA TAC GAC ACC GCT CCC TCT CAA 1166 
Lys Glu Asp His Asn Leu Gin Asn Gin Tyr Asp Thr Ala Pro Ser Gin 
355 360 365 

AGA GCC ATA AAC TTT TGT AAA AGC ATT AAC ACG AGC AAG AAC ATT AAA ' 1214 
Arg Ala He Asn Phe Cys Lys Ser He Asn Thr Ser Lys Asn He Lys 
370 375 380 

GAC TCC TTT GAA ACG ATT ATG GAA TGC TAT GAT GAA GAG TTG AAG AAA 12 62 

Asp Ser Phe Glu Thr He Met Glu Cys Tyr Asp Glu Glu Leu Lys Lys 
385 390 395 

AAG AGT TTT AAA AAC CTA AAA ATC AGC ATC GAT CAC ATT GAT GGC ACC 1310 
Lys Ser Phe Lys Asn Leu Lys He Ser He Asp His He Asp Gly Thr 
400 405 410 

ATG AAT TGT AAG GAT AGG CTT GAA AAA TTA GAA GAG CTC AAT CAA TTT 13 5 8 

Met Asn Cys Lys Asp Arg Leu Glu Lys Leu Glu Glu Leu Asn Gin Phe 
415 420 425 430 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:795: 

TCATCATCTC CACTTCTAAT TTAACCTCTA ACGCCCTTAA CGCAATTGAG CAAATCAGAA 6 0 

GCACAGGA ATG GGG ATT GAC ATT GAT GAA ATC ACT GAA GAG GAT TTT ATC 110 
Met Gly lie Asp lie Asp Glu lie Thr Glu Glu Asp Phe lie 
15 10 

TAT TCT CGC ATT GAT TGG GAA AAG TTT GAT CCC ACA AAA ACG CAA GAC 15 8 

Tyr Ser Arg lie Asp Trp Glu Lys Phe Asp Pro Thr Lys Thr Gin Asp 
15 20 25 30 

GAA ATC CCC TTA TGC GAT AAG AAA AAG CCG CGC TCG CAT CAA ACA GAA 2 06 

Glu lie Pro Leu Cys Asp Lys Lys Lys Pro Arg Ser His Gin Thr Glu 
35 40 45 

GCC ATA AAC GCC ACT AAA GAG TAT TTT TCT GAC CCT AAA AAC GCT AGA 2 54 

Ala lie Asn Ala Thr Lys Glu Tyr Phe Ser Asp Pro Lys Asn Ala Arg 
50 55 60 

GGC AAG CTC ATT ATG GCA TGC GGG ACA GGC AAA ACC TAC ACT TCT TTA 3 02 

Gly Lys Leu lie Met Ala Cys Gly Thr Gly Lys Thr Tyr Thr Ser Leu 
65 70 75 

AAA ATC ATG GAA GCT TTA GAC TCT AAG ATC ACG CTT TTT CTA GCA CCC 350 
Lys lie Met Glu Ala Leu Asp Ser Lys lie Thr Leu Phe Leu Ala Pro 
80 85 90 

AGC ATC GCT TTG CTT TCT CAA ACT TTT AGA GAA TAC GCG CAA GAA AAA 3 98 

Ser lie Ala Leu Leu Ser Gin Thr Phe Arg Glu Tyr Ala Gin Glu Lys 
95 100 105 110 

AGT GAG CCG TTT TAC GCT TCT ATC GTG TGC AGC GAT GAT AAA GTC GGG 44 6 

Ser Glu Pro Phe Tyr Ala Ser lie Val Cys Ser Asp Asp Lys Val Gly 
115 120 125 

AAA AGT AAA GAC GAA GAC AAT GAT GAT ATT AAA TTT TCT GAG CTC CCT 4 94 

Lys Ser Lys Asp Glu Asp Asn Asp Asp lie Lys Phe Ser Glu Leu Pro 
130 135 140 

TTA AAG CCC TCC ACT CGC CTT GAA GAC ATT TTA AGC GTT CGA AAA AAA 54 2 

Leu Lys Pro Ser Thr Arg Leu Glu Asp lie Leu Ser Val Arg Lys Lys 
145 150 155 

GCG CAA AAA GAA AAC AAG CGC TTC ATT ATT TTT TCA ACC TAT CAA AGC 5 90 

Ala Gin Lys Glu Asn Lys Arg Phe lie lie Phe Ser Thr Tyr Gin Ser 
160 165 170 

GCG TTG CGT ATT AAA GAA GCG CAA GAA GCG GGT TTG GGC GGA ATC GAT 63 8 

Ala Leu Arg lie Lys Glu Ala Gin Glu Ala Gly Leu Gly Gly lie Asp 
175 180 185 190 

CTT ATT ATT TGC GAT GAA' GCC CAC AGA ACG GTA GGG GCT ATG TAT TCT 6 86 

Leu lie lie Cys Asp Glu Ala His Arg Thr Val Gly Ala Met Tyr Ser 
195 200 205 
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TTG GTG TCT ATG ATT TCT TTA GCC CTA AGA AGC CCC ACT TTT AAA GAA 2 45 

Leu Val Ser Met lie Ser Leu Ala Leu Arg Ser Pro Thr Phe Lys Glu 
60 65 70 

AAT CCC GCC TTT TGT GTT TGC ATC GTA ATC CCC CCA AAA CCG CAA CAA 2 93 

Asn Pro Ala Phe Cys Val Cys lie Val lie Pro Pro Lys Pro Gin Gin 
75 80 85 

TTG TCC GGC ATT TCT TTA ATT TCA TAATGCAAAT TGAGCAACCT TTTGCATTCT 34 7 

Leu Ser Gly lie Ser Leu lie Ser 
90 95 

TAG 3 5 0 

(2) INFORMATION FOR SEQ ID NO: 7 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794: 

Met Leu Glu Asn Gly Val Lys Lys Ala Phe Tyr Ser Ala Leu Phe Lys 

15 10 15 

Ala Leu Ala Met Ser Ser Lys Gly Cys Lys Asn Gly Gly Val Leu Gly 

20 25 30 

Ser Ser Lys Asp Phe Lys Ala Leu Phe Asn Cys lie Trp His Ala Pro 

35 40' 45 

His Ser Ala Leu Lys lie Ala Ala Leu Val Ser Met lie Ser Leu Ala 

50 55 60 

Leu Arg Ser Pro Thr Phe Lys Glu Asn Pro Ala Phe Cys Val Cys lie 
65 70 75 80 

Val lie Pro Pro Lys Pro Gin Gin Leu Ser Gly lie Ser Leu lie Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 7 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 69... 1718 
(D) OTHER INFORMATION: 
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Lys Asp Lys Asp Leu Tyr Val Lys Arg Leu Glu Lys lie Thr Pro Lys 

20 25 30 

lie Tyr Leu Ala Ser Val Phe Leu Glu Lys His Thr Pro Leu Lys Ser 

35 40 45 

Leu Leu Glu Lys He Pro Lys Gly Lys Lys Glu Thr He Thr Tyr His 

50 55 60 

Asn Pro Cys His Ala Lys Lys Thr Leu Asn Ala His Lys Glu Val Arg 
65 70 75 80 

Asn Leu Leu Asn Leu His Tyr Glu He Lys Glu Met Pro Asp Asn Cys 

85 90 95 

Cys Gly Phe Gly Gly He Thr Met Gin Thr Gin Lys Ala Gly Phe Ser 

100 105 110 

Leu Lys Val Gly Leu Leu Arg Ala Lys Glu He He Asp Thr Lys Ala 

115 120 125 

Ala He Leu Ser Ala Glu Cys Gly Ala Cys His Met Gin Leu Asn Asn 

130 135 140 

Ala Leu Lys Ser Leu Asp Asp Pro Asn Thr Pro Pro Phe Leu His Pro 
145 150 155 160 

Leu Glu Leu He Ala Lys Ala Leu Lys Ser Ala Glu 
165 170 

(2) INFORMATION FOR SEQ ID NO: 793: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 30... 317 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 93: 



GGCGTTAAAG CTCTGTATTA TATAAAAAG ATG TTG GAG AAT GGG GTT AAA AAG 5 3 

Met Leu Glu Asn Gly Val Lys Lys 
1 5 



GCT TTT TAT TCA GCG CTT TTT AAG GCT TTA GCG ATG AGT TCT AAA GGG 101 
Ala Phe Tyr Ser Ala Leu Phe Lys Ala Leu Ala Met Ser Ser Lys Gly 
10 15 20 



TGC AAA AAT GGC GGA GTG TTA GGG TCG TCT AAA GAC TTT AAA GCG TTG 14 9 

Cys Lys Asn Gly Gly Val Leu Gly Ser Ser Lys Asp Phe Lys Ala Leu 
25 30 35 40 



TTT AAT TGC ATA TGG CAT GCC CCG CAT TCA GCG CTC AAA ATT GCA GCT 197 
Phe Asn Cys He Trp His Ala Pro His Ser Ala Leu Lys He Ala Ala 
45 50 55 
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AAA ATC ACG CCT AAA ATC TAT CTG GCG AGC GTG TTT TTA GAG AAA CAC 14 8 

Lys lie Thr Pro Lys lie Tyr Leu Ala Ser Val Phe Leu Glu Lys His 
30 35 40 

ACC CCT TTA AAA AGT CTT TTA GAA AAA ATC CCT AAG GGA AAA AAA GAG 19 6 

Thr Pro Leu Lys Ser Leu Leu Glu Lys lie Pro Lys Gly Lys Lys Glu 
45 50 55 

ACT ATC ACC TAT CAT AAC CCT TGT CAT GCC AAA AAA ACC CTA AAC GCT 244 

Thr He Thr Tyr His Asn Pro Cys His Ala Lys Lys Thr Leu Asn Ala 

60 65 70 75 

CAC AAA GAA GTG CGC AAC TTG CTC AAT TTG CAT TAT GAA ATT AAA GAA 2 92 

His Lys Glu Val Arg Asn Leu Leu Asn Leu His Tyr Glu He Lys Glu 
80 85 90 

ATG CCG GAC AAT TGT TGC GGT TTT GGG GGG ATT ACG ATG CAA ACA CAA 34 0 

Met Pro Asp Asn Cys Cys Gly Phe Gly Gly He Thr Met Gin Thr Gin 
95 100 105 

AAG GCG GGA TTT TCT TTA AAA GTG GGG CTT CTT AGG GCT AAA GAA ATC 3 88 

Lys Ala Gly Phe Ser Leu Lys Val Gly Leu Leu Arg Ala Lys Glu He 
110 115 120 

ATA GAC ACC AAA GCT GCA ATT TTG AGC GCT GAA TGC GGG GCA TGC CAT 43 6 

He Asp Thr Lys Ala Ala He Leu Ser Ala Glu Cys Gly Ala Cys His 
125 130 135 

ATG CAA TTA AAC AAC GCT TTA AAG TCT TTA GAC GAC CCT AAC ACT CCG 4 84 

Met Gin Leu Asn Asn Ala Leu Lys Ser Leu Asp Asp Pro Asn Thr Pro 

140 145 150 155 

CCA TTT TTG CAC CCT TTA GAA CTC ATC GCT AAA GCC TTA AAA AGC GCT 532 

Pro Phe Leu His Pro Leu Glu Leu He Ala Lys Ala Leu Lys Ser Ala 
160 165 170 

GAA TAAAAAGCCT TTTTAACCCC ATTCTCCAAC ATCTTTTTAT ATAATACAGA GCT 58 8 

Glu 



(2) INFORMATION FOR SEQ ID NO: 792: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792: 

Met Arg Tyr Met Leu He Asn Asp Tyr Tyr Lys Val Phe Leu Gly Glu 
15 10 15 
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165 170 175 

Asn Tyr Asp Phe Asn His Ala Leu Lys Ser Glu lie Phe Ala Leu Asp 

180 185 190 

Glu Glu Gin Glu Glu Ala Gin Lys Thr Ala lie Gin Asn Ala lie Lys 

195 200 205 

Asn His Asp Ser lie Gly Gly Val Ala Leu lie Arg Ala Arg Ser lie 

210 215 220 

Lys Thr Asn Gin Lys Leu Pro lie Gly Leu Gly Gin Gly Leu Tyr Ala 
225 230 235 240 

Lys Leu Asp Ala Lys lie Ala Glu Ala Met Met Gly Leu Asn Gly Val 

245 250 255 

Lys Ala Val Glu lie Gly Lys Gly Val Glu Ser Ser Leu Leu Lys Gly 

260 265 270 

Ser Glu Tyr Asn Asp Leu Met Asp Gin Lys Gly Phe Leu Ser Asn Arg 

275 280 285 

Ser Gly Gly Val Leu Gly Gly Met Ser Asn Gly Glu Glu He He Val 

290 295 300 

Arg Val His Phe Lys Pro Thr Pro Ser He Phe Gin Pro Gin Arg Thr 
305 310 315 320 

He Asp He Asn Gly Asn Glu Cys Glu Cys Leu Leu Lys Gly Arg His 

325 330 335 

Asp Pro Cys He Ala He Arg Gly Ser Val Val Cys Glu Ser Leu Leu 

340 345 350 

Ala Leu Val Leu Ala Asp Met Val Leu Leu Asn Leu Thr Ser Lys He 

355 360 365 

Glu Tyr Leu Lys Thr He Tyr Asn Glu Asn 
370 375 

(2) INFORMATION FOR SEQ ID NO: 791: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 20... 535 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791: 

TCATTGTGCC TGAAGCCAC ATG CGC TAG ATG CTC ATC AAC GAT TAT TAG AAG 5 2 

Met Arg Tyr Met Leu He Asn Asp Tyr Tyr Lys 
• 1 5 10 

GTG TTT TTG GGC GAA AAA GAT AAG GAT TTG TAT GTG AAG CGC TTG GAA 10 0 

Val Phe Leu Gly Glu Lys Asp Lys Asp Leu Tyr Val Lys Arg Leu Glu 
15 20 25 
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TTC AAA CCC ACG CCA AGC ATT TTC CAA OCT CAA CGA ACC ATA GAC ATT 1016 
Phe Lys Pro Thr Pro Ser lie Phe Gin Pro Gin Arg Thr lie Asp lie 
310 315 320 

AAT GGC AAT GAG TGC GAA TGC TTG TTA AAG GGC AGG CAT GAT CCT TGC 10 64 

Asn Gly Asn Glu Cys Glu Cys Leu Leu Lys Gly Arg His Asp Pro Cys 
325 330 335 

ATT GCG ATT AGA GGG AGC GTG GTG TGC GAG AGT TTG TTA GCG TTG GTG 1112 
lie Ala lie Arg Gly Ser Val Val Cys Glu Ser Leu Leu Ala Leu Val 
340 345 350 355 

TTG GCT GAT ATG GTA TTA CTC AAT TTG ACT TCA AAA ATA GAG TAT TTA 1160 
Leu Ala Asp Met Val Leu Leu Asn Leu Thr Ser Lys lie Glu Tyr Leu 
360 365 370 

AAA ACG ATT TAT AAT GAG AAT TAAACGAAAT TGGATACAAT CAGCTTAAAA AGG A 1215 
Lys Thr lie Tyr Asn Glu Asn 
375 

TA 1217 
(2) INFORMATION FOR SEQ ID NO: 790: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 90: 

Met Arg Leu Ser Ser Ala Ser Lys Thr Glu Gly Ser Gin Met Asn Thr 

15 10 15 

Leu Gly Arg Phe Leu Arg Leu Thr Thr Phe Gly Glu Ser His Gly Asp 

20 25 30 

Val lie Gly Gly Val Leu Asp Gly Met Pro Ser Gly lie Lys lie Asp 

35 40 45 

Tyr Ala Leu Leu Glu Asn Glu Met Lys Arg Arg Gin Gly Gly Arg Asn 

50 55 60 

Val Phe lie Thr Pro Arg Lys Glu Asp Asp Lys Val Glu lie Thr Ser 
65 70 75 80 

Gly Val Phe Glu Asp Phe Ser Thr Gly Thr Pro He Gly Phe Leu He 

85 90 95 

His Asn Gin Arg Ala Arg Ser Lys Asp Tyr Asp Asn He Lys Asn Leu 

100 105 110 

Phe Arg Pro Ser His Ala Asp Phe Thr Tyr Phe His Lys Tyr Gly He 

115 120 125 

Arg Asp Phe Arg Gly Gly Gly Arg Ser Ser Ala Arg Glu Ser Ala He 

130 135 140 

Arg Val Ala Ala Gly Ala Phe Ala Lys Met Leu Leu Arg Glu He Gly 
145 150 155 160 

He Val Cys Glu Ser Gly He He Glu He Gly Gly He Lys Ala Lys 
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GAA GAT TTT AGC ACA GOG ACT CCT ATA GGG TTT TTA ATC CAC AAC CAA 344 
Glu Asp Phe Ser Thr Gly Thr Pro lie Gly Phe Leu lie His Asn Gin 
85 90 95 

AGG GCT AGG AGC AAG GAT TAG GAT AAC ATT AAA AAC CTT TTT AGG CCT 3 92 

Arg Ala Arg Ser Lys Asp Tyr Asp Asn He Lys Asn Leu Phe Arg Pro 
100 105 110 115 

AGC CAT GCG GAT TTC ACT TAT TTT CAT AAA TAG GGC ATT AGG GAT TTT 44 0 

Ser His Ala Asp Phe Thr Tyr Phe His Lys Tyr Gly He Arg Asp Phe 
120 125 130 

AGG GGT GGG GGG AGG AGT TCG GCC AGA GAG AGT GCT ATA AGA GTG GCT 488 
Arg Gly Gly Gly Arg Ser Ser Ala Arg Glu Ser Ala He Arg Val Ala 
135 140 145 

GCT GGG GCG TTT GCT AAA ATG CTT TTA AGA GAA ATC GGT ATT GTT TGT 53 6 

Ala Gly Ala Phe Ala Lys Met Leu Leu Arg Glu He Gly He Val Cys 
150 155 160 

GAA AGC GGG ATT ATA GAA ATT GGG GGT ATT AAA GCC AAA AAT TAT GAT 5 84 

Glu Ser Gly He He Glu He Gly Gly He Lys Ala Lys Asn Tyr Asp 
165 170 175 

TTT AAT CAC GCC TTA AAA AGC GAG ATT TTT GCC CTA GAT GAA GAA CAA 63 2 

Phe Asn His Ala Leu Lys Ser Glu He Phe Ala Leu Asp Glu Glu Gin 
180 185 190 195 

GAA GAA GCG CAA AAA ACA GCC ATT CAA AAC GCT ATC AAA AAC CAC GAT ^80 
Glu Glu Ala Gin Lys Thr Ala He Gin Asn Ala He Lys Asn His Asp 
200 205 210 

AGC ATA GGG GGT GTG GCT TTG ATT AGA GCG AGG AGC ATA AAA ACC AAT 72 8 

Ser He Gly Gly Val Ala Leu He Arg Ala Arg Ser He Lys Thr Asn 
215 220 225 

CAA AAG CTC CCC ATT GGC TTA GGT CAA GGG CTA TAC GCT AAA TTA GAC 776 
Gin Lys Leu Pro He Gly Leu Gly Gin Gly Leu Tyr Ala Lys Leu Asp 
230 235 240 

GCT AAA ATC GCT GAA GCG ATG ATG GGG CTT AAT GGG GTG AAA GCG GTT 824 
Ala Lys He Ala Glu Ala Met Met Gly Leu Asn Gly Val Lys Ala Val 
245 250 255 

GAA ATA GGC AAG GGG GTA GAA AGC TCT TTA TTA AAA GGC TCA GAG TAT 8 72 

Glu He Gly Lys Gly Val Glu Ser Ser Leu Leu Lys Gly Ser Glu Tyr 
260 265 270 275 

AAT GAT TTA ATG GAT CAA AAG GGG TTT TTG AGC AAT CGT AGC GGA GGG 92 0 

Asn Asp Leu Met Asp Gin Lys Gly Phe Leu Ser Asn Arg Ser Gly Gly 
280 285 290 

GTT TTA GGG GGC ATG AGC AAT GGG GAA GAA ATC ATT GTT AGA GTG CAT 968 
Val Leu Gly Gly Met Ser Asn Gly Glu Glu He He Val Arg Val His 
295 300 305 
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Arg 


Ala 
50 


Leu 


Asn 


Glu 


Ala 


Leu 
55 


Lys 


He 


Leu 


Lys 


Arg 
60 


Pro 


Cys 


Arg 


He 


Thr 


Leu 


Tyr 


Ser 


Asp 


Ser 


Gin 


Tyr 


Val 


Cys 


Gin 


Ala 


He 


Asn 


Val 


Trp 


65 










70 










75 










80 


Leu 


Ala 


Asn 


Trp 


Gin 
85 


Lys 


Lys 


Asn 


Phe 


Ser 
90 


Lys 


Val 


Lys 


Asn 


Val 
95 


Asp 


Leu 


Trp 


Lys 


Glu 
100 


Phe 


Leu 


Glu 


Val 


Ser 
105 


Lys 


Gly 


His 


Ser 


He 
110 


Val 


Ala 


Val 


Trp 


He 
115 


Lys 


Gly 


His 


Asn 


Gly 
120 


His 


Ala 


Glu 


Asn 


Glu 
125 


Arg 


Cys 


Asp 


Ser 


Leu 
130 


Ala 


Lys 


Leu 


Glu 


Ala 
135 


Gin 


Lys 


Arg 


Val 


Lys 
140 


Thr 


Thr 


Thr 





(2) INFORMATION FOR SEQ ID NO: 7 89: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 48... 1181 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789: 

ATGTATGCGA CCGCTAAAGG CAAAAGCAAA AAAGAAGCCG AACAGCA ATG CGC TTA 5 6 

Met Arg Leu 
1 

TCA AGC GCT TCA AAA ACT GAA GGA AGC CAA ATG AAC ACT TTG GGG CGT 104 
Ser Ser Ala Ser Lys Thr Glu Gly Ser Gin Met Asn Thr Leu Gly Arg 
5 10 15 

TTT TTA AGG CTC ACG ACT TTT GGG GAA TCG CAT GGG GAT GTG ATA GGG 152 
Phe Leu Arg Leu Thr Thr Phe Gly Glu Ser His Gly Asp Val He Gly 
20 25 30 35 

GGG GTA TTA GAC GGC ATG CCT AGC GGG ATT AAA ATA GAC TAT GCG CTA 2 00 

Gly Val Leu Asp Gly Met Pro Ser Gly He Lys He Asp Tyr Ala Leu 
40 45 50 

TTA GAA AAT GAA ATG AAG CGC CGC CAA GGG GGG AGG AAC GTT TTC ATT 24 8 

Leu Glu Asn Glu Met Lys Arg Arg Gin Gly Gly Arg Asn Val Phe He 
55 60 65 

ACG CCA CGA AAA GAA GAC GAT AAA GTG GAA ATA AGA AGC GGG GTT TTT 2 96 

Thr Pro Arg Lys Glu Asp Asp Lys Val Glu He Thr Ser Gly Val Phe 
70 75 80 
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TTA GGC AAT CCC GGG CCA GGC GGT TAT GCG GCG ATT TTA CGC TAT AAA 9 9 

Leu Gly Asn Pro Gly Pro Gly Gly Tyr Ala Ala lie Leu Arg Tyr Lys 
15 20 25 

GAT AAA GAA AAA ACC ATC AGT GGG GGC GAA GAA TTC ACC ACG AAT AAC 14 7 

Asp Lys Glu Lys Thr lie Ser Gly Gly Glu Glu Phe Thr Thr Asn Asn 
30 35 40 

CGC ATG GAA TTA AGA GCG CTC AAT GAA GCG TTA AAA ATT TTG AAA CGC 195 

Arg Met Glu Leu Arg Ala Leu Asn Glu Ala Leu Lys lie Leu Lys Arg 
45 50 55 60 

CCA TGC CGT ATC ACG CTT TAT AGC GAT TCG CAA TAG GTG TGC CAA GCG 24 3 

Pro Cys Arg lie Thr Leu Tyr Ser Asp Ser Gin Tyr Val Cys Gin Ala 
65 70 75 

ATC AAT GTG TGG CTA GCT AAC TGG CAA AAA AAG AAT TTT TCT AAA GTT 2 91 

lie Asn Val Trp Leu Ala Asn Trp Gin Lys Lys Asn Phe Ser Lys Val 

80 85 90 

AAA AAT GTG GAT TTA TGG AAA GAA TTT TTA GAA GTC TCT AAA GGG CAT 33 9 

Lys Asn Val Asp Leu Trp Lys Glu Phe Leu Glu Val Ser Lys Gly His 
95 100 105 

TCT ATT GTG GCT GTT TGG ATC AAG GGG CAT AAC GGG CAT GCC GAG AAT 3 87 

Ser lie Val Ala Val Trp lie Lys Gly His Asn Gly His Ala Glu Asn 
110 115 120 

GAA CGA TGC GAT AGC CTC GCT AAA TTA GAG GCG CAA AAA CGG GTC AAA 4 35 

Glu Arg Cys Asp Ser Leu Ala Lys Leu Glu Ala Gin Lys Arg Val Lys 
125 130 135 140 

ACG ACC ACT TAAAGGGAAA AATGATGAAA AACAAACGCT CTCAAAACAG CCC 4 87 

Thr Thr Thr 



(2) INFORMATION FOR SEQ ID NO: 7 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 88: 



Met Gin Glu lie Glu lie Phe Cys 

1 5 
Gly Pro Gly Gly Tyr Ala Ala lie 
20 

Thr lie Ser Gly Gly Glu Glu Phe 
35 40 



Asp Gly Ser Ser Leu Gly Asn Pro 

10 15 
Leu Arg Tyr Lys Asp Lys Glu Lys 
25 30 
Thr Thr Asn Asn Arg Met Glu Leu 
45 
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Met Met Pro Phe Glu Ala Val lie Gly Leu Glu Val His Val Gin Leu 

15 10 15 

Asn Thr Lys Thr Lys lie Phe Cys Ser Cys Ser Thr Ser Phe Gly Glu 

20 25 30 

Ser Pro Asn Ser Asn Thr Cys Pro Val Cys Leu Gly Leu Pro Gly Ala 

35 40 45 

Leu Pro Val Leu Asn Lys Glu Val Val Lys Lys Ala lie Gin Leu Gly 

50 55 60 

Thr Ala lie Glu Ala Asn lie Asn Gin Tyr Ser lie Phe Ala Arg Lys 
65 70 75 80 

Asn Tyr Phe Tyr Pro Asp Leu Pro Lys Ala Tyr Gin lie Ser Gin Phe 

85 90 95 

Glu Val Pro lie Val Ser Asp Gly Lys Leu Glu lie Asp Thr Lys Glu 

100 105 110 

Gly Ala Lys lie Val Arg lie Glu Arg Ala His Met Glu Glu Asp Ala 

115 120 125 

Gly Lys Asn lie His Glu Gly Ser Tyr Ser Leu Val Asp Leu Asn Arg 

130 135 i40 

Ala Cys Thr Pro Leu Leu Glu lie Val Ser Lys Pro Asp Met Arg Asn 
145 150 155 160 

Ser Glu Glu Ala lie Ala Tyr Leu Lys Lys Leu His Ala lie Val Arg 

165 170 175 

Phe lie Gly lie Ser Asp Ala Asn Met Gin Glu Gly Asn Phe Arg Cys 

180 185 190 

Asp Ala Asn Val Ser lie Arg Pro Lys Gly Asp Glu Lys Leu Tyr Thr 

195 200 205 

Arg Val Glu lie Lys Asn Leu Asn Ser Phe Arg Phe lie Ala Lys Ala 

210 215 220 

lie Glu Tyr Glu lie Glu Arg Gin Ser Ala Asp Val Gly Glu Arg Ala 
225 230 235 240 

Leu 



(2) INFORMATION FOR SEQ ID NO: 787: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 16... 444 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787: 
ATGGGAGTGG ATTGA ATG CAA GAA ATT GAA ATT TTT TGC GAT GGC TCT TCT 51 



Met Gin Glu lie 

1 



Glu lie Phe Cys Asp Gly Ser Ser 
5 10 
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80 85 90 

ATT TCG CAG TTT GAA GTC OCT ATT GTG AGC GAT GGG AAA TTA GAG ATT 33 9 

lie Ser Gin Phe Glu Val Pro lie Val Ser Asp Gly Lys Leu Glu lie 
95 100 105 

GAC ACT AAA GAG GGT GCA AAA ATC GTG CGT ATT GAA AGG GCC CAC ATG 3 87 

Asp Thr Lys Glu Gly Ala Lys lie Val Arg lie Glu Arg Ala His Met 
110 115 120 

GAA GAA GAC GCC GGT AAA AAT ATC CAT GAG GGC AGT TAT TCT TTA GTG 43 5 

Glu Glu Asp Ala Gly Lys Asn lie His Glu Gly Ser Tyr Ser Leu Val 
125 130 135 140 

GAT TTG AAC CGC GCT TGC ACC CCT TTA TTA GAA ATT GTC AGT AAG CCG 4 83 

Asp Leu Asn Arg Ala Cys Thr Pro Leu Leu Glu lie Val Ser Lys Pro 
145 150 155 

GAC ATG CGA AAT AGT GAA GAA GCT ATA GCG TAT TTG AAA AAG CTC CAT 531 
Asp Met Arg Asn Ser Glu Glu Ala lie Ala Tyr Leu Lys Lys Leu His 
160 165 170 

GCT ATC GTG CGT TTT ATA GGG ATT TCT GAT GCG AAC ATG CAA GAG GGG 5 79 

Ala lie Val Arg Phe lie Gly lie Ser Asp Ala Asn Met Gin Glu Gly 
175 180 IBS 

AAT TTC AGG TGC GAT GCG AAC GTG TCC ATT AGA CCC AAA GGC GAT GAA 62 7 

Asn Phe Arg Cys Asp Ala Asn Val Ser lie Arg Pro Lys Gly Asp Glu 
190 195 200 

AAG CTT TAT ACG AGA GTA GAG ATT AAA AAT CTA AAT AGC TTT AGA TTC 67 5 

Lys Leu Tyr Thr Arg Val Glu lie Lys Asn Leu Asn Ser Phe Arg Phe 
205 210 215 220 

ATT GCT AAA GCG ATT GAA TAC GAG ATA GAG CGC CAA AGC GCG GAC GTG 72 3 

lie Ala Lys Ala lie Glu Tyr Glu He Glu Arg Gin Ser Ala Asp Val 
225 230 235 

GGA GAA CGG GCG CTA TAATGAAGAG GTGGTTCAAG AAACGCGCCT TT 77 0 

Gly Glu Arg Ala Leu 
240 



(2) INFORMATION FOR SEQ ID NO: 7 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786: 
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450 455 460 

Asn Tyr Tyr Leu Asn His Leu Lys Lys Glu Phe Asn His Pro Leu Phe 
465 470 475 480 

Val Ala Tyr Ala Tyr Asn Ala Gly Pro Gly Phe Leu Arg Arg Trp Leu 

485 490 495 

Glu Ser Ser Lys Arg Phe Lys Glu Lys Asn His Phe Glu Pro Trp Leu 

500 505 510 

Ser Met Glu Leu Met Pro Tyr Ser Glu Thr Arg Met Tyr Gly Phe Arg 

515 520 525 

Val Met Leu Asn Tyr Leu lie Tyr Gin Glu lie Phe Gly Asn Phe lie 

530 535 540 

Pro lie Asp Gly Phe Leu Glu Gin Thr Leu Asn Ser Lys Asp Lys Pro 
545 550 555 560 

(2) INFORMATION FOR SEQ ID NO: 78 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 16... 738 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785: 

TAAAAAGAAG GACAA ATG ATG CCA TTT GAA GOT GTA ATC GGG CTA GAA GTC 51 
Met Met Pro Phe Glu Ala Val lie Gly Leu Glu Val 
15 10 

CAT GTC CAA CTC AAC ACC AAA ACC AAA ATC TTT TGC TCT TGC TCT ACA 9 9 

His Val Gin Leu Asn Thr Lys Thr Lys lie Phe Cys Ser Cys Ser Thr 
15 20 25 

AGC TTT GGA GAA TCC CCT AAT TCT AAC ACC TGC OCT GTG TGT TTG GGT 147 
Ser Phe Gly Glu Ser Pro Asn Ser Asn Thr Cys Pro Val Cys Leu Gly 
30 35 40 

TTA CCG GGA GCT TTG CCG GTA TTG AAT AAA GAA GTG GTT AAA AAA GCC 195 
Leu Pro Gly Ala Leu Pro Val Leu Asn Lys Glu Val Val Lys Lys Ala 
45 50 55 60 

ATC CAA TTA GGC ACA GCC ATT GAA GCC AAT ATC AAC CAA TAT TCT ATT 24 3 

lie Gin Leu Gly Thr Ala lie Glu Ala Asn lie Asn Gin Tyr Ser lie 
65 70 75 

TTT GCG AGG AAA AAT TAT TTT TAC CCT GAT TTG CCT AAG GCT TAT CAA 2 91 

Phe Ala Arg Lys Asn Tyr Phe Tyr Pro Asp Leu Pro Lys Ala Tyr Gin 
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Ser Gin Thr Glu 
20 

lie Val Arg Asp 
35 

Ser Leu Glu Asn 
50 

Asn Ala Leu Gin 
65 

Lys Asn Pro Asp 

Ala Leu Glu Ser 
100 

lie Ala lie Ala 

115 

lie Glu Thr Leu 
130 

Val Leu Tyr Glu 
145 

Ser Leu Phe Lys 

Leu Ser Tyr Glu 
180 

Lys Glu Leu Asn 
195 

Leu lie Tyr Gin 
210 

Ala Leu Thr Lys 
225 

lie Leu Gly lie 

Lys Tyr Phe Glu 
260 

Lys Asp Arg Ala 
275 

Thr Leu Glu Arg 
290 

Tyr Ala Ser Arg 
305 

Arg lie Gin Asn 

Pro Phe Ser Trp 
340 

Glu Gly Ala Phe 
355 

Ala Pro Glu Leu 
370 

Tyr Tyr Tyr Leu 
385 

Asp Glu Lys Ala 

Leu Pro Ala Val 
420 

Met Pro Phe Asn 
435 

lie Asp Leu Asn 



Leu Asn Leu Lys 

Tyr Tyr Leu Trp 
40 

Ala Lys Lys Ala 
55 

Lys Ala Met Gin 
70 

Val Lys Leu Pro 
85 

Met Leu Glu Thr 

Leu Lys Ser Lys 
120 

Lys Pro Leu Gin 
135 

Glu Leu Glu lie 
150 

Ala Asn Ala Gin 
165 

Lys Lys Leu Gin 

Arg Leu Leu Asp 
200 

Val lie Leu Asp 
215 

Ser Asn Ala Thr 
230 

Asn Glu lie Leu 
245 

Arg Ser Glu Ala 

lie Phe Trp Gin 
280 

Leu Ser Gin Ser 
295 

Lys Leu Lys Thr 
310 

Leu Ser Gin Glu 
325 

Gin lie Phe Lys 

Asn Ala Met Leu 
360 

Thr Tyr Leu Leu 
375 

Ser Pro Tyr Glu 
390 

Met Ala Tyr Ala 
405 

lie Ser Arg Ser 

Val Gly Pro Phe 
440 

Asp Met Phe Asn 



Asp Leu Glu Lys 
25 

Arg Tyr lie Ser 

Tyr Glu Leu Thr 
60 

Glu Lys Gly Ser 
75 

Glu Asp lie Tyr 
90 

Thr Asp Thr Phe 
105 

lie Arg Asp Phe 

lie Lys lie Lys 
140 

Leu Gin Ser Lys 
155 

Val Phe Ser Ala 
170 

lie Phe Glu Lys 
185 

Glu Asn Tyr Pro 

Pro Lys Leu Asp 
220 

His Ser Asn Ala 
235 

Arg Lys Lys Pro 
250 

Val Val Lys Asp 
265 

Tyr Leu Val Ser 

Pro Ala Leu Asn 
300 

Thr Pro Ser Tyr 
315 

Asp Pro Pro Phe 
330 

Glu Lys Thr Leu 
345 

Lys Ser Leu Tyr 

Ser Gin Arg Asn 
380 

Gly lie lie Glu 
395 

lie Ala Arg Gin 
410 

Phe Ala Leu Gly 
425 

Ala Lys Ser Leu 
Pro Asn lie Ala 



Lys Pro Ala Gly 
30 

Asp Lys Lys Thr 
45 

Gin Asn Lys Asn 

Asp Asn Ala Glu 
80 

Cys Lys Gin Thr 
95 

Gin Ala Ser Cys 
110 

Asp Lys lie Pro 
125 

Glu Ala Tyr Pro 

His Val Ser Ala 
160 

Leu Phe Asn His 
175 

His lie Pro lie 
190 

Ala Phe Asn Arg 
205 

His Phe Lys Asp 

Gin Thr Phe Phe 
240 

Ser Lys Ala Leu 
255 

Asp Asp Phe Ser 
270 

Lys Lys Lys Lys 
285 

Leu Tyr Ser Leu 

Arg lie lie Ser 
320 

Asn Thr Tyr Asp 
335 

Ser Leu Lys Asp 
350 

Tyr Glu Lys Ser 
365 

Lys Asp Lys lie 

Trp Gin Asn Thr 
400 

Glu Ser Phe Leu 
415 

Leu Met Gin lie 
430 

Gly Met Asp Asn 
445 

Leu Lys Phe Gly 
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Arg Gin Glu Ser Phe Leu Leu Pro Ala Val lie Ser Arg Ser Phe Ala 
415 420 425 

CTG GGG CTT ATG CAA ATC ATG CCC TTT AAT GTA GGG CCT TTC GCT AAA 134 7 

Leu Gly Leu Met Gin lie Met Pro Phe Asn Val Gly Pro Phe Ala Lys 
430 435 440 

AGC CTT GGC ATG GAT AAC ATT GAT CTA AAC GAG ATG TTT AAC CCC AAC 13 95 

Ser Leu Gly Met Asp Asn lie Asp Leu Asn Asp Met Phe Asn Pro Asn 
445 450 455 

ATC GCT CTC AAA TTT GGC AAT TAT TAC TTG AAC CAT TTG AAA AAA GAA 144 3 

lie Ala Leu Lys Phe Gly Asn Tyr Tyr Leu Asn His Leu Lys Lys Glu 
460 465 470 

TTC AAC CAC CCC CTT TTT GTC GCC TAC GCT TAT AAC GCT GGG CCT GGG 14 91 

Phe Asn His Pro Leu Phe Val Ala Tyr Ala Tyr Asn Ala Gly Pro Gly 
475 480 485 490 

TTT TTA AGG AGG TGG TTA GAA AGT TCC AAA CGA TTT AAA GAA AAA AAT 15 3 9 

Phe Leu Arg Arg Trp Leu Glu Ser Ser Lys Arg Phe Lys Glu Lys Asn 
495 500 505 

CAT TTT GAG CCA TGG CTT AGC ATG GAG CTT ATG CCT TAT AGC GAG ACT 15 87 

His Phe Glu Pro Trp Leu Ser Met Glu Leu Met Pro Tyr Ser Glu Thr 
510 515 520 

CGC ATG TAT GGC TTT AGG GTC ATG CTC AAT TAC TTG ATT TAT CAA GAA 16 3 5 

Arg Met Tyr Gly Phe Arg Val Met Leu Asn Tyr Leu lie Tyr Gin Glu 
525 530 535 

ATT TTT GGG AAT TTC ATC CCT ATT GAT GGA TTT TTA GAA CAA ACT CTT 16 8 3 

lie Phe Gly Asn Phe lie Pro lie Asp Gly Phe Leu Glu Gin Thr Leu 
540 545 550 

AAC TCA AAG GAC AAA CCA TGATTAAAAA ATGCCTTTTT CCTGCTGCGG GCTATGGC 17 3 9 
Asn Ser Lys Asp Lys Pro 
555 560 

A 1740 
(2) INFORMATION FOR SEQ ID NO: 784: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784: 

Met Arg Phe Phe Thr Leu Phe Phe lie Gly Met Leu Gly Val Gly Phe 

1-5 10 15 
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190 195 200 



TAT CCG GCG TTT AAC CGC TTG ATC TAT CAG GTT ATT TTA GAT CCT AAA 6 75 

Tyr Pro Ala Phe Asn Arg Leu lie Tyr Gin Val lie Leu Asp Pro Lys 
205 210 215 



TTG GAT CAT TTT AAA GAG GCT CTC ACT AAA AGT AAC GCT ACC CAC AGO 72 3 

Leu Asp His Phe Lys Asp Ala Leu Thr Lys Ser Asn Ala Thr His Ser 
220 225 230 



AAC GCG CAA ACC TTT TTT ATT CTA GGG ATT AAT GAA ATC TTG CGC AAA 771 
Asn Ala Gin Thr Phe Phe lie Leu Gly lie Asn Glu lie Leu Arg Lys 
235 240 245 250 



AAA CCC TCT AAA GCG CTC AAG TAT TTT GAA CGA TCA GAA GCG GTT GTC 819 
Lys Pro Ser Lys Ala Leu Lys Tyr Phe Glu Arg Ser Glu Ala Val Val 
255 260 265 



AAA GAC GAT GAT TTT TCA AAA GAC AGA GCG ATT TTT TGG CAG TAT TTA 8 67 

Lys Asp Asp Asp Phe Ser Lys Asp Arg Ala lie Phe Trp Gin Tyr Leu 
270 275 280 



GTT TCT AAA AAG AAA AAA ACT TTA GAA CGC CTT TCA CAA AGC CCA GCT 915 
Val Ser Lys Lys Lys Lys Thr Leu Glu Arg Leu Ser Gin Ser Pro Ala 
285 290 295 



TTA AAT CTC TAT AGT CTT TAT GCG AGC CGC AAA CTC AAA ACC ACG CCC 963 
Leu Asn Leu Tyr Ser Leu Tyr Ala Ser Arg Lys Leu Lys Thr Thr Pro 
300 305 310 



AGT TAC CGC ATC ATT TCA CGC ATC CAG AAT TTA AGC CAA GAA GAT CCT 1011 
Ser Tyr Arg lie lie Ser Arg lie Gin Asn Leu Ser Gin Glu Asp Pro 
315 320 325 330 



CCT TTT AAC ACT TAT GAC CCT TTT TCG TGG CAA ATT TTT AAG GAA AAA 105 9 

Pro Phe Asn Thr Tyr Asp Pro Phe Ser Trp Gin lie Phe Lys Glu Lys 
335 340 345 



ACC TTG AGT TTG AAA GAT GAG GGC GCG TTT AAT GCG ATG CTA AAA AGC 1107 
Thr Leu Ser Leu Lys Asp Glu Gly Ala Phe Asn Ala Met Leu Lys Ser 
350 355 360 



CTG TAT TAT GAA AAA AGC GCT CCT GAA TTG ACC TAT CTT TTA AGC CAA 1155 
Leu Tyr Tyr Glu Lys Ser Ala Pro Glu Leu Thr Tyr Leu Leu Ser Gin 
365 370 375 



CGC AAT AAA GAC AAG ATT TAT TAT TAT TTA TCC CCT TAT GAG GGC ATT 12 03 
Arg Asn Lys Asp Lys lie Tyr Tyr Tyr Leu Ser Pro Tyr Glu Gly lie 
380 385 390 



ATT GAA TGG CAA AAT ACT GAT GAA AAG GCT ATG GCG TAT GCG ATC GCT 12 51 

lie Glu Trp Gin Asn Thr Asp Glu Lys Ala Met Ala Tyr Ala lie Ala 
395 400 405 410 

AGG CAA GAA AGC TTT TTG CTC CCG GCA GTC ATT TCG CGC TCG TTC GCT 12 99 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 83: 

TAAAAACCTT TAAAGAAAAT C ATG CGT TTT TTT ACC TTG TTT TTT ATC GGT 51 

Met Arg Phe Phe Thr Leu Phe Phe lie Gly 
15 10 

ATG CTT GGC GTT GGT TTT TCT CAA ACC GAG TTA AAT TTA AAA GAT TTA 99 
Met Leu Gly Val Gly Phe Ser Gin Thr Glu Leu Asn Leu Lys Asp Leu 
15 20 25 

GAA AAA AAG CCC GCC GGG ATC GTT AGG GAT TAT TAT TTG TGG CGT TAT 14 7 

Glu Lys Lys Pro Ala Gly lie Val Arg Asp Tyr Tyr Leu Trp Arg Tyr 
30 35 40 

ATT AGC GAT AAA AAA ACC AGT TTA GAA AAC GCT AAA AAA GCC TAT GAA 195 
lie Ser Asp Lys Lys Thr Ser Leu Glu Asn Ala Lys Lys Ala Tyr Glu 
45 50 55 

TTG ACT CAA AAT AAA AAC AAC GCC CTA CAA AAG GCC ATG CAA GAA AAA 24 3 

Leu Thr Gin Asn Lys Asn Asn Ala Leu Gin Lys Ala Met Gin Glu Lys 
60 65 70 

GGC TCA GAC AAT GCA GAA AAA AAC CCT GAT GTT AAA TTG CCT GAA GAT 2 91 

Gly Ser Asp Asn Ala Glu Lys Asn Pro Asp Val Lys Leu Pro Glu Asp 
75 80 85 90 

ATT TAT TGC AAG CAA ACG GCT TTA GAA AGC ATG CTA GAA ACA ACA GAC 33 9 

lie Tyr Cys Lys Gin Thr Ala Leu Glu Ser Met Leu Glu Thr Thr Asp 
95 100 105 

ACT TTC CAA GCA AGC TGC ATC GCT ATC GCT TTA AAA TCA AAG ATC AGA 3 87 

Thr Phe Gin Ala Ser Cys lie Ala lie Ala Leu Lys Ser Lys lie Arg 
110 115 120 

GAT TTT GAT AAA ATC CCT ATT GAA ACC CTT AAG CCC TTA CAA ATT AAA 43 5 

Asp Phe Asp Lys lie Pro lie Glu Thr Leu Lys Pro Leu Gin lie Lys 
125 130 135 

ATC AAA GAG GCT TAC CCC GTT CTT TAT GAA GAA TTA GAA ATT TTG CAA 4 83 

lie Lys Glu Ala Tyr Pro Val Leu Tyr Glu Glu Leu Glu lie Leu Gin 
140 145 150 

AGT AAG CAT GTG AGC GCT TCT TTG TTT AAG GCT AAC GCG CAA GTG TTT 531 
Ser Lys His Val Ser Ala Ser Leu Phe Lys Ala Asn Ala Gin Val Phe 
155 160 165 170 

AGC GCG CTT TTC AAT CAT TTG AGT TAT GAA AAA AAG CTC CAA ATT TTT 57 9 

Ser Ala Leu Phe Asn His Leu Ser Tyr Glu Lys Lys Leu Gin lie Phe 
175 180 185 

GAA AAG CAT ATC CCC ATT AAA GAG TTA AAC CGT CTT TTA GAC GAA AAT 62 7 

Glu Lys His lie Pro lie Lys Glu Leu Asn Arg Leu Leu Asp Glu Asn 
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CCG GTG TTT TAT TTT TTA CGC TCT AGA TTC AAG CTG GTG TTT AAC GGG 250 
Pro Val Phe Tyr Phe Leu Arg Ser Arg Phe Lys Leu Val Phe Asn Gly 
55 60 65 

TTG GAT TTC TCT CCT TTA GTG GTG GTC ATT GTT TTG AAA TTC TTG GAT 2 98 

Leu Asp Phe Ser Pro Leu Val Val Val He Val Leu Lys Phe Leu Asp 
70 75 80 

CTC ACG CTC ATT CAG TGG CTT TTC ATG CTC GCT AAA AAC CTT TAAAGAAAA 34 9 
Leu Thr Leu He Gin Trp Leu Phe Met Leu Ala Lys Asn Leu 
85 90 95 

TCATGCGTTT T 3 60 

(2) INFORMATION FOR SEQ ID NO: 782: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782: 

Met lie Phe Ser Thr Leu lie Asn Ala lie Ala Val lie Leu Ser Ser 

1 5 10 15 

Leu lie Thr lie Tyr Met Trp lie Val lie lie Tyr Ser Leu lie Ser 

20 25 30 

Phe Val Gin Pro Asn Pro Asn Asn Pro lie Met Gin lie Leu Ala Arg 

35 40 45 

Leu Cys Glu Pro Val Phe Tyr Phe Leu Arg Ser Arg Phe Lys Leu Val 

50 55 60 

Phe Asn Gly Leu Asp Phe Ser Pro Leu Val Val Val lie Val Leu Lys 
65 70 75 80 

Phe Leu Asp Leu Thr Leu lie Gin Trp Leu Phe Met Leu Ala Lys Asn 
85 90 95 

Leu 



(2) INFORMATION FOR SEQ ID NO: 7 83 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 22... 1701 
- (D) OTHER INFORMATION: 
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225 

Ala Lys Glu Cys 

Leu Lys Phe Leu 
260 

Leu Asp Phe Tyr 
275 

Gly Leu Leu Lys 
290 

His Leu Ala Leu 
305 

Gin Lys lie Gly 

Asn Pro Lys Asn 
340 

His Tyr Leu Pro 
355 

Asn 



230 

His Lys Ser Phe 
245 

Leu Lys Asn Tyr 

Ala Leu Lys Asn 
280 

Glu lie Phe Asp 
295 

Lys Gly Gly Ala 
310 

Glu lie Leu Asn 
325 

Asn Ala Leu Glu 

Asn Asp Thr lie 
360 



235 

Leu Asn lie His 
250 

Asp Leu Glu Pro 
265 

Pro Lys His Ala 

Ser Asn Glu Pro 
300 

Leu Gin Ser Leu 
315 

Ala Cys Leu Asp 
330 

Trp Leu lie Glu 
345 

Asn Leu Ser Pro 



240 

Asn Lys Thr Glu 
255 

Phe Asn Leu Ala 
270 

Leu Lys lie Lys 
285 

Phe Lys Lys Glu 

Gly Tyr Gin His 
320 

Leu Val lie Ala 
335 

Trp Val Lys Gly 
350 

lie Gly Arg Arg 
365 



(2) INFORMATION FOR SEQ ID NO: 7 81: 

(i) SEQUENCE CH7^J?ACTERISTICS : 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50... 340 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781: 

TTTTCCCCTA TATCCAAAGC CATCATCAAG AAGTTTTAAG GCTCAAAGC ATG ATT TTT 5 8 

Met lie Phe 
1 

TCC ACT CTT ATT AAT GCG ATA GCG GTG ATT TTA AGC TCG CTC ATT ACG 106 
Ser Thr Leu lie Asn Ala lie Ala Val lie Leu Ser Ser Leu lie Thr 
5 10 15 

ATT TAT ATG TGG ATA GTA ATC ATT TAT TCG CTT ATC AGT TTC GTG CAG 154 
lie Tyr Met Trp lie Val lie lie Tyr Ser Leu lie Ser Phe Val Gin 
20 25 30 35 

CCT AAC CCC AAT AAC CCC ATC ATG CAA ATT CTC GCT CGC TTG TGT GAG 2 02 

Pro Asn Pro Asn Asn Pro lie Met Gin lie Leu Ala Arg Leu Cys Glu 
40 45 50 
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GGC GAA ATT TTA AAC OCA TGC TTA GAT TTA GTC ATC GCT AAC CCT AAA 10 6 5 

Gly Glu lie Leu Asn Ala Cys Leu Asp Leu Val lie Ala Asn Pro Lys 
325 330 335 

AAT AAC GCT TTA GAA TGG CTG ATT GAA TGG GTT AAG GGT CAT TAT TTA 1113 
Asn Asn Ala Leu Glu Trp Leu lie Glu Trp Val Lys Gly His Tyr Leu 
340 345 350 355 

CCT AAT GAT ACT ATA AAT CTT TCG CCA ATA GGC AGA AGA AAT TAAAAACAG 1164 
Pro Asn Asp Thr lie Asn Leu Ser Pro lie Gly Arg Arg Asn 
360 365 

AGAAAACATG ATAACGATGA ATGCGATTCA ATGGCCT 12 01 

(2) INFORMATION FOR SEQ ID NO: 7 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 0: 

Met Gly lie Thr Pro Lys Asp Tyr Asp Leu Thr Ser Asn Ala Leu Val 

15 10 15 

Asn Glu Ser Lys Glu Leu Leu Leu Lys Arg His Phe Arg Val Leu Glu 

20 25 30 

Thr Gly lie Lys His Gly Thr lie Thr Ala Leu Lys Asn His Gin Ser 

35 40 45 

Tyr Glu lie Thr Thr Phe Arg lie Glu Lys Gly ' His lie Lys His Arg 

50 55 60 

Lys Pro Lys Glu Leu Val Phe Ser Val His Leu Thr Asp Asp Leu Lys 
65 70 75 80 

Arg Arg Asp Phe Ser Met Asn Ala lie Ala Tyr Ser Pro Thr Lys Gly 

85 90 95 

Leu lie Asp Pro Phe Lys Gly Gin Asn Ala lie Glu Asn Gin Met lie 

100 105 110 

Glu Cys Val Gly Glu Ala Arg Leu Arg Phe Phe Glu Asp Ala Leu Arg 

115 120 125 

lie Leu Arg Ser Leu Arg Phe Ser Ala Thr Leu Gly Phe Lys lie Ala 

130 135 140 

Pro Asn Thr Lys Glu Ala Val Phe Ala Cys Lys Asp Leu Leu Lys His 
145 150 155 160 

Leu Ser Lys Glu Arg Leu Gin Ser Glu Leu Asn Lys Leu Leu Met Gly 

165 170 175 

Lys Asn Ala Tyr Glu Val Ala Lys Glu Tyr Gin Glu lie Leu Glu Leu 

180 185 190 

Val lie Gin Glu Lys lie Glu Asn Leu Gly Phe Leu Lys Asn Ala Pro 

195 200 205 

Phe Asn Leu Glu Leu Arg Leu Leu Gly Phe Phe Lys His Gin Lys Ser 

210 215 220 

Leu Glu Ser Leu Arg Tyr Pro Lys Lys Thr lie Val Leu Phe Ser Lys 
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Pro Phe Lys Gly Gin Asn Ala lie Glu Asn Gin Met lie Glu Cys Val 
100 105 110 115 

GGG GAA GCG CGA TXA AGG TTT TTT GAA GAC GCT TTA AGG ATT TTA AGA 441 
Gly Glu Ala Arg Leu Arg Phe Phe Glu Asp Ala Leu Arg lie Leu Arg 
120 125 130 

TCG CTG CGA TTC AGT GCA ACT TTA GGC TTT AAG ATA GCG CCA AAC ACC 489 
Ser Leu Arg Phe Ser Ala Thr Leu Gly Phe Lys lie Ala Pro Asn Thr 
135 140 145 

AAA GAA GCG GTT TTT GCG TGT AAG GAT TTG TTA AAA CAC CTT TCT AAA 53 7 

Lys Glu Ala Val Phe Ala Cys Lys Asp Leu Leu Lys His Leu Ser Lys 
150 155 160 

GAA CGC TTA CAA AGT GAA TTG AAT AAG CTT CTT ATG GGG AAA AAC GCC 5 85 

Glu Arg Leu Gin Ser Glu Leu Asn Lys Leu Leu Met Gly Lys Asn Ala 
165 170 175 

TAT GAA GTG GCT AAA GAA TAT CAA GAA ATT TTA GAG TTG GTT ATT CAA 63 3 

Tyr Glu Val Ala Lys Glu Tyr Gin Glu lie Leu Glu Leu Val lie Gin 
180 185 190 195 

GAA AAA ATA GAA AAT TTA GGG TTT TTA AAA AAC GCG. CCT TTC AAT CTG 681 
Glu Lys lie Glu Asn Leu Gly Phe Leu Lys Asn Ala Pro Phe Asn Leu 
200 205 210 

GAA TTA AGA TTG TTA GGG TTT TTT AAG CAT CAA AAA AGT TTA GAA AGT 72 9 

Glu Leu Arg Leu Leu Gly Phe Phe Lys His Gin Lys Ser Leu Glu Ser 
215 220 225 

TTA CGC TAC CCT AAA AAA ACG ATC GTT TTA TTT TCC AAA GCT AAA GAA 7 77 

Leu Arg Tyr Pro Lys Lys Thr lie Val Leu Phe Ser Lys Ala Lys Glu 
230 235 240 

TGC CAT AAA TCT TTT TTA AAT ATT CAT AAC AAA ACA GAG TTA AAA TTT 82 5 

Cys His Lys Ser Phe Leu Asn lie His Asn Lys Thr Glu Leu Lys Phe 
245 250 255 

TTA TTG AAA AAC TAC GAT TTA GAG CCT TTT AAT TTG GCT TTA GAT TTT 873 
Leu Leu Lys Asn Tyr Asp Leu Glu Pro Phe Asn Leu Ala Leu Asp Phe 
260 265 270 275 

TAT GCG CTC AAA AAC CCC AAA CAT GCT TTA AAA ATT AAA GGC TTG TTA 921 
Tyr Ala Leu Lys Asn Pro Lys His Ala Leu Lys lie Lys Gly Leu Leu 
280 285 290 

AAA GAA ATC TTT GAT TCT AAC GAG CCT TTT AAA AAA GAA CAC TTG GCC 96 9 

Lys Glu lie Phe Asp Ser Asn Glu Pro Phe Lys Lys Glu His Leu Ala 
295 300 305 

CTT AAG GGC GGT GCG CTT CAA AGC TTG GGT TAC CAG CAC CAA AAA ATC 1017 
Leu Lys Gly Gly Ala Leu Gin Ser Leu Gly Tyr Gin His Gin Lys lie 
310 315 320 
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Gly Glu Arg Lys lie Leu His Ala Tyr Gly Tyr Gly Cys Asp Lys Cys 

195 200 205 

Pro Ala Cys Gin Leu Arg Lys Lys Gly Phe Glu Glu Phe Gin Ala Asn 

210 215 220 

Lys Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 779: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 1155 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 9: 

TCCTCTCATG AGCTTTACTT GGTAGGGGGG TGCGTGCGCG ATTATTTA ATG GGC ATT 5 7 

Met Gly lie 
1 

ACC CCA AAA GAT TAC GAT TTA ACC TCA AAC GCT TTA GTC AAT GAA AGC 105 
Thr Pro Lys Asp Tyr Asp Leu Thr Ser Asn Ala Leu Val Asn Glu Ser 
5 10 15 

AAA GAG CTT CTT TTA AAG CGC CAT TTT AGG GTG CTA GAA ACC GGT ATC 15 3 

Lys Glu Leu Leu Leu Lys Arg His Phe Arg Val Leu Glu Thr Gly lie 
20 25 30 35 

AAA CAT GGT ACG ATC ACG GCT CTT AAA AAC CAT CAA AGC TAT GAA ATC 201 
Lys His Gly Thr lie Thr Ala Leu Lys Asn His Gin Ser Tyr Glu lie 
40 45 50 

ACA ACT TTT AGA ATT GAA AAG GGG CAT ATC AAA CAC CGA AAG CCT AAA 24 9 

Thr Thr Phe Arg lie Glu Lys Gly His He Lys His Arg Lys Pro Lys 
55 60 65 

GAA TTG GTT TTT AGC GTT CAT TTA ACA GAC GAT TTA AAG CGG CGC GAT 2 97 

Glu Leu Val Phe Ser Val His Leu Thr Asp Asp Leu Lys Arg Arg Asp 
70 75 80 

TTT AGC ATG AAT GCG ATC GCT TAT AGC CCT ACA AAA GGG CTG ATT GAT 34 5 

Phe Ser Met Asn Ala He Ala Tyr Ser Pro Thr Lys Gly Leu He Asp 
85 90 95 

CCT TTT AAA GGG CAG AAT GCG ATT GAA AAT CAA ATG ATT GAA TGC GTG 3 93 
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160 

ATG GCT AAA GAT TTG GGC 
Met Ala Lys Asp Leu Gly 
175 

ACC TGC TAT CAA GGA GAG 
Thr Cys Tyr Gin Gly Glu 
190 

TGC GAT AAA TGC CCG GCA 
Cys Asp Lys Cys Pro Ala 
205 210 

TTT CAA GCT AAT AAA AAA 
Phe Gin Ala Asn Lys Lys 
225 



165 

GTC TTG GAT TTA GTC ATC 
Val Leu Asp Leu Val lie 
180 

CGA AAG ATT TTG CAT GCT 
Arg Lys lie Leu His Ala 
195 200 

TGC CAA TTG AGA AAA AAA 
Cys Gin Leu Arg Lys Lys 
215 

TAAGGTTTTT TAAAAAACCA A 



170 

AAA GAA ACG CAC 580 

Lys Glu Thr His 

185 

TAT GGT TAT GGT 62 8 

Tyr Gly Tyr Gly 



GGC TTT GAA GAG 676 
Gly Phe Glu Glu 
220 

715 



(2) INFORMATION FOR SEQ ID NO: 77 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 8: 

Met Glu Gin Lys lie Cys Val He Gly Phe Ser Gly Gly Gin Asp Ser 

15 10 15 

Thr Thr Leu Ala Val Trp Ala Lys Lys Arg Phe Lys Lys Val Cys Leu 

20 25 30 

Val Gly Phe Asp Tyr Ala Gin Lys His Ser Val Glu Leu Glu Cys Ala 

35 40 45 

Gin Lys He Ala Ser Leu Leu Gin Leu Pro Tyr Glu He He Pro Leu 

50 55 60 

Asp Phe Leu Glu Asn He Thr Arg Ser Ala Leu Phe Lys Asn Ser Asn 
65 70 75 80 

Asp Leu He Gly His Ser His Ala Gin Asn Lys Asp Leu Pro Asn Ser 

85 90 95 

Phe Val Pro Asn Arg Asn Ala He Phe He Thr Leu Leu His Ser Tyr 

100 105 110 

Ala Gin Lys Leu Gly Ala Ser Asn He Ala Leu Gly Val Ser Gin Ala 

115 120 125 

Asp Phe Ser Gly Tyr Pro Asp Cys Lys Glu Asp Phe He Lys Ser He 

130 135 140 

Glu His Ala Leu Asn Leu Gly Ser Asn Thr Ala He Lys He Leu Thr 
145 150 155 160 

Pro Leu Met Phe Leu Asn Lys Ala Gin Glu Phe Gin Met Ala Lys Asp 

165 170 175 

Leu Gly Val Leu Asp Leu Val He Lys Glu Thr His Thr Cys Tyr Gin 
180 185 190 



-1170- 



BNSDOCID: <WO 9843478A1_I_> 



wo 98/43478 



PCT/US98/06371 



(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 17 , . . 694 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777: 

TTTTTAAGGA ATTTTG ATG GAA CAA AAA ATT TGC GTG ATC GGT TTT AGC GGC 52 

Met Glu Gin Lys lie Cys Val lie Gly Phe Ser Gly 
15 10 

GGG CAA GAC AGC ACC ACT TTA GCC GTG TGG GCG AAA AAG CGT TTT AAA 10 0 

Gly Gin Asp Ser Thr Thr Leu Ala Val Trp Ala Lys Lys Arg Phe Lys 
15 20 25 

AAA GTC TGT TTA GTG GGG TTT GAT TAT GCG CAA AAA CAC TCT GTG GAA 14 8 

Lys Val Cys Leu Val Gly Phe Asp Tyr Ala Gin Lys His Ser Val Glu 
30 35 40 

TTA GAA TGC GCT CAA AAA ATC GCT TCT CTT TTA CAA CTC CCT TAT GAA 196 
Leu Glu Cys Ala Gin Lys lie Ala Ser Leu Leu Gin Leu Pro Tyr Glu 
45 50 55 60 

ATC ATC CCA TTA GAT TTT TTA GAA AAT ATC ACC CGC TCT GCG CTT TTT 244 
lie lie Pro Leu Asp Phe Leu Glu Asn lie Thr Arg Ser Ala Leu Phe 
65 70 75 

AAA AAC TCT AAC GAT TTA ATA GGG CAT TCG CAT GCG CAA AAT AAA GAT 2 92 

Lys Asn Ser Asn Asp Leu lie Gly His Ser His Ala Gin Asn Lys .Asp 
80 85 90 

TTA CCC AAT TCT TTT GTG CCT AAT CGT AAC GCT ATT TTT ATC ACC CTT 34 0 

Leu Pro Asn Ser Phe Val Pro Asn Arg Asn Ala lie Phe lie Thr Leu 
95 100 105 

TTG CAT TCT TAC GCG CAA AAA CTA GGG GCT AGC AAT ATC GCT TTA GGA 3 88 

Leu His Ser Tyr Ala Gin Lys Leu Gly Ala Ser Asn lie Ala Leu Gly 
110 115 120 

GTT TCG CAA GCG GAT TTT AGC GGC TAT CCG GAT TGT AAA GAA GAT TTT 436 
Val Ser Gin Ala Asp Phe Ser Gly Tyr Pro Asp Cys Lys Glu Asp Phe 
125 130 135 140 

ATT AAA AGC ATC GAG CAT GCC TTA AAT TTA GGA TCA AAC ACG GCG ATT 4 84 

lie Lys Ser lie Glu His Ala Leu Asn Leu Gly Ser Asn Thr Ala lie 
145 150 155 

AAA ATC CTA ACG CCT TTA ATG TTT TTG AAT AAA GCG CAA GAA TTT CAA 532 
Lys lie Leu Thr Pro Leu Met Phe Leu Asn Lys Ala Gin Glu Phe Gin 
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Leu lie Asp Cys Leu Thr Gin Leu Lys Gly Met Leu Lys Thr Gin Asp 

115 120 125 

Phe lie Pro lie Phe Thr Ser Arg Glu Ala Leu Ser Leu Ser Leu Lys 

130 135 140 

Asn Pro Ser Pro Ser Val lie Phe Ser Asp Leu Ser Ser Val Leu Ser 
145 150 155 160 

Cys Thr Lys Leu Pro Leu Glu Asp Ala Lys Tyr Leu Ala Ser Leu Glu 

165 170 175 

Lys Pro Ser lie Lys Ala Pro Leu Lys Ser Val Phe Lys Asp Thr Phe 

180 185 190 

Lys Asn Asp Glu lie lie Ala Gin Leu Pro Tyr Asp Pro lie Leu Asn 

195 200 205 

Leu Leu Cys His lie Leu Gin Asp Glu Gly lie Glu Phe Val Phe Met 

210 215 220 

His Glu Ser Arg Ser Cys Glu Ala Leu Leu Tyr Tyr Glu Ala Leu Phe 
225 230 235 240 

Lys Thr Pro Lys Arg Leu lie Thr Pro Thr Lys Lys Phe Val Leu Glu 

245 250 255 

Asn Asn Phe Ser Thr Phe Pro Phe Lys Asp Glu Leu Glu Phe Leu Ser 

260 265 270 

Ala Thr Pro Asn Ser lie Val Leu Tyr Leu Ser Phe Lys Arg Pro Thr 

275 280 285 

Arg Leu Leu Leu His Ala Asn Gly Ser Leu Lys Thr Leu Leu Ser Val 

290 295 300 

Ser Phe Asp Phe Asn Lys Met Phe Asn Ala Leu Lys Gin Asp Glu Lys 
305 310 315 320 

Ala Ser Arg Met Leu Gin Asn Tyr Ala Thr Lys Phe Pro Asp Phe Tyr 

325 330 335 

Ala Arg lie Val Glu Leu Ser Lys Tyr Asp Leu Gly Gly Ala Asn Leu 

340 345 350 

Leu Asp Phe Phe Cys lie Leu Gly Phe Val Leu Gly Tyr Ser Glu Asp 

355 360 365 

Phe Cys Thr Gin Ser Val lie Pro Leu Ala Lys Glu Cys Leu Arg Pro 

370 375 380 

Lys Gly Pro Arg lie Asp Tyr Lys lie Leu Lys Asp Asn Ser Leu Lys 
385 390 395 400 

Met Ala Leu Asn Phe Ser Lys lie Met His Ser Ala Met Ser Phe Arg 

405 410 415 

Leu Ala Gly Val Glu Asn Glu lie Leu Ser Leu Gly lie Leu Asp Ser 

420 425 430 

Leu Ala Glu Phe Leu Gly Asn Phe lie Trp Asp Asn Ala Gin Asn Phe 

435 440 445 

Ser Val Gin Glu Val Thr lie Ala Gly Asp Phe Phe Gly Glu Lys Val 

450 455 460 

Phe Leu Asp Leu Phe Val Arg Tyr Phe Pro Lys Thr Leu Ala Leu Lys 
465 470 475 480 

Thr His Ala Phe Leu Asp Tyr Glu 
485 

(2) INFORMATION FOR SEQ ID NO: 777: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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AAA ATC CTT AAA GAC AAT TCT TTG AAA ATG GCT TTA AAC TTT TCA AAG 12 52 

Lys lie Leu Lys Asp Asn Ser Leu Lys Met Ala Leu Asn Phe Ser Lys 
395 400 405 

ATC ATG CAC AGT GCG ATG AGT TTC AGG CTC GCA GGC GTG GAA AAT GAA 13 0 0 

lie Met His Ser Ala Met Ser Phe Arg Leu Ala Gly Val Glu Asn Glu 
410 415 420 

ATT TTG AGT TTG GGG ATT TTG GAT TCT TTA GCG GAG TTT TTA GGG AAT 134 8 

lie Leu Ser Leu Gly lie Leu Asp Ser Leu Ala Glu Phe Leu Gly Asn 
425 430 435 

TTC ATT TGG GAT AAC GCG CAA AAT TTT AGC GTT CAA GAA GTA ACG ATC 13 96 

Phe lie Trp Asp Asn Ala Gin Asn Phe Ser Val Gin Glu Val Thr lie 
440 445 450 455 

GCT GGG GAT TTC TTT GGC GAA AAA GTG TTT TTG GAT TTG TTT GTG CGG 1444 

Ala Gly Asp Phe Phe Gly Glu Lys Val Phe Leu Asp Leu Phe Val Arg 

460 465 470 

TAT TTC CCT AAA ACC CTA GCC CTT AAA ACG CAT GCA TTT TTG GAT TAT 14 92 

Tyr Phe Pro Lys Thr Leu Ala Leu Lys Thr His Ala Phe Leu Asp Tyr 
475 480 485 

GAA TAAGGGCTTA AAAGCGGATG TGCATCATCA GCCCGCCGTC CATGTATT 154 3 

Glu 



(2) INFORMATION FOR SEQ ID NO: 77 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 6: 



Met Ala Cys Asn Leu 

1 5 
Thr Thr Ser Phe Tyr 
20 

Ala Gin Lys Leu Ser 
35 

Leu Ser Leu Lys Glu 
50 

Thr Ala Ser Leu Ser 
65 

Asp Phe Leu Asp Lys 
85 

lie Lys Asn Thr Ala 
100 



Gin Ala Arg Phe Tyr 
10 

Leu Gin Ala Ser Ala 
25 

Glu lie Leu Pro Phe 
40 

lie Thr Glu Pro Leu 
55 

Lys Pro Leu Phe Met 
70 

Asn Ser Ser Leu Tyr 
90 

Phe Lys Gly Asp lie 
105 



Ser Val Tyr Lys Asp Asn 
15 

Glu Thr Thr Leu Glu Phe 
30 

Ser Leu Asp Phe Ser Phe 
45 

Asp Glu Asn Leu Phe Gin 
60 

Asn Ala Lys Glu His Gin 
75 80 
Ala Asp Thr Leu Gly Leu 
95 

He His Ser Pro Lys Glu 
110 
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GAC GCT AAA TAT TTG GCC AGT TTG GAA AAA CCC TCC ATC AAA GCC CCA 580 
Asp Ala Lys Tyr Leu Ala Ser Leu Glu Lys Pro Ser lie Lys Ala Pro 
170 175 180 

TTA AAA AGC GTG TTT AAA GAC ACT TTC AAA AAC GAT GAA ATC ATC GCC 62 8 

Leu Lys Ser Val Phe Lys Asp Thr Phe Lys Asn Asp Glu lie lie Ala 
185 190 195 

CAG CTA CCC TAT GAC CCC ATA TTG AAT TTA TTG TGC CAT ATT TTA CAA 67 6 

Gin Leu Pro Tyr Asp Pro lie Leu Asn Leu Leu Cys His lie Leu Gin 
200 205 210 215 

GAT GAG GGG ATA GAA TTT GTT TTT ATG CAT GAA AGC CGT TCT TGT GAA 724 
Asp Glu Gly lie Glu Phe Val Phe Met His Glu Ser Arg Ser Cys Glu 
220 225 230 

GCG CTT TTG TAT TAT GAA GCG CTT TTT AAA ACC CCT AAA CGC TTG ATC 772 
Ala Leu Leu Tyr Tyr Glu Ala Leu Phe Lys Thr Pro Lys Arg Leu lie 
235 240 245 

ACA CCC ACT AAA AAA TTC GTG CTA GAA AAT AAT TTT TCT ACC TTT CCC 82 0 

Thr Pro Thr Lys Lys Phe Val Leu Glu Asn Asn Phe Ser Thr Phe Pro 
250 255 260 

TTT AAA GAT GAA TTA GAG TTT TTA AGC GCA ACC CCC AAT TCT ATC GTT 86 8 

Phe Lys Asp Glu Leu Glu Phe Leu Ser Ala Thr Pro Asn Ser lie Val 
265 270 275 

TTG TAT CTC AGT TTC AAG CGC CCT ACA AGG TTG TTA TTG CAT GCT AAT 916 
Leu Tyr Leu Ser Phe Lys Arg Pro Thr Arg Leu Leu Leu His Ala Asn 
280 285 290 295 

GGT TCT TTA AAA ACG CTT TTA AGC GTC AGT TTT GAT TTT AAC AAA ATG 964 
Gly Ser Leu Lys Thr Leu Leu Ser Val Ser Phe Asp Phe Asn Lys Met 
300 305 310 

TTT AAC GCG CTC AAA CAA GAT GAA AAA GCC TCC AGA ATG CTA CAA AAC 1012 
Phe Asn Ala Leu Lys Gin Asp Glu Lys Ala Ser Arg Met Leu Gin Asn 
315 320 325 

TAC GCC ACT AAA TTC CCT GAT TTT TAC GCG CGC ATT GTA GAG CTT TCT 1060 
Tyr Ala Thr Lys Phe Pro Asp Phe Tyr Ala Arg lie Val Glu Leu Ser 
330 335 340 

AAA TAC GAT CTA GGG GGC GCG AAT TTA TTG GAT TTT TTT TGC ATT TTA 110 8 

Lys Tyr Asp Leu Gly Gly Ala Asn Leu Leu Asp Phe Phe Cys lie Leu 
345 350 355 

GGG TTT GTT TTG GGC TAT AGC GAG GAT TTT TGC ACA CAG AGC GTT ATT 115 6 

Gly Phe Val Leu Gly Tyr Ser Glu Asp Phe Cys Thr Gin Ser Val lie 
360 365 370 375 

CCT TTG GCT AAA GAA TGC TTA CGC CCT AAA GGC CCT AGG ATT GAT TAT 12 04 

Pro Leu Ala Lys Glu Cys Leu Arg Pro Lys Gly Pro Arg lie Asp Tyr 
380 385 390 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 32... 1495 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 775 : 

AAGCCTGAAT TTTACGCCCC TTTTAGAGCG A ATG GCA TGC AAT TTG CAA GCG 52 

Met Ala Cys Asn Leu Gin Ala 
1 5 

CGT TTT TAT AGC GTT TAT AAG GAT AAT ACC ACT TCT TTC TAC CTC CAA 100 
Arg Phe Tyr Ser Val Tyr Lys Asp Asn Thr Thr Ser Phe Tyr Leu Gin 
10 15 20 

GCG AGC GCT GAA ACC ACT TTA GAG TTC GCG CAA AAA CTC AGC GAA ATT 14 8 

Ala Ser Ala Glu Thr Thr Leu Glu Phe Ala Gin Lys Leu Ser Glu lie 
25 30 35 

CTG CCC TTT TCT TTA GAT TTT AGC TTT TTG TCT TTA AAG GAA ATC ACA 19 6 

Leu Pro Phe Ser Leu Asp Phe Ser Phe Leu Ser Leu Lys Glu lie Thr 
40 45 50 55 

GAG CCT TTA GAT GAA AAT CTT TTC CAA ACA GCA AGC CTT TCA AAG CCC 244 
Glu Pro Leu Asp Glu Asn Leu Phe Gin Thr Ala Ser Leu Ser Lys Pro 
60 65 70 

CTT TTT ATG AAC GCT AAA GAG CAT CAA GAT TTT TTA GAC AAA AAT TCA 2 92 

Leu Phe Met Asn Ala Lys Glu His Gin Asp Phe Leu Asp Lys Asn Ser 
75 80 85 

TCT TTG TAT GCC GAT ACT CTG GGC TTG ATT AAA AAC ACC GCT TTT AAG 34 0 

Ser Leu Tyr Ala Asp Thr Leu Gly Leu lie Lys Asn Thr Ala Phe Lys 
90 95 100 

GGG GAT ATA ATC CAT AGC CCT AAA GAG CTT ATA GAT TGC TTA ACC CAA 3 88 

Gly Asp lie lie His Ser Pro Lys Glu Leu lie Asp Cys Leu Thr Gin 
105 110 115 

TTA AAA GGC ATG CTC AAA ACG CAA GAT TTT ATC CCT ATT TTC ACT TCT 436 
Leu Lys Gly Met Leu Lys Thr Gin Asp Phe lie Pro lie Phe Thr Ser 
120 125 130 135 

AGA GAG GCG TTA TCC CTT TCT TTA AAA AAT CCC TCT CCA AGC GTT ATT 4 84 

Arg Glu Ala Leu Ser Leu Ser Leu Lys Asn Pro Ser Pro Ser Val lie 
140 145 150 

TTT AGC GAT CTT TCT AGC GTT TTG AGC TGC ACT AAA TTG CCT TTA GAG 5 32 

Phe Ser Asp Leu Ser Ser Val Leu Ser Cys Thr Lys Leu Pro Leu Glu 
155 160 165 



-1165- 



BNSCXDCID: <WO 984347eAlJ_> 



wo 98/43478 



PCT/US98/06371 



ATG ATT AAT GGC TAT GOT TAT ACC AAA GAA TGAGTCAAAA AATCCTAATT CTA 73 3 
Met lie Asn Gly Tyr Gly Tyr Thr Lys Glu 
215 220 

GGTATTGGCA ATATCCTTTT TGGCGATGAA GGGATTGGGG TGCATTTAGC CCACTACCTC 7 93 

AAAAAAAATT TTTCTTT 810 

(2) INFORMATION FOR SEQ ID NO: 774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE : amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774: 

Met Asp Lys Met Asn Lys Val Val Leu His Lys Glu Tyr Ser Gly Phe 

15 10 15 

Val Arg Phe Phe His Trp Val Arg Ala Leu Ser lie Phe Ala Leu lie 

20 25 30 

Ala Thr Gly Phe Tyr lie Ala Tyr Pro Phe Leu Gin Pro Asn Ser Ser 

35 40 45 

Phe Tyr Lys Gly Val Tyr Leu Leu Gin Ala Tyr Val Arg Ser Phe His 

50 55 60 

Val Met Phe Gly Phe Leu Leu lie Ser Ala Leu lie Phe Arg Thr Tyr 
65 70 75 80 

Leu Phe Phe Thr Lys Glu Ser Leu Met Glu Arg Lys Ser Phe Ser Gin 

85 90 95 

Leu Leu Ser Pro Lys Ala Trp lie Asp Gin Met Lys Ala Tyr Phe Leu 

100 105 110 

lie Ser Gly Lys Pro His Thr Lys Gly Ala Tyr Asn Pro lie Gin Leu 

115 120 125 

Val Ala Tyr Ser Thr Leu lie Val Leu lie Val Leu Met Ser Leu Ser 

130 135 140 

Gly Met Val Leu Tyr Tyr Asn Val Tyr His Ala Gly Leu Gly Ala Phe 
145 150 155 160 

Leu Gly Ser Ala Phe Lys Trp Phe Glu Thr Leu Cys Gly Gly Leu Ala 

165 170 175 

Asn Val Arg Phe lie His His Leu Ala Thr Trp Gly Phe lie Leu Phe 

180 185 190 

Val Pro Val His Val Tyr Met Val Phe Phe His Ser lie Arg Tyr Glu 

195 200 205 

Ser Ser Gly Ala Asp Ser Met lie Asn Gly Tyr Gly Tyr Thr Lys Glu 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 775: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TO POLOG Y : linear 
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TAGAGCCTAA TTTCGCTAAA TTCTAAAAAG GGTTACGC ATG GAT AAA ATG AAT AAG 5 6 

Met Asp Lys Met Asn Lys 
1 5 

GTC GTT TTA CAC AAA GAA TAT TCC GOT TTT GTG CGC TTT TTC CAT TGG 104 
Val Val Leu His Lys Glu Tyr Ser Gly Phe Val Arg Phe Phe His Trp 
10 15 20 

GTT AGG OCT TTG AGT ATT TTC GCT TTA ATC GCT ACA GGG TTT TAC ATC 152 
Val Arg Ala Leu Ser lie Phe Ala Leu lie Ala Thr Gly Phe Tyr lie 
25 30 35 

GCT TAC CCT TTT TTG CAG CCT AAT TCC AGC TTT TAT AAA GGG GTG TAT 2 00 

Ala Tyr Pro Phe Leu Gin Pro Asn Ser Ser Phe Tyr Lys Gly Val Tyr 
40 45 50 

CTT TTA CAA GCT TAT GTG CGT TOT TTT CAT GTC ATG TTT GGG TTT TTG 24 8 

Leu Leu Gin Ala Tyr Val Arg Ser Phe His Val Met Phe Gly Phe Leu 
55 60 65 70 

CTC ATT AGC GCA TTA ATC TTT AGA ACC TAT CTT TTT TTC ACT AAA GAA 2 96 

Leu lie Ser Ala Leu lie Phe Arg Thr Tyr Leu Phe Phe Thr Lys Glu 
75 80 85 

AGC TTG ATG GAA CGC AAG AGT TTT AGC CAA CTT TTA AGC CCA AAA GCC 344 
Ser Leu Met Glu Arg Lys Ser Phe Ser Gin Leu Leu Ser Pro Lys Ala 
90 95 100 

TGG ATT GAT CAG ATG AAA GCG TAT TTT CTT ATC AGC GGC AAA CCC CAC 3 92 

Trp lie Asp Gin Met Lys Ala Tyr Phe Leu lie Ser Gly Lys Pro His 
105 110 115 

ACT AAA GGA GCG TAT AAC CCT ATC CAA CTC GTG GCT TAT TCC ACT TTG 44 0 

Thr Lys Gly Ala Tyr Asn Pro lie Gin Leu Val Ala Tyr Ser Thr Leu 
120 125 130 

ATT GTT TTG ATC GTG TTG ATG AGT TTG AGC GGG ATG GTG CTG TAT TAT 4 88 

lie Val Leu lie Val Leu Met Ser Leu Ser Gly Met Val Leu Tyr Tyr 
135 140 145 150 

AAT GTC TAT CAT GCG GGG CTT GGA GCG TTT TTA GGA AGC GCT TTT AAG 53 6 

Asn Val Tyr His Ala Gly Leu Gly Ala Phe Leu Gly Ser Ala Phe Lys 
155 160 165 

TGG TTT GAA ACG CTT TGT GGA GGG TTA GCG AAT GTT CGT TTC ATC CAC 5 84 

Trp Phe Glu Thr Leu Cys Gly Gly Leu Ala Asn Val Arg Phe lie His 
170 175 180 

CAC TTA GCG ACT TGG GGG TTT ATT TTG TTT GTC CCT GTG CAT GTT TAT 63 2 

His Leu Ala Thr Trp Gly Phe lie Leu Phe Val Pro Val His Val Tyr 
185 190 195 

ATG GTG TTT TTC CAT TCT ATC AGG TAT GAA AGT TCG GGG GCG GAT TCT 6 80 

Met Val Phe Phe His Ser lie Arg Tyr Glu Ser Ser Gly Ala Asp Ser 
200 - 205 210 
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Ala Asp Pro Thr lie Asp Ser lie lie Phe Asp Tyr lie Asn Leu Glu 

100 105 110 

Tyr His Glu Thr lie Met Val Ala Ser Gly Phe Gin Ala Glu Lys Ser 

115 120 125 

Leu His Asp Ala lie Glu Lys His Lys Asn Asn Tyr lie Leu Met Val 

130 135 140 

Glu Gly Gly lie Pro Gin Gly Thr Glu Tyr Phe Leu Thr Gin Gly Pro 
145 150 155 160 

Asn Ala Glu Thr Gly Ala Glu Glu Cys Arg Lys Ala Ala Gin Tyr Ala 

165 170 175 

Ala Ala lie Phe Ala lie Gly Thr Cys Ser Ser Phe Gly Gly Val Gin 

180 185 190 

Ala Ala Tyr Pro Asn Pro Ser Asn Ala Gin Pro Leu His Lys lie lie 

195 200 205 

Asp Lys Pro Val lie Asn Val Pro Gly Cys Pro Pro Ser Glu Lys Asn 

210 215 220 

lie Val Gly Asn Val Leu Tyr Tyr Leu Met Phe Gly Ala Leu Pro Lys 
225 230 235 240 

Leu Asp Ala Tyr Asn Arg Pro Ser Trp Ala Tyr Gly Asn Arg lie His 

245 250 255 

Asp Leu Cys Glu Arg Arg Gly His Phe Asp Ala Gly Glu Phe Val Glu 

260 265 270 

His Phe Gly Asp Glu Asn Ala Lys Arg Gly Phe Cys Leu Tyr Lys Met 

275 280 285 

Gly Cys Lys Gly Pro Tyr Thr Phe Asn Asn Cys Ser Lys Leu Arg Phe 

290 295 300 

Asn Ser His Thr Ser Trp Pro lie Gly Ala Gly His Gly Cys lie Gly 
305 310 315 320 

Cys Ser Glu Pro Asn Phe Trp Asp Thr Met Ser Pro Phe Glu Glu Pro 

325 330 335 

Leu Ala Asn Arg Ser lie Lys Thr Ala Phe Asp Gly Leu Gly Ala Asp 

340 345 350 

Lys Val Ala Asp Lys Val Gly Thr Thr Leu Leu Ser Ala Thr Ala lie 

355 360 365 

Gly lie Val Ala His Ala Leu Leu Ser Lys Ala lie Lys Asn LyB Glu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 773: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 39... 710 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 3 
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GAT GAA AAC OCT AAA AGO GGC TTT TGT TTG TAT AAA ATG GGC TGT AAA 92 2 

Asp Glu Asn Ala Lys Arg Gly Phe Cys Leu Tyr Lys Met Gly Cys Lys 
280 285 290 

GGG OCT TAG ACT TTC AAC AAT TGC TCC AAA CTC CGC TTC AAT TCA CAC 97 0 

Gly Pro Tyr Thr Phe Asn Asn Cys Ser Lys Leu Arg Phe Asn Ser His 
295 300 305 

ACC TCT TGG CCC ATA GGT GCA GGG CAT GGG TGT ATA GGG TGT TCT GAG 1018 
Thr Ser Trp Pro lie Gly Ala Gly His Gly Cys lie Gly Cys Ser Glu 
310 315 320 

CCT AAT TTT TGG GAT ACG ATG AGT CCT TTT GAA GAG CCT TTA GCG AAT 106 6 

Pro Asn Phe Trp Asp Thr Met Ser Pro Phe Glu Glu Pro Leu Ala Asn 
325 330 335 

CGT TCC ATT AAA ACC GCT TTT GAC GGA TTA GGG GCT GAT AAA GTA GCG 1114 
Arg Ser lie Lys Thr Ala Phe Asp Gly Leu Gly Ala Asp Lys Val Ala 
340 345 350 355 

GAT AAA GTA GGC ACG ACT TTG CTG AGC GCA ACC GCT ATT GGC ATT GTT 116 2 

Asp Lys Val Gly Thr Thr Leu Leu Ser Ala Thr Ala lie Gly lie Val 
360 365 370 

GCG CAT GCG CTC CTT TCT AAA GCG ATC AAA AAC AAA GAG TAAGGGATTA AC 1213 
Ala His Ala Leu Leu Ser Lys Ala lie Lys Asn Lys Glu 
375 380 

ATGTCAAAAA AAATCGTAGT CGATCCTATC ACTAGGATTG AGGGGCATTT AAGGATTGAA 12 7 3 
GTGATCGTAG ATGATGATAA CGTGATCACT G 13 0 4 

(2) INFORMATION FOR SEQ ID NO: 7 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772: 

Met Phe Tyr Asp Glu Lys Lys Thr Tyr Gin Lys lie Glu Glu Arg Leu 

15 10 15 

Asp lie Val Arg Ser Phe Asn Ala His Asn Glu His Lys Asn Leu Gin 

20 25 30 

Asp Glu Phe Lys Gly Ala Gly lie Ser Arg Arg Asp Leu Leu Lys Trp 

35 40 45 

Ala Gly Met Met Ser Thr Ala Leu Ala Leu Pro Ala Ser Phe Ala Pro 

50 55 60 

Leu Thr Leu Lys Ala Val Glu Val Ala Asn Arg Leu Pro Val lie Trp 
65 70 75 80 

Leu His Met Ala Glu Cys Thr Gly Cys Ser Glu Ser Leu Leu Arg Ser 

85 90 95 

-1161- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



ATG AGC ACA GCG TTA GCC TTG CCG GOT AGT TTT GCT CCC TTG ACT TTG 2 50 

Met Ser Thr Ala Leu Ala Leu Pro Ala Ser Phe Ala Pro Leu Thr Leu 
55 60 65 

AAG GCG GTG GAA GTG GCT AAC AGA TTG CCC GTG ATT TGG TTG CAC ATG 2 98 

Lys Ala Val Glu Val Ala Asn Arg Leu Pro Val lie Trp Leu His Met 
70 75 80 

GCA GAA TGC ACC GGT TGT AGC GAA AGT TTG TTA AGG AGC GCA GAC CCC 34 6 

Ala Glu Cys Thr Gly Cys Ser Glu Ser Leu Leu Arg Ser Ala Asp Pro 
85 90 95 

ACC ATT GAT AGC ATT ATC TTT GAT TAC ATC AAC CTA GAA TAC CAT GAG 3 94 

Thr lie Asp Ser lie lie Phe Asp Tyr lie Asn Leu Glu Tyr His Glu 
100 105 110 115 

ACC ATC ATG GTA GCG AGC GGT TTT CAA GCT GAA AAA AGC TTG CAT GAC 44 2 

Thr lie Met Val Ala Ser Gly Phe Gin Ala Glu Lys Ser Leu His Asp 
120 125 130 

GCC ATA GAA AAG CAT AAA AAC AAT TAC ATT TTA ATG GTA GAA GGG GGT 4 90 

Ala lie Glu Lys His Lys Asn Asn Tyr lie Leu Met Val Glu Gly Gly 
135 140 145 

ATC CCC CAA GGC ACG GAA TAC TTC CTC ACT C7^ GGC CCA AAC GCT GAA 5 3 8- 

lie Pro Gin Gly Thr Glu Tyr Phe Leu Thr Gin Gly Pro Asn Ala Glu 
150 155 160 

ACG GGA GCT G7^ GAG TGT AGG AAA GCC GCT CAA TAC GCA GCC GCT ATT 586 
Thr Gly Ala Glu Glu Cys Arg Lys Ala Ala Gin Tyr Ala Ala Ala He 
165 170 175 

TTT GCC ATA GGC ACA TGC TCA AGT TTT GGG GGC GTT CAA GCG GCT TAC 63 4 

Phe Ala He Gly Thr Cys Ser Ser Phe Gly Gly Val Gin Ala Ala Tyr 
180 185 190 195 

CCT AAC CCC TCT AAC GCG CAA CCC TTG CAT AAA ATC ATT GAT AAA CCC 6 82 

Pro Asn Pro Ser Asn Ala Gin Pro Leu His Lys lie He Asp Lys Pro 
200 205 210 

GTG ATC AAT GTT CCT GGT TGC CCG CCT AGT GAA AAA AAT ATC GTG GGT 73 0 

Val He Asn Val Pro Gly Cys Pro Pro Ser Glu Lys Asn He Val Gly 
215 220 225 

AAT GTG CTT TAT TAC TTG ATG TTT GGG GCT CTC CCT AAA TTG GAT GCG 77 8 

Asn Val Leu Tyr Tyr Leu Met Phe Gly Ala Leu Pro Lys Leu Asp Ala 
230 235 240 

TAT AAC CGC CCC TCT TGG GCT TAT GGG AAC AGG ATC CAT GAT TTG TGC 82 6 

Tyr Asn Arg Pro Ser Trp Ala Tyr Gly Asn Arg He His Asp Leu Cys 
245 250 255 

GAA AGG AGA GGG CAT TTT GAT GCG GGC GAA TTT GTG GAG CAT TTT GGC 8 74 

Glu Arg Arg Gly His Phe Asp Ala Gly Glu Phe Val Glu His Phe Gly 
260 265 270 275 



-1160- 



BNSDOCID: <WO 9843478 A 1_U> 



wo 98/43478 

Asn Phe Asn Ala Gly 
260 

Ser Ser Ser Val Val 
275 

Ser Val Leu Gly Val 
290 

Gly Lys Asn Cys Leu 
305 

Gly Asp Gly Cys lie 
325 

Val lie Glu lie Glu 
340 

Ser Ala Leu Glu Lys 
355 

Ser Gly Lys Asn Gly 
370 

Leu lie Ala Phe Arg 
385 

His 



Ala Met Gly Val Cys Met 
265 

Val Gly Ala Gly Thr Asp 
280 

Leu Ser Gly Gly Asn Asn 
295 

Leu Gly Ala Asn Ser Val 
310 315 
Val Asp Ala Gly Val Ala 
330 

Glu Asn Glu Phe Lys Lys 
345 

His Ala Asn Asn Leu Tyr 
360 

Val His Phe Arg Ser Asn 
375 

Ser Val Lys Lys lie Glu 
390 395 
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Asn Glu Gly Arg lie 
270 

He Gly Gly Gly Ala 
285 

Asn Pro He Ser He 
300 

Thr Gly He Ser Leu 
320 

He Leu Ala Gly Ser 
335 

Leu Leu Glu Val Asn 
350 

Lys Gly Lys Glu Leu 
365 

Ser Gin Asn Gly Lys 
380 

Leu Asn Gin Asn Leu 
400 



(2) INFORMATION FOR SEQ ID NO: 771: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50... 1201 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771: 

TTATTTTTAT TATGTTAAGA TAATGAAAAT TTCTAATTAA GGAGTGGTC ATG TTC TAC 5 8 

Met Phe Tyr 
1 

GAT GAA AAA AAG ACC TAT CAA AAG ATT GAA GAA CGC CTT GAT ATA GTC 10 6 

Asp Glu Lys Lys Thr Tyr Gin Lys He Glu Glu Arg Leu Asp He Val 
5 10 15 

CGT TCG TTT AAC GCT CAC AAC GAG CAT AAA AAC TTG CAA GAC GAG TTT 154 
Arg Ser Phe Asn Ala His Asn Glu His Lys Asn Leu Gin Asp Glu Phe 
20 25 30 35 

AAA GGG GCG GGC ATT TCT AGG CGC GAT TTA TTG AAG TGG GCG GGC ATG 2 02 

Lys Gly Ala Gly He Ser Arg Arg Asp Leu Leu Lys Trp Ala Gly Met 
40 45 50 
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AAT GGC GTG CAT TTT CGT TCC AAT AGT CAG AAT GGC AAG CTG ATT GCT 1210 
Asn Gly Val His Phe Arg Ser Asn Ser Gin Asn Gly Lys Leu lie Ala 
375 380 385 

TTT AGG AGC GTG AAA AAA ATT GAG TTG AAT CAA AAC CTG CAT TAAGGATTA 12 61 
Phe Arg Ser Val Lys Lys He Glu Leu Asn Gin Asn Leu His 
390 395 400 

AAAGAATGCT CAAAAAAAGT TTGTTATTGC TTGTTTTTTT AGTCTTACAG CTTAGCGGCG 1321 
CTGAAGAAAA CAATCAAGCC CCAAAAAAC 13 5 0 

(2) INFORMATION FOR SEQ ID NO : 770 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 0: 

Met He Asn Lys Phe Lys Asn Phe Val Ser Asn Tyr Gin Gin Ser Asn 

1 5 10 15 . 

His Tyr Lys Glu Pro Leu Gly Phe Gly He Ala Arg Val Asp He Ala 

20 25 30 

Pro He Ser Lys Lys He Leu Cys Ala Thr Tyr Pro Val Leu Asn Trp 

35 40 45 

Lys Asp Glu Asn Leu Gly Ser Tyr Ala Val Phe Cys Asn Ser Leu Ser 

50 55 60 

Lys Glu Lys He Leu Lys Glu Ser Ala Ser Glu Arg Val He Glu He 
65 70 75 80 

Asp Glu Ser Phe Val Leu Lys Ala Leu Asp Phe Tyr Thr Pro Phe Leu 

85 90 95 

Asn Glu Ala Tyr Ser Asn Lys Met Ala His Lys Asn He Gin Val Val 

100 105 110 

Leu Glu Leu Leu Lys Ala Leu Glu Glu Asn Arg Leu Lys Asn Ser Asp 

115 120 125 

Gly Glu Ser Leu Tyr Arg Leu Val He Leu Tyr Glu Asp Lys Pro Cys 

130 135 140 

Glu Ser Val Glu Ser Ala Tyr Met Lys Leu Leu Ala Leu Ser Leu Gly 
145 150 155 160 

Lys Ala Pro Leu Arg Ser Leu Asn Leu Glu Gly He Phe Asn Gin Leu 

165 170 175 

Ser Asn Ala Ala Trp Ser Gly Asn Lys Pro Tyr Glu Leu Glu Trp Leu 

ISO 185 190 

Arg Met Asn Glu Val Ala Leu Lys Met Arg Asp His Phe Pro Ser He 

195 200 205 

Asp Phe He Asp Lys Phe Pro Arg Tyr Leu Met Gin Leu He Pro Glu 

210 215 220 

Phe Asp Asn He Arg Leu Leu Asp Ser Ser Lys Thr Arg Phe Gly Ala 
225 230 235 240 

Tyr Leu Gly Thr Gly Gly Tyr Thr Gin Met Pro Gly Ala Ser Tyr Val 
245 250 255 
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GAG AGC GCG TAT ATG AAA CTT TTA GCG CTC TCT TTA GGT AAA GCC CCT 53 8 

Glu Ser Ala Tyr Met Lys Leu Leu Ala Leu Ser Leu Gly Lys Ala Pro 
150 155 160 

TTG AGG AGT TTG AAT TTA GAG GGT ATT TTT AAC CAG CTT TCT AAT GCG 58 6 

Leu Arg Ser Leu Asn Leu Glu Gly lie Phe Asn Gin Leu Ser Asn Ala 
165 170 175 

GCC TGG AGC GGT AAC AAG CCC TAT GAA TTA GAA TGG CTT AGA ATG AAC 63 4 

Ala Trp Ser Gly Asn Lys Pro Tyr Glu Leu Glu Trp Leu Arg Met Asn 
180 185 190 195 

GAA GTG GCT TTA AAA ATG CGA GAC CAT TTC CCT AGC ATT GAT TTC ATA 6 82 

Glu Val Ala Leu Lys Met Arg Asp His Phe Pro Ser lie Asp Phe lie 
200 205 210 

GAT AAA TTC CCA CGC TAT TTG ATG CAA TTA ATC CCT GAG TTT GAT AAT 73 0 

Asp Lys Phe Pro Arg Tyr Leu Met Gin Leu lie Pro Glu Phe Asp Asn 
215 220 225 

ATC CGC TTA TTG GAT AGC TCA AAA ACG CGC TTT GGG GCG TAT TTA GGG 77 8 

lie Arg Leu Leu Asp Ser Ser Lys Thr Arg Phe Gly Ala Tyr Leu Gly 
230 235 240 

ACT GGA GGT TAT ACC CAA ATG CCT GGG GCT AGT TAT GTG AAT TTT AAC 82 6 

Thr Gly Gly Tyr Thr Gin Met Pro Gly Ala Ser Tyr Val Asn Phe Asn 
245 250 255 

GCA GGG GCT ATG GGA GTG TGC ATG AAT GAG GGG CGT ATT TCT TCA TCG 874 
Ala Gly Ala Met Gly Val Cys Met Asn Glu Gly Arg He Ser Ser Ser 
260 265 270 275 

GTG GTG GTT GGA GCA GGC ACT GAT ATT GGT GGG GGA GCG AGC GTG TTA 922 
Val Val Val Gly Ala Gly Thr Asp He Gly Gly Gly Ala Ser Val Leu 
280 285 290 

GGC GTT TTA AGT GGA GGG AAT AAC AAC CCC ATT AGC ATC GGG AAA AAT 97 0 

Gly Val Leu Ser Gly Gly Asn Asn Asn Pro He Ser He Gly Lys Asn 
295 300 305 

TGT TTG CTA GGG GCT AAT AGC GTT ACT GGA ATT AGT CTA GGC GAT GGC 1018 
Cys Leu Leu Gly Ala Asn Ser Val Thr Gly He Ser Leu Gly Asp Gly 
310 315 320 

TGT ATC GTG GAT GCA GGC GTT GCG ATA CTA GCC GGG AGC GTG ATA GAA 106 6 

Cys He Val Asp Ala Gly Val Ala He Leu Ala Gly Ser Val He Glu 
325 330 335 

ATT GAA GAA AAT GAG TTT AAA AAG CTT TTA GAA GTG AAT AGC GCT TTA 1114 
He Glu Glu Asn Glu Phe Lys Lys Leu Leu Glu Val Asn Ser Ala Leu 
340 345 350 355 

GAA AAA CAT GCC AAC AAC CTT TAC AAA GGC AAA GAA CTT TCC GGA AAA 1162 
Glu Lys His Ala Asn Asn Leu Tyr Lys Gly Lys Glu Leu Ser Gly Lys 
360 365 370 
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(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50... 1252 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769: 



AAGTGGAAAA TTTAGCTAAA GAAAGAGAAA AAAGTTTAAA GGATTAGGC ATG ATC AAT 

Met lie Asn 
1 



58 



AAG TTT AAA AAT TTT GTG AGC AAC TAG GAG CAA TCT AAC GAG TAT AAA 
Lys Phe Lys Asn Phe Val Ser Asn Tyr Gin Gin Ser Asn His Tyr Lys 
5 10 15 



106 



GAG GCT TTA GGT TTT GGC ATT GCC AGA GTG GAT ATT GCC CCT ATT TCC 
Glu Pro Leu Gly Phe Gly lie Ala Arg Val Asp lie Ala Pro lie Ser 
20 25 30 35 



154 



AAA AAG ATT TTA TGC GCC ACT TAG CCT GTT TTG AAT TGG AAA GAT GAA 
Lys Lys lie Leu Cys Ala Thr Tyr Pro Val Leu Asn Trp Lys Asp Glu 
40 45 50 



202 



AAT TTA GGC TCT TAT GCG GTG TTT TGC AAC TCG CTT TCA AAA GAA AAA 
Asn Leu Gly Ser Tyr Ala Val Phe Cys Asn Ser Leu Ser Lys Glu Lys 
55 60 65 



250 



ATC CTA AAA GAG AGC GCG AGC GAG CGC GTT ATT GAG ATT GAT GAA AGT 
lie Leu Lys Glu Ser Ala Ser Glu Arg Val lie Glu lie Asp Glu Ser 
70 75 80 



298 



TTT GTG TTA AAA GCG TTG GAT TTT TAT ACG CCC TTT TTG AAT GAA GCC 
Phe Val Leu Lys Ala Leu Asp Phe Tyr Thr Pro Phe Leu Asn Glu Ala 
85 90 95 



346 



TAT TCT AAT AAA ATG GCT CAT AAA AAC ATC CAA GTG GTT TTA GAG CTT 
Tyr Ser Asn Lys Met Ala His Lys Asn lie Gin Val Val Leu Glu Leu 
100 105 110 115 



394 



TTA AAG GCT TTA GAA GAA AAT CGT TTG AAA AAT AGC GAT GGG GAG TCT 
Leu Lys Ala Leu Glu Glu Asn Arg Leu Lys Asn Ser Asp Gly Glu Ser 
120 125 130 



442 



CTT TAT CGC TTG GTG ATC TTG TAT GAA GAT AAG CCT TGC GAG AGC GTG 
Leu Tyr Arg Leu Val lie Leu Tyr Glu Asp Lys Pro Cys Glu Ser Val 
135 140 145 



490 
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Leu Phe Tyr Asp Gin Leu Asn Thr Arg Lys Cys Leu Lys Ala Phe His 
210 215 220 

AAA TAG AGC AGA CGA TAG GCT GCT TTA AAA ACC GOT AAA GAG GTT TAAAA 72 2 
Lys Tyr Ser Arg Arg Tyr Ala Pro Leu Lys Thr Pro Lys Glu Val 
225 230 235 

AGAGCGGGCT TTATGTTAGA ATAAGTCTTT TTATTCAAAG GAGATTGCAA TGAAT 77 7 

(2) INFORMATION FOR SEQ ID NO: 76 8: 

(i) SEQUENGE GHARAGTERISTICS : 

(A) LENGTH: 23 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLEGULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768: 

Ser Tyr Pro Pro Pro Pro Asn Pro Thr Gin Glu Thr Gin Gin Asp Phe 

15 10 15 

lie lie Glu Ala Gin Gin Asp Leu lie lie Glu Thr Gin Gin Asp Pro 

20 25 30 

Lys Glu Leu Pro Glu Ser Cys Lys lie Thr Pro Gin Lys lie Ser Phe 

35 40 45 

Asn Gin Val Val Phe Lys Lys lie Lys Arg Lys Leu Asn Arg Phe lie 

50 55 60 

Gly Ser lie Leu Ala Arg Thr Glu Val Tyr Lys Asn Leu Val Ala Lys 
65 70 75 80 

Tyr Asp Glu Leu Thr Gly Lys Tyr Glu Ser Leu Leu Ala Lys Glu Ala 

85 90 95 

Asn lie Lys Glu Thr Phe Trp Glu Arg Arg Ala Asp Ser Glu Lys Glu 

100 105 110 

Ala Phe Phe Leu Glu His Phe Tyr Leu Thr Ser Val Tyr Val Ala Ser 

115 120 125 

Thr Ala Gly Tyr Tyr lie Thr Pro Lys Gly Ala Lys Thr Phe lie Glu 

130 135 140 

Ala Thr Glu Arg Phe Lys lie lie Glu Pro Val Asp Met Phe lie Asn 
145 150 155 160 

Asn Pro Thr Tyr His Asp Val Ala Asn Phe Thr Tyr Leu Pro Cys Pro 

165 170 175 

Val Ser Leu Asn Lys His Ala Phe Asn Ser Thr lie Gin Asn Ala Lys 

180 185 190 

Lys Pro Asp lie Ser Leu Lys Pro Pro Arg Lys Ser Tyr Phe Asp Asn 

195 200 205 

Leu Phe Tyr Asp Gin Leu Asn Thr Arg Lys Cys Leu Lys Ala Phe His 

210 215 220 

Lys Tyr Ser Arg Arg Tyr Ala Pro Leu Lys Thr Pro Lys Glu Val 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 76 9: 

(i) SEQUENCE CHARACTERISTICS: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 67: 

ACT TAC CCC CCC CCC CCC AAT CCC ACA CAA GAA ACG CAA CAA GAT TTT 4 8 

Ser Tyr Pro Pro Pro Pro Asn Pro Thr Gin Glu Thr Gin Gin Asp Phe 
15 10 15 

ATT ATT GAA GCA CAA CAA GAT TTG ATT ATT GAA ACG CAA CAA GAC CCC 96 
He He Glu Ala Gin Gin Asp Leu He He Glu Thr Gin Gin Asp Pro 
20 25 30 

AAA GAA CTA CCT GAG TCT TGC AAA ATA ACG CCC CAA AAA ATC TCT TTT 144 
Lys Glu Leu Pro Glu Ser Cys Lys He Thr Pro Gin Lys He Ser Phe 
35 40 45 

AAC CAA GTG GTT TTT AAA AAA ATT AAA AG A AAA CTC AAC CGC TTC ATT 192 
Asn Gin Val Val Phe Lys Lys He Lys Arg Lys Leu Asn Arg Phe He 
50 55 60 

GGA AGC ATT TTA GCT CGG ACA GAA GTG TAT AAG AAT CTC GTG GCA AAA 24 0 

Gly Ser He Leu Ala Arg Thr Glu Val Tyr Lys Asn Leu Val Ala Lys 
65 70 75 80 

TAC GAT GAA CTC ACA GGA AAA TAC GAA TCA TTA TTG GCA AAA GAG GCA 28 8 

Tyr Asp Glu Leu Thr Gly Lys Tyr Glu Ser Leu Leu Ala Lys Glu Ala 
85 90 95 

AAC ATC AAA GAG ACC TTT TGG GAA AGG CGT GCT GAT AGC GAA AAA GAA 3 36 

Asn He Lys Glu Thr Phe Trp Glu Arg Arg Ala Asp Ser Glu Lys Glu 
100 105 110 

GCC TTT TTT TTA GAG CAT TTT TAC CTC ACT AGC GTG TAT GTG GCT TCT 3 84 

Ala Phe Phe Leu Glu His Phe Tyr Leu Thr Ser Val Tyr Val Ala Ser 
115 120 125 

ACA GCA GGA TAC TAT ATC ACG CCT AAG GGC GCT AAA ACC TTT ATA GAA 4 32 

Thr Ala Gly Tyr Tyr He Thr Pro Lys Gly Ala Lys Thr Phe He Glu 
130 135 140 

GCC ACG GAG CGT TTT AAA ATC ATA GAG CCG GTG GAT ATG TTC ATA AAC 48 0 

Ala Thr Glu Arg Phe Lys He He Glu Pro Val Asp Met Phe He Asn 
145 150 155 160 

AAC CCC ACT TAC CAT GAT GTG GCT AAT TTT ACC TAT TTG CCT TGC CCT 52 8 

Asn Pro Thr Tyr His Asp Val Ala Asn Phe Thr Tyr Leu Pro Cys Pro 
165 170 175 

GTT TCT TTA AAC AAG CAT GCT TTC AAT AGC ACC ATT CAA AAT GCA AAA 57 6 

Val Ser Leu Asn Lys His Ala Phe Asn Ser Thr He Gin Asn Ala Lys 
180 185 190 

AAG CCT GAC ATT TCA TTA AAA CCC CCT AGA AAA TCC TAT TTT GAT AAT 62 4 

Lys Pro Asp He Ser Leu Lys Pro Pro Arg Lys Ser Tyr Phe Asp Asn 
195 200 205 

CTT TTT TAT GAT CAA TTA AAC ACT AGA AAG TGC TTA AAA GCC TTT CAC 672 
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{ 2 ) INFORMATION FOR SEQ ID NO : 7 6 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766: 



Met Lys Gin Leu Phe Leu lie lie Gly Ala Pro Gly Ser Gly Lys Thr 

15 10 15 

Thr Asp Ala Glu Leu lie Ala Lys Asn Asn Ser Glu Thr lie Ala His 

20 25 30 

Phe Ser Thr Gly Asp Leu Leu Arg Ala Glu Ser Ala Lys Lys Thr Glu 

35 40 45 

Arg Gly Leu Leu lie Glu Lys Phe Thr Ser Gin Gly Glu Leu Val Pro 

50 55 60 

Leu Glu lie Val Val Glu Thr lie Leu Ser Ala lie Lys Ser Ser Gly 
65 70 75 80 

Lys Gly lie lie Leu lie Asp Gly Tyr Pro Arg Ser Val Glu Gin Met 

85 90 95 

Gin Ala Leu Asp Lys Glu Leu Asn Ala Gin Asn Glu Val lie Leu Lys 

100 105 110 

Ser Val lie Glu Val Glu Val Ser Glu Asn Thr Ala Lys Glu Arg Val 

115 120 125 

Leu Gly Arg Ser Arg Gly Ala Asp Asp Asn Glu Lys Val Phe His Asn 

130 135 140 

Arg Met Arg Val Phe Leu Asp Pro Leu Gly Glu lie Gin Asn Phe Tyr 
145 150 155 160 

Lys Asn Lys Lys Val Tyr Lys Ala lie Asp Gly Glu Arg Ser lie Glu 

165 170 175 

Glu lie Val Gly Glu Met Gin Glu Tyr lie Leu Ser Phe Gly Asn 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 767: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



{A) NAME/KEY: Coding Sequence 
(B) LOCATION: 1...717 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765: 

ATAATTTAAA AGGATACGAT ATG AAA CAA CTA TTT TTG ATC ATT GGA GCC 5 0 

Met Lys Gin Leu Phe Leu lie lie Gly Ala 
15 10 

CCA GGG AGT GGT AAA ACC ACT GAT GCA GAG CTT ATC GCT AAA AAT AAC 9 8 

Pro Gly Ser Gly Lys Thr Thr Asp Ala Glu Leu lie Ala Lys Asn Asn 
15 20 25 

AGC GAA ACA ATC GCT CAT TTT TCT ACC GGG GAT TTA CTC AGG GCT GAG 14 6 

Ser Glu Thr lie Ala His Phe Ser Thr Gly Asp Leu Leu Arg Ala Glu 
30 35 40 

AGC GCT AAA AAG ACC GAG CGA GGC TTA TTG ATT GAA AAA TTC ACT TCT 194 
Ser Ala Lys Lys Thr Glu Arg Gly Leu Leu lie Glu Lys Phe Thr Ser 
45 50 55 

CAA GGC GAA TTA GTG CCT TTA GAA ATT GTG GTA GAA ACG ATC CTT TCA 24 2 

Gin Gly Glu Leu Val Pro Leu Glu lie Val Val Glu Thr lie Leu Ser 
60 65 70 

GCG ATT AAA AGC TCT GGT AAA GGG ATC ATT TTA ATT GAT GGT TAT CCT 2 90 

Ala lie Lys Ser Ser Gly Lys Gly lie lie Leu lie Asp Gly Tyr Pro 
75 80 85 90 

AGG AGC GTG GAA CAA ATG CAG GCT TTG GAT AAG GAA TTG AAC GCT CAA 33 8 

Arg Ser Val Glu Gin Met Gin Ala Leu Asp Lys Glu Leu Asn Ala Gin 
95 100 105 

AAC GAA GTG ATC TTA AAA AGC GTG ATT GAA GTA GAA GTG AGT GAA AAC 3 86 

Asn Glu Val lie Leu Lys Ser Val lie Glu Val Glu Val Ser Glu Asn 
110 115 120 

ACT GCT AAA GAA AGG GTT TTA GGG CGC TCT AGG GGG GCT GAT GAT AAT 4 34 

Thr Ala Lys Glu Arg Val Leu Gly Arg Ser Arg Gly Ala Asp Asp Asn 
125 130 135 

GAA AAG GTG TTT CAT AAC CGC ATG CGG GTG TTT TTG GAT CCG TTG GGC 4 82 

Glu Lys Val Phe His Asn Arg Met Arg Val Phe Leu Asp Pro Leu Gly 
140 145 150 

GAG ATC CAA AAT TTT TAC AAG AAT AAG AAG GTG TAT AAA GCG ATC GAT 53 0 

Glu lie Gin Asn Phe Tyr Lys Asn Lys Lys Val Tyr Lys Ala lie Asp 
155 160 165 170 

GGG GAG AGG AGC ATT GAA GAG ATT GTG GGC GAA ATG CAA GAG TAT ATC 57 8 

Gly Glu Arg Ser lie Glu Glu lie Val Gly Glu Met Gin Glu Tyr lie 
175 180 185 

TTG TCT TTC GGT AAT TAAAATGCAC TCTCAAGGAG AATAGCTGTG ATTTCTG 63 0 

Leu Ser Phe Gly Asn 
190 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:764: 

Met Asn lie Phe Lys He Arg Glu He He His Tyr Asp Gly Glu Val 

15 10 15 

Thr Glu He Leu Gly Gly Ser Asp Gly Val Met Leu Gly Phe Leu Ser 

20 25 30 

Val Arg Gly Glu Ser He Pro Leu Val Asp Val Lys Arg Trp Leu His 

35 40 45 

Tyr Asn Ala Asn Asp Pro Ser Arg Asp Leu Lys Glu Cys Ser Val Lys 

50 55 60 

Asp Asp His Asn Leu Val He Val Cys His Phe Ser Asn His Ser He 
65 70 75 80 

Ala Leu Lys Val Leu Lys He Glu Arg He He His Lys Asn Trp Thr 

85 90 95 

Glu He Ser Ala Gly Asp Lys Gin Gly He Asn Glu Glu Gly Lys Leu 

100 105 110 

Ser Ala He Thr Arg Phe Asp Glu Glu Arg Val Val Gin He Leu Asp 

115 120 125 

Val Glu Lys Met He Ser Asp Val Phe Pro Ser Leu Lys Asp Leu Asp 

130 135 140 

Asp Leu Thr Leu Arg Cys He Glu Ala He Gin Ser Gin Lys Leu He 
145 150 155 160 

Leu He Ala Glu Asp Ser Leu Ser Ala Leu Lys Thr Leu Glu Lys He 

165 170 175 

Val Gin Thr Leu Glu Leu Arg Tyr Leu Ala Phe Pro Asn Gly Arg Glu 

180 185 190 

Leu Leu Asp Tyr Leu Tyr Glu Lys Glu His Tyr Gin Gin Val Gly Val 

195 200 205 

Val He Thr Asp Leu Glu Met Pro Asn He Ser Gly Phe Glu Val Leu 

210 215 220 

Lys Thr He Lys Ala Asp His Arg Thr Glu His Leu Pro Val He He 
225 230 235 240 

Asn Ser Ser Met Ser Ser Asp Ser Asn Arg Gin Leu Ala Gin Ser Leu 

245 250 255 

Glu Ala Asp Gly Phe Val Val Lys Ser Asn He Leu Glu He His Glu 

260 265 270 

Met Leu Lys Lys Thr Leu Ser 
275 

{2) INFORMATION FOR SEQ ID NO: 76 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 21... 593 
(D) OTHER INFORMATION: 
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CGT TTT GAT GAA GAA CGA GTG GTG CAG ATC TTA GAT GTG GAA AAA ATG 43 9 

Arg Phe Asp Glu Glu Arg Val Val Gin He Leu Asp Val Glu Lys Met 
120 125 130 



ATT AGC GAT GTT TTC CCT AGC TTG AAA GAT TTA GAC GAT TTG ACT TTG 4 87 

He Ser Asp Val Phe Pro Ser Leu Lys Asp Leu Asp Asp Leu Thr Leu 
135 140 145 



CGT TGC ATA GAA GCC ATT CAA AGC CAA AAA CTC ATT TTA ATC GCT GAA 53 5 

Arg Cys He Glu Ala He Gin Ser Gin Lys Leu He Leu He Ala Glu 
150 155 160 



GAC TCC CTA AGC GCT CTT AAA ACC TTA GAA AAG ATC GTT CAA ACT TTA 5 83 

Asp Ser Leu Ser Ala Leu Lys Thr Leu Glu Lys He Val Gin Thr Leu 
165 170 175 180 



GAA TTG CGT TAT TTA GCT TTT CCA AAC GGG AGG GAA TTG TTG GAT TAT 631 
Glu Leu Arg Tyr Leu Ala Phe Pro Asn Gly Arg Glu Leu Leu Asp Tyr 
185 190 195 



TTG TAT GAA AAA GAA CAT TAC CAA CAA GTT GGC GTG GTC ATT ACG GAT 67 9 

Leu Tyr Glu Lys Glu His Tyr Gin Gin Val Gly Val Val He Thr Asp 
200 205 210 



TTA GAA ATG CCT AAC ATT TCA GGG TTT GAA GTG TTA AAA ACC ATT AAA 72 7 

Leu Glu Met Pro Asn He Ser Gly Phe Glu Val Leu Lys Thr He Lys 
215 220 225 



GCT GAT CAT AGA ACT GAG CAT CTT CCT GTG ATT ATC AAT TCG TCC ATG 77 5 

Ala Asp His Arg Thr Glu His Leu Pro Val He He Asn Ser Ser Met 
230 235 240 



AGC AGC GAT TCT AAC CGC CAG TTA GCC CAA TCT TTA GAA GCG GAT GGT 82 3 

Ser Ser Asp Ser Asn Arg Gin Leu Ala Gin Ser Leu Glu Ala Asp Gly 
245 250 255 260 



TTT GTG GTA AAA TCT AAC ATT CTT GAA ATC CAT GAA ATG CTT AAA AAA 871 
Phe Val Val Lys Ser Asn He Leu Glu He His Glu Met Leu Lys Lys 
265 270 275 



ACG CTT TCA TAAATTTAAT TTTTGTTTTA ATTTAAAGGG ATAAAACATG CGAAGTCAT 92 9 
Thr Leu Ser 



TTTTGCAC 93 7 

(2) INFORMATION FOR SEQ ID NO:764: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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290 295 300 

lie Asn Ser Arg Leu 
305 

(2) INFORMATION FOR SEQ ID NO: 7 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 base pairs 

(B) TYPE: n-ucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 44... 880 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 63: 

GTGCTTTAGA TTAGATGCAG AAAAAGACGC CCAACTTTAT GGC ATG AAT ATT TTT 55 

Met Asn lie Phe 
1 

AAG ATC CGA GAA ATT ATC CAT TAT GAC GGG GAG GTT ACA GAG ATT CTT 10 3 

Lys lie Arg Glu lie lie His Tyr Asp Gly Glu Val Thr Glu lie Leu 
5 10 15 20 

GGG GGG AGC GAT GGC GTG ATG CTC GGG TTT CTT AGC GTT AGG GGC GAG 151 
Gly Gly Ser Asp Gly Val Met Leu Gly Phe Leu Ser Val Arg Gly Glu 
25 30 35 

TCT ATC CCT TTA GTG GAT GTG AAA AGG TGG TTG CAT TAT AAC GOT AAT 199 
Ser lie Pro Leu Val Asp Val Lys Arg Trp Leu His Tyr Asn Ala Asn 
40 45 50 

GAT CCG AGC CGT GAT CTA AAA GAA TGC AGC GTT AAA GAT GAC CAT AAT 24 7 

Asp Pro Ser Arg Asp Leu Lys Glu Cys Ser Val Lys Asp Asp His Asn 
55 60 65 

TTG GTG ATT GTG TGC CAT TTT TCT AAC CAT TCC ATC GOT CTA AAG GTT 2 95 

Leu Val lie Val Cys His Phe Ser Asn His Ser lie Ala Leu Lys Val 
70 75 80 

TTA AAA ATT GAA AGG ATC ATC CAT AAA AAT TGG ACT GAG ATT AGC GCT 34 3 

Leu Lys lie Glu Arg lie lie His Lys Asn Trp Thr Glu lie Ser Ala 
85 90 95 100 

GGG GAC AAA CAA GGC ATT AAT GAA GAG GGT AAG CTT AGC GCT ATC ACT 3 91 

Gly Asp Lys Gin Gly lie Asn Glu Glu Gly Lys Leu Ser Ala lie Thr 
105 110 115 
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AAA TNG CTT GCG CTT ATA AAT TCC CGG CTT TAGAAAAACA CACCAAAATT GTA 96 8 
Lys Xaa Leu Ala Leu lie Asn Ser Arg Leu 
300 305 



GGAGTCATTA ACCAAGTGGG GCGCASSGGG CGATCACAC 1007 
(2) INFORMATION FOR SEQ ID NO: 762: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762: 



Met 


He 


Lys 


Ser 


Gin 


Lys 


Glu 


Tyr 


Leu 


Glu 


Arg 


He 


Ala 


Tyr 


Leu 


Asn 


1 








5 










10 










15 




Thr 


Leu 


Ser 


His 


His 


Tyr 


Tyr 


Asn 


Leu 


Asp 


Glu 


Pro 


He 


Val 


Ser 


Asp 








20 










25 










30 






Ala 


He 


Tyr 


Asp 


Glu 


Leu 


Tyr 


Gin 


Glu 


Leu 


Lys 


Ala 


Tyr 


Glu 


Glu 


Lys 






35 










40 










45 








Asn 


Pro 


Asn 


Gly 


He 


Gin 


Ala 


Asn 


Ser 


Pro 


Thr 


Gin 


Lys 


Val 


Gly 


Ala 




50 










55 










60 










Thr 


Thr 


Thr 


Asn 


Ser 


Phe 


Asn 


Lys 


Asn 


Pro 


His 


Leu 


Met 


Arg 


Met 


Trp 


65 










70 










75 










80 


Ser 


Leu 


Asp 


Asp 


Val 


Phe 


Asn 


Gin 


Ser 


Glu 


Leu 


Gin 


Ala 


Trp 


Leu 


Gin 










85 










90 










95 




Arg 


He 


Leu 


Lys 


Ala 


Tyr 


Pro 


Ser 


Ala 


Ser 


Phe 


Val 


Cys 


Ser 


Pro 


Lys 








100 










105 










110 






Leu 


Asp 


Gly 


Val 


Ser 


Leu 


Asn 


Leu 


Leu 


Tyr 


Gin 


His 


Gly 


Lys 


Leu 


Val 






115 










120 










125 








Lys 


Ala 


Thr 


Thr 


Arg 


Gly 


Asn 


Gly 


Leu 


Glu 


Gly 


Glu 


Leu 


Val 


Ser 


Ala 




130 










135 










140 










Asn 


Ala 


Lys 


His 


He 


Ala 


Asn 


He 


Pro 


His 


Ala 


He 


Ala 


Tyr 


Asn 


Gly 


145 










150 










155 










160 


Glu 


He 


Glu 


He 


Arg 


Gly 


Glu 


Val 


He 


He 


Ser 


Lys 


Lys 


Asp 


Phe 


Asp 










165 










170 










175 




Ala 


Leu 


Asn 


Gin 


Glu 


Arg 


Leu 


Asn 


Ala 


Asn 


Glu 


Pro 


Leu 


Phe 


Ala 


Asn 








180 










185 










190 






Pro 


Arg 


Asn 


Ala 


Ala 


Ser 


Gly 


Ser 


Leu 


Arg 


Gin 


Leu 


Asp 


Ser 


Glu 


He 






195 










200 










205 








Thr 


Lys 


Lys 


Arg 


Lys 


Leu 


Gin 


Phe 


He 


Pro 


Trp 


Gly Val 


Gly 


Lys 


His 




210 










215 










220 










Ser 


Leu 


Asn 


Phe 


Leu 


Ser 


Phe 


Lys 


Glu 


Cys 


Leu 


Asp 


Phe 


He 


Val 


Ser 


225 










230 










235 










240 


Leu 


Gly 


Phe 


Ser 


Ala 


He 


Gin 


Tyr 


Leu 


Ser 


Leu 


Asn 


Lys 


Asn 


His 


Gin 










245 










250 










255 




Glu 


He 


Glu 


Asp 


Asn 


Tyr 


His 


Thr 


Leu 


He 


Arg 


Glu 


Arg 


Glu 


Gly 


Phe 








260 










265 










270 






Phe 


Ala 


Leu 


Leu 


Asp 


Gly 


Met 


Val 


He 


Val 


Val 


Asn 


Glu 


Leu 


Asn 


He 






275 










280 










285 








Gin 


Lys 


Glu 


Leu 


Gly 


Tyr 


Thr 


Gin 


Lys 


Ser 


Pro 


Lys 


Xaa 


Leu 


Ala 


Leu 
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TTA ATG CGG ATG TGG AGC TTA GAT GAT GTG TTC AAT CAA AGO GAA TTG 2 91 

Leu Met Arg Met Trp Ser Leu Asp Asp Val Phe Asn Gin Ser Glu Leu 
80 85 90 

CAA GCG TGG TTG CAA CGC ATT TTA AAA GCC TAT CCT AGT GCT TCG TTC 33 9 

Gin Ala Trp Leu Gin Arg lie Leu Lys Ala Tyr Pro Ser Ala Ser Phe 
95 100 105 

GTG TGT TCG CCC AAA CTT GAT GGG GTT TCG CTC AAT CTT TTG TAT CAA 3 87 

Val Cys Ser Pro Lys Leu Asp Gly Val Ser Leu Asn Leu Leu Tyr Gin 
110 115 120 

CAT GGC AAG CTA GTG AAG GCG ACC ACT AGG GGC AAC GGC TTA GAA GGA 435 
His Gly Lys Leu Val Lys Ala Thr Thr Arg Gly Asn Gly Leu Glu Gly 
125 130 135 

GAA TTA GTT AGC GCA AAC GCT AAA CAC ATC GCT AAT ATC CCC CAC GCT 4 83 

Glu Leu Val Ser Ala Asn Ala Lys His lie Ala Asn lie Pro His Ala 
140 145 150 155 

ATC GCT TAT AAT GGA GAA ATA GAA ATC AGG GGC GAA GTG ATC ATT TCT 531 
lie Ala Tyr Asn Gly Glu lie Glu lie Arg Gly Glu Val lie lie Ser 
160 165 170 

AAA AAG GAT TTT GAC GCT TTG AAT CAA GAG CGC TTA AAC GCT AAT GAA 57 9 

Lys Lys Asp Phe Asp Ala Leu Asn Gin Glu Arg Leu Asn Ala Asn Glu 
175 180 185 

CCC CTA TTC GCT AAC CCC AGA AAC GCC GCA TCA GGG AGT TTG AGG CAA 62 7 

Pro Leu Phe Ala Asn Pro Arg Asn Ala Ala Ser Gly Ser Leu Arg Gin 
190 195 200 

CTT GAT AGC GAA ATC ACT AAA AAG CGT AAA TTG CAA TTC ATT CCT TGG 67 5 

Leu Asp Ser Glu lie Thr Lys Lys Arg Lys Leu Gin Phe lie Pro Trp 
205 210 215 

GGC GTG GGC AAG CAT TCT TTA AAT TTT TTA AGC TTT AAG GAG TGT TTG 72 3 

Gly Val Gly Lys His Ser Leu Asn Phe Leu Ser Phe Lys Glu Cys Leu 
220 225 230 235 

GAT TTT ATC GTC TCG TTA GGT TTT AGC GCC ATT CAA TAC TTA AGC CTA 771 
Asp Phe lie Val Ser Leu Gly Phe Ser Ala lie Gin Tyr Leu Ser Leu 
240 245 250 

AAC AAA AAC CAC CAA GAA ATA GAA GAC AAT TAC CAC ACC CTA ATT AGA 819 
Asn Lys Asn His Gin Glu lie Glu Asp Asn Tyr His Thr Leu lie Arg 
255 260 265 

GAA AGG GAG GGC TTT TTT GCC CTT TTA GAC GGC ATG GTG ATC GTT GTG 8 67 

Glu Arg Glu Gly Phe Phe Ala Leu Leu Asp Gly Met Val lie Val Val 
270 275 280 

AAT GAA TTA AAT ATT CAA AAG GAG CTA GGC TAC ACG CAA AAA TCC CCT 915 
Asn Glu Leu Asn lie Gin Lys Glu Leu Gly Tyr Thr Gin Lys Ser Pro 
285 290 295 
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50 

Asn Glu Glu Arg Lys 
65 

Ala Gly Lys Thr Arg 
85 

Val Cys Gly Met Leu 
100 

Met Lys Ser Pro lie 
115 

Met Val Leu Ser Leu 
130 

Pro lie Asp Asp Lys 

145 

Glu 



55 

Lys Gly Leu Asn lie Gin 
70 75 
Leu Arg Pro lie Leu Met 
90 

Pro Leu Ala Leu Ala Ser 
105 

Gly lie Ala Met Ser Gly 
120 

Leu lie Val Pro Val Phe 
135 

lie Lys Arg Phe Tyr Gin 
150 155 



60 

Glu Ala lie Leu Phe 
80 

Thr Thr lie Ala Met 
95 

Gly Asp Gly Ala Ala 
110 

Gly Leu Met lie Ser 
125 

Tyr Arg Leu Leu Ala 
140 

Asn Gin Lys Thr Leu 
160 



(2) INFORMATION FOR SEQ ID NO: 761: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 19.,. 945 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761: 

TAAAAGGTTT TTACAAAC ATG ATA AAA AGC CAA AAA GAA TAT TTA GAA AGA 51 

Met lie Lys Ser Gin Lys Glu Tyr Leu Glu Arg 
15 10 

ATT GCA TAT TTA AAC ACC CTA TCG CAC CAT TAT TAC AAC CTT GAT GAA 9 9 

lie Ala Tyr Leu Asn Thr Leu Ser His His Tyr Tyr Asn Leu Asp Glu 
15 20 25 

CCC ATC GTA AGC GAT GCG ATC TAT GAT GAA CTT TAC CAA GAA TTG AAA 14 7 

Pro lie Val Ser Asp Ala lie Tyr Asp Glu Leu Tyr Gin Glu Leu Lys 

30 35 40 

GCT TAT GAA GAA AAA AAC CCT AAT GGC ATT CAA GCT AAT TCC CCT ACC 195 

Ala Tyr Glu Glu Lys Asn Pro Asn Gly lie Gin Ala Asn Ser Pro Thr 
45 50 55 

CAA AAA GTG GGG GCT ACT ACC ACC AAT TCG TTC AAT AAA AAC CCC CAT 24 3 

Gin Lys Val Gly Ala Thr Thr Thr Asn Ser Phe Asn Lys Asn Pro His 
60 65 70 75 
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35 40 45 

CTC ATT GGT ATG GTG GGT AAA AAC GCC ACG CTT TTA ATT GAT GTG GCG 25 3 

Leu lie Gly Met Val Gly Lys Asn Ala Thr Leu Leu lie Asp Val Ala 
50 55 60 

AAT GAA GAG CGT AAA AAA GGT TTG AAT ATC CAA GAG GCC ATT TTA TTT 3 01 

Asn Glu Glu Arg Lys Lys Gly Leu Asn lie Gin Glu Ala lie Leu Phe 
65 70 75 80 

GCC GGC AAA ACC CGT CTA AGA CCG ATT TTA ATG ACG ACC ATT GCG ATG 34 9 

Ala Gly Lys Thr Arg Leu Arg Pro lie Leu Met Thr Thr lie Ala Met 
85 90 95 

GTT TGC GGG ATG CTG CCT TTA GCG TTG GCG AGT GGG GAT GGA GCG GCG 3 97 

Val Cys Gly Met Leu Pro Leu Ala Leu Ala Ser Gly Asp Gly Ala Ala 
100 105 110 

ATG AAA TCC CCT ATA GGG ATT GCG ATG AGT GGG GGC TTG ATG ATT TCT 44 5 

Met Lys Ser Pro lie Gly lie Ala Met Ser Gly Gly Leu Met lie Ser 
115 120 125 

ATG GTG TTA AGC TTA CTC ATT GTG CCG GTG TTT TAT CGT TTG CTC GCT 4 93 

Met Val Leu Ser Leu Leu lie Val Pro Val Phe Tyr Arg Leu Leu Ala 
130 135 140 

CCC ATA GAC GAC AAA ATC AAG CGG TTT TAT CAA AAC CAA AAA ACT TTA 541 
Pro lie Asp Asp Lys lie Lys Arg Phe Tyr Gin Asn Gin Lys Thr Leu 
145 150 155 160 

GAA TGAAAAAAAT TGCTTTCATT TTGGCTTTAT GGGTGGGCTT GTTAGGGGCG TTTGAG 600 
Glu 



CCTAAAAAAA GTCATATTTA TTTTGGGGCT 63 0 

(2) INFORMATION FOR SEQ ID NO:760: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760: 

Met lie Leu Ala Ala Leu Tyr Glu Ser lie Leu Glu Pro Phe lie lie 

15 10 15 

Met Val Thr Met Pro Leu Ser Phe Ser Gly Ala Phe Phe Ala Leu Gly 

20 25 30 

Leu Val His Gin Pro Leu Ser Met Phe Ser Met He Gly Leu He Leu 

35 40 45 

Leu He Gly Met Val Gly Lys Asn Ala Thr Leu Leu He Asp Val Ala 



-1145- 



BNSOOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/0637I 



Val Thr Glu Gly Ser Val Val Lys Lys Gly Asp Val Leu Leu Leu Leu 

50 55 60 

Tyr Asn Gin Asp Lys Gin Ala Gin Ser Asp Ser Thr Glu Gin Gin Leu 
65 70 75 80 

He Phe Ala Lys Lys Gin Tyr Gin Arg Tyr Ser Lys He Gly Gly Ala 

85 90 95 

Val Asp Lys Asn Thr Leu Glu Gly Tyr Glu Phe Thr Tyr Arg Arg Leu 

100 105 110 

Glu Ser Asp Tyr Ala Tyr Ser He Ala Val Leu Asn Lys Thr He Leu 

115 120 125 

Arg Ala Pro Phe Asp Gly Val He Ala Ser Lys Asn He Gin Val Gly 

130 135 140 

Glu Gly Val Ser Ala Asn Asn Thr Val Leu Leu Arg Leu Val Ser His 
145 150 155 160 

Ala Arg Lys Leu Val He Glu Phe Asp Ser Lys Tyr He Asn Ala Val 

165 170 175 

Lys Val Gly Asp Thr Tyr Thr Tyr Ser He Asp Gly Asp Ser Asn Gin 

180 185 190 

His Glu Ala Lys He Thr Lys He Tyr Pro Thr Val Asp Glu Asn Thr 

195 200 205 

Arg Lys Val Ser Ala Glu Ala Leu Leu Ser Lys Pro Met Ala Val Gly 

210 215 220 

Leu Phe Gly Asp Gly Phe He Gin Thr Lys 
225 230 



(2) INFORMATION FOR SEQ ID NO; 759: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
CB) LOCATION: 62 . . . 544 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759: 

ACGCCAAAGA GAGCAACGGG GAGTTTTTAG TCGCTTTAGC GNAGCGNTTG TGCTGATTTA 6 0 

T ATG ATT TTA GCG GCG TTG TAT GAG TCC ATT TTA GAG CCT TTT ATC ATC 10 9 
Met He Leu Ala Ala Leu Tyr Glu Ser He Leu Glu Pro Phe He He 
15 10 15 

ATG GTT ACC ATG CCT TTA AGT TTT TCA GGG GCG TTT TTT GCT CTA GGT 15 7 

Met Val Thr Met Pro Leu Ser Phe Ser Gly Ala Phe Phe Ala Leu Gly 
20 25 30 

TTA GTC CAT CAG CCT TTG AGC ATG TTC TCT ATG ATA GGC TTG ATT TTG 2 05 

Leu Val His Gin Pro Leu Ser Met Phe Ser Met He Gly Leu He Leu 
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TAT GAG TTC ACT TAC AGG CGC TTG GAG TCT GAT TAG GOT TAT TCT ATT 3 90 

Tyr Glu Phe Thr Tyr Arg Arg Leu Glu Ser Asp Tyr Ala Tyr Ser lie 
105 110 115 120 

GCG GTA TTG AAT AAA ACC ATT TTA AGA GCC OCT TTT GAT GGC GTG ATA 43 8 

Ala Val Leu Asn Lys Thr lie Leu Arg Ala Pro Phe Asp Gly Val lie 
125 130 135 

GCG ACT AAA AAC ATT CAA GTG GGC GAA GGG GTG AGC GCG AAT AAC ACG 4 86 

Ala Ser Lys Asn lie Gin Val Gly Glu Gly Val Ser Ala Asn Asn Thr 
140 145 150 

GTG TTA TTG AGA TTA GTC AGC CAT GCT AGG AAA TTA GTT ATT GAA TTT 53 4 

Val Leu Leu Arg Leu Val Ser His Ala Arg Lys Leu Val lie Glu Phe 
155 160 165 

GAT TCT AAA TAT ATT AAT GCG GTC AAA GTA GGG GAC ACT TAC ACC TAT 5 82 

Asp Ser Lys Tyr lie Asn Ala Val Lys Val Gly Asp Thr Tyr Thr Tyr 
170 175 180 

TCT ATA GAC GGG GAT TCT AAT CAG CAT GAA GCT AAA ATC ACT AAG ATT 63 0 

Ser lie Asp Gly Asp Ser Asn Gin His Glu Ala Lys lie Thr Lys lie 
185 190 195 200 

TAC CCC ACG GTT GAT GAA AAC ACC AGG AAA GTG AGC GCT GAA GCC CTT 67 8 

Tyr Pro Thr Val Asp Glu Asn Thr Arg Lys Val Ser Ala Glu Ala Leu 
205 210 215 

TTA TCT AAG CCT ATG GCA GTG GGG CTT TTT GGC GAT GGG TTT ATC CAA 72 6 

Leu Ser Lys Pro Met Ala Val Gly Leu Phe Gly Asp Gly Phe lie Gin 
220 225 230 

ACG AAA TAATAGGATA TTTTGATGTA TAAAACAGCG ATTA 76 6 

Thr Lys 



(2) INFORMATION FOR SEQ ID NO;758: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 758: 

Met lie Arg Lys lie Leu lie Gly Leu Phe Leu Ser Phe Leu Ser Met 

15 10 15 

Glu Ala Gly Glu Lys Val Tyr Ala lie Phe Asn Val Lys Ala Thr Gin 

20 25 30 

Asp Ser Lys Leu Thr Leu Asp Ser Thr Gly lie Val Asp Ser lie Lys 
35 40 45 
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290 295 300 

Met Gly Asn Gin Gly His lie Phe Asn Leu Gly His Gly Met Leu Pro 
305 310 315 320 

Asp Leu Pro Arg Glu Asn Ala Lys Tyr Leu Val Gin Leu Val His Ala 
325 330 335 

Lys Thr Arg Arg 
340 

(2) INFORMATION FOR SEQ ID NO: 757: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 732 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:757: 

TAGACGACTA TGTGCATTAA GGGAATGAAA ATG ATA CGA AAA ATT TTA ATA GGA 54 

Met lie Arg Lys lie Leu lie Gly 
1 5 

CTT TTT TTG AGT TTT TTG AGC ATG GAA GCT GGC GAA AAA GTG TAT GCG 102 
Leu Phe Leu Ser Phe Leu Ser Met Glu Ala Gly Glu Lys Val Tyr Ala 
10 15 20 

ATT TTC AAT GTG AAA GCG ACA CAA GAT TCC ATUV CTC ACC TTA GAC AGC 15 0 

lie Phe Asn Val Lys Ala Thr Gin Asp Ser Lys Leu Thr Leu Asp Ser 
25 30 35 40 

ACA GGA ATT GTG GAT AGC ATT AAG GTT ACT GAG GGG AGC GTG GTC AAA 198 
Thr Gly He Val Asp Ser He Lys Val Thr Glu Gly Ser Val Val Lys 
45 50 55 

AAG GGC GAT GTT TTG TTG CTT TTA TAT AAT CAA GAC AAA CAG GCT CAA 24 6 

Lys Gly Asp Val Leu Leu Leu Leu Tyr Asn Gin Asp Lys Gin Ala Gin 
60 65 70 

AGC GAT TCC ACC GAA CAA CAA CTC ATT TTC GCT AAA AAG CAA TAC CAA 2 94 

Ser Asp Ser Thr Glu Gin Gin Leu He Phe Ala Lys Lys Gin Tyr Gin 
75 80 85 

CGA TAC AGC AAA ATT GGG GGC GCT GTG GAT AAA AAC ACT CTA GAG GGT 34 2 

Arg Tyr Ser Lys He Gly Gly Ala Val Asp Lys Asn Thr Leu Glu Gly 
90 95 100 
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TAT TTA GTG CAA TTA GTG CAT GCT AAA ACC AGA CGA TAGGGGGATT GATGAA 106 5 
Tyr Leu Val Gin Leu Val His Ala Lys Thr Arg Arg 
330 335 340 

TACTATCATA AGATA 10 80 

(2) INFORMATION FOR SEQ ID NO: 75 6: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756: 

Met Met lie Phe lie Asp Ala Cys Phe Arg Lys Glu Thr Pro Tyr Thr 

15 10 15 

Pro lie Trp Met Met Arg Gin Ala Gly Arg Tyr Leu Ser Glu Tyr Gin 

20 25 30 

Glu Ser Arg Lys Lys Ala Gly Ser Phe Leu Glu Leu Cys Lys Asn Ser 

35 40 45 

Asp Leu Ala Thr Glu Val Thr Leu Gin Pro Val Glu lie Leu Gly Val 

50 55 60 

Asp Ala Ala lie Leu Phe Ser Asp lie Leu Val Val Pro Leu Glu Met 
65 70 75 80 

Gly Leu Asn Leu Glu Phe lie Pro Lys Lys Gly Pro His Phe Leu Glu 

85 90 95 

Thr lie Thr Asp Leu Lys Ser Val Glu Ser Leu Lys Val Gly Ala Tyr 

100 105 110 

Lys Gin Leu Asn Tyr Val Tyr Asp Thr lie Ser Gin Thr Arg Gin Lys 

115 120 125 

Leu Ser Arg Glu Lys Ala Leu lie Gly Phe Cys Gly Ser Pro Trp Thr 

130 135 140 

Leu Ala Thr Tyr Met lie Glu Gly Glu Gly Ser Lys Ser Tyr Ala Lys 
145 150 155 160 

Ser Lys Lys Met Leu Tyr Ser Glu Pro Glu Val Leu Lys Ala Leu Leu 

165 170 175 

Glu Lys Leu Ser Leu Glu Leu lie Glu Tyr Leu Ser Leu Gin lie Gin 

180 185 190 

Ala Gly Val Asn Ala Val Met lie Phe Asp Ser Trp Ala Ser Ala Leu 

195 200 205 

Glu Lys Glu Ala Tyr Leu Lys Phe Ser Trp Asp Tyr Leu Lys Lys lie 

210 215 220 

Ser Lys Glu Leu Lys Lys Arg Tyr Ala His lie Pro Val lie Leu Phe 
225 230 235 240 

Pro Lys Gly lie Gly Ala Tyr Leu Asp Ser lie Asp Gly Glu Phe Asp 

245 250 255 

Val Phe Gly Val Asp Trp Gly Thr Pro Leu Thr Ala Ala Lys Lys lie 

260 265 270 

Leu Gly Gly Lys Tyr Val Leu Gin Gly Asn Leu Glu Pro Thr Arg Leu 

275 280 285 

Tyr Asp Lys Asn Ala Leu Glu Glu Gly Val Glu Thr lie Leu Lys Val 
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GAA AGC CTA AAA GTA GGG GCT TAT AAA CAA CTA AAC TAT GTC TAT GAT 3 89 

Glu Ser Leu Lys Val Gly Ala Tyr Lys Gin Leu Asn Tyr Val Tyr Asp 
105 110 115 120 

ACG ATT TCT CAA ACG CGC CAA AAG CTT TCT AGA GAG AAA GCG TTA ATC 437 
Thr lie Ser Gin Thr Arg Gin Lys Leu Ser Arg Glu Lys Ala Leu lie 
125 130 135 

GGT TTT TGC GGA TCG CCT TGG ACT TTA GCG ACT TAC ATG ATA GAA GGC 4 85 

Gly Phe Cys Gly Ser Pro Trp Thr Leu Ala Thr Tyr Met lie Glu Gly 
140 145 150 

GAG GGG AGC AAA TCG TAT GCC AAA AGC AAG AAA ATG CTT TAT AGC GAG 53 3 

Glu Gly Ser Lys Ser Tyr Ala Lys Ser Lys Lys Met Leu Tyr Ser Glu 
155 160 165 

CCT GAA GTT TTA AAA GCG CTT TTA GAA AAA TTA AGC CTT GAA TTG ATA 5 81 

Pro Glu Val Leu Lys Ala Leu Leu Glu Lys Leu Ser Leu Glu Leu lie 
170 175 180 

GAG TAT TTG AGC CTT CAA ATC CAA GCA GGG GTC AAT GCA GTG ATG ATC 62 9 

Glu Tyr Leu Ser Leu Gin lie Gin Ala Gly Val Asn Ala Val Met lie 
185 190 195 200 

TTT GAC TCA TGG GCT AGC GCT TTA GAA AAA GAA GCG TAT TTG AAA TTC 6 77 

Phe Asp Ser Trp Ala Ser Ala Leu Glu Lys Glu Ala Tyr Leu Lys Phe 
205 210 215 

AGT TGG GAT TAT TTG AAA AAA ATC TCT AAA GAG CTT AAA AAA CGC TAT 725 
Ser Trp Asp Tyr Leu Lys Lys lie Ser Lys Glu Leu Lys Lys Arg Tyr 
220 225 230 

GCG CAT ATC CCA GTT ATC CTT TTC CCT AAA GGG ATT GGC GCT TAT TTG 77 3 

Ala His lie Pro Val lie Leu Phe Pro Lys Gly lie Gly Ala Tyr Leu 
235 240 245 

GAT AGC ATA GAT GGG GAA TTT GAT GTG TTT GGC GTG GAT TGG GGC ACG 821 
Asp Ser lie Asp Gly Glu Phe Asp Val Phe Gly Val Asp Trp Gly Thr 
250 255 260 

CCT TTA ACT GCG GCA AAA AAG ATT TTA GGC GGT AAG TAT GTT TTG CAA 86 9 

Pro Leu Thr Ala Ala Lys Lys lie Leu Gly Gly Lys Tyr Val Leu Gin 
265 270 275 280 

GGG AAT TTA GAA CCC ACC CGC CTT TAT GAT AAA AAC GCT TTA GAA GAA 917 
Gly Asn Leu Glu Pro Thr Arg Leu Tyr Asp Lys Asn Ala Leu Glu Glu 
285 290 295 

GGG GTT GAA ACG ATT CTA AAA GTC ATG GGC AAT CAA GGG CAT ATT TTT 96 5 

Gly Val Glu Thr lie Leu Lys Val Met Gly Asn Gin Gly His lie Phe 
300 305 310 

AAT TTA GGG CAT GGG ATG TTG CCG GAT TTA CCC AGA GAA AAC GCC AAA 1013 
Asn Leu Gly His Gly Met Leu Pro Asp Leu Pro Arg Glu Asn Ala Lys 
-315 320 325 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:754: 

Met Tyr Leu Gly Leu Asn Pro Lys Arg Thr Thr Lys Ser Ala Thr Ala 

15 10 15 

Pro lie Lys lie Met Leu Met Arg Cys Ala Gin Gin Lys lie Gin Gly 

20 25 30 

Glu Ser Phe Lys Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 755: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 30... 1049 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755: 

ACCATAATTA GACAAACCTT TAAGGATTT ATG ATG ATT TTC ATT GAT GCA TGT 5 3 

Met Met lie Phe lie Asp Ala Cys 
1 5 

TTT AGA AAG GAA ACG CCT TAC ACG CCC ATT TGG ATG ATG AGG CAA GCG 101 

Phe Arg Lys Glu Thr Pro Tyr Thr Pro lie Trp Met Met Arg Gin Ala 

10 15 20 

GGG CGT TAC CTT AGC GAA TAC CAA GAG AGC CGT AAA AAA GCG GGG AGT 14 9 

Gly Arg Tyr Leu Ser Glu Tyr Gin Glu Ser Arg Lys Lys Ala Gly Ser 
25 30 35 40 

TTC TTG GAA TTG TGT AAA AAT AGC GAT CTA GCC ACA GAA GTT ACC TTA 197 
Phe Leu Glu Leu Cys Lys Asn Ser Asp Leu Ala Thr Glu Val Thr Leu 
45 50 55 

CAG CCG GTA GAG ATT TTA GGC GTG GAT GCG GCT ATT TTG TTT AGC GAT 24 5 

Gin Pro Val Glu lie Leu Gly Val Asp Ala Ala lie Leu Phe Ser Asp 
60 65 70 

ATT TTA GTA GTG CCT TTG GAA ATG GGC TTG AAT TTG GAG TTT ATC CCC 2 93 

lie Leu Val Val Pro Leu Glu Met Gly Leu Asn Leu Glu Phe lie Pro 
75 80 85 

AAA AAG GGG CCG CAT TTT TTA GAG ACG ATT ACG GAT TTA AAA AGC GTG 341 
Lys Lys Gly Pro His Phe Leu Glu Thr lie Thr Asp Leu Lys Ser Val 
90 95 100 
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Lys Glu lie Gly Asp Lys Tyr Glu Lys lie Leu Val lie Glu Leu Asn 

325 330 335 

Lys Gly Gin Tyr Leu Glu Glu lie Glu Arg Ala Met Gin Arg Lys Val 

340 345 350 

His Phe Leu Gly Gin Ala Asn Gly Arg Thr lie Ser Pro Lys Gin lie 

355 360 365 

lie Ala Lys Leu Lys Glu Leu 
370 375 

(2) INFORMATION FOR SEQ ID NO: 753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 54 . . . 164 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:753: 

TATGATAAAA GCTCTCATAT AACCCGCTAC TAGCCATAAG CAACAGCAAG GCA ATG 5 6 

Met 
1 

TAT TTA GGC TTA AAC CCA AAA CGC ACC ACC AAA AGC GCC ACA GCC CCT 104 
Tyr Leu Gly Leu Asn Pro Lys Arg Ttir Thr Lys Ser Ala Thr Ala Pro 
5 10 15 

ATT AAA ATC ATG TTG ATG CGT TGC GCC CAG CAA AAA ATA CAA GGC GAA 152 
lie Lys lie Met Leu Met Arg Cys Ala Gin Gin Lys lie Gin Gly Glu 
20 25 30 

TCT TTC AAA ACA TAGCCAAAAT AACCTTAAAA AACGCTTT 192 
Ser Phe Lys Thr 
35 



(2) INFORMATION FOR SEQ ID NO: 754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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TTTGCGTGTG GATAAAATAC CCACTTTGTG GTGTTGGGGC TGTGGCGATG GCGTGATTTT 13 3 8 



(2) INFORMATION FOR SEQ ID NO: 752: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752: 



Met Arg Glu lie lie Ser Asp Gly Asn Glu Leu Val Ala Lys Ala Ala 

15 10 15 

lie Glu Val Gly Cys Arg Phe Phe Gly Gly Tyr Pro lie Thr Pro Ser 

20 25 30 

Ser Asp lie Met His Ala Met Ser Val Ala Leu Pro Lys Cys Gly Gly 

35 40 45 

His Phe lie Gin Met Glu Asp Glu lie Ser Gly lie Ser Val Ser Leu 

50 55 60 

Gly Ala Ser Met Ser Gly Thr Lys Ser Met Thr Ala Ser Ser Gly Pro 
65 70 75 80 

Gly lie Ser Leu Lys Val Glu Gin lie Gly Tyr Ser Phe Met Ala Glu 

85 90 95 

lie Pro Leu Val lie Ala Asp Val Met Arg Ser Gly Pro Ser Thr Gly 

100 105 110 

Met Pro Thr Arg Val Ala Gin Gly Asp Val Asn Phe Leu Arg His Pro 

115 120 125 

lie His Gly Asp Phe Lys Ala Val Ala Leu Ala Pro Ala Asn Leu Glu 

130 135 140 

Glu Ala Tyr Thr Glu Thr Val Arg Ala Phe Asn Leu Ala Glu Met Leu 
145 150 155 160 

Met Thr Pro Val Phe Leu Leu Met Asp Glu Thr Val Gly His Met Tyr 

165 170 175 

Gly Lys Val Gin lie Pro Asp Leu Glu Glu Val Gin Lys Met Thr lie 

180 185 190 

Asn Arg Lys Glu Phe Leu Gly Asp Lys Lys Asp Tyr Lys Pro Tyr Gly 

195 200 205 

Val Ala Gin Asp Glu Pro Ala Val Leu Asn Pro Phe Phe Lys Gly Tyr 

210 215 220 

Arg Tyr His Val Ser Gly Leu His His Gly Pro lie Gly Phe Pro Thr 
225 230 235 240 

Glu Asp Ala Lys lie Gly Gly Asp Leu lie Asp Arg Leu Phe Asn Lys 

245 250 255 

lie Glu Ser Lys Gin Asp lie lie Asn Glu Asn Glu Glu Met Asp Leu 

260 265 270 

Glu Gly Ala Glu lie Val Val He Ala Tyr Gly Ser Val Ser Leu Ala 

275 280 285 

Val Lys Glu Ala Leu Lys Asp Tyr His Lys Glu Ser Lys Gin Lys Val 

290 295 300 

Gly Phe Phe Arg Pro Lys Thr Leu Trp Pro Ser Pro Ala Lys Arg Leu 
305 " 310 315 320 



GAAATCCATT AT 



1350 
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ATG CTC ATG ACT CCT GTA TTC TTG CTC ATG GAT GAA ACC GTG GGG CAT 64 8 

Met Leu Met Thr Pro Val Phe Leu Leu Met Asp Glu Thr Val Gly His 
160 165 170 

ATG TAT GGC AAG GTG CAA ATC CCA GAT TTA GAA GAA GTG CAA AAG ATG 6 96 

Met Tyr Gly Lys Val Gin lie Pro Asp Leu Glu Glu Val Gin Lys Met 
175 180 185 190 

ACT ATT AAT CGT AAG GAA TTT CTG GGC GAT AAA AAA GAC TAC AAG CCT 744 
Thr lie Asn Arg Lys Glu Phe Leu Gly Asp Lys Lys Asp Tyr Lys Pro 
195 , 200 205 

TAT GGG GTC GCA CAA GAC GAG CCG GCT GTT TTG AAC CCT TTC TTT AAA 7 92 

Tyr Gly Val Ala Gin Asp Glu Pro Ala Val Leu Asn Pro Phe Phe Lys 
210 215 220 

GGT TAT CGC TAC CAT GTT TCA GGC TTG CAC CAT GGG CCT ATT GGC TTT 84 0 

Gly Tyr Arg Tyr His Val Ser Gly Leu His His Gly Pro lie Gly Phe 
225 230 235 

CCT ACT GAA GAC GCT AAA ATT GGT GGG GAT TTG ATT GAC AGA TTA TTT 888 
Pro Thr Glu Asp Ala Lys lie Gly Gly Asp Leu lie Asp Arg Leu Phe 
240 245 250 

AAT AAG ATT GAA TCC AAG CAA GAC ATT ATC AAC GAA AAT GAG GAA ATG 93 6 

Asn Lys lie Glu Ser Lys Gin Asp lie lie Asn Glu Asn Glu Glu Met 
255 260 265 270 

GAT TTA GAG GGT GCT GAA ATC GTT GTT ATC GCT TAC GGT TCG GTT TCT 9 84 

Asp Leu Glu Gly Ala Glu He Val Val He Ala Tyr Gly Ser Val Ser 
275 280 285 

TTG GCG GTT AAA GAG GCC TTG AAA GAT TAC CAT AAA GAA AGC AAG CAA 103 2 

Leu Ala Val Lys Glu Ala Leu Lys Asp Tyr His Lys Glu Ser Lys Gin 
290 295 300 

AAA GTC GGC TTT TTC AGG CCT AAA ACC TTA TGG CCA AGC CCG GCT AAA 108 0 

Lys Val Gly Phe Phe Arg Pro Lys Thr Leu Trp Pro Ser Pro Ala Lys 
305 310 315 

CGC TTG AAA GAA ATA GGG GAT AAA TAC GAA AAA ATC CTT GTG ATT GAA 112 8 

Arg Leu Lys Glu lie Gly Asp Lys Tyr Glu Lys He Leu Val He Glu 
320 325 330 

TTG AAT AAA GGG CAG TAT TTA GAA GAA ATT GAA AGG GCT ATG CAA AGA 117 6 

Leu Asn Lys Gly Gin Tyr Leu Glu Glu He Glu Arg Ala Met Gin Arg 
335 340 345 350 

AAG GTG CAT TTC TTG GGG CAA GCC AAT GGG CGC ACG ATT TCG CCT AAA 1224 
Lys Val His Phe Leu Gly Gin Ala Asn Gly Arg Thr He Ser Pro Lys 
355 360 365 

CAA ATC ATC GCA AAA TTG AAG GAG CTT TAAAATGGCG TTTAATTATG ATGAATA 12 7 8 
Gin He He Ala Lys Leu Lys Glu Leu 
370 375 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 127 ... 1251 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751: 

TGTGGCTGAC AGGAAGGATT TCAAATTCGC TAAAGTTTCT AAAGAAGCCC AAGAAAGAAG 6 0 

CGAAAAGGTT AAGGCCAATA AATACATGCT CTTAGAAGAG ACTATTTTAG AAGGGAGAGA 12 0 

CAAATA ATG CGT GAG ATT ATT TCT GAT GGG AAT GAA TTA GTC GCT AAA 16 8 

Met Arg Glu lie lie Ser Asp Gly Asn Glu Leu Val Ala Lys 
15 10 

GCG GCG ATT GAA GTG GGG TGT CGG TTT TTT GGG GGC TAT CCT ATC ACG 216 
Ala Ala lie Glu Val Gly Cys Arg Phe Phe Gly Gly Tyr Pro lie Thr 
15 20 25 30 

CCA AGT TCG GAT ATT ATG CAT GCG ATG AGC GTG GCT TTA CCC AAA TGC 2 64 

Pro Ser Ser Asp lie Met His Ala Met Ser Val Ala Leu Pro Lys Cys 
35 40 45 

GGC GGT CAT TTT ATC CAA ATG GAA GAT GAA ATC AGC GGG ATT AGC GTG 312 
Gly Gly His Phe lie Gin Met Glu Asp Glu lie Ser Gly lie Ser Val 
50 55 60 

TCT TTA GGA GCG AGC ATG AGC GGG ACG AAG TCT ATG ACA GCA AGC TCT 3 60 

Ser Leu Gly Ala Ser Met Ser Gly Thr Lys Ser Met Thr Ala Ser Ser 
65 70 75 

GGG CCT GGT ATT TCA TTG AAA GTG GAG CAA ATC GGT TAT TCT TTC ATG 4 08 

Gly Pro Gly lie Ser Leu Lys Val Glu Gin He Gly Tyr Ser Phe Met 
80 85 90 

GCG GAA ATC CCT TTA GTG ATC GCT GAT GTG ATG CGT TCA GGC CCA TCA 4 56 

Ala Glu He Pro Leu Val He Ala Asp Val Met Arg Ser Gly Pro Ser 
95 100 105 110 

ACC GGA ATG CCC ACT CGT GTG GCT CAA GGC GAT GTG AAT TTC TTA AGA 5 04 

Thr Gly Met Pro Thr Arg Val Ala Gin Gly Asp Val Asn Phe Leu Arg 
115 120 125 

CAC CCC ATA CAT GGG GAT TTT AAA GCC GTC GCG CTC GCT CCT GCG AAT 5 52 

His Pro He His Gly Asp Phe Lys Ala Val Ala Leu Ala Pro Ala Asn 
130 135 140 

TTA GAA GAA GCT TAC ACC GAA ACC GTT CGC GCG TTC AAT TTG GCT GAA 600 
Leu Glu Glu Ala Tyr Thr Glu Thr Val Arg Ala Phe Asn Leu Ala Glu 
145 150 155 
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35 40 45 

GTA GCC TAG CCA GAG AGC TGT ATC GGT TGC GTG CAA TGC GAG TTG CAC 250 
Val Ala Tyr Pro Glu Ser Cys lie Gly Cys Val Gin Cys Glu Leu His 
50 55 60 65 

TGC CCG GAT TTT GCG ATT TAT GTG GCT GAC AGG AAG GAT TTC AAA TTC 2 98 

Cys Pro Asp Phe Ala lie Tyr Val Ala Asp Arg Lys Asp Phe Lys Phe 
70 75 80 

GCT AAA GTT TCT AAA GAA GCC CAA GAA AGA AGC GAA AAG GTT AAG GCC 34 6 

Ala Lys Val Ser Lys Glu Ala Gin Glu Arg Ser Glu Lys Val Lys Ala 
85 90 95 

AAT AAA TAG ATG CTC TTA GAA GAG ACT ATT TTA GAA GGG AGA GAC AAA T 3 95 
Asn Lys Tyr Met Leu Leu Glu Glu Thr lie Leu Glu Gly Arg Asp Lys 
100 105 110 

AATGCGTGAG ATTATTTCTG ATGGGAATGA ATTAGTCGCT AAAGCGGCGA TTGAA 4 50 

(2) INFORMATION FOR SEQ ID NO:750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750: 

Met Ala Lys Met Ser Ala Pro Asp Gly Val Ala Val Trp Val Asn Glu 

1 5 .10 15 

Asp Arg Cys Lys Gly Cys Asp lie Cys Val Ser Val Cys Pro Ala Gly 

20 25 30 

Val Leu Gly Met Gly lie Glu Lys Glu Arg Val Leu Gly Lys Val Ala 

35 40 45 

Lys Val Ala Tyr Pro Glu Ser Cys lie Gly Cys Val Gin Cys Glu Leu 

50 55 60 

His Cys Pro Asp Phe Ala lie Tyr Val Ala Asp Arg Lys Asp Phe Lys 
65 70 75 80 

Phe Ala Lys Val Ser Lys Glu Ala Gin Glu Arg Ser Glu Lys Val Lys 

85 90 95 

Ala Asn Lys Tyr Met Leu Leu Glu Glu Thr lie Leu Glu Gly Arg Asp 
100 105 110 

Lys 



(2) INFORMATION FOR SEQ ID NO:751: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(C) STRAJSTDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 8: 

Met Arg Ser Cys Leu Phe Leu Lys Thr Asn Ser Val Leu Ser lie Leu 

15 10 15 

Met Gly Asp Lys Pro Ser Leu Lys Thr Thr Glu Gly Phe lie Lys Val 

20 25 30 

Ala Leu lie Thr Glu Phe Phe Lys Ser Asp Gly Thr Asn Ser Leu Gly 

35 40 45 

Val Lys Leu Ala Leu lie Glu Ala Leu Ser lie Leu Lys Leu Ala Asn 

50 55 60 

Trp lie Leu Ser Lys lie His Val Phe Lys Phe Phe Cys Asp Trp Arg 
65 70 75 80 

Trp Lys Arg Gly Phe Lys Asn Ala Arg Leu Phe lie Thr Glu Val Leu 

85 90 95 

lie Phe Asn Ser Met Val Phe Lys Ser Val Ser Pro Cys Lys 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 74 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 56... 394 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 9: 

GGCAAATTTT GATTGCTAGG GCTTTAAATG CAGCTTTTAG CACAAAGGAG AATGA ATG 5 8 

Met 
1 

GCT AAA ATG AGC GOT CCA GAT GGG GTT GCC GTT TGG GTG AAT GAA GAC 106 
Ala Lys Met Ser Ala Pro Asp Gly Val Ala Val Trp Val Asn Glu Asp 
5 10 15 

AGG TGT AAG GGT TGT GAT ATT TGC GTA TCG GTA TGC CCT GCT GGG GTT 154 
Arg Cys Lys Gly Cys Asp lie Cys Val Ser Val Cys Pro Ala Gly Val 
20 25 30 

CTT GGC ATG GGG ATT GAA AAA GAA AGG GTG CTT GGA AAA GTG GCC AAA 2 02 

Leu Gly Met Gly lie Glu Lys Glu Arg Val Leu Gly Lys Val Ala Lys 

-1133- 



BNSDOCID: <WO 9e43478A1_l_> 



wo 98/43478 



PCT/US98/0637I 



(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 375 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 7: 



GAGCCAGTTA ATTCCATGGT TTCATAAGTG ATTTTTTGGG GCTGT ATG AGG AGC TGT 5 7 

Met Arg Ser Cys 
1 



TTG TTT TTG AAA ACT AAT TCG GTT TTA TCC ATT TTA ATG GGC GAT AAG 105 
Leu Phe Leu Lys Thr Asn Ser Val Leu Ser lie Leu Met Gly Asp Lys 
5 10 15 20 



CCA TCA TTA AAA ACG ACT GAA GGC TTC ATC AAA GTG GCT TTA ATT ACA 153 
Pro Ser Leu Lys Thr Thr Glu Gly Phe lie Lys Val Ala Leu lie Thr 
25 30 35 



GAA TTT TTT AAA AGC GAT GGG ACA AAC TCG CTA GGA GTG AAA TTG GCT 201 
Glu Phe Phe Lys Ser Asp Gly Thr Asn Ser Leu Gly Val Lys Leu Ala 
40 45 50 



TTG ATT GAA GCG TTA TCA 
Leu lie Glu Ala Leu Ser 
55 

AAA ATC CAT GTT TTT AAA 
Lys lie His Val Phe Lys 
70 

TTT AAA AAC GCC AGG CTT 
Phe Lys Asn Ala Arg Leu 
85 90 



ATC TTA AAG CTA GCG AAT 
lie Leu Lys Leu Ala Asn 
60 

TTT TTT TGC GAT TGG CGT 
Phe Phe Cys Asp Trp Arg 
75 80 

TTC ATT ACA GAA GTG TTA 
Phe lie Thr Glu Val Leu 
95 



TGG ATC TTA TCA 24 9 

Trp lie Leu Ser 

65 

TGG AAA AGA GGC 2 97 

Trp Lys Arg Gly 



ATT TTT AAT TCT 345 
lie Phe Asn Ser 
100 



ATG GTT TTT AAA TCG GTT AGC CCT TGC AAA TAAATTGCAG CGCTGGGTTC GAT 3 98 
Met Val Phe Lys Ser Val Ser Pro Cys Lys 
105 110 

TAAGGGCTTG ACAATCAAAT TAAACGCCAT TTTCCTAGCT TTGGGTGAAT AG 45 0 

(2) INFORMATION FOR SEQ ID NO: 748: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 



-1132- 



BNSDOCID: <WO 9843478A1 l _> 



wo 98/43478 



PCT/US98/06371 



CATCAGATAT TATCCAAGCG CCTTTTAAAA TCTTGCGCCG TATTTTCACA CCTATTGACA 6 0 

TCATCGTGG ATG AAG TCA AAA AAA ACA TTG ATT CAA AAA GGA AGT AAA ATG 111 
Met Lys Ser Lys Lys Thr Leu lie Gin Lys Gly Ser Lys Met 
15 10 

ACG CTC AAT GAA GCC ATT AAA GAC AAA GTT TAT GAA ATC GTA GAA ATC 15 9 

Thr Leu Asn Glu Ala lie Lys Asp Lys Val Tyr Glu lie Val Glu lie 
15 20 25 30 

GCT AAC TGC GAT GAA GCC CTT AAA AAA CGC TTT CTC TCT TTT GGT ATC 2 07 

Ala Asn Cys Asp Glu Ala Leu Lys Lys Arg Phe Leu Ser Phe Gly lie 
35 40 45 

CAT GAA GGG GTT CAA TGC ATT CTT TTG CAT TAT TCC ATG AAA AAA GCC 255 
His Glu Gly Val Gin Cys lie Leu Leu His Tyr Ser Met Lys Lys Ala 
50 55 60 

ACG CTT TCG GTT AAA ATC AAC CGC ATT CAA GTG GCT TTA AGA TCC CAT 3 03 

Thr Leu Ser Val Lys lie Asn Arg lie Gin Val Ala Leu Arg Ser His 
65 70 75 

GAA GCA CAA TAG CTT GTC ATC AAA GAA AGC GTG TGAAAATGGG TTTAAAACGC 3 56 

Glu Ala Gin Tyr Leu Val lie Lys Glu Ser Val 
80 85 

GCTAAACGCT ATAATAA 3 73 

(2) INFORMATION FOR SEQ ID NO: 74 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746: 

Met Lys Ser Lys Lys Thr Leu lie Gin Lys Gly Ser Lys Met Thr Leu 

IS 10 15 

Asn Glu Ala lie Lys Asp Lys Val Tyr Glu lie Val Glu lie Ala Asn 

20 25 30 

Cys Asp Glu Ala Leu Lys Lys Arg Phe Leu Ser Phe Gly lie His Glu 

35 40 45 

Gly Val Gin Cys lie Leu Leu His Tyr Ser Met Lys Lys Ala Thr Leu 

50 55 60 

Ser Val Lys lie Asn Arg lie Gin Val Ala Leu Arg Ser His Glu Ala 
65 70 75 80 

Gin Tyr Leu Val lie Lys Glu Ser Val 
85 

(2) INFORMATION FOR SEQ ID NO: 747: 
(i) SEQUENCE CHARACTERISTICS: 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:744: 



Met Lys lie Leu Ala 

1 5 
His Gly Leu Ser Ala 
20 

Arg Arg Leu Ser Leu 
35 

Val Leu Leu Pro Ala 
50 

Gly Trp Ala Lys Pro 
65 

Lys Gly Ser Leu Ala 
85 

Asn Phe Thr Leu Ala 
100 

Gin Gin Leu Gly lie 
115 

Gin Leu Ala Leu Val 
130 

Val Leu Gly Val Phe 
145 

Lys Ala Leu Gly Phe 
165 

Glu Trp Phe Ser Lys 
180 

Phe Leu Phe lie Pro 
195 

Arg Phe Leu Phe Ser 
210 



Leu Leu lie Ala lie lie 
10 

Phe Leu Phe Gly Asp Arg 
25 

Asn Pro lie Arg His Leu 
40 

Leu Leu Leu lie Phe Gin 
55 

Val Pro Val Asp Met Arg 
70 75 
Cys Val Val Val Ser Leu 
90 

Val Leu Leu Ala Phe lie 
105 

Asn Ala Leu Ser lie Asp 
120 

Thr Phe Leu lie Gin Gly 
135 

Asn Ser Leu Pro lie Pro 
150 155 
Leu Ala Leu His Phe Lys 
170 

Met Glu Arg Tyr Gly Leu 
185 

Pro Leu Ser Glu Phe Phe 
200 

Leu Leu Leu Ser 
215 



Gly His Glu lie Met 
15 

Ser Thr Lys Asp Ala 
30 

Asp Met Met Gly Ser 
45 

Ala Pro Phe Leu Phe 
60 

Tyr lie Val Ser Gin 
80 

Ala Gly Val Ala Tyr 
95 

Thr His Trp Ser Phe 
110 

Glu Leu Asn Leu Tyr 
125 

lie Leu Tyr Asn Leu 
140 

Pro Leu Asp Gly Ser 
160 

Ser Ala Phe Leu Leu 
175 

Leu Val Val Phe He 
190 

He His Ala Pro Thr 
205 



(2) INFORMATION FOR SEQ ID NO: 74 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 336 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 5: 
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AGC ACT AAA GAC GCT AGO CGT TTG ACT TTA AAC CCT ATC AGO CAT TTA 2 06 

Ser Thr Lys Asp Ala Arg Arg Leu Ser Leu Asn Pro lie Arg His Leu 
30 35 40 

GAC ATG ATG GGT TCG GTG CTT TTA CCG GCT TTA TTA CTC ATT TTT CAA 2 54 

Asp Met Met Gly Ser Val Leu Leu Pro Ala Leu Leu Leu lie Phe Gin 
45 50 55 

GCC CCT TTT TTG TTT GGG TGG GCC AAA CCC GTG CCT GTT GAT ATG CGC 3 02 

Ala Pro Phe Leu Phe Gly Trp Ala Lys Pro Val Pro Val Asp Met Arg 
60 65 70 75 

TAG ATT GTC TCT CAA AAA GGC TCT CTA GCA TOO GTA GTG GTG AGT TTA 350 
Tyr lie Val Ser Gin Lys Gly Ser Leu Ala Cys Val Val Val Ser Leu 
80 85 90 

GCC GGG GTG GCT TAT AAT TTC ACT CTG GCC GTT CTG CTC GCT TTC ATC 3 98 

Ala Gly Val Ala Tyr Asn Phe Thr Leu Ala Val Leu Leu Ala Phe lie 
95 100 105 

ACG CAT TGG AGC TTC CAA CAA CTA GGG ATC AAC GCT TTA AGC ATT GAT 44 6 

Thr His Trp Ser Phe Gin Gin Leu Gly lie Asn Ala Leu Ser lie Asp 
110 115 120 

GAA TTG AAT CTT TAT CAG CTC GCT TTA GTA ACC TTT CTC ATT CAA GGC 4 94 

Glu Leu Asn Leu Tyr Gin Leu Ala Leu Val Thr Phe Leu lie Gin Gly 
125 130 135 

ATT CTT TAT AAT CTT GTC TTA GGC GTT TTC AAT AGC CTC CCT ATC CCG 54 2 

lie Leu Tyr Asn Leu Val Leu Gly Val Phe Asn Ser Leu Pro lie Pro 
140 145 150 155 

CCC TTA GAC GGC TCC AAA GCG TTA GGC TTT TTA GCG TTG CAT TTT AAA 5 90 

Pro Leu Asp Gly Ser Lys Ala Leu Gly Phe Leu Ala Leu His Phe Lys 
160 165 170 

AGT GCG TTT TTA TTG GAA TGG TTT TCT AAA ATG GAA CGC TAG GGC TTG 63 8 

Ser Ala Phe Leu Leu Glu Trp Phe Ser Lys Met Glu Arg Tyr Gly Leu 
175 180 185 

TTG GTA GTG TTT ATT TTT TTG TTT ATC CCC CCT TTA TCG GAG TTT TTT 6 86 

Leu Val Val Phe lie Phe Leu Phe lie Pro Pro Leu Ser Glu Phe Phe 
190 195 200 

ATC CAT GCG CCC ACA AGA TTT TTA TTT TCT TTA CTC CTC TCT TAATCTTTT 73 7 
lie His Ala Pro Thr Arg Phe Leu Phe Ser Leu Leu Leu Ser 
205 210 215 

ATCAAGGAGA GTTTATGAAT AAGCTCTTAA AGTTTTCTCA A 77 8 

(2) INFORMATION FOR SEQ ID NO:744: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 
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(2) INFORMATION FOR SEQ ID NO: 742: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742: 

Met Phe Thr Gin Trp Phe lie Leu Thr lie Ala lie Val Phe lie Leu 

1 5 10 15 

Tyr Met Gly Val Arg Thr Phe Phe Phe Lys Thr Val Ala Lys Arg Gin 

20 25 ' 30 

Glu Arg Thr Asn Ala Ser Met Lys Leu Thr Leu Gin Glu Ala Glu lie 

35 40 45 

Leu lie Gin Lys His Gin Leu Gin Leu Gin Arg Ala Leu Gly Asn lie 

50 55 60 

Asp lie Leu Thr Gin Glu Met Ser Ser Leu Lys Thr Glu Leu Lys Ala 
65 70 75 80 

Leu Lys Gin Arg Asn Ser Glu Tyr Lys Gly Glu Ser Asp Lys Tyr Lys 

85 90 95 

Asn Arg lie Lys Glu Leu Glu Gin Lys lie Glu Ala Leu Leu 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 743: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 78... 728 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:743: 

AAAAAAGAAA ACGCAACGCA TTAAGGTTTT TTGTGCAATT TTTTGATTTC TCTTTAGAAA 6 0 

GTTTTATTAC CACCTTA ATG AAA ATC CTA GCC CTT TTA ATC GCT ATC ATA 110 

Met Lys lie Leu Ala Leu Leu lie Ala lie lie 
15 10 

GGG CAT GAG ATC ATG CAT GGC TTG AGC GCG TTT TTA TTT GGG GAT AGG 15 8 

Gly His Glu lie Met His Gly Leu Ser Ala Phe Leu Phe Gly Asp Arg 
- 15 20 25 
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165 



Leu Glu Leu Glu 
180 



170 



175 



(2) INFORMATION FOR SEQ ID NO: 741: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 28... 357 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741: 



TTTCGCTAAA AAGGATATTT TAACAGA ATG TTT ACC CAA TGG TTT ATT CTC ACT 54 

Met Phe Thr Gin Trp Phe lie Leu Thr 
1 5 



ATC GCT ATT GTT TTT ATC CTT TAT ATG GGT GTG CGC ACT TTC TTT TTT 102 
lie Ala lie Val Phe lie Leu Tyr Met Gly Val Arg Thr Phe Phe Phe 
10 15 20 25 



AAA ACC GTG GCT AAA CGG CAA GAA CGC ACC AAC GCA TCC ATG AAG CTC 15 0 

Lys Thr Val Ala Lys Arg Gin Glu Arg Thr Asn Ala Ser Met Lys Leu 
30 35 40 



ACC TTA CAA GAA GCT GAA ATT TTG ATC CAA AAA CAC CAG TTG CAA CTC 198 
Thr Leu Gin Glu Ala Glu lie Leu lie Gin Lys His Gin Leu Gin Leu 
45 50 55 



CAA AGG GCT TTG GGC AAT ATT GAT ATT CTC ACC CAA GAA ATG AGC TCG 246 
Gin Arg Ala Leu Gly Asn lie Asp lie Leu Thr Gin Glu Met Ser Ser 
60 65 70 



TTA AAA ACA GAA CTA AAA GCC CTT AAA CAG CGC AAC TCT GAA TAG AAA 2 94 

Leu Lys Thr Glu Leu Lys Ala Leu Lys Gin Arg Asn Ser Glu Tyr Lys 
75 80 85 

GGC GAA TCG GAT AAA TAT AAA AAT CGT ATT AAA GAA TTG GAG CAA AAA 342 
Gly Glu Ser Asp Lys Tyr Lys Asn Arg lie Lys Glu Leu Glu Gin Lys 
90 95 100 105 



ATA GAA GCT CTC CTT TAAAAACGCT ATAATAA 374 
lie Glu Ala Leu Leu 
110 
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GCT TTT AGG GGA ATT AAG GGG GTA AGG GTG GTG TTG ATT GAT GAT TTA 444 
Ala Phe Arg Gly lie Lys Gly Val Arg Val Val Leu lie Asp Asp Leu 
115 120 125 

TTA GCC ACT GGA GGC ACA GCT TTA GCG AGC CTT GAG CTT ATC AAA GCC 4 92 

Leu Ala Thr Gly Gly Thr Ala Leu Ala Ser Leu Glu Leu lie Lys Ala 
130 135 140 

CTA CAA GCC GAA TGC ATA GAA GCA TGC TTT TTG ATA GGG TTA AAA GAA 54 0 

Leu Gin Ala Glu Cys lie Glu Ala Cys Phe Leu lie Gly Leu Lys Glu 
145 150 155 160 

TTA CCG GGT ATC CAA CTT TTA GAA GAA CGC GTG AAA ACC TTT TGT TTG 588 
Leu Pro Gly lie Gin Leu Leu Glu Glu Arg Val Lys Thr Phe Cys Leu 
165 170 175 

TTA GAG TTA GAA TAAGGGTGA 60 9 

Leu Glu Leu Glu 
180 



(2) INFORMATION FOR SEQ ID NO: 74 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 0: 

Met Asn Glu Thr Leu Lys Glu Glu Leu Leu Gin Ser lie Arg Glu Val 

15 10 15 

Lys Asp Tyr Pro Lys Lys Gly lie Leu Phe Lys Asp lie Thr Thr Leu 

20 25 30 

Leu Asn Tyr Pro Lys Leu Phe Asn Lys Leu lie Asp Thr Leu Lys Lys 

35 40 45 

Arg Tyr Leu Ala Leu Asn lie Asp Phe lie Val Gly lie Glu Ala Arg 

50 55 60 

Gly Phe lie Leu Gly Ser Ala Leu Ala Tyr Ala Leu Gly Val Gly Phe 
65 70 75 80 

Val Pro Val Arg Lys Lys Gly Lys Leu Pro Ala His Thr Leu Ser Gin 

85 90 95 

Ser Tyr Ser Leu Glu Tyr Gly Ser Asp Ser lie Glu lie His Ser Asp 

100 105 110 

Ala Phe Arg Gly lie Lys Gly Val Arg Val Val Leu lie Asp Asp Leu 

115 120 125 

Leu Ala Thr Gly Gly Thr Ala Leu Ala Ser Leu Glu Leu lie Lys Ala 

130 135 140 

Leu Gin Ala Glu Cys lie Glu Ala Cys Phe Leu lie Gly Leu Lys Glu 
145 150 155 160 

Leu Pro Gly lie Gin Leu Leu Glu Glu Arg Val Lys Thr Phe Cys Leu 
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His Ala Leu Gly Trp Leu Lys Lys His Pro Tyr Ala Leu lie Leu Leu 

115 120 125 

Leu Val Ser Phe Leu Ala Leu Val Leu Trp Tyr Phe Gin Tyr Tyr Ser 

130 135 140 

Lys Lys Asn Arg 
145 

(2) INFORMATION FOR SEQ ID NO: 73 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 609 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 61... 600 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 9: 

TAAAAACGCT ATAATAAATC AAAATTCTAC AACCAATCCG TTATATTAAA GGAAATCAAA 6 0 

ATG AAT GAA ACG CTC AAA GAA GAA CTT TTA CAA AGC ATC AGA GAA GTG 10 8 

Met Asn Glu Thr Leu Lys Glu Glu Leu Leu Gin Ser lie Arg Glu Val 
15 10 15 

AAA GAC TAC CCT AAA AAA GGG ATT TTA TTC AAA GAC ATT ACC ACG CTA 15 6 

Lys Asp Tyr Pro Lys Lys Gly lie Leu Phe Lys Asp lie Thr Thr Leu 
20 • 25 30 

CTC AAC TAC CCT AAA CTC TTT AAC AAA CTC ATT GAC ACG CTC AAA AAA 2 04 

Leu Asn Tyr Pro Lys Leu Phe Asn Lys Leu lie Asp Thr Leu Lys Lys 
35 40 45 

CGC TAT CTC GCT CTC AAT ATA GAC TTT ATC GTG GGC ATT GAA GCG AGA 2 52 

Arg Tyr Leu Ala Leu Asn lie Asp Phe lie Val Gly lie Glu Ala Arg 
50 55 60 

GGG TTT ATT TTA GGC TCT GCT CTC GCT TAT GCG CTT GGG GTG GGT TTT 3 00 

Gly Phe lie Leu Gly Ser Ala Leu Ala Tyr Ala Leu Gly Val Gly Phe 
65 70 75 80 

GTG CCT GTG AGG AAA AAG GGC AAA CTC CCC GCA CAC ACC CTA TCT CAA 34 8 

Val Pro Val Arg Lys Lys Gly Lys Leu Pro Ala His Thr Leu Ser Gin 
85 90 95 

AGC TAC AGC CTA GAA TAC GGG AGC GAC AGC ATA GAA ATC CAC TCC GAC 3 96 

Ser Tyr Ser Leu Glu Tyr Gly Ser Asp Ser He Glu He His Ser Asp 
100 105 110 
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55 60 65 

CGC ACC ATC ATT CCC ATT AGC ATA GGT CTC ACG CGT TAT AGC GCT TTA 2 97 

Arg Thr lie lie Pro lie Ser lie Gly Leu Thr Arg Tyr Ser Ala Leu 
70 75 80 

AAA TTC GCT ATC ATC AAT CTC ATT AGC GCG ATG GTG TGG GCG AGC ATT 345 
Lys Phe Ala lie lie Asn Leu lie Ser Ala Met Val Trp Ala Ser lie 
85 90 95 

ACC ATT ATT CTA GCG TGG TAT TTA GGA GAA GAG TTA TTG CAT GCG TTA 3 93 

Thr lie lie Leu Ala Trp Tyr Leu Gly Glu Glu Leu Leu His Ala Leu 
100 105 110 115 

GGG TGG CTT AAA AAA CAC CCT TAT GCG CTA ATA TTA CTA TTA GTA TCT 441 
Gly Trp Leu Lys Lys His Pro Tyr Ala Leu lie Leu Leu Leu Val Ser 
120 125 130 

TTC TTG GCG TTA GTG CTG TGG TAT TTC CAA TAC TAT AGT AAG AAA AAC 4 89 

Phe Leu Ala Leu Val Leu Trp Tyr Phe Gin Tyr Tyr Ser Lys Lys Asn 
135 140 145 

CGC TAGAGTGCAA TACAATTCTT GAAAGATATG AAATTAAAAA AGGAGACTTT ATGTTA 54 8 
Arg 



AAAATCAAAT TA 56 0 

(2) INFORMATION FOR SEQ ID NO: 73 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUl 

Met His Leu Gly 
1 

Gly Asp Gin lie 
20 

Gin Lys Lys Leu 

35 

Leu Leu Gin Lys 

50 

Tyr Gly Met Arg 
65 

Ser Ala Leu Lys 

Ala Ser lie Thr 
100 



;NCE DESCRIPTION 

Leu Ala lie Leu 
5 

Tyr Phe Tyr lie 

Glu Lys Gin Arg 
40 

His Gly Trp Phe 
55 

Thr lie lie Pro 
70 

Phe Ala lie lie 
85 

lie lie Leu Ala 



SEQ ID NO: 738 : 

Val Ala Gly lie 
10 

Gly Arg Thr Asn 
25 

Arg Lys Leu Ala 

lie lie Phe lie 
60 

lie Ser lie Gly 
75 

Asn Leu lie Ser 
90 

Trp Tyr Leu Gly 
105 



Gly Gly Phe Val 
15 

Lys Ala Tyr lie 
30 

Leu Ala His Leu 
45 

Gin Arg Tyr Met 

Leu Thr Arg Tyr 
80 

Ala Met Val Trp 
95 

Glu Glu Leu Leu 
110 



-1124- 



BNSDOCID: <WO 9843476A 1 



wo 98/43478 



PCT/US98/06371 



355 360 365 

Thr Gly Ala Cys Val Val Gly Leu Gly Glu Phe Thr Ser Ala lie Met 

370 375 380 

Gly His Asn Glu Glu Leu Lys Asn Leu Phe Glu Thr Ser Gly Leu Glu 
385 390 395 400 

Ser Gly Glu Leu Leu Ala Lys Leu Pro Phe Asn Arg His Leu Lys Lys 

405 410 415 

Leu lie Glu Ser Lys lie Ala Asp Val Cys Asn lie Ser Ser Ser Arg 

420 425 430 

Tyr Gly Gly Ala lie Thr Ala Gly Leu Phe Leu Asn Glu Phe lie Arg 

435 440 445 

Asp Glu Phe Lys Asp Lys Trp Leu His lie Asp lie Ala Gly Pro Ala 

450 455 460 

Tyr Val Glu Lys Glu Trp Asp Val Asn Ser Phe Gly Ala Ser Gly Ala 
465 470 475 480 

Gly Val Arg Ala Cys Thr Ala Phe Val Glu Glu Leu Leu Lys Lys Ala 
485 490 495 

(2) INFORMATION FOR SEQ ID NO:737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 492 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 7: 

GGCGAAATCG GGTTAATTTT AGCAGGGATT GCCAGCTATA CCGGTCAT ATG CAT TTA 57 

Met His Leu 
1 

GGG TTA GCC ATT TTA GTC GCA GGG ATT GGG GGC TTT GTG GGG GAT CAG 105 
Gly Leu Ala He Leu Val Ala Gly He Gly Gly Phe Val Gly Asp Gin 
5 10 15 

ATC TAT TTT TAC ATC GGC CGC ACC AAT AAA GCT TAC ATC CAA AAA AAG 15 3 

He Tyr Phe Tyr He Gly Arg Thr Asn Lys Ala Tyr He Gin Lys Lys 
20 25 30 35 

CTA GAA AAA CAA CGC CGA AAA CTA GCC CTA GCC CAT TTA TTG TTG CAA 2 01 

Leu Glu Lys Gin Arg Arg Lys Leu Ala Leu Ala His Leu Leu Leu Gin 
40 45 50 

AAA CAC GGC TGG TTT ATC ATT TTT ATC CAA CGC TAT ATG TAT GGC ATG 24 9 

Lys His Gly Trp Phe He He Phe He Gin Arg Tyr Met Tyr Gly Met 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 6: 



Met 


Leu 


Lys 


He 


Lys 


Leu 


Glu 


Lys 


Thr 


Thr 


Phe 


Glu 


Asn 


Ala 


Lys 


Ala 


1 








5 










10 










15 




Glu 


Cys 


Ser 


Leu 


Val 


Phe 


He 


He 


Asn 


Lys 


Asp 


Phe 


Ser 


His 


Ala 


Trp 








20 










25 










30 






Val 


Lys 


Asn 


Lys 


Glu 


Leu 


Leu 


Glu 


Thr 


Phe 


Lys 


Tyr 


Glu 


Gly 


Glu 


Gly 






35 










40 










45 








Val 


Phe 


Leu 


Asp 


Gin 


Glu 


Asn 


Lys 


He 


Leu 


Tyr 


Ala 


Gly 


Val 


Lys 


Glu 




50 










55 










60 










Asp 


Asp 


Val 


His 


Leu 


Leu 


Arg 


Glu 


Ser 


Ala 


Cys 


Leu 


Ala 


Val 


Arg 


Thr 


65 










70 










75 










80 


Leu 


Lys 


Lys 


Leu 


Ala 


Phe 


Lys 


Ser 


Val 


Lys 


Val 


Gly Val 


Tyr 


Thr 


Cys 










85 










90 










95 




Gly 


Ala 


His 


Ser 


Lys 


Asp 


Asn 


Ala 


Leu 


Leu 


Glu 


Asn 


Leu 


Lys 


Ala 


Leu 








100 










105 










110 






Phe 


Leu 


Gly 


Leu 


Lys 


Leu 


Gly 


Leu 


Tyr 


Glu 


Tyr 


Asp 


Thr 


Phe 


Lys 


Ser 






115 










120 










125 








Asn 


Lys 


Lys 


Glu 


Ser 


Val 


Leu 


Lys 


Glu 


Ala 


He 


Val 


Ala 


Leu 


Glu 


Leu 




130 










135 










140 










His 


Lys 


Pro 


Cys 


Glu 


Lys 


Thr 


Cys 


Ala 


Asn 


Ser 


Leu 


Glu 


Lys 


Ser 


Ala 


145 










150 










155 










160 


Lys 


Glu 


Ala 


Leu 


Lys 


Tyr 


Ala 


Glu 


He 


Met 


Thr 


Glu 


Ser 


Leu 


Asn 


He 










165 










170 










175 




Val 


Lys 


Asp 


Leu 


Val 


Asn 


Thr 


Pro 


Pro 


Met 


He 


Gly 


Thr 


Pro 


Val 


Tyr 








180 










185 










190 






Met 


Ala 


Glu 


Val 


Ala 


Gin 


Lys 


Val 


Ala 


Lys 


Glu 


Asn 


His 


Leu 


Glu 


He 






195 










200 










205 








His 


Val 


His 


Asp 


Glu 


Lys 


Phe 


Leu 


Glu 


Glu 


Lys 


Lys 


Met 


Asn 


Ala 


Phe 




210 










215 










220 










Leu 


Ala 


Val 


Asn 


Lys 


Ala 


Ser 


Leu 


Ser 


Val 


Asn 


Pro 


Pro 


Arg 


Leu 


He 


225 










230 










235 










240 


His 


Leu 


Val 


Tyr 


Lys 


Pro 


Lys 


Lys 


Ala 


Lys 


Lys 


Lys 


He 


Ala 


Leu 


Val 










245 










250 










255 




Gly 


Lys 


Gly 


Leu 


Thr 


Tyr 


Asp 


Cys 


Gly 


Gly 


Leu 


Ser 


Leu 


Lys 


Pro 


Ala 








260 










265 










270 






Asp 


Tyr 


Met 


Val 


Thr 


Met 


Lys 


Ala 


Asp 


Lys 


Gly 


Gly 


Gly 


Ser 


Ala 


Val 






275 










280 










285 








He 


Gly 


Leu 


Leu 


Asn 


Ala 


Leu 


Ala 


Lys 


Leu 


Gly 


Val 


Glu 


Ala 


Glu 


Val 




290 










295 










300 










His 


Gly 


He 


He 


Gly 


Ala 


Thr 


Glu 


Asn 


Met 


He 


Gly 


Pro 


Ala 


Ala 


Tyr 


305 










310 










315 










320 


Lys 


Pro 


Asp 


Asp 


He 


Leu 


He 


Ser 


Lys 


Glu 


Gly 


Lys 


Ser 


He 


Glu 


Val 










325 










330 










335 




Arg 


Asn 


Thr 


Asp 


Ala 


Glu 


Gly 


Arg 


Leu 


Val 


Leu 


Ala 


Asp 


Cys 


Leu 


Ser 








340 










345 










350 






Tyr 


Ala 


Gin 


Asp 


Leu 


Asn 


Pro 


Asp 


Val 


He 


Val 


Asp 


Phe 


Ala 


Thr 


Leu 
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GTG CAT GGC ATT ATT GGG GCT ACA GAA AAC ATG ATA GGC CCA GCC GCT 102 0 

Val His Gly lie lie Gly Ala Thr Glu Asn Met He Gly Pro Ala Ala 
305 310 315 

TAT AAA CCA GAT GAT ATT TTG ATC TCC AAA GAA GGC AAG AGC ATA GAG 106 8 

Tyr Lys Pro Asp Asp He Leu He Ser Lys Glu Gly Lys Ser He Glu 
320 325 330 335 

GTC CGT AAT ACC GAC GCT GAG GGG CGT TTG GTT TTA GCG GAT TGT TTG 1116 
Val Arg Asn Thr Asp Ala Glu Gly Arg Leu Val Leu Ala Asp Cys Leu 
340 345 350 

AGC TAC GCT CAA GAT TTA AAC CCT GAT GTG ATC GTG GAT TTT GCG ACC 1164 
Ser Tyr Ala Gin Asp Leu Asn Pro Asp Val He Val Asp Phe Ala.. Thr 
355 360 365 

CTT ACT GGG GCA TGC GTT GTA GGC TTA GGC GAA TTC ACT TCA GCG ATC 1212 
Leu Thr Gly Ala Cys Val Val Gly Leu Gly Glu Phe Thr Ser Ala He 
370 375 380 

ATG GGG CAT AAT GAA GAG TTA AAA AAC CTC TTT GAA ACT TCA GGG TTA 12 6 0 

Met Gly His Asn Glu Glu Leu Lys Asn Leu Phe Glu Thr Ser Gly Leu 
385 390 395 

GAA TCC GGC GAA TTA TTA GCC AAA CTC CCC TTT AAC CGC CAT TTA AAG 13 0 8 

Glu Ser Gly Glu Leu Leu Ala Lys Leu Pro Phe Asn Arg His Leu Lys 
400 405 410 415 

AAA TTG ATT GAA TCT AAA ATC GCT GAT GTG TGC AAT ATT TCT TCT TCA 13 56 

Lys Leu He Glu Ser Lys He Ala Asp Val Cys Asn He Ser Ser Ser 
420 425 430 

CGC TAT GGC GGT GCG ATC ACA GCG GGC TTG TTT TTA AAT GAA TTT ATT 14 04 

Arg Tyr Gly Gly Ala He Thr Ala Gly Leu Phe Leu Asn Glu Phe He 
435 440 445 

AGA GAT GAG TTT AAG GAT AAG TGG CTA CAC ATT GAC ATT GCA GGC CCT 14 5 2 

Arg Asp Glu Phe Lys Asp Lys Trp Leu His He Asp He Ala Gly Pro 
450 455 460 

GCT TAT GTG GAA AAA GAA TGG GAT GTG AAT AGC TTT GGA GCG AGT GGG 15 00 

Ala Tyr Val Glu Lys Glu Trp Asp Val Asn Ser Phe Gly Ala Ser Gly 
465 470 475 

GCT GGC GTG AGA GCT TGC ACA GCT TTT GTG GAA GAG CTT TTG AAA AAG 154 8 

Ala Gly Val Arg Ala Cys Thr Ala Phe Val Glu Glu Leu Leu Lys Lys 
480 485 490 495 

GCT TGAAATGGGC TTGTCTGTAG GCATTGTGGG TTTGCCTAAT GTGGGCAAAT CCAGCA 1607 
Ala 



CCTTTA 1613 
(2) INFORMATION FOR SEQ ID NO: 736: 
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Thr Leu Lys Lys Leu Ala Phe Lys Ser Val Lys Val Gly Val Tyr Thr 
80 85 90 95 

TGT GGT GCA CAT TCT AAA GAT AAC GCG CTT TTA GAA AAC TTG AAA GCG 3 96 

Cys Gly Ala His Ser Lys Asp Asn Ala Leu Leu Glu Asn Leu Lys Ala 
100 105 110 

CTG TTT TTG GGC TTG AAA TTA GGT TTG TAT GAA TAG GAC ACT TTT AAA 444 
Leu Phe Leu Gly Leu Lys Leu Gly Leu Tyr Glu Tyr Asp Thr Phe Lys 
115 120 125 

TCC AAC AAA AAA GAA AGC GTT TTA AAA GAA GCC ATT GTC GCT TTA GAA 4 92 

Ser Asn Lys Lys Glu Ser Val Leu Lys Glu Ala lie Val Ala Leu Glu 
130 135 140 

TTG CAC AAA CCT TGC GAA AAA ACT TGC GCA AAT TCT TTA GAA AAG AGT 54 0 

Leu His Lys Pro Cys Glu Lys Thr Cys Ala Asn Ser Leu Glu Lys Ser 
145 150 155 

GCT AAA GAA GCG TTA AAA TAC GCT GAA ATC ATG ACA GAA AGC TTG AAT 58 8 

Ala Lys Glu Ala Leu Lys Tyr Ala Glu lie Met Thr Glu Ser Leu Asn 
160 165 170 175 

ATC GTT AAA GAT CTA GTC AAT ACC CCC CCT ATG ATT GGC ACT CCG GTT 63 6 

lie Val Lys Asp Leu Val Asn Thr Pro Pro Met lie Gly Thr Pro Val 
180 185 190 

TAT ATG GCT GAA GTG GCG CAA AAA GTG GCT AAA GAA AAC CAT TTA GAA 6 84 

Tyr Met Ala Glu Val Ala Gin Lys Val Ala Lys Glu Asn His Leu Glu 
195 200 205 

ATC CAT GTT CAT GAT GAA AAA TTT TTA GAA GAA AAG AAA ATG AAC GCC 73 2 

lie His Val His Asp Glu Lys Phe Leu Glu Glu Lys Lys Met Asn Ala 
210 215 220 

TTT TTA GCG GTC AAT AAA GCC TCT CTT AGC GTC AAT CCT CCT CGC TTG 78 0 

Phe Leu Ala Val Asn Lys Ala Ser Leu Ser Val Asn Pro Pro Arg Leu 
225 230 235 

ATC CAT TTA GTC TAT AAG CCT AAA AAA GCG AAG AAA AAA ATC GCT TTA 82 8 

lie His Leu Val Tyr Lys Pro Lys Lys Ala Lys Lys Lys lie Ala Leu 
240 245 250 255 

GTG GGT AAG GGC TTG ACT TAT GAT TGT GGG GGT TTG AGC TTG AAA CCG 87 6 

Val Gly Lys Gly Leu Thr Tyr Asp Cys Gly Gly Leu Ser Leu Lys Pro 
260 265 270 

GCC GAT TAC ATG GTT ACT ATG AAA GCG GAT AAA GGC GGT GGC TCT GCG 924 
Ala Asp Tyr Met Val Thr Met Lys Ala Asp Lys Gly Gly Gly Ser Ala 
275 280 285 

GTG ATT GGG CTT TTA AAC GCA TTA GCC AAA CTA GGC GTG GAG GCT GAA 972 
Val lie Gly Leu Leu Asn Ala Leu Ala Lys Leu Gly Val Glu Ala Glu 
290 295 300 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734: 

Met Pro Gly lie Lys Val Arg Glu Gly Asp Ala Phe Asp Glu Ala Tyr 

15 10 15 

Arg Arg Phe Lys Lys Gin Th.r Asp Arg Asn Leu Val Val Thr Glu Cys 

20 25 30 

Arg Ala Arg Arg Phe Phe Glu Ser Lys Thr Glu Lys Arg Lys Lys Gin 

35 40 45 

Lys lie Ser Ala Lys Lys Lys Val Leu Lys Arg Leu Tyr Met Leu Arg 

50 55 60 

Arg Tyr Glu Ser Arg Leu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 73 5: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 1613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
CB) LOCATION: 64... 1551 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735: 

TAAGAAAAAC CGCTAGAGTG CAATACAATT CTTGAAAGAT ATGAAATTAA AAAAGGAGAC 6 0 

TTT ATG TTA AAA ATC AAA TTA GAA AAA ACC ACT TTT GAA AAC GCA AAA 10 8 

Met Leu Lys lie Lys Leu Glu Lys Thr Thr Phe Glu Asn Ala Lys 
15 10 15 

GCT GAA TGC AGT TTA GTT TTT ATT ATC AAT AAG GAT TTT AGC CAC GCT 15 6 

Ala Glu Cys Ser Leu Val Phe lie lie Asn Lys Asp Phe Ser His Ala 
20 25 30 

TGG GTC AAA AAT AAA GAG TTG CTA GAA ACC TTT AAA TAC GAA GGC GAA 2 04 

Trp Val Lys Asn Lys Glu Leu Leu Glu Thr Phe Lys Tyr Glu Gly Glu 
35 40 45 

GGC GTA TTT TTA GAC CAA GAA AAT AAA ATC CTG TAT GCG GGC GTT AAA 252 
Gly Val Phe Leu Asp Gin Glu Asn Lys lie Leu Tyr Ala Gly Val Lys 
50 55 60 

GAA GAT GAT GTG CAT TTA TTG AGA GAG AGC GCG TGT TTA GCC GTT CGC 3 00 

Glu Asp Asp Val His Leu Leu Arg Glu Ser Ala Cys Leu Ala Val Arg 
65 70 75 

ACC CTT AAA AAA CTC GCT TTT AAA AGC GTT AAA GTG GGC GTT TAT ACT 34 8 
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(A) LENGTH: 373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 103... 3 12 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733: 

GAAAATTCTT GACACCCAGT TAATTCGTCT TATATAATAC TATTTCATTT GTAGCCATTG 6 0 

TTAAGCAATG ATTTTAAGCT ATGGAAAAGA GGTGATAGCA GT ATG CCA GGG ATT 114 

Met Pro Gly lie 
1 

AAG GTT AGA GAA GGC GAT GCG TTT GAT GAA GCT TAT AGG AGA TTC AAA 162 
Lys Val Arg Glu Gly Asp Ala Phe Asp Glu Ala Tyr Arg Arg Phe Lys 
5 10 15 20 

AAG CAA ACC GAT CGC AAT TTA GTG GTA ACA GAA TGC CGT GCT AGA AGG 210 
Lys Gin Thr Asp Arg Asn Leu Val Val Thr Glu Cys Arg Ala Arg Arg 
25 30 35 

TTC TTT GAG TCT AAG ACT GAA AAA CGC AAA AAA CAA AAA ATC AGC GCT 2 58 

Phe Phe Glu Ser Lys Thr Glu Lys Arg Lys Lys Gin Lys lie Ser Ala 
40 45 50 

AAA AAG AAG GTT TTG AAG CGT CTT TAT ATG TTA AGG CGT TAT GAG TCA 3 06 

Lys Lys Lys Val Leu Lys Arg Leu Tyr Met Leu Arg Arg Tyr Glu Ser 
55 60 65 

AGA CTA TAATAGACTT TAAGAAAAAT TTAAAAATTA AGGATTATTG AATAATGCAA TT 3 64 
Arg Leu 
70 

CACAGGGAA 373 
(2) INFORMATION FOR SEQ ID N0:734: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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210 



215 



220 



GTA GGN TTC TTT TGAGTGATCA CTCTAGTTAC ATCACTGGAG AGACTCTCAA AGTCA 7 89 
Val Xaa Phe Phe 
225 



(2) INFORMATION FOR SEQ ID NO: 732 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:732: 

Met Gin Phe Thr Gly Lys Asn Val Leu lie Thr Gly Ala Ser Lys Gly 

1 5 10 15 

lie Gly Ala Glu lie Ala Lys Thr Leu Ala Ser Met Gly Leu Lys Val 

20 25 30 

Trp lie Asn Tyr Arg Ser Asn Ala Glu Val Ala Asp Ala Leu Lys Asn 

35 40 45 

Glu Leu Glu Glu Lys Gly Tyr Lys Ala Ala Val lie Lys Phe Asp Ala 

50 55 60 

Ala Ser Glu Ser Asp Phe lie Glu Ala lie Gin Thr^ lie Val Gin Ser 
65 70 75 ' 80 

Asp Gly Gly Leu Ser Tyr Leu Val Asn Asn Ala Gly Val Val Arg Asp 

85 90. 95 

Lys Leu Ala lie Lys Met Lys Thr Glu Asp Phe His His Val lie Asp 

100 105 110 

Asn Asn Leu Thr Ser Ala Phe lie Gly Cys Arg Glu Ala Leu Lys Val 

115 120 125 

Met Ser Lys Ser Arg Phe Gly Ser Val Val Asn Val Ala Ser lie lie 

130 135 140 

Gly Glu Arg Gly Asn Met Gly Gin Thr Asn Tyr Ser Ala Ser Lys Gly 
145 150 155 160 

Gly Met lie Ala Met Ser Lys Ser Phe Ala Tyr Glu Gly Ala Leu Arg 

165 170 175 

Asn lie Arg Phe Asn Ser Val Thr Pro Gly Phe lie Glu Thr Asp Met 

180 185 190 

Asn Ala Asn Leu Lys Asp Glu Leu Lys Ala Asp Tyr Val Lys Asn lie 

195 200 205 

Pro Leu Asn Arg Leu Gly Ser Ala Lys Glu Val Ala Glu Ala Val Xaa 

210 215 220 

Phe Phe 



ATGGCGGGCT TTATA 



804 



225 



(2) INFORMATION FOR SEQ ID 



NO:733 : 



(i) SEQUENCE CHARACTERISTICS: 
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AAGGCGTTAT GAGTCAAGAC TATAATAGAC TTTAAGAAAA ATTTAAAAAT TAAGGATTAT 6 0 

TGAATA ATG CAA TTC ACA GGG AAA AAT GTT CTC ATT ACT GGG GCT TCT 10 8 

Met Gin Phe Thr Gly Lys Asn Val Leu lie Thr Gly Ala Ser 
15 10 

AAA GGC ATT GGG GCT GAA ATC GCC AAA ACT CTC GCT TCT ATG GGG CTG 15 6 

Lys Gly lie Gly Ala Glu lie Ala Lys Thr Leu Ala Ser Met Gly Leu 
15 20 25 30 

AAA GTT TGG ATC AAT TAC CGC AGT AAT GCT GAA GTG GCT GAC GCT TTG 2 04 

Lys Val Trp lie Asn Tyr Arg Ser Asn Ala Glu Val Ala Asp Ala Leu 
35 40 45 

AAA AAT GAG CTT GAA GAA AAA GGC TAT AAG GCA GCT GTC ATT AAA TTT 2 52 

Lys Asn Glu Leu Glu Glu Lys Gly Tyr Lys Ala Ala Val lie Lys Phe 
50 55 60 

GAT GCG GCT TCT GAA AGC GAT TTT ATT GAA GCG ATA CAA ACC ATC GTC 3 00 

Asp Ala Ala Ser Glu Ser Asp Phe lie Glu Ala lie Gin Thr lie Val 
65 70 75 

CAA AGC GAT GGG GGG TTG TCT TAC TTG GTG AAT AAC GCC GGT GTG GTG 34 8 

Gin Ser Asp Gly Gly Leu Ser Tyr Leu Val Asn Asn Ala Gly Val Val 
80 85 90 

CGC GAT AAA TTA GCG ATC AAA ATG AAA ACA GAA GAC TTT CAC CAT GTC 3 96 

Arg Asp Lys Leu Ala lie Lys Met Lys Thr Glu Asp Phe His His Val 
95 100 105 110 

ATA GAC AAT AAC CTC ACT TCA GCC TTT ATA GGT TGC CGA GAG GCT TTA 4 44 

lie Asp Asn Asn Leu Thr Ser Ala Phe lie Gly Cys Arg Glu Ala Leu 
115 120 125 

AAG GTG ATG AGC AAG AGT CGT TTT GGG AGC GTG GTC AAT GTC GCT TCT 4 92 

Lys Val Met Ser Lys Ser Arg Phe Gly Ser Val Val Asn Val Ala Ser 
130 135 140 

ATC ATT GGT GAA AGA GGC AAT ATG GGG CAG ACA AAC TAC TCA GCG AGT 54 0 

lie lie Gly Glu Arg Gly Asn Met Gly Gin Thr Asn Tyr Ser Ala Ser 
145 150 155 

AAG GGG GGA ATG ATT GCA ATG AGC AAG TCC TTT GCT TAT GAG GGA GCT 588 
Lys Gly Gly Met lie Ala Met Ser Lys Ser Phe Ala Tyr Glu Gly Ala 
160 165 170 

TTA AGG AAT ATT CGT TTC AAC TCT GTA ACG CCC GGT TTT ATA GAA ACC 63 6 

Leu Arg Asn lie Arg Phe Asn Ser Val Thr Pro Gly Phe lie Glu Thr 
175 180 185 190 

GAC ATG AAC GCC AAT TTG AAA GAC GAA CTC AAA GCG GAT TAT GTT AAA 6 84 

Asp Met Asn Ala Asn Leu Lys Asp Glu Leu Lys Ala Asp Tyr Val Lys 
195 200 205 

AAC ATT CCT TTA AAC AGG CTA GGG TCT GCT AAG GAA GTG GCA GAA GCG 73 2 

Asn lie Pro Leu Asn Arg Leu Gly Ser Ala Lys Glu Val Ala Glu Ala 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730: 



Met Ala Val Tyr 
1 

Glu lie Glu Leu 
20 

Leu Glu Lys Arg 
35 

Thr Asp Phe Gin 
50 

Ala Met Asp Tyr 
65 

Gly Asp Arg His 

Lys lie Asp Asn 
100 

Gly Thr Lys Asn 
115 

Gly Tyr Arg Lys 
130 

Leu Pro lie Leu 
145 

Gly Ala Glu Glu 

Glu Phe Ala Ser 
180 

Gly Gly Ser Gly 
195 



Leu Asp Phe Glu 
5 

Ala Leu lie Arg 

Leu Asp Lys Glu 
40 

Lys Leu Gin Leu 
55 

lie Asp Leu lie 
70 

Tyr Asn Asp Asp 
85 

Val Pro Val Val 

Lys Leu Leu Arg 
120 

Ala Leu Lys Met 
135 

Met Leu Val Asp 
150 

Arg Gly Gin Ser 
165 

Leu Lys Val Pro 

Gly Ala Leu Arg 
200 



Asn His lie Lys 
10 

Gly Asp Glu Asp 
25 

Val Lys Ser lie 

Ala Arg His Pro 
60 

Leu Lys Asp Lys 
75 

Lys Ala lie Val 
90 

Val lie Gly Glu 
105 

Asn Phe Gly Met 

Ala Lys Phe Ala 
140 

Thr Ala Gly Ala 
155 

Glu Ala lie Ala 
170 

Thr lie Ser Val 
185 

Leu Gin Trp Leu 



Glu lie Gin Asn 
15 

Ala Lys Glu lie 
30 

Tyr Ser Asn Leu 
45 

Asp Arg Pro Tyr 

Tyr Glu Val Phe 
80 

Cys Phe Val Gly 
95 

Glu Lys Gly Arg 
110 

Pro Asn Pro Cys 
125 

Glu Lys Phe Asn 

Tyr Pro Gly lie 
160 

Lys Asn Leu Gin 
175 

lie lie Gly Glu 
190 

Thr Asn Trp Leu 
205 



(2) INFORMATION FOR SEQ ID NO: 731: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 67 . . . 744 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731: 
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35 40 45 

CTC ACT GAT TTT CAA AAA CTC CAA TTA GCA AGA CAC CCT GAG AGA CCC 2 53 

Leu Thr Asp Phe Gin Lys Leu Gin Leu Ala Arg His Pro Asp Arg Pro 
50 55 60 

TAG GCT ATG GAT TAG ATT GAT CTC ATC TTA AAA GAT AAA TAT GAA GTC 3 01 

Tyr Ala Met Asp Tyr lie Asp Leu lie Leu Lys Asp Lys Tyr Glu Val 
65 70 75 

TTT GGG GAT AGG CAT TAT T^C GAT GAT AAA GCG ATC GTG TGC TTT GTA 349 
Phe Gly Asp Arg His Tyr Asn Asp Asp Lys Ala lie Val Cys Phe Val 
80 85 90 95 

GGG AAA ATT GAT AAT GTC CCA GTT GTG GTG ATC GGA G7^ GAA AAG GGC 3 97 

Gly Lys lie Asp Asn Val Pro Val Val Val He Gly Glu Glu Lys Gly 
100 105 110 

AGA GGG ACT AAA AAC AAA CTC TTA AGA AAT TTT GGC ATG CCT AAC CCT 445 
Arg Gly Thr Lys Asn Lys Leu Leu Arg Asn Phe Gly Met Pro Asn Pro 
115 120 125 

TGT GGC TAT CGT AAG GCT TTG AAA ATG GCA AAG TTT GCT GAA AAG TTT 4 93 

Cys Gly Tyr Arg Lys Ala Leu Lys Met Ala Lys Phe Ala Glu Lys Phe 
130 135 140 

AAT TTG CCT ATT TTA ATG CTT GTG GAT ACA GCC GGG GCG TAT CCG GGG 541 
Asn Leu Pro He Leu Met Leu Val Asp Thr Ala Gly Ala Tyr Pro Gly 
145 150 155 

ATT GGT GCA GAA GAA AGG GGG CAA AGT GAA GCG ATC GCT AAA AAT CTC 58 9 

He Gly Ala Glu Glu Arg Gly Gin Ser Glu Ala He Ala Lys Asn Leu 
160 165 170 175 

CAA GAG TTC GCC TCT TTA AAA GTC CCT ACT ATT TCT GTA ATT ATC GGT 63 7 

Gin Glu Phe Ala Ser Leu Lys Val Pro Thr He Ser Val He He Gly 
180 185 190 

GAG GGG GGC AGT GGT GGT GCG CTA CGA TTG CAG TGG CTG ACA AAT TGG 685 
Glu Gly Gly Ser Gly Gly Ala Leu Arg Leu Gin Trp Leu Thr Asn Trp 
195 200 205 

CTA TGATGGAATA TTCCATTTTT AGCGTTATAT CCCCAGAAGG TTGTGCGGCG ATTCTT 74 4 
Leu 



TGGGATGACC CTAGCAAGAC TGAAGTGGCT ATTAAAGCGA TGAAAATCAC GCCTAGAGAC 8 04 

TTAAAGGAGG CGGGGCTTAT TGATGATATT ATCTTAGAGC CTAGCAAAGG GGCTCATAGA 864 
GACAAATTTT CAG 8 77 

(2) INFORMATION FOR SEQ ID NO: 73 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
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690 695 700 

Pro Asp Gly Val lie lie Pro Leu Ala Asn Ala Gin Ala Ala Gly Met 
705 710 715 720 

Leu Gly Glu Ala Gly Val Asp Gly Tyr Val Asn Asn His Phe Met Lys 

725 730 735 

Arg lie Gly Phe Ala Val lie Ala Ser Val Val Asn Ser Phe Leu Gin 

740 745 750 

Thr Ala Pro lie He Ala Leu Asp Lys Leu He Gly Leu Gly Lys Gly 

755 760 765 

Arg Ser Glu Arg Thr Pro Glu Phe Asn Tyr Ala Leu Gly Gin Ala He 

770 775 780 

Asn Gly Ser Met Gin Ser Ser Ala Gin Met Ser Asn Gin He Leu Gly 
785 790 795 800 

Gin Leu Met Asn He Pro Pro Ser Phe Tyr Lys Asn Glu Gly Asp Ser 

805 810 815 

He Lys He Leu Thr Met Asp Asp He Asp Phe Ser Gly Val Tyr Asp 

820 825 830 

Val Lys He Thr Asn Lys Ser Val Val Asp Glu He He Lys Gin Ser 

835 840 845 

Thr Lys Thr Leu Ser Arg Glu His Glu Glu He Thr Thr Ser Pro Lys 

850 855 860 

Gly Gly Asn 
865 



(2) INFORMATION FOR SEQ ID NO: 72 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 877 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 65... 688 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 9: 

TTTGGTGGCA CTAATGGTGT TGTGATTTTC AAAAAAGCCT AGTTTTACAA AGTTAGGATT 6 0 

TTGA ATG GCC GTT TAT TTA GAT TTT GAA AAT CAT ATT AAA GAG ATT CAA 109 
Met Ala Val Tyr Leu Asp Phe Glu Asn His He Lys Glu He Gin 
15 10 15 

AAT GAA ATT GAA TTA GCC CTT ATT AGA GGC GAT GAG GAC GCT AAA GAA 157 
Asn Glu He Glu Leu Ala Leu He Arg Gly Asp Glu Asp Ala Lys Glu 
20 25 30 

ATC TTA GAA AAA AGA TTG GAT AAG GAG GTT AAA AGC ATT TAT TCC AAT 2 05 

He Leu- Glu Lys Arg Leu Asp Lys Glu Val Lys Ser He Tyr Ser Asn 
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Leu Ser Gin Ala 
260 

Leu Thr Pro Glu 

275 

Val Lys Ala Tyr 
290 

Lys Lys Glu Cys 
305 

Ala Lys Glu Leu 

Asn Ala Asp Pro 
340 

Leu Ser Asp Glu 
355 

Ala Val Ala Asp 
370 

Lys Cys Gin Asn 
385 

Arg Thr Gin Asn 

Gin Ala Ser Glu 
420 

Ala lie Glu Gin 
435 

lie Leu Gly lie 
450 

Asp Leu Arg Asn 
465 

Pro Leu Leu Pro 

Asn Ala Thr Asn 
500 

Tyr Ala Ser lie 
515 

Thr lie Lys Asp 
530 

Gly Asn Lys Lys 
545 

Glu Ala Lys Ala 

Ala Lys Asn lie 
580 

Gly Glu Phe Val 
595 

Ala Glu Lys Gin 
610 

Lys Ser Asp Pro 
625 

Thr Leu Thr Ser 

Val Val Ala Lys 
660 

Asp Lys Gly Thr 
675 

Thr Pro lie Met 



Arg Asn Glu Glu 

Ala Arg Lys Leu 
280 

Leu Asp Cys Val 
295 

Glu Lys Leu Leu 
310 

Gin Gin Lys Asp 
325 

Asn Asp Arg Ala 

Glu Lys Leu Lys 
360 

Cys Leu Ala Met 
375 

Leu Tyr Ser Asp 
390 

Lys Gin Asn Gin 
405 

Cys Leu Asp Asn 

Cys Leu Glu Gly 
440 

Lys Arg Gin Ala 
455 

Arg Lys Thr Phe 
470 

Met Asp Phe Lys 
485 

Val Asp Ala Asp 

Glu Pro Asp lie 
520 

Lys Asn Leu Glu 
535 

Asp Asp Asp Lys 
550 

Glu Asn Asn Lys 
565 

Ser Glu lie Ala 

Asp Glu Asn Gly 
600 

Asp Glu Thr Ser 
615 

Thr Phe Val Leu 
630 

Lys Val Asp Ala 
645 

Asp Val Trp Asn 

Lys Val Tyr Gly 
680 

Thr Arg Leu Met 



Glu Arg Arg Ala 
265 

Leu Glu Gin Glu 

Ser Arg Ala Arg 
300 

Thr Pro Glu Ala 
315 

Lys Ala lie Lys 
330 

Ala lie Met Lys 
345 

Tyr Leu Gin Glu 

Ala Lys Thr Asp 
380 

Leu lie Gin Glu 
395 

Leu Ser Lys Thr 
410 

Leu Asp Asp Pro 
425 

Leu Ser Asp Ser 

Asp Glu Val Asp 
460 

Asp Asn Met Ala 
475 

Asn Gly Gly Asp 
490 

Lys lie Ala Ser 
505 

Ala Lys Gin Tyr 

Ala Lys Leu Ala 
540 

Glu Lys Ser Lys 
555 

lie Asp Lys Asp 
570 

Leu Lys Asn Lys 
585 

Asn Pro lie Asp 

Pro Val Lys Gin 
620 

Ala Gin Tyr Thr 
635 

Thr Leu Thr Gly 
650 

Met Asn Gly Thr 
665 

Asn Tyr Gin Ser 
lie Val Phe Thr 



Cys Glu Lys Leu 
270 

Val Lys Lys Ser 
285 

Asn Glu Lys Glu 

Arg Lys Phe Leu 
320 

Asp Cys Leu Lys 
335 

Cys Leu Asp Gly 
350 

Ala Arg Glu Lys 
365 

Glu Glu Lys Arg 

lie Gin Asn Lys 
400 

Glu Arg Leu His 
415 

Thr Asp Gin Glu 
430 

Glu Arg Ala Leu 
445 

Leu lie Tyr Ser 

Ala Lys Gly Tyr 
480 

lie Ala Thr He 
495 

Asp Asn Pro He 
510 

Glu Thr Glu Lys 
525 

Lys Ala Leu Gly 

Lys Ser Thr Ala 
560 

Val Ala Glu Thr 
575 

Lys Glu Lys Ser 
590 

Asp Lys Lys Lys 
605 

Ala Phe He Gly 

Pro He Glu He 
640 

He Val Ser Gly 
655 

Met He Leu Leu 
670 

Val Lys Gly Gly 
685 

Lys Ala He Thr 
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ACC AAA ACT TTG TCT AGA GAA CAT GAA GAA ATC ACC AC A AGC CCC AAA 2 5 92 

Thr Lys Thr Leu Ser Arg Glu His Glu Glu lie Thr Thr Ser Pro Lys 
850 855 860 

GGT GGC AAT TAATTCAAGA GAAAGGATAA AATATATTCA TGTTACTAAA CTCGGTTCT 2 65 0 

Gly Gly Asn 

865 

TTACAAAATA AAAGACAAAA CCAACAACAG GCTCT 26 85 

(2) INFORMATION FOR SEQ ID NO: 728: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 8: 

Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys 

15 10 15 

Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys 

20 25 30 

Glu Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu 

35 40 45 

Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys 

50 55 60 

Leu Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Glu 
65 70 75 80 

Ala Asp Lys Lys Arg Cys Val Lys Asp Leu Pro Lys Asp Leu Gin Lys 

85 90 93 

Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser 

100 105 110 

Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr 

115 120 125 

Pro Glu Ala Lys Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu Lys Ala 

130 135 140 

Tyr Lys Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala 
145 150 155 160 

Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu 

165 170 175 

Val Lys Lys Ser lie Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg 

180 185 190 

Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala 

195 200 205 

Arg Lys Phe Leu Ala Lys Gin Val Leu Asn Cys Leu Glu Lys Ala Gly 

210 215 220 

Asn Glu Glu Glu Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys Asp Leu 
225 230 235 240 

Gin Glu Asn lie Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys 
245 250 255 
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AAG AGT GAT 
Lys Ser Asp 
625 

ACT CTG ACT 
Thr Leu Thr 



GTT GTA GCC 
Val Val Ala 



GAC AAA GGC 
Asp Lys Gly 
675 

ACA CCC ATT 
Thr Pro lie 
690 

CCT GAT GGT 
Pro Asp Gly 
705 

TTG GGT GAA 
Lieu Gly Glu 



CGC ATA GGC 
Arg lie Gly 



ACT GCG CCT 
Thr Ala Pro 
755 

AGA AGT GAA 
Arg Ser Glu 
770 

AAT GGT AGC 
Asn Gly Ser 
785 

CAA CTG ATG 
Gin Leu Met 



ATT AAG ATT 
lie Lys lie 



GTT AAA ATT 
Val Lys lie 
835 



CCC ACA TTT 
Pro Thr Phe 
630 

TCT AAA GTA 
Ser Lys Val 
645 

AAA GAT GTA 
Lys Asp Val 
660 

ACT AAG GTG 
Thr Lys Val 



ATG ACA CGC 
Met Thr Arg 



GTG ATA ATA 
Val lie lie 
710 

GCA GGG GTA 
Ala Gly Val 
725 

TTT GCT GTG 
Phe Ala Val 
740 

ATC ATA GCT 
lie lie Ala 



AGG ACA CCT 
Arg Thr Pro 



ATG CAA AGT 
Met Gin Ser 
790 

AAT ATC CCC 
Asn lie Pro 
805 

CTC ACA ATG 
Leu Thr Met 
820 

ACT AAC AAA 
Thr Asn Lys 



GTT TTA GCG 
Val Leu Ala 



GAT GCC ACT 
Asp Ala Thr 



TGG AAC ATG 
Trp Asn Met 
665 

TAT GGG AAT 
Tyr Gly Asn 
680 

TTA ATG ATA 
Leu Met lie 
695 

CCT CTA GCA 
Pro Leu Ala 



GAT GGC TAT 
Asp Gly Tyr 



ATA GCA AGC 
lie Ala Ser 
745 

CTA GAT AAA 
Leu Asp Lys 
760 

GAA TTT AAT 
Glu Phe Asn 
775 

TCA GCT CAG 
Ser Ala Gin 



CCA AGT TTT 
Pro Ser Phe 



GAC GAT ATT 
Asp Asp lie 
825 

TCT GTG GTA 
Ser Val Val 
840 



CAA TAC ACC 
Gin Tyr Thr 
635 

CTC ACA GGT 
Leu Thr Gly 
650 

AAC GGC ACT 
Asn Gly Thr 



TAT CAA AGC 
Tyr Gin Ser 



GTC TTT ACT 
Val Phe Thr 
700 

AAC GCT CAA 
Asn Ala Gin 
715 

GTG AAT AAT 
Val Asn Asn 
730 

GTG GTT AAT 
Val Val Asn 



CTC ATA GGC 
Leu lie Gly 



TAC GCT TTG 
Tyr Ala Leu 
780 

ATG TCT AAT 
Met Ser Asn 
795 

TAC AAA AAC 
Tyr Lys Asn 
810 

GAT TTT AGC 
Asp Phe Ser 

GAT GAA ATT 
Asp Glu lie 



CCC ATT GAA 
Pro lie Glu 



ATA GTG AGT 
lie Val Ser 
655 

ATG ATC TTA 
Met lie Leu 
670 

GTG AAA GGT 
Val Lys Gly 
685 

AAA GCC ATT 
Lys Ala lie 



GCA GCA GGC 
Ala Ala Gly 



CAC TTT ATG 
His Phe Met 
735 

AGC TTC TTG 
Ser Phe Leu 
750 

CTT GGC AAA 
Leu Gly Lys 
765 

GGT CAA GCT 
Gly Gin Ala 



CAA ATT CTA 
Gin lie Leu 



GAG GGC GAT 
Glu Gly Asp 
815 

GGT GTG TAT 
Gly Val Tyr 
830 

ATC AAA CAA 
lie Lys Gin 
845 



ATC 1920 

lie 

640 

GGG 1968 
Gly 



TTA 2 016 

Leu 



GGC 2064 
Gly 



ACG 2112 
Thr 



ATG 2160 

Met 

720 

AAG 2208 
Lys 



CAA 2256 
Gin 



GGT 2 3 04 

Gly 



ATC 2 3 52 

lie 



GGG 2400 

Gly 

800 

AGT 244 8 

Ser 



GAT 24 96 

Asp 



AGC 2 54 4 

Ser 



-1110- 



BNSDOCID: <WO 9&43478A1_I_> 



wo 98/43478 



PCT/US98/06371 



AGG AC A CAA AAC AAA CAA AAT CAA TTG AGT AAA AC A GAA AGG TTG CAT 124 8 

Arg Thr Gin Asn Lys Gin Asn Gin Leu Ser Lys Thr Glu Arg Leu His 
405 410 415 

CAA GCA AGC GAG TGC TTG GAT AAC TTA GAT GAC CCT ACT GAT CAA GAG 12 96 

Gin Ala Ser Glu Cys Leu Asp Asn Leu Asp Asp Pro Thr Asp Gin Glu 
420 425 430 

GCC ATA GAG CAA TGT TTA GAG GGC TTG AGC GAT AGT GAA AGG GCG CTA 1344 
Ala lie Glu Gin Cys Leu Glu Gly Leu Ser Asp Ser Glu Arg Ala Leu 
435 440 445 

ATT CTA GGA ATT AAA CGA CAA GCT GAT GAA GTG GAT CTG ATT TAT AGC 13 92 

lie Leu Gly lie Lys Arg Gin Ala Asp Glu Val Asp Leu lie Tyr Ser 
450 455 460 

GAT CTA AGA AAC CGT AAA ACC TTT GAT AAC ATG GCG GCT AAA GGT TAT 14 4 0 

Asp Leu Arg Asn Arg Lys Thr Phe Asp Asn Met Ala Ala Lys Gly Tyr 
465 470 475 480 

CCA TTG TTA CCA ATG GAT TTC AAA AAT GGC GGC GAT ATT GCC ACT ATT 14 8 8 

Pro Leu Leu Pro Met Asp Phe Lys Asn Gly Gly Asp lie Ala Thr lie 
485 490 495 

AAC GCC ACT AAT GTT GAT GCG GAC AAA ATA GCT AGC GAT AAT CCT ATT 153 6 

Asn Ala Thr Asn Val Asp Ala Asp Lys lie Ala Ser Asp Asn Pro lie 
500 505 510 

TAT GCT TCC ATA GAG CCT GAT ATT GCC AAG CAA TAG GAA AC A GAA AAA 15 84 

Tyr Ala Ser lie Glu Pro Asp lie Ala Lys Gin Tyr Glu Thr Glu Lys 
515 520 525 

ACC ATT AAG GAT AAG AAT TTA GAA GCT AAA TTA GCT AAG GCT TTA GGT 163 2 

Thr lie Lys Asp Lys Asn Leu Glu Ala Lys Leu Ala Lys Ala Leu Gly 
530 535 540 

GGC AAT AAA AAA GAT GAC GAT AAA GAA AAA AGT AAA AAA TCC ACA GCA 1680 
Gly Asn Lys Lys Asp Asp Asp Lys Glu Lys Ser Lys Lys Ser Thr Ala 
545 550 555 560 

GAA GCT AAA GCA GAA AAC AAT AAG ATA GAC AAA GAT GTC GCA GAA ACT 172 8 

Glu Ala Lys Ala Glu Asn Asn Lys lie Asp Lys Asp Val Ala Glu Thr 
565 570 575 

GCC AAG AAT ATC AGT GAA ATC GCT CTT AAG AAC AAA AAA GAA AAG AGT 177 6 

Ala Lys Asn lie Ser Glu lie Ala Leu Lys Asn Lys Lys Glu Lys Ser 
580 585 590 

GGG GAA TTT GTA GAT GAA AAT GGT AAT CCC ATT GAT GAC AAA AAG AAA 182 4 

Gly Glu Phe Val Asp Glu Asn Gly Asn Pro lie Asp Asp Lys Lys Lys 
595 600 605 

GCA GAA AAA CAA GAT GAA ACA AGC CCT GTC AAA CAG GCC TTT ATA GGC 1872 
Ala Glu Lys Gin Asp Glu Thr Ser Pro Val Lys Gin Ala Phe lie Gly 
610 615 620 
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GTT AAG AAA AGC ATT AAG GCT TAT TTG GAC TGC GTA TCA AGA- GCT AGG 57 6 

Val Lys Lys Ser lie Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg 
180 185 190 

AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT G7^ GCG 624 
Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala 
195 200 205 

AGA AAA TTT TTA GCG AAG CAA GTG CTA AAT TGT TTG GAA AAA GCT GGA 672 
Arg Lys Phe Leu Ala Lys Gin Val Leu Asn Cys Leu Glu Lys Ala Gly 
210 215 220 

AAT GAA GAA GAA AGA AAA GCA TGT CTT AAA AAT CTC CCT AAA GAC TTA 720 
Asn Glu Glu Glu Arg Lys Ala Cys Leu Lys Asn Leu Pro Lys Asp Leu 
225 230 235 240 

CAG GAA AAT ATT TTA GCT AAA GAG AGT CTT AAA GCT TAT AAA GAC TGC 76 8 

Gin Glu Asn lie Leu Ala Lys Glu Ser Leu Lys Ala Tyr Lys Asp Cys 
245 250 255 

CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG AGA GCT TGC GAG AAA CTA 816 
Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala Cys Glu Lys Leu 
260 265 270 

CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG CAA GAA GTT AAG AAA AGC 8 64 

Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu Val Lys Lys Ser 
275 280 285 

GTT AAG GCT TAT TTG GAC TGC GTA TCA AGA GCT AGG AAT GAA AAA GAG 912 
Val Lys Ala Tyr Leu Asp Cys Val Ser Arg Ala Arg Asn Glu Lys Glu 
290 295 300 

AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG AGA AAA TTT TTA 96 0 

Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Phe Leu 
305 310 315 320 

GCG AAA GAA CTC CAA CAA AAA GAT AAA GCG ATC AAA GAT TGC TTG AAA 10 0 8 

Ala Lys Glu Leu Gin Gin Lys Asp Lys Ala lie Lys Asp Cys Leu Lys 
325 330 335 

AAC GCC GAT CCT AAC GAC AGA GCG GCT ATC ATG AAG TGT TTG GAT GGT 105 6 

Asn Ala Asp Pro Asn Asp Arg Ala Ala lie Met Lys Cys Leu Asp Gly 
340 345 350 

TTG AGC GAT GAA GAG AAG CTC AAA TAC CTG CAA GAA GCT AGA GAA AAG 1104 
Leu Ser Asp Glu Glu Lys Leu Lys Tyr Leu Gin Glu Ala Arg Glu Lys 
355 360 365 

GCT GTT GCG GAT TGT TTG GCT ATG GCT AAA ACC GAT GAA GAA AAA AGG 1152 
Ala Val Ala Asp Cys Leu Ala Met Ala Lys Thr Asp Glu Glu Lys Arg 
370 375 380 

AAA TGC CAA AAC CTT TAT AGC GAT TTG ATC CAA GAA ATC CAA AAT AAA 12 0 0 

Lys Cys Gin Asn Leu Tyr Ser Asp Leu lie Gin Glu lie Gin Asn Lys 
385 390 395 400 

-1108- 



BNSDOCiD: <WO 984347BA1 J_> 



wo 98/43478 



PCT/US98/06371 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...2601 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727: 

TTG GAT TGC GTA TCT CAA GCC AAA ACT GAA GCT GAG AAA AAA GAA TGC 4 8 

Leu Asp Cys Val Ser Gin Ala Lys Thr Glu Ala Glu Lys Lys Glu Cys 
15 10 15 

GAG AAA TTA CTC ACC CCT GAA GCG AGA AAA CTC TTA GAA GAA GCT AAA 96 
Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Glu Ala Lys 
20 25 30 

GAG AGC GTT AAA GCT TAT AAA GAC TGC GTA TCA AAA GCT AGG AAT GAA 144 
Glu Ser Val Lys Ala Tyr Lys Asp Cys Val Ser Lys Ala Arg Asn Glu 
35 40 45 

AAA GAG AAA AAA GAA TGC GAG AAA TTA CTC ACG CCT GAA GCG AAA AAA 192 
Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr Pro Glu Ala Lys Lys 
50 55 60 

CTT TTA GAG CAA CAA GTG CTA GAT TGT TTG AAA AAC GCT AAA ACC GAA 24 0 

Leu Leu Glu Gin Gin Val Leu Asp Cys Leu Lys Asn Ala Lys Thr Glu 
65 70 75 80 

GCT GAT AAA AAA AGG TGT GTC AAA GAT CTC CCT AAA GAC TTG GAG AAA 2 88 

Ala Asp Lys Lys Arg Cys Val Lys Asp Leu Pro Lys Asp Leu Gin Lys 
85 90 95 

AAG GTT TTA GCT AAA GAG AGC GTT AAG GCT TAT TTG GAC TGC GTA TCA 3 36 

Lys Val Leu Ala Lys Glu Ser Val Lys Ala Tyr Leu Asp Cys Val Ser 
100 105 110 

AGA GCT AGG AAT GAA AAA GAG AAA AAA GAA TGC GAG AAA TTG CTC ACC 3 84 

Arg Ala Arg Asn Glu Lys Glu Lys Lys Glu Cys Glu Lys Leu Leu Thr 
115 120 125 

CCT GAA GCG AAA AAA CTT TTA GAA GAA GCC AAA GAG AGT CTT AAA GCT 43 2 

Pro Glu Ala Lys Lys Leu Leu Glu Glu Ala Lys Glu Ser Leu Lys Ala 
130 135 140 

TAT AAA GAC TGC CTC TCT CAA GCT AGA AAT GAA GAA GAA AGG AGA GCT 48 0 

Tyr Lys Asp Cys Leu Ser Gin Ala Arg Asn Glu Glu Glu Arg Arg Ala 
145 150 155 160 

TGC GAG AAA CTA CTC ACG CCT GAA GCG AGA AAA CTC TTA GAG CAA GAA 52 8 

Cys Glu Lys Leu Leu Thr Pro Glu Ala Arg Lys Leu Leu Glu Gin Glu 
165 170 175 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726: 

Met Phe Lys Asp Phe Tyr Arg Thr Thr Leu Ser Phe Leu Lys Pro Leu 

15 10 15 

Leu Leu Leu Leu Val Leu Leu Leu Pro Phe Ser Leu Cys lie Ala Asp 

20 25 30 

Glu Tyr He Ser He Ser Asp Asp Trp Asp Glu He Val Arg Asn His 

35 40 45 

Lys Thr Tyr Tyr Phe Glu Asn Gly Leu Asp His Phe Asn Gin Gly Gin 

50 55 60 

Tyr Gin Gin Ala Phe Lys Asp Phe Arg Leu Ala Gin Glu Tyr Ser He 
65 70 75 80 

Gly Leu Gly Ser Val Tyr Leu Ala Lys Met Tyr Leu Glu Gly Lys Gly 

85 90 95 

Val Lys Val Asp Tyr Lys Lys Ala Gin Phe Tyr Ala Glu Asn Ala He 

100 105 110 

Lys Gly Tyr Gly Ser Gly Leu Leu Gly Gly Ala Leu He Leu Gly Arg 

115 120 125 

Met Gin Ala Glu Gly Leu Gly Met Lys Lys Asp Leu Lys Gin Ala Leu 

130 135 140 

Lys Thr Tyr Arg His Val Val Arg Met Phe Ser Asn Lys Ser Thr Asn 
145 150 155 160 

Phe Ala Asn Asn Phe Arg Leu Pro Asn Leu Ala Glu Phe Thr Ser Met 

165 170 175 

Leu He Gly Ser Arg Phe He Asp Leu Ser Gly Leu Ser Ala Asn Pro 

180 185 190 

He Lys Phe Gly Lys Lys Phe Gly He Leu Val Lys Lys Ser Thr Gin 

195 200 205 

He Lys Asp Lys Thr Leu Leu Trp Glu Asp He Ala Glu He Ser Ser 

210 215 220 

Asn He Thr Leu Leu Lys Gin Gin Met Gly Glu He Leu Tyr Arg He 
225 230 235 240 

Gly He Ala Tyr Lys Glu Gly Leu Gly Thr Arg Lys Lys Lys Asp Arg 

245 250 255 

Ala Lys Lys Phe Leu Gin Lys Ser Ala Glu Phe Gly Tyr Glu Lys Ala 

260 265 270 

Met Glu Ala Leu 
275 



(2) INFORMATION FOR SEQ ID NO: 72 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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70 75 80 

GTT TAT TTA GCC AAA ATG TAT TTG GAG GGA AAG GGC GTG AAA GTG GAT 34 2 

Val Tyr Leu Ala Lys Met Tyr Leu Glu Gly Lys Gly Val Lys Val Asp 
85 90 95 100 

TAG AAA AAA GCA CAA TTT TAT GCA GAA AAC GCT ATC AAA GGG TAT GGG 3 90 

Tyr Lys Lys Ala Gin Phe Tyr Ala Glu Asn Ala lie Lys Gly Tyr Gly 
105 110 115 

AGO GGA TTG TTA GGG GGT GCT CTT ATT TTA GGA CGC ATG CAA GCA GAA 43 8 

Ser Gly Leu Leu Gly Gly Ala Leu lie Leu Gly Arg Met Gin Ala Glu 
120 125 130 

GGC TTA GGG ATG AAA AAG GAT TTG AAA CAA GCG CTC AAG ACT TAT AGG 4 86 

Gly Leu Gly Met Lys Lys Asp Leu Lys Gin Ala Leu Lys Thr Tyr Arg 
135 140 145 

CAT GTG GTT CGC ATG TTT TCT AAT AAA AGC ACA AAT TTT GCT AAC AAT 53 4 

His Val Val Arg Met Phe Ser Asn Lys Ser Thr Asn Phe Ala Asn Asn 
150 155 160 

TTT AGA TTA CCA AAC CTT GCG GAA TTT ACT AGT ATG CTT ATT GGA TCG 58 2 

Phe Arg Leu Pro Asn Leu Ala Glu Phe Thr Ser Met Leu lie Gly Ser 
165 170 175 180 

CGA TTC ATT GAT CTT TCA GGT TTG AGC GCG AAT CCT ATA AAA TTT GGA 63 0 

Arg Phe lie Asp Leu Ser Gly Leu Ser Ala Asn Pro lie Lys Phe Gly 
185 190 195 

AAG AAA TTT GGA ATA CTT GTT AAG AAA TCC ACT CAA ATC AAA GAT AAG 67 8 

Lys Lys Phe Gly lie Leu Val Lys Lys Ser Thr Gin lie Lys Asp Lys 
200 205 210 

ACA CTT CTT TGG GAA GAT ATT GCT GAA ATT TCA AGC AAT ATT ACT TTA 72 6 

Thr Leu Leu Trp Glu Asp lie Ala Glu lie Ser Ser Asn lie Thr Leu 
215 220 225 

CTC AAA CAA CAA ATG GGG GAG ATC CTT TAT AGG ATT GGG ATC GCT TAT 774 
Leu Lys Gin Gin Met Gly Glu lie Leu Tyr Arg lie Gly lie Ala Tyr 
230 235 240 

AAA GAA GGG CTT GGC ACT AGA AAG AAA AAG GAC AGG GCT AAA AAA TTC 82 2 

Lys Glu Gly Leu Gly Thr Arg Lys Lys Lys Asp Arg Ala Lys Lys Phe 
245 250 255 260 

CTG CAA AAA TCC GCA GAA TTT GGC TAT GAA AAA GCC ATG GAA GCT CTG T 871 
Leu Gin Lys Ser Ala Glu Phe Gly Tyr Glu Lys Ala Met Glu Ala Leu 
265 270 275 

AGTTTTTTAA TCAAACTTGT ATCAAGCTTG ACTGAATGGG TTAGAAAAAT CCGCTTAGAT 931 
TATCCTAATT AAAACAAAGC CCTATTTTTC TTATTTTTCC ATTATAATAG ACACTTGAT 9 90 

(2) INFORMATION FOR SEQ ID NO: 72 6: 
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Met 


Lys 


Phe 


Asn 


Ala 


Phe 


Lys 


Glu 


Tyr 


Lys 


Lys 


Arg 


Val 


Phe 


Ala 


Lys 


225 










230 










235 










240 


Asn 


Glu 


Lys 


Lys 


Asn 


He 


Ala 


Phe 


Ser 


Ser 


He 


Asn 


Val 


He 


Pro 


Tyr 










245 










250 










255 




Pro 


Asn 


Ser 


Gin 


Asn 


Lys 


Arg 


Leu 


Phe 


Tyr 


Val 


Val 


Phe 


Asp 


Gin 


Asp 








260 










265 










270 






Tyr 


Lys 


Ala 


Tyr 


Gin 


His 


Asn 


Lys 


Leu 


Ser 


Tyr 


Ser 


Ser 


Asn 


Ser 


Gin 






275 










280 










285 








Lys 


Glu 


Leu 


Tyr 


lie 


Glu 


He 


Glu 


Asn 


Asn 


Gin 


Val 


Ser 


He 


He 


Met 




290 










295 










300 










Glu 


Lys 






























305 

































(2) INFORMATION FOR SEQ ID NO: 72 5; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 43... 870 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:725: 

AAAACAAGCT AAAGAATGGC TCAAATTGTA AAAGGATCTG AC ATG TTT AAA GAT 54 

Met Phe Lys Asp 
1 

TTT TAT CGC ACC ACC CTC TCT TTT TTA AAG CCT TTA TTG CTT TTA CTA 102 
Phe Tyr Arg Thr Thr Leu Ser Phe Leu Lys Pro Leu Leu Leu Leu Leu 
5 10 15 20 

GTT TTA TTA TTG CCG TTT TCA CTT TGT ATA GCT GAT GAA TAT ATT AGC 15 0 

Val Leu Leu Leu Pro Phe Ser Leu Cys He Ala Asp Glu Tyr He Ser 
25 30 35 

ATA AGT GAT GAT TGG GAT GAA ATT GTG CGA AAT CAT AAG ACA TAT TAT 198 
He Ser Asp Asp Trp Asp Glu He Val Arg Asn His Lys Thr Tyr Tyr 
40 45 50 

TTT GAA AAT GGT TTA GAC CAT TTT AAT CAA GGC CAA TAC CAG CAA GCC 24 6 

Phe Glu Asn Gly Leu Asp His Phe Asn Gin Gly Gin Tyr Gin Gin Ala 
55 60 65 

TTT AAA GAT TTT AGA TTG GCG CAA GAA TAC AGC ATC GGG CTT GGC AGT 2 94 

Phe Lys Asp Phe Arg Leu Ala Gin Glu Tyr Ser He Gly Leu Gly Ser 
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TCT CAG AAC AAA CGC TTG TTT TAT GTG GTG TTT GAG CAA GAT TAT AAA 872 
Ser Gin Asn Lys Arg Leu Phe Tyr Val Val Phe Asp Gin Asp Tyr Lys 
260 265 270 

GCC TAG CAG CAT AAC AAG CTC TCT TAT AGC TCC AAT TCT CAA AAA GAA 92 0 

Ala Tyr Gin His Asn Lys Leu Ser Tyr Ser Ser Asn Ser Gin Lys Glu 
275 * 280 285 290 

CTC TAT ATA GAG ATT GAA AAC AAT CAA GTG TCT ATT ATA ATG GAA AAA T 96 9 
Leu Tyr lie Glu lie Glu Asn Asn Gin Val Ser lie lie Met Glu Lys 
295 300 305 

AAGAAAAATA GGGCTTTGTT TTAATTAGGA TAATCTAAGC GGATTTTTCT AA 10 21 

(2) INFORMATION FOR SEQ ID NO: 724: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:724: 

Met Arg Leu Tyr Gin Lys Gin Gly Leu Glu Met Val Gly Gin Lys Leu 

15 10 15 

Asp Ser Tyr Leu Ala Asp Lys Ser Phe Trp Ala Glu Glu Leu Gin Asn 

20 25 30 

Lys Asp Thr Asp Phe Gly Tyr Tyr Gin Asn Lys Gin Phe Leu Phe Val 

35 40 45 

Ala Asn Lys Ser Lys Pro Ser Leu Glu Phe Tyr Glu lie Glu Asn Asn 

50 55 60 

Met Leu Lys Lys lie Asn Ser Ser Lys Ala Leu Val Gly Ser Lys Lys 
65 70 75 80 

Gly Asp Lys Thr Leu Glu Gly Asp Leu Ala Thr Pro lie Gly Val Tyr 

85 90 95 

Arg lie Thr Gin Lys Leu Glu Arg Leu Asp Gin Tyr Tyr Gly Val Leu 

100 105 110 

Ala Phe Val Thr Asn Tyr Pro Asn Leu Tyr Asp Thr Leu Lys Lys Arg 

115 120 125 

Thr Gly His Gly lie Trp Val His Gly Met Pro Leu Asn Gly Asp Arg 

130 135 140 

Asn Glu Leu Asn Thr Lys Gly Cys lie Ala lie Glu Asn Pro Leu Leu 
145 150 155 160 

Ser Ser Tyr Asp Lys Val Leu Lys Gly Glu Lys Ala Phe Leu lie Thr 

165 170 175 

Tyr Glu Asp Lys Phe Phe Pro Ser Thr Lys Glu Glu Leu Ser Met lie 

180 185 190 

Leu Ser Ser Leu Phe Gin Trp Lys Glu Ala Trp Ala Arg Gly Asp Phe 

195 200 205 

Glu Arg Tyr Met Arg Phe Tyr Asn Pro Asn Phe Thr Arg Tyr Asp Gly 
210 215 220 
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Thr Asp Phe Gly Tyr Tyr Gin Asn Lys Gin Phe Leu Phe Val Ala Asn 
35 40 45 50 

AAA TCC AAG CCC AGT TTG GAG TTT TAT GAG ATA GAA AAT AAC ATG CTT 24 8 

Lys Ser Lys Pro Ser Leu Glu Phe Tyr Glu lie Glu Asn Asn Met Leu 
55 60 65 

AAA AAA ATC AAC AGC TCT AAA GCT CTT GTA GGC TCT AAA AAG GGC GAT 2 96 

Lys Lys lie Asn Ser Ser Lys Ala Leu Val Gly Ser Lys Lys Gly Asp 
70 75 80 

AAG ACT TTA GAG GGC GAT TTG GCC ACG CCT ATT GGA GTG TAT CGT ATC 344 
Lys Thr Leu Glu Gly Asp Leu Ala Thr Pro lie Gly Val Tyr Arg lie 
85 90 95 

ACG CAG AAA TTA GAG CGC TTG GAT CAA TAT TAT GGC GTT TTG GCT TTT 3 92 

Thr Gin Lys Leu Glu Arg Leu Asp Gin Tyr Tyr Gly Val Leu Ala Phe 
100 105 110 

GTA ACG AAT TAC CCT AAT TTG TAT GAT ACC TTG AAA AAA CGC ACC GGG 44 0 

Val Thr Asn Tyr Pro Asn Leu Tyr Asp Thr Leu Lys Lys Arg Thr Gly 
115 120 125 130 

CAT GGC ATT TGG GTG CAT GGA ATG CCT TTA AAT GGC GAT CGG AAT GAA 4 88 

His Gly lie Trp Val His Gly Met Pro Leu Asn Gly Asp Arg Asn Glu 
135 140 145 ■ 

TTG AAC ACC AAG GGC TGT ATT GCG ATT GAA AAC CCG CTT TTA AGC TCT 53 6 

Leu Asn Thr Lys Gly Cys lie Ala lie Glu Asn Pro Leu Leu Ser Ser 
150 155 160 

TAT GAC AAA GTG TTA AAA GGC GAA AAA GCG TTC CTC ATC ACC TAT GAA 5 84 

Tyr Asp Lys Val Leu Lys Gly Glu Lys Ala Phe Leu lie Thr Tyr Glu 
165 170 175 

GAC AAG TTT TTC CCA AGC ACC AAA GAA GAA TTG AGC ATG ATT TTA AGC 63 2 

Asp Lys Phe Phe Pro Ser Thr Lys Glu Glu Leu Ser Met lie Leu Ser 
180 185 190 

TCC CTT TTT CAA TGG AAA GAA GCC TGG GCT AGG GGC GAT TTT GAA CGC 68 0 

Ser Leu Phe Gin Trp Lys Glu Ala Trp Ala Arg Gly Asp Phe Glu Arg 
195 200 205 210 

TAC ATG CGT TTT TAT AAC CCC AAT TTC ACT CGC TAT GAC GGC ATG AAA 72 8 

Tyr Met Arg Phe Tyr Asn Pro Asn Phe Thr Arg Tyr Asp Gly Met Lys 
215 220 225 

TTT AAC GCT TTT AAA GAG TAT AAA AAA AGG GTG TTT GCA AAA AAC GAA 77 6 

Phe Asn Ala Phe Lys Glu Tyr Lys Lys Arg Val Phe Ala Lys Asn Glu 
230 235 240 

AAA AAG AAT ATC GCT TTT TCC TCT ATC AAT GTG ATC CCT TAC CCC AAC 824 
Lys Lys Asn lie Ala Phe Ser Ser lie Asn Val He Pro Tyr Pro Asn 
245 250 255 
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AATGCGAACA ATTATGGGAT GATATTATAA AAATTGGTGG GAATGATAAG AT 37 6 

(2) INFORMATION FOR SEQ ID NO:722: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:722: 

Met His Ala Val Lys Ser Ala Lys Ser Arg Ser Val lie Met Glu Ala 

15 10 15 

Asp Ala Thr Thr Leu Leu Gly Phe Phe Glu Glu Asn Gin Asn Asn Gin 

20 25 30 

Phe Val lie Pro lie Tyr Gin Arg Leu Tyr Ser Trp Lys Lys Glu 
35 40 45 

(2) INFORMATION FOR SEQ ID N0:723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 968 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723: 

TATTAATGGG GTTTGTGGGA TTGAATGCTA GTGATCGTTT GTTAGAAATC ATG CGC 5 6 

Met Arg 

1 

CTT TAT CAA AAA CAA GGC TTG GAA ATG GTG GGT CAA AAG TTG GAT TCT 104 
Leu Tyr Gin Lys Gin Gly Leu Glu Met Val Gly Gin Lys Leu Asp Ser 
5 10 15 

TAT TTA GCG GAT AAA TCT TTT TGG GCA GAA GAA CTT CAA AAC AAG GAC 15 2 

Tyr Leu Ala Asp Lys Ser Phe Trp Ala Glu Glu Leu Gin Asn Lys Asp 
20 25 30 

ACG GAT TTT GGC TAT TAT CAA AAC AAG CAG TTT TTA TTT GTG GCT AAT 20 0 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) 'SEQUENCE DESCRIPTION: SEQ ID NO: 72 0: 



Met 


Phe 


Lys 


Gly 


He 


Tyr 


Pro 


Met 


Arg 


Asn 


Phe 


Pro 


He His 


His 


Asn 


1 








5 










10 








15 




Gly 


Phe 


Lys 


His 


Glu 


Val 


Leu 


Ala 


His 


Met 


Leu 


Lys 


Arg His 


Lys 


Glu 








20 










25 








30 






Pro 


Phe 


lie 


Leu 


Ser 


Tyr 


Asn 


Asp 


Cys 


Glu 


Phe 


Val 


Arg Asn 


Ala 


Tyr 






35 










40 










45 






Lys 


Asp 


Phe 


Lys 


He 


Leu 


Glu 


Pro 


Ser 


Trp 


Gin 


Tyr 


Thr Met 


Gly 


Gin 




50 










55 










60 








Gly 


Glu 


lie 


Arg 


Met 


Gly 


Lys 


Asn 


Arg 


Leu 


Glu 


Arg 


Gly Asp 


Asn 


Asn 


65 










70 










75 








80 


His 


Val 


Lys 


Gin 


Ser 


His 


Glu 


Leu 


Leu 


He 


He 


Lys 


Glu 







85 90 



(2) INFORMATION FOR SEQ ID NO: 721: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 183... 323 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721: 

ATCCACAAAT TCCACTTCAT TTTCTTTGCA AAATTCAAAA AATTCTTTGA TCTTGCTTTC 6 0 

ACTATTTTGA GTTCTTACTA TCATTAACCA CCTTTAATTG TTATGAATTG AGTTTGATTG 12 0 

ATAGGGGTGA TTATAGCATT TATGGGGCAA AAAAAGTAGA ATCTGTATCA AGTTTTATTA 18 0 

AG ATG CAT GCG GTA AAA TCC GCT AAA TCA AGG AGT GTT ATT ATG GAA 227 
Met His Ala Val Lys Ser Ala Lys Ser Arg Ser Val He Met Glu 
15 10 15 

GCA GAC GCA ACC ACA CTA TTA GGA TTT TTT GAA GAA AAT CAA AAC AAT 2 75 

Ala Asp Ala Thr Thr Leu Leu Gly Phe Phe Glu Glu Asn Gin Asn Asn 
20 25 30 

CAA TTT GTC ATT CCT ATC TAT CAG AGG TTG TAT AGT TGG AAA AAG GAA T 3 24 
Gin Phe Val He Pro He Tyr Gin Arg Leu Tyr Ser Trp Lys Lys Glu 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 719: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 382 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 329 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719: 

CCTAATGATT TTTTCTATCT TGCCCCCCTT ATGTGTTAGA AAATTCTAAA ATG TTT 5 6 

Met Phe 
1 

AAG GGG ATT TAT CCT ATG CGT AAT TTT CCT ATC CAC CAT AAT GGT TTT 104 
Lys Gly lie Tyr Pro Met Arg Asn Phe Pro He His His Asn Gly Phe 
5 10 15 

AAA CAT GAA GTG TTA GCT CAC ATG CTA AAA AGG CAT AAA GAG CCA TTT 152 
Lys His Glu Val Leu Ala His Met Leu Lys Arg His Lys Glu Pro Phe 
20 25 30 

ATT TTA AGC TAT AAT GAC TGC GAA TTT GTA AGG AAT GCT TAT AAA GAT 2 00 

He Leu Ser Tyr Asn Asp Cys Glu Phe Val Arg Asn Ala Tyr Lys Asp 
35 40 45 50 

TTT AAA ATT TTA GAA CCA TCT TGG CAA TAC ACT ATG GGA CAA GGC GAG 24 8 

Phe Lys He Leu Glu Pro Ser Trp Gin Tyr Thr Met Gly Gin Gly Glu 
55 60 65 

ATC AGA ATG GGT AAA AAT CGC TTA GAA AGA GGC GAT AAT AAC CAT GTC 2 96 

He Arg Met Gly Lys Asn Arg Leu Glu Arg Gly Asp Asn Asn His Val 
70 75 80 

AAA CAA TCT CAT GAG TTA TTG ATT ATC AAG GAG TAAAAATGCA TATTAGCGAA 34 9 

Lys Gin Ser His Glu Leu Leu He He Lys Glu 
85 90 

GTCAAAACTG CCTTTAAAAT CGCTGATGTA GAA 3 82 

(2) INFORMATION FOR SEQ ID NO: 72 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 
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20 

Ala Asn Gin Gly 
35 

Arg Asn Ser lie 
50 

Val Gly Leu Glu 
65 

Glu lie Tyr Lys 

Tyr Tyr Cys Tyr 
100 

Gin Lys Ala Arg 
115 

Phe Lys Gly Thr 
130 

Val Pro Gin Asn 
145 

Val Lys Val Asn 

Asp Gly Thr Pro 
180 

Met Gly lie Thr 
195 

Pro Lys Gin lie 
210 

Phe Phe His Val 
225 

Lys Arg His Gly 

Leu Lys Glu Ala 
260 

Gin Asp Lys Glu 
275 

Pro Lys Asp Leu 
290 

Asn Trp Leu Asn 
305 

Leu Lys Leu Leu 

Pro Thr Gin Leu 
340 

Thr Leu Lys Glu 
355 

Val Glu Tyr Glu 
370 

Met Pro Leu Leu 
385 

Asn Asp Glu Ser 

Glu Lys lie Lys 
420 

Leu Gly Lys Gly 
435 

Gly Lys Thr Glu 
450 



Lys Phe Phe Leu 
40 

Glu Ala Ala Asn 
55 

Tyr Asp Gly Glu 
70 

Glu Tyr lie Gin 
85 

Met Ser Lys Glu 

Lys Glu Thr Pro 
120 

Pro Pro Lys Gly 
135 

Glu Val lie Gly 
150 

Thr Asn Glu Leu 
165 

Thr Tyr Asn Phe 

Asp Val lie Arg 
200 

Val Leu Tyr Lys 
215 

Pro Met lie Leu 
230 

Ala Thr Asn Val 
245 

Leu Val Asn Phe 

Val Phe Ser Met 
280 

Asn Ser Ser Pro 
295 

Ala His Tyr Leu 
310 

Lys Pro Phe Ser 
325 

Asp Arg Leu Leu 

Leu Ala Leu Lys 
360 

Glu Lys Val Phe 
375 

Glu Lys Phe Lys 
390 

Ala Leu Glu Asn 
405 

Ala Gly Ser Phe 

Gly Gly lie Gly 
440 

Ser Val Lys Arg 
455 



25 

Arg lie Glu Asp 

Ala lie lie Glu 
60 

lie Leu Tyr Gin 
75 

Lys Leu Leu Asp 
90 

Glu Leu Asp Ala 
105 

Arg Tyr Asp Asn 

lie Glu Pro Val 
140 

Phe Asn Asp Gly 
155 

Asp Asp Phe lie 
170 

Val Val Thr lie 
185 

Gly Asp Asp His 

Ala Leu Asn Phe 
220 

Asn Glu Glu Gly 
235 

Met Asp Tyr Gin 
250 

Leu Ala Arg Leu 
265 

Gin Glu Leu Leu 

Ser Cys Phe Ser 
300 

Lys Asn Gin Ser 
315 

Phe Ser Asp Leu 
330 

Asp Ala Leu Lys 
345 

lie Asp Glu Val 

Lys Lys Leu Asn 
380 

Leu Glu Leu Asn 
395 

Ala Met Arg Gin 
410 

Met Gin Pro Leu 
425 

Leu Lys Glu Ala 

lie Glu Asp Phe 
460 



30 

Thr Asp Leu Ser 
45 

Ala Phe Lys Trp 

Ser Lys Arg Phe 
80 

Glu Asp Lys Ala 
95 

Leu Arg Glu Glu 
110 

Arg Tyr Arg Asp 
125 

Val Arg lie Lys 

Val Lys Gly Glu 
160 

lie Ala Arg Ser 
175 

Asp Asp Ala Leu 
190 

Leu Ser Asn Thr 
205 

Lys lie Pro Asn 

Gin Lys Leu Ser 
240 

Glu Met Gly Tyr 
255 

Gly Trp Ser Tyr 
270 

Glu Leu Phe Asp 
285 

Trp His Lys Leu 

Val Gin Glu Leu 
320 

Ser His Leu Asn 
335 

Glu Arg Ser Gin 
350 

Leu lie Ala Pro 
365 

Gin Ala Leu Val 

Lys Ala Asn Phe 
400 

He He Glu Glu 
415 

Arg Leu Ala Leu 
430 

Leu Phe He Leu 
445 

Leu Lys Asn 
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CTT TTA AAG CCT TTT AGT TTT AGC GAT CTC TCG CAT TTA AAC CCC ACT 106 5 

Leu Leu Lys Pro Phe Ser Phe Ser Asp Leu Ser His Leu Asn Pro Thr 
325 330 335 

CAA TTG GAT CGC TTG TTA GAC GCT CTC AAA GAA AGA TCT CAA ACA CTA 1113 
Gin Leu Asp Arg Leu Leu Asp Ala Leu Lys Glu Arg Ser Gin Thr Leu 
340 345 350 

AAA GAA TTA GCC CTT AAA ATA GAT GAG GTT TTA ATC GCC CCT GTG GAG 1161 
Lys Glu Leu Ala Leu Lys lie Asp Glu Val Leu lie Ala Pro Val Glu 
355 360 365 370 

TAT GAA GAA AAG GTT TTT AAA AAA CTC AAT CAA GCG CTC GTT ATG CCC 12 09 

Tyr Glu Glu Lys Val Phe Lys Lys Leu Asn Gin Ala Leu Val Met Pro 
375 380 385 

TTG TTA GAA AAG TTT AAG CTA GAA TTA AAC AAA GCC AAT TTC AAC GAT 12 5 7 

Leu Leu Glu Lys Phe Lys Leu Glu Leu Asn Lys Ala Asn Phe Asn Asp 
390 395 400 

GAA AGC GCG CTA GAA AAC GCC ATG CGC CAA ATC ATT GAA GAA GAA AAG 13 0 5 

Glu Ser Ala Leu Glu Asn Ala Met Arg Gin lie lie Glu Glu Glu Lys 
405 410 415 

ATT AAA GCG GGT AGT TTT ATG CAG CCT TTA AGA TTG GCC CTT TTG GGT 13 5 3 

lie Lys Ala Gly Ser Phe Met Gin Pro Leu Arg Leu Ala Leu Leu Gly 
420 425 430 

AAG GGA GGC GGG ATA GGC CTT AAA GAA GCG CTT TTT ATT TTA GGC AAA 14 01 

Lys Gly Gly Gly lie Gly Leu Lys Glu Ala Leu Phe lie Leu Gly Lys 
435 440 445 450 

ACA GAG AGC GTC AAA AGA ATA GAG GAT TTT TTG AAA AAC TAAAAAATTG GC 14 5 2 
Thr Glu Ser Val Lys Arg lie Glu Asp Phe Leu Lys Asn 
455 460 

TCTGTTTTC^ TTGGGTATTA AAATAAAAGA GTTTGATTTT AGCGGTTGTT AAGCAACTCC 1512 
TTTTGAAAAA TAAGAGGG 153 0 

(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718: 

Met Ser Leu lie Val Thr Arg Phe Ala Pro Ser Pro Thr Gly Tyr Leu 

15 10 15 

His lie Gly Gly Leu Arg Thr Ala lie Phe Asn Tyr Leu Phe Ala Arg 
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TGT TAG ATG AGC AAA GAA GAG TTG GAG GCT TTG AGA GAA GAG CAA AAA 3 93 

Cys Tyr Met Ser Lys Glu Glu Leu Asp Ala Leu Arg Glu Glu Gin Lys 
100 105 110 

GCC AGG AAA GAA ACC CCA CGC TAT GAC AAT CGC TAT GGT GAT TTT AAA 441 
Ala Arg Lys Glu Thr Pro Arg Tyr Asp Asn Arg Tyr Arg Asp Phe Lys 
115 120 125 130 

GGC ACG GCT GGT AAA GGC ATA GAG GGT GTG GTA AGG ATT AAA GTC CGC 4 89 

Gly Thr Pro Pro Lys Gly lie Glu Pro Val Val Arg lie Lys Val Pro 
135 140 145 

CAA AAT GAG GTG ATT GGT TTT AAT GAC GGG GTT AAA GGC GAA GTG AAA 53 7 

Gin Asn Glu Val lie Gly Phe Asn Asp Gly Val Lys Gly Glu Val Lys 
150 155 160 

GTG AAT ACT AAC GAA TTA GAC GAT TTT ATT ATG GGG AGG AGC GAT GGG 5 85 

Val Asn Thr Asn Glu Leu Asp Asp Phe lie lie Ala Arg Ser Asp Gly 
165 170 175 

ACA CGC ACT TAT AAC TTT GTG GTT ACT ATT GAT GAG GGT TTA ATG GGG 63 3 

Thr Pro Thr Tyr Asn Phe Val Val Thr lie Asp Asp Ala Leu Met Gly 
180 185 190 

ATT ACT GAT GTG ATT AGA GGC GAT GAT CAC CTT TGT AAC ACG GCT AAA 681 
lie Thr Asp Val lie Arg Gly Asp Asp His Leu Ser Asn Thr Pro Lys 
195 200 205 210 

CAA ATG GTT CTT TAT AAG GCT TTG AAT TTT AAA ATG GCT AAT TTT TTG 72 9 

Gin lie Val Leu Tyr Lys Ala Leu Asn Phe Lys lie Pro Asn Phe Phe 
215 220 225 

CAT GTG GGG ATG ATT TTG AAT GAA GAA GGG CAA AAA TTA AGC AAA CGC 777 
His Val Pro Met lie Leu Asn Glu Glu Gly Gin Lys Leu Ser Lys Arg 
230 235 240 

CAT GGG GCC ACT AAT GTG ATG GAC TAT CAA GAA ATG GGC TAT CTT AAG 82 5 

His Gly Ala Thr Asn Val Met Asp Tyr Gin Glu Met Gly Tyr Leu Lys 
245 250 255 

GAA GCT TTA GTG AAT TTT TTA GGG GGT TTG GGG TGG AGC TAT GAG GAT 873 
Glu Ala Leu Val Asn Phe Leu Ala Arg Leu Gly Trp Ser Tyr Gin Asp 
260 265 270 

AAA GAG GTT TTT AGC ATG GAA GAA TTG CTA GAA TTA TTT GAT GCT AAA 921 
Lys Glu Val Phe Ser Met Gin Glu Leu Leu Glu Leu Phe Asp Pro Lys 
275 280 285 290 

GAT TTG AAT TGT TGG CGC AGT TGG TTG AGC TGG CAC AAG CTT AAT TGG 96 9 

Asp Leu Asn Ser Ser Pro Ser Cys Phe Ser Trp His Lys Leu Asn Trp 
295 300 305 

GTG AAC GCT GAT TAT TTA AAA AAC CAA AGT GTG GAA GAA TTG TTA AAA 1017 
Leu Asn Ala His Tyr Leu Lys Asn Gin Ser Val Gin Glu Leu Leu Lys 
310 315 320 
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Asp Glu Asn Tyr Cys His Tyr Glu Val lie Ser Gin Thr lie Ser Leu 

115 120 125 

Pro Lys Asp Cys Leu Asn Pro Thr Phe Lys Leu Asp Phe Asn Gin Asn 

130 135 140 

Lys Lys Phe 
145 

(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 1440 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717: 

TTAAGCCATT TTTGAATACA ATAAGAGTAT TTTATTTTTA AGGTTAGAAC A ATG AGT 5 7 

Met Ser 
1 

TTG ATC GTT ACG CGC TTC GCT CCA TCG CCC ACT GGC TAG CTC CAC ATA 10 5 

Leu lie Val Thr Arg Phe Ala Pro Ser Pro Thr Gly Tyr Leu His lie 
5 10 15 

GGA GGC TTA AGA ACA GCC ATT TTC AAT TAT CTT TTT GCA CGA GCC AAT 15 3 

Gly Gly Leu Arg Thr Ala lie Phe Asn Tyr Leu Phe Ala Arg Ala Asn 
20 25 30 

CAA GGA AAA TTT TTT TTA CGC ATT GAA GAC ACG GAT TTG AGC CGT AAC 2 01 

Gin Gly Lys Phe Phe Leu Arg lie Glu Asp Thr Asp Leu Ser Arg Asn 
35 40 45 50 

TCT ATA GAA GCG GCT AAC GCC ATT ATA GAA GCT TTC AAA TGG GTA GGG 24 9 

Ser lie Glu Ala Ala Asn Ala lie lie Glu Ala Phe Lys Trp Val Gly 
55 60 65 

CTA GAA TAG GAT GGC GAA ATC CTC TAC CAA TCC AAA CGC TTT GAG ATT 2 97 

Leu Glu Tyr Asp Gly Glu lie Leu Tyr Gin Ser Lys Arg Phe Glu lie 
70 75 80 

TAT AAA GAA TAC ATT CAA AAA CTC TTA GAT GAA GAC AAA GCC TAT TAT 34 5 

Tyr Lys Glu Tyr lie Gin Lys Leu Leu Asp Glu Asp Lys Ala Tyr Tyr 
85 90 95 
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55 60 65 

GTG TTA TTG GTA AGC CAT AAC CCC AAT GAA ATC ACC AAG CTC GCG CAA 2 96 

Val Leu Leu Val Ser His Asn Pro Asn Glu lie Thr Lys Leu Ala Gin 
70 75 80 

ACT TTC CTC TTT TTA AAC AAT GGC GTT ATT GAT OCT AAT CAA GAA AAT 34 4 

Thr Phe Leu Phe Leu Asn Asn Gly Val lie Asp Pro Asn Gin Glu Asn 
85 90 95 

CGG CTT TTT TCA AAC CGC TTA TTA ATA AAA CCT CTC TTT GAA GAT GAA 3 92 

Arg Leu Phe Ser Asn Arg Leu Leu He Lys Pro Leu Phe Glu Asp Glu 
100 105 110 

AAT TAT TGC CAT TAT GAG GTC ATT TCT CAA ACG ATT AGT TTG CCC AAA 44 0 

Asn Tyr Cys His Tyr Glu Val lie Ser Gin Thr He Ser Leu Pro Lys 
115 120 125 130 

GAT TGT CTG AAC CCA ACT TTT AAG CTT GAT TTC AAT CAA AAC AAA AAA 488 
Asp Cys Leu Asn Pro Thr Phe Lys Leu Asp Phe Asn Gin Asn Lys Lys 
135 140 145 

TTT TAGAAATATT TTTTCATTTT CCTCTTAAAA CCCTCTTATT TTTCAAAAGG AGTTGC 54 7 
Phe 



TTAACAACCG CTAAAATCAA ACTCTTTTAT TTTAATACCC AATGAAAACA GAGC 601 
(2) INFORMATION FOR SEQ ID NO: 716: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716: 

Met Arg Leu Glu Asn Leu Ser Gin Gin Lys lie Leu Gin Leu Ser Gly 

15 10 15 

Gly Gin Ala Gin Arg Val Ala Leu Ala Arg Ala Leu lie Ala Ala Lys 

20 25 30 

Asn Leu Leu Leu Leu Asp Glu Pro Leu Asn Ala Leu Asp Asn Ala Leu 

35 40 45 

Lys Asn Glu Val Gin Gin Gly Leu Leu Asp Phe lie Lys Arg Glu Asn 

50 55 60 

Leu Ser Val Leu Leu Val Ser His Asn Pro Asn Glu lie Thr Lys Leu 
65 70 75 80 

Ala Gin Thr Phe Leu Phe Leu Asn Asn Gly Val lie Asp Pro Asn Gin 

85 90 95 

Glu Asn Arg Leu Phe Ser Asn Arg Leu Leu lie Lys Pro Leu Phe Glu 
100 105 110 
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lie Lys lie Pro Thr Ala Leu Phe Pro Met Gly Val Leu Ser Cys Leu 

100 105 110 

Glu Ser Ser Lys Asn Phe Ser Thr lie Ala Val Glu Glu Ser Glu Ser 

115 120 125 

Pro Met Pro Lys Thr Arg Glu Lys Met Gly Glu Arg Pro Lys Thr Lys 

130 135 140 

Tyr Pro Asn Lys Lys Ala lie Lys Ala His Lys Thr Thr Cys Lys Ser 
145 150 155 160 

Pro Lys Thr Ser Thr Ser Cys Leu Met Asp Leu Ser Leu Ser Lys Leu 

165 170 175 

Asn Ser Met Pro lie lie Asn lie Lys Lys Thr lie Pro Asn Ser Pro 

180 185 190 

lie Ser Asp Asn Lys Ser Lys Ser Leu He Leu Lys Lys Ala Ala Lys 

195 200 205 

Thr Val Pro Val Gin Met 
210 

(2) INFORMATION FOR SEQ ID NO: 715: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 491 
{ D ) OTHER INFORMAT I ON : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715: 

GCCTTTGCTC ACCCTAAAGA TAAAAATAAA ATTCACGAAG GTTACGCTTA ATG CGT 5 6 

Met Arg 
1 

TTA GAA AAT CTA AGC CAG CAA AAA ATT CTT CAA CTC TCT GGC GGG CAA 104 
Leu Glu Asn Leu Ser Gin Gin Lys He Leu Gin Leu Ser Gly Gly Gin 
5 10 15 

GCC CAA CGA GTC GCT TTA GCA AGA GCT TTA ATC GCA GCC AAG AAT CTA 15 2 

Ala Gin Arg Val Ala Leu Ala Arg Ala Leu He Ala Ala Lys Asn Leu 
20 25 30 

TTG CTT TTA GAT GAG CCT TTA AAC GCC TTA GAT AAC GCC TTA AAA AAC 2 00 

Leu Leu Leu Asp Glu Pro Leu Asn Ala Leu Asp Asn Ala Leu Lys Asn 
35 40 45 50 

GAA GTG CAA CAA GGT TTG CTT " GAT TTT ATC AAG CGT GAA AAT TTA AGC 24 8 

Glu Val Gin Gin Gly Leu Leu Asp Phe lie Lys Arg Glu Asn Leu Ser 
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TCA AAG AAT TTC AGC ACA ATA GCG GTT GAA GAG AGC GAA AGC CCC ATG 44 0 

Ser Lys Asn Phe Ser Thr lie Ala Val Glu Glu Ser Glu Ser Pro Met 
115 120 125 130 

CCT AAA ACA AGG GAA AAA ATG GGT GAA AGA CCC AAA ACA AAA TAC CCC 488 
Pro Lys Thr Arg Glu Lys Met Gly Glu Arg Pro Lys Thr Lys Tyr Pro 
135 140 145 

AAT AAA AAA GCG ATT AAA GCG CAT AAA ACC ACT TGT AAA AGC CCA AAA 53 6 

Asn Lys Lys Ala lie Lys Ala His Lys Thr Thr Cys Lys Ser Pro Lys 
150 155 160 

ACC AGC ACT TCT TGT TTG ATG GAT TTG AGC TTG TCA AAA TTA AAC TCA 5 84 

Thr Ser Thr Ser Cys Leu Met Asp Leu Ser Leu Ser Lys Leu Asn Ser 
165 170 175 

ATG CCT ATC ATA AAC ATT AAA AAG ACG ATA CCA AAT TCG CCA ATA TCA 63 2 

Met Pro lie lie Asn lie Lys Lys Thr lie Pro Asn Ser Pro lie Ser 
180 185 190 

GAC AAC AAA TCA AAA TCA TTA ATT TTA AAA AAA GCC GCT AAG ACC GTT 680 
Asp Asn Lys Ser Lys Ser Leu lie Leu Lys Lys Ala Ala Lys Thr Val 
195 200 205 210 

CCT GTG CAA ATG TAACCAATGA TAACAGGCAT GTCTAATTTC TTTAAAAAGA TTCCA 73 7 
Pro Val Gin Met 



AAGCCCAC 74 5 

(2) INFORMATION FOR SEQ ID NO: 714: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714: 

Met Pro Thr Met Asn Ala Pro Lys Asp lie Glu Asn Pro Lys Lys Trp 

15 10 15 

Leu Asn Pro Ala Ala Leu Gin Thr lie Thr Lys lie Val Pro lie Lys 

20 25 30 

lie Ser Gly Arg Arg Val Ser Phe Ala Cys Ser Lys Met Arg Leu Ala 

35 40 45 

Pro Phe Phe Pro Gly Asn Asn Lys Arg Thr Lys lie lie Pro Ala Glu 

50 55 60 

lie Lys Val Leu Arg Met Ser Lys Leu Thr Leu Glu Ser Leu Leu Pro 

65 70 75 80 

Arg lie Val Arg lie Lys Ser Met Gly Met Ala Ala lie Ser Trp Lys 

85 90 95 
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35 40 45 

Phe Asp Pro Ser Ala Leu Ala Leu Glu Pro Lys Phe 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 7 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: .Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 692 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713: 

GCGCGAATTC TTCTTGGACA TTGATTTTAT AGCGCGATTT AGAAATGGCC ATG CCC 5 6 

Met Pro 
1 

AC A ATG AAC GCT CCT AAA GAC ATA GAA AAC CCA AAA AAA TGG CTC AAC 104 
Thr Met Asn Ala Pro Lys Asp lie Glu Asn Pro Lys Lys Trp Leu Asn 
5 10 15 

CCT GCT GCG CTG CAA ACA ATC ACT AAA ATC GTG CCT ATA AAA ATT TCA 152 
Pro Ala Ala Leu Gin Thr lie Thr Lys lie Val Pro lie Lys lie Ser 
20 25 30 

GGC AGG CGC GTG TCT TTT GCT TGC TCT AAG ATG AGA TTA GCC CCT TTT 2 00 

Gly Arg Arg Val Ser Phe Ala Cys Ser Lys Met Arg Leu Ala Pro Phe 
35 40 45 50 

TTT CCA GGC AAT AAT AAA AGA ACT AAA ATA ATC CCT GCT GAA ATA AAG 24 8 

Phe Pro Gly Asn Asn Lys Arg Thr Lys lie lie Pro Ala Glu lie Lys 
55 60 65 

GTT TTA AGA ATG AGT AAA TTA ACA TTA GAA TCT TTA CTA CCT AGA ATA 2 96 

Val Leu Arg Met Ser Lys Leu Thr Leu Glu Ser Leu Leu Pro Arg lie 
70 75 80 

GTG AGG ATT AAA AGC ATG GGA ATG GCT GCA ATA TCT TGG AAA ATC AAA 344 
Val Arg lie Lys Ser Met Gly Met Ala Ala lie Ser Trp Lys lie Lys 
85 90 95 

ATC CCC ACC GCG CTC TTT CCC ATG GGC GTG CTA AGC TGT TTG GAA TCT 3 92 

lie Pro Thr Ala Leu Phe Pro Met Gly Val Leu Ser Cys Leu Glu Ser 
100 105 110 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

CA) NAME/KEY: Coding Sequence 
(B) LOCATION: 1...180 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711: 

AAC AAA ACT TTT TTA AGG GGT AAA AAC CCT TTA GTT ACA AGA AAG AGA 4 8 

Asn Lys Thr Phe Leu Arg Gly Lys Asn Pro Leu Val Thr Arg Lys Arg 
15 10 15 

ACT TTT GTC ATC AAA ATG AAG TTT TTT AAA GAA AAA GAA AAA GAA GTT 96 
Ser Phe Val lie Lys Met Lys Phe Phe Lys Glu Lys Glu Lys Glu Val 
20 25 30 

TCA AAA ATT AAA AGT TTG AGA AAG TTT GAG TCA AAT CCG CTA GTA AGA 144 
Ser Lys lie Lys Ser Leu Arg Lys Phe Glu Ser Asn Pro Leu Val Arg 
35 40 45 

TTT GAC CCT AGC GCT CTT GCG CTA GAG CCA AAA TTT TAGTATAATG GCGTTG 196 
Phe Asp Pro Ser Ala Leu Ala Leu Glu Pro Lys Phe 

50 55 60 

CGGATACCAT TAAAACTATG ATTACTAATC TCATGTTAGG TTATCTCCTT TTCTGAGGTA 256 
ACCACCCACT ATTACCCCCA ACCCT 281 

(2) INFORMATION FOR SEQ ID NO: 7 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712: 

Asn Lys Thr Phe Leu Arg Gly Lys Asn Pro Leu Val Thr Arg Lys Arg 

15 10 15 

Ser Phe Val lie Lys Met Lys Phe Phe Lys Glu Lys Glu Lys Glu Val 

20 25 30 

Ser Lys- lie Lys Ser Leu Arg Lys Phe Glu Ser Asn Pro Leu Val Arg 
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595 600 605 

Val Met Glu Trp Gin Arg lie Asp Arg Val Glu Leu Pro Thr Pro Asp 

610 615 620 

Ser lie Lys Lys Pro Ser lie Asp Glu Leu Lys Asn Asp Phe Asn Pro 
625 630 635 640 

Lys Arg Gly Lys Ser Ser Asp Leu Phe Glu Trp Ala Leu Glu Gin Leu 

645 650 655 

Glu Thr Leu Lys Asp Leu Asn Ser Pro Pro Arg Phe Val Leu Glu Lys 

660 665 670 

Lys Leu Glu Cys Gly Gly lie Ser lie lie Gly Glu Asp Arg Asn Asn 

675 680 685 

Glu Leu Phe Tyr lie Met Glu Thr Asn Gly Lys Lys Val Phe Cys His 

690 695 700 

Ser Arg Gin Cys Lys Gly Ser Val Asn Lys Asp Glu Leu Ser Leu Gly 
705 710 715 720 

Ala Arg Val Cys Leu Glu Val Gly Pro Asp Lys Asn Asp His Gly Lys 

725 730 735 

Tyr Arg Gly Lys lie Tyr Gly Leu Glu Lys Asn Arg Glu lie Val Leu 

740 745 ' 750 

Leu Asn Thr Ala Lys Asn Ser Tyr Gin Arg Lys Pro Leu Asp Glu Lys 

755 760 765 

lie Lys His Arg lie Glu Ala Leu Lys Arg lie Lys Tyr Pro Cys Leu 

770 775 780 

Lys lie Phe Ser His Tyr Met Leu Glu Glu Leu Glu Thr Leu Asn Pro 
785 790 795 800 

Glu Phe Ala Thr Pro Phe Lys Glu Tyr Leu Lys Arg Leu Glu Glu Tyr 

805 810 815 

Tyr Phe Asp Pro Gin Thr Asp Arg Asp Phe Lys Lys Gly Leu Leu Asp 

820 825 830 

Phe Phe Ser Arg Leu Asn Asp Ser lie Pro Ala Lys Leu Gin Gin Glu 

835 840 845 

Phe lie Asn Leu Pro Ser Thr Asp Phe Leu Ser Arg Cys Leu Gly Ser 

850 855 860 

Leu Glu Lys Asp Phe Gin Lys Thr lie Phe Lys Lys Leu Lys Val Thr 
865 870 875 880 

Asn Leu Lys Thr Leu Ser lie Val Ala Arg Ala Ser Trp Asn Asn Glu 

885 890 895 

Lys Phe Leu Glu Asn Leu Met Ala Gin Thr Ser Leu Glu Gin Gin Lys 

900 905 910 

Asp Phe Leu Lys Arg lie Glu Glu Cys Leu Lys Asn Pro Glu Ser Phe 

915 920 925 

Tyr Phe Ser Ser Ala Cys Glu Leu Leu Leu Ala Phe Leu Ser Tyr Arg 

930 935 940 

Asn Ala Lys Arg Glu Leu Glu Leu lie Pro Glu Ser Glu Lys Thr Met 
945 950 955 960 

Arg Leu Leu Asp Ser lie Asp Lys Ala lie Glu Lys Glu Thr Glu lie 

965 970 975 

Lys Ser Phe Val Lys Leu Glu Leu Lys Asn Gin Ser Phe Asn Asn lie 

980 985 990 

Pro Pro Leu Leu Ser Ala Leu Arg Leu Tyr Leu Arg Gly Asp Leu Glu 

995 1000 1005 

Gly Val Gly He Glu He Asn Gly Thr Glu Glu Asp Glu 
1010 1015 1020 

(2) INFORMATION FOR SEQ ID NO: 711: 
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Phe Glu Glu Leu lie Phe Ser lie Thr Ser Phe Leu Lys Asn Gly Val 

165 170 175 

Ala Gin Gin lie Asp Glu Lys Gin Ala lie lie Pro Asn Asn Leu Leu 

180 185 190 

His Phe Asp Lys His Tyr Lys Arg lie Phe Lys Ala His Asp Trp Leu 

195 200 205 

Tyr Asp Gly Val Gly Ser Leu Met Asn Leu Asp Gin He Phe Tyr Leu 

210 215 220 

Glu Cys Leu Tyr Gin Ala Gin Phe Tyr Thr Ser Lys Asn He Glu Pro 
225 230 235 240 

Thr Leu He Arg Asn Glu Gin Asp Leu Tyr Ala Leu He Lys Asn Ser 

245 250 255 

Phe Pro He Lys Asp Leu Ser Phe Glu Lys Met Arg Leu Lys Ala Lys 

260 265 270 

Glu Phe Phe Glu Asn Glu Leu Arg Gin Pro He Asn Leu Asp Gin Glu 

275 280 285 

He Pro Gin Leu Glu Leu Cys Lys Gly Val Tyr Lys Glu Met Tyr He 

290 295 300 

Asp Met Phe Ser Pro Glu Pro Phe Ala Leu Leu Val Gly Asn Gly Asn 
305 310 315 320 

Glu Glu Lys He Leu Lys Leu Pro Leu Leu Val Lys Lys Gin Glu Asn 

325 330 335 

Asn Thr Tyr He Asn Ala Asn Gly Ala Lys Gly Lys He Asp Glu Lys 

340 345 350 

Gly Tyr Leu Ala Asn Ala Leu Lys Asn Tyr Asp Glu Thr Leu Val Glu 

355 360 365 

Ala Phe Met Arg Asp Phe Lys Glu Arg Tyr Lys He Glu Lys Leu Tyr 

370 375 380 

Tyr Leu Leu Asp Asp Asn He Lys Asn Phe Glu Phe Ala Lys He Lys 
385 390 395 400 

His Lys He Ser Leu Tyr Phe Lys Asp Ala Lys Phe Tyr Pro Lys Ser 

405 410 415 

Val Ala Leu Gly Phe Ser Ser Leu Phe Glu Asn Lys Leu Lys Lys Asn 

420 425 430 

Glu Arg Leu Arg Tyr Asn Ser Val Asp Leu Val Val Lys Glu Asn His 

435 440 445 

Lys Ser Lys Thr Phe Asn Asp Cys Gly Leu Val Leu Glu Arg Gin Lys 

450 455 460 

Ser Asp Asp Ser Lys Glu Phe Leu He Leu Gin Asp Ser Phe He Lys 
465 470 475 480 

Lys Ala Leu Lys Asn Phe Lys Arg Ala Leu Gly Leu Glu Lys Glu Gly 

485 490 495 

Phe He Leu Tyr Lys Glu Cys Leu Pro Lys Leu Ser Met Glu Val Val 

500 505 510 

Lys Asp Gly Arg Phe Lys Asn Phe Glu He He Lys Asp Lys Thr He 

515 520 525 

Leu Gly Asp Lys Glu Thr Leu Glu He Glu Thr Pro Phe He He Pro 

• 530 535 540 

Lys Gly Arg Glu Ser Phe Ala Leu Pro Leu He Leu Asn Glu Glu Lys 
545 550 555 560 

He Ala Tyr Gin Gly Lys He Thr Ser Lys Asp Phe Pro Leu Glu Asn 

565 570 575 

Asp Glu Glu Tyr Lys Leu Thr Leu Thr Tyr Asp He Gly Thr Glu Phe 

580 585 590 

Asn Tyr Val Leu Glu Phe Lys Pro Val Asn Asn Asp Leu Lys Pro He 
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AAA AG A GAG TTG GAA TTG ATC CCT GAA AGC GAA AAA ACC ATG CGT TTA 2 93 6 

Lys Arg Glu Leu Glu Leu lie Pro Glu Ser Glu Lys Thr Met Arg Leu 
950 955 960 

TTG GAC AGC ATA GAT AAA GCG ATA GAA AAA GAG ACT GAA ATT AAA AGC 2 984 

Leu Asp Ser lie Asp Lys Ala lie Glu Lys Glu Thr Glu lie Lys Ser 
965 970 975 

TTT GTA AAA TTA GAG CTA AAA AAT CAA AGC TTC AAC AAT ATC CCA CCT 3 03 2 

Phe Val Lys Leu Glu Leu Lys Asn Gin Ser Phe Asn Asn lie Pro Pro 
980 985 990 

TTG TTG TCG GCG TTA CGC TTG TAT TTA AGG GGG GAT TTG GAA GGT GTT 3 080 

Leu Leu Ser Ala Leu Arg Leu Tyr Leu Arg Gly Asp Leu Glu Gly Val 
995 1000 1005 1010 

GGA ATT GAA ATT AAT GGG ACA GAA GAG GAT GAA TAAATCAAAC AAATTAGTCA 313 3 
Gly lie Glu lie Asn Gly Thr Glu Glu Asp Glu 
1015 1020 

TTATCAATCG CGCCATTCCA GGTGGGGGCA AGA 3166 

(2) INFORMATION FOR SEQ ID NO: 710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710: 

Met Asp Leu Glu Glu Leu Tyr Ala Pro Asn His lie Glu Arg Leu Lys 

15 10 15 

Ala Arg Ser Phe Leu Arg Ser lie Ala Phe Phe Asp Asp Phe Ser Ala 

20 25 30 

Ser Phe Glu Tyr Arg Asp Leu Phe Ser Val Leu Glu Asn lie Val Gin 

35 40 45 

Phe Asp Tyr Glu Lys Lys Pro Tyr Lys Asp Asp Leu Tyr Phe Leu Cys 

50 55 60 

Lys Phe Val Glu Pro Ala Leu Lys Ala lie Phe Ser Asn Leu Asn Thr 
65 70 75 80 

Asn lie Tyr Arg Lys His Leu Lys Met Pro Leu Glu Lys Ala Arg Glu 

85 90 95 

Phe Asp Ala Lys Cys Ala Leu Asp Leu Ala Lys Arg Pro Gly Arg Ser 

100 105 110 

Leu Lys Glu Lys Leu Cys Asp Asn Lys Val Leu Ser Val Lys Arg Tyr 

115 120 125 

Val Asn Ala Asn Thr His Glu Asn Arg Phe Leu Lys Arg Phe lie Lys 

130 135 140 

Glu Leu Leu Arg lie lie His Trp Arg Glu lie Glu Phe Gin Gin Val 
145 150 155 160 
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GTG TGT TTG GAA GTG GGG CCA GAT AAG AAC GAC CAT GGT AAA TAT CGA 2 2 64 

Val Cys Leu Glu Val Gly Pro Asp Lys Asn Asp His Gly Lys Tyr Arg 
725 730 735 

GGT AAA ATT TAT GGT TTG GAA AAA AAT AGA GAA ATT GTT TTA TTA AAT 2 312 

Gly Lys lie Tyr Gly Leu Glu Lys Asn Arg Glu lie Val Leu Leu Asn 
740 745 750 

ACA GCT AAA AAT TCT TAT CAA AGA AAA CCT CTA GAT GAG AAA ATT AAA 2 3 60 

Thr Ala Lys Asn Ser Tyr Gin Arg Lys Pro Leu Asp Glu Lys lie Lys 
755 760 765 770 

CAC AGA ATA GAA GCG CTC AAA AGA ATC AAG TAT CCT TGT TTA AAA ATT 24 0 8 

His Arg lie Glu Ala Leu Lys Arg lie Lys Tyr Pro Cys Leu Lys lie 
775 780 785 

TTT TCA CAT TAC ATG CTT GAA GAG TTA GAA ACC TTA AAT CCT GAA TTT 24 5 6 

Phe Ser His Tyr Met Leu Glu Glu Leu Glu Thr Leu Asn Pro Glu Phe 
790 795 800 

GCT ACT CCC TTT AAA GAA TAT TTG AAG CGG TTA GAA GAA TAT TAT TTT 2 5 04 

Ala Thr Pro Phe Lys Glu Tyr Leu Lys Arg Leu Glu Glu Tyr Tyr Phe 
805 810 815 

GAC CCA CAA ACA GAC AGA GAT TTT AAA AAA GGA CTC TTG GAT TTC TTT 2 55 2 

Asp Pro Gin Thr Asp Arg Asp Phe Lys Lys Gly Leu Leu Asp Phe Phe 
820 825 830 

AGC CGC TTG AAT GAT AGT ATT CCC GCA AAA TTA CAA CAA GAA TTT ATT 2 60 0 

Ser Arg Leu Asn Asp Ser lie Pro Ala Lys Leu Gin Gin Glu Phe lie 
835 840 845 850 

AAT TTA CCT TCT ACG GAT TTT TTA AGC AGA TGT TTA- GGC TCT CTT GAA 2 64 8 

Asn Leu Pro Ser Thr Asp Phe Leu Ser Arg Cys Leu Gly Ser Leu Glu 
855 860 865 

AAA GAC TTT CAA AAA ACG ATT TTT AAG AAG CTT AAA GTT ACT AAC CTA 2 6 96 

Lys Asp Phe Gin Lys Thr lie Phe Lys Lys Leu Lys Val Thr Asn Leu 
870 875 880 

AAG ACT TTA AGT ATT GTG GCT AGG GCT AGT TGG AAT AAT GAG AAA TTT 2 74 4 

Lys Thr Leu Ser lie Val Ala Arg Ala Ser Trp Asn Asn Glu Lys Phe 
885 890 895 

TTA GAG AAC TTG ATG GCT CAA ACC AGC TTG GAG CAG CAA AAA GAC TTT 2 7 92 

Leu Glu Asn Leu Met Ala Gin Thr Ser Leu Glu Gin Gin Lys Asp Phe 
900 905 910 

TTG AAG CGT ATA GAA GAG TGT TTG AAA AAT CCT GAG TCA TTT TAT TTC 2 84 0 

Leu Lys Arg lie Glu Glu Cys Leu Lys Asn Pro Glu Ser Phe Tyr Phe 
915 920 925 930 

AGT AGC GCA TGC GAA TTG CTG TTA GCG TTT TTG TCT TAT CGC AAC GCT 2 888 

Ser Ser Ala Cys Glu Leu Leu Leu Ala Phe Leu Ser Tyr Arg Asn Ala 
935 940 945 
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CTG TAT AAA GAA TGC TTG CCT AAG CTC TCT ATG GAA GTG GTT AAA GAC 15 92 

Leu Tyr Lys Glu Cys Leu Pro Lys Leu Ser Met Glu Val Val Lys Asp 
500 505 510 

GGG CGG TTT AAA AAT TTT GAG ATC ATT AAA GAT AAA ACC ATT TTA GGA 164 0 

Gly Arg Phe Lys Asn Phe Glu lie lie Lys Asp Lys Thr lie Leu Gly 
515 520 525 530 

GAT AAA GAA ACC CTA GAG ATT GAA ACG CCT TTT ATT ATC CCT AAA GGG 168 8 

Asp Lys Glu Thr Leu Glu lie Glu Thr Pro Phe lie lie Pro Lys Gly 
535 540 545 

CGA GAA AGT TTT GCT TTG CCC TTG ATC CTA AAT GAA GAA AAA ATC GCC 173 6 

Arg Glu Ser Phe Ala Leu Pro Leu lie Leu Asn Glu Glu Lys lie Ala 
550 555 560 

TAT CAA GGT AAA ATC ACC TCT AAA GAT TTT CCC CTA GAA AAT GAC GAA 17 84 

Tyr Gin Gly Lys lie Thr Ser Lys Asp Phe Pro Leu Glu Asn Asp Glu 
565 570 575 

GAA TAG AAA CTC ACG CTC ACT TAT GAC ATT GGC ACC GAG TTT AAC TAT 183 2 

Glu Tyr Lys Leu Thr Leu Thr Tyr Asp lie Gly Thr Glu Phe Asn Tyr 
580 585 590 

GTG TTA GAG TTT AAA CCT GTC AAT AAT GAT TTA AAG CCC ATT GTC ATG 18 80 

Val Leu Glu Phe Lys Pro Val Asn Asn Asp Leu Lys Pro lie Val Met 
595 600 605 610 

GAA TGG CAG CGT ATT GAT AGG GTT GAA CTC CCT ACG CCC GAT TCC ATC 192 8 

Glu Trp Gin Arg lie Asp Arg Val Glu Leu Pro Thr Pro Asp Ser lie 
615 620 625 

AAA AAA CCA AGT ATT GAT GAA CTA AAA AAT GAC TTT AAT CCT AAA AGG 197 6 

Lys Lys Pro Ser lie Asp Glu Leu Lys Asn Asp Phe Asn Pro Lys Arg 
630 635 640 

GGC AAA AGT TCT GAT TTG TTT GAG TGG GCG CTA GAG CAA TTA GAG ACA 2 024 

Gly Lys Ser Ser Asp Leu Phe Glu Trp Ala Leu Glu Gin Leu Glu Thr 
645 650 655 

TTG AAA GAT TTA AAT AGT CCA CCC AGA TTT GTT TTA GAG AAA AAA CTA 2 07 2 

Leu Lys Asp Leu Asn Ser Pro Pro Arg Phe Val Leu Glu Lys Lys Leu 
660 665 670 

GAA TGC GGT GGA ATC TCA ATA ATA GGG GAA GAT AGA AAC AAT GAA CTT 212 0 

Glu Cys Gly Gly lie Ser lie lie Gly Glu Asp Arg Asn Asn Glu Leu 
675 680 • 685 690 

TTT TAG ATA ATG GAA ACA AAT GGT AAA AAA GTT TTT TGT CAT AGC CGT 2168 
Phe Tyr lie Met Glu Thr Asn Gly Lys Lys Val Phe Cys His Ser Arg 
695 700 705 

CAA TGC AAA GGG AGC GTG AAC AAA GAT GAG CTT TCA TTA GGC GCG CGA 2216 
Gin Cys Lys Gly Ser Val Asn Lys Asp Glu Leu Ser Leu Gly Ala Arg 
710 715 720 
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Phe Glu Asn Glu Leu Arg Gin Pro lie Asn Leu Asp Gin Glu lie Pro 
275 280 285 290 

CAA TTG GAA TTG TGT AAG GGA GTT TAT AAA GAA ATG TAT ATT GAT ATG 9 68 

Gin Leu Glu Leu Cys Lys Gly Val Tyr Lys Glu Met Tyr He Asp Met 
295 300 305 

TTT AGC CCT GAA CCT TTC GCT TTG TTA GTG GGT AAT GGC AAT GAA GAA 1016 
Phe Ser Pro Glu Pro Phe Ala Leu Leu Val Gly Asn Gly Asn Glu Glu 
310 315 320 

AAG ATT TTA AAG CTC CCC CTT TTA GTC AAA AAG GAG GAG AAT AAT ACT 10 64 

Lys He Leu Lys Leu Pro Leu Leu Val Lys Lys Gin Glu Asn Asn Thr 
325 330 335 

TAT ATC AAC GCT AAT GGC GCT AAG GGT AAG ATA GAT GAA AAA GGT TAT 1112 
Tyr He Asn Ala Asn Gly Ala Lys Gly Lys He Asp Glu Lys Gly Tyr 
340 345 350 

TTG GCC AAC GCT CTC AAA AAC TAT GAT GAG ACT CTT GTG GAA GCT TTT 1160 
Leu Ala Asn Ala Leu Lys Asn Tyr Asp Glu Thr Leu Val Glu Ala Phe 
355 360 365 370 

ATG AG A GAT TTC AAG GAA CGC TAT AAG ATA GAA AAA CTA TAT TAT TTA 12 0 8 

Met Arg Asp Phe Lys Glu Arg Tyr Lys He Glu Lys Leu Tyr Tyr Leu 
375 380 385 

TTA GAT GAT AAT ATT AAA AAT TTT GAA TTT GCT AAG ATC AAG CAT AAA 12 5 6 

Leu Asp Asp Asn He Lys Asn Phe Glu Phe Ala Lys He Lys His Lys 
390 395 400 

ATA AGC TTG TAT TTT AAA GAC GCA AAA TTC TAT CCT AAA AGC GTT GCT 13 04 

He Ser Leu Tyr Phe Lys Asp Ala Lys Phe Tyr Pro Lys Ser Val Ala 
405 410 415 

TTA GGA TTT AGT TCT TTG TTT GAA AAT AAA TTA AAG AAA AAT GAG CGT 13 52 

Leu Gly Phe Ser Ser Leu Phe Glu Asn Lys Leu Lys Lys Asn Glu Arg 
420 425 430 

TTG CGT TAT AAC AGC GTG GAT TTG GTC GTT AAA GAA AAC CAT AAA AGT 14 0 0 

Leu Arg Tyr Asn Ser Val Asp Leu Val Val Lys Glu Asn His Lys Ser 
435 440 445 450 

AAG ACC TTT AAT GAT TGT GGC TTG GTT TTG GAG AGG CAA AAA AGC GAT 144 8 

Lys Thr Phe Asn Asp Cys Gly Leu Val Leu Glu Arg Gin Lys Ser Asp 
455 460 465 

GAT TCA AAA GAG TTC CTT ATT CTA CAA GAT TCT TTT ATC AAA AAA GCT 14 96 

Asp Ser Lys Glu Phe Leu He Leu Gin Asp Ser Phe He Lys Lys Ala 
470 475 480 

TTA AAA AAT TTT AAA AGA GCC TTA GGA TTA GAA AAA GAA GGC TTT ATT 154 4 

Leu Lys Asn Phe Lys Arg Ala Leu Gly Leu Glu Lys Glu Gly Phe He 
485 490 495 
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55 60 65 

GTG GAG CCA GCC CTA AAG GCT ATC TTT AGO AAT CTA AAT ACC AAT ATC 2 96 

Val Glu Pro Ala Leu Lys Ala lie Phe Ser Asn Leu Asn Thr Asn lie 
70 75 80 

TAG CGA AAA CAT TTA AAA ATG CCT TTA GAA AAG GCT AGG GAA TTT GAC 3 44 

Tyr Arg Lys His Leu Lys Met Pro Leu Glu Lys Ala Arg Glu Phe Asp 
85 90 95 

GCT AAA TGC GCG TTG GAT TTA GCC AAG CGA CCA GGT CGT AGT TTG AAA 3 92 

Ala Lys Cys Ala Leu Asp Leu Ala Lys Arg Pro Gly Arg Ser Leu Lys 
100 105 110 

GAA AAG TTG TGC GAC AAT AAA GTA TTG AGC GTC AAG CGT TAT GTG AAT 44 0 

Glu Lys Leu Cys Asp Asn Lys Val Leu Ser Val Lys Arg Tyr Val Asn 
115 120 125 130 

GCC AAT ACG CAT GAA AAC AGG TTT CTC AAG CGT TTC ATT AAA GAA CTT 4 88 

Ala Asn Thr His Glu Asn Arg Phe Leu Lys Arg Phe lie Lys Glu Leu 
135 140 145 

TTA AGA ATA ATT CAT TGG CGC GAG ATA GAA TTC CAA CAG GTT TTT GAA 536 
Leu Arg lie lie His Trp Arg Glu lie Glu Phe Gin Gin Val Phe Glu 
150 155 160 

GAG TTA ATT TTC AGC ATA ACA AGT TTT TTA AAG AAT GGA GTA GCC CAA 5 84 

Glu Leu lie Phe Ser lie Thr Ser Phe Leu Lys Asn Gly Val Ala Gin 
165 170 175 

CAA ATT GAT GAA AAA CAA GCC ATC ATT CCT AAT AAC TTG TTG CAT TTT 63 2 

Gin lie Asp Glu Lys Gin Ala lie lie Pro Asn Asn Leu Leu His Phe 
180 185 190 

GAT AAG CAC TAC AAA CGC ATT TTT AAA GCC CAT GAT TGG CTT TAT GAT 6 80 

Asp Lys His Tyr Lys Arg lie Phe Lys Ala His Asp Trp Leu Tyr Asp 
195 200 205 210 

GGT GTG GGG TCA TTG ATG AAT TTG GAT CAA ATT TTC TAT TTG GAG TGT 72 8 

Gly Val Gly Ser Leu Met Asn Leu Asp Gin lie Phe Tyr Leu Glu Cys 
215 220 225 

TTA TAC CAA GCC CAA TTT TAT ACT TCT AAA AAC ATT GAA CCC ACG CTA 77 6 

Leu Tyr Gin Ala Gin Phe Tyr Thr Ser Lys Asn lie Glu Pro Thr Leu 
230 235 240 

ATT AGA AAT GAA CAA GAT TTA TAC GCG CTA ATT AAA AAT AGT TTT CCA 824 
lie Arg Asn Glu Gin Asp Leu Tyr Ala Leu lie Lys Asn Ser Phe Pro 
245 250 255 

ATA AAA GAT TTA TCG TTT GAA AAG ATG CGT TTA AAA GCG AAA GAG TTT 872 
lie Lys Asp Leu Ser Phe Glu Lys Met Arg Leu Lys Ala Lys Glu Phe 
260 265 270 

TTT GAA AAT GAA TTA AGA CAG CCT ATA AAT TTA GAT CAA GAA ATT CCG 92 0 
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Gly Leu Asn Gly Glu Ser Asn Leu Pro Asn Ser Phe Phe Leu Gly lie 

145 150 155 160 

Cys Gly Tyr Leu Lys Lys Leu Phe Lys Lys Leu Lys Asp Arg Ala Phe 

165 170 175 

Arg Leu Ala Asn Lys His Gly Val Phe Phe Leu Lys lie Pro Lys His 

180 185 190 

Phe Gin Met Gin Asn Phe Asn Asn lie Phe Leu Glu Phe Val Ser Val 

195 200 205 

Asn Asn Pro Cys Phe Ser Tyr Arg Leu Thr Tyr Asp Gin Leu Val Gly 

210 215 220 

Lys Lys lie Pro Asn lie Lys Cys Ser Tyr Gin Gin Ala lie Val Lys 

225 230 235 240 

Lys Asn lie His Arg Ala Leu Asp Glu Leu Ser lie Asp Lys Glu lie 
245 250 255 

Leu Ala Ser 



(2) INFORMATION FOR SEQ ID NO: 7 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 3113 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709: 

ACTATTTGT^ TAATGTCGAA TATGAAAAAC TTCTTAAAAA GTAGCATGCA ATG GAT 5 6 

Met Asp 
1 

CTA GAA GAA CTC TAT GCG CCT AAT CAC ATA GAG CGT TTG AAA GCG CGG 104 
Leu Glu Glu Leu Tyr Ala Pro Asn His lie Glu Arg Leu Lys Ala Arg 
5 10 15 

AGT TTT TTA AGA TCG ATT GCT TTT TTT GAT GAT TTT AGC GCT TCT TTT 152 
Ser Phe Leu Arg Ser lie Ala Phe Phe Asp Asp Phe Ser Ala Ser Phe 
20 25 30 

GAA TAG AGA GAT CTA TTT AGC GTT TTG GAA AAT ATC GTG CAA TTT GAT 200 
Glu Tyr Arg Asp Leu Phe Ser Val Leu Glu Asn lie Val Gin Phe Asp 
35 40 45 50 

TAT GAA AAA AAG CCG TAT AAA GAT GAT TTG TAT TTT TTG TGC AAA TTT 24 8 

Tyr Glu Lys Lys Pro Tyr Lys Asp Asp Leu Tyr Phe Leu Cys Lys Phe 
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170 175 180 185 

TTT TTG AAA ATT CCT AAG CAT TTT CAA ATG CAA AAC TTT AAC AAT ATT 68 8 

Phe Leu Lys lie Pro Lys His Phe Gin Met Gin Asn Phe Asn Asn lie 
190 195 200 

TTT TTG GAG TTT GTG TCG GTT AAT AAT CCT TGT TTT TCT TAT AGA TTG 736 
Phe Leu Glu Phe Val Ser Val Asn Asn Pro Cys Phe Ser Tyr Arg Leu 
205 210 215 

ACT TAT GAT CAA CTT GTT GGT AAA AAA ATT CCA AAT ATC AAG TGC TCT 7 84 

Thr Tyr Asp Gin Leu Val Gly Lys Lys lie Pro Asn lie Lys Cys Ser 
220 225 230 

TAC CAA CAA GCA ATT GTA AAA AAG AAT ATC CAT AGA GCA TTA GAT GAA 8 32 

Tyr Gin Gin Ala lie Val Lys Lys Asn lie His Arg Ala Leu Asp Glu 
235 240 245 

CTA TCT ATA GAT AAG GAA ATT TTA GCA TCA TAAAGAAGAC AAAGGATAAA AAT 88 5 
Leu Ser lie Asp Lys Glu lie Leu Ala Ser 
250 255 

GCAATTCCCA CTCAAAAAAG ATTTAAGA 913 
(2) INFORMATION FOR SEQ ID NO: 70 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708: 

Met Gin Glu Arg Val Phe Lys Arg Lys Val Leu Asp Ala Asn lie Leu 

15 10 15 

Lys Glu Met His Ala Asn Asn Val Cys Tyr Ser Lys His Ser Lys Asp 

20 25 30 

Arg Phe He Pro Phe Lys Phe Asp Lys Phe Gly Tyr Val Gly Cys Lys 

35 40 45 

Leu Phe Lys Lys He Leu Asn Phe Pro Ser Asn Thr Thr Phe Phe Gly 

50 55 60 

Gly Thr Gly Cys Lys Lys Leu Met Glu Leu Leu Ser Glu He Val He 
65 70 75 80 

Asp Ser Arg Ser Ser Lys He Ala Leu Asn Arg His Tyr Ala Leu Thr 

85 90 95 

Arg Leu Gin Trp Cys Asp Arg Thr Leu Arg His Asn Leu Gin He Leu 

100 105 110 

Glu Arg He Gly Phe Leu Thr Ala Phe Lys Asn Lys Lys Gly Tyr He 

115 120 125 

Phe Leu Ser Met His Asp Phe Thr Lys He Glu Asn Tyr Glu His Ser 
130 135 140 
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(B) LOCATION: 86... 862 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707: 

ATTTCATATT TTCTTTTAGG GGCATTCTTT AGAGAATGCA ATTATTTTCT AATCTCTAAG 6 0 

AAATCAATAT CTAGGAATTA GGACC ATG CAA GAA AGA GTT TTT AAA AGA AAA 112 

Met Gin Glu Arg Val Phe Lys Arg Lys 
1 5 

GTT TTA GAT GCG AAT ATC TTA AAA GAA ATG CAT GCG AAC AAT GTC TGT 16 0 

Val Leu Asp Ala Asn lie Leu Lys Glu Met His Ala Asn Asn Val Cys 
10 15 20 25 

TAT TCC AAG CAT TCA AAA GAT AGG TTT ATT CCT TTC AAA TTT GAT AAA 2 08 

Tyr Ser Lys His Ser Lys Asp Arg Phe lie Pro Phe Lys Phe Asp Lys 
30 35 40 

TTT GGT TAT GTT GGA TGT AAA CTT TTT AAA AAG ATA TTA AAC TTT CCT 256 
Phe Gly Tyr Val Gly Cys Lys Leu Phe Lys Lys lie Leu Asn Phe Pro 
45 50 55 

AGC AAT ACA ACT TTC TTT GGT GGC ACA GGT TGT AAG AAA CTC ATG GAA 3 04 

Ser Asn Thr Thr Phe Phe Gly Gly Thr Gly Cys Lys Lys Leu Met Glu 
60 65 70 

CTT TTA AGT GAA ATC GTT ATA GAT TCT AGA AGT TCT AAA ATT GCG TTA 3 52 

Leu Leu Ser Glu lie Val lie Asp Ser Arg Ser Ser Lys lie Ala Leu 
75 80 85 

AAC CGC CAT TAT GCC TTA ACT CGC TTG CAA TGG TGC GAT AGA ACC TTA 40 0 

Asn Arg His Tyr Ala Leu Thr Arg Leu Gin Trp Cys Asp Arg Thr Leu 
90 95 100 105 

AGA CAT AAT CTC CAA ATT TTA GAG AGA ATA GGA TTT CTA ACT GCT TTT 44 8 

Arg His Asn Leu Gin lie Leu Glu Arg lie Gly Phe Leu Thr Ala Phe 
110 115 120 

AAG AAC AAA AAA GGT TAT ATT TTT TTG TCT ATG CAT GAC TTC ACT AAA 4 96 

Lys Asn Lys Lys Gly Tyr lie Phe Leu Ser Met His Asp Phe Thr Lys 
125 130 135 

ATA GAA AAC TAC GAA CAT TCA GGT TTG AAT GGG GAG AGC AAT TTA CCT 544 
lie Glu Asn Tyr Glu His Ser Gly Leu Asn Gly Glu Ser Asn Leu Pro 
140 145 150 

AAT AGC TTC TTT TTA GGA ATT TGT GGG TAT TTG AAA AAA CTC TTC AAG 5 92 

Asn Ser Phe Phe Leu Gly lie Cys Gly Tyr Leu Lys Lys Leu Phe Lys 
155 160 165 

AAA TTA AAA GAT AGA GCA TTC AGG CTC GCA AAC AAG CAC GGT GTA TTC 64 0 

Lys Leu Lys Asp Arg Ala Phe Arg Leu Ala Asn Lys His Gly Val Phe 
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Glu Phe Leu Lys Glu lie Glu Lys Asn Glu Ser Phe Phe 
180 185 190 

GCGTGAAAAG CTAAGCCAAT TAAAAGCAAG GCAGCAAAAA C 67 6 



(2) INFORMATION FOR SEQ ID NO: 706: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:706: 



Met Asp Glu Val Leu Lys Glu lie Leu Ser Ser Tyr Gin Lys Arg Ala 

15 10 15 

Leu Lys Leu Thr Lys Arg Val Arg Lys Lys lie Phe Lys Asn Asp Pro 

20 25 30 

Thr Glu Asn Gin Lys Lys Ala lie Lys lie Ala Leu Asn Thr Pro Asp 

35 40 45 

lie Ala lie lie Gin Gly Pro Pro Gly Thr Gly Lys Thr Thr Val lie 

50 55 60 

Asn Ala lie Cys Glu Arg Leu Phe Glu Glu Tyr Pro Lys Asp Lys Asn 
65 70 75 80 

lie Lys Gly Gin lie Leu Leu Cys Ala Gin Gly His Asp Ala Thr Asn 

85 90 95 

Asn Ala Arg Glu Arg lie Lys Val Gly Gly Leu Pro Thr Phe Lys Phe 

100 105 110 

Gly Ala Lys Lys Asn Ala Lys Glu Glu Gin Tyr Lys Gin Asp Glu Arg 

115 120 125 

Leu Asn Glu Arg Leu Arg Glu Phe Ala Glu Thr Leu lie Glu Ser Val 

130 135 140 

Arg Lys Lys Leu Gin Lys Leu Gly Asp Tyr Glu Asn lie Glu Lys lie 
145 150 155 160 

Leu Asp Leu Glu Glu Ala Leu Arg Arg Tyr Tyr Ser Ser Pro lie Ser 

165 170 175 

Glu Leu Glu Phe Leu Lys Glu lie Glu Lys Asn Glu Ser Phe Phe 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 7 07: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705: 

TTAAAGAGGA TAAAGAGAAT AGTGATAGCG ATAATGATAC TGCAGACACG ATG GAT 56 

Met Asp 
1 

GAA GTC TTA AAA GAG ATT TTA TCA AGT TAT CAA AAA AGA GCT TTA AAA 104 
Glu Val Leu Lys Glu lie Leu Ser Ser Tyr Gin Lys Arg Ala Leu Lys 
5 10 15 

TTA ACC AAA AGA GTT AGA AAG AAG ATT TTT AAG AAT GAT CCC ACA GAA 152 
Leu Thr Lys Arg Val Arg Lys Lys lie Phe Lys Asn Asp Pro Thr Glu 
20 25 30 

AAT CAA AAA AAA GCC ATA AAG ATC GCT CTA AAT ACC CCT GAT ATT GCT 2 00 

Asn Gin Lys Lys Ala lie Lys lie Ala Leu Asn Thr Pro Asp lie Ala 
35 40 45 50 

ATT ATC CAA GGG CCT CCT GGA ACG GGC AAA ACC ACT GTG ATC AAT GCC 248 
lie lie Gin Gly Pro Pro Gly Thr Gly Lys Thr Thr Val lie Asn Ala 
55 60 65 

ATT TGT GAG AGA TTG TTT GAA GAA TAC CCT AAG GAT AAA AAT ATC AAG 2 96 

lie Cys Glu Arg Leu Phe Glu Glu Tyr Pro Lys Asp Lys Asn lie Lys 
70 75 80 

GGG CAA ATT TTA CTG TGC GCT CAA GGG CAT GAT GCG ACT AAC AAT GCG 3 44 

Gly Gin lie Leu Leu Cys Ala Gin Gly His Asp Ala Thr Asn Asn Ala 
85 90 95 

CGT GAG CGC ATC AAA GTA GGG GGA TTG CCC ACT TTT AAA TTT GGT GCT 3 92 

Arg Glu Arg lie Lys Val Gly Gly Leu Pro Thr Phe Lys Phe Gly Ala 
100 105 110 

AAA AAA AAT GCT AAA GAA GAA CAA TAC AAG CAA GAT GAA AGA TTG AAT 44 0 

Lys Lys Asn Ala Lys Glu Glu Gin Tyr Lys Gin Asp Glu Arg Leu Asn 
115 120 125 130 

GAG CGA TTG AGA GAG TTT GCT GAA ACG CTC ATA GAA AGC GTG AGA AAA 48 8 

Glu Arg Leu Arg Glu Phe Ala Glu Thr Leu lie Glu Ser Val Arg Lys 
135 140 145 

AAA CTG CAA AAA TTA GGG GAT TAT GAA AAT ATA GAA AAA ATT TTG GAT 53 6 

Lys Leu Gin Lys Leu Gly Asp Tyr Glu Asn lie Glu Lys lie Leu Asp 
150 155 160 

TTA GAA GAA GCC CTT AGA CGC TAC TAT AGT TCG CCT ATC AGT GAA TTG 5 84 

Leu Glu Glu Ala Leu Arg Arg Tyr Tyr Ser Ser Pro lie Ser Glu Leu 
165 170 175 

GAA TTT TTA AAA GAA ATA GAA AAA AAT GAG AGC TTT TTT TAATTCTTCT AT 63 5 
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Asn Leu Tyr Ala Glu Leu Arg Lys Met Gly Ala Lys Asn Leu Asn Ser 

260 265 270 

Val Asn lie Gly Gly Gly Leu Ala Val Glu Tyr Thr Gin His Lys His 

275 280 285 

His Gin Asp Lys Asn Tyr Thr Leu Glu Glu Phe Ser Ala Asp Val Val 

290 295 300 

Phe Leu Leu Arg Glu lie Val Lys Asn Lys Gin Glu lie Glu Pro Asp 
305 310 315 320 

lie Phe lie Glu Ser Gly Arg Tyr lie Ser Ala Asn His Ala Val Leu 

325 330 335 

Val Ala Pro Val Leu Glu Leu Phe Ser His Glu Tyr Asn Glu Lys Ser 

340 345 350 

Leu Lys lie Lys Glu Asn Asn Asn Pro Pro Leu lie Asp Glu Met Leu 

355 360 365 

Asp Leu Leu Ala Asn lie Asn Glu Lys Asn Ala lie Glu Tyr Leu His 

370 375 380 

Asp Ser Phe Asp His Thr Glu Ser Leu Phe Thr Leu Phe Asp Leu Gly 
385 390 395 400 

Tyr He Asp Leu He Asp Arg Ser Asn Thr Glu Val Leu Ala His Leu 

405 410 415 

He Val Lys Lys Ala Val Gin Leu Leu Tyr Val Lys Asp His Asn Asp 

420 425 430 

He Leu Arg He Gin Glu Gin Val Gin Glu Arg Tyr Leu Leu Asn Cys 

435 440 445 

Ser Phe Phe Gin Ser Leu Pro Asp Tyr Trp Gly Leu Arg Gin Asn Phe 

450 455 460 

Pro Val Met Pro Leu Asn Lys Leu Asp Glu Lys Pro Thr Arg Ser Ala 
465 470 475 480 

Ser Leu Trp Asp He Thr Cys Asp Ser Asp Gly Glu He Ala Phe Asp 

485 490 495 

Ser Thr Lys Pro Leu Phe Leu His Asp He Asp He Asp Glu Glu Glu 

500 505 510 

Tyr Phe Leu Ala Phe Phe Leu Val Gly Ala Tyr Gin Glu Val Leu Gly 

515 520 525 

Met Lys His Asn Leu Phe Thr His Leu Arg Asn Leu Ala Trp Phe Leu 

530 535 540 

Met Lys Lys Ala He Met Lys Trp Lys He Phe Val Lys Pro Lys Arg 
545 550 555 560 

Phe 



(2) INFORMATION FOR SEQ ID NO: 7 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 623 
- (D) OTHER INFORMATION: 
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CAC AAT TTA TTC ACG CAC CTA CGG AAT TTA GCG TGG TTT TTG ATG AAA 16 8 8 

His Asn Leu Phe Thr His Leu Arg Asn Leu Ala Trp Phe Leu Met Lys 
535 540 545 

AAG GCG ATT ATG AAG TGG AAG ATA TTT GTG AAG CCC AAA CGA TTT TAGAT 173 8 
Lys Ala lie Met Lys Trp Lys lie Phe Val Lys Pro Lys Arg Phe 
550 555 560 

GTGCTAGACG ATTTAGACTA TGACACTAAA GAGATCGAGC GCCTTTTA 178 

(2) INFORMATION FOR SEQ ID NO: 704: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:704: 



Met 


Gin 


Glu 


Val 


His 


Asp 


Tyr 


Gly 


He 


Lys 


Phe 


Trp 


Ser 


Asn 


Asn 


Glu 


1 








5 










10 










15 




Phe 


Lys 


He 


Glu 


Lys 


Gly 


Leu 


Val 


Lys 


Val 


Cys 


His 


Gly 


Lys 


Asn 


Pro 








20 










25 










30 






Ser 


Leu 


Leu 


Glu 


He 


Val 


Gin 


Ser 


Val 


Arg 


Asp 


Lys 


Gly 


Tyr 


Arg 


Gly 






35 










40 










45 








Pro 


Leu 


Leu 


Val 


Arg 


Phe 


Pro 


His 


Leu 


Val 


Gin 


Lys 


Gin 


He 


Lys 


Ser 




50 










55 










60 








Leu 


Phe 


Asp 


Ala 


Phe 


Ser 


Ser 


Ala 


He 


Lys 


Glu 


Tyr 


Gin 


Tyr 


Ser 


Gly 


65 










70 










75 










80 


Ala 


Phe 


Lys 


Ala 


Val 


Phe 


Pro 


Leu 


Lys 


Val 


Asn 


Gin 


Met 


Pro 


Ser 


Phe 










85 










90 










95 




Val 


Phe 


Pro 


Leu 


Val 


Gin 


Gly 


Ala 


Lys 


Gly 


Leu 


Asn 


Tyr 


Gly 


Leu 


Glu 








100 










105 










110 






Ala 


Gly 


Ser 


Lys 


Ser 


Glu 


Leu 


He 


He 


Ala 


Met 


Ser 


Tyr 


Thr 


Asn 


Pro 






115 










120 










125 








Lys 


Ala 


Pro 


He 


Thr 


Val 


Asn 


Gly 


Phe 


Lys 


Asp 


Lys 


Glu 


Met 


He 


Glu 




130 










135 










140 










Leu 


Gly 


Phe 


He 


Ala 


Lys 


Ser 


Met 


Gin 


His 


Glu 


He 


Thr 


Leu 


Thr 


He 


145 










150 










155 










160 


Glu 


Gly 


Leu 


Asn 


Glu 


Leu 


Lys 


Thr 


He 


He 


Ala 


Val 


Ala 


Lys 


Gin 


Asn 










165 










170 










175 




Glu 


Phe 


Leu 


Ala 


Cys 


Pro 


Lys 


He 


Gly 


He 


Arg 


He 


Arg 


Leu 


His 


Ser 








180 










185 










190 






Thr 


Gly 


Thr 


Gly 


Val 


Trp 


Ala 


Lys 


Ser 


Gly 


Gly 


He 


Asn 


Ser 


Lys 


Phe 






195 










200 










205 






Gly 


Leu 


Ser 


Ser 


Thr 


Glu 


Val 


Leu 


Glu 


Ala 


Met 


Arg 


Leu 


Leu 


Glu 


Glu 




210 










215 










220 










Asn 


Asp 


Leu 


Leu 


Glu 


His 


Phe 


His 


Met 


He 


His 


Phe 


His 


He 


Gly 


Ser 


225 










230 










235 








240 


Gin 


He 


Ser 


Asp 


He 


Ser 


Pro 


Leu 


Lys 


Lys 


Ala 


Leu 


Arg 


Glu 


Ala 


Gly 










245 










250 










255 





-1076- 



BNSDOCIO: «WO 9843478A1 I > 



wo 98/43478 PCT/US98/06371 

TTG AGA GAA ATT GTG AAA AAT AAG CAG GAA ATC GAG CCG GAC ATT TTC 1016 
Leu Arg Glu lie Val Lys Asn Lys Gin Glu lie Glu Pro Asp lie Phe 
310 315 320 

ATT GAA TCA GGC CGT TAT ATT TCC GCT AAC CAT GCC GTT TTA GTG GCC 1064 
lie Glu Ser Gly Arg Tyr lie Ser Ala Asn His Ala Val Leu Val Ala 
325 330 335 

CCG GTG TTA GAA TTG TTT TOG CAT GAA TAG AAT GAA AAA TCC CTA AAA 1112 
Pro Val Leu Glu Leu Phe Ser His Glu Tyr Asn Glu Lys Ser Leu Lys 
340 345 350 

ATC AAA GAA AAT AAT AAC CCC CGT TTG ATT GAT GAA ATG CTA GAC TTG 116 0 

lie Lys Glu Asn Asn Asn Pro Pro Leu lie Asp Glu Met Leu Asp Leu 
355 360 365 370 

CTC GCT AAT ATC AAT GAA AAA AAC GCC ATT GAA TAG TTG CAT GAT AGT 12 0 8 

Leu Ala Asn lie Asn Glu Lys Asn Ala lie Glu Tyr Leu His Asp Ser 
375 380 385 

TTT GAT CAC ACC GAG TCG CTA TTC ACG CTT TTT GAT CTG GGC TAT ATT 12 56 

Phe Asp His Thr Glu Ser Leu Phe Thr Leu Phe Asp Leu Gly Tyr lie 
390 395 400 

GAT TTG ATT GAC AGG AGC AAC ACT GAA GTT TTA GCC CAT TTG ATC GTC 13 04 

Asp Leu lie Asp Arg Ser Asn Thr Glu Val Leu Ala His Leu lie Val 
405 410 415 

AAA AAA GCG GTG CAA TTG CTT TAT GTT AAG GAT CAT AAC GAT ATT TTA 13 52 

Lys Lys Ala Val Gin Leu Leu Tyr Val Lys Asp His Asn Asp lie Leu 
420 425 430 

CGC ATT CAA GAG CAG GTC CAA GAG CGC TAT TTA TTG AAT TGC TCG TTT 14 0 0 

Arg lie Gin Glu Gin Val Gin Glu Arg Tyr Leu Leu Asn Cys Ser Phe 
435 440 445 450 

TTC CAA AGC TTG CCG GAT TAT TGG GGC TTG AGA CAG AAT TTC CCG GTC 14 4 8 

Phe Gin Ser Leu Pro Asp Tyr Trp Gly Leu Arg Gin Asn Phe Pro Val 
455 460 465 

ATG CCC TTG AAT AAA TTA GAT GAA AAG CCC ACC AGG AGT GCG AGC TTG 14 9 6 

Met Pro Leu Asn Lys Leu Asp Glu Lys Pro Thr Arg Ser Ala Ser Leu 
470 475 480 

TGG GAT ATT ACT TGC GAT AGC GAT GGG GAA ATC GCT TTT GAT TCC ACG 154 4 

Trp Asp lie Thr Cys Asp Ser Asp Gly Glu lie Ala Phe Asp Ser Thr 
485 490 495 

AAG CCC TTG TTT TTG CAC GAT ATA GAT ATA GAT GAA GAA GAA TAG TTT 15 92 

Lys Pro Leu Phe Leu His Asp lie Asp lie Asp Glu Glu Glu Tyr Phe 
500 505 510 

TTA GCG TTC TTT TTA GTG GGA GCG TAT CAA GAA GTT TTA GGC ATG AAA 164 0 

Leu Ala Phe Phe Leu Val Gly Ala Tyr Gin Glu Val Leu Gly Met Lys 
515 520 525 530 

-1075- 



BNSDOCID: <WO 9843478AlJ_> 



wo 98/43478 PCT/US98/06371 

AAG GCG GTT TTC CCT TTA AAA GTC AAT CAA ATG CCC TCG TTT GTT TTC 3 44 

Lys Ala Val Phe Pro Leu Lys Val Asn Gin Met Pro Ser Phe Val Phe 
85 90 95 

CCT TTA GTG CAG GGG GCT AAG GGT TTG AAT TAC GGA TTA GAG GCT GGG 3 92 

Pro Leu Val Gin Gly Ala Lys Gly Leu Asn Tyr Gly Leu Glu Ala Gly 
100 105 110 

AGC AAG TCT GAA CTC ATC ATC GCA ATG AGT TAC ACT AAC CCT AAA GCC 44 0 

Ser Lys Ser Glu Leu lie lie Ala Met Ser Tyr Thr Asn Pro Lys Ala 
115 120 125 130 

CCT ATC ACC GTG AAT GGC TTT AAA GAC AAA GAA ATG ATT GAG CTT GGC 4 88 

Pro lie Thr Val Asn Gly Phe Lys Asp Lys Glu Met lie Glu Leu Gly 
135 140 145 

TTT ATC GCT AAA AGC ATG CAG CAT GAG ATC ACT TTA ACG ATT GAG GGT 53 6 

Phe lie Ala Lys Ser Met Gin His Glu lie Thr Leu Thr lie Glu Gly 
150 155 160 

TTG AAT GAA TTG AAA ACC ATT ATC GCC GTG GCT AAA CAA AAC GAG TTT 5 84 

Leu Asn Glu Leu Lys Thr lie lie Ala Val Ala Lys Gin Asn Glu Phe 
165 170 175 

TTA GCC TGC CCT AAA ATT GGC ATC CGC ATC CGT TTG CAC AGC ACT GGC 63 2 

Leu Ala Cys Pro Lys lie Gly lie Arg lie Arg Leu His Ser Thr Gly 
180 185 190 

ACT GGC GTT TGG GCA AAG AGT GGG GGG ATC AAT TCT AAA TTT GGT CTT 6 80 

Thr Gly Val Trp Ala Lys Ser Gly Gly lie Asn Ser Lys Phe Gly Leu 
195 200 205 210 

AGC AGC ACT GAA GTT TTA GAG GCG ATG CGC CTT TTA GAA GAA AAC GAC 72 8 

Ser Ser Thr Glu Val Leu Glu Ala Met Arg Leu Leu Glu Glu Asn Asp 
215 220 225 

TTG TTA GAG CAT TTC CAC ATG ATA CAT TTC CAT ATA GGC TCT CAA ATC 776 
Leu Leu Glu His Phe His Met lie His Phe His lie Gly Ser Gin lie 
230 235 240 

AGC GAT ATT TCG CCC TTA AAA AAG GCT TTA AGA GAA GCG GGT AAC TTG 824 
Ser Asp lie Ser Pro Leu Lys Lys Ala Leu Arg Glu Ala Gly Asn Leu 
245 250 255 

TAT GCA GAA TTG CGT AAA ATG GGC GCT AAA AAT CTT AAT AGC GTG AAT 8 72 

Tyr Ala Glu Leu Arg Lys Met Gly Ala Lys Asn Leu Asn Ser Val Asn 
260 265 270 

ATT GGA GGG GGG TTA GCC GTA GAA TAC ACC CAA CAC AAG CAC CAC CAA 92 0 

lie Gly Gly Gly Leu Ala Val Glu Tyr Thr Gin His Lys His His Gin 
275 280 285 290 

GAC AAA AAC TAC ACT TTA GAG GAA TTC AGC GCT GAT GTG GTG TTT TTA 968 
Asp Lys Asn Tyr Thr Leu Glu Glu Phe Ser Ala Asp Val Val Phe Leu 
295 300 305 
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Val His Leu Leu lie Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu 
65 70 75 80 

Val Asn Ser Leu Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His 

85 90 95 

Phe Lys Ser Val Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro 

100 105 110 

Ser Tyr Phe Ala Gly Ser Cys Gly Asp Ala Pro Leu Glu Met lie Lys 

115 120 125 

Gin Tyr lie Gin Asp Gin Glu Thr Pro His 
130 135 

(2) INFORMATION FOR SEQ ID NO: 7 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1733 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:703: 

AAAAATAACC CCCATCTCTT TAAAACCTTA TCATAATGAA AGGATAAAAA ATG CAA 56 

Met Gin 
1 

GAA GTC CAT GAT TAT GGG ATT AAA TTT TGG AGC AAT AAC GAA TTT AAG 104 
Glu Val His Asp Tyr Gly lie Lys Phe Trp Ser Asn Asn Glu Phe Lys 
5 10 15 

ATA GAA AAA GGC TTG GTT AAA GTC TGT CAT GGT AAA AAC CCC TCG CTT 152 
lie Glu Lys Gly Leu Val Lys Val Cys His Gly Lys Asn Pro Ser Leu 
20 25 30 

TTA GAA ATC GTT CAA AGC GTG CGC GAT AAG GGC TAT AGA GGA CCT TTG 200 
Leu Glu lie Val Gin Ser Val Arg Asp Lys Gly Tyr Arg Gly Pro Leu 
35 40 45 50 

TTG GTG CGA TTC CCC CAT TTG GTG CAA AAA CAA ATC AAA AGC CTG TTT 24 8 

Leu Val Arg Phe Pro His Leu Val Gin Lys Gin lie Lys Ser Leu Phe 
55 60 65 

GAT GCG TTT TCT TCA GCG ATT AAA GAG TAT CAA TAC AGC GGG GCT TTT 2 96 

Asp Ala Phe Ser Ser Ala lie Lys Glu Tyr Gin Tyr Ser Gly Ala Phe 
70 75 80 
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GTG CAT TTT GTA TTT GTT ACT AAA TAG AGG CGT TCA GCA TTC AAT AAG 152 
Val His Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe Asn Lys 
20 25 30 

GAA GTG ATA GAT TTT TTA GGA TCG GTG TTT GCC AAA GTG TGT AAG GAC 2 00 

Glu Val lie Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys Lys Asp 
35 40 45 50 

TTT GAG AGC GAA TTG GTA GAA TTT GAT GGG GAG AGC GAT CAT GTG CAT 24 8 

Phe Glu Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His Val His 
55 60 65 

TTG CTT ATC AAC TAG CCT CCA AAA GTG AGC GTG AGT AAG TTA GTT AAT 2 96 

Leu Leu lie Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu Val Asn 
70 75 80 

TCT TTA AAA GGC GTT AGC AGT CGT TTG ACT AGA CAA CAC CAT TTC AAA 344 
Ser Leu Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His Phe Lys 
85 90 95 

AGC GTT GAA GCT AGT TTG TGG GGG AAG CAT TTA TGG TCG CCT AGT TAT 3 92 

Ser Val Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro Ser Tyr 
100 105 110 

TTC GCT GGG AGT TGT GGG GAC GCG CCT TTA GAG ATG ATT AAG CAA TAC 44 0 

Phe Ala Gly Ser Cys Gly Asp Ala Pro Leu Glu Met lie Lys Gin Tyr 
115 120 125 130 

ATA CAA GAT CAA GAA ACA CCG CAT TAAATTAGCT AACTTTGATT TTTAAGTAGA 4 94 

lie Gin Asp Gin Glu Thr Pro His 
135 

ACGCGCTAAA AAGCGAATGG ATC 517 
(2) INFORMATION FOR SEQ ID NO: 7 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 02: 

Met Lys Lys lie Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu 

15 10 15 

Met His Val His Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe 

20 25 30 

Asn Lys Glu Val lie Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys 

35 40 45 

Lys Asp Phe Glu Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His 
50 55 60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 00: 

Met Gin Asp Thr Phe Tyr Phe Lys Asp His Lys Leu Leu Arg Thr His 

15 10 15 

Thr Ser Pro Val Gin lie His Thr Met Gin Glu Gin Thr Pro Pro lie 

20 25 30 

Lys Met lie Cys Leu Gly Glu Thr Phe Arg Arg Asp Tyr Asp Leu Thr 

35 40 45 

His Thr Pro Met Phe His Gin lie Glu Gly Leu Val Val Asp Gin Lys 

50 55 60 

Gly Asn lie Arg Phe Thr His Leu Lys Gly Val lie Glu Asp Phe Leu 
65 70 75 80 

His Tyr Phe Phe Gly Gly Val Lys Leu Arg Trp Arg Ser Ser Phe Phe 

85 90 95 

Pro Phe Thr Glu Pro Ser Ala Glu Val Asp lie Ser Cys Val Phe Cys 

100 105 110 

Lys Gin Glu Gly Cys Arg Val Cys Ser His Thr Gly Trp Leu Glu Val 

115 120 125 

Leu Gly Cys Gly Met Val Asn Asn Ala Val Phe Glu Ala lie Gly Tyr 

130 135 140 

Glu Asn Val Ser Gly Phe Ala Phe Gly Met Gly lie Glu Arg Leu Ala 
145 150 155 160 

Met Leu Thr Cys Gin lie Asn Asp Leu Arg Ser Phe Phe Glu Thr Asp 

165 170 175 

Leu Arg Val Leu Glu Ser Phe 
180 

(2) INFORMATION FOR SEQ ID NO: 701: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 464 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701: 

TATTGACTTT CATAGAAAGT ATTTTAACCT CTTTTTGTTA AAATAGGTCT ATG AAA 5 6 

Met Lys 
1 

AAA ATT GAT GAT ATG AGA CAC GGA AGA CAT TGT GTT TTT TTA ATG CAT 104 
Lys lie Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu Met His 
5 10 15 
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Arg Thr His Thr Ser Pro Val Gin lie His Thr Met Gin Glu Gin Thr 
15 20 25 

CCA CCC ATT AAG ATG ATT TGT TTA GGC GAA ACC TTT AGO CGC GAT TAT 62 6 

Pro Pro lie Lys Met lie Cys Leu Gly Glu Thr Phe Arg Arg Asp Tyr 
30 35 40 45 

GAT TTG ACC CAC ACG CCC ATG TTC CAC CAA ATT GAA GGG CTT GTC GTG 674 
Asp Leu Thr His Thr Pro Met Phe His Gin lie Glu Gly Leu Val Val 
50 55 60 

GAT CAA AAA GGG AAT ATC CGT TTC ACA CAT TTA AAA GGT GTG ATC GAA 722 
Asp Gin Lys Gly Asn lie Arg Phe Thr His Leu Lys Gly Val lie Glu 
65 70 75 

GAC TTT TTG CAT TAT TTC TTT GGG GGC GTG AAG TTA AGG TGG CGC TCT 77 0 

Asp Phe Leu His Tyr Phe Phe Gly Gly Val Lys Leu Arg Trp Arg Ser 
80 85 90 

AGC TTT TTC CCT TTC ACA GAG CCA AGC GCT GAA GTG GAT ATT AGT TGC 818 
Ser Phe Phe Pro Phe Thr Glu Pro Ser Ala Glu Val Asp lie Ser Cys 
95 100 105 

GTG TTT TGC AAG CAA GAA GGC TGT AGG GTT TGC TCG CAC ACA GGC TGG 8 66 

Val Phe Cys Lys Gin Glu Gly Cys Arg Val Cys Ser His Thr Gly Trp 
110 115 120 125 

TTA GAA GTG TTG GGC TGT GGC ATG GTC AAT AAT GCG GTG TTT GAA GCC 914 
Leu Glu Val Leu Gly Cys Gly Met Val Asn Asn Ala Val Phe Glu Ala 
130 135 140 

ATA GGG TAT GAG AAT GTG AGC GGG TTT GCT TTT GGC ATG GGG ATT GAA 962 
lie Gly Tyr Glu Asn Val Ser Gly Phe Ala Phe Gly Met Gly lie Glu 
145 150 155 

AGA TTA GCC ATG CTG ACT TGC CAG ATC AAT GAT TTG CGC AGT TTC TTT 1010 
Arg Leu Ala Met Leu Thr Cys Gin lie Asn Asp Leu Arg Ser Phe Phe 
160 165 170 

GAA ACT GAT TTG AGA GTG TTG GAG AGC TTT TAATGAAACT GAGCATTAAT GAT 1063 
Glu Thr Asp Leu Arg Val Leu Glu Ser Phe 
175 180 

TTGAATGTTT TTGTCAATAC GCCTAAAGAT ATAG 10 97 

(2) INFORMATION FOR SEQ ID NO: 700: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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595 600 605 

Glu Lys Leu Thr Thr Gin Thr Pro lie Asn His Pro Tyr Gin Ser Ala 

610 615 620 

Lys lie He Gin Asn His Glu He He Gly Val He Ala Lys He His 
625 630 635 640 

Pro Liys Val He Gin Glu Leu Asp Leu Phe Glu Ser Tyr Tyr Ala Glu 

645 650 655 

He Asp Ala Phe Lys Leu Lys Arg Pro Ala Met Leu Leu Lys Pro Phe 

660 665 670 

Ser He Tyr Pro Ser Ser Val Arg Asp Leu Thr Leu He He Asp Glu 

675 680 685 

Asn Thr Ala Phe Ser Gly He Lys Lys Ala Leu Lys Asp Ala Gin He 

690 695 700 

Pro Asn Leu Ser Glu He Leu Pro Leu Asp He Phe Lys Glu Ser Asn 
705 710 715 720 

Asn Ser He Ala Leu Ser Val Arg Cys Val He His Ser Leu Glu Lys 

725 730 735 

Thr Leu Asn Asp Glu Glu Val Asn Ser Ala Val Gin Lys Ala Leu Glu 

740 745 750 

He Leu Glu Lys Glu Phe Asn Ala Arg Leu Lys Gly 
755 760 



(2) INFORMATION FOR SEQ ID NO: 699: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 92... 1040 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699: 

TTTCTTTTAT TAAAGTTGTA TTATAGCTTT TTAAAAATAA AATATAAGGA TCGTTATTGC 6 0 

ACACCTTAAT AGAGCGATTA GAAAAGGTTA CTAATAGCAA AGAGTTAGAA GAAGCGCGCT 12 0 

TGAATGCTTT GGGTAAAAAA GGGGTTTTTG CGGATAAATT CAACCAGCTC AAACATCTGA 18 0 

ACGGCGAAGA AAAAAACGCC TTTGCTAAAG AAATCCACCA TTATAAACAA GCGTTTGAAA 24 0 

AAGCCTTTGA ATGGAAAAAA AAGGCTATTA TAGAGCTTGA ATTAGAAGAA CGCTTGAAAA 3 00 

AAGAAAAAAT TGATGTGAGC TTGTTTAACG CTATCAAAAC AAGCTCTTCT CACCCTTTAA 36 0 

ACTACACTAA AAATAAAATC ATTGAATTTT TCACCCCATT AGGATACAAG CTTGAAATCG 420 

GCTCTTTAGT GGAAGATGAT TTCCATAATT TCAGCGCTTT AAACTTGCCC CCTTACCATC 4 80 

CTGCAAGAGA C ATG CAA GAC ACT TTT TAT TTT AAA GAT CAC AAG CTT TTA 53 0 
Met Gin Asp Thr Phe Tyr Phe Lys Asp His Lys Leu Leu 
15 10 

AGG ACC CAC ACT TCG CCC GTG CAA ATC CAC ACC ATG CAA GAA CAA ACC 57 8 
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Ser Leu Thr Pro 

Arg Glu lie Ser 
180 

Leu Asn Phe Thr 
195 

Asn lie Glu Ser 
210 

Lys Thr Pro Leu 
225 

Ser Glu Asn Asp 

Gly Val lie Met 
260 

Ser Val Lys Asn 
275 

Gin Lys Arg Ser 
290 

Ser Glu Cys Leu 
305 

Ser Leu Lys Leu 

Ala Leu lie Tyr 
340 

Gly Leu Asn Phe 
355 

Lys Gin Thr Glu 
370 

Glu Asp lie Thr 
385 

Gin Gly lie Leu 

Asn Ser Lys Pro 
420 

Asp lie Lys Thr 
435 

Gly lie Asp Asn 
450 

Asn Ser Asn Pro 
465 

His Lys Ala Leu 

Tyr Ser Lys Glu 
500 

Pro Leu Glu Leu 
515 

Thr Ser Leu Val 
530 

Leu Gly Phe Lys 
545 

Ser Lys Arg Glu 

Gin Lys Lys Glu 
580 

Tyr Ser Phe Ala 



Asn Arg Gly Asp 
165 

Ala Phe Tyr His 

Pro Lys Ser Gly 
200 

His Leu Ala Tyr 
215 

Asn lie Lys Leu 
230 

Leu Asn Asn Phe 
245 

Asn Ala Tyr Ser 

Asp Glu Asn Asn 
280 

Thr lie Ala lie 
295 

Leu Leu Glu Ala 
310 

His Ala Leu Lys 
325 

Arg Ser Ala Arg 

Leu Ser Ala His 
360 

His Ser Leu Lys 
375 

Glu lie Leu Gly 
390 

Lys Asn Leu Gly 
405 

Gin lie Leu Glu 

lie Gin Asp lie 
440 

Leu Val Ser Lys 
455 

Asn Tyr Asp Thr 
470 

Ala Cys Gly Phe 
485 

Lys Gin Gin Lys 

Gin Asn Pro lie 
520 

Cys Gly Leu Leu 
535 

Ser lie Ala Leu 
550 

Glu lie Gin Lys 
565 

Ser Tyr Pro Asp 
Glu Cys Val Ser 



Cys Leu Ser Val 
170 

Thr Pro Leu Lys 
185 

Leu lie Thr Leu 

Tyr Leu lie Cys 
220 

Ser Leu Ala His 
235 

lie Glu Phe Ser 
250 

Leu Asn Thr Thr 
265 

Leu Glu Ser Val 

Lys His Gin Val 
300 

Ser Tyr Thr Asp 
315 

Asp Lys Thr Leu 
330 

Gly Ser Asn Pro 
345 

Leu Lys Ala Thr 

Asp Arg Thr Leu 
380 

Leu Ala Val Glu 
395 

Phe Lys Val Ser 
410 

Val lie Ala Pro 
425 

Ala Glu Glu He 

Pro Leu His Cys 
460 

His Arg Phe Phe 
475 

Lys Glu Val He 
490 

Leu Gly Phe Glu 
505 

Thr Thr Glu Leu 

Asp Ala Ser Leu 
540 

Tyr Glu Lys Gly 
555 

Leu Gly Phe Leu 
570 

Thr Lys Gly Lys 
585 

Lys Val He Gly 



PCT/US98/06371 

Leu Gly He Ala 
175 

Pro He Lys Ala 
190 

Ser Ala Gly Glu 
205 

Asn His Ser Leu 

Asn Asn Ala Leu 
240 

Thr His Phe Ser 
255 

Pro Met Asp Leu 
270 

Tyr He Asn His 
285 

Gin Lys Asp Leu 

Pro He Ser Leu 
320 

Gin Lys Asp Asn 
335 

Asn Leu Ser Asp 
350 

He Leu Glu Ser 
365 

Thr Phe Gin Leu 

Lys Glu Lys He 
400 

Val Lys Glu Pro 
415 

Asn Phe Arg His 
430 

Leu Arg Phe Val 
445 

Val Ser Ser Lys 

Glu Asn Leu Lys 
480 

His Tyr Val Phe 
495 

Val Leu Glu Asp 
510 

Asn Thr Leu Arg 
525 

Arg Asn Lys Asn 

Ser Val Tyr Asn 
560 

He Ser Gly Leu 
575 

Ala Trp Asp Phe 
590 

Asp Phe Ser Leu 
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CTA AAG GAC GCT CAA ATC CCT AAT TTA AGC GAG ATT CTA CCC CTT GAT 22 23 

Leu Lys Asp Ala Gin lie Pro Asn Leu Ser Glu lie Leu Pro Leu Asp 
700 705 710 

ATT TTT AAA GAA AGT AAT AAT TCC ATA GCC TTA AGC GTG CGT TGC GTG 2271 
lie Phe Lys Glu Ser Asn Asn Ser lie Ala Leu Ser Val Arg Cys Val 
7X5 720 725 730 

ATC CAT TCT TTA GAA AAA ACC CTG AAT GAT GAA GAG GTC AAT TCA GCC 2 319 

lie His Ser Leu Glu Lys Thr Leu Asn Asp Glu Glu Val Asn Ser Ala 
735 740 745 

GTG CAA AAA GCA CTT GAA ATT TTA GAA AAA GAA TTT AAC GCC CGC CTT 2 3 67 

Val Gin Lys Ala Leu Glu lie Leu Glu Lys Glu Phe Asn Ala Arg Leu 
750 755 760 

AAA GGA TAATATAAAG GATAATATGT GATAGAGCTT GACATTAACG CTAGCGATAA AT 2425 
Lys Gly 



CGCTCTCACA CAG 24 3 8 

(2) INFORMATION FOR SEQ ID NO: 698: 

(i) SEQUENCE CHAE^CTERISTICS : 

(A) LENGTH: 764 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 8: 

Met Lys Leu Ser lie Asn Asp Leu Asn Val Phe Val Asn Thr Pro Lys 

15 10 15 

Asp lie Ala Lys Leu Cys Glu Asp Leu Ser Arg Leu Gly Leu Glu Val 

20 25 30 

Glu Ser Cys lie Pro Cys lie Ala Pro Lys Asn Val Val Val Gly Lys 

35 40 45 

lie Leu Glu Lys Ala Pro His Lys Asn Ala Glu Lys Leu Ser Val Cys 

50 55 60 

Gin Val Asp Val Gly Lys Glu Val Leu Gin lie Val Cys Gly Ala Lys 
65 70 75 80 

Asn Val Ala Pro Asn Gin Phe Val Pro Val Ala Leu Asn Gly Ala Leu 

85 90 95 

lie Gly Ser Thr Thr lie Ala Lys Thr Glu Leu Arg Gly Val Glu Ser 

100 105 110 

His Gly Met lie Cys Ser Ser lie Glu Leu Gly Phe Pro Lys lie Asn 

115 120 125 

Asp Gly lie Leu Glu Leu Asp Glu Ser Val Gly Glu Leu Val Leu Gly 

130 135 140 

Lys Glu Leu Asn Glu Tyr Ala Pro Phe Asn Thr His Val Leu Glu lie 
145 150 155 160 
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TTT TTT GAA AAC CTT AAA CAC AAG GCT CTC GCT TGC GGT TTT AAA GAA 1551 
Phe Phe Glu Asn Leu Lys His Lys Ala Leu Ala Cys Gly Phe Lys Glu 
475 480 485 490 

GTC ATT CAT TAC GTG TTT TAC TCT AAA GAA AAA CAG CAA AAA TTA GGC 15 99 

Val lie His Tyr Val Phe Tyr Ser Lys Glu Lys Gin Gin Lys Leu Gly 
495 500 505 

TTT GAA GTT TTA GAA GAT CCC CTA GAA TTG CAA AAC CCT ATC ACA ACG 164 7 

Phe Glu Val Leu Glu Asp Pro Leu Glu Leu Gin Asn Pro lie Thr Thr 
510 515 520 

GAG TTA AAC ACC CTA AGG ACG AGT CTT GTT TGC GGG CTT TTA GAC GCC 1695 
Glu Leu Asn Thr Leu Arg Thr Ser Leu Val Cys Gly Leu Leu Asp Ala 
525 530 535 

AGT TTA AGG AAT AAA AAT TTA GGG TTT AAA AGC ATA GCC CTT TAT GAA 174 3 

Ser Leu Arg Asn Lys Asn Leu Gly Phe Lys Ser lie Ala Leu Tyr Glu 
540 545 550 

AAG GGG AGC GTG TAT AAC TCT AAA AGA GAA GAA ATC CAA AAA CTA GGC 17 91 

Lys Gly Ser Val Tyr Asn Ser Lys Arg Glu Glu lie Gin Lys Leu Gly 
555 560 565 570 

TTT TTA ATA AGC GGC TTG CAA AAA AAA GAA AGC TAC CCT GAT ACT AAG 183 9 

Phe Leu lie Ser Gly Leu Gin Lys Lys Glu Ser Tyr Pro Asp Thr Lys 
575 580 585 

GGC AAG GCT TGG GAT TTT TAC TCT TTT GCC GAA TGC GTT TCA AAA GTT 18 87 

Gly Lys Ala Trp Asp Phe Tyr Ser Phe Ala Glu Cys Val Ser Lys Val 
590 595 600 

ATA GGG GAT TTC AGC TTG GAA AAA CTA ACC ACT CAA ACC CCC ATT AAC 193 5 

lie Gly Asp Phe Ser Leu Glu Lys Leu Thr Thr Gin Thr Pro lie Asn 
605 610 615 

CAC CCC TAC CAG AGC GCT AAA ATC ATT CAA AAT CAT GAA ATC ATA GGC 1983 
His Pro Tyr Gin Ser Ala Lys lie lie Gin Asn His Glu He He Gly 
620 625 630 

GTG ATC GCT AAA ATC CAC CCT AAA GTG ATC CAG GAA TTG GAT TTG TTT 2 031 

Val He Ala Lys He His Pro Lys Val He Gin Glu Leu Asp Leu Phe 
635 640 645 650 

GAA AGC TAT TAC GCT GAG ATA GAC GCT TTT AAA CTC AAA CGC CCT GCT 2 07 9 

Glu Ser Tyr Tyr Ala Glu He Asp Ala Phe Lys Leu Lys Arg Pro Ala 
655 660 665 

ATG CTA TTA AAA CCC TTT AGC ATT TAT CCT AGC AGT GTT AGG GAT TTG 212 7 

Met Leu Leu Lys Pro Phe Ser He Tyr Pro Ser Ser Val Arg Asp Leu 
670 675 680 

ACT CTC ATC ATT GAT GAG AAT ACC GCT TTT AGT GGG ATT AAA AAA GCC 217 5 

Thr Leu He He Asp Glu Asn Thr Ala Phe Ser Gly He Lys Lys Ala 
^ 685 690 695 
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Phe Ser Thr His Phe Ser Gly Val lie Met Asn Ala Tyr Ser Leu Asn 
255 260 265 

ACA ACC CCT ATG GAT TTG AGC GTG AAA AAC GAT GAA AAC AAC CTT GAA 92 7 

Thr Thr Pro Met Asp Leu Ser Val Lys Asn Asp Glu Asn Asn Leu Glu 
270 275 280 

AGC GTT TAT ATC AAC CAT CAA AAA CGC TCC ACG ATC GCT ATC AAG CAT 975 
Ser Val Tyr lie Asn His Gin Lys Arg Ser Thr lie Ala lie Lys His 
285 290 295 

CAA GTT CAA AAA GAT TTG AGC GAG TGT TTG CTT TTA GAG GCA AGT TAC 1023 
Gin Val Gin Lys Asp Leu Ser Glu Cys Leu Leu Leu Glu Ala Ser Tyr 
300 305 310 

ACC GAT CCG ATA AGC CTG TCT TTA AAA TTA CAC GCC CTA AAA GAT AAA 1071 
Thr Asp Pro lie Ser Leu Ser Leu Lys Leu His Ala Leu Lys Asp Lys 
315 320 325 330 

ACG CTT CAA AAA GAC AAC GCC CTT ATT TAT AGA AGC GCT AGG GGG AGT 1119 
Thr Leu Gin Lys Asp Asn Ala Leu lie Tyr Arg Ser Ala Arg Gly Ser . 

335 340 345 

AAC CCT AAT TTA TCA GAC GGC TTG AAT TTT TTA AGC GCT CAT TTG AAA 1167 
Asn Pro Asn Leu Ser Asp Gly Leu Asn Phe Leu Ser Ala His Leu Lys 
350 355 360 

GCC ACG ATT TTA GAA AGC AAA CAA ACT GAG CAT TCT TTA AAA GAT CGC 1215 
Ala Thr lie Leu Glu Ser Lys Gin Thr Glu His Ser Leu Lys Asp Arg 
365 370 375 

ACC CTT ACA TTC CAG CTT GAA GAC ATT ACT GAA ATT TTG GGG CTT GCT 12 63 

Thr Leu Thr Phe Gin Leu Glu Asp lie Thr Glu lie Leu Gly Leu Ala 
380 385 390 

GTA GAG AAA GAA AAA ATT CAA GGC ATT TTA AAA AAT TTA GGC TTT AAA 1311 
Val Glu Lys Glu Lys lie Gin Gly lie Leu Lys Asn Leu Gly Phe Lys 
395 400 405 410 

GTC AGC GTA AAA GAG CCA AAC TCA AAA CCC CAA ATT TTA GAG GTT ATT 13 5 9 

Val Ser Val Lys Glu Pro Asn Ser Lys Pro Gin lie Leu Glu Val lie 
415 420 425 

GCG CCA AAT TTC AGG CAT GAC ATT AAA ACG ATC CAA GAT ATT GCT GAA 14 07 

Ala Pro Asn Phe Arg His Asp He Lys Thr He Gin Asp He Ala Glu 
430 435 440 

GAA ATT TTG CGC TTT GTA GGG ATT GAT AAT CTA GTC TCA AAG CCC CTT 14 5 5 

Glu He Leu Arg Phe Val Gly He Asp Asn Leu Val Ser Lys Pro Leu 
445 450 455 

CAT TGT GTC AGT AGC AAA AAT TCA AAC CCC AAT TAC GAC ACG CAC CGC 1503 
His Cys Val Ser Ser Lys Asn Ser Asn Pro Asn Tyr Asp Thr His Arg 
460 465 470 
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30 35 40 

AAT GTG GTT GTG GGT AAA ATT TTA GAA AAA GCC CCC CAT AAA AAC GCT 255 
Asn Val Val Val Gly Lys lie Leu Glu Lys Ala Pro His Lys Asn Ala 
45 50 55 

GAA AAA CTC AGC GTG TGT CAA GTG GAT GTG GGT AAA GAA GTG TTG CAA 3 03 

Glu Lys Leu Ser Val Cys Gin Val Asp Val Gly Lys Glu Val Leu Gin 
60 65 70 

ATC GTG TGT GGG GCT AAA AAT GTC GCG CCA AAC CAA TTC GTG CCA GTC 3 51 

lie Val Cys Gly Ala Lys Asn Val Ala Pro Asn Gin Phe Val Pro Val 
75 80 85 90 

GCT TTA AAC GGG GCG CTA ATC GGC TCA ACC ACC ATC GCT AAA ACG GAG 3 99 

Ala Leu Asn Gly Ala Leu lie Gly Ser Thr Thr lie Ala Lys Thr Glu 
95 100 105 

CTT AGG GGG GTT GAA AGC CAT GGC ATG ATT TGC TCT AGC ATT GAA TTA 44 7 

Leu Arg Gly Val Glu Ser His Gly Met lie Cys Ser Ser lie Glu Leu 
110 115 120 

GGC TTC CCT AAA ATC AAT GAT GGC ATC TTG GAA TTA GAT GAG AGC GTT 4 95 

Gly Phe Pro Lys lie Asn Asp Gly lie Leu Glu Leu Asp Glu Ser Val 
125 130 135 

GGG GAG TTG GTT TTA GGG AAA GAA TTA AAC GAA TAC GCC CCT TTC AAC 54 3 

Gly Glu Leu Val Leu Gly Lys Glu Leu Asn Glu Tyr Ala Pro Phe Asn 
140 145 150 

ACG CAT GTT TTA GAA ATT TCA TTG ACT CCC AAT CGT GGG GAT TGC TTG 591 
Thr His Val Leu Glu lie Ser Leu Thr Pro Asn Arg Gly Asp Cys Leu 
155 160 165 170 

AGC GTT TTA GGT ATT GCC AGA GAA ATT AGC GCC TTT TAT CAC ACG CCC 63 9 

Ser Val Leu Gly lie Ala Arg Glu lie Ser Ala Phe Tyr His Thr Pro 
175 180 185 

CTA AAG CCT ATT AAG GCT TTA AAT TTT ACG CCC AAA AGC GGT TTG ATC 687 
Leu Lys Pro lie Lys Ala Leu Asn Phe Thr Pro Lys Ser Gly Leu lie 
190 195 200 

ACG CTT AGT GCG GGT GAA AAT ATT GAA TCG CAT CTG GCT TAT TAT TTG 73 5 

Thr Leu Ser Ala Gly Glu Asn lie Glu Ser His Leu Ala Tyr Tyr Leu 
205 210 215 

ATT TGC AAC CAT TCA TTA AAA ACC CCT TTA AAT ATC AAA CTT TCG CTC 783 
lie Cys Asn His Ser Leu Lys Thr Pro Leu Asn lie Lys Leu Ser Leu 
220 225 230 

GCT CAT AAT AAT GCC TTG AGT GAG AAC GAT CTG AAC AAT TTC ATA GAA 831 
Ala His Asn Asn Ala Leu Ser Glu Asn Asp Leu Asn Asn Phe lie Glu 
235 240 245 250 

TTT AGC ACG CAT TTT AGT GGG GTA ATA ATG AAC GCT TAT AGC CTA AAT 87 9 
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435 

Ala Lys Val Ser 
450 

Ser Asn Ser Tyr 
465 

lie Ala Gly Arg 

Ser Thr Glu Thr 
500 

Phe Leu Gin Tyr 
515 

Ser Arg Leu Cys 
530 

Gin Gin Leu Ser 
545 

Lys Gly Val Thr 

Ser Ser Tyr Arg 
580 

Leu lie Lys Ser 
595 

Cys Ser Val Asn 
610 



440 

Leu Ala Val Val 
455 

Arg Ala Leu Glu 
470 

Ser Ala Arg Gin 
485 

Asp Leu Leu Glu 

Glu Leu Gin Glu 
520 

Leu Leu Glu Phe 
535 

Leu Lys Ala Ser 
550 

Leu Ser Asn Phe 
565 

Tyr Leu lie Leu 

Val His Ala Phe 
600 

Met Asp Pro Val 
615 



Leu Gly lie Asp 
460 

Glu Gly Val Ser 
475 

lie Ser Gly Gin 
490 

Asn Phe Leu Glu 
505 

Arg Cys Glu Leu 

Lys His Lys Asn 
540 

Gin Thr Leu Ser 
555 

Lys Ala Pro lie 
570 

Leu Arg Ser Lys 
585 

Leu Lys Ser Ala 
Asp lie Phe 



445 

Asn lie lie Lys 

Leu Leu Tyr Gin 
480 

Val Phe lie Gin 
495 

Asp Tyr Glu Asp 
510 

Tyr Pro Pro Phe 
525 

Glu Glu Lys Ala 

Ser Cys Leu Glu 
560 

Glu Lys lie Ala 
575 

Asn Pro Leu Ser 
590 

Pro Ser lie Pro 
605 



(2) INFORMATION FOR SEQ ID NO: 6 97: 

(i) SEQUENCE CHARACTERISTICS: 
- (A) LENGTH: 2438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 82... 2373 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697: 

AAGATTAGCC ATGCTGACTT GCCAGATCAA TGATTTGCGC AGTTTCTTTG AAACTGATTT 60 
GAGAGTGTTG GAGAGCTTTT A ATG AAA CTG AGC ATT AAT GAT TTG AAT GTT 111 

Met Lys Leu Ser lie Asn Asp Leu Asn Val 
15 10 

TTT GTC AAT ACG CCT AAA GAT ATA GCC AAA CTC TGT GAG GAT TTG AGT 15 9 

Phe Val Asn Thr Pro Lys Asp lie Ala Lys Leu Cys Glu Asp Leu Ser 
15 20 25 

CGC TTA GGT TTA GAA GTG GAA AGC TGT ATC CCT TGT ATC GCT CCT AAA 2 07 

Arg Leu - Gly Leu Glu Val Glu Ser Cys He Pro Cys He Ala Pro Lys 
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Met Phe Tyr His 

1 

Thr Tyr Phe Ser 
20 

Pro Leu Arg Asn 
35 

Lys Pro Ser Phe 
50 

Leu Pro Phe Gin 
65 

Ala Asn Leu Ser 

Asp Leu Val Gly 
100 

Gin Thr Gin Thr 
115 

Leu Leu Phe Gly 
130 

Ala lie Ala Gin 
14 5 

Pro Glu lie Ala 

Phe Lys Glu Asn 
180 

Lys Lys Gin Phe 
195 

Val Gly Thr Arg 
210 

He He Val Asp 
225 

Pro Met Tyr Asn 

Pro He Gin Val 
260 

Lys Arg Phe Lys 
275 

Pro Thr Gin Lys 
290 

Pro Lys Leu Leu 
305 

Ala He He Phe 

Gin Ser Cys Tyr 
340 

Ser Leu His Leu 
355 

Ser Ser Pro He 
370 

Val Gly Lys Arg 
385 

Leu Leu Glu Gly 

Thr Gin Lys Lys 
420 

Thr Asn He Leu 



Leu He Ala Pro 
5 

Lys Glu Gin His 

Lys Thr Leu Leu 
40 

Glu Cys Leu Glu 
55 

Met Glu Leu Ala 
70 

Ser Val Leu Ser 
85 

Leu Glu Lys He 

Asn Ala Leu Lys 
120 

Asp Thr Gly Ser 
135 

Thr Leu Glu Gin 
150 

Leu Thr Pro Gin 
165 

Leu Gly Leu Trp 

Leu Glu Lys Leu 
200 

Ser Ala Leu Phe 
215 

Glu Glu His Asp 
230 

Ala Arg Asp Leu 
245 

He Leu Gly Ser 

Asp Lys Ala Leu 
280 

Asn He He Phe 
295 

Glu Ala Leu Gin 
310 

Val Pro Thr Arg 
325 

Lys Ser Val Gin 

Lys Thr Asn Lys 
360 

Pro Lys He Cys 
375 

He Gly Thr Met 
390 . 

Ala Lys He Ala 
405 

Leu His Asn He 
He Gly Thr Gin 



Leu Lys Asn Lys 
10 

Gin Lys Gly Ala 
25 

Gly Val Val Leu 

Leu Glu Lys Thr 
60 

He Phe He Ala 
75 

Leu Phe Ala Pro 
90 

Glu Pro He Leu 
105 

Glu Leu Gin Lys 

Gly Lys Thr Glu 
140 

Lys Lys Ser Ala 
155 

Met Gin Gin Arg 
170 

His Ser Lys Leu 
185 

Tyr Ser Gin Glu 

Leu Pro Leu Lys 
220 

Phe Ser Tyr Lys 
235 

Cys Leu Tyr Leu 
250 

Ala Thr Pro Ser 
265 

Val Arg Leu Lys 

Glu Lys Thr Glu 
300 

Gin Val Leu Asp 
315 

Ala Asn Phe Lys 
330 

Cys Pro Phe Cys 
345 

Leu Met Cys His 

Ser Ala Cys Gin 
380 

Gin Val Leu Lys 
395 

He Leu Asp Lys 
410 

Leu Asn Asp Phe 
425 

Met He Ser Lys 



Thr Pro Pro Leu 
15 

Leu Val Asn He 
30 

Glu Glu Val Ser 
45 

Pro Tyr Phe Leu 

Gin Tyr Tyr Ser 
80 

Phe Lys Glu Cys 
95 

Asn He Leu Ser 
110 

His Ser Ala Ser 
125 

He Tyr Met His 

Leu Leu Leu Val 
160 

Leu Lys Arg Val 
175 

Ser Gin Asn Gin 
190 

He Lys Leu Val 
205 

Glu Leu Gly Leu 

Ser His Gin Ser 
240 

Ser His Lys Phe 
255 

Leu Asn Ser Tyr 
270 

Gly Arg Tyr Thr 
285 

Arg Phe He Thr 

Lys Asn Glu Gin 
320 

Thr Leu Leu Cys 
335 

Ser Val Asn Met 
350 

Tyr Cys His Phe 
365 

Ser Glu Val Leu 

Glu Leu Glu Ser 
400 

Asp His Thr Ser 
415 

Asn Ala Gin Lys 
430 

Gly His Asp Tyr 
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AGT TAT AGG OCT TTA GAA GAA GGC GTG TCG TTA CTT TAT CAA ATC GCT 14 96 

Ser Tyr Arg Ala Leu Glu Glu Gly Val Ser Leu Leu Tyr Gin lie Ala 
470 475 480 

GGG AGG AGC GCT AGG CAA ATT TCT GGC CAA GTG TTC ATT CAA AGC ACC 154 4 

Gly Arg Ser Ala Arg Gin lie Ser Gly Gin Val Phe lie Gin Ser Thr 
485 490 495 

GAA ACC GAT CTG TTA GAA AAT TTC TTA GAA GAT TAT GAA GAT TTT TTA 15 92 

Glu Thr Asp Leu Leu Glu Asn Phe Leu Glu Asp Tyr Glu Asp Phe Leu 
500 505 510 

CAA TAC GAA TTG CAA GAA AGG TGC GAA CTC TAC CCG CCT TTT TCT AGG 164 0 

Gin Tyr Glu Leu Gin Glu Arg Cys Glu Leu Tyr Pro Pro Phe Ser Arg 
515 520 525 530 

CTG TGT TTG TTG GAG TTT AAG CAT AAA AAC GAA GAA AAA GCC CAA CAA 16 8 8 

Leu Cys Leu Leu Glu Phe Lys His Lys Asn Glu Glu Lys Ala Gin Gin 
535 540 545 

TTG AGC CTA AAA GCC TCT CAA ACC CTT TCT TCG TGT TTA GAA AAG GGC 173 6 

Leu Ser Leu Lys Ala Ser Gin Thr Leu Ser Ser Cys Leu Glu Lys Gly 
550 555 560 

GTA ACG CTC TCT AAT TTC AAA GCC CCC ATT GAA AAA ATC GCT TCT TCT 17 84 

Val Thr Leu Ser Asn Phe Lys Ala Pro lie Glu Lys lie Ala Ser Ser 
565 570 575 

TAT CGC TAC CTT ATT TTA TTG CGT TCC AAA AAC CCT TTA AGC CTA ATC 183 2 

Tyr Arg Tyr Leu lie Leu Leu Arg Ser Lys Asn Pro Leu Ser Leu lie 
580 585 590 

AAA AGC GTG CAT GCG TTT TTA AAA TCC GCC CCT AGT ATC CCT TGC AGC 18 80 

Lys Ser Val His Ala Phe Leu Lys Ser Ala Pro Ser lie Pro Cys Ser 
595 600 605 610 

GTG AAC ATG GAT CCT GTG GAT ATT TTT TAAAAAACTC ATGTTTTATA TATTATT 1934 
Val Asn Met Asp Pro Val Asp lie Phe 
615 

TCAAAAAACT TAAGTTTTTC TGGCGA 196 0 

(2) INFORMATION FOR SEQ ID NO: 6 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696: 
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TAT AAC GCT AGG GAT TTA TGC TTG TAT TTA TCT CAT AAA TTC CCT ATT 82 4 

Tyr Asn Ala Arg Asp Leu Cys Leu Tyr Leu Ser His Lys Phe Pro lie 
245 250 255 

CAA GTG ATC TTA GGC TCT GCT ACG CCA AGT TTG AAT AGT TAT AAA CGC 87 2 

Gin Val lie Leu Gly Ser Ala Thr Pro Ser Leu Asn Ser Tyr Lys Arg 
260 265 270 

TTT AAA GAT AAG GCT TTA GTG CGC TTA AAG GGG CGC TAC ACC CCC ACG 92 0 

Phe Lys Asp Lys Ala Leu Val Arg Leu Lys Gly Arg Tyr Thr Pro Thr 
275 280 285 290 

CAA AAA AAC ATT ATT TTT GAA AAA ACC GAG CGT TTT ATC ACG CCC AAA 96 8 

Gin Lys Asn lie lie Phe Glu Lys Thr Glu Arg Phe lie Thr Pro Lys 
295 300 305 

CTC CTA GAA GCG CTA CAA CAA GTC CTA GAC AAA AAC GAG CAA GCC ATT 1016 
Leu Leu Glu Ala Leu Gin Gin Val Leu Asp Lys Asn Glu Gin Ala lie 
310 315 320 

ATT TTT GTG CCT ACA AGG GCT AAT TTC AAA ACC TTG CTG TGC CAA AGT 1064 
lie Phe Val Pro Thr Arg Ala Asn Phe Lys Thr Leu Leu Cys Gin Ser 
325 330 335 

TGT TAC AAA AGC GTT CAA TGC CCC TTT TGC AGC GTG AAT ATG AGC TTG 1112 
Cys Tyr Lys Ser Val Gin Cys Pro Phe Cys Ser Val Asn Met Ser Leu 
340 345 350 

CAT TTA AAG ACC AAC AAA CTC ATG TGC CAT TAT TGC CAT TTT TCA AGC 116 0 

His Leu Lys Thr Asn Lys Leu Met Cys His Tyr Cys His Phe Ser Ser 
355 360 365 370 

CCT ATC CCT AAA ATT TGC AGC GCG TGT CAA AGC GAA GTC TTA GTG GGT 12 0 8 

Pro lie Pro Lys lie Cys Ser Ala Cys Gin Ser Glu Val Leu Val Gly 
375 380 385 

AAA AGG ATA GGC ACT ATG CAA GTG CTA AAG GAA TTA GAG AGC CTT TTA 12 5 6 

Lys Arg lie Gly Thr Met Gin Val Leu Lys Glu Leu Glu Ser Leu Leu 
390 395 400 

GAG GGG GCT AAA ATA GCG ATT TTA GAT AAA GAT CAC ACT AGC ACG CAA 13 04 

Glu Gly Ala Lys lie Ala lie Leu Asp Lys Asp His Thr Ser Thr Gin 
405 410 415 

AAA AAA CTC CAC AAT ATT TTA AAC GAT TTC AAC GCT CAA AAA ACG AAT 13 52 

Lys Lys Leu His Asn lie Leu Asn Asp Phe Asn Ala Gin Lys Thr Asn 
420 425 430 

ATC TTA ATC GGC ACT CAA ATG ATA AGC AAA GGG CAT GAT TAC GCT AAA 14 0 0 

lie Leu lie Gly Thr Gin Met lie Ser Lys Gly His Asp Tyr Ala Lys 
435 440 445 450 

GTG AGT TTG GCG GTT GTT TTA GGC ATA GAC AAT ATC ATC AAA TCT AAT 14 4 8 

Val Ser Leu Ala Val Val Leu Gly lie Asp Asn lie He Lys Ser Asn 
455 460 465 
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TTT TCT AAA GAG CAA CAC CAA AAA GGA GCG TTA GTC AAT ATC CCT TTA 152 
Phe Ser Lys Glu Gin His Gin Lys Gly Ala Leu Val Asn lie Pro Leu 
20 25 30 

AGG AAT AAA ACG CTT TTA GGC GTC GTC CTT GAA GAA GTT TCA AAA CCC 2 00 

Arg Asn Lys Thr Leu Leu Gly Val Val Leu Glu Glu Val Ser Lys Pro 
35 40 45 50 

TCT TTT GAA TGC CTA GAG CTA GAA AAA ACC CCT TAT TTT TTA CTC CCC 248 
Ser Phe Glu Cys Leu Glu Leu Glu Lys Thr Pro Tyr Phe Leu Leu Pro 
55 60 65 

TTT CAA ATG GAG CTC GCT ATT TTT ATC GCT CAA TAT TAG TCA GCT AAT 2 96 

Phe Gin Met Glu Leu Ala He Phe He Ala Gin Tyr Tyr Ser Ala Asn 
70 75 80 

CTT TCT TCA GTT TTA AGC CTT TTT GCC CCT TTT AAA GAA TGC GAT TTA 344 
Leu Ser Ser Val Leu Ser Leu Phe Ala Pro Phe Lys Glu Cys Asp Leu 
85 90 95 

GTG GGG TTA GAA AAA ATT GAG CCT ATT CTT AAT ATA TTA AGC CAA ACG 3 92 

Val Gly Leu Glu Lys He Glu Pro He Leu Asn He Leu Ser Gin Thr 
100 105 110 

CAA AC A AAC GCT TTA AAA GAA TTG CAA AAA CAT TCA GCA AGC TTG CTC 44 0 

Gin Thr Asn Ala Leu Lys Glu Leu Gin Lys His Ser Ala Ser Leu Leu 
115 120 125 130 

TTT GGC GAT ACG GGT AGC GGG AAA ACC GAG ATT TAT ATG CAT GCA ATC 4 88 

Phe Gly Asp Thr Gly Ser Gly Lys Thr Glu He Tyr Met His Ala He 
135 140 145 

GCC CAA ACT TTA GAG CAA AAA AAA AGC GCT TTA TTG TTG GTG CCA GAA 53 6 

Ala Gin Thr Leu Glu Gin Lys Lys Ser Ala Leu Leu Leu Val Pro Glu 
150 155 160 

ATC GCT CTC ACC CCT CAA ATG CAA CAA CGC CTT AAA AGG GTT TTT AAA 5 84 

He Ala Leu Thr Pro Gin Met Gin Gin Arg Leu Lys Arg Val Phe Lys 
165 170 175 

GAA AAT TTA GGC TTG TGG CAT AGC AAA CTC TCT CAA AAT CAA AAA AAA 6 32 

Glu Asn Leu Gly Leu Trp His Ser Lys Leu Ser Gin Asn Gin Lys Lys 
180 185 190 

CAA TTT TTA GAA AAG CTT TAT TCG CAA GAA ATC AAA TTA GTG GTA GGC 680 
Gin Phe Leu Glu Lys Leu Tyr Ser Gin Glu He Lys Leu Val Val Gly 
195 200 205 210 

ACA CGA AGC GCG TTG TTT TTA CCC CTT AAA GAG CTG GGT TTA ATC ATT 72 8 

Thr Arg Ser Ala Leu Phe Leu Pro Leu Lys Glu Leu Gly Leu He He 
215 220 225 

GTA GAT GAA GAG CAT GAC TTT TCT TAT AAA TCC CAT CAA AGC CCT ATG 77 6 

Val Asp Glu Glu His Asp Phe Ser Tyr Lys Ser His Gin Ser Pro Met 
230 235 240 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694: 



Met Lys Asp Phe 
1 

His Thr lie Ser 
20 

lie Lys lie Ala 
35 

Leu Lys Gly His 
50 

Lys Phe Tyr Pro 
65 

Gly Val Asn Val 

Pro Thr His Tyr 
100 

Asn Arg Glu Arg 
115 

Lys Val Glu Gly 
130 

Glu Asn Ala Asp 
145 

Glu Asn Phe 



Tyr Phe Met Glu 
5 

Glu Met Val Ser 

Gin Gly Glu Lys 
40 

Ala lie Glu Cys 
55 

Ser Pro Gly Lys 
70 

Arg Leu Asp Ser 
85 

Asp Ser Met lie 

Ala lie Ala Lys 
120 

lie Lys Thr Thr 
135 

Phe Arg Gin Ala 
150 



Met Asn Thr Arg 
10 

Gly Leu Asn Leu 
25 

Leu Pro Lys Gin 

Arg lie Thr Ala 
60 

lie Thr Glu Trp 
75 

His Ala His Ala 
90 

Gly Lys Leu lie 
105 

Met Lys Arg Ala 

lie Pro Phe His 
140 

Lys lie His Thr 
155 



Leu Gin Val Glu 
15 

lie Glu Trp Met 
30 

Glu Ser Phe Ser 
45 

Glu Asp Pro Lys 

He Ala Pro Gly 
80 

Asn Tyr Val Val 
95 

Val Trp Gly Glu 
110 

Leu Lys Glu Phe 
125 

Leu Glu Met Leu 

Lys Tyr Leu Glu 
160 



(2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1960 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1907 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 



AACGAGTATT TGCACAATGA ACTCCAAAAG CTTTTAGAAA AAATCTCATC ATG TTC 5 6 

Met Phe 
1 



TAT CAC TTA ATC GCT CCT TTA AAA AAT AAA ACC CCC CCT TTA ACC TAT 104 
Tyr His Leu He Ala Pro Leu Lys Asn Lys Thr Pro Pro Leu Thr Tyr 
5 10 15 
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TTTTTGTTGG ATTCTAAC ATG AAA GAT TTT TAT TTC ATG GAG ATG AAC ACT 171 

Met Lys Asp Phe Tyr Phe Met Glu Met Asn Thr 
15 10 

CGT TTG CAA GTG GAA CAC ACC ATT AGC GAA ATG GTG AGO GGG TTA AAC 219 
Arg Leu Gin Val Glu His Thr lie Ser Glu Met Val Ser Gly Leu Asn 
15 20 25 

CTC ATT GAG TGG ATG ATT AAA ATC GCT CAA GGC GAA AAA TTG CCC AAG 2 67 

Leu lie Glu Trp Met lie Lys lie Ala Gin Gly Glu Lys Leu Pro Lys 
30 35 40 

CAA GAA AGC TTT TCT CTC AAA GGG CAT GCG ATA GAA TGC CGA ATC ACG 315 
Gin Glu Ser Phe Ser Leu Lys Gly His Ala lie Glu Cys Arg lie Thr 
45 50 55 

GCA GAA GAT CCT AAA AAA TTC TAC CCA AGC CCG GGC AAA ATT ACC GAA 3 63 

Ala Glu Asp Pro Lys Lys Phe Tyr Pro Ser Pro Gly Lys lie Thr Glu 
60 65 70 75 

TGG ATC GCT CCT GGT GGG GTG AAT GTG CGC CTT GAT TCG CAC GCG CAT 411 
Trp lie Ala Pro Gly Gly Val Asn Val Arg Leu Asp Ser His Ala His 
80 85 90 

GCC AAT TAT GTC GTG CCT ACG CAC TAT GAT TCG ATG ATT GGC AAG CTC 45 9 

Ala Asn Tyr Val Val Pro Thr His Tyr Asp Ser Met lie Gly Lys Leu 
95 100 105 

ATT GTG TGG GGT GAA AAC AGA GAA AGA GCG ATC GCT AAG ATG AAA AGG 5 07 

lie Val Trp Gly Glu Asn Arg Glu Arg Ala lie Ala Lys Met Lys Arg 
110 115 120 

GCT TTA AAG GAA TTT AAA GTA GAA GGC ATT AAA ACG ACC ATT CCT TTC 555 
Ala Leu Lys Glu Phe Lys Val Glu Gly lie Lys Thr Thr lie Pro Phe 
125 130 135 

CAC CTT GAA ATG CTT GAA AAT GCG GAT TTC AGG CAA GCA AAA ATC CAC 6 03 

His Leu Glu Met Leu Glu Asn Ala Asp Phe Arg Gin Ala Lys lie His 
140 145 150 155 

ACG AAG TAT TTA GAA GAA AAT TTT TAAGTTTTAA GGATTCTTTT AAGCATAGTT 6 57 

Thr Lys Tyr Leu Glu Glu Asn Phe 
160 

TAAGGGTTTT AAGCGATCAG AAAAAGTCAG CA 689 
(2) INFORMATION FOR SEQ ID NO:694: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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lie Asp Asn lie Asn Pro Phe lie Thr Ser Pro Glu lie Asn Leu Cys 

100 105 110 

Leu Val Arg Gin Ala Tyr Glu Glu Ala Leu His Ser His Ala Tyr Ala 

115 120 125 

Val Met Val Glu Ser lie Ser Ala Asn Thr Glu Glu lie Tyr Asp Met 

130 135 140 

Trp Arg Asn Asp Met Gin Leu Lys Ser Lys Asn Asp Tyr lie Ala Gin 
145 150 155 160 

Val Tyr Met Glu Leu Ala Lys Asn Pro Thr Glu Glu Asn lie Leu Lys 

165 170 175 

Ala Leu Phe Ala Asn Gin lie Leu Glu Gly lie Tyr Phe Tyr Ser Gly 

180 185 190 

Phe Ser Tyr Phe Tyr Thr Leu Ala Arg Ser Gly Lys Met Leu Gly Ser 

195 200 205 

Ala Gin Met lie Arg Phe lie Gin Arg Asp Glu Val Thr His Leu lie 

210 215 220 

Leu Phe Gin Asn Met lie Asn Ala Leu Arg Asn Glu Arg Ala Asp Leu 
225 230 235 240 

Phe Thr Pro Gin Leu lie Asn Glu Val lie Gly Met Phe Lys Lys Ala 

245 250 255 

Val Glu lie Glu Ala Leu Trp Gly Asp Tyr lie Thr Gin Gly Lys lie 

260 265 270 

Leu Gly Leu Thr Ser Ser Leu lie Glu Gin Tyr lie Gin Phe Leu Ala 

275 280 285 

Asp Ser Arg Leu Ser Lys Val Gly lie Ala Lys Val Tyr Gly Val Gin 

290 295 300 

His Pro lie Lys Trp Val Glu Ser Phe Ser Ser Phe Asn Glu Gin Arg 
305 310 315 320 

Ser Asn Phe Phe Glu Ala Arg Val Ser Asn Tyr Ala Lys Gly Ser Val 

325 330 335 

Ser Phe Asp Asp Phe 
340 

(2) INFORMATION FOR SEQ ID NO: 6 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 139... 627 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693: 

CAAAAGCTCA TTGAAGAAAC CCCGGCAGTG GTTTTAGAAG AGGGCGTGCG TGACGTTTGC 6 0 

TAGAAACAGC GATCAAGGCC GCTAAATATA TCGGCTRRTG TGGGGGCGGG GACTTTTGAA 12 0 
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AAC GCT TTA AGG AAT GAA AGA GCG GAT CTC TTC ACG CCG CAA TTG ATT 834 
Asn Ala Leu Arg Asn Glu Arg Ala Asp Leu Phe Thr Pro Gin Leu lie 
235 240 245 

AAT GAA GTC ATA GGA ATG TTT AAA AAA GCG GTA GAA ATT GAA GCT TTG 8 82 

Asn Glu Val lie Gly Met Phe Lys Lys Ala Val Glu lie Glu Ala Leu 
250 255 260 

TGG GGG GAT TAT ATC ACG CAA GGC AAG ATT TTA GGG CTC ACT TCA AGC 93 0 

Trp Gly Asp Tyr lie Thr Gin Gly Lys lie Leu Gly Leu Thr Ser Ser 
265 270 275 

TTG ATT GAG CAA TAG ATC CAG TTT TTA GCG GAT AGC CGT TTG AGT AAG 97 8 

Leu lie Glu Gin Tyr lie Gin Phe Leu Ala Asp Ser Arg Leu Ser Lys 
280 285 290 

GTG GGC ATC GCT AAA GTT TAT GGC GTC CAA CAC CCC ATT AAA TGG GTA 102 6 

Val Gly lie Ala Lys Val Tyr Gly Val Gin His Pro He Lys Trp Val 
295 300 305 310 

GAG AGC TTT TCA AGT TTC AAT GAG CAG CGC TCT AAT TTC TTT GAG GCT 10 74 

Glu Ser Phe Ser Ser Phe Asn Glu Gin Arg Ser Asn Phe Phe Glu Ala 
315 320 325 

AGG GTG AGC AAT TAC GCT AAA GGG AGC GTG AGT TTT GAT GAT TTT TAAGG 1124 
Arg Val Ser Asn Tyr Ala Lys Gly Ser Val Ser Phe Asp Asp Phe 
330 335 340 

GGCTTGTTTG AATAGTATTA AAAACCATTT AATGTGTGAA GAAATC 117 0 

(2) INFORMATION FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692: 

Met Glu Val Ser Arg Lys Lys He Tyr Asn Pro Asn Ser Thr Glu Ser 

15 10 15 

Val Asn Glu Arg Lys He Phe Gly Gly Asn Pro Thr Ser Met Phe Asp 

20 25 30 

Leu Asn Lys He Lys Tyr Gin Trp Ala Asp His Leu Trp Lys Thr Met 

35 40 45 

Leu Ala Asn Thr Trp Phe Ala Glu Glu Val Ser Met Asn Asp Asp Lys 

50 55 60 

Arg Asp Tyr Leu Lys Leu Ser Ala Glu Glu Lys He Gly Tyr Asp Arg 
65 70 75 80 

Ala Leu Ala Gin Leu He Phe Met Asp Ser Leu Gin Ala Asn Asn Leu 
85 90 95 
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Lys lie Tyr Asn Pro Asn Ser Thr Glu Ser Val Asn Glu Arg Lys lie 
10 15 20 

TTT GGG GGC AAT CCT ACA AGO ATG TTT GAT TTG AAT AAG ATC AAG TAT 210 
Phe Gly Gly Asn Pro Thr Ser Met Phe Asp Leu Asn Lys lie Lys Tyr 
25 30 35 

CAA TGG GCG GAT CAT TTG TGG AAA ACG ATG CTC GCT AAC ACC TGG TTT 25 8 

Gin Trp Ala Asp His Leu Trp Lys Thr Met Leu Ala Asn Thr Trp Phe 
40 45 50 

GCT GAA GAA GTG AGC ATG AAT GAT GAC AAA AGG GAT TAT TTG AAA TTA 3 06 

Ala Glu Glu Val Ser Met Asn Asp Asp Lys Arg Asp Tyr Leu Lys Leu 
55 60 65 70 

AGC GCA GAG GAA AAG ATC GGT TAT GAC AGA GCT TTA GCG CAA CTC ATT 3 54 

Ser Ala Glu Glu Lys lie Gly Tyr Asp Arg Ala Leu Ala Gin Leu lie 
75 80 85 

TTT ATG GAC AGC TTG CAA GCG AAT AAT TTA ATT GAC AAT ATC AAT CCC 4 02 

Phe Met Asp Ser Leu Gin Ala Asn Asn Leu lie Asp Asn lie Asn Pro 
90 95 100 

TTC ATC ACC AGC CCC GAA ATC AAT TTG TGT TTG GTG CGT CAA GCT TAT 45 0 

Phe lie Thr Ser Pro Glu lie Asn Leu Cys Leu Val Arg Gin Ala Tyr 
105 110 115 

GAA GAA GCC CTA CAC AGC CAT GCG TAT GCG GTG ATG GTA GAA AGC ATA 4 98 

Glu Glu Ala Leu His Ser His Ala Tyr Ala Val Met Val Glu Ser lie 
120 125 130 

AGT GCG AAT ACT GAA GAG ATT TAT GAC ATG TGG CGT AAC GAT ATG CAA 54 6 

Ser Ala Asn Thr Glu Glu lie Tyr Asp Met Trp Arg Asn Asp Met Gin 
135 140 145 150 

TTA AAA AGC AAG AAC GAC TAT ATC GCG CAA GTG TAT ATG GAA TTA GCC 5 94 

Leu Lys Ser Lys Asn Asp Tyr lie Ala Gin Val Tyr Met Glu Leu Ala 
155 160 165 

AAA AAC CCC ACA GAA GAA AAC ATT CTC AAA GCG CTT TTT GCT AAC CAG 64 2 

Lys Asn Pro Thr Glu Glu Asn lie Leu Lys Ala Leu Phe Ala Asn Gin 
170 175 180 

ATT TTA GAG GGG ATT TAT TTT TAT AGC GGG TTT AGC TAT TTT TAC ACT 6 90 

lie Leu Glu Gly lie Tyr Phe Tyr Ser Gly Phe Ser Tyr Phe Tyr Thr 
185 190 195 

TTG GCT AGG AGC GGT AAA ATG CTA GGA TCG GCA CAA ATG ATT CGT TTT 73 8 

Leu Ala Arg Ser Gly Lys Met Leu Gly Ser Ala Gin Met lie Arg Phe 
200 205 210 

ATC CAA AGA GAT GAG GTA ACG CAT TTG ATT TTG TTC CAA AAC ATG ATC 7 86 

lie Gin Arg Asp Glu Val Thr His Leu He Leu Phe Gin Asn Met He 
215 220 225 230 
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Thr Ala Val Val Tyr Gly Glu Ser Ser Ser Ser Leu Asn Glu Glu Ser 

130 135 140 

Pro Leu Asn Pro lie Asn Pro Tyr Gly Ala Ser Lys Met Met Ser Glu 
145 150 155 160 

Arg lie Leu Leu Asp Thr Ser Lys lie Ala Asp Phe Lys Cys Val lie 

165 170 175 

Leu Arg Tyr Phe Asn Val Ala Gly Ala Cys Met His Asn Asp Tyir Thr 

180 ■ 185 190 

Thr Pro Tyr Thr Leu Gly Gin Arg Thr Leu Asn Ala Thr His Leu lie 

195 200 205 

Lys lie Ala Cys Glu Cys Ala Val Gly Lys Arg Lys Lys Met Gly lie 

210 215 220 

Phe Gly Thr Asn Tyr Pro Thr Arg Asp Gly Thr Cys lie Arg Asp Tyr 
225 230 235 240 

lie His Val Asp Asp Leu Ala Asn Ala His Leu Ala Ser Tyr Gin Thr 

245 250 255 

Leu Leu Glu Lys Asn Lys Ser Glu lie Tyr Asn Val Gly Tyr Asn Gin 

260 265 270 

Gly His Ser Val Lys Glu Val lie Glu Lys Val Lys Glu lie Ser Asn 

275 280 285 

Asn Asp Phe Leu Val Glu lie Leu Asp Lys Arg Gin Gly Asp Pro Ala 

290 295 300 

Ser Leu lie Ala Asn Asn Ala Lys lie Leu Gin Asn Thr Ser Phe Lys 
305 310 315 320 

Pro Leu Tyr Asn Asn Leu Asp Thr lie lie Lys Ser Ala Leu Asp Trp 

325 330 335 

Glu Glu His Leu Leu Arg Phe Gin 
340 

(2) INFORMATION FOR SEQ ID NO: 6 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 97 . . . 1119 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 91: 

GGAGTTTTTT CTGGCTTTTA GGGGGTTTTA AATTCTTTTA AGGTATTCTA ACAAGACTAT 6 0 

ATCATTGAGA TAGTTTTAAG GAAATTAAGG AACAAA ATG GAA GTT TCA CGC AAG 114 

Met Glu Val Ser Arg Lys 
1 5 

AAA ATT TAC AAC CCC AAT TCT ACA GAA AGT GTG AAT GAA AGA AAG ATT 162 
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GAA AAA AAT AAG AGC GAG ATC TAT AAT GTC GGC TAG AAT CAA GGC CAT 872 
Glu Lys Asn Lys Ser Glu lie Tyr Asn Val Gly Tyr Asn Gin Gly His 
260 265 270 

AGC GTG AAA GAA GTG ATA GAA AAG GTC AAA GAA ATC TCA AAC AAC GAT 92 0 

Ser Val Lys Glu Val lie Glu Lys Val Lys Glu lie Ser Asn Asn Asp 
275 280 285 290 

TTT TTA GTG GAA ATT TTA GAC AAA CGA CAG GGC GAT CCA GCA AGC CTT 96 8 

Phe Leu Val Glu lie Leu Asp Lys Arg Gin Gly Asp Pro Ala Ser Leu 

295 300 305 

\ 

ATT GCC AAT AAC GCT AAA ATC TTA CAA AAC ACC TCT TTC AAA CCC CTT 1016 
lie Ala Asn Asn Ala Lys lie Leu Gin Asn Thr Ser Phe Lys Pro Leu 
310 315 320 

TAT AAC AAC CTA GAC ACC ATT ATC AAA AGC GCT CTA GAT TGG GAA GAA 10 64 

Tyr Asn Asn Leu Asp Thr lie He Lys Ser Ala Leu Asp Trp Glu Glu 
325 330 335 

CAC CTT TTG AGG TTT CAA TAATACACCC TGTGCAAATA CAAGCCATTA GCCATTAT 112 0 
His Leu Leu Arg Phe Gin 
340 

GGGCGTTCTT ATAGT 113 5 

(2) INFORMATION FOR SEQ ID NO: 6 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690: 

Met Ala Leu Leu Phe Thr Gly Ala Cys Gly Tyr He Gly Ser His Thr 

15 10 15 

Ala Arg Ala Phe Leu Glu Lys Thr Lys Glu Asn He He He Val Asp 

20 25 30 

Asp Leu Ser Thr Gly Phe Leu Glu His Leu Lys Ala Leu Glu His Tyr 

35 40 45 

Tyr Pro Asn Arg Val Val Phe He Gin Ala Asn Leu Asn Glu Thr His 

50 55 60 

Lys Leu Asp Ala Phe Leu Asn Lys Gin Gin Leu Lys Asp Pro He Glu 
65 70 75 80 

Ala He Leu His Phe Gly Ala Lys He Ser Val Glu Glu Ser Thr His 

85 90 95 

Leu Pro Leu Glu Tyr Tyr Thr Asn Asn Thr Leu Asn Thr Leu Glu Leu 

100 105 110 

Val Lys Leu Cys Leu Lys His Ala. He Lys Arg Phe He Phe Ser Ser 
115 120 125 



-1052- 



BNSDOCID: <WO_984347aAl_l_> 



wo 98/43478 



PCT/US98/06371 



Ser Thr Gly Phe Leu Glu His Leu Lys Ala Leu Glu His Tyr Tyr Pro 
35 40 45 50 

AAT AGG GTT GTG TTT ATT CAA GCG AAT TTG AAT GAA ACG CAC AAA TTA 24 8 

Asn Arg Val Val Phe lie Gin Ala Asn Leu Asn Glu Thr His Lys Leu 
55 60 65 

GAG GCC TTT TTG AAT AAG GAG CAG CTA AAA GAT CCC ATT GAA GCC ATC 2 96 

Asp Ala Phe Leu Asn Lys Gin Gin Leu Lys Asp Pro lie Glu Ala lie 
70 75 80 

TTG CAC TTT GGG GCT AAA ATC TCA GTA GAA GAA TCC ACG CAC TTG CCT 344 
Leu His Phe Gly Ala Lys lie Ser Val Glu Glu Ser Thr His Leu Pro 
85 90 95 

TTA GAA TAC TAG ACC AAC AAC ACG CTC AAC ACT TTA GAG CTT GTC AAA 3 92 

Leu Glu Tyr Tyr Thr Asn Asn Thr Leu Asn Thr Leu Glu Leu Val Lys 
100 105 110 

CTT TGC TTA AAA CAT GCA ATC AAG CGT TTT ATT TTT TCT TCT ACG GCC 44 0 

Leu Cys Leu Lys His Ala lie Lys Arg Phe lie Phe Ser Ser Thr Ala 
115 120 125 130 

GTG GTT TAT GGC GAA TCT AGT TCA AGT TTG AAT GAA GAA AGC CCC TTA 4 88 

Val Val Tyr Gly Glu Ser Ser Ser Ser Leu Asn Glu Glu Ser Pro Leu 
135 140 145 

AAC CCC ATT AAT CCT TAT GGA GCG TCT AAA ATG ATG AGC GAA AGA ATC 536 
Asn Pro lie Asn Pro Tyr Gly Ala Ser Lys Met Met Ser Glu Arg lie 
150 155 160 

TTG TTA GAC ACT TCT AAA ATA GCG GAT TTT AAA TGC GTT ATT TTG CGC 5 84 

Leu Leu Asp Thr Ser Lys lie Ala Asp Phe Lys Cys Val lie Leu Arg 
165 170 175 

TAT TTC AAT GTG GCT GGG GCA TGC ATG CAC AAT GAT TAT ACC ACC CCT 6 32 

Tyr Phe Asn Val Ala Gly Ala Cys Met His Asn Asp Tyr Thr Thr Pro 
180 185 190 

TAC ACG CTA GGG CAA CGC ACG CTC AAC GCC ACG CAT TTG ATC AAA ATC 680 
Tyr Thr Leu Gly Gin Arg Thr Leu Asn Ala Thr His Leu lie Lys lie 
195 200 205 210 

GCA TGC GAA TGC GCG GTG GGG AAA AGG AAA AAA ATG GGG ATT TTT GGC 72 8 

Ala Cys Glu Cys Ala Val Gly Lys Arg Lys Lys Met Gly lie Phe Gly 
215 220 225 

ACT AAC TAC CCC ACA AGA GAT GGC ACT TGC ATT AGG GAT TAT ATC CAT 776 
Thr Asn Tyr Pro Thr Arg Asp Gly Thr Cys lie Arg Asp Tyr lie His 
230 235 240 

GTA GAT GAT TTG GCT AAC GCA CAT TTA GCG AGC TAT CAA ACC CTT TTA 82 4 

Val Asp Asp Leu Ala Asn Ala His Leu Ala Ser Tyr Gin Thr Leu Leu 
245 250 255 
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130 

Ala Leu Lys Glu 
145 

Ser Glu Leu Leu 

Asn Gly Val Gly 
180 

Asn lie Glu Cys 
195 

Pro Asn Leu Ser 
210 

Ser Asp Asn Tyr 
225 

Leu Ser Glu Gin 

Asp Glu Phe lie 
260 

Gly Leu Asp Thr 
275 

Gin Glu Arg 
290 



135 

Gly Val Phe Glu 
150 

Lys Lys Glu Gly 
165 

Arg Ala His Leu 

Ala Leu Leu Asp 
200 

Ala lie Val Ala 
215 

Lys Leu Gly Gly 
230 

Asn lie Leu Lys 
245 

Met His Gly Asn 

Glu Ser Leu Thr 
280 



140 

Pro Ser Gly Phe 
155 

Glu lie Leu Leu 
170 

Val Gin Leu Ala 
185 

Leu Arg Phe Leu 

Pro Tyr Gin Lys 
220 

Val Ala Ser Ala 
235 

Pro Val Lys Ser 
250 

Thr Ala Leu Val 
265 

Asp Ala lie Leu 



Val Leu Gly Gin 
160 

lie Gly Tyr Gly 
175 

Leu Lys Glu Lys 
190 

Lys Pro Leu Asp 
205 

Leu Tyr Val Phe 

lie Leu Glu Phe 
240 

Phe Glu He He 
255 

Glu Lys Ser Leu 
270 

Lys Asp Leu Gly 
285 



(2) INFORMATION FOR SEQ ID NO: 68 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/ICEY: Coding Sequence 

(B) LOCATION: 51... 1082 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 

GGATTAAGAT GTTATAATAG TTGTTATTTT TTCATTTTAA AAGGGGTTTT ATG GCA 56 

Met Ala 
1 

TTA TTA TTC ACA GGA GCG TGC GGG TAT ATA GGC TCG CAT ACC GCA AGG 104 
Leu Leu Phe Thr Gly Ala Cys Gly Tyr lie Gly Ser His Thr Ala Arg 
5 10 15 

GCG TTT TTA GAA AAA ACC AAA GAA AAT ATC ATT ATT GTA GAT GAC TTA 152 
Ala Phe Leu Glu Lys Thr Lys Glu Asn lie lie He Val Asp Asp Leu 
20 25 30 

AGC ACC GGT TTT TTA GAG CAC CTC AAA GCG TTA GAG CAT TAT TAG CCT 200 
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TTA AGC GCG ATC GTT GCC CCT TAT CAA AAG CTC TAT GTT TTT AGC GAT 72 8 

Leu Ser Ala lie Val Ala Pro Tyr Gin Lys Leu Tyr Val Phe Ser Asp 
215 220 225 

AAT TAG AAG CTT GGA GGG GTG GOT AGC GCG ATT TTA GAG TTT TTG AGC 77 6 

Asn Tyr Lys Leu Gly Gly Val Ala Ser Ala lie Leu Glu Phe Leu Ser 
230 235 240 

GAA CAA AAT ATT TTA AAG CCT GTT AAA AGC TTT GAA ATC ATT GAT GAA 824 
Glu Gin Asn lie Leu Lys Pro Val Lys Ser Phe Glu lie lie Asp Glu 
245 250 255 

TTT ATC ATG CAT GGG AAC ACC GCT TTA GTG GAA AAA TCC TTA GGA TTA 872 
Phe lie Met His Gly Asn Thr Ala Leu Val Glu Lys Ser Leu Gly Leu 
260 265 270 

GAC ACA GAG AGT TTG ACT GAC GCT ATT TTA AAA GAT TTA GGA CAA GAG 92 0 

Asp Thr Glu Ser Leu Thr Asp Ala lie Leu Lys Asp Leu Gly Gin Glu 
275 280 285 290 

AGA TGAAAACAAA AGCGCCAATG AAAAATATCC GCAATTTTTC CATTATCGCT CAC 97 6 

Arg 



(2) INFORMATION FOR SEQ ID NO: 68 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688: 

Lys Asn Arg Arg Arg Asn Arg Gly Asp Ala Ser Gly Thr Gly Leu Asp 

15 10 15 

Lys Leu lie Asp Ala Xaa Pro Leu Arg Phe Phe Asp Val Ala lie Ala 

20 25 30 

Glu Gin His Ala Leu Thr Ser Ser Ser Ala Met Ala Lys Glu Gly Phe 

35 40 45 

Lys Pro Phe Val Ser lie Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp 

50 55 60 

Ser lie Val His Asp Ala Cys lie Ser Ser Leu Pro lie Lys Leu Ala 
65 70 75 80 

lie Asp Arg Ala Gly lie Val Gly Glu Asp Gly Glu Thr His Gin Gly 

85 90 95 

Leu Leu Asp Val Ser Tyr Leu Arg Ser lie Pro Asn Met Val lie Phe 

100 105 110 

Ala Pro Arg Asp Asn Glu Thr Leu Lys Asn Ala Val Arg Phe Ala Asn 

115 120 125 

Glu His Asp Ser Ser Pro Cys Ala Phe Arg Tyr Pro Arg Gly Ser Phe 
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Lys Asn 
1 

CGT AGG CGT AAC CGC GGC GAT GCT AGC GGC ACA GGA TTA GAG AAA CTC 104 
Arg Arg Arg Asn Arg Gly Asp Ala Ser Gly Thr Gly Leu Asp Lys Leu 
5 10 15 

ATT GAG GCT TAN OCT TTG CGC TTT TTT GAT GTC GCT ATC GCT GAG CAA 152 
lie Asp Ala Xaa Pro Leu Arg Phe Phe Asp Val Ala lie Ala Glu Gin 
20 25 30 

CAC GCT TTA ACT TCT AGC AGC GCT ATG GCT AAA GAG GGG TTT AAA CCT 2 00 

His Ala Leu Thr Ser Ser Ser Ala Met Ala Lys Glu Gly Phe Lys Pro 
35 40 45 50 

TTT GTG AGC ATC TAT TCT ACT TTT TTG CAG AGG GCT TAT GAT TCT ATT 24 8 

Phe Val Ser lie Tyr Ser Thr Phe Leu Gin Arg Ala Tyr Asp Ser lie 
55 60 65 

GTG CAT GAC GCT TGT ATT TCT AGC TTG CCG ATT AAA TTA GCC ATT GAC 2 96 

Val His Asp Ala Cys lie Ser Ser Leu Pro lie Lys Leu Ala lie Asp 
70 75 80 

AGG GCT GGG ATT GTG GGC GAA GAT GGC GAG ACG CAC CAA GGG CTT TTA 3 44 

Arg Ala Gly lie Val Gly Glu Asp Gly Glu Thr His Gin Gly Leu Leu 
85 90 95 

GAC GTG TCG TAT TTG CGC TCT ATC CCT AAC ATG GTC ATT TTT GCC CCA 3 92 

Asp Val Ser Tyr Leu Arg Ser lie Pro Asn Met Val lie Phe Ala Pro 
100 105 110 

CGA GAC AAT GAG ACT TTA AAA AAC GCC GTG CGT TTT GCC AAT GAA CAC 44 0 

Arg Asp Asn Glu Thr Leu Lys Asn Ala Val Arg Phe Ala Asn Glu His 
115 120 125 130 

GAT TCA AGC CCT TGC GCG TTC CGA TAC CCT AGG GGG TCG TTT GCG TTA 4 88 

Asp Ser Ser Pro Cys Ala Phe Arg Tyr Pro Arg Gly Ser Phe Ala Leu 
135 140 145 

AAA GAG GGG GTT TTT GAG CCT AGC GGT TTT GTT TTA GGC CAA AGC GAA 53 6 

Lys Glu Gly Val Phe Glu Pro Ser Gly Phe Val Leu Gly Gin Ser Glu 
150 155 160 

TTG TTG AAA AAA GAG GGC GAA ATT TTA CTC ATA GGC TAT GGT AAT GGC 5 84 

Leu Leu Lys Lys Glu Gly Glu lie Leu Leu lie Gly Tyr Gly Asn Gly 
165 170 175 

GTG GGG CGG GCG CAT TTA GTC CAA CTG GCT TTA AAA GAA AAA AAC ATA 632 
Val Gly Arg Ala His Leu Val Gin Leu Ala Leu Lys Glu Lys Asn lie 
180 185 190 

GAA TGC GCT CTC TTG GAT CTC AGG TTT TTA AAG CCT TTA GAT CCA AAT 680 
Glu Cys Ala Leu Leu Asp Leu Arg Phe Leu Lys Pro Leu Asp Pro Asn 
195 200 205 210 
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GGACTCTTTT CTTCATTGGC TTGAGTTGC 5 22 

(2) INFORMATION FOR SEQ ID NO: 6 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 86: 

Met Ser Tyr Phe Phe Lys lie lie Leu Gly Thr Ser Val lie Val Gly 

15 10 15 

Val Leu Leu Gly Leu Trp Arg Leu Thr Tyr Asp Lys Phe Tyr Phe Ser 

20 25 30 

Leu Val Phe Val Leu Leu lie Leu Gly lie Val Ala Cys Ser Tyr lie 

35 40 45 

Ser Leu Lys Met His Gin Arg Lys Cys Phe Ala Lys Cys Phe Val Asn 

50 55 60 

Ser Glu Ser Phe Leu Ser Lys Met Leu His Ser Pro lie Met Val lie 
65 70 75 80 

Cys Phe Tyr Phe lie Phe Ser lie Phe Thr Ser lie Ser lie Val Tyr 

85 90 95 

Ser Val Leu Asp Tyr Asp Gin Met Met Trp Gly Phe Val Phe Cys Thr 

100 105 110 

lie Val Val Cys Ala Val Val Phe Gly Thr Leu Glu Lys Asn Ala Gin 

115 120 125 

Glu Tyr His Gin Arg Arg Leu Phe Asp Ala Asp Val 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 6 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 923 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:687: 
CCCACTGAAG CGTATTCTAA CACCCTTTTA G^lATTAGCTA AAAAAGATGA AAA AAT 5 6 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 470 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:685: 

AAAGACAACC CTAACAAACG CTTCAAAAAC AACAAGAGGG ATAAAAAATA ATG TCT 5 6 

Met Ser 
1 

TAT TTT TTT AAA ATC ATT CTG GGC ACA AGC GTG ATC GTG GGG GTG TTG 104 

Tyr Phe Phe Lys lie lie Leu Gly Thr Ser Val lie Val Gly Val Leu 
5 10 15 

TTG GGC TTG TGG CGT TTG ACT TAG GAT AAG TTC TAT TTC TCG TTG GTC 152 
Leu Gly Leu Trp Arg Leu Thr Tyr Asp Lys Phe Tyr Phe Ser Leu Val 
20 25 30 

TTT GTG TTG CTG ATA CTA GGG ATT GTC GCT TGT AGC TAT ATT TCT TTA 200 
Phe Val Leu Leu lie Leu Gly lie Val Ala Cys Ser Tyr lie Ser Leu 
35 40 45 50 

AAA ATG CAT CAG AGG AAA TGC TTC GCC AAG TGT TTC GTG AAT AGT GAA 24 8 

Lys Met His Gin Arg Lys Cys Phe Ala Lys Cys Phe Val Asn Ser Glu 
55 60 65 

TCT TTT TTA TCC AAG ATG TTA CAC TCC CCA ATA ATG GTA ATT TGC TTT 2 96 

Ser Phe Leu Ser Lys Met Leu His Ser Pro lie Met Val lie Cys Phe 
70 75 80 

TAT TTT ATT TTT TCA ATT TTC ACA TCC ATA TCC ATC GTC TAT AGC GTG 344 
Tyr Phe lie Phe Ser lie Phe Thr Ser lie Ser lie Val Tyr Ser Val 
85 90 95 

CTG GAC TAT GAT CAG ATG ATG TGG GGG TTT GTT TTT TGC ACT ATC GTT 3 92 

Leu Asp Tyr Asp Gin Met Met Trp Gly Phe Val Phe Cys Thr lie Val 
100 105 110 

GTT TGC GCT GTG GTG TTT GGC ACG CTT GAA AAA AAT GCT CAA GAG TAG 44 0 

Val Cys Ala Val Val Phe Gly Thr Leu Glu Lys Asn Ala Gin Glu Tyr 
115 120 125 130 

CAT CAA AGA AGA TTA TTT GAT GCT GAT GTC TAGAGAAGTG AGTGCTTTTG TGG 4 93 
His Gin Arg Arg Leu Phe Asp Ala Asp Val 
135 140 
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CAC CAG CAT GAG AAT AAA AAT CAG AAT ATT GAC TAT GAG CAA CCA AAC 154 
His Gin His Glu Asn Lys Asn Gin Asn lie Asp Tyr Glu Gin Pro Asn 
20 25 30 35 

GAT AGA AGC AAA TTC CAC GCT CAC CCT TTC AAG AGC GTT TTA ACA ACC 2 02 

Asp Arg Ser Lys Phe His Ala His Pro Phe Lys Ser Val Leu Tiir Thr 
40 45 50 

CAA ACG CTA CCA CTT GGT TTT TTA GAG AGA GAA AGA GAG AGA AAG CAA 250 
Gin Thr Leu Pro Leu Gly Phe Leu Glu Arg Glu Arg Glu Arg Lys Gin 
55 60 65 

AAT TTT AAG ATT GAT TCT CAA ATC TAT TCC TTT GCA AAA GTT AAG ATT 2 98 

Asn Phe Lys lie Asp Ser Gin lie Tyr Ser Phe Ala Lys Val Lys lie 
70 75 80 

GGG TGT TTT AAC ATG ATT TTT GGC CTG CTC GCA TCA AGC CCT TAT TTT T 34 7 
Gly Cys Phe Asn Met lie Phe Gly Leu Leu Ala Ser Ser Pro Tyr Phe 
85 90 95 

AACATTTCCG CTCCCTTGCT TTTTTAAAGC CTCCCTAAAT TACTACACCA CT 3 99 

(2) INFORMATION FOR SEQ ID NO: 684: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:684: 

Asn Pro Gin Ser Phe Phe Arg Asn Thr Pro His Phe Leu lie Ser Arg 

15 10 15 

lie Phe lie His Gin His Glu Asn Lys Asn Gin Asn lie Asp Tyr Glu 

20 25 30 

Gin Pro Asn Asp Arg Ser Lys Phe His Ala His Pro Phe Lys Ser Val 

35 40 45 

Leu Thr Thr Gin Thr Leu Pro Leu Gly Phe Leu Glu Arg Glu Arg Glu 

50 55 60 

Arg Lys Gin Asn Phe Lys lie Asp Ser Gin lie Tyr Ser Phe Ala Lys 
65 70 75 80 

Val Lys lie Gly Cys Phe Asn Met lie Phe Gly Leu Leu Ala Ser Ser 
85 90 95 

Pro Tyr Phe 



(2) INFORMATION FOR SEQ ID NO: 6 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682: 

Val Asp Ser Glu Gly Phe Ser Pro Ser lie Tyr Thr Asp Lys Thr Gly 

15 10 15 

His Pro Thr lie Gly Tyr Gly Tyr Asn Leu Ser Val Tyr Ser Tyr Glu 

20 25 30 

Gly Lys Arg lie Thr Lys Thr Tyr Gly Leu Leu Thr Asp He Leu Ser 

35 40 45 

Tyr Gly Trp Tyr Lys Asn Leu Asp Ala Met Arg Arg Met Val He Leu 

50 55 60 

Asp Leu Ser Tyr Asn Leu Gly Leu Asn Gly Leu Leu Lys Phe Lys Gin 
65 70 75 80 

Phe He Lys Ala He Glu Asp Lys Asn Tyr Ala Leu Ala Val Glu Arg 

85 90 95 

Leu Gin Lys Ser Pro Tyr Phe Asn Gin Val Lys Lys Glu Arg Gin Gly 

100 105 110 

He Trp Lys Phe 
115 

(2) INFORMATION FOR SEQ ID NO: 6 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50... 346 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:683: 

ACACAACCAT AGCGACCAAA AACCCGACAG ACCCGCCAAG GAGCGATAA AAC CCA CAA 58 

Asn Pro Gin 

1 

AGC TTT TTC AGG AAT ACG CCA CAT TTT TTG ATC AGC CGA ATT TTT ATC 106 
Ser Phe Phe Arg Asn Thr Pro His Phe Leu He Ser Arg He Phe He 
5 10 15 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 398 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681: 

GTGGAACGCT CTGTCTTAGC AAATTGATCT TAGCGGCGTC GTTTTTGATA GTG GAT 5 6 

Val Asp 
1 

TCA GAG GGG TTT TCG CCT TCT ATT TAT ACC GAC AAG ACA GGG CAT CCC 104 
Ser Glu Gly Phe Ser Pro Ser lie Tyr Thr Asp Lys Thr Gly His Pro 
5 10 15 . 

ACG ATT GGC TAT GGC TAT AAT TTG AGC GTT TAT TCT TAT GAG GGT AAG 152 
Thr lie Gly Tyr Gly Tyr Asn Leu Ser Val Tyr Ser Tyr Glu Gly Lys 
20 25 30 

CGT ATC ACC AAA ACA TAT GGG CTT TTA ACT GAC ATA CTC TCT TAT GGG 2 00 

Arg lie Thr Lys Thr Tyr Gly Leu Leu Thr Asp lie Leu Ser Tyr Gly 
35 40 45 50 

TGG TAT AAA AAT TTG GAC GCA ATG AGG AGA ATG GTC ATC TTG GAT TTG 24 8 

Trp Tyr Lys Asn Leu Asp Ala Met Arg Arg Met Val lie Leu Asp Leu 
55 60 65 

AGC TAC AAT TTA GGC TTG AAC GGA CTG CTC AAA TTC AAG CAA TTC ATC 2 96 

Ser Tyr Asn Leu Gly Leu Asn Gly Leu Leu Lys Phe Lys Gin Phe lie 
70 75 80 

AAG GCC ATA GAG GAT AAA AAT TAT GCT TTG GCT GTG GAG AGA CTG CAA 344 
Lys Ala lie Glu Asp Lys Asn Tyr Ala Leu Ala Val Glu Arg Leu Gin 
85 90 95 

AAA AGC CCG TAT TTC AAT CAA GTG AAA AAA GAG CGT CAA GGA ATA TGG 3 92 

Lys Ser Pro Tyr Phe Asn Gin Val Lys Lys Glu Arg Gin Gly lie Trp 
100 105 110 

AAA TTT TGAAATTGGA GGGTTGCGAA AAACATTGTA AGAAAAAATA CGCAATAGAA AA 45 0 

Lys Phe 

115 



451 



(2) INFORMATION FOR SEQ ID NO: 682: 
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TAG TAG TCT AAA GCT TGT GGA TTA AAT GAT CAA GAT GGG TGT TTA ATA 344 
Tyr Tyr Ser Lys Ala Cys Gly Leu Asn Asp Gin Asp Gly Gys Leu lie 
85 90 95 

CTA GGA TAT AAG CAA TAT GCT GGG AAG GGG GTA GTC AAA AAT GAA AAA 3 92 

Leu Gly Tyr Lys Gin Tyr Ala Gly Lys Gly Val Val Lys Asn Glu Lys 
100 105 110 

CAA GGG GTG AAA ACC TTT GAA AAG GGT TGT AGG TTA GGA TCT GAA GAC 44 0 

Gin Ala Val Lys Thr Phe Glu Lys Ala Cys Arg Leu Gly Ser Glu Asp 
lis 120 125 130 

GGA TGT GGT ATT TTA AAC AAC TAG TAGATTTGAA ATAAATGCTG TTTTTTAGCT 4 94 

Ala Cys Gly lie Leu Asn Asn Tyr 
135 

GGCTTTCATG TTTTTGTAAC CCC 517 
(2) INFORMATION FOR SEQ ID NO: 6 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:680: 

Met Val Gly Gly Gly Thr Val Lys Lys Asp Leu Lys Lys Ala lie Gin 

15 10 15 

Tyr Tyr Val Lys Ala Cys Glu Leu Asn Glu Met Phe Gly Cys Leu Ser 

20 25 30 

Leu Val Ser Asn Ser Gin lie Asn Lys Gin Lys Leu Phe Gin Tyr Leu 

35 40 45 

Ser Lys Ala Cys Glu Leu Asn Ser Gly Asn Gly Gys Arg Phe Leu Gly 

50 55 60 

Asp Phe Tyr Glu Asn Gly Lys Tyr Val Lys Lys Asp Leu Arg Lys Ala 
65 70 75 80 

Ala Gin Tyr Tyr Ser Lys Ala Cys Gly Leu Asn Asp Gin Asp Gly Cys 

85 90 95 

Leu lie Leu Gly Tyr Lys Gin Tyr Ala Gly Lys Gly Val Val Lys Asn 

100 105 110 

Glu Lys Gin Ala Val Lys Thr Phe Glu Lys Ala Cys Arg Leu Gly Ser 

115 120 125 

Glu Asp Ala Cys Gly lie Leu Asn Asn Tyr 
130 135 

(2) INFORMATION FOR SEQ ID NO: 681: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 451 base pairs 
IB) TYPE: nucleic acid 
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lie Thr Asn Ser 
145 

Asn Phe Gly Leu 

Gly Phe Lys Phe 
180 

Leu Lys Val Gin 
195 

Tyr His Phe 
210 



Ser Ala Val Asp 
150 

Lys Ala Leu Phe 
165 

Pro Thr lie Asn 

Met Arg Arg Val 
200 



Asn Thr Ser Phe 
155 

Val Asp Glu His 
170 

Asn Lys Tyr Tyr 
185 

Phe Ala Phe Tyr 



Gin Phe Leu Phe 
160 

Glu Phe Glu lie 
175 

Thr Thr Asp Ala 
190 

Val Gly Tyr Asn 
205 



(2) INFORMATION FOR SEQ ID NO: 679: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 464 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 9: 



AGATTTCATT CGAGGTAGAA AATACATTGA AAAAGCGTGT GAATTAAACG ATG GTA 5 6 

Met Val 
1 



GGG GGT GGA ACG GTA AAA AAA GAC TTG AAG AAA GCC ATT CAA TAC TAT 104 
Gly Gly Gly Thr Val Lys Lys Asp Leu Lys Lys Ala lie Gin Tyr Tyr 
5 10 15 



GTT AAA GCG TGT GAA TTG AAT GAA ATG TTT GGG TGT CTG TCA TTA GTT 152 
Val Lys Ala Cys Glu Leu Asn Glu Met Phe Gly Cys Leu Ser Leu Val 
20 25 30 



TCG AAC TCT CAA ATA AAC AAA CAA AAA CTC TTT CAA TAT CTC TCT AAA 2 00 

Ser Asn Ser Gin lie Asn Lys Gin Lys Leu Phe Gin Tyr Leu Ser Lys 
35 40 45 50 



GCT TGT GAA TTA AAT AGT GGT AAT GGA TGT AGG TTT TTA GGG GAT TTT 24 8 

Ala Cys Glu Leu Asn Ser Gly Asn Gly Cys Arg Phe Leu Gly Asp Phe 
55 60 65 



TAT GAG AAT GGA AAA TAT GTA AAA AAG GAT TTA AGA AAA GCT GCT CAA 2 96 

Tyr Glu Asn Gly Lys Tyr Val Lys Lys Asp Leu Arg Lys Ala Ala Gin 
70 75 80 
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120 125 130 135 

GGC AGT AAA GGG CAG GAA TTG TTA GGT ATC ACT AAC AGC AGC GCG GTT 54 7 

Gly Ser Lys Gly Gin Glu Leu Leu Gly lie Thr Asn Ser Ser Ala Val 
140 145 150 

GAT AAC ACC TCT TTT CAA TTC CTC TTT AAC TTT GGC CTC AAG GCT TTA 5 95 

Asp Asn Thr Ser Phe Gin Phe Leu Phe Asn Phe Gly Leu Lys Ala Leu 
155 160 165 

TTT GTA GAT GAG CAT GAA TTT GAA ATC GGT TTT AAA TTC CCC ACC ATT 64 3 

Phe Val Asp Glu His Glu Phe Glu lie Gly Phe Lys Phe Pro Thr lie 
170 175 180 

AAT AAC AAA TAC TAG ACC ACT GAC GCG CTC AAG GTT CAA ATG CGT AGG 6 91 

Asn Asn Lys Tyr Tyr Thr Thr Asp Ala Leu Lys Val Gin Met Arg Arg 
185 190 195 

GTC TTT GCC TTT TAT GTG GGG TAT AAT TAC CAC TTC TAAAGGGCTT TTAAAA 74 3 
Val Phe Ala Phe Tyr Val Gly Tyr Asn Tyr His Phe 
200 205 210 

CCCAACGCAA CTCCCTAACA TCTTTTGGTA ATAGCTCTTG GCTTTGAG 7 91 

(2) INFORMATION FOR SEQ ID NO: 67 8: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678: 

Met Gly Val Asn Tyr Gin Thr Gly Ser lie Asn Leu Met Thr Asn lie 

15 10 15 

His Glu Val Arg Glu Val Thr Asn Tyr Gin Thr Gly Tyr Thr Asn lie 

20 25 30 

lie Thr Ser Val Asn Ser Val Lys Lys Leu Thr Asn Met Gly Ser Asn 

35 40 45 

Gly lie Gly Leu Val Met Gly Tyr Asn His Phe Phe His Pro Asp Lys 

50 55 60 

lie Leu Gly Leu Arg Tyr Phe Ala Phe Leu Asp Trp Gin Gly Tyr Gly 
65 70 75 80 

Met Arg Tyr Pro Lys Gly Tyr Tyr Gly Gly Asn Asn Met lie Thr Tyr 

85 90 95 

Gly Val Gly Val Asp Ala Val Trp Asn Phe Phe Gin Gly Ser Phe Tyr 

100 105 110 

Gin Asp Asp lie Ser Val Asp lie Gly Val Phe Gly Gly lie Ala lie 

115 120 125 

Ala Gly Asn Ser Trp Tyr lie Gly Ser Lys Gly Gin Glu Leu Leu Gly 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 67 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 95... 727 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677: 

CCTTGTTAAA AATGTTAGTT GGTGCAAGCT TGCTGACACA CGCCTTAATA GCTAAAGAAG 6 0 

AAAGCGCAGC NCTTCTTGGA CAAAAAATTT GTAT ATG GGA GTC AAT TAG CAA ACA 115 

Met Gly Val Asn Tyr Gin Thr 
1 5 

GGT TCT ATC AAT TTA ATG ACT AAT ATC CAT GAA GTT AGA GAA GTT ACT 163 
Gly Ser lie Asn Leu Met Thr Asn lie His Glu Val Arg Glu Val Thr 
10 15 20 

AAC TAT CAA ACC GGT TAG ACC AAT ATT ATA ACT AGC GTT AAT AGC GTT 211 
Asn Tyr Gin Thr Gly Tyr Thr Asn lie lie Thr Ser Val Asn Ser Val 
25 30 35 

AAA AAG CTC ACC AAC ATG GGA TCT AAT GGG ATT GGA TTA GTC ATG GGT 259 
Lys Lys Leu Thr Asn Met Gly Ser Asn Gly lie Gly Leu Val Met Gly 
40 45 50 55 

TAT AAC CAC TTT TTC CAT CCG GAT AAA ATC TTG GGC TTG CGC TAT TTC 3 07 

Tyr Asn His Phe Phe His Pro Asp Lys lie Leu Gly Leu Arg Tyr Phe 
60 65 70 

GCT TTT TTA GAT TGG CAA GGC TAT GGC ATG AGA TAC CCT AAA GGC TAT 3 55 

Ala Phe Leu Asp Trp Gin Gly Tyr Gly Met Arg Tyr Pro Lys Gly Tyr 
75 80 85 

TAT GGC GGC AAT AAC ATG ATC ACT TAT GGC GTG GGC GTG GAT GCA GTG 4 03 

Tyr Gly Gly Asn Asn Met lie Thr Tyr Gly Val Gly Val Asp Ala Val 
90 95 100 

TGG AAT TTC TTT CAA GGG AGT TTC TAT CAA GAT GAC ATT AGC GTG GAT 451 
Trp Asn Phe Phe Gin Gly Ser Phe Tyr Gin Asp Asp lie Ser Val Asp 
105 110 115 

ATT GGC GTT TTT GGG GGG ATT GCG ATT GCG GGG AAT AGC TGG TAT ATT 4 99 

lie Gly Val Phe Gly Gly lie Ala lie Ala Gly Asn Ser Trp Tyr lie 
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ATC ATG GAC GCT TTA GAT CCT AAA CGC ACC TCT TAAAAGGAGC TTGCATGATT 92 5 

lie Met Asp Ala Leu Asp Pro Lys Arg Thr Ser 
275 280 285 

TTAGAAGTTA AAGATTTAAA AACTTATTTT TTC 958 
(2) INFORMATION FOR SEQ ID NO: 67 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676: 

Met Glu Ser Phe Arg Glu Phe lie Gin Gin Phe Lys Lys Asn Lys Ala 

15 10 15 

Ala Val Val Gly Ala Trp lie Val Leu Leu Leu Val He Cys Ala He 

20 25 30 

Phe Ala Pro Leu Leu Ala Pro His Asp Pro Tyr Val Gin Asn Ala Gin 

35 40 45 

Asp Arg Leu Leu Lys Pro He Trp Glu His Gly Gly Asn Ala Lys Tyr 

50 55 60 

Leu Leu Gly Thr Asp Asp Leu Gly Arg Asp He Leu Ser Arg Leu He 
65 70 75 80 

Tyr Gly Ala Arg He Ser Leu Thr He Gly He Val Ser Met Gly He 

85 90 95 

Ala Val Phe Phe Gly Thr He Leu Gly Leu He Ala Gly Tyr Phe Gly 

100 105 110 

Gly Lys Thr Asp Ala He He Met Arg He Met Asp He Met Phe Ala 

115 120 125 

Leu Pro Ser He Leu Leu He Val He Val Val Ala Val Leu Gly Pro 

130 135 140 

Ser Leu Thr Asn Ala Met Leu Ala He Gly Phe Val Gly He Pro Gly 
145 150 155 160 

Phe Ala Arg Leu Val Arg Ser Ser Val Leu Gly Glu Lys Glu Lys Glu 

165 170 175 

Tyr Val He Ala Ser Lys He Asn Gly Ser Ser His Leu Arg Leu Met 

180 185 190 

Cys Lys Val He Phe Pro Asn Cys He He Pro Leu He Val Gin Thr 

195 200 205 

Thr Met Gly Phe Ala Ser Thr Val Leu Glu Ala Ala Ala Leu Ser Phe 

210 215 220 

Leu Gly Leu Gly Ala Gin Pro Pro Lys Pro Glu Trp Gly Ala Met Leu 
225 230 235 240 

Met Asn Ser Met Gin Tyr He Ala Thr Ala Pro Trp Met Leu Val Phe 

245 250 255 

Pro Gly Val Met He Phe Leu Thr Val Met Ser Phe Asn Leu Val Gly 

260 265 270 

Asp Gly He Met Asp Ala Leu Asp Pro Lys Arg Thr Ser 
275 280 285 
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CTT TTG AAG CCT ATA TGG GAG CAT GGA GGG AAT GCT AAA TAG CTT TTA 24 8 

Leu Leu Lys Pro lie Trp Glu His Gly Gly Asn Ala Lys Tyr Leu Leu 
55 60 65 

GGC ACC GAT GAT TTG GGG CGC GAT ATT TTG AGC CGC TTG ATC TAT GGG 2 96 

Gly Thr Asp Asp Leu Gly Arg Asp lie Leu Ser Arg Leu lie Tyr Gly 
70 75 80 

GCC AGG ATT TCT TTA ACC ATA GGG ATT GTT TCT ATG GGG ATT GCG GTG 344 
Ala Arg lie Ser Leu Thr He Gly He Val Ser Met Gly He Ala Val 
85 90 95 

TTT TTT GGC ACG ATA CTA GGG CTA ATA GCG GGG TAT TTT GGG GGG AAA 3 92 

Phe Phe Gly Thr He Leu Gly Leu He Ala Gly Tyr Phe Gly Gly Lys 
100 105 110 

ACA GAT GCA ATT ATC ATG CGT ATC ATG GAC ATC ATG TTC GCT TTG CCC 44 0 

Thr Asp Ala He He Met Arg He Met Asp He Met Phe Ala Leu Pro 
115 120 125 130 

TCT ATT TTA TTG ATC GTG ATT GTG GTC GCG GTG TTA GGG CCT TCA CTC 4 88 

Ser He Leu Leu He Val He Val Val Ala Val Leu Gly Pro Ser Leu 
135 140 145 

ACT AAC GCC ATG CTC GCT ATT GGG TTT GTG GGG ATT CCT GGG TTT GCA 53 6 

Thr Asn Ala Met Leu Ala He Gly Phe Val Gly He Pro Gly Phe Ala 
150 155 160 

AGA TTG GTG CGC AGT TCC GTG CTA GGT GAA AAA GAA AAA GAA TAG GTG 5 84 

Arg Leu Val Arg Ser Ser Val Leu Gly Glu Lys Glu Lys Glu Tyr Val 
165 170 175 

ATC GCT TCT AAA ATC AAT GGC TCT TCG CAT CTT CGT TTG ATG TGT AAG 632 
He Ala Ser Lys He Asn Gly Ser Ser His Leu Arg Leu Met Cys Lys 
180 185 190 

GTG ATT TTC CCT AAT TGC ATT ATC CCT TTA ATC GTT CAA ACG ACA ATG 6 80 

Val He Phe Pro Asn Cys He He Pro Leu He Val Gin Thr Thr Met 
195 ^ 200 205 210 

GGT TTT GCT TCC ACG GTT TTA GAA GCG GCT GCA CTG AGC TTC TTA GGT 72 8 

Gly Phe Ala Ser Thr Val Leu Glu Ala Ala Ala Leu Ser Phe Leu Gly 
215 220 225 

CTT GGG GCC CAA CCT CCC AAA CCC GAA TGG GGA GCG ATG TTG ATG AAT 7 76 

Leu Gly Ala Gin Pro Pro Lys Pro Glu Trp Gly Ala Met Leu Met Asn 
230 235 240 

TCC ATG CAA TAG ATC GCT ACC GCT CCT TGG ATG CTT GTT TTC CCT GGG 82 4 

Ser Met Gin Tyr He Ala Thr Ala Pro Trp Met Leu Val Phe Pro Gly 
245 250 255 

GTG ATG ATT TTT TTA ACG GTC ATG AGT TTT AAT CTG GTA GGC GAT GGC 87 2 

Val Met He Phe Leu Thr Val Met Ser Phe Asn Leu Val Gly Asp Gly 
260 265 270 
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405 410 415 

Val Leu Leu Asn Lys Lys Gly Lys Leu lie Lys Glu Tyr Lys Thr Leu 

420 425 430 

Glu Pro Leu Lys Ser Leu Glu lie Arg Leu Ser Glu Ala Pro lie Asp 

435 440 445 

Lys Arg Asn Asp Phe Leu Tyr His Lys Thr Thr Tyr Ala Pro Phe Tyr 

450 455 460 

Gin Lys Ala Arg Ala Leu He Lys Lys Gly Val Met Phe Asp Glu lie 
465 470 475 480 

Phe Tyr Asn Gin Asp Leu Glu Leu Thr Glu Gly Ala Arg Ser Asn Leu 

485 490 495 

Val Leu Glu He His Asn Arg Leu Leu Thr Pro Tyr Phe Ser Ala Gly 

500 505 510 

Ala Leu Asn Gly Thr Gly Val Val Gly Leu Leu Lys Lys Gly Leu Val 

515 520 525 

Gly His Ala Pro Leu Lys Leu Gin Asp Leu Gin Lys Ala Ser Lys lie 

530 535 540 

Tyr Cys He Asn Ala Leu Tyr Gly Leu Val Glu Val Lys He Lys 
545 550 555 

(2) INFORMATION FOR SEQ ID NO : 675 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 905 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 

TTAGTGGATA TTTTATACGC TTTTATTGAT CCTAGAATAA GGTTGTCATA ATG GAG 5 6 

Met Glu 
1 

TCT TTT AGA GAG TTT ATC CAA CAA TTC AAA AAA AAT AAG GCA GCG GTC 104 

Ser Phe Arg Glu Phe He Gin Gin Phe Lys Lys Asn Lys Ala Ala Val 
5 10 15 

GTT GGG GCT TGG ATT GTG CTT TTA TTG GTA ATT TGC GCG ATT TTT GCG 152 
Val Gly Ala Trp He Val Leu Leu Leu Val He Cys Ala He Phe Ala 
20 25 30 

CCC CTT TTA GCC CCG CAT GAT CCT TAT GTC CAA AAC GCG CAA GAT CGC 2 00 

Pro Leu Leu Ala Pro His Asp Pro Tyr Val Gin Asn Ala Gin Asp Arg 
35 40 45 50 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:674: 



Met lie Phe Gly 
1 

Ala Thr Asn Leu 
20 

Asn Arg Gly Asn 
35 

Leu Ala Phe Leu 
50 

Phe Glu Gin Phe 
65 

Glu His Ala Phe 

Tyr Phe Lys Gin 
100 

Thr Tyr Gin Val 
115 

Lys Pro Lys Arg 
130 

Phe Lys Ala Phe 
145 

Pro Glu Leu Phe 

lie Thr Lys Pro 
180 

lie Asp Glu Lys 
195 

Ser Glu Asn Val 
210 

Leu Ala Leu Lys 
225 

Ser Leu Pro Ser 

Pro Leu Lys Thr 
260 

Gly Ser Val Thr 
275 

Ser Leu Glu Lys 
290 

Val Glu Glu Lys 
305 

Lys Arg Val His 

Thr Tyr Lys Ser 
340 

Ser Phe Phe Val 
355 

Lys lie lie Lys 
370 

His Lys Glu Arg 
385 

Asp Glu Asn Leu 



Asp Phe Lys Tyr 
5 

Asn Glu Leu Lys 

Gly Tyr Phe Val 
40 

Asp Glu Asn Phe 
55 

Leu Glu Arg Lys 
70 

Tyr Pro Lys lie 
85 

Phe Lys Ala Val 

Asn Leu Thr Met 
120 

Val Phe Lys Glu 
135 

lie Glu Asn Glu 
150 

Phe Glu Leu Glu 
165 

Met Lys Gly Thr 

Asn Arg Leu Phe 
200 

Met He Val Asp 
215 

Asn Ser Val Lys 
230 

Val Tyr Gin Met 
245 

Ser Leu Phe Glu 

Gly Cys Pro Lys 
280 

Arg Pro Arg Gly 
295 

Lys Ala Leu Phe 
310 

Glu Asn Phe Leu 
325 

Lys Ala Pro Lys 

Met Pro Lys He 

360 

Lys Asp Gin Lys 
375 

Leu Met Asn Ser 
390 

Leu Asp Phe Glu 



Gin Lys Ser Val 
10 

Asn Ala Leu Asp 
25 

Gly Tyr Leu Leu 

Gin Ser Gin Thr 
60 

Lys Tyr Ser Leu 
75 

His Ser Ser Leu 
90 

Lys Glu Arg Leu 
105 

Asp Leu Phe Leu 

Val Val His Asn 
14 0 

Phe Gly Ser Val 
155 

Phe Leu Asp Thr 
170 

He Ala Arg Ser 
185 

Leu Gin Asn Asp 

Leu Leu Arg Asn 
220 

Val Asn Gin Leu 
235 

He Ser Glu He 
250 

He Phe Lys Ala 
265 

He Lys Thr Met 

Val Tyr Cys Gly 
300 

Ser Val Pro He 
315 

His Leu Gly Val 
330 

Glu Tyr Glu Glu 
345 

Glu Phe Glu He 

Leu Glu He Asn 
380 

Thr Arg Tyr Phe 
395 

Leu Glu Lys Glu 



Lys Lys Leu Thr 
15 

Phe He Ser Gin 
30 

Tyr Glu Ala Arg 
45 

Pro Phe Leu Tyr 

Glu Pro Leu Lys 
80 

Asp Gin Lys Thr 
95 

Lys Asn Gly Asp 
110 

Asp Thr Lys Ala 
125 

Gin Asn Thr Pro 

Leu Ser Phe Ser 
160 

Ala He Lys He 
175 

Lys Asn Pro Leu 
190 

Asp Lys Asn Arg 
205 

Asp Leu Ser Arg 

Phe Glu He He 
240 

Glu Ala Lys Leu 
255 

Leu Phe Pro Cys 
270 

Gin He He Glu 
285 

Ala He Gly Met 

Arg Thr Leu Glu 
■320 

Gly Ser Gly Val 
335 

Ser Phe Leu Lys 
350 

Val Glu Thr Met 
365 

Asn Lys Asn Ala 

Asn Phe Lys Tyr 
400 

Gly Val Leu Arg 
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GAA CGC TTA ATG AAT AGC ACT CGA TAT TTT AAC TTT AAA TAG GAT GAA 12 5 6 

Glu Arg Leu Met Asn Ser Thr Arg Tyr Phe Asn Phe Lys Tyr Asp Glu 
390 395 400 

AAT CTT TTA GAT TTT GAA TTA GAA AAA GAA GGG GTT TTA AGO GTT TTA 13 04 

Asn Leu Leu Asp Phe Glu Leu Glu Lys Glu Gly Val Leu Arg Val Leu 
405 410 415 

CTC AAT AAA AAG GGC AAG CTC ATT AAA GAA TAG AAA ACC TTA GAG CCT 13 52 

Leu Asn Lys Lys Gly Lys Leu lie Lys Glu Tyr Lys Thr Leu Glu Pro 
420 425 430 

TTA AAA AGC CTA GAA ATC CGT TTG AGT GAA GCC CCC ATT GAT AAA CGC 14 0 0 

Leu Lys Ser Leu Glu lie Arg Leu Ser Glu Ala Pro lie Asp Lys Arg 
435 440 445 450 

AAT GAT TTT TTA TAG CAT AAG ACC ACT TAT GCC CCT TTT TAT CAA AAG 144 8 

Asn Asp Phe Leu Tyr His Lys Thr Thr Tyr Ala Pro Phe Tyr Gin Lys 
455 460 465 

GCT CGA GCG CTC ATT AAA AAG GGC GTT ATG TTT GAT GAA ATC TTT TAT 14 96 

Ala Arg Ala Leu lie Lys Lys Gly Val Met Phe Asp Glu lie Phe Tyr 
470 475 480 

AAC CAG GAT TTG GAA CTC ACT GAG GGC GCT AGG AGC AAT CTT GTT TTA 1544 
Asn Gin Asp Leu Glu Leu Thr Glu Gly Ala Arg Ser Asn Leu Val Leu 
485 490 495 

GAA ATC CAT AAC AGG CTT TTA ACC CCT TAT TTT AGC GCG GGC GCG TTA 15 92 

Glu lie His Asn Arg Leu Leu Thr Pro Tyr Phe Ser Ala Gly Ala Leu 
500 505 510 

AAC GGG ACG GGT GTT GTG GGG TTG TTA AAA AAG GGT CTT GTT GGG CAT 164 0 

Asn Gly Thr Gly Val Val Gly Leu Leu Lys Lys Gly Leu Val Gly His 
515 520 525 530 

GCA CCT TTG AAA TTG CAA GAT TTG CAA AAA GCG TCT AAA ATC TAT TGT 16 8 8 

Ala Pro Leu Lys Leu Gin Asp Leu Gin Lys Ala Ser Lys lie Tyr Cys 
535 540 545 

ATT AAC GCG CTA TAT GGC TTA GTG GAA GTG AAA ATA AAA TAACTATAAA AA 17 3 9 
lie Asn Ala Leu Tyr Gly Leu Val Glu Val Lys lie Lys 
550 555 

CAGAGCGGCT AAAACCTCAT TTTTAGAAAT AGGTTACCCA A 17 80 

(2) INFORMATION FOR SEQ ID NO: 674: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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TTG TTT TTT GAA TTA GAG TTT TTA GAT ACA GCG ATT AAG ATT ATT ACA 584 
Leu Phe Phe Glu Leu Glu Phe Leu Asp Thr Ala lie Lys lie lie Thr 
165 170 175 

AAA CCC ATG AAA GGC ACG ATC GCT CGC TCA AAA AAC CCC TTA ATA GAT 63 2 

Lys Pro Met Lys Gly Thr lie Ala Arg Ser Lys Asn Pro Leu lie Asp 
180 185 190 

GAA AAA AAC CGA TTG TTT TTG CAA AAT GAT GAC AAA AAC AGA AGO GAA 68 0 

Glu Lys Asn Arg Leu Phe Leu Gin Asn Asp Asp Lys Asn Arg Ser Glu 
195 200 205 210 

AAT GTG ATG ATT GTG GAT TTA TTG CGT AAC GAT TTG AGC CGC TTG GCC 72 8 

Asn Val Met lie Val Asp Leu Leu Arg Asn Asp Leu Ser Arg Leu Ala 
215 220 225 

TTA AAA AAT AGC GTG AAA GTC AAT CAA TTG TTT GAA ATC ATC AGC TTG 7 76 

Leu Lys Asn Ser Val Lys Val Asn Gin Leu Phe Glu lie lie Ser Leu 
230 235 240 

CCT AGC GTG TAT CAA ATG ATA AGC GAG ATT GAA GCG AAA TTG CCC CTA 82 4 

Pro Ser Val Tyr Gin Met lie Ser Glu lie Glu Ala Lys Leu Pro Leu 
245 250 255 

AAA ACC AGC TTG TTT GAG ATT TTT AAG GCG TTG TTC CCT TGC GGC TCT 8 72 

Lys Thr Ser Leu Phe Glu lie Phe Lys Ala Leu Phe Pro Cys Gly Ser 
260 265 270 

GTG ACC GGA TGC CCT AAA ATC AAA ACC ATG CAA ATC ATT GAA AGT TTA 92 0 

Val Thr Gly Cys Pro Lys lie Lys Thr Met Gin lie He Glu Ser Leu 
275 280 285 290 

GAA AAA CGC CCT AGG GGG GTG TAT TGC GGG GCG ATA GGC ATG GTT GAA 96 8 

Glu Lys Arg Pro Arg Gly Val Tyr Cys Gly Ala He Gly Met Val Glu 
295 300 305 

GAA AAA AAA GCC CTT TTT AGC GTG CCT ATC CGC ACT TTA GAA AAA AGA 1016 
Glu Lys Lys Ala Leu Phe Ser Val Pro lie Arg Thr Leu Glu Lys Arg 
310 315 320 

GTG CAC GAA AAT TTT TTG CAT TTA GGG GTA GGG AGT GGG GTA ACT TAT 1064 
Val His Glu Asn Phe Leu His Leu Gly Val Gly Ser Gly Val Thr Tyr 
325 330 335 

AAA AGT AAA GCG CCA AAA GAA TAT GAA GAA AGC TTT TTG AAA TCC TTT 1112 
Lys Ser Lys Ala Pro Lys Glu Tyr Glu Glu Ser Phe Leu Lys Ser Phe 
340 345 350 

TTT GTG ATG CCC AAA ATA GAA TTT GAG ATT GTA GAA ACG ATG AAA ATT 116 0 

Phe Val Met Pro Lys He Glu Phe Glu He Val Glu Thr Met Lys He 
355 360 365 370 

ATC AAA AAG GAT CAA AAA TTA GAG ATT AAT AAT AAA AAC GCC CAT AAA 120 8 

He Lys Lys Asp Gin Lys Leu Glu He Asn Asn Lys Asn Ala His Lys 
375 380 385 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1727 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673: 

ATAACCCGGT GTTTTTGATA GACAAATACG AAGCCGTTGC TAAAAATTAA ATG ATT 56 

Met lie 
1 

TTT GGG GAT TTT AAA TAT CAA AAA AGC GTT AAA AAA CTC ACA GCC ACC 104 
Phe Gly Asp Phe Lys Tyr Gin Lys Ser Val Lys Lys Leu Thr Ala Thr 
5 10 15 

AAT CTT AAT GAG CTT AAA AAC GCC CTG GAT TTC ATC TCT CAA AAT AGG 152 
Asn Leu Asn Glu Leu Lys Asn Ala Leu Asp Phe lie Ser Gin Asn Arg 
20 25 30 

GGG AAT GGG TAT TTT GTG GGG TAT CTT TTA TAT GAA GCG CGC TTA GCG 2 00 

Gly Asn Gly Tyr Phe Val Gly Tyr Leu Leu Tyr Glu Ala Arg Leu Ala 
35 40 45 50 

TTT TTA GAT GAA AAT TTT CAA AGC CAA ACC CCT TTT TTG TAT TTT GAA 24 8 

Phe Leu Asp Glu Asn Phe Gin Ser Gin Thr Pro Phe Leu Tyr Phe Glu 
55 60 65 

CAA TTT TTA GAA AGA AAA AAA TAT TCT TTA GAG CCT TTA AAA GAG CAT 2 96 

Gin Phe Leu Glu Arg Lys Lys Tyr Ser Leu Glu Pro Leu Lys Glu His 
70 75 80 

GCG TTT TAC CCT AAA ATC CAT AGT TCT TTA GAT CAA AAA ACT TAT TTC 344 
Ala Phe Tyr Pro Lys lie His Ser Ser Leu Asp Gin Lys Thr Tyr Phe 
85 90 95 

AAG CAG TTT AAA GCC GTT AAA GAG CGT CTC AAA AAC GGC GAC ACC TAT 3 92 

Lys Gin Phe Lys Ala Val Lys Glu Arg Leu Lys Asn Gly Asp Thr Tyr 
100 105 110 

CAA GTG AAT CTC ACA ATG GAT TTA TTT TTA GAC ACT AAA GCC AAA CCA 44 0 

Gin Val Asn Leu Thr Met Asp Leu Phe Leu Asp Thr Lys Ala Lys Pro 
115 120 125 130 

AAG CGC GTT TTT AAG GAG GTG GTA CAC AAC CAA AAC ACG CCT TTT AAG 4 88 

Lys Arg Val Phe Lys Glu Val Val His Asn Gin Asn Thr Pro Phe Lys 
135 140 145 

GCT TTT ATA GAA AAT GAG TTT GGG AGC GTT TTA AGC TTT TCG CCG GAA 53 6 

Ala Phe lie Glu Asn Glu Phe Gly Ser Val Leu Ser Phe Ser Pro Glu 
150 155 160 
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Ala Gly Glu Ala Ala L.eu Gin Ser Ala Glu Phe Ala Arg Ser His Ala 
115 120 125 130 

CAA AGC GAA TGG TTG TTT AAG GAA TTG TTT GTG CTG GTG TGC GCT TTG 4 88 

Gin Ser Glu Trp Leu Phe Lys Glu Leu Phe Val Leu Val Cys Ala Leu 
135 140 145 

TTT TTT TGG CGT TTG CTT GGA AAA AAT GTG CTT TAGTCCCTTT GATTTAATCA 541 
Phe Phe Trp Arg Leu Leu Gly Lys Asn Val Leu 
150 155 

AATGAGAGAG TTTTTGGCTA CTATCTAGGA AAT 5 74 

(2) INFORMATION FOR SEQ ID NO: 672: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 : 

Met Lys Lys Phe Gly Leu Gly Val Tyr Leu Leu Leu Leu Gly lie Leu 

15 10 15 

Gly Gly Ser Leu lie lie Leu Gly Ala lie Val Ala Pro lie Val Phe 

20 25 30 

Lys Ala Ser Ser Val Leu Pro Glu Leu His Leu Thr Pro Phe Glu Ser 

35 40 45 

Gly Lys Leu Met Ala Gin He Phe Val Arg Phe Asn Tyr Val Leu Gly 

50 55 60 

Ala He Gly Phe Val Val Leu Leu Tyr Glu He He Ser Phe He Tyr 
65 70 75 80 

Tyr Lys Arg Ser Leu Val Tyr Leu He Leu Gly Val Ala He Gly Ala 

85 90 95 

Leu Cys Leu Leu Phe Val Phe Tyr Tyr Thr Pro Tyr He Leu Asn Ala 

100 105 110 

Gin Lys Ala Gly Glu Ala Ala Leu Gin Ser Ala Glu Phe Ala Arg Ser 

115 120 125 

His Ala Gin Ser Glu Trp Leu Phe Lys Glu Leu Phe Val Leu Val Cys 

130 135 140 

Ala Leu Phe Phe Trp Arg Leu Leu Gly Lys Asn Val Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO: 671: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 521 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671: 

AGTTTAAGGA TTGCAACCAT TTGTTGCAAA AATCTTAAAA AAGGTGGAAG ATG AAA 5 6 

Met Lys 
1 

AAA TTT GGT TTG GGG GTG TAT TTG CTT CTT TTA GGT ATT TTG GGC GGC 104 
Lys Phe Gly Leu Gly Val Tyr Leu Leu Leu Leu Gly lie Leu Gly Gly 
5 10 15 

TCT TTG ATC ATT CTA GGA GCG ATA GTC GCG CCC ATT GTT TTC AAA GCT 152 
Ser Leu lie lie Leu Gly Ala lie Val Ala Pro lie Val Phe Lys Ala 
20 25 30 

TCA AGC GTT TTA CCT GAA TTG CAT CTG ACT CCC TTT GAG AGC GGG AAA 2 00 

Ser Ser Val Leu Pro Glu Leu His Leu Thr Pro Phe Glu Ser Gly Lys 
35 40 45 50 

CTC ATG GCG CAA ATC TTT GTG CGT TTC AAT TAT GTT TTA GGC GCG ATC 24 8 

Leu Met Ala Gin lie Phe Val Arg Phe Asn Tyr Val Leu Gly Ala lie 
55 60 65 

GGT TTT GTA GTG TTA CTT TAT GAA ATC ATT TCG TTT ATT TAT TAC AAA 2 96 

Gly Phe Val Val Leu Leu Tyr Glu lie lie Ser Phe lie Tyr Tyr Lys 
70 75 80 

AGA TCG TTA GTG TAT TTG ATC CTT GGC GTG GCG ATA GGG GCG TTG TGT 3 44 

Arg Ser Leu Val Tyr Leu lie Leu Gly Val Ala lie Gly Ala Leu Cys 
85 90 95 

TTG CTC TTT GTT TTT TAT TAC ACG CCT TAT ATT TTA AAC GCT CAA AAA 3 92 

Leu Leu Phe Val Phe Tyr Tyr Thr Pro Tyr lie Leu Asn Ala Gin Lys 
100 105 110 

GCG GGC_GAA GCC GCG CTT CAA AGT GCT GAA TTT GCC CGC TCG CAC GCT 44 0 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:670: 



Met Lys Lys Val 
1 

Asp Thr Gin Val 
20 

Glu Glu Gin Glu 

35 

Asn Gly Ala Asp 
50 

Leu Asp Lys Glu 
65 

Lys Glu Leu Phe 

Asn Lys Glu Lys 
ICQ 

lie Asp Asp Asn 
115 

Glu Phe Val Gly 
130 

Asp Phe Asp Cys 
145 

Arg Ser Phe Glu 

Ser Lys Gly Val 
180 

Tyr Gly Lys Asp 
195 

Ser Gin lie Ala 
210 

Asn Gin lie Asn 
225 

Glu Lys His Leu 

Glu Arg Met Asn 
260 

Glu Thr lie Ala 
275 

Gly His Pro Gly 
290 

Leu Glu Ser Leu 
305 

Arg Lys Asp Thr 

Asp Ser Lys Lys 
340 

Asn Lys Ala Phe 
355 

Ala Leu Val Glu 
370 

Phe Phe Asn Pro 
385 

Phe Leu Glu lie 
Val Phe 



Tyr Phe Lys Thr 
5 

Met Ser Glu Asn 
Ala Asp lie lie 

40 

Ser Ala Val Arg 
55 

Val Leu Phe Thr 
70 

Glu Lys Gly Phe 
85 

lie Asn Ala Leu 

Leu Glu Asn Lys 
120 

Lys Thr Arg Ala 
135 

Asn Tyr Cys lie 
150 

Glu Arg Lys lie 
165 

Gin Glu Val Val 

Arg Gly Ser Asn 
200 

Gly Leu Lys Arg 
215 

Asp Glu Phe Leu 
230 

His lie Ala Leu 
245 

Arg Arg Asn Arg 

Ser Lys Asn Phe 
280 

Glu Ser Gly Ser 
295 

Pro Leu Thr His 
310 

Pro Ser Ser Leu 
325 

Arg Leu Asn Ala 

Arg Gin Leu Gin 
360 

Val Gin Lys Asp 
375 

lie Lys lie Lys 
390 

Lys Glu Tyr Glu 
405 



Phe Gly Cys Arg 
10 

Leu Lys Asp Phe 
25 

lie lie Asn Ser 

Ser Tyr Ala Lys 
60 

Gly Cys Gly Val 
75 

Leu Lys Gly Val 
90 

Leu Gin Glu Lys 
105 

His Leu Asp Thr 

Phe lie Lys lie 
140 

lie Pro Ser Val 
155 

Leu Glu Gin Val 
170 

Leu Thr Gly Thr 
185 

lie Ala Arg Leu 

lie Arg lie Gly 
220 

Glu Leu Leu Glu 
235 

Gin His Ser His 
250 

Thr Lys Ser Asp 
265 

Ala lie Gly Thr 

Val Phe Glu Lys 
300 

lie His Pro Phe 
315 

Met Thr Asp Ser 
330 

lie Lys Asp Leu 
345 

Leu Lys Leu Asn 

Gly Glu Phe Lys 
380 

Ser Asp Lys Pro 
395 

lie Lys Glu Arg 
410 



Thr Asn Leu Phe 
15 

Ser Thr Thr Leu 
30 

Cys Thr Val Thr 
45 

Lys Met Ala Arg 

Lys Thr Gin Gly 
80 

Phe Gly His Asp 
95 

Lys Arg Phe Phe 
110 

Thr Met Val Ser 
125 

Gin Glu Gly Cys 

Arg Gly Arg Ala 
160 

Gly Leu Leu Cys 
175 

Asn Val Gly Ser 
190 

lie Lys Lys Leu 
205 

Ser Leu Glu Pro 

Glu Asp Phe Leu 
240 

Asp Leu Met Leu 
255 

Arg Glu Leu Leu 
270 

Asp Phe lie Val 
285 

Ala Phe Lys Asn 

lie Tyr Ser Lys 
320 

Val Ser Leu Glu 
335 

lie Phe His Lys 
350 

Thr Pro Leu Lys 
365 

Ala Leu Asp Gin 

Leu Arg Ala Ser 
400 

Glu Asn His Ala 
415 
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His Leu His lie Ala Leu Gin His Ser His Asp Leu Met Leu Glu Arg 
245 250 255 

ATG AAT CGA AGA AAC CGC ACT AAA AGC GAT AGG GAA TTA TTA GAA ACA 8 72 

Met Asn Arg Arg Asn Arg Thr Lys Ser Asp Arg Glu Leu Leu Glu Thr 
260 265 270 

ATC GCT TCT AAG AAT TTT GOT ATT GGC ACG GAT TTT ATT GTG GGG CAT 92 0 

lie Ala Ser Lys Asn Phe Ala He Gly Thr Asp Phe He Val Gly His 
275 280 285 290 

CCG GGC GAG AGC GGA AGC GTT TTT GAA AAA GCG TTT AAA AAT TTA GAA 96 8 

Pro Gly Glu Ser Gly Ser Val Phe Glu Lys Ala Phe Lys Asn Leu Glu 
295 300 305 

AGC TTG CCT TTA ACG CAC ATC CAC CCT TTT ATT TAC AGC AAA CGA AAA 1016 
Ser Leu Pro Leu Thr His He His Pro Phe He Tyr Ser Lys Arg Lys 
310 315 320 

GAC ACC CCC TCT AGC TTG ATG ACT GAT AGC GTG AGT TTG GAA GAT TCT 10 64 

Asp Thr Pro Ser Ser Leu Met Thr Asp Ser Val Ser Leu Glu Asp Ser 
325 330 335 

AAA AAG CGT TTG AAT GCG ATT AAA GAT TTG ATT TTT CAT AAA AAT AAG 1112 
Lys Lys Arg Leu Asn Ala He Lys Asp Leu He Phe His Lys Asn Lys 
340 345 350 

GCG TTC AGG CAA TTG CAG CTC AAG CTC AAT ACG CCT CTA AAA GCC TTA 116 0 

Ala Phe Arg Gin Leu Gin Leu Lys Leu Asn Thr Pro Leu Lys Ala Leu 
355 360 365 370 

GTG GAA GTG CAA AAA GAC GGC GAA TTT AAA GCC TTA GAT CAA TTT TTC 12 0 8 

Val Glu Val Gin Lys Asp Gly Glu Phe Lys Ala Leu Asp Gin Phe Phe 
375 380 385 

AAC CCC ATT AAA ATC AAA AGC GAT AAG CCT CTA AGG GCT AGT TTT TTA 12 5 6 

Asn Pro He Lys He Lys Ser Asp Lys Pro Leu Arg Ala Ser Phe Leu 
390 395 400 

GAA ATC AAA GAG TAT GAA ATT AAG GAG AGG GAA AAT CAT GCC GTT TTC T 13 05 
Glu He Lys Glu Tyr Glu He Lys Glu Arg Glu Asn His Ala Val Phe 
405 410 415 

AAAAATTTAG AAAATCTTAC CGCTCCCTTC AAACGCATTA AAAACCGCTC GC 13 5 7 

(2) INFORMATION FOR SEQ ID NO: 67 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE; protein 
(v) FRAGMENT TYPE: internal 
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20 25 30 

CAA GAA GCC GAT ATT ATT ATC ATC AAT TCT TGC ACC GTG ACC AAT GGG 2 00 

Gin Glu Ala Asp lie lie lie lie Asn Ser Cys Thr Val Thr Asn Gly 
35 40 45 50 

GCC GAT AGC GCG GTA AGG AGT TAG GCT AAA AAA ATG GCA CGG TTG GAT 24 8 

Ala Asp Ser Ala Val Arg Ser Tyr Ala Lys Lys Met Ala Arg Leu Asp 
55 60 65 

AAG GAA GTG CTA TTT ACT GGT TGC GGG GTG AAA ACC CAA GGC AAA GAG 2 96 

Lys Glu Val Leu Phe Thr Gly Cys Gly Val Lys Thr Gin Gly Lys Glu 
70 75 80 

CTT TTT GAA AAA GGG TTT TTA AAG GGC GTT TTT GGG CAT GAC AAT AAA 34 4 

Leu Phe Glu Lys Gly Phe Leu Lys Gly Val Phe Gly His Asp Asn Lys 
85 90 95 

GAA AAG ATT AAC GCG CTT TTA CAA GAA AAA AAG CGT TTT TTT ATA GAT 3 92 

Glu Lys lie Asn Ala Leu Leu Gin Glu Lys Lys Arg Phe Phe lie Asp 
100 105 110 

GAC AAT TTA GAA AAC AAG CAC TTA GAC ACC ACG ATG GTG AGC GAG TTT 4 4 0' 

Asp Asn Leu Glu Asn Lys His Leu Asp Thr Thr Met Val Ser Glu Phe 
115 120 125 130 

GTG GGA AAA ACT AGG GCG TTT ATT AAG ATC CAA GAA GGC TGT GAT TTT 4 88 

Val Gly Lys Thr Arg Ala Phe lie Lys lie Gin Glu Gly Cys Asp Phe 
135 140 145 

GAT TGC AAT TAT TGC ATT ATC CCA AGC GTG AGA GGG AGG GCT AGG AGT 53 6 

Asp Cys Asn Tyr Cys lie lie Pro Ser Val Arg Gly Arg Ala Arg Ser 
150 155 160 

TTT GAA GAG AGA AAA ATT TTA GAG CAA GTG GGC CTT TTA TGC TCT AAA 5 84 

Phe Glu Glu Arg Lys lie Leu Glu Gin Val Gly Leu Leu Cys Ser Lys 
165 170 175 

GGG GTT CAA GAA GTG GTT TTA ACC GGC ACC AAT GTG GGG AGC TAT GGG 63 2 

Gly Val Gin Glu Val Val Leu Thr Gly Thr Asn Val Gly Ser Tyr Gly 
180 185 190 

AAA GAT AGA GGA AGC AAT ATC GCG CGA TTG ATT AAA AAA TTA AGC CAG 68 0 

Lys Asp Arg Gly Ser Asn lie Ala Arg Leu lie Lys Lys Leu Ser Gin 
195 200 205 210 

ATC GCT GGA TTA AAA CGC ATA AGG ATT GGG AGC TTA GAA CCT AAT CAA 72 8 

lie Ala Gly Leu Lys Arg lie Arg lie Gly Ser Leu Glu Pro Asn Gin 
215 220 225 

ATT AAC GAT GAA TTT TTA GAG CTT TTA GAA GAG GAT TTT TTA GAA AAA 77 6 

lie Asn Asp Glu Phe Leu Glu Leu Leu Glu Glu Asp Phe Leu Glu Lys 
230 235 240 

CAT TTG_CAT ATC GCT TTA CAG CAC AGC CAT GAT CTC ATG CTA GAG AGG 824 
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His Lieu Lys Met Arg Ser Leu Ser Val Gly Leu Thr His Glu Gly Phe 

195 200 205 

Asp Gly Tyr Ala lie Gly Gly Leu Ala Val Gly Glu Ser Ala Asp Glu 

210 215 220 

Met Leu Glu Thr lie Ala His Thr Ala Pro Leu Leu Pro Lys Asp Lys 
225 230 235 240 

Pro Arg Tyr Leu Met Gly Val Gly Thr Pro Glu Asn lie Leu Asp Ala 

245 250 255 

lie Ser Leu Gly Val Asp Met Phe Asp Cys Val Met Pro Thr Arg Asn 

260 265 270 

Ala Arg Asn Ala Thr Leu Phe Thr His Ser Gly Lys lie Ser lie Lys 

275 280 285 

Asn Ala Pro Tyr Lys Leu Asp Asn Thr Pro lie Glu Glu Asn Cys Ala 

290 295 300 

Cys Tyr Ala Cys Lys Arg Tyr Ser Lys Ala Tyr Leu His His Leu Phe 
305 310 315 320 

Arg Ala Lys Glu Leu Thr Tyr Ala Arg Leu Ala Ser Leu His Asn Leu 

325 330 335 

His Phe Tyr Leu Glu Leu Val Lys Asn Ala Arg Asn Ala lie Leu Glu 

340 345 350 

Lys Arg Phe Leu Ser Phe Lys Lys Glu Phe Leu Glu Lys Tyr Asn Ser 

355 360 365 

Arg Ser His 
370 

(2) INFORMATION FOR SEQ ID NO: 669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1304 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669: 

GTGGATGCTG ATTTTTTAGG CGATGGTTTT GGGAATAAAA GGGAACAAAA ATG AAA 5 6 

Met Lys 
1 

AAA GTC TAT TTC AAA ACT TTT GGG TGC AGG ACG AAT CTT TTT GAC ACG 104 
Lys Val Tyr Phe Lys Thr Phe Gly Cys Arg Thr Asn Leu Phe Asp Thr 
5 10 15 

CAA GTG ATG AGC GAG AAT TTG AAG GAC TTT AGC ACG ACC TTA GAA GAA 15 2 

Gin Val Met Ser Glu Asn Leu Lys Asp Phe Ser Thr Thr Leu Glu Glu 
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GCT AAA GAA CTC ACT TAG GCT CGT TTG GCC AGC TTG CAC AAT TTG CAT 106 8 

Ala Lys Glu Leu Thr Tyr Ala Arg Leu Ala Ser Leu His Asn Leu His 
325 330 335 

TTT TAT TTA GAG CTG GTG AAG AAC GCC AGA AAC GCC ATT TTA GAA AAG 1116 
Phe Tyr Leu Glu Leu Val Lys Asn Ala Arg Asn Ala lie Leu Glu Lys 
340 345 350 

CGG TTT TTG AGT TTT AAA AAA GAA TTT TTG GAG AAA TAC AAC TCC CGC 1164 
Arg Phe Leu Ser Phe Lys Lys Glu Phe Leu Glu Lys Tyr Asn Ser Arg 
355 360 365 

TCT CAT TGAATGATGG AATGCAAAAA TACTAAAAAG CGTTTTTTAC CATCAATAAA AG 12 2 2 

Ser His 

370 

TTTTCTTAAA A 12 3 3 

(2) INFORMATION FOR SEQ ID NO: 66 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 371 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668: 

Met Asp Phe Gin Leu Gin Ala Thr Asp Asn Asn Ala Arg Ala Gly Leu 

15 10 15 

Leu Asn Leu Ala His Ser Gin Val Ala Thr Pro Val Phe Met Pro Val 

20 25 30 

Gly Thr Gin Gly Cys lie Lys Ser Leu Asp Ala Thr Asp Ala Gin Glu 

35 40 45 

lie Leu Gly Ala Lys Leu lie Leu Ala Asn Thr Tyr His Met Tyr Leu 

50 55 60 

Arg Pro Gly Glu Lys Val Val Glu Glu Leu Gly Gly Leu His Arg Phe 
65 70 75 • 80 

Ala Gin Phe Tyr Gly Ser Phe Leu Thr Asp Ser Gly Gly Phe Gin Ala 

85 90 95 

Phe Ser Leu Ser Asp Asn Val Lys Leu Gin Glu Asp Gly lie Val Phe 

100 105 110 

Lys Ser His lie Asp Gly Ser Lys His Leu Phe Thr Pro Ala Lys Val 

115 120 125 

Leu Asp lie Gin Tyr Ser Leu Asn Ser Asp lie Met Met Val Leu Asp 

130 135 140 

Asp Leu Val Gly Leu Pro Ala Pro Leu Lys Arg Leu Glu Glu Ser lie 
145 150 155 160 

Lys Arg Ser Ala Lys Trp Ala Asn Met Ser Leu Glu Tyr His Lys Glu 

165 170 175 

Lys Asn Arg Pro Ser Asn Asn Leu Phe Ala He He Gin Gly Gly Thr 
180 185 190 
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AGT TTG AGC GAT AAT GTC AAA TTG CAA GAA GAT GGG ATC GTT TTT AAA 3 96 

Ser Leu Ser Asp Asn Val Lys Leu Gin Glu Asp Gly lie Val Phe Lys 
100 105 110 

TCC CAT ATT GAT GGG AGC AAG CAT CTA TTC ACG CCC GCT AAA GTT TTG 444 
Ser His lie Asp Gly Ser Lys His Leu Phe Thr Pro Ala Lys Val Leu 
115 120 125 

GAG ATT CAA TAT TCT TTG AAT AGC GAT ATT ATG ATG GTT TTA GAG GAT 4 92 

Asp He Gin Tyr Ser Leu Asn Ser Asp He Met Met Val Leu Asp Asp 
130 135 140 145 

TTA GTG GGC TTG CCC GCT CCC TTA AAA CGC CTT GAA GAA TCC ATT AAA 54 0 

Leu Val Gly Leu Pro Ala Pro Leu Lys Arg Leu Glu Glu Ser He Lys 
150 155 160 

AGA AGT GCT AAA TGG GCG AAT ATG AGC CTA GAA TAG CAC AAA GAA AAA 58 8 

Arg Ser Ala Lys Trp Ala Asn Met Ser Leu Glu Tyr His Lys Glu Lys 
165 170 175 

AAC CGC CCG AGC AAC AAC CTT TTT GCC ATT ATC CAG GGC GGG ACG CAT 63 6 

Asn Arg Pro Ser Asn Asn Leu Phe Ala He He Gin Gly Gly Thr His 
180 185 190 

TTG AAA ATG CGC AGC CTT AGC GTG GGA TTA ACG CAT GAG GGT TTT GAT 6 84 

Leu Lys Met Arg Ser Leu Ser Val Gly Leu Thr His Glu Gly Phe Asp 
195 200 205 

GGC TAC GCT ATA GGC GGT TTA GCG GTG GGG GAA AGC GCT GAT GAA ATG 73 2 

Gly Tyr Ala He Gly Gly Leu Ala Val Gly Glu Ser Ala Asp Glu Met 
210 215 220 225 

CTA GAA ACC ATC GCG CAC ACC GCC CCC TTG CTC CCC AAA GAC AAG CCT 78 0 

Leu Glu Thr He Ala His Thr Ala Pro Leu Leu Pro Lys Asp Lys Pro 
230 235 240 

CGC TAC TTA ATG GGC GTA GGC ACG CCT GAA AAT ATC CTA GAC GCT ATC 82 8 

Arg Tyr Leu Met Gly Val Gly Thr Pro Glu Asn He Leu Asp Ala He 
245 250 255 

AGT TTG GGG GTG GAT ATG TTT GAT TGC GTG ATG CCC ACC AGA AAC GCC 8 76 

Ser Leu Gly Val Asp Met Phe Asp Cys Val Met Pro Thr Arg Asn Ala 
260 265 270 

AGA AAC GCC ACC CTT TTC ACG CAT TCT GGC AAA ATT TCT ATC AAA AAC 924 
Arg Asn Ala Thr Leu Phe Thr His Ser Gly Lys He Ser He Lys Asn 
275 280 285 

GCG CCC TAT AAA TTG GAT AAT ACC CCT ATT GAA GAA AAT TGC GCA TGT 97 2 

Ala Pro Tyr Lys Leu Asp Asn Thr Pro He Glu Glu Asn Cys Ala Cys 
290 295 300 305 

TAT GCT TGC AAA CGC TAT TCT AAA GCC TAT TTG CAC CAT TTA TTT AGG 102 0 

Tyr Ala Cys Lys Arg Tyr Ser Lys Ala Tyr Leu His His Leu Phe Arg 
310 315 320 
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Gin Lys Ala L.eu Met His Leu Gin Glu Thr Leu Met lie Glu Val Asp 

290 295 300 

Arg Leu Asp Phe Ser Leu Val Glu Arg Leu Asn lie Leu Ala Arg Met 
305 310 315 320 

Glu Asn Glu Lys His Ala Phe 
325 

(2) INFORMATION FOR SEQ ID NO: 667: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58... 1170 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667: 

ATTCCAAACT TCCTAAAAAA TGATTACGCC ATTATATCCA AGATTAAGGC TTAAACG ATG 6 0 

Met 
1 

GAT TTT CAA CTC CAA GCG ACT GAC AAT AAC GCG CGA GCT GGT CTT TTA 10 8 

Asp Phe Gin Leu Gin Ala Thr Asp Asn Asn Ala Arg Ala Gly Leu Leu 
5 10 15 

AAT TTA GCC CAT TCT CAA GTG GCA ACG CCT GTT TTT ATG CCC GTA GGC 15 6 

Asn Leu Ala His Ser Gin Val Ala Thr Pro Val Phe Met Pro Val Gly 
20 25 30 

ACG CAA GGC TGC ATC AAA TCT TTA GAC GCT ACA GAT GCG CAA GAA ATT 2 04 

Thr Gin Gly Cys lie Lys Ser Leu Asp Ala Thr Asp Ala Gin Glu lie 
35 40 45 

TTA GGC GCT AAA CTC ATT TTA GCC AAC ACC TAT CAC ATG TAT TTA AGG 2 52 

Leu Gly Ala Lys Leu He Leu Ala Asn Thr Tyr His Met Tyr Leu Arg 
50 55 60 65 

CCG GGT GAA AAG GTC GTT GAG GAG TTA GGG GGC TTG CAT CGT TTC GCT 3 00 

Pro Gly Glu Lys Val Val Glu Glu Leu Gly Gly Leu His Arg Phe Ala 
70 75 80 

CAA TTT TAT GGG AGT TTT TTA ACC GAT AGT GGA GGG TTT CAA GCC TTT 34 8 

Gin Phe Tyr Gly Ser Phe Leu Thr Asp Ser Gly Gly Phe Gin Ala Phe 
85 90 95 
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GAA AAG CAT GCG TTT TAGTTACATT GAGCCAAGAG CGAAATACCT TATCAGCAAG C 1072 
Glu Lys His Ala Phe 
325 

TTTCTAAAAT T 10 8 3 



(2) INFORMATION FOR SEQ ID NO: 66 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666: 



Val Leu Lys Gly 
1 

Val Tyr lie Ser 
20 

lie Arg lie Lys 
35 

Glu Thr Lys Val 
50 

Lys lie Ala Arg 
65 

Ala Met Ser Lys 

Gin lie Lys Gin 
100 

Lys Tyr Cys Val 
115 

Arg Phe Lys lie 
130 

lie Tyr Asp Phe 
145 

Leu Leu Leu Glu 

Glu lie Phe Leu 
180 

Phe lie Glu Ser 
195 

Val Asp Leu Phe 
210 

Glu Ala lie Gly 
225 

Glu Asp Ala Tyr 

Ala Asp Val Leu 
260 

lie Asp Phe Val 
275 



Leu Lys Lys Ala 
5 

Phe Asn Val Asp 

Asn Asp Arg lie 
40 

Glu Thr Lys Asn 
55 

Thr Tyr Ser Gin 
70 

Ala Lys Glu Val 
85 

Glu Asn Gin Asp 

Tyr Val Glu Ser 
120 

Gin Asp Val Asp 
135 

Val Arg Asp Asn 
150 

Arg Ser Arg Phe 
165 

Ala Lys Ser Val 

Lys Glu Glu Asp 
200 

Leu Ser Glu lie 
215 

Leu Asp Ser Ser 
230 

Ser Leu lie Glu 
245 

Gin Glu Gly Leu 

Glu Lys Val Val 
280 



Phe Lys Glu Arg 
10 

His Asn Leu Leu 
25 

Lys Glu Lys Phe 

Gly Glu Val Pro 
60 

Lys Tyr Pro Tyr 
75 

Leu Cys Glu Lys 
90 

Tyr His Ala Cys 
105 

Lys Asp Phe Leu 

Phe Leu Phe Ser 
140 

Leu Glu Asn Lys 
155 

Tyr Phe Leu lie 
170 

Phe Leu Glu Glu 
185 

Phe Met Gly Met 

Gin Glu Asp lie 
220 

Lys Asp Asn Ser 
235 

Gly Met Thr Asn 
250 

Arg Gly Val Tyr 
265 

Val Leu Asp Ser 



Phe Cys Ser Gin 
15 

Ser Thr Gin Val 
30 

Phe Lys Thr Phe 
45 

lie Gin Ala Leu 

Thr Tyr Phe Ser 
80 

Gin Ala Phe Glu 
95 

Glu Val Asn Gin 
110 

Lys Asp Phe Lys 
125 

Pro Phe Ser Leu 

Pro Leu Leu Tyr 
160 

Ala Asp Lys Lys 
175 

Gin Pro Glu Glu 
190 

Asp Asn Glu Ala 
205 

Asp Ser Leu Glu 

Glu Lys lie Thr 
240 

lie Pro Leu lie 
255 

His Ser Arg Glu 
270 

Cys Gin lie His 
285 
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AAA CAA GAA AAT CAA GAT TAT CAT OCT TOT GAA GTC AAT CAA AAG TAT 3 92 

Lys Gin Glu Asn Gin Asp Tyr His Ala Cys Glu Val Asn Gin Lys Tyr 
100 105 110 

TGC GTT TAT GTG GAA TCT AAG GAT TTT TTA AAG GAT TTT AAG CGT TTT 44 0 

Cys Val Tyr Val Glu Ser Lys Asp Phe Leu Lys Asp Phe Lys Arg Phe 
115 120 125 130 

AAA ATC CAG GAT GTG GAT TTT TTG TTT TCG CCT TTT AGC CTT ATT TAT 4 88 

Lys lie Gin Asp Val Asp Phe Leu Phe Ser Pro Phe Ser Leu lie Tyr 
135 140 145 

GAT TTT GTG CGC GAT AAT TTA GAA AAT AAG CCG TTG TTG TAT TTG CTT 53 6 

Asp Phe Val Arg Asp Asn Leu Glu Asn Lys Pro Leu Leu Tyr Leu Leu 
150 155 160 

TTG GAG CGT TCA AGA TTT TAT TTT TTG ATT GCG GAT AAA AAA GAG ATT 5 84 

Leu Glu Arg Ser Arg Phe Tyr Phe Leu lie Ala Asp Lys Lys Glu lie 
165 170 175 

TTT TTA GCC AAA TCC GTG TTT TTA GAA GAA CAA CCT GAA GAG TTT ATA 63 2 

Phe Leu Ala Lys Ser Val Phe Leu Glu Glu Gin Pro Glu Glu Phe lie 
180 185 190 

GAG AGC AAA GAA GAA GAT TTT ATG GGA ATG GAT AAT GAG GCT GTG GAT 68 0 

Glu Ser Lys Glu Glu Asp Phe Met Gly Met Asp Asn Glu Ala Val Asp 
195 200 205 210 

TTG TTT TTG AGT GAA ATC CAA GAA GAT ATT GAC AGC CTT GAA GAA GCG 72 8 

Leu Phe Leu Ser Glu lie Gin Glu Asp lie Asp Ser Leu Glu Glu Ala 
215 220 225 

ATA GGC CTA GAC AGC AGT AAG GAT AAT AGC GAA AAA ATA ACA GAG GAC 7 76 

lie Gly Leu Asp Ser Ser Lys Asp Asn Ser Glu Lys lie Thr Glu Asp 
230 235 240 

GCT TAT AGT TTG ATT GAA GGC ATG ACG AAT ATC CCC TTA ATT GCA GAT 824 
Ala Tyr Ser Leu lie Glu Gly Met Thr Asn lie Pro Leu lie Ala Asp 
245 250 255 

GTT TTG CAA GAG GGA TTG CGT GGC GTC TAT CAT TCT AGA GAG ATA GAC 87 2 

Val Leu Gin Glu Gly Leu Arg Gly Val Tyr His Ser Arg Glu lie Asp 
260 265 270 

TTT GTA GAA AAA GTG GTT GTT TTA GAC AGC TGT CAA ATC CAC CAA AAA 92 0 

Phe Val Glu Lys Val Val Val Leu Asp Ser Cys Gin lie His Gin Lys 
275 280 285 290 

GCG TTA ATG CAT TTG CAA GAA ACT TTG ATG ATA GAA GTG GAT AGG CTT 968 
Ala Leu Met His Leu Gin Glu Thr Leu Met lie Glu Val Asp Arg Leu 
295 300 305 

GAT TTT TCT TTA GTG GAG CGC TTG AAC ATT TTA GCG CGC ATG GAG AAT 1016 
Asp Phe Ser Leu Val Glu Arg Leu Asn lie Leu Ala Arg Met Glu Asn 
310 315 320 
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Lys Asn Asn Gin Ala lie Glu Leu Glu Ala Leu Lys Arg Gly Asp Lys 

385 390 395 400 

lie Glu Leu Ser Asn Glu Lys Thr Arg Ala Ser Ala Glu lie Leu Ser 

405 410 415 

Val Asp Arg Val 
420 

(2) INFORMATION FOR SEQ ID NO: 6 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1031 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665: 

AAAGACGCGC ATGGATTCTG TTGGGTTTGA AGAACCTCAA AGGGGTTTGG GTG CTT 5 6 

Val Leu 
1 

AAG GGG TTA AAA AAA GCG TTT AAG GAG AGG TTT TGC TCT CAA GTG TAT 104 
Lys Gly Leu Lys Lys Ala Phe Lys Glu Arg Phe Cys Ser Gin Val Tyr 
5 10 15 

ATC TCT TTT AAT GTG GAT CAC AAT CTT TTA TCC ACT CAA GTC ATA AGG 152 
lie Ser Phe Asn Val Asp His Asn Leu Leu Ser Thr Gin Val lie Arg 
20 25 30 

ATC AAA AAC GAT CGC ATT AAA GAG AAA TTT TTT AAA ACT TTT GAG ACT 200 
lie Lys Asn Asp Arg lie Lys Glu Lys Phe Phe Lys Thr Phe Glu Thr 
35 40 45 50 

AAA GTG GAG ACT AAA AAT GGT GAA GTC CCT ATT CAA GCC TTA AAA ATC 24 8 

Lys Val Glu Thr Lys Asn Gly Glu Val Pro lie Gin Ala Leu Lys lie 
55 60 65 

GCC AGA ACT TAT AGO CAA AAA TAC CCC TAC ACT TAT TTT AGC GCG ATG 2 96 

Ala Arg Thr Tyr Ser Gin Lys Tyr Pro Tyr Thr Tyr Phe Ser Ala Met 
70 75 80 

AGT AAA GCT AAA GAG GTT TTA TGC GAA AAG CAG GCG TTT GAA CAA ATC 344 
Ser Lys Ala Lys Glu Val Leu Cys Glu Lys Gin Ala Phe Glu Gin He 
85 90 . 95 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 664 : 



Val Asp Val Leu 
1 

Leu Glu Ala Thr 
20 

Leu Thr lie His 

35 

Ser Gin Ser Val 
50 

Leu Lys Phe Ala 
65 

lie Ser Val Tyr 

lie Glu Pro Lys 
100 

Lys Glu Lys Leu 
115 

Pro Lys Pro His 
130 

Ser Ala Ala Trp 
145 

Met Cys Glu Leu 

Val Gin Ser Val 
180 

Thr Lys Asn Ala 
195 

Met Glu Asp Leu 
210 

Tyr Leu Al a Lys 
225 

Phe Leu Leu Ser 

Asn Ala Met Glu 
260 

Leu Asp Gly Phe 
275 

His Gin Lys Lys 
290 

Leu Ser Phe Glu 
305 

Leu Lys lie Ala 

Lys Thr Leu Leu 
340 

Leu Gin Thr Lys 
355 

Leu Ala His Ala 
370 



Ser Val Ser Glu 
5 

Phe Leu Gin Val 

Lys Val Ser Gly 
40 

lie Lys Cys Val 
55 

Leu Lys Glu Gly 
70 

Val Pro Arg Gly 
85 • 

Asp lie Gly Ser 

Arg Leu Lys Gly 
120 

Phe Pro Lys Arg 
135 

Ala Asp Met Ly s 
150 

Val Cys lie Asn 
165 

Val Glu Ser lie 

Phe Asp Ala lie 
200 

Tyr Ser Phe Asn 
215 

Thr Phe Ser Met 
230 

Asp Leu Val Ala 
245 

lie Leu Leu Pro 

Asn Val Lys Leu 
280 

Ala His Leu Glu 
295 

Asn Lys His His 
310 

Leu Glu Asn Lys 
325 

Glu Lys lie Ser 

Thr Glu Arg Leu 
360 

Asn Leu Lys Leu 
375 



lie Asn Ala Gin 
10 

Arg Val Gin Gly 
25 

His Ala Tyr Phe 

Leu Phe Lys Gly 
60 

Gin Glu Val Val 
75 

Asp Tyr Gin lie 
90 

Leu Thr Leu Ala 
105 

Tyr Phe Asp Glu 

Val Ala Val lie 
140 

Lys lie Ala Ser 
155 

Thr Leu Met Gin 
170 

Val Tyr Ala Asp 
185 

Val Val Ala Arg 

Asp Glu Lys lie 
220 

Ser Ala lie Gly 
235 

Asp Leu Arg Ala 
250 

Ser Ser Asp Glu 
265 

His Arg Ser Phe 

His Leu Val Ala 
300 

Leu Asn Ala Leu 
315 

Thr Leu Glu Phe 
330 

Thr Gin Thr Leu 
345 

Asn Arg Leu Glu 

Pro Gin Phe Gly 
380 



lie Lys Ala Leu 
15 

Glu Val Ser Asn 
30 

Ser Leu Lys Asp 
45 

Asn Ala Asn Arg 

Val Phe Gly Gly 
80 

Asn Cys Phe Glu 
95 

Leu Glu Gin Leu 
110 

Glu Asn Lys Leu 
125 

Thr Ser Gin Asn 

Lys Arg Trp Pro 
160 

Gly Glu Gly Cys 
175 

Ser Phe His Asp 
190 

Gly Gly Gly Ser 
205 

Ala Asp Ala Leu 

His Glu Ser Asp 
240 

Ser Thr Pro Ser 
255 

Trp Leu Gin Arg 
270 

Lys Thr Leu Leu 
285 

Ser Leu Lys Arg 

Lys Leu Glu Lys 
320 

Leu Arg Phe Lys 
335 

Thr Ser Pro Phe 
350 

Asn Ala Leu Lys 
365 

Ala Leu Val Ser 
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Leu Ser Asp Leu Val Ala Asp Leu Arg Ala Ser Thr Pro Ser Asn Ala 
245 250 255 

ATG GAA ATT TTA CTC CCC AGC AGC GAT GAA TGG TTG CAA AGA CTT GAT 87 2 

Met Glu He Leu Leu Pro Ser Ser Asp Glu Trp Leu Gin Arg Leu Asp 
260 265 270 

GGG TTT AAT GTG AAA TTG CAC CGC TCG TTT AAA ACT TTG CTC CAC CAA 92 0 

Gly Phe Asn Val Lys Leu His Arg Ser Phe Lys Thr Leu Leu His Gin 
275 280 285 290 

AAA AAG GCG CAT TTA GAG CAT TTA GTG GOT TCT TTA AAA CGA TTG AGT 96 8 

Lys Lys Ala His Leu Glu His Leu Val Ala Ser Leu Lys Arg Leu Ser 
295 300 305 

TTT GAA AAC AAG CAC CAT TTA AAC GCT TTA AAA CTA GAA AAA TTA AAA 1016 
Phe Glu Asn Lys His His Leu Asn Ala Leu Lys Leu Glu Lys Leu Lys 
310 315 320 

ATC GCC CTA GAA AAT AAA ACT CTA GAA TTT TTA CGC TTT AAA AAA ACG 10 64 

He Ala Leu Glu Asn Lys Thr Leu Glu Phe Leu Arg Phe Lys Lys Thr 
325 330 335 

CTT TTA GAA AAA ATC TCT ACT CAA ACA TTA ACA AGC CCT TTT TTA CAA 1112 
Leu Leu Glu Lys He Ser Thr Gin Thr Leu Thr Ser Pro Phe Leu Gin 
340 345 350 

ACT AAA ACA GAG CGA TTG AAC AGG CTA GAA AAC GCC CTT AAA CTC GCT 116 0 

Thr Lys Thr Glu Arg Leu Asn Arg Leu Glu Asn Ala Leu Lys Leu Ala 
355 360 365 370 

CAT GCT AAT TTG AAA TTA CCC CAA TTC GGG GCG TTG GTG AGC AAA AAT 12 0 8 

His Ala Asn Leu Lys Leu Pro Gin Phe Gly Ala Leu Val Ser Lys Asn 
375 380 385 

AAT CAA GCG ATA GAA TTA GAG GCA TTA AAA AGG GGC GAT AAA ATT GAA 12 5 6 

Asn Gin Ala He Glu Leu Glu Ala Leu Lys Arg Gly Asp Lys He Glu 
390 395 400 

TTA AGT AAT GAA AAA ACC AGA GCG AGC GCT GAA ATT TTG AGC GTG GAT 13 04 

Leu Ser Asn Glu Lys Thr Arg Ala Ser Ala Glu He Leu Ser Val Asp 
405 410 415 

AGG GTG TAGGGGTTTG AAAAATAATA TTTAAAACGC TATTTGTTTT ATCTTAAAAT TT 13 6 2 
Arg Val 
420 

C 1363 
(2) INFORMATION FOR SEQ ID NO:664: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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20 25 30 

ATC CAT AAG GTG AGC GGC CAT GCG TAT TTT TOG CTC AAA GAC AGC CAG 2 00 

lie His Lys Val Ser Gly His Ala Tyr Phe Ser Leu Lys Asp Ser Gin 
35 40 45 50 

TCG GTT ATT AAA TGC GTG CTG TTT AAA GGG AAC GCT AAC AGG CTC AAA 24 8 

Ser Val lie Lys Cys Val Leu Phe Lys Gly Asn Ala Asn Arg Leu Lys 
55 60 65 

TTC GCT TTA AAA GAA GGG CAG GAA GTG GTT GTT TTT GGG GGT ATT AGC 2 96 

Phe Ala Leu Lys Glu Gly Gin Glu Val Val Val Phe Gly Gly He Ser 
70 75 80 

GTG TAT GTC CCA AGG GGG GAT TAT CAA ATC AAT TGC TTT GAA ATA GAG 3 44 

Val Tyr Val Pro Arg Gly Asp Tyr Gin He Asn Cys Phe Glu He Glu 
85 90 95 

CCT AAG GAT ATA GGT TCA TTA ACT TTA GCT TTA GAG CAA TTG AAA GAA 3 92 

Pro Lys Asp He Gly Ser Leu Thr Leu Ala Leu Glu Gin Leu Lys Glu 
100 105 110 

AAA TTA CGC CTT AAA GGC TAT TTT GAT GAA GAA AAT AAA TTA CCC AAA 44 0 

Lys Leu Arg Leu Lys Gly Tyr Phe Asp Glu Glu Asn Lys Leu Pro Lys 
115 120 125 130 

CCG CAT TTT CCT AAA CGA GTG GCA GTC ATC ACT TCT CAA AAT TCA GCC 4 88 

Pro His Phe Pro Lys Arg Val Ala Val He Thr Ser Gin Asn Ser Ala 
135 140 145 

GCT TGG GCG GAC ATG AAA AAG ATC GCT TCC AAA CGA TGG CCG ATG TGT 53 6 

Ala Trp Ala Asp Met Lys Lys He Ala Ser Lys Arg Trp Pro Met Cys 
150 155 160 

GAA TTA GTT TGT ATC AAC ACC TTA ATG CAA GGG GAG GGC TGC GTT CAA 5 84 

Glu Leu Val Cys He Asn Thr Leu Met Gin Gly Glu Gly Cys Val Gin 
165 170 175 

AGC GTG GTG GAA AGC ATC GTT TAT GCG GAT ACT TTT CAT GAC ACA AAA 63 2 

Ser Val Val Glu Ser He Val Tyr Ala Asp Ser Phe His Asp Thr Lys 
180 185 190 

AAC GCT TTT GAT GCG ATT GTA GTG GCT AGG GGT GGG GGG AGC ATG GAG 680 
Asn Ala Phe Asp Ala He Val Val Ala Arg Gly Gly Gly Ser Met Glu 
195 200 205 210 

GAT TTG TAT TCT TTC AAT GAT GAA AAA ATC GCT GAT GCT CTG TAT TTG 72 8 

Asp Leu Tyr Ser Phe Asn Asp Glu Lys He Ala Asp Ala Leu Tyr Leu 
215 220 225 

GCC AAA ACC TTC AGC ATG TCA GCT ATT GGG CAT GAG AGC GAT TTT TTA 7 76 

Ala Lys Thr Phe Ser Met Ser Ala He Gly His Glu Ser Asp Phe Leu 
230 235 240 

TTG AGC GAT TTA GTG GCG GAT TTA AGG GCT TCT ACG CCT TCA AAC GCG 8 24 
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Thr Tyr Pro Phe Val Thr Ser Ser Asn Thr Thr Ser Ala Ser Ala Cys 
225 230 235 240 

Val Ser Thr Gly Leu Asn Pro Lys Ala lie Asn Glu Val lie Gly lie 

245 250 255 

Thr Lys Ala Tyr Ser Thr Arg Val Gly Asn Gly Pro Phe Pro Ser Glu 

260 265 270 

Asp Thr Thr Pro Met Gly Asp His Leu Arg Thr Lys Gly Ala Glu Phe 

275 280 285 

Gly Thr Thr Thr Lys Arg Pro Arg Arg Cys Gly Trp Leu Asp Leu Val 

290 295 300 

Ala Leu Lys Tyr Ala Cys Ala Leu Asn Gly Cys Thr Gin Leu Ala Leu 
305 310 315 320 

Met Lys Leu Asp Val Leu Asp Gly lie Asp Ala lie Lys Val Cys Val 

325 330 335 

Ala Tyr Glu Arg Lys Gly Glu Arg Leu Glu He Phe Pro Ser Asp Leu 

340 345 350 

Lys Asp Cys Val Pro He Tyr Gin Thr Phe Lys Gly Trp Glu Lys Ser 

355 360 365 

Val Gly Val Arg Lys Leu Asp Asp Leu Glu Pro Asn Val Arg Glu Tyr 

370 375 380 

He Arg Phe He Glu Lys Glu Val Gly Val Lys He Arg Leu He Ser 
385 390 395 400 

Thr Ser Pro Glu Arg Glu Asp Thr He Phe Leu 
405 410 



(2) INFORMATION FOR SEQ ID NO : 663 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1310 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 



TTTTAGGGCT TTTCAATGAT CTCACTCGTT TGCTATAAAA GGGGGAGCTG GTG GAT 5 6 

Val Asp 
1 

GTA TTG AGC GTG AGC GAA ATC AAT GCG CAA ATC AAA GCC CTT TTA GAA 10 4 

Val Leu Ser Val Ser Glu He Asn Ala Gin He Lys Ala Leu Leu Glu 
5 10 15 

GCG ACT TTT TTG CAA GTT AGG GTT CAA GGG GAA GTG AGT AAT TTG ACT 152 

Ala Thr_:Phe Leu Gin Val Arg Val Gin Gly Glu Val Ser Asn Leu Thr 
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GAA AAA AGC GTG GGC GTG AGA AAA TTA GAG GAT TTA GAG CCA AAC GTT 1215 
Glu Lys Ser Val Gly Val Arg Lys Leu Asp Asp Leu Glu Pro Asn Val 
370 375 380 

AGA GAG TAT ATC CGT TTT ATT GAA AAA GAA GTG GGG GTA AAA ATC CGC 12 6 3 

Arg Glu Tyr lie Arg Phe lie Glu Lys Glu Val Gly Val Lys lie Arg 
385 390 395 

CTT ATT TCT ACA AGC CCT GAA AGA GAA GAC ACG ATT TTT CTA TGAAAAAAT 1314 
Leu lie Ser Thr Ser Pro Glu Arg Glu Asp Thr lie Phe Leu 
400 405 410 

TCGCTTCTGT ATTGGTGCAA TTAAAAACCC TTGCGT 13 5 0 

(2) INFORMATION FOR SEQ ID NO:662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662: 

Met Ala Asp Val Val Val Gly lie Gin Trp Gly Asp Glu Gly Lys Gly 

15 10 15 

Lys lie Val Asp Arg lie Ala Lys Asp Tyr Asp Phe Val Val Arg Tyr 

20 25 30 

Gin Gly Gly His Asn Ala Gly His Thr lie Val His Lys Gly Val Lys 

35 40 45 

His Ser Leu His Leu Met Pro Ser Gly Val Leu Tyr Pro Lys Cys Lys 

50 55 60 

Asn lie lie Ser Ser Ala Val Val Val Ser Val Lys Asp Leu Cys Glu 
65 70 75 80 

Glu lie Ser Ala Phe Glu Asp Leu Glu Asn Arg Leu Phe Val Ser Asp 

85 90 95 

Arg Ala His Val lie Leu Pro Tyr His Ala Lys Lys Asp Ala Phe Lys 

100 105 110 

Glu Lys Ser Gin Asn lie Gly Thr Thr Lys Lys Gly lie Gly Pro Cys 

115 120 125 

Tyr Glu Asp Lys Met Ala Arg Ser Gly lie Arg Met Gly Asp Leu Leu 

130 135 140 

Asp Asp Lys lie Leu Glu Glu Lys Leu Asn Ala His Phe Lys Ala lie 
145 150 155 160 

Glu Pro Phe Lys Lys Ala Tyr Asp Leu Gly Glu Asn Tyr Glu Lys Asp 

165 170 175 

Leu Met Gly Tyr Phe Lys Thr Tyr Ala Pro Lys lie Cys Pro Phe lie 

180 185 190 

Lys Asp Thr Thr Ser Met Leu lie Glu Ala Asn Gin Lys Gly Glu Lys 

195 200 205 

lie Leu Leu Glu Gly Ala Gin Gly Thr Leu Leu Asp lie Asp Leu Gly 
210 215 220 
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GAT TTA TTA GAC GAT AAA ATC TTA GAA GAA AAG CTA AAC GCT CAT TTC 54 3 

Asp Leu Leu Asp Asp Lys lie Leu Glu Glu Lys Leu Asn Ala His Phe 
145 150 155 

AAA GCC ATT GAG CCT TTT AAA AAA GCG TAT GAT TTG GGC GAG AAT TAG 5 91 

Lys Ala lie Glu Pro Phe Lys Lys Ala Tyr Asp Leu Gly Glu Asn Tyr 
160 165 170 

GAA AAA GAT TTG ATG GGG TAT TTT AAA ACT TAG GCT CCA AAA ATT TGC 63 9 

Glu Lys Asp Leu Met Gly Tyr Phe Lys Thr Tyr Ala Pro Lys lie Cys 
175 180 185 

CCC TTT ATC AAA GAC ACG ACA AGC ATG CTG ATA GAA GCG AAT CAA AAG 687 
Pro Phe lie Lys Asp Thr Thr Ser Met Leu lie Glu Ala Asn Gin Lys 
190 195 200 205 

GGT GAA AAA ATC CTA TTA GAA GGG GCA CAA GGC ACG CTT TTA GAC ATT 73 5 

Gly Glu Lys lie Leu Leu Glu Gly Ala Gin Gly Thr Leu Leu Asp He 
210 215 220 

GAT TTA GGG ACT TAG CCT TTT GTA ACA AGC TCT AAC ACC ACG AGC GCT 7 83 

Asp Leu Gly Thr Tyr Pro Phe Val Thr Ser Ser Asn Thr Thr Ser Ala 
225 230 235 

AGC GCA TGC GTG AGC ACC GGC TTA AAC CCT AAA GCG ATC AAT GAA GTC 831 
Ser Ala Cys Val Ser Thr Gly Leu Asn Pro Lys Ala He Asn Glu Val 
240 245 250 

ATA GGT ATC ACA AAA GCC TAG TCC ACT CGT GTG GGT AAT GGG CCT TTC 879 
He Gly He Thr Lys Ala Tyr Ser Thr Arg Val Gly Asn Gly Pro Phe 
255 260 265 

CCT AGC GAA GAC ACT ACA CCC ATG GGC GAT CAT TTA AGG ACT AAG GGT 92 7 

Pro Ser Glu Asp Thr Thr Pro Met Gly Asp His Leu Arg Thr Lys Gly 
270 275 280 285 

GCG GAG TTT GGC ACG ACA ACC AAG CGC CCA AGG CGT TGC GGG TGG CTG 97 5 

Ala Glu Phe Gly Thr Thr Thr Lys Arg Pro Arg Arg Cys Gly Trp Leu 
290 295 300 

GAT TTG GTG GCT TTG AAA TAC GCT TGC GCT TTG AAT GGT TGC ACG CAA 102 3 

Asp Leu Val Ala Leu Lys Tyr Ala Cys Ala Leu Asn Gly Cys Thr Gin 
305 310 315 

TTA GCC TTA ATG AAA TTA GAC GTT TTA GAC GGG ATT GAT GCG ATT AAG 10 71 

Leu Ala Leu Met Lys Leu Asp Val Leu Asp Gly He Asp Ala He Lys 
320 325 330 

GTG TGC GTG GCT TAT GAA AGA AAG GGC GAA AGA TTG GAG ATT TTC CCT 1119 
Val Cys Val Ala Tyr Glu Arg Lys Gly Glu Arg Leu Glu He Phe Pro. 
335 340 345 

AGC GAT TTG AAA GAT TGC GTG CCG ATC TAT CAA ACT TTT AAA GGT TGG 1167 
Ser Asp Leu Lys Asp Cys Val Pro He Tyr Gin Thr Phe Lys Gly Trp 
350 355 360 365 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 73... 1305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661: 

TTTTACTTTT TTTTTGGTAT TCTAACAAGC TTTTAAATAA TCCAATCTAC TTTGTTTTAA 6 0 

GGATAATATT TT ATG GCA GAT GTC GTT GTG GGG ATC CAG TGG GGA GAT GAG 111 
Met Ala Asp Val Val Val Gly lie Gin Trp Gly Asp Glu 
15 10 

GGG AAG GGA AAA ATT GTT GAT AGG ATC GCT AAA GAT TAT GAC TTT GTG 15 9 

Gly Lys Gly Lys lie Val Asp Arg lie Ala Lys Asp Tyr Asp Phe Val 
15 20 25 

GTG CGC TAT CAG GGC GGG CAT AAT GCT GGG CAT ACC ATT GTG CAT AAG 2 07 

Val Arg Tyr Gin Gly Gly His Asn Ala Gly His Thr lie Val His Lys 
30 35 40 45 

GGG GTT AAG CAT TCT TTG CAT TTA ATG OCT TCA GGG GTT TTA TAC CCC 255 
Gly Val Lys His Ser Leu His Leu Met Pro Ser Gly Val Leu Tyr Pro 
50 55 60 

AAA TGC AAG AAC ATC ATT TCT AGC GCG GTG GTC GTG AGC GTT AAG GAT 3 03 

Lys Cys Lys Asn lie lie Ser Ser Ala Val Val Val Ser Val Lys Asp 
65 70 75 

TTG TGC GAA GAA ATC AGC GCG TTT GAG GAT TTA GAA AAT CGT TTG TTT 3 51 

Leu Cys Glu Glu lie Ser Ala Phe Glu Asp Leu Glu Asn Arg Leu Phe 
80 85 90 

GTC AGC GAC AGA GCC CAT GTG ATC TTG CCC TAT CAT GCC AAA AAA GAC 3 99 

Val Ser Asp Arg Ala His Val lie Leu Pro Tyr His Ala Lys Lys Asp 
95 100 105 

GCT TTT AAA GAA AAA TCT CAA AAC ATC GGC ACG ACT AAA AAA GGC ATA 44 7 

Ala Phe Lys Glu Lys Ser Gin Asn lie Gly Thr Thr Lys Lys Gly lie 
110 115 120 125 

GGC CCT TGC TAT GAG GAT AAA ATG GCC AGG AGC GGG ATA AGA ATG GGG 4 95 

Gly Pro Cys Tyr Glu Asp Lys Met Ala Arg Ser Gly lie Arg Met Gly 
130 135 140 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 212 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659: 

TAAGCGCTTT GCGTGTTTAA AAATTCCTTA TCCAAGCTCT CAGCGTCTTC ATG AAA 5 6 

Met Lys 
1 

GCT GTA ACC ATC TTT CAT GAT AAA TTC CCT CGC TCT CAC CAG CCC AAA 104 
Ala Val Thr lie Phe His Asp Lys Phe Pro Arg Ser His Gin Pro Lys 
5 10 15 

TCT TGG GCG GAT TTC ATC ACG GAA TTT CGT GTG GAT TTG ATA GAG ATG 152 
Ser Trp Ala Asp Phe lie Thr Glu Phe Arg Val Asp Leu lie Glu Met 
20 25 30 

GAC GGG CAA TTG CTT GTA ACT TTT AAT GAA ATT AGC GGC AAT TTC GGT 2 00 

Asp Gly Gin Leu Leu Val Thr Phe Asn Glu lie Ser Gly Asn Phe Gly 
35 40 45 50 

GAT ATT TTC TTC TAAAGTGGGG CTTAAAACAA AATCATTGTC TTTTCGGTCT TTAAA 2 57 
Asp lie Phe Phe 



AACCAATA 2 65 

(2) INFORMATION FOR SEQ ID NO: 66 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 

Met Lys Ala Val Thr lie Phe His Asp Lys Phe Pro Arg Ser His Gin 

15 10 15 

Pro Lys Ser Trp Ala Asp Phe lie Thr Glu Phe Arg Val Asp Leu lie 

20 25 30 

Glu Met Asp Gly Gin Leu Leu Val Thr Phe Asn Glu lie Ser Gly Asn 

35 40 45 

Phe Gly Asp lie Phe Phe 
50 

(2) INFORMATION FOR SEQ ID NO: 661: 
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lie Leu Glu Lys Ala Gin Leu Phe 
190 

TTTGGTGGTG GTTAAGCTTG AACGCT 102 0 

(2) INFORMATION FOR SEQ ID NO: 6 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 8: 



Met 


Ser 


Leu 


Pro 


Lys 


Gly 


Ala 


Lys 


Val 


Gly 


Thr 


Thr 


Ser 


Leu 


Arg 


Arg 


1 








5 










10 










15 




Ser 


Met 


Gin 


lie 


Lys 


Leu 


Lys 


Arg 


Gin 


Asp 


Leu 


Asp 


Thr 


Glu 


Ser 


Leu 








20 










25 










30 






Arg 


Gly 


Asn 


Val 


Gin 


Thr 


Arg 


Leu 


Lys 


Lys 


Leu 


Glu 


Cys 


Gly 


Glu 


Phe 






35 










40 










45 








Asp 


Ala 


lie 


lie 


Leu 


Ala 


Glu 


Ala 


Gly 


Leu 


Cys 


Arg 


Leu 


Glu 


He 


Gin 




50 










55 










60 










Gly 


Ala 


Lys 


Tyr 


Arg 


Lys 


Ala 


Phe 


Ser 


Val 


Glu 


Glu 


Met 


He 


Pro 


Ser 


65 










70 










75 










80 


Met 


Gly 


Gin 


Gly 


Ala 


Leu 


Gly 


Val 


Glu 


Met 


Leu 


Lys 


Asn 


His 


Lys 


His 










85 










90 










95 




Phe 


Ala 


Thr 


Leu 


Gin 


Lys 


Leu 


Asn 


Asp 


Glu 


Lys 


Ser 


Ala 


Phe 


Cys 


Cys 








100 










105 










110 






Arg 


Leu 


Glu 


Arg 


Glu 


Phe 


lie 


Lys 


Gly 


Leu 


Asn 


Gly 


Gly 


Cys 


Gin 


He 






115 










120 










125 








Pro 


lie 


Gly 


Val 


His 


Ala 


Ser 


Leu 


Met 


Gly 


Asp 


Arg 


Val 


Lys 


He 


Gin 




130 










135 










140 










Ala 


Val 


Leu 


Gly 


Leu 


Pro 


Asn 


Gly 


Lys 


Glu 


Val 


He 


Thr 


Lys 


Glu 


Lys 


145 










150 










155 










160 


Gin 


Gly 


Asp 


Lys 


Thr 


Lys 


Ala 


Phe 


Asp 


Leu 


Val 


Gin 


Glu 


Leu 


Leu 


Glu 










165 










170 










175 




Glu 


Phe 


Leu 


Gin 


Ser 


Gly 


Ala 


Lys 


Glu 


He 


Leu 


Glu 


Lys 


Ala 


Gin 


Leu 



180 185 190 

Phe 



(2) INFORMATION FOR SEQ ID NO: 65 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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GAAGAT^AAAA TTTTAGAAAT GTTAGAAAGC GAATAAGGGG GAGTGAGTGG GAAATTTAGT 6 0 

GATTGGCTCT AGGGGGAGCG AATTAGCCTT ATGGCAAGCG AATCACATTA AAGAACGCCT 12 0 

GAAAAAAGAA TGCTTGATAG AAAGCGAAAT TCAAATCGTT AAGACTAAGG GCGATAAAAT 180 

CTTAGACACC CCTTTAAATA AGATCGGCGG TAAGGGGCTA TTCACTAAGG AATTAGAAGA 24 0 

ATTGCTTTTA AAGGGCGCAA TTGATTTGGC GGTGCATTCT TTAAAAGACG TGCCGGTCGT 3 00 

GTTTGAAAAG GGGTTAGACT TGGCATGCAT CACCAAAAGG GCTGATGTGA GGGACACTTT 3 60 

TTTAAGCGTG AAATTCCCTG ATTTG ATG AGT TTG CCT AAA GGG GCA AAG GTT 412 

Met Ser Leu Pro Lys Gly Ala Lys Val 
1 5 

GGC ACG ACT TCT TTA AGG CGC TCT ATG CAG ATC AAA TTA AAG CGC CAG 4 60 
Gly Thr Thr Ser Leu Arg Arg Ser Met Gin lie Lys Leu Lys Arg Gin 
10 15 20 25 

GAT TTG GAC ACA GAA AGC TTG AG A GGG AAT GTC CAA ACC CGT TTG AAA 508 
Asp Leu Asp Thr Glu Ser Leu Arg Gly Asn Val Gin Thr Arg Leu Lys 
30 35 40 

AAG CTT GAA TGC GGA GAA TTT GAC GCT ATC ATT TTA GCT GAA GCC GGG 556 
Lys Leu Glu Cys Gly Glu Phe Asp Ala lie lie Leu Ala Glu Ala Gly 
45 50 55 

TTG TGC CGC CTA GAA ATT CAA GGA GCG AAA TAC CGC AAG GCT TTT AGC 6 04 

Leu Cys Arg Leu Glu lie Gin Gly Ala Lys Tyr Arg Lys Ala Phe Ser 
60 65 70 

GTA GAA GAA ATG ATT CCT AGC ATG GGT CAG GGG GCT TTA GGG GTA GAA 65 2 

Val Glu Glu Met He Pro Ser Met Gly Gin Gly Ala Leu Gly Val Glu 
75 80 85 

ATG CTC AAA AAC CAC AAG CAT TTT GCC ACG CTT CAA AAA CTC AAC GAC 7 00 

Met Leu Lys Asn His Lys His Phe Ala Thr Leu Gin Lys Leu Asn Asp 
90 95 100 105 

GAG AAA AGC GCG TTT TGC TGC CGT TTA GAA. AGG GAG TTT ATC AAG GGG 74 8 

Glu Lys Ser Ala Phe Cys Cys Arg Leu Glu Arg Glu Phe He Lys Gly 
110 115 120 

CTT AAT GGA GGG TGT CAG ATC CCT ATA GGC GTG CAT GCG AGT TTA ATG 7 96 

Leu Asn Gly Gly Cys Gin He Pro He Gly Val His Ala Ser Leu Met 
125 130 135 

GGC GAT AGG GTT AAA ATC CAG GCG GTT TTA GGC TTG CCT AAC GGG AAA 844 
Gly Asp Arg Val Lys He Gin Ala Val Leu Gly Leu Pro Asn Gly Lys 
140 145 150 

GAA GTC ATT ACT AAA GAA AAA CAA GGG GAT AAA ACT AAA GCG TTT GAT 8 92 . 

Glu Val He Thr Lys Glu Lys Gin Gly Asp Lys Thr Lys Ala Phe Asp 
155 160 165 

TTA GTT CAA GAG CTT TTA GAA GAA TTT TTG CAA AGC GGG GCT AAA GAG 94 0 

Leu Val Gin Glu Leu Leu Glu Glu Phe Leu Gin Ser Gly Ala Lys Glu 
170 175 180 185 

ATT TTA GAA AAG GCG CAG TTG TTT TAATGCGTTT GTTTATCGCG CTAGTTTTGT 994 
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Val Asn Ala Leu 
130 

Gin Asn Gin Asn 
145 

lie lie Tyr Leu 

lie Cys lie Leu 
180 

Ala Pro Asn Leu 
195 

lie lie lie Pro 
210 

Pro Tyr Ala Leu 
225 

Thr Ala Lys Thr 

Asn Lys Glu Thr 
260 

Leu Gly Gly Met 
275 

Ala Lys Ser Gly 
290 

Ser Leu Met Val 
305 

Pro lie Val Ala 

Phe Asn Phe Gin 
340 

Thr Leu Asn Met 
355 

Leu Ala lie Gly 
370 

Lys Ser Lys Gly 
385 



Gly lie Leu Leu 
135 

Leu Gly Val Phe 
150 

Phe Pro Leu lie 
165 

lie Val Ser Ala 

Gly Ser lie Glu 
200 

Lys Asn Leu Asp 
215 

Ser Leu Ala Leu 
230 

Leu Asp Val lie 
245 

Lys Ala Gin Gly 

Thr Gly Cys Ala 
280 

Ala Lys Thr Arg 
295 

Leu lie Leu Val 
310 

Val Val Ala Val 
325 

Ser lie lie Asn 

Leu Leu Val Val 
360 

Val Val Val Gly 
375 

lie Ala 
390 



Leu Met Glu Gin 
140 

Val Leu Leu Ala 
155 

Thr Lys Lys lie 
170 

lie Ala Leu lie 
185 

Gin Gly Val Ser 

Phe Lys lie Met 
220 

Val Gly Thr lie 
235 

Leu Lys Asp Gly 
250 

Leu Gly Asn lie 
265 

Leu Val Gly Gin 

Leu Ser Thr Phe 
300 

Phe Asn Glu Tyr 
315 

Met Val Met lie 
330 

lie Lys Lys He 
345 

Ala Val Val Leu 

Val Leu Val Asn 
380 



Phe Lys Phe Leu 

He Gly He Leu 
160 

Pro Ser Asn Leu 
175 

Phe Asp Met His 
190 

Gly Phe His Phe 
205 

He Glu Leu Leu 

Glu Ser Leu Leu 

240 

Val Ser Asp Lys 
255 

He Ser Gly Leu 
270 

Ser He He Asn 
285 

Phe Ala Gly Phe 

Val Val Lys He 
320 

Ser Phe Thr Thr 
335 

Lys Leu Tyr Asp 
350 

Tyr Thr His Asn 
365 

Ala Leu Trp He 



(2) INFORMATION FOR SEQ ID NO:657: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NTU^E/KEY: Coding Sequence 

(B) LOCATION: 386... 964 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 657 : 
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ATG GTG CTC ATA TTA GTG TTT AAT GAA TAT GTG GTT AAG ATC CCC ATT 1016 
Met Val Leu lie L.eu Val Phe Asn Glu Tyr Val Val Lys lie Pro He 
310 315 320 

GTG GCG GTT GTG GCG GTA ATG GTG ATG ATT TCT TTC ACC ACT TTT AAT 10 64 

Val Ala Val Val Ala Val Met Val Met He Ser Phe Thr Thr Phe Asn 
325 330 335 

TTC CAA TCC ATT ATT AAC ATT AAA AAA ATC AAG CTC TAT GAC ACG CTC 1112 
Phe Gin Ser He He Asn He Lys Lys He Lys Leu Tyr Asp Thr Leu 
340 345 350 

AAC ATG CTC TTA GTC GTG GCG GTG GTT TTA TAC ACG CAT AAT TTA GCG 1160 
Asn Met Leu Leu Val Val Ala Val Val Leu Tyr Thr His Asn Leu Ala 
355 360 365 370 

ATA GGG GTT GTG GTG GGG GTT TTA GTC AAT GCG TTA TGG ATC AAA TCT 12 08 

He Gly Val Val Val Gly Val Leu Val Asn Ala Leu Trp He Lys Ser 
375 380 385 

AAA GGG ATT GCA TGAAATTTTA TTTTAAAAAG TTGGGTAGCT AGAGATATGG CTCCA 12 65 
Lys Gly He Ala 
390 

GATGTAGG 1273 
(2) INFORMATION FOR SEQ ID NO: 65 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656: 

Met Arg Lys Lys Gly Met Phe Glu Lys He Gin Lys Glu Trp Leu Ser 

15 10 15 

Asn He Gin Lys Asp Leu Leu Ser Gly Phe Val Val Gly Leu Ser Val 

20 25 30 

He Pro Glu Thr Ala Gly Phe Ala He Met Val Gly Leu Asp Val Gly 

35 40 45 

Val Ala Phe Tyr Thr Thr Phe Tyr Met Ala Phe Val Leu Ser Leu Phe 

50 55 60 

Gly Ala Arg Lys Ala Met He Ser Ala Ala Ala Gly Ser Val Ala Leu 
65 70 75 80 

He Leu Val Gly Val Val Lys Asn Tyr Gly Leu Glu Tyr Ala Gly Val 

85 90 95 

Ala Thr Leu Met Ala Gly Val Leu Gin He Leu Leu Gly Tyr Leu Lys 

100 105 110 

He Gly Asn Leu Leu Arg Phe He Pro Gin Ser Val Met Tyr Gly Phe 
115 120 125 
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GTG GGC GTG GTT AAA AAC TAT GGG CTT GAA TAG GOG GGC GTG GCG ACT 3 44 

Val Gly Val Val Lys Asn Tyr Gly Leu Glu Tyr Ala Gly Val Ala Thr 
85 90 95 

CTT ATG GCA GGG GTG TTG CAA ATT CTT TTA GGC TAT TTG AAA ATA GGG 3 92 

Leu Met Ala Gly Val Leu Gin lie Leu Leu Gly Tyr Leu Lys lie Gly 
100 105 110 

AAT CTT TTG AGG TTT ATC CCC CAA TCA GTG ATG TAT GGC TTT GTG AAC 44 0 

Asn Leu Leu Arg Phe lie Pro Gin Ser Val Met Tyr Gly Phe Val Asn 
115 120 125 130 

GCG CTA GGC ATT TTG CTT TTA ATG GAG CAA TTC AAA TTC CTT CAA AAC 4 88 

Ala Leu Gly lie Leu Leu Leu Met Glu Gin Phe Lys Phe Leu Gin Asn 
135 140 145 

CAA AAT TTG GGG GTG TTT GTC TTG CTC GCT ATT GGG ATA CTC ATC ATT 53 6 

Gin Asn Leu Gly Val Phe Val Leu Leu Ala lie Gly lie Leu lie He 
150 155 160 

TAT CTT TTT CCT CTA ATC ACT AAA AAA ATC CCC TCT AAT CTG ATT TGT 5 84 

Tyr Leu Phe Pro Leu He Thr Lys Lys He Pro Ser Asn Leu He Cys 
165 170 175 

ATC CTT ATA GTG AGC GCG ATC GCT TTA ATT TTT GAT ATG CAT GCG CCG 632 
He Leu He Val Ser Ala He Ala Leu He Phe Asp Met His Ala Pro 
180 185 190 

AAT TTG GGG AGC ATT GAG CAA GGG GTT TCA GGC TTT CAT TTC ATC ATT 68 0 

Asn Leu Gly Ser He Glu Gin Gly Val Ser Gly Phe His Phe He He 
195 200 205 210 

ATC CCC AAA AAT TTG GAT TTT AAA ATA ATG ATA GAG TTG TTG CCT TAC 72 8 

He Pro Lys Asn Leu Asp Phe Lys He Met He Glu Leu Leu Pro Tyr 
215 220 225 

GCT CTT TCT TTA GCA CTA GTG GGA ACG ATA GAA AGC TTA TTG ACG GCT 77 6 

Ala Leu Ser Leu Ala Leu Val Gly Thr He Glu Ser Leu Leu Thr Ala 
230 235 240 

AAA ACT TTA GAT GTG ATT TTA AAA GAC GGC GTG AGC GAT AAA AAT AAA 82 4 

Lys Thr Leu Asp Val He Leu Lys Asp Gly Val Ser Asp Lys Asn Lys 
245 250 255 

GAA ACT AAA GCG CAA GGC TTG GGG AAT ATC ATC TCA GGG CTT TTG GGG 8 72 

Glu Thr Lys Ala Gin Gly Leu Gly Asn He He Ser Gly Leu Leu Gly 
260 265 270 

GGA ATG ACA GGG TGC GCT TTA GTG GGG CAG TCT ATC ATT AAC GCA AAA 92 0 

Gly Met Thr Gly Cys Ala Leu Val Gly Gin Ser He He Asn Ala Lys 
275 280 285 290 

TCC GGG GCT AAA ACA AGG CTT TCT ACT TTT TTT GCC GGC TTT TCT TTA 9 68 

Ser Gly Ala Lys Thr Arg Leu Ser Thr Phe Phe Ala Gly Phe Ser Leu 
295 300 305 
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Met Leu Trp Asp Leu Asn Arg Ser Asn lie Ala Ala Ser Thr Gly Ser 

290 295 300 

Ala Cys Ala Ser Glu Asp Leu Glu Ala Asn Pro Val Met Val Ala lie 
305 310 315 320 

Gly Ala Ser Lys Glu Leu Ala His Thr Ala lie Arg Leu Ser Leu Ser 

325 330 335 

Arg Phe Asn Thr Glu Ala Glu He Asp Lys Thr He Glu Val Phe Ser 

340 345 350 

Gin Ala Ala Val Arg Leu Arg Asn He Ser Ser Ser Tyr 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 655: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 73 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 122 0 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 

TCCGGTTTGA AGTTGGTTTT TAAATCTTTG GCTATAATCA AGCCATTCTA ATG AGA 5 6 

Met Arg 
1 

AAG AAA GGC ATG TTT GAA AAG ATA CAA AAA GAA TGG CTG AGC AAC ATT 104 
Lys Lys Gly Met Phe Glu Lys He Gin Lys Glu Trp Leu Ser Asn He 
5 10 15 

CAA AAG GAT TTG TTG TCT GGT TTT GTG GTG GGG CTT TCT GTG ATC CCA 15 2 

Gin Lys Asp Leu Leu Ser Gly Phe Val Val Gly Leu Ser Val He Pro 
20 25 30 

GAG ACG GCC GGC TTT GCG ATC ATG GTG GGT TTA GAT GTG GGC GTG GCG 2 00 

Glu Thr Ala Gly Phe Ala He Met Val Gly Leu Asp Val Gly Val Ala 
35 40 45 50 

TTT TAT ACG ACC TTT TAG ATG GCT TTT GTT TTG TCT CTT TTT GGG GCT 24 8 

Phe Tyr Thr Thr Phe Tyr Met Ala Phe Val Leu Ser Leu Phe Gly Ala 
55 60 65 

AGA AAG GCG ATG ATT AGC GCA GCG GCC GGC TCA GTG GCG CTC ATT TTA 2 96 

Arg Lys Ala Met He Ser Ala Ala Ala Gly Ser Val Ala Leu He Leu 
70 75 80 
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GCT GTA AGG TTG AGA AAT ATT TCA AGC TCT TAT TAAAAAGAAT ATAAAGGAAT 116 5 
Ala Val Arg Leu Arg Asn lie Ser Ser Ser Tyr 



(2) INFORMATION FOR SEQ ID NO:654: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 



Met Asp Pro Phe Leu Arg Asp His Tyr Gly Asn Pro Ser Ser Leu His 

15 10 15 

Gin Phe Gly Thr Glu Thr His Pro Ala lie Ala Glu Ala Leu Asp Lys 

20 25 30 

Leu Tyr Lys Gly lie Asn Ala Arg Asp lie Asp Asp Val lie lie Thr 

35 40 45 

Ser Cys Ala Thr Glu Ser Asn Asn Trp Val Leu Lys Gly Val Tyr Phe 

50 55 60 

Asp Glu Cys Leu Lys Lys Gly Lys Asn His lie Val Thr Thr Val Ala 
65 70 75 80 

Glu His Pro Ala Val Arg Ser Thr Cys Asn Phe Leu Glu Ser Leu Gly 

85 90 95 

Val Glu Val Thr Tyr Leu Pro lie Asn Glu His Gly Ser lie Thr Ala 

100 105 110 

Glu Gin Val Lys Glu Ala lie Thr Glu Lys Thr Ala Leu Val Ser Val 

115 120 125 

Met Trp Ala Asn Asn Glu Thr Gly Leu lie Phe Pro lie Glu Glu lie 

130 135 140 

Gly Ala lie Cys Lys Glu Lys Gly Val Leu Phe His Thr Asp Ala Val 
145 150 155 160 

Gin Ala lie Gly Lys lie Pro Val Asp Val Leu Lys Ala Asn Ala Asp 

165 170 175 

Phe Leu Ser Phe Ser Ala His Lys Phe His Gly Pro Lys Gly lie Gly 

180 185 190 

Gly Leu Tyr lie Arg Ser Gly Val Gly Leu Thr Pro Leu Phe His Gly 

195 200 205 

Gly Glu His Met Asn Gly Arg Arg Ser Gly Thr Leu Asn Val Pro Tyr 

210 215 220 

lie Val Gly Met Gly Glu Ala Met Lys Leu Ala Val Glu His Leu Asp 
225 230 235 240 

Tyr Glu Lys Glu Val Val Gly Lys Leu Arg Asp Lys Leu Glu Glu Ala 

245 250 255 

Leu Leu Lys lie Pro Asp Val Met Val Val Gly Asp Arg lie His Arg 

260 265 270 

Val Pro Asn Thr Thr Leu Val Ser Val Arg Gly lie Glu Gly Glu Ala 



355 



360 



365 



CAAAATGGCA AAACATGATT TAGTGGGTTC GGT 



1198 



275 



280 



285 
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GCG AAT AAT GAA ACC GGT CTC ATT TTC CCT ATT GAA GAA ATT GGG GCT 4 88 

Ala Asn Asn Glu Thr Gly Leu lie Phe Pro lie Glu Glu lie Gly Ala 

135 140 145 

ATT TGT AAA GAA AAG GGC GTG TTG TTC CAT ACC GAT GCC GTG CAA GCG 53 6 

lie Cys Lys Glu Lys Gly Val Leu Phe His- Thr Asp Ala Val Gin Ala 
150 155 160 

ATT GGT AAA ATC CCT GTA GAT GTG TTA AAA GCG AAT GCA GAT TTC CTT 5 84 

lie Gly Lys lie Pro Val Asp Val Leu Lys Ala Asn Ala Asp Phe Leu 
165 170 175 

TCT TTT AGC GCG CAC AAG TTT CAT GGG CCT AAA GGC ATT GGG GGG TTG 632 

Ser Phe Ser Ala His Lys Phe His Gly Pro Lys Gly lie Gly Gly Leu 
180 185 190 

TAT ATT AGA AGT GGG GTG GGA TTG ACC CCT CTT TTT CAT GGC GGG GAG 6 80 

Tyr lie Arg Ser Gly Val Gly Leu Thr Pro Leu Phe His Gly Gly Glu 

195 200 205 210 

CAT ATG AAT GGC AGG CGC AGC GGG ACT TTG AAT GTG CCT TAT ATT GTG 72 8 

His Met Asn Gly Arg Arg Ser Gly Thr Leu Asn Val Pro Tyr lie Val 

215 220 225 

GGA ATG GGC GAA GCG ATG AAA TTA GCC GTA GAG CAT TTA GAC TAT GAA 7 76 

Gly Met Gly Glu Ala Met Lys Leu Ala Val Glu His Leu Asp Tyr Glu 
230 235 240 

AAA GAA GTG GTG GGG AAA TTG CGC GAC AAA TTA GAA GAA GCG CTT TTG 824 

Lys Glu Val Val Gly Lys Leu Arg Asp Lys Leu Glu Glu Ala Leu Leu 
245 250 255 

AAA ATC CCT GAT GTG ATG GTG GTG GGC GAT AGA ATC CAT CGT GTG CCT 87 2 

Lys lie Pro Asp Val Met Val Val Gly Asp Arg lie His Arg Val Pro 
260 265 270 

AAC ACG ACT TTA GTC AGC GTG AGA GGG ATT GAA GGA GAG GCC ATG CTG 92 0 

Asn Thr Thr Leu Val Ser Val Arg Gly lie Glu Gly Glu Ala Met Leu 

275 280 285 290 

TGG GAT TTA AAC CGC TCT AAT ATC GCC GCT TCC AC A GGG AGC GCG TGC 96 8 

Trp Asp Leu Asn Arg Ser Asn lie Ala Ala Ser Thr Gly Ser Ala Cys 

295 300 305 

GCG AGT GAG GAT TTA GAG GCT AAT CCG GTG ATG GTA GCG ATT GGA GCG 1016 

Ala Ser Glu Asp Leu Glu Ala Asn Pro Val Met Val Ala lie Gly Ala 
310 315 320 

AGT AAG GAA TTG GCT CAT ACC GCT ATC AGG CTT TCA TTG AGC CGT TTT 1064 

Ser Lys Glu Leu Ala His Thr Ala lie Arg Leu Ser Leu Ser Arg Phe 
325 330 335 

AAC ACG GAA GCT GAA ATT GAC AAA ACG ATT GAA GTT TTC TCT CAA GCG 1112 

Asn Thr Glu Ala Glu lie Asp Lys Thr He Glu Val Phe Ser Gin Ala 
340-- 345 350 
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(2) INFORMATION FOR SEQ ID NO: 653: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.,. 1145 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:653: 
ATTTAGACAA TAACGCTACA ACTAGGATTG ACCCTAAAGT CAAAGAGATC ATG GAT 5 6 



Met Asp 
1 



OCT TTT TTA AGG GAT CAT TAT GGG AAT CCT AGC TCG TTG CAC CAG TTT 
Pro Phe Leu Arg Asp His Tyr Gly Asn Pro Ser Ser Leu His Gin Phe 
5 10 15 



104 



GGC ACA GAA ACC CAC CCA GCC ATT GCA GAA GCG CTA GAT AAG CTT TAT 
Gly Thr Glu Thr His Pro Ala lie Ala Glu Ala Leu Asp Lys Leu Tyr 
20 25 30 



152 



AAG GGC ATT AAC GCT AGG GAT ATA GAT GAT GTG ATC ATC ACT TCT TGT 
Lys Gly lie Asn Ala Arg Asp lie Asp Asp Val lie lie Thr Ser Cys 
35 40 45 50 



200 



GCG ACA GAA AGC AAT AAT TGG GTT TTA AAG GGC GTG TAT TTT GAT GAA 
Ala Thr Glu Ser Asn Asn Trp Val Leu Lys Gly Val Tyr Phe Asp Glu 
55 60 65 



248 



TGC TTG AAA AAA GGC AAA AAC CAT ATT GTA ACC ACG GTT GCA GAG CAT 
Cys Leu Lys Lys Gly Lys Asn His lie Val Thr Thr Val Ala Glu His 
70 75 80 



296 



CCG GCG GTG CGA TCC ACT TGC AAT TTT TTA GAA AGC TTG GGG GTG GAG 
Pro Ala Val Arg Ser Thr Cys Asn Phe Leu Glu Ser Leu Gly Val Glu 
85 90 95 



344 



GTT ACT TAC TTG CCC ATT AAT GAG CAT GGG AGC ATC ACC GCA GAG CAA 
Val Thr Tyr Leu Pro lie Asn Glu His Gly Ser lie Thr Ala Glu Gin 
100 105 110 



392 



GTC AAA GAA GCG ATC ACA GAA AAA ACC GCT CTA GTG AGC GTG ATG TGG 
Val Lys Glu Ala lie Thr Glu Lys Thr Ala Leu Val Ser Val Met Trp 
115 120 125 130 



440 
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CAA AAA GTG AAG CGA CTG ATT AAA AGO GTT TTT TAAAACCCTT TTTAAACTAA 12 3 0 
Gin Lys Val Lys Arg Leu lie Lys Arg Val Phe 
275 280 

TGCGAGCAAG CATGGGTTTG TGTGGGCTGG ATGTCCTTAT T 12 71 

(2) INFORMATION FOR SEQ ID NO: 652: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 

Met lie Met Cys Arg Phe Phe Phe Ala Ser Leu Phe Pro Gin Tyr Asp 

15 10 15 

Lys Met He Met Phe Asp Val Asp Thr Leu Phe Val Asn Asp He Ser 

20 25 30 

Glu Ser Phe Phe He Pro Leu Glu Thr His Tyr Phe Gly Ala Val Arg 

35 40 45 

Glu Lys Asp Leu He Ala He Asn Arg Asn Ser Ala Lys Asp Leu Tyr 

50 55 60 

Glu Leu Arg Gin Met His Ala Lys Ser He Gly He Ala Asn Ala Phe 
65 70 75 80 

Pro Asn Leu Glu Glu Ala Gin He Leu Phe Asp Asn Tyr Phe Asn Ala 

85 90 95 

Gly Phe Leu Ala Leu Asn Leu Lys Ser Trp Arg Lys Glu Asn Leu Glu 

100 105 110 

Asn Gin Leu He Thr Phe Phe He Leu Lys Asn Glu Lys Leu Leu Phe 

115 120 125 

Asn Asp Gin Asp Ala Leu Cys Phe Val Cys Arg Gly Arg He Leu Glu 

130 135 140 

Leu Pro Tyr Pro Tyr Asn Ala His Pro Ser Phe Leu Asp Thr Leu Ser 
145 150 155 160 

Phe Pro Ser He Lys Glu Ala Arg Met Leu His Phe Trp Gly Asp Lys 

165 170 175 

Pro Trp Lys Leu Leu Ser Val He Gly Ala Lys Lys Trp His Glu Ala 

180 185 190 

Leu He Gin Thr Pro Phe Lys Asp Ala Tyr Phe Asn Ala Ser Phe Leu 

195 200 205 

Asp His Leu Phe Glu Ser Leu Gin Asn Lys Asp Asn Glu He Lys Arg 

210 215 220 

Arg Asp Glu Arg He He Glu Ala Leu Gin Ala Arg Asp Lys He Leu 
225 230 235 240 

Ser Phe Ser Asp Lys Arg His Ser Phe Glu Ser Leu Leu Pro Lys Leu 

245 250 255 

Ser Ser Lys Leu Leu He Glu Phe Leu Leu Phe Lys Ala Lys Gin Lys 

260 265 270 

Val Lys Arg Leu He Lys Arg Val Phe 
275 280 
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Val Arg Glu Lys Asp Leu lie Ala lie Asn Arg Asn Ser Ala Lys Asp 
50 55 60 

TTA TAG GAA TTG CGC CAA ATG CAT GCA AAA TCT ATC GGC ATC GCC AAC 6 01 

Leu Tyr Glu Leu Arg Gin Met His Ala Lys Ser lie Gly lie Ala Asn 
65 70 75 

GCT TTC OCT AAT TTA GAA GAA GCT CAA ATC CTT TTT GAC AAC TAG TTT 64 9 

Ala Phe Pro Asn Leu Glu Glu Ala Gin lie Leu Phe Asp Asn Tyr Phe 
80 85 90 

AAC GCC GGG TTT TTA GCC TTA AAT TTA AAA TCA TGG COT AAA GAA AAT 6 97 

Asn Ala Gly Phe Leu Ala Leu Asn Leu Lys Ser Trp Arg Lys Glu Asn 
95 100 105 110 

CTT GAA AAC CAA TTG ATT ACC TTT TTC ATT TTG AAA AAT GAA AAA CTT 74 5 

Leu Glu Asn Gin Leu lie Thr Phe Phe lie Leu Lys Asn Glu Lys Leu 
115 120 125 

TTA TTT AAC GAT CAA GAT GCT TTG TGT TTT GTG TGC CGT GGG AGG ATT 7 93 

Leu Phe Asn Asp Gin Asp Ala Leu Cys Phe Val Cys Arg Gly Arg lie 
130 135 140 

TTA GAA TTG CCT TAT CCA TAG AAT GCC CAC CCT AGT TTC CTT GAT AGG 841 
Leu Glu Leu Pro Tyr Pro Tyr Asn Ala His Pro Ser Phe Leu Asp Thr 
145 150 155 

CTC TCA TTC CCT AGC ATC AAA GAA GCG CGC ATG GTG CAT TTT TGG GGC 889 
Leu Ser Phe Pro Ser lie Lys Glu Ala Arg Met Leu His Phe Trp Gly 
160 165 170 

GAT AAA GCG TGG AAA CTC TTA AGC GTC ATT GGC GCG AAA AAA TGG CAT 93 7 

Asp Lys Pro Trp Lys Leu Leu Ser Val lie Gly Ala Lys Lys Trp His 
175 180 185 190 

GAA GCG TTG ATC CAA ACG CCT TTT AAA GAC GCC TAT TTC AAC GCT TCT 98 5 

Glu Ala Leu lie Gin Thr Pro Phe Lys Asp Ala Tyr Phe Asn Ala Ser 
195 200 205 

TTT TTA GAT CAC CTC TTT GAA TGG CTT CAA AAC AAG GAT AAT GAG ATC 10 3 3 

Phe Leu Asp His Leu Phe Glu Ser Leu Gin Asn Lys Asp Asn Glu lie 
210 215 220 

AAA AGA AGA GAT GAA AGG ATC ATT GAA GCA CTT GAA GCA AGG GAT AAA 10 81 

Lys Arg Arg Asp Glu Arg lie lie Glu Ala Leu Gin Ala Arg Asp Lys 
225 230 235 

ATC GTG TCT TTT TCA GAC AAG CGA GAT TCT TTT GAA TGT CTT GTG CGC 112 9 

lie Leu Ser Phe Ser Asp Lys Arg His Ser Phe Glu Ser Leu Leu Pro 
240 245 250 

AAG CTT TCT TCT AAA CTC CTT ATA GAA TTT TTG CTT TTT AAA GCC AAA 117 7 

Lys Leu Ser Ser Lys Leu Leu lie Glu Phe Leu Leu Phe Lys Ala Lys 
255 260 265 270 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMEin: TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 

Met Asp Leu Leu Phe Ala Thr Pro Thr lie Ser Pro Phe Leu Pro Phe 

15 10 15 

Lys Asn Pro Met lie Phe Leu Tyr Lys Asn Glu Met lie Val Leu Lys 

20 25 30 

Phe Ser Asn Ser Gin Asp Ala Leu Pro lie Asn Lys Pro Ser Thr Leu 

35 40 45 

Ser lie Pro Lys He Ser Leu Ala Phe Cys Val Phe Thr Leu Pro Pro 

50 55 60 

Tyr Asn Lys Gly Val Phe 
65 70 

(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 368... 1210 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 

TTTAAGGACT ATGCATGCAA GAGATTATCC CTATTGTCGT GGCTTTTGAT AACAACTATT 6 0 

GTATCCCTGC TGGCGTGAGC TTATTTTCCA TGCTAGCAAA CGCCAAACGA GAGAGAGAGA 12 0 

GAGAGAGAGA GTAAAACTCT TTTATCAAAT CCATTGTTTG GTGGAGAGTT TAACCCCAGA 18 0 

GAATATAGCC AAATTAGAAG AAACGATCGC TCCTTTTAGA GCTTTTTCTA GCATAGAGTT 24 0 

TTTGGATATT ACCGATAAAG AATTAGAACC ACGCCACAAT TATAATAAGC TTGATCCTTT 3 00 

AATAGCGAGT GAAATTAAAA AATTGTATTT AAAACTCAAT GCTTTTTCGC AAAAACGCTT 3 60 

TTCTAAA ATG ATC ATG TGC CGT TTC TTT TTT GCC TCC CTT TTC CCC CAA 409 

Met He Met Cys Arg Phe Phe Phe Ala Ser Leu Phe Pro Gin 

1 5 10 

TAC GAT AAG ATG ATC ATG TTT GAT GTG GAC ACT TTG TTT GTG AAT GAT 457 
Tyr Asp Lys Met He Met Phe Asp Val Asp Thr Leu Phe Val Asn Asp 
15 20 25 30 

ATT AGC GAG AGC TTT TTT ATC CCC CTT GAA ACG CAT TAT TTT GGG GCT 5 05 

He Ser Glu Ser Phe Phe He Pro Leu Glu Thr His Tyr Phe Gly Ala 
35 40 45 

GTG AGG GAA AAA GAT TTG ATC GCT ATA AAT AGG AAT TCG GCT AAG GAT 55 3 



-1000- 



BNSDOCID: <WO 9843478A1_I_> 



wo 98/43478 



PCT/US98/06371 



210 215 220 

(2) INFORMATION FOR SEQ ID NO: 64 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 2 62 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 9: 

GAAAGCCAAA GTAGCCCCTT GATTGAAACA AGATTGAGCG ATCCCATAAG CG ATG GAT 5 8 

Met Asp 
1 

TTA TTG TTC GCC ACC CCT ACA ATA AGC CCT TTT TTA CCT TTT AAA AAT 10 6 

Leu Leu Phe Ala Thr Pro Thr lie Ser Pro Phe Leu Pro Phe Lys Asn 
5 10 15 

CCC ATG ATT TTC CTT TAT AAA AAT GAA ATG ATT GTT TTA AAA TTT TCT 154 
Pro Met lie Phe Leu Tyr Lys Asn Glu Met lie Val Leu Lys Phe Ser 
20 25 30 

AAT TCC CAA GAC GCG CTC CCG ATC AAC AAG CCA TCC ACG CTA TCA ATC 2 02 

Asn Ser Gin Asp Ala Leu Pro lie Asn Lys Pro Ser Thr Leu Ser lie 
35 40 45 50 

CCT AAA ATT TCT TTA GCG TTT TGT GTG TTC ACG CTC CCC CCA TAC AAC 25 0 

Pro Lys lie Ser Leu Ala Phe Cys Val Phe Thr Leu Pro Pro Tyr Asn 
55 60 65 

AAG GGC GTT TTT TGATTTAAGA TTTGCTTTAA AAAACCATGC GTGAGATAAA TATCT 3 07 
Lys Gly Val Phe 
70 

TCTAAAGAAG CGCTTTTTTT GGTGCCAATC GCCCAAATAG GCTC 3 51 

(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

-999- 



BNSDOCID: <WO_984347aAl_l_> 



wo 98/43478 



PCT/US98/06371 



GGC ATG AAC AGA GCC ATG CGT TTT ATG ATT GAG AGC CAC GAT GAA AGA 63 2 

Gly Met Asn Arg Ala Met Arg Phe Met lie Asp Ser His Asp Glu Arg 
180 185 190 

AAC GAT GAT GAT TTT TAT GAG TTA GTC GGC GAT GAT GAT GGT GTT TTT 68 0 

Asn Asp Asp Asp Phe Tyr Glu Leu Val Gly Asp Asp Asp Gly Val Phe 
195 200 205 

GGG TGC ATG AGC CTT ATC GCT TGC CAT GAC ACT TGC CC TAAAGAATTA CCC 731 
Gly Cys Met Ser Leu He Ala Cys His Asp Thr Cys Pro 
210 215 220 

TTGCAAAGCA GTATCGCCAC TTTGCGTAAC AGGATGTTGA AAGTGGGTAA AAGCCGCTAA 791 
TTTCTTTTTA GTGGGTCGTT TTTGAAAATC TTTTTAGTCT TTTTAAGCG 84 0 

(2) INFORMATION FOR SEQ ID NO: 64 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:648: 

Met Ser Asp Asn Glu Arg Thr He Val Val Arg Val Leu Lys Phe Asp 

^5 10 15 

Pro Gin Ser Ala Val Ser Lys Pro His Phe Lys Glu Tyr Gin Leu Lys 

20 25 30 

Glu Thr Pro Ser Met Thr Leu Phe He Ala Leu Asn Leu He Arg Glu 

35 40 45 

His Gin Asp Pro Asp Leu Ser Phe Asp Phe Val Cys Arg Ala Gly He 

50 55 60 

Cys Gly Ser Cys Ala Met Met Val Asn Gly Arg Pro Arg Leu Ala Cys 

70 75 80 

Lys Thr Leu Thr Ser Ser Phe Glu Ser Gly Val He Thr Leu Met Pro 

85 90 95 

Met Pro Ser Phe Thr Leu He Lys Asp Leu Ser Val Asn Thr Gly Asp 

100 105 110 

Trp Phe Leu Asp Met Thr Lys Arg Val Glu Ser Trp Ala His Ser Lys 

115 120 125 

Glu Glu Val Asp He Thr Arg Pro Glu Lys Arg Val Glu Pro Asp Glu 

135 140 
Ala Gin Glu Val Phe Glu Leu Asp Arg Cys He Glu Cys Gly Cys Cys 
, 155 150 

He Ala Ser Cys Gly Thr Lys Leu Met Arg Pro Asn Phe He Gly Ala 

165 170 175 

Ala Gly Met Asn Arg Ala Met Arg Phe Met He Asp Ser His Asp Glu 

180 185 190 

Arg Asn Asp Asp Asp Phe Tyr Glu Leu Val Gly Asp Asp Asp Gly Val 

195 200 205 

Phe Gly Cys Met Ser Leu He Ala Cys His Asp Thr Cys Pro 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 7: 

CAATGTCCAT GAGTTATTGG ACCAACACAA CGCTAACCTA AAAGGAGAAC ACC ATG 5 6 

Met 

1 

AGT GAT AAT GAA CGA ACG ATT GTA GTT AGA GTG CTA AAA TTT GAC CCT 104 
Ser Asp Asn Glu Arg Thr lie Val Val Arg Val Leu Lys Phe Asp Pro 
5 10 15 

CAA AGC GCG GTG AGT AAG CCG CAT TTT AAA GAG TAT CAG TTG AAA GAA 152 
Gin Ser Ala Val Ser Lys Pro His Phe Lys Glu Tyr Gin Leu Lys Glu 
20 25 30 

ACG CCA TCC ATG ACG CTT TTT ATC GCT TTG AAC CTC ATT AGA GAG CAT 2 00 

Thr Pro Ser Met Thr Leu Phe lie Ala Leu Asn Leu lie Arg Glu His 
35 40 45 

CAA GAT CCG GAT TTG AGT TTT GAT TTT GTG TGC CGC GCT GGG ATT TGC 24 8 

Gin Asp Pro Asp Leu Ser Phe Asp Phe Val Cys Arg Ala Gly lie Cys 
50 55 60 65 

GGC TCT TGC GCG ATG ATG GTT AAT GGG AGA CCG AGG CTA GCT TGT AAA 2 96 

Gly Ser Cys Ala Met Met Val Asn Gly Arg Pro Arg Leu Ala Cys Lys 
70 75 80 

ACC CTA ACT TCT AGC TTT GAA AGC GGG GTG ATC ACG CTC ATG CCC ATG 3 44 

Thr Leu Thr Ser Ser Phe Glu Ser Gly Val lie Thr Leu Met Pro Met 
85 90 95 

CCC AGT TTT ACG CTC ATT AAA GAT TTG AGC GTG AAT ACG GGC GAT TGG 3 92 

Pro Ser Phe Thr Leu lie Lys Asp Leu Ser Val Asn Thr Gly Asp Trp 
100 105 110 

TTT TTG GAT ATG ACT AAA AGG GTG GAG AGT TGG GCG CAT TCT AAA GAA 44 0 

Phe Leu Asp Met Thr Lys Arg Val Glu Ser Trp Ala His Ser Lys Glu 
115 120 125 

GAA GTG GAT ATT ACT AGA CCG GAA AAA AGG GTT GAG CCT GAC GAA GCC 488 
Glu Val Asp lie Thr Arg Pro Glu Lys Arg Val Glu Pro Asp Glu Ala 
130 135 140 145 

CAA GAA GTC TTT GAA CTA GAC AGG TGT ATT GAA TGC GGG TGT TGT ATC 53 6 

Gin Glu Val Phe Glu Leu Asp Arg Cys lie Glu Cys Gly Cys Cys lie 
150 155 160 

GCT TCT TGC GGG ACT AAA CTC ATG CGC CCT AAT TTC ATT GGA GCT GCT 5 84 

Ala Ser Cys Gly Thr Lys Leu Met Arg Pro Asn Phe lie Gly Ala Ala 
165 170 175 
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40 45 



50 



GGG GAA TTA AAA TTA GAA GTC GCC CCC AAA AAC CAA AGC TTC TTC AAA 308 
Gly Glu Leu Lys Leu Glu Val Ala Pro Lys Asn Gin Ser Phe Phe Lys 
55 60 65 

ACG CTC TTG GAA GCG GCA GAA AAG GAG AAT ATC AGC CAG AAA AAG ATG 35 6 

Thr Leu Leu Glu Ala Ala Glu Lys Glu Asn lie Ser Gin Lys Lys Met 
70 75 80 

CAG ACA CCG AGG GCA AGC AAA ATC TTT TTG AAG TTT TGAAGGCTAT CTTGGA 4 08 
Gin Thr Pro Arg Ala Ser Lys He Phe Leu Lys Phe 
85 90 95 

TAAGGTCAGC AAATTGAGTG AAGAAGAAGT GAATGAGCGT TTG 4 51 

(2) INFORMATION FOR SEQ ID NO: 64 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 6: 



Met 


Val 


Val 


Ala 


Lys 


Asn 


Glu 


Asp 


Asn 


Lys 


Lys 


Leu 


Tyr 


Asp 


He 


He 


1 








5 










10 










15 




Asp 


Gly 


Gin 


Gin 


Arg 


Thr 


Thr 


Thr 


He 


Phe 


Met 


Leu 


Leu 


His 


Val 


Leu 


Ala 






20 










25 










30 






Asn 


Lys 


Gin 


Asn 


Glu 


Lys 


Asp 


Lys 


Gin 


Glu 


Thr 


Arg 


Lys 


Tyr 


Leu 






35 










40 










45 






Tyr 


Gin 


Lys 


Gly 


Glu 


Leu 


Lys 


Leu 


Glu 


Val 


Ala 


Pro 


Lys 


Asn 


Gin 


Ser 




50 










55 










60 








Phe 


Phe 


Lys 


Thr 


Leu 


Leu 


Glu 


Ala 


Ala 


Glu 


Lys 


Glu 


Asn 


He 


Ser 


Gin 


65 










70 










75 










80 


Lys 


Lys 


Met 


Gin 


Thr 


Pro 


Arg 


Ala 


Ser 


Lys 


He 


Phe 


Leu 


Lys 


Phe 





85 90 95 

(2) INFORMATION FOR SEQ ID NO: 64 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ICEY: Coding Sequence 

(B) LOCATION: 54... 718 
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Cys Met Thr Asp 
100 

Phe Asn Lys Thr 
115 

Phe Met Phe Asn 
130 

Glu Asn Leu Lys 
145 

Tyr Gly Leu Glu 

Gly Asp Tyr Asn 
180 

Lys Leu Pro lie 
195 

lie Glu Leu Asn 
210 

Arg lie Glu Asp 
225 



85 

Asn Pro Thr Arg 

Gin Gly Ala Ser 
120 

Arg Lys Ser Val 
135 

Leu Ser Leu Glu 
150 

Glu Leu Glu Glu 
165 

Ser Phe Lys Leu 

Leu Lys Ala Ser 
200 

Asp Glu Gin Met 
215 

Asp Asp Asp Val 
230 



90 

Thr lie Ala Asn 
105 

lie Val Pro Asn 

Phe Glu Cys Leu 
140 

Asp Leu Glu Phe 
155 

Val Glu Asp Lys 
170 

Leu Asn Glu Gly 
185 

Leu Gin Arg lie 

Glu Leu Thr Glu 
220 

Val Ala Leu Tyr 
235 



95 

Leu Lys Ser Tyr 
110 

Gly Ser Leu Glu 
125 

Lys Asn Glu Val 

Ala Leu lie Asp 
160 

lie lie lie Arg 
175 

Phe Glu Ser Leu 
190 

Ala Thr Thr Pro 
205 

Lys Leu Leu Asp 

Thr Asn lie Glu 
240 



(2) INFORMATION FOR SEQ ID NO: 64 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 108... 392 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

ATCAGAGGGA TTACGCCTGG AAGGATAAGA ATTTTAGGGA TCTGTAAGAA GATCTAGAAG 6 0 

AAGCGATAGA ATACAACAAA AAGGGTCAGG GGCATTTTAT AGGCACC ATG GTT GTC 116 

Met Val Val 
1 

GCT AAG AAT GAA GAC AAC AAA AAA TTG TAT GAC ATC ATT GAC GGC CAG 164 
Ala Lys Asn Glu Asp Asn Lys Lys Leu Tyr Asp lie He Asp Gly Gin 
5 10 15 

CAA CGA ACG ACT ACC ATC TTC ATG CTC TTG CAT GTC TTG GCG AAC AAA 212 
Gin Arg Thr Thr Thr He Phe Met Leu Leu His Val Leu Ala Asn Lys 
20 25 30 35 

CAA AAC GAG AAA GAC AAG CAA GAA ACA AG A AAA TAT CTA TAG CAA AAG 260 
Gin Asn Glu Lys Asp Lys Gin Glu Thr Arg Lys Tyr Leu Tyr Gin Lys 
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TTT AAC CGA AAA AGC GTG TTT GAA TGC TTG AAA AAT GAA GTG GAA AAT 4 88 

Phe Asn Arg Lys Ser Val Phe Glu Cys Leu Lys Asn Glu Val Glu Asn 
135 140 145 

TTA AAA CTC AGT CTA GAA GAT TTA GAA TTC OCT CTC ATT GAT TAT GGT 53 6 

Leu Lys Leu Ser Leu Glu Asp Leu Glu Phe Ala Leu lie Asp Tyr Gly 
150 155 160 

TTG GAA GAA TTA GAA GAA GTG GAA GAG AAG ATC ATT ATT AGG GGG GAT 5 84 

Leu Glu Glu Leu Glu Glu Val Glu Asp Lys He He He Arg Gly Asp 
165 170 175 

TAT AAC AGC TTC AAG CTT TTA AAT GAG GGG TTT GAA AGC TTG AAA TTA 632 
Tyr Asn Ser Phe Lys Leu Leu Asn Glu Gly Phe Glu Ser Leu Lys Leu 
180 185 190 

CCC ATT TTA AAA GCG AGT TTG CAA CGC ATC GCC ACA ACG CCC ATT GAA 680 
Pro He Leu Lys Ala Ser Leu Gin Arg He Ala Thr Thr Pro He Glu 
195 200 205 210 

TTG AAT GAC GAA CAA ATG GAG CTT ACC GAA AAA TTA CTG GAC AGG ATT 72 8 

Leu Asn Asp Glu Gin Met Glu Leu Thr Glu Lys Leu Leu Asp Arg He 
215 220 225 

GAA GAC GAT GAT GAT GTG GTC GCG CTT TAT ACC AAT ATT GAG TGAAATGCA 779 
Glu Asp Asp Asp Asp Val Val Ala Leu Tyr Thr Asn He Glu 
230 235 240 

AAAAGACTCA AAGTATTTTT TTAAACCACC CAAGCATTCA TCCCAATAGG GAATGCGTTG 83 9 

GAGA 84 3 

(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

Met Gly Arg Ala Phe Glu Tyr Arg Arg Ala Ala Lys Glu Lys Arg Trp 

15 10 15 

Asp Lys Met Ser Lys Val Phe Pro Lys Leu Ala Lys Ala He Thr. Leu 

20 25 30 

Ala Ala Lys Asp Gly Gly Ser Glu Pro Asp Thr Asn Ala Lys Leu Arg 

35 40 45 

Thr Ala He Leu Asn Ala Lys Ala Gin Asn Met Pro Lys Asp Asn He 

50 55 60 

Asp Ala Ala He Lys Arg Ala Ser Ser Lys Glu Gly Asn Leu Ser Glu 
65 70 75 80 

He Thr Tyr Glu Gly Lys Ala Asn Phe Gly Val Leu He He Met Glu 
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(2) INFORMATION FOR SEQ ID NO: 643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 3 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.. -770 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:643: 

TTATTAGCTA TCATGCTAAA GAAGTGGCTA ACTTATTACA AAGGAATTGA ATG GGA 5 6 

Met Gly 
1 

CGA GCG TTT GAA TAC AGA AGA GCG OCT AAA GAA AAA CGA TGG GAT AAG 104 
Arg Ala Phe Glu Tyr Arg Arg Ala Ala Lys Glu Lys Arg Trp Asp Lys 
5 10 15 

ATG AGT AAG GTT TTC CCA AAG CTC GCT AAA GCG ATC ACT CTA GCG GCA 15 2 

Met Ser Lys Val Phe Pro Lys Leu Ala Lys Ala lie Thr Leu Ala Ala 
20 25 30 

A7^ GAT GGC GGG AGC GAA CCG GAC ACQ AAC GCC AAA CTA CGA ACA GCG 2 00 

Lys Asp Gly Gly Ser Glu Pro Asp Thr Asn Ala Lys Leu Arg Thr Ala 
35 40 45 50 

ATT TTA AAC GCT AAA GCG CAA AAC ATG CCT AAA GAC AAT ATT GAC GCA 24 8 

lie Leu Asn Ala Lys Ala Gin Asn Met Pro Lys Asp Asn lie Asp Ala 
55 60 65 

GCG ATT AAA AGA GCG AGC AGT AAA GAA GGG AAT TTG AGT GAA ATC ACT 2 96 

Ala lie Lys Arg Ala Ser Ser Lys Glu Gly Asn Leu Ser Glu lie Thr 
70 75 80 

TAT GAA GGT AAG GCG AAT TTT GGC GTG CTA ATC ATC ATG GAA TGC ATG 3 44 

Tyr Glu Gly Lys Ala Asn Phe Gly Val Leu lie lie Met Glu Cys Met 
85 90 95 

ACT GAT AAC CCC ACC AGA ACC ATT GCC AAC CTT AAA AGC TAT TTC AAT 3 92 

Thr Asp Asn Pro Thr Arg Thr lie Ala Asn Leu Lys Ser Tyr Phe Asn 
100 105 110 

AAA ACG CAA GGG GCA AGC ATC GTG CCT AAT GGC TCT TTA GAG TTT ATG 44 0 

Lys Thr Gin Gly Ala Ser lie Val Pro Asn Gly Ser Leu Glu Phe Met 
115 120 125 130 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

TACTGAATTT ATCCAATCGC TAAAACCCTT AGAAGATTAA AGGAAAAGAG ATG AAA 5 6 

Met Lys 
1 

TTT TTA AAC GGA TTA GCA GGG AAT TTA CTG ATT GTG GTT ATT TTA TTG 104 
Phe Leu Asn Gly Leu Ala Gly Asn Leu Leu lie Val Val lie Leu Leu 
5 10 15 

TGT GTG GCC GTT TTT TTT ACG CTC AAA GCG ATC CAT ATC CAA AAA GAG 15 2 

Cys Val Ala Val Phe Phe Thr Leu Lys Ala lie His He Gin Lys Glu 
20 25 30 

CAA GCC ACC AAT TAT TAC CGC TAT AAG GAT ATT AAC GCT TTA GAG ACA 200 
Gin Ala Thr Asn Tyr Tyr Arg Tyr Lys Asp He Asn Ala Leu Glu Thr 
35 40 45 50 

AAA AAC ACC CAA AAC CGG GCT AAC TAT GAA TTA GTC AAT CAA GGG AGT 24 8 

Lys Asn Thr Gin Asn Arg Ala Asn Tyr Glu Leu Val Asn Gin Gly Ser 
55 60 65 

AAA AAA TGAAATTCAC GACTTTAGAA AAAATTTTAG CTTTGATGGT AGTAGCGACC AT 3 06 
Lys Lys 



T 307 
(2) INFORMATION FOR SEQ ID NO:642: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

Met Lys Phe Leu Asn Gly Leu Ala Gly Asn Leu Leu lie Val Val lie 

15 10 15 

Leu Leu Cys Val Ala Val Phe Phe Thr Leu Lys Ala lie His lie Gin 

20 25 30 

Lys Glu Gin Ala Thr Asn Tyr Tyr Arg Tyr Lys Asp lie Asn Ala Leu 

35 40 45 

Glu Thr Lys Asn Thr Gin Asn Arg Ala Asn Tyr Glu Leu Val Asn Gin 

50 55 60 

Gly Ser Lys Lys 
65 
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(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

Met Asp Phe liCu Asn Asp His lie Asn Val Phe Gly Leu He Ala Ala 

1 5 10 15 

Leu Val He Leu Val Leu Thr He Tyr Glu Ser Ser Ser Leu He Lys 

20 25 30 

Glu Met Arg Asp Ser Lys Ser Gin Gly Glu Leu Val Glu Asn Gly His 

35 40 45 

Leu He Asp Gly He Gly Glu Phe Ala Asn Asn Val Pro Val Gly Trp 

50 55 60 

He Ala Ser Phe Met Cys Thr He Val Trp Ala Phe Trp Tyr Phe Phe 
65 70 75 80 

Phe Gly Tyr Pro Leu Asn Ser Phe Ser Gin He Gly Gin Tyr Asn Glu 

85 90 95 

Glu Val Lys Ala His Asn Gin Lys Phe Glu Ala Lys Trp Lys His Leu 

100 105 110 

Gly Gin Lys Glu Leu Val Asp Met Gly Gin Gly He Phe Leu Val His 

115 120 125 

Cys Ser Gin Cys His Gly He Thr Ala Glu Gly Leu His Gly Ser Ala 

130 135 140 

Gin Asn Leu Val Arg Trp Gly Lys Glu Glu Gly He Met Asp Thr He 
145 150 ■ 155 160 

Lys His Gly Ser Lys Gly Met Asp Tyr Leu Ala Gly Glu Met Pro Ala 

165 170 175 

Met Glu Leu Asp Glu Lys Asp Ala Lys Ala He Ala Ser Tyr Val Met 

180 185 190 

Ala Glu Leu Ser Ser Val Lys Lys Thr Lys Asn Pro Gin Leu He Asp 

195 200 205 

Lys Gly Lys Glu Leu Phe Glu Ser Met Gly Cys Thr Gly Cys His Gly 

210 215 220 

Asn Asp Gly Lys Gly Leu Gin Glu Asn Gin Val Phe Ala Ala Asp Leu 
225 230 235 240 

Thr Ala Tyr Gly Thr Glu Asn Phe Leu Arg Asn He Leu Thr His Gly 

245 250 255 

Lys Lys Gly Asn He Gly His Met Pro Ser Phe Lys Tyr Lys Asn Phe 

260 265 270 

Ser Asp Leu Gin Val Lys Ala Leu Leu Asn Leu Ser Asn Arg 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51. . .254 
(D) OTHER INFORMATION: 
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AAG GAA CTG GTG GAT ATG GGT CAA GGC ATC TTT TTA GTC CAT TGT TCG 44 0 

Lys Glu Lieu Val Asp Met Gly Gin Gly lie Phe Leu Val His Cys Ser 
115 120 125 130 

CAA TGC CAT GGC ATC ACC GCT GAG GGC TTG CAT GGG AGC GCT CAA AAT 4 88 

Gin Cys His Gly lie Thr Ala Glu Gly Leu His Gly Ser Ala Gin Asn 
135 . 140 145 

CTG GTG CGC TGG GGT AAA GAA GAG GGT ATT ATG GAT ACC ATT AAG CAT 53 6 

Leu Val Arg Trp Gly Lys Glu Glu Gly lie Met Asp Thr lie Lys His 
150 155 160 

GGC TCT AAG GGC ATG GAT TAT CTC GCT GGG GAA ATG CCC GCT ATG GAA 5 84 

Gly Ser Lys Gly Met Asp Tyr Leu Ala Gly Glu Met Pro Ala Met Glu 
165 170 175 

TTG GAC GAA AAA GAC GCT AAA GCG ATC GCA AGC TAT GTG ATG GCA GAA 63 2 

Leu Asp Glu Lys Asp Ala Lys Ala lie Ala Ser Tyr Val Met Ala Glu 
180 185 190 

CTT TCT AGC GTT AAA AAA ACC AAA AAC CCT CAA CTC ATT GAT AAA GGC 6 80 

Leu Ser Ser Val Lys Lys Thr Lys Asn Pro Gin Leu lie Asp Lys Gly 
195 200 205 210 

AAG GAA TTG TTT GAA AGC ATG GGT TGC ACA GGC TGT CAT GGC AAT GAT 72 8 

Lys Glu Leu Phe Glu Ser Met Gly Cys Thr Gly Cys His Gly Asn Asp 
215 220 225 

GGT AAG GGC TTG CAA GAA AAT CAA GTG TTT GCA GCC GAT TTG ACC GCT 77 6 

Gly Lys Gly Leu Gin Glu Asn Gin Val Phe Ala Ala Asp Leu Thr Ala 
230 235 240 

TAG GGC ACA GAG AAT TTT TTG AGA AAT ATC TTA ACG CAT GGC AAA AAG 8 24 

Tyr Gly Thr Glu Asn Phe Leu Arg Asn lie Leu Thr His Gly Lys Lys 
245 250 255 

GGC AAT ATA GGG CAT ATG CCA TCA TTC AAG TAT AAA AAC TTT AGC GAT 8 72 

Gly Asn lie Gly His Met Pro Ser Phe Lys Tyr Lys Asn Phe Ser Asp 
260 265 270 

TTG CAA GTT AAA GCG TTA CTG AAT TTA TCC AAT CGC TAAAACCCTT AGAAGA 92 4 
Leu Gin Val Lys Ala Leu Leu Asn Leu Ser Asn Arg 
275 280 285 

TTAAAGGAAA AGAGATGAAA TTTTTAAACG GATTAGC 961 
(2) INFORMATION FOR SEQ ID NO: 64 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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530 535 540 

Glu Val Val Pro Ala lie Lys 
545 550 

(2) INFORMATION FOR SEQ ID NO:639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51 . . . 908 
(D) OTHER INFORMATION: . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 9: 

TGAACCACGC CATAAAAAAG TTCATGATAA TGGCATAAAG GAAAGTTGAA ATG GAT 56 

Met Asp 
1 

TTT TTA AAC GAC CAT ATA AAT GTT TTT GGC TTG ATT GCA GCG CTT GTG 104 
Phe Leu Asn Asp His He Asn Val Phe Gly Leu He Ala Ala Leu Val 
5 10 15 

ATT TTA GTT TTA ACC ATC TAT GAA TCC AGT TCG CTC ATT AAA GAA ATG 152 
He Leu Val Leu Thr He Tyr Glu Ser Ser Ser Leu He Lys Glu Met 
20 25 30 

CGC GAC AGC AAA TCT CAA GGT GAG CTT GTA GAA AAT GGG CAT TTG ATT 2 00 

Arg Asp Ser Lys Ser Gin Gly Glu Leu Val Glu Asn Gly His Leu He 
35 40 45 50 

GAT GGG ATA GGG GAG TTT GCC AAT AAT GTG CCA GTA GGC TGG ATC GCA 24 8 

Asp Gly He Gly Glu Phe Ala Asn Asn Val Pro Val Gly Trp He Ala 
55 60 65 

AGC TTT ATG TGC ACG ATT GTG TGG GCT TTT TGG TAT TTC TTC TTT GGG 2 96 

Ser Phe Met Cys Thr He Val Trp Ala Phe Trp Tyr Phe Phe Phe Gly 
70 75 80 

TAT CCG CTG AAT AGC TTT TCT CAA ATC GGG CAA TAC AAT GAA GAG GTT 3 44 

Tyr Pro Leu Asn Ser Phe Ser Gin He Gly Gin Tyr Asn Glu Glu Val 
85 90 95 

AAA GCG CAC AAC CAA AAA TTT GAG GCC AAG TGG AAG CAT TTG GGT CAA 3 92 

Lys Ala His Asn Gin Lys Phe Glu Ala Lys Trp Lys His Leu Gly Gin 
100__ 105 110 
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Ser Gly Tyr Phe Glu lie Leu Lys Glu Ser Val Gin Ser lie Thr Leu 

100 105 110 

Asp His Arg lie Leu Val lie Leu lie Gly Phe Cys Phe Gly Ser Phe 

115 120 125 

Leu Glu Gly Ala lie Gly Phe Gly Gly Pro lie Ala lie Thr Ala Ala 

130 135 140 

lie Leu Val Gly Leu Gly Leu Ser Pro Leu Tyr Ser Ala Gly Leu Cys 
145 150 155 160 

Leu lie Ala Asn Thr Ala Pro Val Ala Phe Gly Ala Val Gly lie Pro 

165 17-0 175 

lie Ser Ala Met Ala Ser Ala Val Gly Val Pro Ala lie Leu lie Ser 

180 185 190 

Ala Met Thr Gly Lys lie Leu Phe Phe Val Ser Leu Leu Val Pro Phe 

195 200 -205 

Phe lie Val Phe Leu Met Asp Gly Phe Lys Gly lie Lys Glu Thr Phe 

210 215 220 

Pro Ala Val Phe lie Ala Ala Phe Ser Phe Ala Gly Ala Gin Phe Leu 
225 230 235 240 

Ser Ser Asn Tyr Leu Gly Pro Glu Leu Pro Gly lie lie Ser Ala Leu 

245 250 255 

Val Ser Leu Val Ala Thr Ala Leu Phe Leu Lys Phe Trp Gin Pro Lys 

260 265 270 

Ala lie Phe Arg Ser Asp Gly Lys Ala Ala Ser Phe Thr Lys Ser Asn 

275 280 285 

His His lie Cys Lys lie Tyr Val Ala Trp Ser Pro Phe Val He Leu 

290 295 300 

Val Leu Val He Val Leu Trp He Gin Pro Phe Phe Lys Ala Leu Phe 
305 310 315 320 

Glu Lys Asp Gly Leu Leu Ala Phe Ser Asn Phe Tyr Phe Glu Phe Asn 

325 330 335 

Asn He Ser Asn His He Phe Lys Ser Pro Pro Phe Val Glu Ala Asn 

340 345 350 

Gin Ser Val Ser Phe Pro Val Val Phe Lys Phe Leu Leu He Asn Thr 

355 360 365 

Val Gly Thr Ser He Phe Leu Ala Ala Leu Val Ser Met Leu Val Leu 

370 375 380 

Arg Val Arg Val Ser Asp Ala Leu Ser Val Phe Gly Glu Thr Leu Lys 
385 390 395 400 

Glu Met Arg Tyr Pro He Leu Thr He Gly Leu Val Leu Ser Phe Ala 

405 410 415 

Tyr Val Ser Asn Tyr Ser Gly He Ser Ser Thr Leu Ala Leu Ala Leu 

420 425 430 

Thr His Thr Gly Leu Ala Phe Thr Phe Phe Ser Pro Leu He Gly Trp 

435 440 445 

Val Gly Val Phe Leu Thr Gly Ser Asp Thr Ser Ser Asn Leu Leu Phe 

450 455 460 

Gly Ser Leu Gin Gin Leu Thr Ala Gin Arg Leu His Leu Pro Glu Val 
465 470 475 480 

Leu Thr Leu Thr Ala Asn Thr Val Gly Gly Thr Leu Gly Lys Met He 

485 490 495 

Ser Pro Gin Ser He Ala He Ala Cys Ala Ala Val Gly Leu Ala Gly 

500 505 510 

Lys Glu Ser Asp Leu Phe Lys Phe Thr Val Lys Tyr Ser Leu He Phe 

515 520 525 

Val Ala He Met Gly Val Val He Ser Ala He Ala Tyr Leu He Pro 
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GTG TTT TTA ACC GGG AGC GAT AGO AGT TCC AAT CTT TTG TTT GGC TCT 144 8 

Val Phe Leu Thr Gly Ser Asp Thr Ser Ser Asn Leu Leu Phe Gly Ser 
455 460 465 

TTA CAG CAA CTC ACC GCC CAA CGA TTG CAC CTC CCT GAG GTT TTA ACC 14 96 

Leu Gin Gin Leu Thr Ala Gin Arg Leu His Leu Pro Glu Val Leu Thr 
470 475 480 

CTA ACG GCT AAT ACC GTG GGT GGC ACT TTA GGC AAG ATG ATA AGC CCT 1544 
Leu Thr Ala Asn Thr Val Gly Gly Thr Leu Gly Lys Met lie Ser Pro 
485 490 495 

CAA AGC ATC GCT ATC GCT TGC GCG GCG GTG GGG TTA GCC GGG AAA GAG 15 92 

Gin Ser He Ala He Ala Cys Ala Ala Val Gly Leu Ala Gly Lys Glu 
500 505 510 

AGC GAT TTG TTC AAA TTC ACG GTT AAA TAG TCC CTT ATT TTT GTA GCG 164 0 

Ser Asp Leu Phe Lys Phe Thr Val Lys Tyr Ser Leu He Phe Val Ala 
515 520 525 530 

ATC ATG GGA GTT GTG ATC AGC GCG ATT GCG TAT TTG ATC CCT GAA GTG 16 88 

He Met Gly Val Val He Ser Ala He Ala Tyr Leu He Pro Glu Val 
535 540 545 

GTG CCT GCG ATA AAG TAGGGCCATT TTAGATTTAG CAGGGTTTAA CCCCCAAATA A 174 4 
Val Pro Ala He Lys 
550 

ATTTTTTTGT TT 17 56 

(2) INFORMATION FOR SEQ ID NO:638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 8: 

Val Leu Glu Phe His Gin He Tyr Asp Pro Leu Gly Asn He Trp Leu 

15 10 15 

Ser Ala Leu Val Ala Leu Leu Pro He Leu Leu Phe Phe Leu Ser Leu 

20 25 30 

Met Val Phe Lys Leu Lys Gly Tyr Thr Ala Ala Phe Leu Ser Val Ala 

35 40 45 

Leu Ser Ala Val He Ala Val Leu Val Tyr Lys Met Pro Val Ser Met 

50 55 60 

Val Gly Ser Ser Phe Leu Tyr Gly Phe Leu Tyr Gly Leu Trp Pro He 
65 70 75 80 

Ala Trp He He He Ala Ala He Phe Leu Tyr Lys Leu Ser Val Lys 
85 90 95 
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GTT TTT ATC GCG OCT TTT TCT TTC OCT GOT GCG CAA TTT TTA AGO TCT 77 6 

Val Phe lie Ala Ala Phe Ser Phe Ala Gly Ala Gin Phe L.eu Ser Ser 
230 235 240 

AAT TAT TTA GGG CCA GAA TTG CCT GGT ATT ATT TCA GCC GTT GTT TCA 824 
Asn Tyr Leu Gly Pro Glu Leu Pro Gly lie lie Ser Ala Leu Val Ser 
245 250 255 

CTC GTT GCA ACA GCG CTC TTT TTG AAA TTT TGG CAG CCT AAA GCG ATT 8 72 

Leu Val Ala Thr Ala Leu Phe Leu Lys Phe Trp Gin Pro Lys Ala lie 
260 265 270 

TTT AGA AGC GAC GGC AAA GCG GCT TCG TTC ACT AAG AGT AAC CAT CAT 92 0 

Phe Arg Ser Asp Gly Lys Ala Ala Ser Phe Thr Lys Ser Asn His His 
275 280 285 290 

ATT TGT AAG ATC TAT GTC GCT TGG TCT CCT TTT GTG ATT TTA GTT TTA 96 8 

lie Cys Lys lie Tyr Val Ala Trp Ser Pro Phe Val lie Leu Val Leu 
295 300 305 

GTG ATT GTG CTA TGG ATA CAG CCT TTT TTT AAA GCC TTG TTT GAA AAA 1016 
Val lie Val Leu Trp lie Gin Pro Phe Phe Lys Ala Leu Phe Glu Lys 
310 315 320 

GAC GGC TTG TTA GCT TTT TCT AAT TTT TAT TTT GAA TTC AAT AAC ATC 10 64 

Asp Gly Leu Leu Ala Phe Ser Asn Phe Tyr Phe Glu Phe Asn Asn lie 
325 330 335 

AGT AAC CAC ATC TTT AAA AGC CCG CCT TTT GTA GAA GCC AAT CAA AGC 1112 
Ser Asn His lie Phe Lys Ser Pro Pro Phe Val Glu Ala Asn Gin Ser 
340 345 350 

GTG AGT TTT CCG GTG GTG TTT AAA TTT CTC TTA ATC AAC ACG GTT GGC 116 0 

Val Ser Phe Pro Val Val Phe Lys Phe Leu Leu lie Asn Thr Val Gly 
355 360 365 370 

ACT TCC ATT TTT TTA GCC GCT CTT GTT AGC ATG CTC GTT TTA AGG GTG 12 0 8 

Thr Ser lie Phe Leu Ala Ala Leu Val Ser Met Leu Val Leu Arg Val 
375 380 385 

CGA GTG AGC GAT GCG CTG AGC GTC TTT GGC GAG ACT TTA AAA GAA ATG 12 5 6 

Arg Val Ser Asp Ala Leu Ser Val Phe Gly Glu Thr Leu Lys Glu Met 
390 395 400 

CGT TAC CCC ATT CTC ACC ATT GGT TTA GTC TTA AGC TTT GCC TAT GTG 13 04 

Arg Tyr Pro He Leu Thr He Gly Leu Val Leu Ser Phe Ala Tyr Val 
405 410 415 

TCT AAT TAC AGC GGG ATT TCT TCC ACT CTA GCC TTA GCG CTC ACG CAT 13 5 2 

Ser Asn Tyr Ser Gly He Ser Ser Thr Leu Ala Leu Ala Leu Thr His 
420 425 430 

ACG GGT TTG GCT TTC ACC TTT TTC TCG CCC TTG ATC GGG TGG GTA GGC 14 0 0 

Thr Gly Leu Ala Phe Thr Phe Phe Ser Pro Leu He Gly Trp Val Gly 
435 440 445 450 
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GAA TTT CAT CAA ATT TAT GAT CCT TTG GGT AAT ATT TGG CTG AGC GCT 104 
Glu Phe His Gin lie Tyr Asp Pro Leu Gly Asn lie Trp Leu Ser Ala 
5 10 15 

CTT GTG GCC TTA TTG CCG ATT TTA TTG TTT TTC TTA TCT TTA ATG GTT 152 
Leu Val Ala Leu Leu Pro lie Leu Leu Phe Phe Leu Ser Leu Met Val 
20 25 30 

TTT AAA CTC AAA GGT TAT ACA GCG GCC TTT TTG AGC GTG GCC TTA TCA 2 00 

Phe Lys Leu Lys Gly Tyr Thr Ala Ala Phe Leu Ser Val Ala Leu Ser 
35 40 45 50 

GCC GTT ATT GCG GTT TTA GTG TAT AAA ATG CCT GTT AGC ATG GTG GGT 24 8 

Ala Val lie Ala Val Leu Val Tyr Lys Met Pro Val Ser Met Val Gly 
55 60 65 

TCA AGC TTC CTT TAC GGC TTT CTT TAT GGC TTA TGG CCG ATC GCT TGG 2 96 

Ser Ser Phe Leu Tyr Gly Phe Leu Tyr Gly Leu Trp Pro lie Ala Trp 
70 75 80 

ATC ATT ATT GCG GCG ATT TTT TTA TAC AAA CTC AGC GTT AAA TCC GGC 3 44 

lie lie lie Ala Ala lie Phe Leu Tyr Lys Leu Ser Val Lys Ser Gly 
85 90 95 

TAT TTT GAA ATT TTA AAA GAA AGC GTC CAG TCC ATC ACT TTA GAT CAC 3 92 

Tyr Phe Glu lie Leu Lys Glu Ser Val Gin Ser lie Thr Leu Asp His 
100 105 110 

CGC ATT TTA GTG ATT TTG ATT GGC TTT TGT TTT GGC TCG TTT TTA GAA 44 0 

Arg lie Leu Val lie Leu lie Gly Phe Cys Phe Gly Ser Phe Leu Glu 
115 120 125 130 

GGG GCG ATC GGC TTT GGA GGG CCT ATT GCC ATT ACC GCA GCG ATT TTA 48 8 

Gly Ala lie Gly Phe Gly Gly Pro lie Ala lie Thr Ala Ala lie Leu 
135 140 145 

GTG GGC TTA GGG TTA AGC CCT TTG TAT TCT GCC GGG TTA TGT TTG ATC 536 
Val Gly Leu Gly Leu Ser Pro Leu Tyr Ser Ala Gly Leu Cys Leu lie 
150 155 160 

GCT AAT ACC GCT CCT GTA GCT TTT GGC GCG GTG GGT ATC CCT ATA AGT 5 84 

Ala Asn Thr Ala Pro Val Ala Phe Gly Ala Val Gly lie Pro lie Ser 
165 170 175 

GCT ATG GCG AGC GCG GTA GGG GTG CCA GCG ATT TTA ATT TCA GCC ATG 632 
Ala Met Ala Ser Ala Val Gly Val Pro Ala lie Leu lie Ser Ala Met 
180 185 190 

ACG GGT AAA ATC CTC TTT TTT GTG AGC TTG TTA GTG CCG TTT TTC ATT 680 
Thr Gly Lys He Leu Phe Phe Val Ser Leu Leu Val Pro Phe Phe lie 
195 200 205 210 

GTG TTT TTA ATG GAT GGC TTT AAA GGG ATT AAA GAA ACT TTT CCG GCC 72 8 

Val Phe Leu Met Asp Gly Phe Lys Gly He Lys Glu Thr Phe Pro Ala 
215 220 225 
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TCA AAC AAC TCT TCA CGC ATC GCT TCA ATT TCT TTA TCA TAAGGCTGTA AA 34 7 
Ser Asn Asn Ser Ser Arg He Ala Ser He Ser Leu Ser 
85 90 95 

GTAAAATCCT TAAACGCTTC AAAATTCAAA TTCAAATCTG T 3 88 

(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 

Met Ser Ala Val Arg Leu Gly Pro Pro He Arg Ser Met Phe Val Gly 

15 10 15 

Asn Ala Pro Phe Trp Leu Trp Phe Phe Lys Ala Cys Met Pro Ser Leu 

20 25 30 

Lys Leu Arg Thr Met Asp Ser Phe Phe Asn Lys Met Met Gin Leu Arg 

35 40 45 

Val Met Ser Glu Ser Asn Arg Gly Leu Ala Cys Glu Glu Thr Ala Pro 

50 55 60 

He Met Pro Lys Leu Glu He Pro Gin Thr Pro Cys Asn Lys Ala Val 
65 70 75 80 

Ser He Ser Asn Asn Ser Ser Arg He Ala Ser He Ser Leu Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO:637: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1703 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:637: 

TTTTAAAGAG AACTAGCACT AAGAGAATAT TTTTAAAAAG GGATTTTTTA GTG CTA 5 6 

Val Leu 
1 
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Asp Glu Val Lys Ser Phe Phe Glu lie His Arg Ala Glu Gly Ser Leu 

290 295 300 

Ala Ser Gly Val His Leu Glu Met Thr Gly Glu Asn Val Thr Glu Cys 
305 310 315 320 

lie Gly Gly Ser Gin Ala lie Thr Glu Glu Gly Leu Ser Cys His Tyr 

325 330 335 

Tyr Thr Gin Cys Asp Pro Arg Leu Asn Ala Thr Gin Ala Leu Glu Leu 

340 345 350 

Ala Phe Leu lie Ala Asp Met Leu Lys Lys Gin His Ala 
355 360 365 

(2) INFORMATION FOR SEQ ID NO:635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.,. 335 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 5: 

TGATAGCGAC TTTTTGAGGC CCATGCACCC CAAAAACGGT TTTTAATTCA ATG TCA 56 

Met Ser 
1 

GCT GTC CGG CTA GGC CCG CCA ATA AGG AGC ATG TTT GTG GGT AAT GCA 104 
Ala Val Arg Leu Gly Pro Pro lie Arg Ser Met Phe Val Gly Asn Ala 
5 10 15 

CCG TTT TGG CTT TGG TTT TTT AAA GCT TGC ATG CCT TCA CTC AAA TTG 152 
Pro Phe Trp Leu Trp Phe Phe Lys Ala Cys Met Pro Ser Leu Lys Leu 
20 25 30 

CGC ACA ATG GAT TCT TTT TTC AAT AAG ATG ATG CAA TTA AGG GTG ATG 200 
Arg Thr Met Asp Ser Phe Phe Asn Lys Met Met Gin Leu Arg Val Met 
35 40 45 50 

AGC GAA AGC AAT CGC GGG CTT GCA TGC GAA GAG ACC GCC CCA ATC ATG 24 8 

Ser Glu Ser Asn Arg Gly Leu Ala Cys Glu Glu Thr Ala Pro lie Met 
55 60 65 

CCC AAG CTT GAA ATC CCA CAA ACC CCA TGC AAT AAA GCC GTA TCA ATC 2 96 

Pro Lys Leu Glu He Pro Gin Thr Pro Cys Asn Lys Ala Val Ser He 
70 75 80 
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TTA ATC GCT GAC ATG CTC AAA AAA CAG CAC GCT TAGTTAAAAA GAGATTAATC 1165 
Leu lie Ala Asp Met Leu Lys Lys Gin His Ala 
355 360 365 

TTTTTTTAAC TCTTTTACTT TATAATTATC GTT 1198 
(2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:634: 

Met Phe Lys Val Met Met Gin Met Ala lie Val Leu Thr Phe Ala Gly 

1 5 10 . 15 

Ser lie Pro lie Val Lys Val Gly Arg He Ala Gly Gin Phe Ala Lys 

20 25 30 

Pro Arg Ser Asn Ala Thr Glu Met Leu Asp Asn Glu Glu Val Leu Ser 

35 40 45 

Tyr Arg Gly Asp He He Asn Gly He Ser Lys Lys Glu Arg Glu Pro 

50 55 60 

Asn Pro Glu Arg Met Leu Lys Ala Tyr His Gin Ser Val Ala Thr Leu 
65 70 75 80 

Asn Leu He Arg Ala Phe Ala Gin Gly Gly Leu Ala Asp Leu Glu Gin 

85 90 95 

Val His Arg Phe Asn Leu Asp Phe Val Lys Asn Asn Asp Phe Gly Gin 

100 105 110 

Lys Tyr Gin Gin He Ala Asp Arg He Thr Gin Ala Leu Gly Phe Met 

115 120 125 

Arg Ala Cys Gly Val Glu He Glu Arg Thr Pro He Leu Arg Glu Val 

130 135 140 

Glu Phe Tyr Thr Ser His Glu Ala Leu Leu Leu His Tyr Glu Glu Pro 
145 150 155 160 

Leu Val Arg Lys Asp Ser Leu Thr Asn Gin Phe Tyr Asp Cys Ser Ala 

165 170 175 

His Met Leu Trp He Gly Glu Arg Thr Arg Asp Pro Lys Gly Ala His 

180 185 190 

Val Glu Phe Leu Arg Gly Val Cys Asn Pro He Gly Val Lys He Gly 

195 200 205 

Pro Asn Ala Ser Val Ser Glu Val Leu Glu Leu Cys Asp Val Leu Asn 

210 215 220 

Pro Arg Asn He Lys Gly Arg Leu Asn Leu He Val Arg Met Gly Ser 
225 230 235 240 

Lys Met He Lys Glu Arg Leu Pro Lys Leu Leu Gin Gly Val Leu Glu 

245 250 255 

Glu Lys Arg His He Leu Trp Ser He Asp Pro Met His Gly Asn Thr 

260 265 270 

Val Lys Thr Ser Leu Gly Val Lys Thr Arg Ala Phe Asp Ser Val Leu 
275 280 285 
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TGC GGG GTG GAG ATA GAG CGA ACG CCT ATT CTT AGG GAA GTG GAA TTT 4 88 

Cys Gly Val Glu lie Glu Arg Thr Pro lie Leu Arg Glu Val Glu Phe 
135 140 145 

TAG ACC AGO CAC GAA GCG TTA CTG CTC CAT TAT GAA GAG CCG TTG GTG 536 
Tyr Thr Ser His Glu Ala Leu Leu Leu His Tyr Glu Glu Pro Leu Val 
150 155 160 

CGT AAG GAT AGT CTG ACT AAC CAG TTT TAT GAT TGC TCC GCG CAC ATG 5 84 

Arg Lys Asp Ser Leu Thr Asn Gin Phe Tyr Asp Cys Ser Ala His Met 
165 170 175 

CTA TGG ATT GGC GAA AGG ACA AGA GAC CCT AAG GGT GCG CAT GTG GAG 632 
Leu Trp lie Gly Glu Arg Thr Arg Asp Pro Lys Gly Ala His Val Glu 
180 185 190 

TTT TTA AGG GGG GTT TGT AAC CCT ATT GGC GTG AAA ATC GGG CCT AAT 6 80 

Phe Leu Arg Gly Val Cys Asn Pro lie Gly Val Lys lie Gly Pro Asn 
195 200 205 210 

GCG AGC GTG AGC GAA GTG TTA GAA TTG TGC GAT GTT TTA AAC CCG CGC 72 8 

Ala Ser Val Ser Glu Val Leu Glu Leu Cys Asp Val Leu Asn Pro Arg 
215 220 225 

AAC ATT AAG GGG CGT TTG AAT TTG ATC GTG CGC ATG GGT TCT AAG ATG 7 76 

Asn lie Lys Gly Arg Leu Asn Leu lie Val Arg Met Gly Ser Lys Met 
230 235 240 

ATT AAA GAG CGT TTG CCT AAA CTT TTA CAA GGG GTG TTG GAA GAA AAA 8 24 

lie Lys Glu Arg Leu Pro Lys Leu Leu Gin Gly Val Leu Glu Glu Lys 
245 250 255 

CGC CAT ATT TTA TGG AGC ATT GAT CCC ATG CAT GGC AAC ACG GTT AAA 872 
Arg His lie Leu Trp Ser lie Asp Pro Met His Gly Asn Thr Val Lys 
260 265 270 

ACC AGC TTG GGG GTT AAA ACA AGG GCT TTT GAT AGC GTG TTA GAT GAA 92 0 

Thr Ser Leu Gly Val Lys Thr Arg Ala Phe Asp Ser Val Leu Asp Glu 
275 280 285 290 

GTG AAA AGC TTT TTT GAA ATC CAT AGG GCT GAA GGG AGT TTG GCT TCA 96 8 

Val Lys Ser Phe Phe Glu lie His Arg Ala Glu Gly Ser Leu Ala Ser 
295 300 305 

GGG GTT CAT TTG GAA ATG ACA GGT GAG AAT GTT ACA GAA TGT ATC GGT 1016 
Gly Val His Leu Glu Met Thr Gly Glu Asn Val Thr Glu Cys lie Gly 
310 315 320 

GGC TCG CAA GCG ATC ACC GAA GAG GGT TTG AGC TGC CAT TAG TAG ACG 10 64 

Gly Ser Gin Ala lie Thr Glu Glu Gly Leu Ser Cys His Tyr Tyr Thr 
325 330 335 

CAA TGC GAT CCA AGA TTA AAC GCC ACC CAA GCC CTA GAA CTC GCC TTC 1112 
Gin Cys Asp Pro Arg Leu Asn Ala Thr Gin Ala Leu Glu Leu Ala Phe 
340 345 350 
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(2) INFORMATION FOR SEQ ID NO:633: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1145 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 3: 

GCGATTGTGC GGAGTCGTTT TCTCAATTTA GCGCCAATCG GATTAGAGAC ATG TTT 5 6 

Met Phe 
1 

AAA GTA ATG ATG CAA ATG GCG ATC GTT CTC ACT TTT GCT GGC TCT ATA 104 
Lys Val Met Met Gin Met Ala lie Val Leu Thr Phe Ala Gly Ser lie 
5 10 15 

CCG ATC GTG AAA GTG GGG CGC ATT GCC GGG CAA TTT GCC AAG CCT CGC 152 
Pro lie Val Lys Val Gly Arg lie Ala Gly Gin Phe Ala Lys Pro Arg 
20 25 30 

TCC AAT GCG ACT GAA ATG CTG GAT AAT GAA GAA GTG TTG AGT TAC AGA 2 00 

Ser Asn Ala Thr Glu Met Leu Asp Asn Glu Glu Val Leu Ser Tyr Arg 
35 40 45 50 

GGG GAT ATT ATC AAT GGG ATT TCC AAA AAA GAA AGA GAG CCA AAT CCT 24 8 

Gly Asp lie lie Asn Gly lie Ser Lys Lys Glu Arg Glu Pro Asn Pro 
55 60 65 

GAA AGA ATG CTT AAG GCC TAC CAT CAA AGC GTA GCG ACT TTA AAC CTT 2 96 

Glu Arg Met Leu Lys Ala Tyr His Gin Ser Val Ala Thr Leu Asn Leu 
70 75 80 

ATC AGA GCC TTT GCT CAA GGC GGG TTA GCG GAT TTG GAG CAA GTG CAT 344 
lie Arg Ala Phe Ala Gin Gly Gly Leu Ala Asp Leu Glu Gin Val His 
85 90 95 

CGT TTC AAT TTG GAT TTT GTC AAA AAC AAC GAC TTT GGG CAA AAA TAC 3 92 

Arg Phe Asn Leu Asp Phe Val Lys Asn Asn Asp Phe Gly Gin Lys Tyr 
100 105 110 

CAG CAA ATC GCT GAC CGG ATC ACG CAA GCT TTA GGG TTT ATG CGA GCA 44 0 

Gin Gin lie Ala Asp Arg lie Thr Gin Ala Leu Gly Phe Met Arg Ala 
115 _ 120 125 130 
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lie Tyr Gly lie Ala Phe Ala Thr Lys Glu Gly Leu Lys Asp Tyr Leu 

195 200 205 

Phe Gin lie Glu Glu Ala Lys Lys Arg Asp His Arg Lys Leu Gly Val 

210 215 220 

Glu Leu Gly Leu Phe Ser Phe Asp Asp Glu lie Gly Ala Gly Leu Pro 
225 230 235 240 

Leu Trp Leu Pro Lys Gly Ala Arg Leu Arg Lys Arg lie Glu Asp Leu 

245 250 255 

Leu Ser Gin Ala Leu Leu Leu Arg Gly Tyr Glu Pro Val Lys Gly Pro 

260 265 270 

Glu lie Leu Lys Ser Asp Val Trp Lys lie Ser Gly His Tyr Asp Asn 

275 280 285 

Tyr Lys Glu Asn Met Tyr Phe Thr Thr lie Asp Glu Gin Glu Tyr Gly 

290 295 300 

lie Lys Pro Met Asn Cys Val Gly His lie Lys Val Tyr Gin Ser Ala 
305 310 315 320 

Leu His Ser Tyr Arg Asp Leu Pro Leu Arg Phe Tyr Glu Tyr Gly Val 

325 330 335 

Val His Arg His Glu Lys Ser Gly Val Leu His Gly Leu Leu Arg Val 

340 345 350 

Arg Glu Phe Thr Gin Asp Asp Ala His lie Phe Cys Ser Phe Glu Gin 

355 360 365 

lie Gin Ser Glu Val Ser Ala lie Leu Asp Phe Thr His Lys lie Met 

370 375 380 

Gin Ala Phe Asp Phe Ser Tyr Glu Met Glu Leu Ser Thr Arg Pro Ala 
385 390 395 400 

Lys Ser lie Gly Asp Asp Lys Val Trp Glu Lys Ala Thr Asn Ala Leu 

405 410 415 

Lys Glu Ala Leu Lys Glu His Arg lie Asp Tyr Lys lie Asp Glu Gly 

420 425 430 

Gly Gly Ala Phe Tyr Gly Pro Lys lie Asp lie Lys lie Thr Asp Ala 

435 440 445 

Leu Lys Arg Lys Trp Gin Cys Gly Thr lie Gin Val Asp Met Asn Leu 

450 455 460 

Pro Glu Arg Phe Lys Leu Ala Phe Thr Asn Glu Tyr Asn His Ala Glu 
465 470 475 480 

Gin Pro Val Met lie His Arg Ala lie Leu Gly Ser Phe Glu Arg Phe 

485 490 495 

lie Ala lie Leu Ser Glu His Phe Gly Gly Asn Phe Pro Phe Phe Val 

500 505 510 

Ala Pro Thr Gin lie Ala Leu lie Pro lie Asn Glu Glu His His Val 

515 520 525 

Phe Ala Leu Lys Leu Lys Glu Ala Leu Lys Lys Arg Asp lie Phe Val 

530 535 540 

Glu Val Leu Asp Lys Asn Asp Ser Leu Asn Lys Lys Val Arg Leu Ala 
545 550 555 560 

Glu Lys Gin Lys lie Pro Met lie Leu Val Leu Gly Asn Glu Glu Val 

565 570 575 

Glu Thr Glu lie Leu Ser lie Arg Asp Arg Glu Lys Gin Asp Gin Tyr 

580 585 590 

Lys Met Pro Leu Lys Glu Phe Leu Asn Met Val Glu Ser Lys Met Gin 

595 . 600 605 

Glu Val Ser Phe 
610 
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CAA AAA ATC CCT ATG ATT TTA GTG TTA GGG AAT GAA GAA GTG GAG ACC 17 84 

Gin Lys lie Pro Met lie Leu Val Leu Gly Asn Glu Glu Val Glu Thr 
565 570 575 

GAA ATT TTA TCC ATT AGA GAG AGA GAA AAA CAA GAT CAA TAT AAA ATG 1832 
Glu lie Leu Ser lie Arg Asp Arg Glu Lys Gin Asp Gin Tyr Lys Met 
580 585 590 

CCC TTA AAG GAG TTT TTA AAC ATG GTT GAA TCT AAG ATG CAA GAG GTT 18 80 

Pro Leu Lys Glu Phe Leu Asn Met Val Glu Ser Lys Met Gin Glu Val 
595 600 605 610 

AGT TTT TGAGTAGAAA CGAAGTGTTG TTAAACGGAG ACATTAATTT TAAAGAAGTG CG 193 8 
Ser Phe 



(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 612 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:632: 

Met Ser Ala Glu Leu lie Ala Val Tyr Lys Asp Glu Gin lie lie Asp 

15 10 15 

Leu Glu Ser Ala Lys Val Leu Gly Leu Ser Asp Gly lie Lys Ala Leu 

20 25 30 

Asn Gly Thr Glu Pro lie Tyr Phe Asp Asp Ser Pro Leu Ala Leu Glu 

35 40 45 

Val lie Arg His Ser Cys Ala His Leu Leu Ala Gin Ser Leu Lys Ala 

50 55 60 

Leu Tyr Pro Asp Ala Lys Phe Phe Val Gly Pro Val Val Glu Glu Gly 
65 70 75 80 

Phe Tyr Tyr Asp Phe Lys Thr Ser Ser Lys lie Ser Glu Glu Asp Leu 

85 90 95 

Pro Lys lie Glu Ala Lys Met Lys Glu Phe Ala Lys Leu Lys Leu Ala 

100 105 110 

lie Thr Lys Glu Thr Leu Thr Arg Glu Gin Ala Leu Glu Arg Phe Lys 

115 120 125 

Gly Asp Glu Leu Lys His Ala Val Met Ser Lys lie Gly Gly Asp Ala 

130 135 140 

Phe Gly Val Tyr Gin Gin Gly Glu Phe Glu Asp Leu Cys Lys Gly Pro 
145 150 155 160 

His Leu Pro Asn Thr Arg Phe Leu Asn His Phe Lys Leu Thr Lys Leu 

165 170 175 

Ala Gly Ala Tyr Leu Gly Gly Asp Glu Asn Asn Glu Met Leu lie Arg 
180 185 190 
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CGG CAT GAA AAA AGC GGC GTG TTG CAT GGG CTT TTA AGG GTT AGG GAA 1112 
Arg His Glu Lys Ser Gly Val Leu His Gly Leu Leu Arg Val Arg Glu 
340 345 350 

TTT ACC CAA GAT GAT GCA CAT ATT TTT TGC TCT TTT GAA CAG ATC CAA 116 0 

Phe Thr Gin Asp Asp Ala His lie Phe Cys Ser Phe Glu Gin lie Gin 
355 360 365 370 

AGC GAA GTG AGC GCG ATT TTA GAT TTT ACG CAC AAA ATC ATG CAA GCG 12 0 8 

Ser Glu Val Ser Ala lie Leu Asp Phe Thr His Lys lie Met Gin Ala 
375 380 385 

TTT GAT TTT AGC TAT GAA ATG GAA TTA TCC ACA AGG CCG GCT AAA TCC 12 5 6 

Phe Asp Phe Ser Tyr Glu Met Glu Leu Ser Thr Arg Pro Ala Lys Ser 
390 395 400 

ATA GGC GAT GAT AAA GTT TGG GAA AAG GCC ACT AAC GCT TTA AAA GAA 13 0 4 

lie Gly Asp Asp Lys Val Trp Glu Lys Ala Thr Asn Ala Leu Lys Glu 
405 410 415 

GCC TTA AAA GAA CAC CGC ATT GAT TAC AAG ATT GAT GAA GGG GGA GGG 135 2 

Ala Leu Lys Glu His Arg lie Asp Tyr Lys lie Asp Glu Gly Gly Gly 
420 425 430 

GCT TTC TAT GGG CCT AAG ATT GAC ATT AAA ATC ACT GAC GCT TTA AAG 14 00 

Ala Phe Tyr Gly Pro Lys lie Asp lie Lys lie Thr Asp Ala Leu Lys 
435 440 445 450 

CGT AAA TGG CAG TGT GGC ACG ATT CAA GTG GAT ATG AAT TTG CCT GAA 14 4 8 

Arg Lys Trp Gin Cys Gly Thr lie Gin Val Asp Met Asn Leu Pro Glu 
455 460 465 

CGC TTC AAG CTC GCT TTC ACT AAT GAG TAT AAT CAC GCT GAG CAG CCG 14 96 

Arg Phe Lys Leu Ala Phe Thr Asn Glu Tyr Asn His Ala Glu Gin Pro 
470 475 480 

GTG ATG ATC CAC AGA GCG ATT TTA GGC TCG TTT GAA AGG TTT ATT GCG 1544 
Val Met lie His Arg Ala lie Leu Gly Ser Phe Glu Arg Phe lie Ala 
485 490 495 

ATT TTG AGC GAA CAT TTT GGG GGG AAT TTC CCT TTC TTT GTC GCG CCC 15 92 

lie Leu Ser Glu His Phe Gly Gly Asn Phe Pro Phe Phe Val Ala Pro 
500 505 510 

ACT CAA ATC GCT CTC ATC CCT ATT AAT GAA GAG CAT CAT GTT TTT GCT 164 0 

Thr Gin lie Ala Leu He Pro lie Asn Glu Glu His His Val Phe Ala 
515 520 525 530 

TTG AAA TTA AAA GAG GCG CTA AAA AAG CGC GAT ATT TTT GTA GAA GTG 16 8 8 

Leu Lys Leu Lys Glu Ala Leu Lys Lys Arg Asp He Phe Val Glu Val 
535 540 545 

TTA GAT AAA AAC GAC AGC TTG AAT AAA AAG GTG CGA TTA GCC GAA AAG 17 3 6 

Leu Asp Lys Asn Asp Ser Leu Asn Lys Lys Val Arg Leu Ala Glu Lys 
550 555 560 
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AAA GAG ACT TTA ACC AGA GAG CAA GCT TTG GAG CGT TTT AAG GGC GAT 44 0 

Lys Glu Thr Leu Thr Arg Glu Gin Ala Leu Glu Arg Phe Lys Gly Asp 
115 120 125 130 

GAA TTA AAG CAT GCG GTG ATG AGT AAA ATC GGT GGC GAT GCC TTT GGC 4 88 

Glu Leu Lys His Ala Val Met Ser Lys lie Gly Gly Asp Ala Phe Gly 
135 140 145 

GTG TAT CAA CAA GGC GAG TTT GAA GAT TTG TGT AAG GGG CCG CAT CTC 536 
Val Tyr Gin Gin Gly Glu Phe Glu Asp Leu Cys Lys Gly Pro His Leu 
150 155 160 

CCA AAC ACC CGT TTT TTA AAC CAT TTT AAG CTC ACT AAA CTG GCT GGG 5 84 

Pro Asn Thr Arg Phe Leu Asn His Phe Lys Leu Thr Lys Leu Ala Gly 
165 170 175 

GCT TAT TTG GGC GGC GAT GAA AAC AAT GAA ATG CTC ATT AGA ATC TAT 63 2 

Ala Tyr Leu Gly Gly Asp Glu Asn Asn Glu Met Leu lie Arg lie Tyr 
180 185 190 

GGA ATC GCT TTT GCC ACC AAA GAG GGT TTA AAA GAC TAT CTT TTC CAA 680 
Gly lie Ala Phe Ala Thr Lys Glu Gly Leu Lys Asp Tyr Leu Phe Gin 
195 200 205 210 

ATA GAA GAA GCG AAA AAA CGA GAT CAC AGA AAG CTA GGC GTG GAG CTA 72 8 

lie Glu Glu Ala Lys Lys Arg Asp His Arg Lys Leu Gly Val Glu Leu 
215 220 225 

GGG CTT TTT AGC TTT GAT GAT GAG ATA GGG GCG GGC TTA CCT TTA TGG 77 6 

Gly Leu Phe Ser Phe Asp Asp Glu lie Gly Ala Gly Leu Pro Leu Trp 
230 235 240 

CTG CCT AAA GGG GCA AGG CTT AGG AAG CGC ATT GAA GAT TTA TTG AGT 824 
Leu Pro Lys Gly Ala Arg Leu Arg Lys Arg lie Glu Asp Leu Leu Ser 
245 250 255 

CAA GCG TTA CTT TTA AGA GGC TAT GAG CCG GTT AAA GGT CCT GAG ATT 872 
Gin Ala Leu Leu Leu Arg Gly Tyr Glu Pro Val Lys Gly Pro Glu lie 
260 265 270 

TTA AAG AGC GAT GTG TGG AAA ATC AGC GGG CAT TAT GAC AAC TAT AAA 92 0 

Leu Lys Ser Asp Val Trp Lys lie Ser Gly His Tyr Asp Asn Tyr Lys 
275 280 285 290 

GAA AAC ATG TAT TTC ACC ACG ATT GAT GAG CAA GAA TAT GGC ATA AAG 96 8 

Glu Asn Met Tyr Phe Thr Thr lie Asp Glu Gin Glu Tyr Gly lie Lys 
295 300 305 

CCT ATG AAC TGC GTG GGG CAT ATT AAA GTC TAT CAA AGC GCT TTG CAC 1016 
Pro Met Asn Cys Val Gly His lie Lys Val Tyr Gin Ser Ala Leu His 
310 315 320 

AGC TAG AGA GAT TTG CCC TTA AGG TTT TAT GAA TAC GGC GTG GTG CAT 1064 
Ser Tyr Arg Asp Leu Pro Leu Arg Phe Tyr Glu Tyr Gly Val Val His 
325 330 335 
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370 375 380 

Lys Lys Ala Glu Lys Ala Gly Tyr Gly Tyr Ser Lys Arg Met 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic RNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51 . . . 1886 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

AAGATTGCTA AATTTTAAGG TTTGAAGAAA GGAAACATAA GTTTTAAAGA ATG AGT 5 6 

Met Ser 
1 

GCG GAA CTG ATT GCT GTT TAT AAA GAC GAG CAA ATA ATA GAT TTA GAG 104 
Ala Glu Leu lie Ala Val Tyr Lys Asp Glu Gin lie lie Asp Leu Glu 
5 10 15 

AGC GCG AAA GTC TTA GGG CTG AGC GAT GGG ATT AAA GCG TTA AAC GGG 15 2 

Ser Ala Lys Val Leu Gly Leu Ser Asp Gly lie Lys Ala Leu Asn Gly 
20 25 30 

ACA GAG CCG ATA TAT TTT GAT GAT TCG OCT TTG GCT TTA GAG GTG ATT 2 00 

Thr Glu Pro lie Tyr Phe Asp Asp Ser Pro Leu Ala Leu Glu Val lie 
35 40 45 50 

AGG CAT TCA TGC GCG CAT TTG CTT GCG CAA AGC TTG AAA GCC CTT TAT 24 8 

Arg His Ser Cys Ala His Leu Leu Ala Gin Ser Leu Lys Ala Leu Tyr 
55 60 65 

CCG GAC GCG AAA TTT TTT GTA GGC CCT GTG GTA GAA GAG GGG TTT TAT 2 96 

Pro Asp Ala Lys Phe Phe Val Gly Pro Val Val Glu Glu Gly Phe Tyr 
70 75 80 

TAC GAT TTC AAG ACT TCT TCA AAA ATC AGC GAA GAG GAT TTG CCT AAA 34 4 

Tyr Asp Phe Lys Thr Ser Ser Lys lie Ser Glu Glu Asp Leu Pro Lys 
85 90 95 

ATT GAA GCG AAA ATG AAA GAG TTT GCG AAG TTG AAA CTC GCT ATC ACT 3 92 

lie Glu Ala Lys Met Lys Glu Phe Ala Lys Leu Lys Leu Ala lie Thr 
100 105 110 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 0: 

Met Glu Ser Val Lys Thr Gly Lys Thr Asn Lys Val Gly Lys Asn Thr 

15 10 15 

Glu Met Ala Asn Thr Lys Ala Asn Lys Glu Thr His Phe Lys Gin Val 

20 25 30 

Ser Ala lie Thr Asn lie lie Arg Ser Val Gly Gly Phe Phe Thr Lys 

35 40 45 

lie Ala Lys Arg Val Arg Gly Leu Val Lys Lys His Pro Lys Lys Ser 

50 55 60 

Ser Ala Ala Leu Val Val Leu Thr His lie Ala Cys Lys Lys Ala Lys 
65 70 75 80 

Glu Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu 

85 90 95 

Asn Gin lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala Ala Ser 

100 105 110 

Leu Leu Leu Ala Ala Cys Ser Ala Gly Asp Thr Asp Lys Gin lie Glu 

115 120 125 

Leu Glu Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn 

130 135 140 

Lys Ser Gly He Glu Leu Glu Gin Glu Arg Gin Lys Thr Asn Lys Ser 
145 150 155 160 

Gly He Glu Leu Ala Asn Ser Gin He Lys Ala Glu Gin Glu Arg Gin 

165 170 175 

Lys Thr Glu Gin Glu Lys Gin Lys Ala Asn Lys Ser Ala He Glu Leu 

180 185 190 

Glu Gin Gin Lys Gin Lys Thr He Asn Thr Gin Arg Asp Leu He Lys 

195 200 205 

Glu Gin Lys Asp Phe He Lys Glu Thr Glu Gin Asn Cys Gin Glu Asn 

210 215 220 

His Asn Gin Phe Phe He Lys Lys Leu Gly He Lys Gly Gly He Ala 
225 230 235 240 

He Glu Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn 

245 250 255 

Gin Thr Pro He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His 

260 265 270 

Ser Gin Arg Gly Ser Lys Ala Gin Glu Phe He Ala Tyr Leu Gin Lys 

275 280 285 

Glu Leu Glu Phe Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val 

290 295 300 

Asn Phe Tyr Lys Pro Ser Ser He Ala Tyr Leu Glu Leu Asp Pro Arg 
305 310 315 320 

Asp Phe Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg 

325 330 335 

Ser Lys Ala Gin Ala Lys Met Leu Glu Met Arg Asp Leu Lys Pro Asp 

340 345 350 

Pro Gin Ala His Leu Pro Thr Ser Gin Ser Leu Leu Phe Val Gin Lys 

355 360 365 

He Phe Ala Asp Val Asn Lys Glu He Glu Ala Val Ala Asn Thr Glu 
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GAA CAA AAT TGC CAA GAA AAT CAT AAT CAA TTC TTT ATT AAA AAA TTA 7 85 

Glu Gin Asn Cys Gin Glu Asn His Asn Gin Phe Phe lie Lys Lys Leu 
220 225 230 

GGA ATT AAG GGT GGC ATT GCT ATA GAA GTA GAA GCT GAA TGC AAA ACC 83 3 

Gly lie Lys Gly Gly lie Ala lie Glu Val Glu Ala Glu Cys Lys Thr 
235 240 245 

CCT AAA CCT GCA AAA ACC AAT CAA ACC CCT ATC CAG CCA AAA CAC CTC 881 
Pro Lys Pro Ala Lys Thr Asn Gin Thr Pro lie Gin Pro Lys His Leu 
250 255 260 265 

CCA AAC TCT AAA CAA CCT CAT TCT CAA AGA GGA TCA AAA GCG CAA GAG 92 9 

Pro Asn Ser Lys Gin Pro His Ser Gin Arg Gly Ser Lys Ala Gin Glu 
270 275 280 

TTT ATC GCT TAT TTG CAA AAA GAG CTA GAA TTT CTG CCC TAT TCG CAA 97 7 

Phe lie Ala Tyr Leu Gin Lys Glu Leu Glu Phe Leu Pro Tyr Ser Gin 
285 290 295 

AAA GCT ATC GCT AAA CAA GTG AAT TTC TAT AAA CCA AGT TCT ATC GCT 1025 
Lys Ala lie Ala Lys Gin Val Asn Phe Tyr Lys Pro Ser Ser lie Ala 
300 305 310 

TAT TTA GAA CTA GAT CCT AGA GAT TTT AAG GTT ACA GAA GAA TGG CAA 107 3 

Tyr Leu Glu Leu Asp Pro Arg Asp Phe Lys Val Thr Glu Glu Trp Gin 
315 320 325 

AAA GAA AAT CTA AAA ATA CGC TCT AAA GCT CAA GCT AAA ATG CTT GAA 1121 
Lys Glu Asn Leu Lys lie Arg Ser Lys Ala Gin Ala Lys Met Leu Glu 
330 335 340 345 

ATG AGG GAT TTA AAA CCA GAC CCA CAA GCC CAC CTT CCA ACC TCT CAA 1169 
Met' Arg Asp Leu Lys Pro Asp Pro Gin Ala His Leu Pro Thr Ser Gin 
350 355 360 

AGC CTT TTG TTC GTT CAA AAA ATA TTT GCT GAT GTT AAT AAA GAA ATA 1217 
Ser Leu Leu Phe Val Gin Lys lie Phe Ala Asp Val Asn Lys Glu lie 
365 370 375 

GAA GCA GTT GCT AAT ACT GAA AAG AAA GCA GAA AAA GCG GGT TAT GGT 12 65 

Glu Ala Val Ala Asn Thr Glu Lys Lys Ala Glu Lys Ala Gly Tyr Gly 
380 385 390 

TAT AGT AAA AGG ATG TAGGCATAAG AAAATAAGAA CACCATAAAA TCGTTTTTAG C 13 21 
Tyr Ser Lys Arg Met 
395 

TTCTAGGAGA CATCAGTCAG TTTCTTGCC 13 5 0 

(2) INFORMATION FOR SEQ ID NO: 63 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 amino acids 

(B) TYPE: amino acid 
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Met Glu Ser Val Lys Thr Gly Lys Thr 
1 5 

AAT AAG GTT GGC AAG AAT AC A GAG ATG GCT AAT ACA AAG GCA AAT AAA 161 
Asn Lys Val Gly Lys Asn Thr Glu Met Ala Asn Thr Lys Ala Asn Lys 
10 15 20 25 

GAG ACT CAT TTT AAA CAA GTG AGO GCC ATT ACA AAT ATA ATC AGA TCA 2 09 

Glu Thr His Phe Lys Gin Val Ser Ala lie Thr Asn lie lie Arg Ser 
30 35 40 

GTT GGT GGG TTT TTT ACA AAA ATT GCA AAG AGA GTT AGA GGA CTT GTA 257 
Val Gly Gly Phe Phe Thr Lys lie Ala Lys Arg Val Arg Gly Leu Val 
45 50 55 

AAA AAA CAC CCC AAG AAA AGC AGT GCG GCA TTA GTA GTA TTG ACC CAT 3 05 

Lys Lys His Pro Lys Lys Ser Ser Ala Ala Leu Val Val Leu Thr His 
60 65 70 

ATT GCG TGC AAG AAA GCG AAA GAA TTA GAC GAT AAA GTC CAA GAT AAA 353 
lie Ala Cys Lys Lys Ala Lys Glu Leu Asp Asp Lys Val Gin Asp Lys 
75 80 85 

TCC AAA CAA GCT GAA AAA GAA AAT CAA ATC AAT TGG TGG AAA TAT TCA 401 
Ser Lys Gin Ala Glu Lys Glu Asn Gin lie Asn Trp Trp Lys Tyr Ser 
90 95 100 105 

GGA TTA ACA ATA GCG GCA AGT TTA TTA TTA GCC GCT TGT AGC GCT GGT 44 9 

Gly Leu Thr lie Ala Ala Ser Leu Leu Leu Ala Ala Cys Ser Ala Gly 
110 115 120 

GAT ACT GAT AAA CAG ATA GAA CTA GAA CAA GAA AAA AAG GAA GCT GAA 4 97 

Asp Thr Asp Lys Gin lie Glu Leu Glu Gin Glu Lys Lys Glu Ala Glu 
125 130 135 

AAC GCT AGG GAT AGA GCG AAC AAG AGT GGG ATA GAA CTA GAA CAA GAA 54 5 

Asn Ala Arg Asp Arg Ala Asn Lys Ser Gly lie Glu Leu Glu Gin Glu 
140 145 150 

AGA CAG AAA ACA AAC AAG AGT GGG ATA GAA CTC GCT AAT AGT CAA ATA 5 93 

Arg Gin Lys Thr Asn Lys Ser Gly lie Glu Leu Ala Asn Ser Gin lie 
155 160 165 

AAA GCA GAA CAA GAA AGA CAA AAG ACA GAA CAA GAA AAA CAA AAA GCA 641 
Lys Ala Glu Gin Glu Arg Gin Lys Thr Glu Gin Glu Lys Gin Lys Ala 
170 175 180 185 

AAT AAG AGT GCG ATA GAG TTA GAA CAG CAA AAA CAA AAG ACC ATT AAT 689 
Asn Lys Ser Ala lie Glu Leu Glu Gin Gin Lys Gin Lys Thr lie Asn 
190 195 200 

ACA CAA AGA GAT TTG ATT AAA GAA CAG AAA GAT TTC ATT AAA GAA ACA 73 7 

Thr Gin Arg Asp Leu lie Lys Glu Gin Lys Asp Phe lie Lys Glu Thr 
205 210 215 
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Gin Asn Ala L.eu 
65 

Thr Ala Asn Gly 

Gin Ser lie Lys 
100 

Asn Gly Ser Leu 
115 

Phe Asp lie Val 
130 

Glu Arg Val Asp 
145 

Gly lie Leu Arg 

Val Leu Leu lie 
180 

lie Ala Ala Phe 
195 

Thr lie Asp Arg 
210 

Glu Leu Leu Lys 
225 

Pro Lys Arg Ser 

Asp Glu Leu Leu 
260 

Ala Pro Leu lie 
275 

Leu Asn His Lys 
290 

Tyr Cys Ala Phe 
305 



Asn Asn Leu Thr 
70 

Glu Pro Thr Leu 
85 

Pro Phe Leu Lys 

Phe Tyr Glu Pro 
120 

Lys Phe Ser Leu 
135 

Lys Pro Tyr Ser 
150 

Phe Ser Gin lie 
165 

Lys Gly Val Asn 

Leu Lys Gin lie 
200 

Pro Ser Ser Phe 
215 

Cys Ser Leu Phe 
230 

lie Thr Gin Ala 
245 

Ala Leu lie Ser 

Leu Asp Ser Asn 
280 

Gin He Thr He 
295 



Thr Pro He Asp 
75 

Tyr Pro His Leu 
90 

Gly Val Lys Thr 
105 

Lys Val Gin Gin 

Asp Ala He Asp 
140 

Lys Asp He Asn 
155 

Tyr Gin Gly Gin 
170 

Asp Ser Ala Asn 
185 

Asn He Ala Arg 

Lys Ala Pro Lys 
220 

Phe Glu Gly Leu 
235 

Lys Lys Leu He 
250 

Arg Arg Pro Leu 
265 

Ala Phe Lys His 

Lys Lys Val Gly 
300 



Val Leu Thr He 
80 

Leu Glu Leu He 
95 

Leu He Leu Ser 
110 

Ala Leu Lys Glu 
125 

Leu Lys Ala Phe 

Lys He Leu Glu 
160 

Leu Val Ala Glu 
175 

Asn Leu Lys Leu 
190 

Val Asp Leu Ser 
205 

Leu Ser Glu Asp 

Cys Val Ser Leu 
240 

Ser Cys Gly He 
255 

Ser Ala Glu Glu 
270 

Leu Glu Thr Leu 
285 

Ser Leu Glu Phe 



(2) INFORrdATION FOR SEQ ID NO: 62 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 87... 1280 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 

ATCAATCTAA CTTGAGTGGA TTTTTCGTAT 
GATTGTTTTT TAAAAT^AAGG TTGGTA ATG 



SEQ ID NO:62 9 : 

TAGTTTCCAT GATATAATTT TGAAAAGTAA 6 0 

GAA TCA GTA AAA ACA GGA AAA ACA 113 
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GCC TTT TTA AAA CAA ATC AAT ATA GCC AGA GTG GAT TTA AGC ACC ATA 68 0 

Ala Phe Leu Lys Gin lie Asn lie Ala Arg Val Asp Leu Ser Thr He 
195 200 205 210 

GAC AGA CCC TCA AGC TTT AAA GCC CCT AAA TTA AGC GAA GAT GAA TTG 72 8 

Asp Arg Pro Ser Ser Phe Lys Ala Pro Lys Leu Ser Glu Asp Glu Leu 
215 220 225 

TTA AAA TGC TCT TTA TTT TTT GAA GGG CTT TGC GTG AGT TTG CCT AAA 7 76 

Leu Lys Cys Ser Leu Phe Phe Glu Gly Leu Cys Val Ser Leu Pro Lys 
230 235 240 

CGA TCC ATT ACT CAA GCT AAA AAA TTG ATT TCT TGC GGT ATA GAC GAA 82 4 

Arg Ser He Thr Gin Ala Lys Lys Leu He Ser Cys Gly He Asp Glu 
245 250 255 

TTG CTC GCT TTA ATT TCC AGG CGC CCT TTA AGC GCA GAA GAA GCC CCC 8 72 

Leu Leu Ala Leu He Ser Arg Arg Pro Leu Ser Ala Glu Glu Ala Pro 
260 265 270 

CTA ATT CTA GAT TCT AAC GCT TTT AAG CAT TTA GAA ACT TTG TTA AAC 92 0 

Leu He Leu Asp Ser Asn Ala Phe Lys His Leu Glu Thr Leu Leu Asn 
275 280 285 290 

CAT AAG CAA ATT ACG ATT AAA AAA GTC GGC TCT TTG GAG TTT TAT TGC 96 8 

His Lys Gin He Thr He Lys Lys Val Gly Ser Leu Glu Phe Tyr Cys 
295 300 305 

GCG TTT TAACCTCCAT TTGTAAGTTT TACCTTACTT TAGGGATAGC TTAAGCTTTT AA 102 6 
Ala Phe 



A 1027 
(2) INFORMATION FOR SEQ ID NO: 62 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 08 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 8: 

Met Ala Lys Glu Asn Pro Pro He Val Phe Gly Pro Val Leu Ser Arg 

15 10 15 

Arg Phe Gly Lys Ser Leu Gly Val Asp Leu Ser Pro Ser Lys Lys Gin 

20 25 30 

Cys Asn Tyr Asn Cys He Tyr Cys Glu Leu Gly Lys Ala Lys Pro He 

35 40 45 

Glu Arg Met Glu Glu Val He Lys Val Glu Thr Leu He Asn Ala He 
50 55 60 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:627: 

GGCGCATGAA TGCATTAAAT GTAAAGAACG CGTGTTTTTA GAGGAAGATA ATG GCT 56 

Met Ala 
1 

AAA GAA AAT CCG CCT ATC GTT TTT GGG OCT GTT TTA TCC AGG CGT TTT 104 
Lys Glu Asn Pro Pro lie Val Phe Gly Pro Val Leu Ser Arg Arg Phe 
5 10 15 

GGG AAG TCT TTG GGC GTG GAT CTA TCG CCC TCT AAA AAA CAA TGC AAT 152 
Gly Lys Ser Leu Gly Val Asp Leu Ser Pro Ser Lys Lys Gin Cys Asn 
20 25 30 

TAG AAT TGC ATT TAT TGC GAG TTG GGT AAA GCC AAG CCC ATT GAA CGC 2 00 

Tyr Asn Cys lie Tyr Cys Glu Leu Gly Lys Ala Lys Pro lie Glu Arg 
35 40 45 50 

ATG GAA GAA GTG ATC AAA GTG GAA ACC TTG ATT AAC GCC ATT CAA AAC 24 8 

Met Glu Glu Val lie Lys Val Glu Thr Leu lie Asn Ala lie Gin Asn 
55 60 65 

GCC CTA AAC AAC CTC ACC ACC CCC ATT GAT GTT TTA ACC ATT ACC GCT 2 96 

Ala Leu Asn Asn Leu Thr Thr Pro lie Asp Val Leu Thr lie Thr Ala 
70 75 80 

AAT GGC GAA CCC ACG CTA TAG CCT CAT TTA TTA GAG CTT ATC CAA AGC 3 44 

Asn Gly Glu Pro Thr Leu Tyr Pro His Leu Leu Glu Leu lie Gin Ser 
85 90 95 

ATC AAG CCT TTT TTA AAG GGC GTT AAA ACT TTG ATT TTA AGC AAT GGC 3 92 

lie Lys Pro Phe Leu Lys Gly Val Lys Thr Leu lie Leu Ser Asn Gly 
100 105 110 

TCG CTC TTT TAT GAG CCA AAA GTC CAG CAA GCC TTA AAG GAA TTT GAC 44 0 

Ser Leu Phe Tyr Glu Pro Lys Val Gin Gin Ala Leu Lys Glu Phe Asp 
115 120 125 130 

ATC GTT AAA TTT TCT TTA GAC GCT ATT GAT TTG AAA GCC TTT GAA AGA 4 88 

lie Val Lys Phe Ser Leu Asp Ala lie Asp Leu Lys Ala Phe Glu Arg 
135 140 145 

GTG GAT AAA CCC TAT TCT AAA GAC ATT AAT AAG ATT TTA GAG GGG ATT 53 6 

Val Asp Lys Pro Tyr Ser Lys Asp lie Asn Lys lie Leu Glu Gly lie 
150 155 160 

TTG CGC TTT TCT CAA ATT TAT CAA GGG CAA TTG GTG GCT GAA GTG TTG 5 84 

Leu Arg Phe Ser Gin He Tyr Gin Gly Gin Leu Val Ala Glu Val Leu 
165 170 175 

TTA ATT AAG GGC GTG AAT GAT AGC GCG AAC AAC TTA AAA CTC ATC GCT 6 32 

Leu He Lys Gly Val Asn Asp Ser Ala Asn Asn Leu Lys Leu He Ala 
180 185 190 
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80 85 90 

AAG GAG TGG GTG AAA ATA GAA TCC CAT TTT GAG GTA AAA TCC CAA AAG 57 9 

Lys Glu Trp Val Lys lie Glu Ser His Phe Glu Val Lys Ser Gin Lys 
95 100 105 

TAT AAC GCT TAAAGTGGCT TAAAAAAAGG CTTTATTGGT GGTTTAAATC TTTGATTTT 63 7 
Tyr Asn Ala 
110 

AGATCGGAT 64 6 

(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:626: 

Met Ala Lys Arg Val Pro Asn Met Glu Leu Phe Val Phe Cys Glu Asp 

15 10 15 

Leu Glu Phe Thr Gin Asn Leu Asp Leu Gly Tyr Pro Phe Met Asp Met 

20 25 30 

Thr Thr Arg Asp Lys Glu Glu Glu Ala Tyr Trp Asp Met Leu Leu Met 

35 40 45 

Gin Ser Cys Gin His Gly lie lie Ala Asn Ser Thr Tyr Ser Trp Trp 

50 55 60 

Ala Ala Tyr Leu lie Glu Asn Pro Glu Lys lie lie lie Gly Pro Lys 
65 70 75 80 

His Trp Leu Phe Gly His Glu Asn lie Leu Cys Lys Glu Trp Val Lys 

85 90 95 

lie Glu Ser His Phe Glu Val Lys Ser Gin Lys Tyr Asn Ala 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 62 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 974 
(D) OTHER INFORMATION: 
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Gly Ser Ser Thr Thr Gly lie Ala Ala Asn Leu Leu Lys Arg Glu Phe 
225 230 235 240 

lie Gly lie Glu Lys Glu Ser Glu Phe lie Lys lie Ser Met Asp Arg 

245 250 255 

Lys lie Glu Leu Asp Ala Arg Tyr Lys Glu lie Arg Ser Lys lie Lys 

260 265 270 

Asp Leu Asn His Gin 
275 

(2) INFORMATION FOR SEQ ID N0:625: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 259... 588 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 5: 

CTTGACTTAT TTTTTTGGCT ATTTCCAAGA TCCACGATAC TTTGATGCTA TATCCCCTTT 6 0 

AATCAAGCAA ACCTTCACTC TACCCCCCCC CCCCCCGAAA ATAATAAGAA TAATAATAAA 12 0 

AAAGAGGAAG AATATCAGTG CAAGCTTTCT TTGATTTTAG CCGCTAAAAA CAGCGTGTTT 18 0 

GTGCATATAA GAAGAGGGGA TTATGTGGGG ATTGGCTGTC AGCTTGGTAT TGACTATCAA 24 0 

AAAAAGGCGC TTGAGTAT ATG GCA AAG CGC GTG CCA AAC ATG GAG CTT TTT 2 91 

Met Ala Lys Arg Val Pro Asn Met Glu Leu Phe 
15 10 

GTG TTT TGC GAA GAC TTA GAA TTC ACG CAA AAT CTT GAT CTT GGC TAC 33 9 

Val Phe Cys Glu Asp Leu Glu Phe Thr Gin Asn Leu Asp Leu Gly Tyr 
15 20 25 

CCT TTT ATG GAC ATG ACC ACT AGG GAT AAA GAA GAA GAG GCG TAT TGG 3 87 

Pro Phe Met Asp Met Thr Thr Arg Asp Lys Glu Glu Glu Ala Tyr Trp 
30 35 40 

GAC ATG CTG CTC ATG CAA TCT TGT CAG CAT GGC ATT ATC GCT AAT AGC 4 35 

Asp Met Leu Leu Met Gin Ser Cys Gin His Gly lie lie Ala Asn Ser 
45 50 55 

ACT TAT AGC TGG TGG GCG GCC TAT TTG ATA GAA AAT CCA GAA AAA ATC 4 83 

Thr Tyr Ser Trp Trp Ala Ala Tyr Leu lie Glu Asn Pro Glu Lys lie 
60 65 70 75 

ATT ATT GGC CCC AAA CAC TGG CTT TTT GGG CAT GAG AAT ATC CTT TGT 531 
lie lie Gly Pro Lys His Trp Leu Phe Gly His Glu Asn lie Leu Cys 
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ATA GAA AAA GAA AGC GAG TTT ATC AAA ATA TCC ATG GAT AGA AAA ATA 8 24 

lie Glu Lys Glu Ser Glu Phe lie Lys lie Ser Met Asp Arg Lys He 
245 250 255 

GAA TTA GAC GCT CGC TAT AAA GAA ATC CGA TCT AAA ATC AAA GAT TTA 8 72 

Glu Leu Asp Ala Arg Tyr Lys Glu He Arg Ser Lys He Lys Asp Leu 
260 265 270 

AAC CAC CAA TAAAGCCTTT TTTTAAGCCA CTTTAAGCGT TATACTTTTG GGATTTTAC 93 0 

Asn His Gin 

275 

CTCA 93 4 

(2) INFORMATION FOR SEQ ID NO: 62 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

Met Val He Ala His Ser Asn Glu He Ala Arg Pro He Phe Lys Ser 

15 10 15 

Gin Asp Gin Leu Phe Thr Leu Tyr Gin Gly Asp Cys Asn Glu Val Leu 

20 25 30 

Pro Gin Phe Glu Asn Gin Phe Asp Leu He Phe Ala Asp Pro Pro Tyr 

35 40 45 

Phe Leu Ser Asn Asp Gly Leu Ser He Gin Ser Gly Lys He Val Ser 

50 55 60 

Val Asn Lys Gly Asp Trp Asp Lys Glu Asp Gly He Asn Gly He Asp 
65 70 75 80 

Glu Phe Asn Tyr Gin Trp He Asn Asn Ala Lys Lys Ala Leu Lys Asp 

85 90 95 

Thr Gly Ser Leu Leu He Ser Gly Thr Tyr His Asn He Phe Ser Leu 

100 105 110 

Gly Cys Val Leu Gin Lys Leu Asp Phe Lys He Leu Asn Leu He Thr 

115 120 125 

Trp Gin L.ys Thr Asn Pro Pro Pro Asn Phe Ser Cys Arg Tyr Leu Thr 

130 135 140 

His Ser Ala Glu Gin He He Trp Ala Arg Lys Ser Arg Lys His Lys 
145 150 155 160 

His Val Phe Asn Tyr Glu Val Leu Lys Lys He Asn Asn Asp Lys Gin 

165 170 175 

Met Arg Asp Val Trp Ser Phe Pro Ala He Ala Pro Trp Glu Lys Val 

180 185 190 

Asn Gly Lys His Pro Thr Gin Lys Pro Leu Ala Leu Leu Val Arg Leu 

195 200 205 

Leu Leu Met Ala Ser Asp Glu Asn Ser Leu He Gly Asp Pro Phe Ser 
210 215 220 
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Gin Leu Phe Thr Leu Tyr Gin Gly Asp Cys Asn Glu Val Leu Pro Gin 
20 25 30 

TTT GAA AAC CAG TTT GAT TTG ATT TTT GCT GAT CCG CCT TAT TTC CTC 2 00 

Phe Glu Asn Gin Phe Asp Leu lie Phe Ala Asp Pro Pro Tyr Phe Leu 
35 40 45 50 

TCT AAT GAC GGC TTA AGC ATA CAG AGC GGT AAA ATC GTG AGC GTC AAT 24 8 

Ser Asn Asp Gly Leu Ser He Gin Ser Gly Lys He Val Ser Val Asn 
55 60 65 

AAA GGC GAT TGG GAT AAA GAA GAT GGG ATT AAT GGT ATT GAT GAG TTT 2 96 

Lys Gly Asp Trp Asp Lys Glu Asp Gly lie Asn Gly He Asp Glu Phe 
70 75 80 

AAT TAC CAG TGG ATA AAC AAC GCT AAA AAG GCT TTA AAA GAC ACA GGA 344 
Asn Tyr Gin Trp He Asn Asn Ala Lys Lys Ala Leu Lys Asp Thr Gly 
85 90 95 

AGC CTT TTA ATC AGC GGG ACT TAC CAC AAC ATC TTT TCT TTG GGG TGT 3 92 

Ser Leu Leu He Ser Gly Thr Tyr His Asn He Phe Ser Leu Gly Cys 
100 105 110 

GTT TTA CAA AAA TTG GAT TTT AAG ATT TTA AAC CTC ATC ACC TGG CAA 44 0 

Val Leu Gin Lys Leu Asp Phe Lys He Leu Asn Leu He Thr Trp Gin 
115 120 125 130 

AAA ACC AAC CCT CCT CCC AAT TTC AGC TGC CGT TAT TTG ACG CAT TCA 4 88 

Lys Thr Asn Pro Pro Pro Asn Phe Ser Cys Arg Tyr Leu Thr His Ser 
135 140 145 

GCT GAG CAA ATC ATT TGG GCG AGA AAA AGC CGC AAA CAC AAG CAT GTT 53 6 

Ala Glu Gin He He Trp Ala Arg Lys Ser Arg Lys His Lys His Val 
150 155 160 

TTT AAC TAT GAG GTT TTA AAA AAG ATC AAT AAC GAC AAG CAA ATG CGC 5 84 

Phe Asn Tyr Glu Val Leu Lys Lys He Asn Asn Asp Lys Gin Met Arg 
165 170 175 

GAT GTG TGG AGC TTC CCA GCG ATC GCT CCT TGG GAA AAA GTT AAT GGC 632 
Asp Val Trp Ser Phe Pro Ala He Ala Pro Trp Glu Lys Val Asn Gly 
180 185 190 

AAG CAC CCC ACT CAA AAA CCC CTC GCT TTA TTA GTG CGC TTG CTT TTA 680 
Lys His Pro Thr Gin Lys Pro Leu Ala Leu Leu Val Arg Leu Leu Leu 
195 200 205 210 

ATG GCG AGC GAT GAA AAT TCT CTC ATT GGC GAT CCT TTT AGC GGG AGC 72 8 

Met Ala Ser Asp Glu Asn Ser Leu He Gly Asp Pro Phe Ser Gly Ser 
215 220 225 

TCT ACC ACA GGC ATT GCG GCT AAT CTT TTG AAG AGG GAA TTT ATT GGC 776 
Ser Thr Thr Gly He Ala Ala Asn Leu Leu Lys Arg Glu Phe He Gly 
230 235 240 
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370 375 380 

Lys Glu Phe Gly Arg Leu Lys Glu Glu Lys Gly Asp Asn Asp Asn Phe 
385 390 395 400 

Arg lie Lys Arg Tyr Leu Ser Lys Tyr Thr lie Asn Pro Ala lie Ala 

405 410 415 

His Gly lie Ser Glu Tyr Val Gly Ser Val Glu Val Gly Lys Val Ala 

420 425 430 

Asp Leu Val Leu Trp Ser Pro Ala Phe Phe Gly Val Lys Pro Asn Met 

435 440 445 

lie lie Lys Gly Gly Phe lie Ala Leu Ser Gin Met Gly Asp Ala Asn 

450 455 460 

Ala Ser lie Pro Thr Pro Gin Pro Val Tyr Tyr Arg Glu Met Phe Ala 
465 470 475 480 

His His Gly Lys Ala Lys Tyr Asp Ala Asn lie Thr Phe Val Ser Gin 

485 490 495 

Ala Ala Tyr Asp Lys Gly lie Lys Glu Glu Leu Gly Leu Glu Arg Gin 

500 505 510 

Val Leu Pro Val Lys Asn Cys Arg Asn lie Thr Lys Lys Asp Met Gin 

515 520 525 

Phe Asn Asp Thr Thr Ala His lie Glu Val Asn Pro Glu Thr Tyr His 

530 535 540 

Val Phe Val Asp Gly Lys Glu Val Thr Ser Lys Pro Ala Asn Lys Val 
545 550 555 560 

Ser Leu Ala Gin Leu Phe Ser lie Phe 
565 

(2) INFORMATION FOR SEQ ID NO:623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 881 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

GAGATCTACA ATTTGAGCAA TGTAAATGAT TTTATATCAA AGGCACAAAA ATG GTA 5 6 

Met Val 
1 

ATC GCG CAT TCT AAT GAA ATC GCA CGC CCC ATT TTT AAA AGC CAA GAC 104 
lie Ala His Ser Asn Glu lie Ala Arg Pro lie Phe Lys Ser Gin Asp 
5 10 15 

CAG CTT TTC ACT CTT TAT CAA GGG GAT TGT AAT GAG GTT TTG CCC CAA 152 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 2: 



Met 


Lys 


Lys 


He 


Ser 


Arg 


Lys 


Glu 


Tyr 


Val 


Ser 


Met 


Tyr 


Gly 


Pro 


Thr 


1 








5 










10 










15 




Thr 


Gly 


Asp 


Lys 


Val 


Arg 


Leu 


Gly 


Asp 


Thr 


Asp 


Leu 


He 


Ala 


Glu 


Val 








20 










25 










30 






Glu 


His 


Asp 


Tyr 


Thr 


He 


Tyr 


Gly 


Glu 


Glu 


Leu 


Lys 


Phe 


Gly 


Gly 


Gly 






35 










40 










45 








Lys 


Thr 


Leu 


Arg 


Glu 


Gly 


Met 


Ser 


Gin 


Ser 


Asn 


Asn 


Pro 


Ser 


Lys 


Glu 




50 










55 










60 










Glu 


Leu 


Asp 


Leu 


He 


He 


Thr 


Asn 


Ala 


Leu 


He 


Val 


Asp 


Tyr 


Thr 


Gly 


65 










70 










75 










80 


He 


Tyr 


Lys 


Ala 


Asp 


He 


Gly 


He 


Lys 


Asp 


Gly 


Lys 


He 


Ala 


Gly 


He 










85 










90 










95 




Gly 


Lys 


Gly 


Gly 


Asn 


Lys 


Asp 


Met 


Gin 


Asp 


Gly 


Val 


Lys 


Asn 


Asn 


Leu 








100 










105 










110 






Ser 


Val 


Gly 


Pro 


Ala 


Thr 


Glu 


Ala 


Leu 


Ala 


Gly 


Glu 


Gly 


Leu 


He 


Val 






115 










120 










125 








Thr 


Ala 


Gly 


Gly 


He 


Asp 


Thr 


His 


He 


His 


Phe 


He 


Ser 


Pro 


Gin 


Gin 




130 










135 










140 










He 


Pro 


Thr 


Ala 


Phe 


Ala 


Ser 


Gly 


Val 


Thr 


Thr 


Met 


He 


Gly 


Gly 


Gly 


145 










150 










155 










160 


Thr 


Gly 


Pro 


Ala 


Asp 


Gly 


Thr 


Asn 


Ala 


Thr 


Thr 


He 


Thr 


Pro 


Gly 


Arg 










165 










170 










175 




Arg 


Asn 


Leu 


Lys 


Trp 


Met 


Leu 


Arg 


Ala 


Ala 


Glu 


Glu 


Tyr 


Ser 


Met 


Asn 








180 










185 










190 






Leu 


Gly 


Phe 


Leu 


Ala 


Lys 


Gly 


Asn 


Ala 


Ser 


Asn 


Asp 


Ala 


Ser 


Leu 


Ala 






195 










200 










205 








Asp 


Gin 


He 


Glu 


Ala 


Gly 


Ala 


He 


Gly 


Phe 


Lys 


He 


His 


Glu 


Asp 


Trp 




210 










215 










220 










Gly 


Thr 


Thr 


Pro 


Ser 


Ala 


He 


Asn 


His 


Ala 


Leu 


Asp 


Val 


Ala 


Asp 


Lys 


225 










230 










235 










240 


Tyr 


Asp 


Val 


Gin 


Val 


Ala 


He 


His 


Thr 


Asp 


Thr 


Leu 


Asn 


Glu 


Ala 


Gly 










245 










250 










255 




Cys 


Val 


Glu 


Asp 


Thr 


Met 


Ala 


Ala 


He 


Ala 


Gly 


Arg 


Thr 


Met 


His 


Thr 








260 










265 










270 






Phe 


His 


Thr 


Glu 


Gly 


Ala 


Gly 


Gly 


Gly 


His 


Ala 


Pro 


Asp 


He 


He 


Lys 






275 










280 










285 








Val 


Ala 


Gly 


Glu 


His 


Asn 


He 


Leu 


Pro 


Ala 


Ser 


Thr 


Asn 


Pro 


Thr 


He 




290 










295 










300 










Pro 


Phe 


Thr 


Val 


Asn 


Thr 


Glu 


Ala 


Glu 


His 


Met 


Asp 


Met 


Leu 


Met 


Val 


305 










310 










315 










320 


Cys 


His 


His 


Leu 


Asp 


Lys 


Ser 


He 


Lys 


Glu 


Asp 


Val 


Gin 


Phe 


Ala 


Asp 










325 










330 










335 




Ser 


Arg 


He 


Arg 


Pro 


Gin 


Thr 


He 


Ala 


Ala 


Glu 


Asp 


Thr 


Leu 


His 


Asp 








340 










345 










350 






Met 


Gly 


He 


Phe 


Ser 


He 


Thr 


Ser 


Ser 


Asp 


Ser 


Gin 


Ala 


Met 


Gly 


Arg 






355 










360 










365 








Val 


Gly 


Glu 


Val 


He 


Thr 


Arg 


Thr 


Trp 


Gin 


Thr 


Ala 


Asp 


Lys 


Asn 


Lys 
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TTT GGC CGC TTG AAA GAA GAA AAA GGC GAT AAC GAG AAC TTC AGG ATC 12 5 6 

Phe Gly Arg Leu Lys Glu Glu Lys Gly Asp Asn Asp Asn Phe Arg lie 
390 395 400 

AAA CGC TAC TTG TCT AAA TAG ACC ATT AAC CCA GCG ATC GCT CAT GGG 13 04 

Lys Arg Tyr Leu Ser Lys Tyr Thr lie Asn Pro Ala lie Ala His Gly 
405 410 415 

ATT AGC GAG TAT GTA GGT TCT GTA GAA GTG GGC AAA GTG GCT GAG TTG 13 52 

lie Ser Glu Tyr Val Gly Ser Val Glu Val Gly Lys Val Ala Asp Leu 
420 425 430 

GTA TTG TGG AGT CCC GCA TTC TTT GGC GTA AAA CCC AAC ATG ATC ATC 14 0 0 

Val Leu Trp Ser Pro Ala Phe Phe Gly Val Lys Pro Asn Met lie lie 
435 440 445 450 

AAA GGC GGG TTC ATT GCG TTG AGT CAA ATG GGT GAC GCG AAC GCT TCT 144 8 

Lys Gly Gly Phe lie Ala Leu Ser Gin Met Gly Asp Ala Asn Ala Ser 
455 460 465 

ATC CCT ACC CCA CAA CCA GTT TAT TAC AGA GAA ATG TTC GCT CAT CAT 14 9 6 

lie Pro Thr Pro Gin Pro Val Tyr Tyr Arg Glu Met Phe Ala His His 
470 475 480 

GGT AAA GCC AAA TAC GAT GCA AAC ATC ACT TTT GTG TCT CAA GCG GCT 1544 
Gly Lys Ala Lys Tyr Asp Ala Asn lie Thr Phe Val Ser Gin Ala Ala 
485 490 495 

TAT GAC AAA GGC ATT AAA GAA GAA TTA GGG CTT GAA AGA CAA GTG TTG 15 92 

Tyr Asp Lys Gly lie Lys Glu Glu Leu Gly Leu Glu Arg Gin Val Leu 
500 505 510 

CCG GTA AAA AAT TGC AGA AAC ATC ACT AAA AAA GAC ATG CAA TTC AAC 164 0 

Pro Val Lys Asn Cys Arg Asn lie Thr Lys Lys Asp Met Gin Phe Asn 
515 520 525 530 

GAC ACT ACC GCT CAC ATT GAA GTC AAT CCT GAA ACT TAC CAT GTG TTC 168 8 

Asp Thr Thr Ala His lie Glu Val Asn Pro Glu Thr Tyr His Val Phe 
535 540 545 

GTG GAT GGC AAA GAA GTA ACT TCT AAA CCA GCC AAT AAA GTG AGC TTG 17 3 6 

Val Asp Gly Lys Glu Val Thr Ser Lys Pro Ala Asn Lys Val Ser Leu 
550 555 560 

GCG CAA CTC TTT AGC ATT TTC TAGGATTTTT TAGGAGCAAC GCTCCTTAAA TCCT 17 91 
Ala Gin Leu Phe Ser lie Phe 
565 

TAGTTTTTAG CTCTCTGATT TTTT 1815 
(2) INFORMATION FOR SEQ ID NO: 622: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 9 amino acids 

(B) TYPE: amino acid 
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CCT GCT GAT GGC ACT AAT GCG ACT ACT ATC ACT CCA GGC AGA AGA AAT 5 84 

Pro Ala Asp Gly Thr Asn Ala Thr Thr lie Thr Pro Gly Arg Arg Asn 
165 170 175 

TTA AAA TGG ATG CTC AGA GCG GCT GAA GAA TAT TCT ATG AAC TTA GGT 63 2 

Leu Lys Trp Met Leu Arg Ala Ala Glu Glu Tyr Ser Met Asn Leu Gly 
180 185 190 

TTC TTG GCT AAA GGT AAC GCT TCT AAC GAC GCG AGC TTA GCC GAT CAA 68 0 

Phe Leu Ala Lys Gly Asn Ala Ser Asn Asp Ala Ser Leu Ala Asp Gin 
195 200 205 210 

ATT GAA GCT GGT GCG ATT GGC TTT AAA ATC CAC GAA GAC TGG GGC ACC 72 8 

lie Glu Ala Gly Ala lie Gly Phe Lys lie His Glu Asp Trp Gly Thr 
215 220 225 

ACT CCT TCT GCA ATC AAT CAT GCG TTA GAT GTT GCA GAC AAA TAC GAT 7 76 

Thr Pro Ser Ala lie Asn His Ala Leu Asp Val Ala Asp Lys Tyr Asp 
230 235 240 

GTG CAA GTC GCT ATC CAC ACA GAC ACT TTG AAT GAA GCC GGT TGC GTG 824 
Val Gin Val Ala lie His Thr Asp Thr Leu Asn Glu Ala Gly Cys Val 
245 250 255 

GAA GAC ACT ATG GCA GCT ATT GCC GGA CGC ACT ATG CAC ACT TTC CAC 872 
Glu Asp Thr Met Ala Ala lie Ala Gly Arg Thr Met His Thr Phe His 
260 265 270 

ACT GAA GGT GCT GGC GGC GGA CAC GCT CCT GAT ATT ATT AAA GTA GCT 92 0 

Thr Glu Gly Ala Gly Gly Gly His Ala Pro Asp lie He Lys Val Ala 
275 280 285 290 

GGT GAA CAC AAC ATT CTT CCC GCT TCC ACT AAC CCC ACT ATC CCT TTC 96 8 

Gly Glu His Asn He Leu Pro Ala Ser Thr Asn Pro Thr He Pro Phe 
295 300 305 

ACT GTG AAT ACA GAA GCA GAA CAC ATG GAC ATG CTT ATG GTG TGC CAC 1016 
Thr Val Asn Thr Glu Ala Glu His Met Asp Met Leu Met Val Cys His 
310 315 320 

CAC TTG GAT AAA AGC ATT AAA GAA GAT GTT CAG TTC GCT GAT TCA AGG 10 64 

His Leu Asp Lys Ser He Lys Glu Asp Val Gin Phe Ala Asp Ser Arg 
325 330 335 

ATC CGC CCT CAA ACC ATT GCG GCT GAA GAC ACT TTG CAT GAC ATG GGG 1112 
He Arg Pro Gin Thr He Ala Ala Glu Asp Thr Leu His Asp Met Gly 
340 345 350 

ATT TTC TCA ATC ACC AGC TCT GAC TCT CAA GCT ATG GGT CGT GTG GGT 116 0 

He Phe Ser He Thr Ser Ser Asp Ser Gin Ala Met Gly Arg Val Gly 
355 360 365 370 

GAA GTT ATC ACT AGA ACT TGG CAA ACA GCT GAC AAA AAC AAA AAA GAA 12 0 8 

Glu Val He Thr Arg Thr Trp Gin Thr Ala Asp Lys Asn Lys Lys Glu 
375 380 385 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1757 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 

ATGGCGCTAA AAGCGATGAC AACTATGTAA AAACAATTAA GGAGTAAGAA ATG AAA 5 6 

Met Lys 

1 

AAG ATT AGC AGA AAA GAA TAT GTT TCT ATG TAT GGC CCT ACT ACA GGC 104 
Lys lie Ser Arg Lys Glu Tyr Val Ser Met Tyr Gly Pro Thr Thr Gly 
5 10 15 

GAT AAA GTG AGA TTG GGC GAT ACA GAC TTG ATC GCT GAA GTA GAA CAT 152 
Asp Lys Val Arg Leu Gly Asp Thr Asp Leu lie Ala Glu Val Glu His 
20 25 30 

GAC TAG ACC ATT TAT GGC GAA GAG CTT AAA TTC GGT GGC GGT AAA ACC 2 00 

Asp Tyr Thr lie Tyr Gly Glu Glu Leu Lys Phe Gly Gly Gly Lys Thr 
35 40 45 50 

CTG AGA GAA GGC ATG AGC CAA TCC AAC AAC CCT AGC AAA GAA GAA TTG 24 8 

Leu Arg Glu Gly Met Ser Gin Ser Asn Asn Pro Ser Lys Glu Glu Leu 
55 60 65 

GAT CTA ATC ATC ACT AAC GCT TTA ATC GTG GAT TAG ACC GGT ATT TAT 2 96 

Asp Leu lie He Thr Asn Ala Leu He Val Asp Tyr Thr Gly He Tyr 
70 75 80 

AAA GCG GAT ATT GGT ATT AAA GAT GGC AAA ATC GCT GGC ATT GGT AAA 34 4 

Lys Ala Asp He Gly He Lys Asp Gly Lys He Ala Gly He Gly Lys 
85 90 95 

GGC GGT AAC AAA GAC ATG CAA GAT GGC GTT AAA AAC AAT CTT AGC GTA 3 92 

Gly Gly Asn Lys Asp Met Gin Asp Gly Val Lys Asn Asn Leu Ser Val 
100 105 110 

GGT CCT GCT ACT GAA GCC TTA GCC GGT GAA GGT TTG ATC GTA ACT GCT 44 0 

Gly Pro Ala Thr Glu Ala Leu Ala Gly Glu Gly Leu He Val Thr Ala 
115 120 125 130 

GGT GGT ATT GAC ACA CAC ATC CAC TTC ATT TCA CCC CAA CAA ATC CCT 4 88 

Gly Gly He Asp Thr His He His Phe He Ser Pro Gin Gin He Pro 
135 140 145 

ACA GCT TTT GCA AGC GGT GTA ACA ACC ATG ATT GGT GGC GGA ACT GGT 53 6 

Thr Ala Phe Ala Ser Gly Val Thr Thr Met He Gly Gly Gly Thr Gly 
150 155 160 
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(2) INFORMATION FOR SEQ ID NO: 62 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 0: 



Met Asn Thr Tyr Ala Gin Glu Sei' Lys Leu Arg Leu Lys Thr Lys lie 

15 10 15 

Gly Ala Asp Gly Arg Cys Val lie Glu Asp Asn Phe Phe Thr Pro Pro 

20 25 30 . 

Phe Lys Leu Met Ala Pro Phe Tyr Pro Lys Asp Asp Leu Ala Glu lie 

35 40 45 

Met Leu Leu Ala Val Ser Pro Gly Met Met Arg Gly Asp Ala Gin Asp 

50 55 60 

Val Gin Leu Asn He Gly Pro Asn Cys Lys Leu Arg He Thr Ser Gin 
65 70 75 80 

Ser Phe Glu Lys He His Asn Thr Glu Asp Gly Phe Ala Ser Arg Asp 

85 90 95 

Met His He Val Val Gly Glu Asn Ala Phe Leu Asp Phe Ala Pro Phe 

100 105 110 

Pro Leu He Pro Phe Glu Asn Ala His Phe Lys Gly Asn Thr Thr He 

115 120 125 

Ser Leu Arg Ser Ser Ser Gin Leu Leu Tyr Ser Glu He He Val Ala 

130 135 140 

Gly Arg Val Ala Arg Asn Glu Leu Phe Lys Phe Asn Arg Leu His Thr 
145 150 155 160 

Lys He Ser He Leu Gin Asp Glu Lys Pro He Tyr Tyr Asp Asn Thr 

165 170 175 

He Leu Asp Pro Lys Thr Thr Asp Leu Asn Asn Met Cys Met Phe Asp 

180 185 190 

Gly Tyr Thr His Tyr Leu Asn Leu Val Leu Val Asn Cys Pro He Glu 

195 200 205 

Leu Ser Gly Val Arg Glu Cys He Glu Glu Ser Glu Gly Val Asp Gly 

210 215 220 

Ala Val Ser Glu Thr Ala Ser Ser His Leu Cys Val Lys Ala Leu Ala 
225 230 235 240 

Lys Gly Ser Glu Pro Leu Leu His Leu Arg Glu Lys He Ala Arg Leu 

245 250 255 

Val Thr Gin Thr Thr Thr Gin Lys Val 
260 265 



(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1815 base pairs 

(B) TYPE: nucleic acid" 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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50 55 60 65 

CAA TTA AAC ATC GGT CCA AAT TGC AAG TTA AGG ATC ACT TCG CAA TCC 3 00 

Gin Leu Asn lie Gly Pro Asn Cys Lys Leu Arg lie Thr Ser Gin Ser 
70 75 80 

TTT GAA AAA ATC CAT AAC ACT GAA GAT GGG TTT GCC AGC AGA GAC ATG 34 8 

Phe Glu Lys lie His Asn Thr Glu Asp Gly Phe Ala Ser Arg Asp Met 
85 90 95 

CAT ATT GTT GTG GGG GAA AAC GCT TTT TTA GAT TTT GCG CCT TTC CCG 3 96 

His He Val Val Gly Glu Asn Ala Phe Leu Asp Phe Ala Pro Phe Pro 
100 105 110 

TTA ATC CCC TTT GAA AAC GCG CAT TTT AAG GGC AAC ACC ACG ATT TCT 444 
Leu He Pro Phe Glu Asn Ala His Phe Lys Gly Asn Thr Thr He Ser 
115 120 125 

TTG CGC TCT AGC TCT CAA TTG CTC TAT AGT GAA ATC ATT GTC GCA GGG 4 92 

Leu Arg Ser Ser Ser Gin Leu Leu Tyr Ser Glu He He Val Ala Gly 
130 135 140 145 

CGA GTG GCG CGC AAT GAG TTG TTT AAA TTC AAC CGC TTG CAC ACC AAA 54 0 

Arg Val Ala Arg Asn Glu Leu Phe Lys Phe Asn Arg Leu His Thr Lys 
150 155 160 

ATC TCT ATT TTA CAA GAT GAG AAA CCC ATC TAT TAT GAC AAC ACG ATT 588 
He Ser He Leu Gin Asp Glu Lys Pro He Tyr Tyr Asp Asn Thr He 
165 170 175 

TTA GAT CCC AAA ACC ACC GAC TTA AAT AAC ATG TGC ATG TTT GAT GGC 63 6 

Leu Asp Pro Lys Thr Thr Asp Leu Asn Asn Met Cys Met Phe Asp Gly 
180 185 190 

TAT ACG CAT TAT TTG AAT TTG GTG CTT GTC AAT TGC CCC ATA GAG CTC 6 84 

Tyr Thr His Tyr Leu Asn Leu Val Leu Val Asn Cys Pro He Glu Leu 
195 200 205 

TCT GGT GTG CGA GAA TGC ATT GAA GAA AGC GAA GGG GTG GAT GGG GCA 73 2 

Ser Gly Val Arg Glu Cys He Glu Glu Ser Glu Gly Val Asp Gly Ala 
210 215 220 225 

GTG AGT GAA ACC GCT AGT TCT CAT TTA TGC GTG AAA GCT TTA GCG AAA 7 80 

Val Ser Glu Thr Ala Ser Ser His Leu Cys Val Lys Ala Leu Ala Lys 
230 235 240 

GGC TCA GAA CCC TTA TTG CAT TTA AGA GAA AAA ATC GCT CGC TTG GTT 82 8 

Gly Ser Glu Pro Leu Leu His Leu Arg Glu Lys He Ala Arg Leu Val 
245 250 255 

ACG CAA ACC ACC ACG CAA AAG GTT TGAAAGCACT TCAAAAAGAT TAAAGTCCTT 8 82 

Thr Gin Thr Thr Thr Gin Lys Val 
260 265 

TAGTCTTTTT ACTCCCCCCT TTTTTTGACC CTATAAGCTG AAAGGCCTGA ATTCAGTA 94 0 
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Lys Pro Ser Ala His Glu lie Phe Ser Leu Cys Lys Glu Leu Lys Asn 
65 70 75 80 

Ser lie Trp Asp Arg Lys Leu Val Val Ala Leu Val Glu Ala Leu Glu 

85 90 95 

Gly Phe Lys Asp Trp Asn Leu Ser Leu Lys lie Glu Asp Lys Arg Ser 

100 105 110 

Asn Ser Leu Gly Asn Gly Thr Lys Lys Leu Leu Thr Asn Ala Asp Leu 

115 120 125 

Gly Ser Asp Tyr Lys Thr lie Val lie Asp Ser Met Lys Thr Tyr His 

130 135 140 

Gin Ser Gin Gin Glu Lys Tyr Lys Arg Glu Arg Gly Glu Thr Leu Glu 
145 150 155 160 

Val Arg Pro Thr Thr Pro Pro Ser Tyr Gly Gly Gly Ser lie Arg lie 

165 170 175 

Ser Gly Asp Lys Lys Pro Asp Phe Asp Glu Glu Asn Phe 
180 185 

(2) INFORMATION FOR SEQ ID NO: 619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58... 852 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

AGAAGGCTTA GACGATGTGA TCGCTTGGAT CAAGCGCAAC GCTTTATTGG AAGATTG ATG 6 0 

Met 
1 

AAC ACT TAG GCT CAA GAA TCC AAG CTC AGG TTA AAA ACC AAA ATA GGG 10 8 

Asn Thr Tyr Ala Gin Glu Ser Lys Leu Arg Leu Lys Thr Lys lie Gly 
5 10 15 

GCT GAT GGG CGG TGC GTG ATT GAA GAC AAT TTT TTC ACG CCC CCC TTT 156 
Ala Asp Gly Arg Cys Val lie Glu Asp Asn Phe Phe Thr Pro Pro Phe 
20 25 30 

AAG CTC ATG GCG CCC TTT TAC CCT AAA GAC GAT TTA GCG GAA ATC ATG 2 04 

Lys Leu Met Ala Pro Phe Tyr Pro Lys Asp Asp Leu Ala Glu lie Met 
35 40 45 

CTT TTA GCG GTA AGC CCT GGC ATG ATG AGG GGC GAT GCG CAA GAT GTG 2 52 

Leu Leu Ala Val Ser Pro Gly Met Met Arg Gly Asp Ala Gin Asp Val 
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Ala His Glu lie Phe Ser Leu Cys Lys Glu Leu Lys Asn Ser lie Trp 
70 75 80 

GAC AGG AAG CTT GTG GTA GCG CTA GTG GAG GCT TTA GAG GGG TTT AAG 4 03 

Asp Arg Lys Leu Val Val Ala Leu Val Glu Ala Leu Glu Gly Phe Lys 
85 90 95 

GAT TGG AAT TTG TCG CTT AAA ATA GAA GAC AAG CGT TCT AAC AGC TTG 451 
Asp Trp Asn Leu Ser Leu Lys He Glu Asp Lys Arg Ser Asn Ser Leu 
100 105 110 115 

GOT AAT GGC ACC AAA AAA TTG CTC ACC AAC GCT GAT TTA GGG AGC GAC 4 99 

Gly Asn Gly Thr Lys Lys Leu Leu Thr Asn Ala Asp Leu Gly Ser Asp 
120 125 130 

TAT AAA ACA ATC GTG ATA GAC AGC ATG AAA ACA TAG CAC CAA AGC CAG 54 7 

Tyr Lys Thr He Val He Asp Ser Met Lys Thr Tyr His Gin Ser Gin 
135 140 145 

CAA GAA AAA TAT AAA AGA GAA AGA GGC GAA ACG CTA GAG GTT CGC CCC 5 95 

Gin Glu Lys Tyr Lys Arg Glu Arg Gly Glu Thr Leu Glu Val Arg Pro 
150 155 160 

ACA ACA CCC CCT AGC TAT GGG GGT GGA AGC ATT AGA ATC AGC GGC GAT 64 3 

Thr Thr Pro Pro Ser Tyr Gly Gly Gly Ser He Arg He Ser Gly Asp 
165 170 175 

AAA AAG CCT GAT TTT GAT GAA GAA AAT TTT TAAAAGAAAG GACAACCGAT GAG 6 96 
Lys Lys Pro Asp Phe Asp Glu Glu Asn Phe 
180 185 

CAGAGTGCAA ATGGATACCG AAGAGGTCAG GGAATTTGTA GGGCATTTAG AACGCTTTAA 75 6 

AGAGTTACTA AGAGAGGAAG TGAACAGCTT GAGTAATCAT TTCCATAATT TAGA 810 

(2) INFORMATION FOR SEQ ID NO: 618: 



( i ) S EQUENCE CHARACTER I S T I CS : 

(A) LENGTH: 189 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 

Met Met Thr Lys Asn Ala Tyr Ala Phe Val Val He Glu Glu Ser Val 

^5 10 15 

Met Val Phe Lys Arg Thr Lys Asp Glu Gly Leu Met Pro He Phe Glu 

20 25 30 

Gly Phe Val Pro Leu Lys Glu Gly Phe Leu Lys Ser Phe Lys Glu Arg 

35 40 45 

Cys Asn Leu Glu Phe Leu Glu Asn Leu Asp Leu Leu Phe Leu Tyr Asp 

50^ 55 60 
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Ser Glu Lys Leu Asn Thr Phe Asn Arg Val Arg Tyr Leu Lys Ser Asp 
105 1110 1115 1120 

Leu Asp Leu Leu His Glu Gin Ala Ser Ala Leu Gly Met Val Leu Ala 

1125 1130 1135 

Thr Ala Lys Pro Cys Leu Asn Gly Arg Phe Glu Leu Leu Tyr Tyr His 

1140 1145 1150 

Leu Glu Arg Ser Val Ser lie Ser Tyr His Arg Tyr Gly Asn Leu Gly 

1155 1160 1165 

Ser Arg Val Leu Arg Gin Pro Thr Cys His Lys Ser Cys Cys Ala Glu 
1170 1175 1180 

Lys 
185 

(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 107... 673 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

AGCATGAAGA AGTGGATGTG AAGGTGTGCA GTATAGATTC ACAAAGCATT AAAGTGGGGC 6 0 

TGTTTAAAGA TAACCAATTA ATCTATGAAA GCGAGGCAGA AAAATT ATG ATG ACT 115 

Met Met Thr 
1 

AAG AAC GCG TAT GCG TTT GTT GTG ATT GAA GAA AGC GTT ATG GTG TTT 163 
Lys Asn Ala Tyr Ala Phe Val Val lie Glu Glu Ser Val Met Val Phe 
5 10 15 

AAA CGC ACC AAA GAT GAG GGG TTA ATG CCT ATC TTT GAA GGC TTT GTG 211 
Lys Arg Thr Lys Asp Glu Gly Leu Met Pro lie Phe Glu Gly Phe Val 
20 25 30 35 

CCT TTA AAA GAG GGC TTT TTG AAA AGT TTT AAA GAG CGT TGC AAT TTG 2 59 

Pro Leu Lys Glu Gly Phe Leu Lys Ser Phe Lys Glu Arg Cys Asn Leu 
40 45 50 

GAA TTT TTA GAA AAT TTA GAC CTT TTG TTT TTG TAT GAC AAA CCA TCC 3 07 

Glu Phe Leu Glu Asn Leu Asp Leu Leu Phe Leu Tyr Asp Lys Pro Ser 
55 60 65 

GCA CAC GAG ATC TTT TCC TTG TGC AAG GAG CTG AAA AAT TCC ATC TGG 3 55 
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660 665 670 

Ser Val Thr Gly Tyr Lys Leu Cys Glu Cys Phe Trp Asp Ala Gly Val 

675 680 685 

Pro Arg Asp Ala Leu lie Tyr Leu Pro Ser Lys Gly Ser Asp lie Ser 

690 695 700 

Glu His Leu Leu Arg Asp Glu Ser lie Gin Phe Ala He Leu Thr Gly 
705 710 715 720 

Gly Glu Asp Thr Ala Tyr Lys Met Leu Lys Ala Asn Pro Thr Leu Ala 

725 730 735 

Leu Ser Ala Glu Thr Gly Gly Lys Asn Ala Thr He Val Ser Lys Met 

740 745 750 

Ala Asp Arg Asp Gin Ala He Lys Asn Val He His Ser Ala Phe Ser 

755 760 765 

Asn Ser Gly Gin Lys Cys Ser Ala Thr Ser Leu Leu Val Leu Glu Lys 

770 775 780 

Glu Val Tyr Glu Asp Glu Asn Phe Lys Lys Thr Leu He Asp Ala Thr 
785 790 795 800 

Leu Ser Leu Ser Val Gly Asp Pro Phe Asp Phe Lys Asn Lys He Gly 

805 810 815 

Ala Leu Ala Asp Lys Pro Asn Glu Lys Val He Lys Ala He Asp Glu 

820 825 830 

Leu Lys Ser Tyr Glu Asn Tyr Glu He Pro Val Ser Phe Val Asn Asp 

835 840 845 

Asn Pro Tyr Leu Met Lys Pro Ser He Lys Tyr Gly Thr Lys Lys Gly 

850 855 860 

Asp Phe Thr His Gin Thr Glu Leu Phe Thr Pro He Leu Ser Val Met 
865 870 875 880 

Glu Ala Lys Asp Leu Asp Glu Ala He Glu He Ala Asn Ser Thr Gly 

885 890 895 

Tyr Gly Leu Thr Ser Ala Leu Glu Ser Leu Asp Glu Arg Glu Trp Glu 

900 905 910 

Tyr Tyr Leu Glu Arg He Glu Ala Gly Asn He Tyr He Asn Lys Pro 

915 920 925 

Thr Thr Gly Ala He Val Leu Arg Gin Pro Phe Gly Gly Val Lys Lys 

930 935 940 

Ser Ala Val Gly Phe Gly Arg Lys Val Gly He Phe Asn Tyr He Thr 
945 950 955 960 

Gin Phe Val Asn He Cys Gin Glu Glu Glu Asp Glu Asn Ala Leu Lys 

965 970 975 

Asn Pro Leu Ser Glu Ala Leu Glu Asn Leu Thr Gin Lys Gly Tyr Asp 

980 985 990 

Glu His Thr His Glu Leu Lys Arg Ala He Phe Met Ala Lys Ser Tyr 

995 1000 1005 

Ala Tyr His Tyr Lys His Glu Phe Ser Gin Thr Lys Asp Tyr Val Lys 

1010 1015 1020 

He Arg Gly Glu Asp Asn Leu Phe Ser Tyr Thr Lys Val Lys Ser Val 
025 1030 1035 1040 

Gly Tyr Arg He Thr Glu Lys Asp Thr Leu Ser Asp Met Leu Gly Val 

1045 1050 1055 

Ala Leu Ala Cys Leu He Ser Gin He Pro Leu Thr Leu Ser He Glu 

1060 1065 1070 

Asn Glu Arg Thr Asn Lys Asp Leu Thr Phe Phe Leu Glu Cys Leu Lys 

1075 1080 1085 

Ala Leu Gin Ala Ser Ala Pro He Val Tyr Glu Ser Leu Gin Lys Phe 
1090 1095 1100 



-954- 



BNSDOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



Met Glu Glu Phe Arg Asp Leu Glu Leu Thr Val Glu Ser Phe Met Glu 
225 230 235 240 

Ser lie Ala Lys Phe Asp Leu Asn Ala Gly lie Val Leu Gin Ala Tyr 

245 250 255 

lie Pro Asp Ser Tyr Glu Tyr Leu Lys Lys Leu His Ala Phe Ser Lys 

260 265 270 

Glu Arg Val Leu Lys Gly Leu Lys Pro lie Lys lie Arg Phe Val Lys 

275 280 285 

Gly Ala Asn Met Glu Ser Glu Glu Thr lie Ala Ser Val Lys Asp Trp 

290 295 300 

Ala Leu Pro Thr Phe Ser Asn Lys Gin Asp Thr Asp Ser Asn Tyr Asn 
305 310 315 320 

Lys Met Leu Asp Phe Val Leu Glu Gly Asp Asn Tyr Lys Tyr lie His 

325 330 335 

lie Gly Ala Ala Ser His Asn lie Phe Glu lie Ala Tyr Val Tyr Thr 

340 345 350 

Arg lie His Ala lie Asn Asp Pro Val Val Leu Glu His Phe Ser Phe 

355 360 365 

Glu Met Leu Glu Gly Met Ser Leu Gin Ala Ser Gin Glu Leu Lys Glu 

370 375 380 

Met His Lys Leu lie Leu Tyr Ala Pro Val Cys Asp Glu Ala His Phe 
385 390 395 400 

Asn Asn Ala lie Ala Tyr Leu Val Arg Arg Leu Asp Glu Asn Thr Ser 

405 410 415 

Ser Asp Asn Phe Met Lys Ala Phe Phe Asn Leu Lys Val Gly Thr Ser 

420 425 430 

Glu Trp Lys Asp Gin Glu Gin Arg Phe Leu Asn Ser Leu Lys Gly lie 

435 440 445 

Ala Thr Leu Asp Asn Ala Thr His Arg Thr Gin Asp Arg Asn Ala Lys 

450 455 460 

Gin Ser Gly His Thr Thr Tyr Pro Asn His Ser Phe Lys Asn Glu Ser 
465 470 475 480 

Asp Thr Asp Phe lie Leu Lys Ala Asn Arg Glu Trp Ala Lys Lys Val 

485 490 495 

Arg Glu Lys Met Arg Asn Ala Pro lie Leu Glu Leu Tyr Pro Glu Met 

500 505 510 

Asp Gly Arg Phe Glu Asp Pro Asn Leu Thr Pro Leu Glu Val Phe Asp 

515 520 525 

Arg lie His His Lys Lys lie Ala Ser Val His Leu Ala Asp Lys Glu 

530 535 540 

Ala lie Leu Lys Ala Leu Glu Val Ala Lys Ser Asp Lys Ser Arg Phe 
545 550 555 560 

Ser Gin Lys Ser Phe Thr Glu lie His Ala Leu Met Ser Gin Thr Ala 

565 570 575 

Gin Leu Phe Arg Glu Arg Arg Gly Asp Leu lie Gly lie Ser Ala Leu 

580 585 590 

Glu Val Gly Lys Thr Phe Ala Glu Thr Asp Ala Glu Val Ser Glu Ala 

595 600 605 

lie Asp Phe Leu Glu Phe Tyr Pro Tyr Ser Leu Arg Val Leu Gin Glu 

610 615 620 

Gin Asn Thr Lys Thr Gin Phe Thr Pro Lys Gly Val Gly Val Val lie 
625 630 635 640 

Ala Pro Trp Asn Phe Pro Val Gly lie Ser Val Gly Thr lie Ala Ala 

645 650 655 

Pro Leu-Ala Thr Gly Asn Arg Val lie Tyr Lys Pro Ser Ser Leu Ser 
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GCT AAA CCA TGC CTA AAT GGG CGT TTT GAA TTG CTG TAT TAC CAC TTA 3 512 

Ala Lys Pro Cys Leu Asn Gly Arg Phe Glu Leu Leu Tyr Tyr His Leu 
1140 1145 1150 

GAG CGA TCG GTT AGC ATC TCT TAT CAC CGT TAT GGG AAT TTA GGC TCA 3 56 0 
Glu Arg Ser Val Ser lie Ser Tyr His Arg Tyr Gly Asn Leu Gly Ser 
1155 1160 1165 

AGG GTT TTA AGG CAA CCC ACT TGC CAC AAA TCA TGC TGT GCT GAA AAA T 3 60 9 
Arg Val Leu Arg Gin Pro Thr Cys His Lys Ser Cys Cys Ala Glu Lys 
1170 1175 1180 1185 

AAATATTGTA TTAAATAAGG AGATCAAAAT GGGACATGTT GTTTTAAGTA CCCCTAT 3 666 

(2) INFORMATION FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

Met Gin Lys lie lie Asp Asp Ser Leu Glu Leu Ala Lys Lys Leu Gin 

15 10 15 

Asp Ser lie Ser Asn His Leu Ser Asp Gin Glu Lys Ala Phe His Ser 

20 25 30 

Lys Met Gin Lys Leu Leu Asn Asn Pro Glu Asn Lys Val Met Leu lie 

35 40 45 

Glu Leu Met Asp Arg Ser Phe Arg Cys Leu Asp Asn Lys Ala Arg Phe 

50 55 60 

Glu Met lie Glu His Val Leu Asp Lys Tyr Lys Ser Arg Glu lie Phe 
65 70 75 80 

Ser Pro Phe Glu Lys Val Leu Leu Met Gly Phe Leu Ser Phe Gly Lys 

85 90 95 

Met Leu Pro Asp Met Ser Val Pro Phe Phe Val Asn Lys lie Arg Ser 

100 105 110 

Asp Thr Lys Ala Met Val Leu Asp Gin Glu Glu Ser Gin Leu Lys Glu 

115 120 125 

Arg lie Leu Lys Arg Lys Asn Glu Lys lie lie Leu Asn Val Asn Phe 

130 135 140 

lie Gly Glu Glu Val Leu Gly Glu Glu Glu Ala Asn Ala Arg Phe Glu 
145 150 155 160 

Lys Tyr Ser Gin Ala Leu Lys Ser Asn Tyr lie Gin Tyr lie Ser lie 

165 170 175 

Lys lie Thr Thr lie Phe Ser Gin lie Asn lie Leu Asp Phe Glu Tyr 

180 185 190 

Ser Lys Lys Glu lie Val Lys Arg Leu Asp Ala Leu Tyr Ala Leu Ala 

195 200 205 

Leu Glu Glu Glu Lys Lys Gin Gly Met Pro Lys Phe lie Asn Leu Asp 
210 215 220 
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TAT TTA GAA CGC ATT GAA GCC GGT AAT ATC TAT ATC AAC AAG CCC ACC 2 84 0 

Tyr Leu Glu Arg lie Glu Ala Gly Asn lie Tyr lie Asn Lys Pro Thr 
915 920 925 

ACA GGA GCG ATT GTC TTG CGC CAG CCT TTT GGG GGG GTT AAA AAA TCC 2 88 8 

Thr Gly Ala lie Val Leu Arg Gin Pro Phe Gly Gly Val Lys Lys Ser 
930 935 940 945 

GCT GTG GGG TTT GGG AGG AAA GTA GGC ATT TTC AAC TAT ATC ACG CAA 2 93 6 

Ala Val Gly Phe Gly Arg Lys Val Gly He Phe Asn Tyr He Thr Gin 
950 955 960 

TTT GTG AAT ATC TGC CAA GAA GAA GAA GAC GAA AAC GCC TTA AAA AAC 2 984 

Phe Val Asn lie Cys Gin Glu Glu Glu Asp Glu Asn Ala Leu Lys Asn 
965 970 975 

CCC TTA AGC GAA GCC TTA GAA AAC TTA ACT CAA AAA GGC TAT GAT GAG 3 03 2 

Pro Leu Ser Glu Ala Leu Glu Asn Leu Thr Gin Lys Gly Tyr Asp Glu 
980 985 990 

CAT ACG CAT GAG TTG AAG CGC GCG ATT TTT ATG GCA AAA AGC TAG GCT 3 080 

His Thr His Glu Leu Lys Arg Ala He Phe Met Ala Lys Ser Tyr Ala 
995 1000 1005 

TAT CAT TAG AAA CAT GAA TTC AGC CAA ACT AAA GAC TAT GTC AAA ATC 312 8 

Tyr His Tyr Lys His Glu Phe Ser Gin Thr Lys Asp Tyr Val Lys lie 
1010 1015 1020 1025 

AGA GGC GAA GAC AAC CTT TTT TCC TAG ACT AAA GTT AAA AGC GTG GGC 317 6 

Arg Gly Glu Asp Asn Leu Phe Ser Tyr Thr Lys Val Lys Ser Val Gly 
1030 1035 1040 

TAT CGC ATC ACC GAA AAG GAC ACT TTA AGC GAC ATG TTA GGC GTT GCT 32 24 

Tyr Arg He Thr Glu Lys Asp Thr Leu Ser Asp Met Leu Gly Val Ala 
1045 1050 1055 

TTA GCA TGT TTA ATT TCT CAA ATC CCT TTA ACG CTC AGC ATA GAA AAC 3 2 72 

Leu Ala Cys Leu He Ser Gin He Pro Leu Thr Leu Ser He Glu Asn 
1060 1065 1070 

GAA CGA ACG AAC AAA GAT TTA ACC TTT TTC TTA GAA TGC TTA AAA GCG 3 32 0 

Glu Arg Thr Asn Lys Asp Leu Thr Phe Phe Leu Glu Cys Leu Lys Ala 
1075 1080 1085 

CTC CAA GCA AGC GCC CCT ATT GTT TAT GAA AGC TTG CAA AAA TTT AGC 3 36 8 

Leu Gin Ala Ser Ala Pro He Val Tyr Glu Ser Leu Gin Lys Phe Ser 
1090 1095 1100 1105 

GAG AAA TTG AAT ACT TTC AAT CGT GTC CGT TAT CTC AAA AGC GAT TTG 3 416 

Glu Lys Leu Asn Thr Phe Asn Arg Val Arg Tyr Leu Lys Ser Asp Leu 
1110 1115 1120 

GAT TTA TTG CAC GAA CAA GCG AGC GCT TTA GGG ATG GTT TTA GCC ACG 34 64 

Asp Leu Leu His Glu Gin Ala Ser Ala Leu Gly Met Val Leu Ala Thr 
1125 1130 1135 
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AGA GAT GCG CTC ATT TAG TTG CCC TCT AAA GGG AGC GAT ATT AGO GAA 216 8 

Arg Asp Ala Leu lie Tyr Leu Pro Ser Lys Gly Ser Asp lie Ser Glu 
690 695 700 705 

CAT CTT TTA AGA GAT GAA AGC ATC CAG TTT GCC ATT TTA ACC GGG GGC 2216 
His Leu Leu Arg Asp Glu Ser lie Gin Phe Ala lie Leu Thr Gly Gly 
710 715 720 

GAA GAC ACC GCT TAT AAA ATG TTA AAA GCT AAC CCC ACT TTA GCC TTG 2 2 64 

Glu Asp Thr Ala Tyr Lys Met Leu Lys Ala Asn Pro Thr Leu Ala Leu 
725 730 735 

AGC GCT GAA ACA GGC GGT AAA AAC GCC ACC ATT GTG AGC AAA ATG GCA 2 312 

Ser Ala Glu Thr Gly Gly Lys Asn Ala Thr lie Val Ser Lys Met Ala 
740 745 750 

GAC AGA GAC CAG GCG ATT AAG AAT GTT ATC CAT TCA GCT TTT AGC AAT 23 6 0 

Asp Arg Asp Gin Ala lie Lys Asn Val lie His Ser Ala Phe Ser Asn 
755 760 765 

TCG GGG CAA AAA TGC TCC GCC ACT TCG CTT TTA GTA TTA GAA AAA GAA 24 0 8 

Ser Gly Gin Lys Cys Ser Ala Thr Ser Leu Leu Val Leu Glu Lys Glu 
770 775 780 785 

GTC TAT GAA GAT GAG AAC TTT AAA AAG ACT CTA ATA GAT GCG ACT CTA 24 56 

Val Tyr Glu Asp Glu Asn Phe Lys Lys Thr Leu lie Asp Ala Thr Leu 
790 795 800 

AGC CTT AGC GTG GGC GAT CCT TTT GAT TTC AAA AAC AAA ATC GGC GCT 2 5 04 

Ser Leu Ser Val Gly Asp Pro Phe Asp Phe Lys Asn Lys lie Gly Ala 
805 810 815 

CTA GCG GAC AAG CCT AAT GAA AAG GTC ATC AAA GCC ATA GAT GAA TTA 2 552 

Leu Ala Asp Lys Pro Asn Glu Lys Val lie Lys Ala lie Asp Glu Leu 
820 825 830 

AAA AGC TAT GAA AAT TAG GAA ATC CCG GTA AGC TTT GTC AAT GAT AAC 2 6 00 

Lys Ser Tyr Glu Asn Tyr Glu lie Pro Val Ser Phe Val Asn Asp Ash 
835 840 845 

CCC TAT TTG ATG AAG CCA AGC ATC AAA TAG GGC ACT AAA AAA GGC GAT 2 64 8 

Pro Tyr Leu Met Lys Pro Ser He Lys Tyr Gly Thr Lys Lys Gly Asp 
850 855 860 865 

TTC ACG CAC CAA ACT GAG CTT TTT ACG CCC ATT TTA TCC GTG ATG GAA 2 6 96 

Phe Thr His Gin Thr Glu Leu Phe Thr Pro He Leu Ser Val Met Glu 
870 875 880 

GCA AAA GAT TTA GAC GAA GCG ATA GAA ATA GCC AAT TCT ACC GGT TAG 2 74 4 

Ala Lys Asp Leu Asp Glu Ala He Glu He Ala Asn Ser Thr Gly Tyr 
885 890 895 

GGG CTG ACT AGC GCG TTA GAG TCG TTG GAC GAA AGG GAG TGG GAA TAT 2 7 92 

Gly Leu Thr Ser Ala Leu Glu Ser Leu Asp Glu Arg Glu Trp Glu Tyr 
900 905 910 
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AGC GGG CAT ACC ACT TAC CCA AAC CAC TCC TTT AAA AAC GAA AGC GAT 14 96 

Ser Gly His Thr Thr Tyr Pro Asn His Ser Phe Lys Asn Glu Ser Asp 
470 475 480 

ACC GAT TTT ATT TTA AAA GCC AAC CGA GAA TGG GCT AAA AAA GTG CGC 1544 
Thr Asp Phe lie Leu Lys Ala Asn Arg Glu Trp Ala Lys Lys Val Arg 
485 490 495 

GAG AAA ATG CGT AAC GCT CCT ATT TTA GAG CTT TAC CCA GAG ATG GAT 15 92 

Glu Lys Met Arg Asn Ala Pro lie Leu Glu Leu Tyr Pro Glu Met Asp 
500 505 510 

GGG AGG TTT GAA GAT CCT AAT CTA ACC CCT TTA GAA GTC TTT GAT AGA 164 0 

Gly Arg Phe Glu Asp Pro Asn Leu Thr Pro Leu Glu Val Phe Asp Arg 
515 520 525 

ATC CAT CAT AAA AAA ATC GCT AGC GTG CAT TTA GCG GAT AAG GAA GCG 168 8 

lie His His Lys Lys lie Ala Ser Val His Leu Ala Asp Lys Glu Ala 
530 535 540 545 

ATT TTA AAA GCC CTA GAA GTG GCT AAA AGC GAT AAG AGC CGT TTC AGT 17 3 6 

lie Leu Lys Ala Leu Glu Val Ala Lys Ser Asp Lys Ser Arg Phe Ser 
550 555 560 

CAA AAA AGC TTT ACA GAA ATC CAT GCC TTA ATG AGT CAA ACC GCC CAG 17 84 

Gin Lys Ser Phe Thr Glu lie His Ala Leu Met Ser Gin Thr Ala Gin 
565 570 575 

CTT TTT AGA GAA AGA AGA GGC GAT TTG ATA GGG ATT TCC GCT TTA GAA 183 2 

Leu Phe Arg Glu Arg Arg Gly Asp Leu lie Gly lie Ser Ala Leu Glu 
580 585 590 

GTG GGT AAG ACT TTC GCT GAA ACG GAC GCT GAA GTG AGC GAA GCC ATT 1880 
Val Gly Lys Thr Phe Ala Glu Thr Asp Ala Glu Val Ser Glu Ala lie 
595 600 605 

GAC TTT TTA GAG TTT TAC CCT TAC AGC TTA AGG GTG TTG CAA GAG CAA 192 8 

Asp Phe Leu Glu Phe Tyr Pro Tyr Ser Leu Arg Val Leu Gin Glu Gin 
610 615 620 625 

AAC ACA AAA ACG CAA TTC ACC CCT AAA GGC GTG GGC GTG GTC ATT GCC 197 6 

Asn Thr Lys Thr Gin Phe Thr Pro Lys Gly Val Gly Val Val lie Ala 
630 635 640 

CCA TGG AAT TTC CCT GTG GGC ATT TCT GTA GGC ACT ATC GCT GCC CCC 2 024 

Pro Trp Asn Phe Pro Val Gly lie Ser Val Gly Thr lie Ala Ala Pro 
645 650 655 

CTA GCT ACG GGC AAT CGG GTG ATT TAC AAG CCC TCA AGT TTG TCT AGC 2 072 

Leu Ala Thr Gly Asn Arg Val lie Tyr Lys Pro Ser Ser Leu Ser Ser 
660 665 670 

GTA ACG GGC TAT AAG CTT TGT GAG TGC TTT TGG GAT GCG GGC GTG CCT 212 0 

Val Thr Gly Tyr Lys Leu Cys Glu Cys Phe Trp Asp Ala Gly Val Pro 
675 680 685 
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ATC GCT AAA TTT GAT TTG AAC GCT GOT ATT GTG CTG CAA GCC TAT ATT 8 24 

lie Ala Lys Phe Asp Leu Asn Ala Gly lie Val Leu Gin Ala Tyr lie 
245 250 255 

CCG GAT TCT TAT GAA TAT TTG AAA AAA CTG CAC GCT TTT TCT AAA GAA 8 72 

Pro Asp Ser Tyr Glu Tyr Leu Lys Lys Leu His Ala Phe Ser Lys Glu 
260 265 270 

AGG GTT TTA AAA GGG TTG AAG CCC ATT AAA ATC CGC TTT GTT AAG GGA 92 0 

Arg Val Leu Lys Gly Leu Lys Pro lie Lys lie Arg Phe Val Lys Gly 
275 280 285 

GCG AAC ATG GAG AGC GAA GAA ACT ATC GCT TCC GTG AAA GAT TGG GCG 9 68 

Ala Asn Met Glu Ser Glu Glu Thr He Ala Ser Val Lys Asp Trp Ala 
290 295 300 305 

TTA CCC ACA TTT TCC AAT AAG CAA GAC ACC GAT TCT AAT TAC AAT AAA 1016 
Leu Pro Thr Phe Ser Asn Lys Gin Asp Thr Asp Ser Asn Tyr Asn Lys 
310 315 320 

ATG TTG GAT TTT GTT TTA GAG GGC GAT AAT TAT AAA TAC ATT CAT ATT 10 64 

Met Leu Asp Phe Val Leu Glu Gly Asp Asn Tyr Lys Tyr He His He 
325 330 335 

GGC GCA GCG AGT CAT AAT ATT TTT GAA ATC GCT TAT GTC TAT ACG CGT 1112 
Gly Ala Ala Ser His Asn He Phe Glu He Ala Tyr Val Tyr Thr Arg 
340 345 350 

ATC CAT GCC ATT AAT GAT CCT GTT GTG TTA GAG CAT TTC AGC TTT GAA 1160 
He His Ala He Asn Asp Pro Val Val Leu Glu His Phe Ser Phe Glu 
355 360 365 

ATG CTA GAG GGC ATG AGT TTG CAA GCG AGC CAG GAA CTA AAA GAG ATG 12 0 8 

Met Leu Glu Gly Met Ser Leu Gin Ala Ser Gin Glu Leu Lys Glu Met 
370 375 380 385 

CAC AAG CTC ATT CTT TAT GCG CCG GTG TGC GAT GAA GCG CAT TTT AAC 12 5 6 

His Lys Leu He Leu Tyr Ala Pro Val Cys Asp Glu Ala His Phe Asn 
390 395 400 

AAT GCG ATC GCT TAC TTG GTG AGG AGG TTA GAC GAA AAC ACC TCA AGC 13 04 

Asn Ala He Ala Tyr Leu Val Arg Arg Leu Asp Glu Asn Thr Ser Ser 
405 410 415 

GAT AAT TTC ATG AAG GCT TTC TTT AAC CTC AAA GTA GGC ACG AGC GAA 13 52 

Asp Asn Phe Met Lys Ala Phe Phe Asn Leu Lys Val Gly Thr Ser Glu 
420 425 430 

TGG AAA GAT CAA GAA CAA CGC TTT TTA AAC AGC CTT AAA GGA ATT GCC 14 0 0 

Trp Lys Asp Gin Glu Gin Arg Phe Leu Asn Ser Leu Lys Gly He Ala 
435 440 445 

ACT TTA GAC AAT GCC ACC CAT AGG ACT CAA GAT AGG AAC GCC AAA CAA 144 8 

Thr Leu Asp Asn Ala Thr His Arg Thr Gin Asp Arg Asn Ala Lys Gin 
450 455 460 465 
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Ser lie Ser Asn His Leu Ser Asp Gin Glu Lys Ala Phe His Ser Lys 
20 25 30 

ATG CAA AAG CTT TTA AAC AAC CCT GAA AAC AAA GTC ATG CTC ATA GAG 2 00 

Met Gin Lys Leu Leu Asn Asn Pro Glu Asn Lys Val Met Leu lie Glu 
35 40 45 

CTT ATG GAT CGG AGT TTC AGG TGC TTG GAG AAT AAA GCC CGC TTT GAA 24 8 

Leu Met Asp Arg Ser Phe Arg Cys Leu Asp Asn Lys Ala Arg Phe Glu 
50 55 60 65 

ATG ATT GAG CAT GTT TTA GAC AAA TAG AAA AGC CGT GAG ATT TTT TCT 2 96 

Met lie Glu His Val Leu Asp Lys Tyr Lys Ser Arg Glu lie Phe Ser 
70 75 80 

CCG TTT GAA AAA GTG CTT TTA ATG GGG TTT TTA AGC TTT GGG AAA ATG 344 
Pro Phe Glu Lys Val Leu Leu Met Gly Phe Leu Ser Phe Gly Lys Met 
85 90 95 

CTC CCT GAT ATG AGC GTG CCT TTC TTT GTC AAT AAA ATC AGA AGC GAC 3 92 

Leu Pro Asp Met Ser Val Pro Phe Phe Val Asn Lys lie Arg Ser Asp 
100 105 110 

ACG AAA GCG ATG GTC TTG GAT CAA GAA GAG AGC CAG TTA AAA GAG CGG 44 0 

Thr Lys Ala Met Val Leu Asp Gin Glu Glu Ser Gin Leu Lys Glu Arg 
115 120 125 

ATT TTA AAA AGA AAA AAT GAA AAA ATC ATT TTG AAT GTG AAT TTT ATT 4 88 

lie Leu Lys Arg Lys Asn Glu Lys lie lie Leu Asn Val Asn Phe lie 
130 135 140 145 

GGC GAA GAG GTT TTA GGC GAA GAA GAA GCT AAT GCG CGT TTT GAA AAA 53 6 

Gly Glu Glu Val Leu Gly Glu Glu Glu Ala Asn Ala Arg Phe Glu Lys 
150 155 160 

TAC TCT CAA GCC CTA AAA TCC AAC TAC ATC CAA TAG ATT TCC ATT AAA 5 84 

Tyr Ser Gin Ala Leu Lys Ser Asn Tyr lie Gin Tyr lie Ser lie Lys 
165 170 175 

ATC ACG ACG ATT TTT TCT CAA ATC AAT ATC CTT GAT TTT GAA TAC TCT 63 2 

lie Thr Thr lie Phe Ser Gin lie Asn lie Leu Asp Phe Glu Tyr Ser 
180 185 190 

AAA AAA GAG ATT GTC AAA CGC CTA GAC GCT CTT TAC GCC CTG GCT TTA 68 0 

Lys Lys Glu lie Val Lys Arg Leu Asp Ala Leu Tyr Ala Leu Ala Leu 
195 200 205 

GAA GAA GAA AAA AAA CAA GGC ATG CCT AAA TTC ATC AAT TTG GAT ATG 72 8 

Glu Glu Glu Lys Lys Gin Gly Met Pro Lys Phe lie Asn Leu Asp Met 
210 215 220 225 

GAA GAA TTT AGG GAT TTA GAG CTA ACA GTG GAG TCG TTT ATG GAA TCC 776 
Glu Glu Phe Arg Asp Leu Glu Leu Thr Val Glu Ser Phe Met Glu Ser 
230 235 240 
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625 630 635 640 

Pro Leu Asn Leu Ser lie Lys Gin Lys Pro Ser Tyr lie Phe Asp Lys 

645 650 655 

Gin Leu Pro Tyr Trp Val lie Tyr Arg Asn Ala Phe Phe Asp Lys Val 

660 665 670 

Phe His Ser Met Gin Phe Gly Leu Phe Glu Val Phe Arg Asp Arg Gin 

675 680 685 

lie Thr Asn Ser Val Leu Val Lys Asn Gly lie Arg Val lie Lys Ser 

690 695 700 

Arg Asn lie Asp Glu Asn Gly Lys lie lie Ser lie Glu Asn Tyr Asp 
705 710 715 720 

Ser Tyr lie Gin Lys Glu Val Leu Ser Pro Phe Lys lie Ala Ser Phe 

725 730 735 

Leu Asp Arg Asp Asp Val Tyr Leu Thr Pro Asn Met Thr Tyr Lys Pro 

740 745 750 

Arg He Leu Lys Lys Glu Lys Gly Tyr Val Val Asn Gly Ser Val Ala 

755 760 765 

He Leu He Pro Lys Asn Pro He Ser Leu Ser Lys Lys Gin Cys Asp 

770 775 780 

Tyr He Ser Ser Val Glu Phe Arg Asp Phe Tyr Lys He Ala Arg Asn 
785 790 795 800 

Tyr Gin Thr Arg Thr Leu Asn He Asp Ser Met Ser Cys Phe Trp Phe 

805 810 815 

Gly He Leu Arg Ser Ser Leu 
820 



(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 54... 3608 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

CCCGCTTAAA ACATGCTACA ATCAAGTCAA ATTCTTAAAT AAAAGGTAAG CTC ATG 5 6 

Met 
1 

CAA AAA ATC ATT GAC GAT TCG CTA GAA TTA GCT AAA AAA CTG CAA GAT 104 
Gin Lys He He Asp Asp Ser Leu Glu Leu Ala Lys Lys Leu Gin Asp 
5 10 15 

AGT ATC AGT AAC CAT TTG AGO GAT CAG GAA AAA GCG TTC CAC TCT AAA 15 2 
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Cys Lys Glu Phe 
195 

Phe Lys Gin Glu 
210 

Leu Ala Trp Gly 
225 

Thr Tyr Pro Lys 

Gin Ser Ala Phe 
260 

Val Gly Ser Lys 
275 

Lys Arg Gin Lys 
290 

Asp Gin Met Ala 
305 

Phe Ser lie Arg 

Lys Trp Leu Asp 
340 

Glu Lys Glu Lys 
355 

lie Gly Glu Ala 
370 

lie Lys Asn Phe 
385 

Arg Leu lie Asp 

Phe lie Leu Glu 
420 

Leu Ala Glu Met 
435 

Thr Asn Pro Phe 
450 

Lys Gin Glu Ser 
465 

Phe Leu Ser Ala 

Leu Lys Cys lie 
500 

Leu Tyr Lys Asp 
515 

Lys Asp Phe Leu 
530 

Asn Pro Pro Phe 
545 

Arg Leu Thr His 

Ala Asn Phe Thr 
580 

Glu Tyr Ala Glu 
595 

lie Leu Asp Phe 
610 

lie Ala He Val 



Lys Asp Phe He 
200 

Lys Thr Leu Lys 
215 

Glu Tyr Asp Asn 
230 

His Met Gin Glu 
245 

Glu Asn Thr Glu 

He Val Leu Asn 
280 

Tyr His Ser Val 
295 

Ser Gin Asn Thr 
310 

Glu Leu Met Leu 
325 

Leu Glu Leu Gin 

He Ser Lys Gin 
360 

Val Pro Thr He 
375 

Met Ser Gin Thr 
390 

Val His His Leu 
405 

Asn Gin Asn Lys 

Ser Asn Ser Lys 
440 

He He Asn Glu 
455 

Val Thr He He 
470 

Leu Phe Lys Lys 
485 

Asp He Asp Lys 

Cys He Pro Asn 
520 

Ala Tyr Glu Cys 
535 

Gly Lys Thr His 
550 

Leu Ala Gly He 
565 

Ala Met Val Met 

Thr Arg Thr Lys 
600 

Gly Glu Leu Gly 
615 

Thr Gin Lys Ser 



Ser Ala Leu Glu 

Glu Val He Gly 
220 

Thr Asp Phe Tyr 
235 

Trp He Lys Asp 
250 

Leu Asn Lys Lys 
265 

Val Ser Lys Asn 

Ala Pro Cys He 
300 

He His Pro Lys 
315 

Leu Met Asn He 
330 

Glu Leu Asn Ala 
345 

Asn Glu Met Asn 

He Phe Lys Gin 
380 

His Leu Glu Pro 
395 

Leu Glu Pro Gin 
410 

He Ala Arg Ala 
425 

Arg He Glu Lys 

He Al a Lys Leu 
460 

Glu Pro Ser Ala 
475 

Tyr Thr Ser Val 
490 

Asn Ser Leu Glu 
505 

Asn Phe Glu Met 

Gly Lys Val Asp 
540 

Glu Arg Phe Lys 
555 

Phe Leu Glu Lys 
570 

Pro Lys Asn Leu 
585 

Leu Glu Lys Lys 

Phe Lys Gly Val 
620 

Lys Glu Val Leu 



Phe Phe Pro Asp 
205 

Ser Leu Lys Pro 

His Ser Phe Arg 
240 

Leu Lys Glu Gly 
255 

Pro His Arg He 
270 

Gly Asp Lys Tyr 
285 

His Thr Arg Asn 

Asp Asp Arg Val 
320 

Pro Ser Arg Phe 
335 

Leu Asn Gin Gin 
350 

He Arg Gin Ser 
365 

He Ala He Lys 

Lys Glu He He 
400 

Asn Leu Lys Arg 
415 

Ser Leu Val Ser 
430 

Ser Ala Tyr Phe 
445 

Leu Pro Ser Phe 

Gly Cys Gly Asn 
480 

Lys Lys Val Tyr 
495 

He Leu Glu He 
510 

Glu Leu He Cys 
525 

Leu He Val Gly 

Asp Tyr Ser Leu 
560 

Ser Leu Lys Leu 
575 

Leu Asn Thr Lys 
590 

Gly Val Gly Ala 
605 

Leu Val Glu Thr 
Ala Arg Ser Leu 
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TAT GTG GTT AAT GGC TCT GTG GCT ATT TTA ATC CCT AAA AAC CCC ATA 2 417 

Tyr Val Val Asn Gly Ser Val Ala lie Leu lie Pro Lys Asn Pro lie 
765 770 775 

TCT TTA AGC AAG AAA CAA TGC GAT TAT ATC TCT AGC GTT GAA TTT AGA 2 4 65 

Ser Leu Ser Lys Lys Gin Cys Asp Tyr lie Ser Ser Val Glu Phe Arg 
780 785 790 

GAT TTT TAT AAA ATC GCT AGG AAT TAT CAA ACG CGC ACC TTA AAT ATT 2 513 

Asp Phe Tyr Lys lie Ala Arg Asn Tyr Gin Thr Arg Thr Leu Asn lie 
795 800 805 

GAT AGC ATG AGT TGT TTT TGG TTT GGA ATT TTA AGG AGT AGC TTA TGAAA 2563 
Asp Ser Met Ser Cys Phe Trp Phe Gly lie Leu Arg Ser Ser Leu 
810 815 820 

GAGCAATCAA TGATTGATTT TTTAAAACTT AGAGATTATG ACATTAG 2 610 

(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:614: 

Met Leu Phe Asp Gin Thr Leu Thr Tyr lie Ser Leu Phe Ser Gly Ala 

15 10 15 

Gly Val Gly Cys Tyr Gly Leu Leu Glu Glu Gly Phe Glu Cys Val Ala 

20 25 30 

Thr Asn Glu lie Leu Glu Lys Arg Leu Asn lie Gin Arg lie Asn Arg 

35 40 45 

Lys Cys Lys Leu Asp Glu Ser Tyr lie Ser Gly Asp lie Lys Lys Pro 

50 55 60 

Glu Thr Lys Glu Lys lie Leu Lys Gin lie Glu Phe Tyr Ser Lys Lys 
65 70 75 80 

Phe Gly Asn Asp Arg Val Asp Leu Val Val Ala Thr Pro Pro Cys Gin 

85 90 95 

Gly Met Ser Val Ala Asn His Lys Lys Lys Asn Asp Glu lie Lys Arg 

100 105 110 

Asn Ser Leu Val Val Glu Ser lie Asp Leu lie Lys Gin lie Lys Pro 

115 120 125 

Arg Phe Phe lie Leu Glu Asn Val Pro Ser Phe Tyr Lys Thr Gly Cys 

130 135 140 

lie Asp Lys Asn Asp Asn Leu Leu Glu lie Gly Ser Met lie Glu Gin 
145 150 155 160 

Asn Leu Ser Gly Asp Tyr Met Leu Tyr Asp Glu Val lie Asn Phe Lys 

165 170 175 

Asn Phe Gly Ala Asn Ser Ser Arg Thr Arg Thr Leu Val lie Gly Val 
180 185 190 
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GGC AAA GTG GAT TTA ATT GTG GGC AAT CCG OCT TTT GGC AAA ACG CAT 174 5 

Gly Lys Val Asp Leu lie Val Gly Asn Pro Pro Phe Gly Lys Thr His 
540 545 550 

GAA AGA TTC AAA GAT TAT AGT TTA AGA CTC ACT CAT TTA GCA GGG ATT 17 93 

Glu Arg Phe Lys Asp Tyr Ser Leu Arg Leu Thr His Leu Ala Gly lie 
555 560 565 

TTT TTA GAA AAG TCT TTA AAA CTA GCC AAC TTT ACA GCG ATG GTT ATG 1841 
Phe Leu Glu Lys Ser Leu Lys Leu Ala Asn Phe Thr Ala Met Val Met 
570 575 580 

CCT AAA AAC CTT TTA AAC ACT AAA GAG TAT GCA GAA ACT AGA ACT AAG 18 8 9 

Pro Lys Asn Leu Leu Asn Thr Lys Glu Tyr Ala Glu Thr Arg Thr Lys 
585 590 595 600 

CTT GAA AAA AAG GGA GTA GGA GCG ATT TTA GAC TTT GGC GAG CTT GGT 193 7 

Leu Glu Lys Lys Gly Val Gly Ala lie Leu Asp Phe Gly Glu Leu Gly 
605 610 615 

TTT AAG GGT GTT TTG GTA GAA ACA ATT GCT ATT GTT ACA CAA AAA TCA 1985 
Phe Lys Gly Val Leu Val Glu Thr lie Ala lie Val Thr Gin Lys Ser 
620 625 630 

AAA GAA GTT TTA GCG CGT TCG TTA CCC CTA AAT CTA AGC ATC AAG CAA 2 03 3 

Lys Glu Val Leu Ala Arg Ser Leu Pro Leu Asn Leu Ser lie Lys Gin 
635 640 645 

AAG CCA AGC TAT ATT TTT GAC AAA CAA TTG CCC TAT TGG GTT ATC TAT 2 0 81 

Lys Pro Ser Tyr lie Phe Asp Lys Gin Leu Pro Tyr Trp Val lie Tyr 
650 655 660 

CGC AAC GCT TTT TTT GAT AAG GTG TTT CAT TCC ATG CAG TTT GGT CTT 212 9 

Arg Asn Ala Phe Phe Asp Lys Val Phe His Ser Met Gin Phe Gly Leu 
665 670 675 680 

TTT GAA GTG TTT AGA GAC AGA CAA ATC ACT AAT TCT GTG TTG GTT AAA 217 7 

Phe Glu Val Phe Arg Asp Arg Gin lie Thr Asn Ser Val Leu Val Lys 
685 690 695 

AAT GGT ATT CGT GTG ATT AAA TCT CGC AAT ATT GAT GAA AAC GGA AAG 2 22 5 

Asn Gly lie Arg Val lie Lys Ser Arg Asn lie Asp Glu Asn Gly Lys 
700 705 710 

ATT ATT AGC ATT GAA AAT TAC GAT AGC TAC ATT CAA AAA GAG GTT TTA 2 2 73 

lie lie Ser lie Glu Asn Tyr Asp Ser Tyr lie Gin Lys Glu Val Leu 
715 720 725 

AGT CCG TTT AAG ATA GCT TCA TTT TTA GAC AGA GAT GAT GTC TAT TTA 2 321 

Ser Pro Phe Lys He Ala Ser Phe Leu Asp Arg Asp Asp Val Tyr Leu 
730 735 740 

ACC CCC AAC ATG ACC TAT AAG CCA AGG ATT TTA AAA AAA GAA AAA GGC 23 6 9 

Thr Pro Asn Met Thr Tyr Lys Pro Arg He Leu Lys Lys Glu Lys Gly 
745 750 755 760 
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lie His Pro Lys Asp Asp Arg Val Phe Ser lie Arg Glu Leu Met Leu 
315 320 325 

TTA ATG AAT ATC CCT AGC CGT TTT AAG TGG TTA GAT TTA GAA TTA CAA 1121 
Leu Met Asn lie Pro Ser Arg Phe Lys Trp Leu Asp Leu Glu Leu Gin 
330 335 340 

GAA TTA AAC GCC CTT AAC CAA CAA GAA AAA GAA AAA ATC TCC AAA CAA 1169 
Glu Leu Asn Ala Leu Asn Gin Gin Glu Lys Glu Lys lie Ser Lys Gin 
345 350 355 360 

AAC GAA ATG AAT ATA AGA CAA AGC ATC GGT GAA GCT GTT CCA ACG ATT 1217 
Asn Glu Met Asn lie Arg Gin Ser lie Gly Glu Ala Val Pro Thr lie 
365 370 375 

ATT TTT AAG CAA ATT GCC ATA AAG ATA AAA AAT TTC ATG TCT CAA ACC 12 65 

lie Phe Lys Gin lie Ala lie Lys lie Lys Asn Phe Met Ser Gin Thr 
380 385 390 

CAC TTA GAG CCT AAA GAA ATC ATT AGG CTT ATT GAT GTG CAC CAT TTA 1313 
His Leu Glu Pro Lys Glu lie lie Arg Leu lie Asp Val His His Leu 
395 400 405 

TTA GAG CCA CAA AAT TTG AAG CGA TTT ATT TTA GAA AAT CAA AAC AAG 13 61 

Leu Glu Pro Gin Asn Leu Lys Arg Phe lie Leu Glu Asn Gin Asn Lys 
410 415 420 

ATT GCA AGA GCG AGT TTA GTG AGT TTG GCA GAA ATG TCT AAT TCT AAA 14 0 9 

lie Ala Arg Ala Ser Leu Val Ser Leu Ala Glu Met Ser Asn Ser Lys 
425 430 435 440 

CGC ATA GAA AAA AGC GCG TAT TTT ACA AAC CCT TTT ATT ATT AAT GAA 14 5 7 

Arg lie Glu Lys Ser Ala Tyr Phe Thr Asn Pro Phe lie He Asn Glu 
445 450 455 

ATA GCG AAG TTA TTG CCA AGC TTT AAA CAA GAG AGT GTT ACT ATT ATA 15 0 5 

He Ala Lys Leu Leu Pro Ser Phe Lys Gin Glu Ser Val Thr He He 
460 465 470 

GAG CCA AGT GCA GGG TGT GGG AAT TTC TTA AGT GCT CTT TTT AAA AAA 15 53 

Glu Pro Ser Ala Gly Cys Gly Asn Phe Leu Ser Ala Leu Phe Lys Lys 
475 480 485 

TAC ACT TCT GTT AAA AAA GTT TAT TTA AAG TGT ATA GAT ATT GAT AAA 1601 
Tyr Thr Ser Val Lys Lys Val Tyr Leu Lys Cys He Asp He Asp Lys 
490 495 500 

AAT AGT TTA GAA ATT TTA GAG ATT TTA TAT AAA GAT TGC ATT CCT AAC 164 9 

Asn Ser Leu Glu He Leu Glu He Leu Tyr Lys Asp Cys He Pro Asn 
505 510 515 520 

AAT TTT GAG ATG GAA TTG ATT TGC AAA GAT TTT CTA GCC TAT GAA TGC 16 97 

Asn Phe Glu Met Glu Leu He Cys Lys Asp Phe Leu Ala Tyr Glu Cys 
525 530 535 
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90 95 100 

AAG AAA AAC GAT GAG ATC AAA CGG AAT TCT TTG GTG GTT GAA AGC ATT 44 9 

Lys Lys Asn Asp Glu lie Lys Arg Asn Ser Leu Val Val Glu Ser lie 
105 110 115 120 

GAT TTG ATC AAA CAA ATC AAA CCC AGA TTT TTT ATT TTA GAA AAT GTC 4 97 

Asp Leu lie Lys Gin lie Lys Pro Arg Phe Phe lie Leu Glu Asn Val 
125 130 135 

CCT AGT TTT TAT AAA ACA GGT TGT ATA GAC AAA AAT GAT AAT TTG CTA 54 5 

Pro Ser Phe Tyr Lys Thr Gly Cys lie Asp Lys Asn Asp Asn Leu Leu 
140 145 150 

GAA ATA GGA TCT ATG ATA GAG CAA AAT TTG AGT GGC GAT TAT ATG CTC 5 93 

Glu lie Gly Ser Met lie Glu Gin Asn Leu Ser Gly Asp Tyr Met Leu 
155 160 165 

TAT GAT GAG GTA ATC AAT TTT AAA AAT TTT GGA GCT AAT TCA AGC CGA 641 
Tyr Asp Glu Val lie Asn Phe Lys Asn Phe Gly Ala Asn Ser Ser Arg 
170 175 180 

ACA AGA ACT TTA GTG ATA GGG GTT TGT AAA GAG TTT AAA GAT TTT ATA 68 9 

Thr Arg Thr Leu Val lie Gly Val Cys Lys Glu Phe Lys Asp Phe lie 
185 190 195 200 

AGC GCG TTA GAA TTT TTT CCT GAT TTC AAA CAA GAA AAA ACC TTA AAA 737 
Ser Ala Leu Glu Phe Phe Pro Asp Phe Lys Gin Glu Lys Thr Leu Lys 
205 210 215 

GAA GTG ATA GGA TCG TTA AAA CCA CTT GCT TGG GGC GAG TAT GAC AAC 78 5 

Glu Val lie Gly Ser Leu Lys Pro Leu Ala Trp Gly Glu Tyr Asp Asn 
220 225 230 

ACG GAT TTT TAT CAT AGT TTT AGA ACT TAT CCA AAG CAT ATG CA^. GAA 83 3 

Thr Asp Phe Tyr His Ser Phe Arg Thr Tyr Pro Lys His Met Gin Glu 
235 240 245 

TGG ATT AAG GAT TTA AAA GAA GGA CAA AGC GCG TTT GAG AAT ACA GAA 881 
Trp lie Lys Asp Leu Lys Glu Gly Gin Ser Ala Phe Glu Asn Thr Glu 
250 255 260 

TTA AAC AAA AAA CCT CAT AGA ATT GTT GGC AGT AAG ATT GTC TTA AAT 92 9 

Leu Asn Lys Lys Pro His Arg lie Val Gly Ser Lys lie Val Leu Asn 
265 270 275 280 

GTT TCT AAA AAT GGC GAT AAA TAT AAA AGA CAA AAA TAT CAT AGC GTT 97 7 

Val Ser Lys Asn Gly Asp Lys Tyr Lys Arg Gin Lys Tyr His Ser Val 
285 290 295 

GCC CCT TGC ATT CAT ACA AGA AAC GAC CAA ATG GCT AGC CAA AAC ACG 102 5 

Ala Pro Cys lie His Thr Arg Asn Asp Gin Met Ala Ser Gin Asn Thr 
300 305 310 

ATC CAC CCC AAA GAT GAT AGA GTG TTT TCC ATT AGA GAG CTG ATG CTT 10 7 3 
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370 

Leu Lys Val lie Asp 
385 

Pro Ser Trp Ser Tyr 
405 

Leu Lys Phe 



375 

Ala Phe Tyr Phe Asp Phe 
390 395 
Leu Leu Asp Lys Asn Asp 
410 



380 

Asn Lys Ser Lys Asp 
400 

Leu Arg Ala Phe Lys 
415 ■ 



(2) INFORMATION FOR SEQ ID NO:613: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 90... 2558 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

TAAAGAGGCT TTTGAAACCA TGCTTAAAGA AATTGAGAGC TTGAAACATT AATGCTTCAA 6 0 

ATGAATTTGG TTTAATCCTT AATCCCCTT ATG CTC TTT GAT CAA ACC TTA ACC 113 

Met Leu Phe Asp Gin Thr Leu Thr 
1 5 

TAT ATT TCT TTA TTT TCT GGG GCA GGA GTG GGG TGC TAT GGG CTT TTA 161 
Tyr lie Ser Leu Phe Ser Gly Ala Gly Val Gly Cys Tyr Gly Leu Leu 
10 15 20 

GAA GAG GGG TTT GAA TGC GTT GCT ACC AAT GAA ATT TTA GAA AAA CGC 2 09 

Glu Glu Gly Phe Glu Cys Val Ala Thr Asn Glu lie Leu Glu Lys Arg 
25 30 35 40 

TTG AAT ATC CAA AGO ATT AAT CGC AAA TGC AAA TTA GAT GAA AGC TAC 2 57 

Leu Asn lie Gin Arg lie Asn Arg Lys Cys Lys Leu Asp Glu Ser Tyr 
45 50 55 

ATT AGT GGG GAC ATT AAA AAG CCA GAA ACA AAA GAA AAA ATT TTA AAG 3 05 

lie Ser Gly Asp lie Lys Lys Pro Glu Thr Lys Glu Lys lie Leu Lys 
60 65 70 

CAA ATT GAA TTT TAT TCT AAA AAA TTT GGT AAT GAT AGG GTT GAT TTA 3 53 

Gin lie Glu Phe Tyr Ser Lys Lys Phe Gly Asn Asp Arg Val Asp Leu 
75 80 85 

GTG GTA GCA ACC CCA CCT TGT CAA GGC ATG AGC GTA GCC AAT CAT AAG 4 01 

Val Val Ala Thr Pro Pro Cys Gin Gly Met Ser Val Ala Asn His Lys 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 



Met Lys Glu Gin 
1 

lie Arg Lys Thr 
20 

Asp Val Leu Ser 
35 

Asn lie Gly Lys 
50 

Thr Asn Glu Asn 
65 

Asp Phe Ser Met 

Leu Leu Ala Tyr 
100 

Asn lie Tyr Thr 
115 

Arg Glu Thr Asn 
130 

Leu Met Asp Ser 
145 

Gin Asp Thr Glu 

Thr lie Asn Asn 
180 

Phe Thr Lys lie 
195 

Thr Arg Lys Gly 
210 

Asn Tyr Asn Arg 
225 

Thr Arg Gin Glu 

Asn Tyr Leu lie 
260 

Arg Phe Asn Asn 
275 

Gin Ala Thr Gin 
290 

lie Ala Asn Tyr 
305 

Phe lie Tyr Ala 

Phe Gin Tyr lie 
340 

Thr Gin Gly Val 
355 

Gly Leu Lys Thr 



Ser Met lie Asp 
5 

Gin Asn Ala Arg 

Leu Val Ala Asp 
40 

Ser Phe Ser lie 
55 

Val Lys Met lie 
70 

His Glu Tyr Asp 
85 

Ser Gly lie Leu 

lie Gin Asn lie 
120 

Ala Leu Lys Phe 
135 

Gly lie Tyr Pro 
150 

Ser Phe Lys Gin 
165 

Thr Ala lie Asn 

lie Asn Pro Leu 
200 

Tyr Leu Ser Asn 
215 

lie Asn Trp Arg 
230 

Tyr Asp Leu lie 
245 

Ser Lys Ala Lys 

Ser Leu Ser Glu 
280 

lie His His lie 
295 

lie Glu Asn Leu 
310 

His Pro Asn Asn 
325 

Cys Leu Leu Ala 

Tyr Asp Trp Asn 
360 

Thr lie Phe Ser 



Phe Leu Lys Leu 
10 

Trp lie Asp Gin 
25 

Cys lie Leu Glu 

Arg Asp lie Trp 
60 

Phe Ser Lys Pro 
75 

Lys Phe Phe Ser 
90 

Phe Glu Thr Lys 
105 

Glu Leu Leu Glu 

Leu lie Leu Tyr 
14 0 

Leu Phe Asp Asn 
155 

Leu Lys Asp Gly 
170 

Asn Ala Thr Glu 
185 

Ala Phe Tyr Tyr 

Thr lie lie Thr 
220 

Asp lie Gly Lys 
235 

Asn Ser Lys Arg 
250 

Lys Val Val Lys 
265 

Val Lys Gin Glu 

Phe Pro lie Gin 
300 

lie Ala Leu Thr 
315 

Gin Thr Arg Leu 
330 

Lys Thr Thr Thr 
345 

Asp Tyr lie Val 
Gin Val Lys Asn 



Arg Asp Tyr Asp 
15 

Lys Cys Thr Pro 
30 

Phe Thr Gin Cys 
45 

Asp Ser Pro Tyr 

Asp Leu Asn Ser 
80 

Gin Pro lie Lys 
95 

Thr Gly Asn Arg 
110 

Tyr Leu Met Gin 
125 

lie Gin Lys Val 

Phe Leu Gin Lys 
160 

Phe Thr His Phe 
175 

Cys Phe Arg lie 
190 

Gly Lys Lys Gly 
205 

Lys Asp Glu Leu 

Asp Lys Asn Thr 
240 

lie Ala Asn Ser 
255 

Arg Tyr Asn Asp 
270 

Lys Glu Glu Ser 
285 

Asp Phe Pro lie 

Pro Asn Gin His 
320 

lie Asp Lys Asp 
335 

lie Leu Asn Asp 
350 

Val Leu Asn Met 
365 

Glu Trp Glu Leu 
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CGT ATC AAT TGG CGA GAT ATA GGA AAA GAT AAA AAT ACC ACC AGA CAA 77 7 

Arg lie Asn Trp Arg Asp lie Gly Lys Asp Lys Asn Thr Thr Arg Gin 
230 235 240 

GAA TAG GAT CTT ATA AAC TCT AAA AGG ATT GCT AAT TCT AAC TAT CTT 82 5 

Glu Tyr Asp Leu lie Asn Ser Lys Arg lie Ala Asn Ser Asn Tyr Leu 
245 250 255 

ATT TCA AAA GCT AAG AAA GTG GTG AAA CGA TAT AAT GAT AGA TTT AAT 87 3 

lie Ser Lys Ala Lys Lys Val Val Lys Arg Tyr Asn Asp Arg Phe Asn 
260 265 270 275 

AAT TCT CTC TCT GAA GTA AAA CAA GAA AAA GAA GAG TCG CAA GCC ACA 921 
Asn Ser Leu Ser Glu Val Lys Gin Glu Lys Glu Glu Ser Gin Ala Thr 
280 285 290 

CAA ATA CAC CAT ATT TTT CCC ATC CAA GAC TTT CCC ATT ATT GCT AAC 969 
Gin lie His His lie Phe Pro lie Gin Asp Phe Pro lie lie Ala Asn 
295 300 305 

TAT ATA GAG AAT CTT ATC GCA CTC ACT CCT AAT CAA CAT TTT ATT TAC 1017 
Tyr lie Glu Asn Leu lie Ala Leu Thr Pro Asn Gin His Phe lie Tyr 
310 315 320 

GCC CAC CCT AAT AAT CAA ACC CGC TTG ATT GAT AAA GAT TTT CAA TAT 10 6 5 

Ala His Pro Asn Asn Gin Thr Arg Leu lie Asp Lys Asp Phe Gin Tyr 
325 330 335 

ATC TGC TTA TTA GCT AAA ACG ACC ACA ATT CTT AAT GAC ACT CAA GGC 1113 
lie Cys Leu Leu Ala Lys Thr Thr Thr lie Leu Asn. Asp Thr Gin Gly 
340 345 350 355 

GTA TAT GAT TGG AAT GAT TAT ATT GTT GTG TTG AAT ATG GGC CTC AAA 1161 
Val Tyr Asp Trp Asn Asp Tyr lie Val Val Leu Asn Met Gly Leu Lys 
360 365 370 

ACA ACT ATC TTT TCT CAA GTC AAG AAC GAA TGG GAA TTA TTA AAA GTA 12 0 9 

Thr Thr lie Phe Ser Gin Val Lys Asn Glu Trp Glu Leu Leu Lys Val 
375 380 385 

ATA GAT GCT TTT TAT TTT GAT TTT AAC AAG AGC AAA GAT CCA AGT TGG 12 57 

lie Asp Ala Phe Tyr Phe Asp Phe Asn Lys Ser Lys Asp Pro Ser Trp 
390 395 400 

TCA TAC TTG CTA GAT AAA AAC GAT TTA AGA GCT TTC AAG CTA AAA TTT T 13 06 
Ser Tyr Leu Leu Asp Lys Asn Asp Leu Arg Ala Phe Lys Leu Lys Phe 
405 410 415 

AATAAGTTTT ATTGAAACTG GCTATAAAAA CCCGCTTGAC TTATCTTATC CTTTT 13 61 

(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
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CAA TCA ATG ATT GAT TTT TTA AAA CTT AGA GAT TAT GAG ATT AGA AAA 105 
Gin Ser Met He Asp Phe Leu Lys Leu Arg Asp Tyr Asp He Arg Lys 
5 10 15 

ACA CAA AAT GCG CGA TGG ATA GAT CAA AAA TGC ACC CCT GAT GTG TTG 15 3 

Thr Gin Asn Ala Arg Trp He Asp Gin Lys Cys Thr Pro Asp Val Leu 
20 25 30 35 

TCT CTT GTT GCT GAT TGT ATT TTA GAG TTT ACG CAA TGT AAT ATT GGA 2 01 

Ser Leu Val Ala Asp Cys He Leu Glu Phe Thr Gin Cys Asn He Gly 
40 45 50 

AAA TCA TTT TCT ATT AGG GAT ATT TGG GAT AGC CCT TAC ACC AAT GAA 24 9 

Lys Ser Phe Ser He Arg Asp He Trp Asp Ser Pro Tyr Thr Asn Glu 
55 60 65 

AAT GTT AAA ATG ATT TTT TCT AAA CCT GAT TTA AAT TCT GAC TTT TCC 2 97 

Asn Val Lys Met He Phe Ser Lys Pro Asp Leu Asn Ser Asp Phe Ser 
70 75 80 

ATG CAT GAA TAC GAT AAG TTT TTT TCT CAG CCT ATT AAA TTA TTA GCC 34 5 

Met His Glu Tyr Asp Lys Phe Phe Ser Gin Pro He Lys Leu Leu Ala 
85 90 95 

TAT AGC GGT ATT TTA TTT GAA ACA AAA ACT GGC AAT AGA AAT ATT TAT 3 93 

Tyr Ser Gly He Leu Phe Glu Thr Lys Thr Gly Asn Arg Asn He Tyr 
100 105 110 115 

ACC ATA CAA AAC ATA GAG CTA TTA GAA TAT CTC ATG CAA AGA GAA ACA 441 
Thr He Gin Asn He Glu Leu Leu Glu Tyr Leu Met Gin Arg Glu Thr 
120 125 130 

AAC GCT TTG AAA TTC CTT ATT TTA TAT ATT CAA AAG GTA TTA ATG GAT 489 
Asn Ala Leu Lys Phe Leu He Leu Tyr He Gin Lys Val Leu Met Asp 
135 140 145 

AGT GGG ATT TAT CCT TTA TTT GAC AAC TTT TTA CAA AAA CAA GAC ACA 53 7 

Ser Gly He Tyr Pro Leu Phe Asp Asn Phe Leu Gin Lys Gin Asp Thr 
150 155 160 

GAA AGT TTT AAG CAA CTA AAA GAT GGT TTC ACT CAT TTT ACT ATC AAT 5 85 

Glu Ser Phe Lys Gin Leu Lys Asp Gly Phe Thr His Phe Thr He Asn 
165 170 175 

AAC ACA GCA ATC AAT AAC GCT ACG GAA TGT TTT AGG ATT TTT ACT AAA 63 3 

Asn Thr Ala He Asn Asn Ala Thr Glu Cys Phe Arg He Phe Thr Lys 
180 185 190 195 

ATT ATC AAT CCT TTA GCT TTT TAT TAT GGT AAA AAA GGC ACA AGA AAA 681 
He He Asn Pro Leu Ala Phe Tyr Tyr Gly Lys Lys Gly Thr Arg Lys 
200 205 210 

GGG TAT TTG TCC AAC ACT ATA ATT ACA AAA GAT GAG CTT AAT TAT AAT 72 9 

Gly Tyr Leu Ser Asn Thr He He Thr Lys Asp Glu Leu Asn Tyr Asn 
215 220 225 
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Ala Lys Gly Tyr Phe Leu Glu Glu lie Lys Glu Arg Leu Asn Ala Leu 

130 135 140 

Gly Tyr Gin Leu Ser Tyr Gin lie Leu Asn Ala Lys Asp Tyr Gly Val 
145 150 155 160 

Pro Gin Asn Arg Glu Arg Ala Phe lie Val Gly Ala Ser Arg Phe Ser 

165 170 175 

Phe Asp Phe Asn Leu Leu Glu Pro Ser Gin Ser Val Asn Val Gin Asp 

180 185 190 

Ala lie Ser Asp Leu Ala Tyr Leu Cys Ser Asn Glu Gly Ala Phe Glu 

195 200 205 

Ser Asp Tyr Leu Asn Pro lie Gin Ser Ser Tyr Gin Ala Leu Met Arg 

210 215 220 

Lys Asp Ser Pro Lys Leu Tyr Asn His Gin Ala Thr Asn His Ser Gin 
225 230 235 240 

Ala Ala Leu Glu Lys Leu Lys Leu He Asn Lys Glu Gin Gly Lys Glu 

245 250 255 

Cys Leu Pro Lys Asn Leu His Gly Lys Gin Gin Phe Lys Ser Thr Trp 

260 265 270 

Gly Arg Leu Asn Trp Asn Lys He Ser Pro Thr He Asp Thr Arg Phe 

275 280 285 

Asp Thr Pro Ser Asn Gly Thr Asn Ser His Pro Glu Leu His Arg Ser 

290 295 300 

He Thr Pro Arg Glu Ala Ala Arg He Gin Ser Phe Ser Asp Asn Tyr 
305 310 315 320 

He Phe Tyr Gly Asn Lys Thr Ser Val Cys Lys Gin He Gly Asn Ala 

325 330 335 

Val Pro Pro Leu Leu Ala Leu Ala Leu Gly Lys Ala He Leu Lys Ser 
340 345 350 

Leu Arg Lys 
355 

(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 1305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 

TATTGATAGC ATGAGTTGTT TTTGGTTTGG AATTTTAAGG AGTAGCTT ATG AAA GAG 5 7 

Met Lys Glu 
1 
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Gly Arg Leu Asn Trp Asn Lys lie Ser Pro Thr lie Asp Thr Arg Phe 
275 280 285 

GAC ACT CCC AGC AAT GGC ACC AAC TCC CAC CCC GAA TTG CAC CGC TCT 971 

Asp Thr Pro Ser Asn Gly Thr Asn Ser His Pro Glu Leu His Arg Ser 
290 295 300 



ATC ACG CCT AGA GAA GCC GCT AGG ATA CAA AGT TTT AGC GAT AAT TAT 1019 
lie Thr Pro Arg Glu Ala Ala Arg lie Gin Ser Phe Ser Asp Asn Tyr 
305 310 315 320 



ATC TTT TAT GGC AAT AAA ACG AGC GTT TGT AAG CAA ATC GGT AAC GCT 1067 
lie Phe Tyr Gly Asn Lys Thr Ser Val Cys Lys Gin lie Gly Asn Ala 
325 330 335 



GTG CCT CCT CTT CTA GCC CTA GCC TTA GGC AAA GCG ATC TTA AAA AGC 1115 
Val Pro Pro Leu Leu Ala Leu Ala Leu Gly Lys Ala lie Leu Lys Ser 
340 345 350 



TTA AGA AAA TGATACAAAT TTATCACGCT GACGCTTTTG AAATCATCAA AGACTTTTAC 1174 
Leu Arg Lys 
355 



CAGCAAA 1181 
(2) INFORMATION FOR SEQ ID NO: 6 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 



Met Asn Tyr Lys 
1 

Ala Gly Leu Glu 
20 

Cys Asp Lys Gin 
35 

lie Gly Val Cys 
50 

lie Lys Leu Ala 
65 

Pro Cys Gin Gly 

Pro Arg Asn Phe 
100 

Lys Pro Glu lie 
115 



lie Leu Asp Leu 
5 

Cys Leu Glu Glu 

Ala Leu lie Thr 
40 

Gly Asp lie Thr 
55 

Lys Lys Leu Glu 
70 

Phe Ser Asn Lys 
85 

Leu Phe Leu Glu 

Phe lie lie Glu 
120 



Phe Cys Gly Ala 
10 

Phe Asp Ala Leu 
25 

Phe Glu Asn Asn 

Gin Thr Glu lie 
60 

lie Asn Met lie 
75 

Gly Lys Asn Leu 
90 

Tyr lie Glu lie 
105 

Asn Val Lys Asn 



Gly Gly Phe Ser 
15 

lie Gly Leu Asp 
30 

His Lys Asn Ala 
45 

Lys Glu Lys Val 

lie Gly Gly Pro 
80 

Gly Leu Lys Asp 
95 

Val Lys Ala lie 
110 

Leu lie Ser Cys 
125 



-935- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 

50 



55 



60 



PCT/US98/0637I 



ATC AAA CTA GCT AAA AAA TTA GAA ATC AAC ATG ATC ATT GGC GGG OCT 2 99 

lie Lys Leu Ala Lys Lys Leu Glu lie Asn Met lie lie Gly Gly Pro 
65 70 75 80 

CCA TGT CAA GGC TTT TCT AAT AAA GGG AAA AAT TTA GGG CTA AAA GAC 34 7 

Pro Cys Gin Gly Phe Ser Asn Lys Gly Lys Asn Leu Gly Leu Lys Asp 
85 90 95 

CCT AGG AAT TTT TTA TTC TTA GAA TAT ATA GAA ATA GTC AAA GCC ATA 3 95 

Pro Arg Asn Phe Leu Phe Leu Glu Tyr lie Glu lie Val Lys Ala lie 
100 105 110 

AAG CCA GAA ATT TTT ATC ATT GAA AAC GTG AAA AAC CTC ATC TCT TGC 443 
Lys Pro Glu lie Phe He He Glu Asn Val Lys Asn Leu He Ser Cys 
115 120 125 

GCT AAA GGC TAT TTT TTA GAA GAA ATT AAA GAA AGG TTG AAC GCT TTA 4 91 

Ala Lys Gly Tyr Phe Leu Glu Glu He Lys Glu Arg Leu Asn Ala Leu 
130 135 140 

GGG TAT CAA TTG AGC TAT CAA ATC CTA AAC GCT AAA GAT TAT GGC GTG 53 9 

Gly Tyr Gin Leu Ser Tyr Gin He Leu Asn Ala Lys Asp Tyr Gly Val 
145 150 155 160 

CCT CAA AAC AGA GAG AGA GCC TTT ATT GTA GGG GCT AGT CGT TTC AGT 5 87 

Pro Gin Asn Arg Glu Arg Ala Phe He Val Gly Ala Ser Arg Phe Ser 
165 170 175 

TTT GAT TTC AAT CTT TTA GAG CCT TCT CAA AGC GTG AAT GTT CAA GAT 63 5 

Phe Asp Phe Asn Leu Leu Glu Pro Ser Gin Ser Val Asn Val Gin Asp 
180 185 190 

GCC ATA AGC GAT TTA GCC TAT CTT TGT TCT AAT GAG GGG GCG TTT GAG 683 
Ala He Ser Asp Leu Ala Tyr Leu Cys Ser Asn Glu Gly Ala Phe Glu 
195 200 205 

AGC GAT TAT TTA AAC CCT ATC CAA TCA AGC TAT CAA GCT TTA ATG CGA 731 
Ser Asp Tyr Leu Asn Pro He Gin Ser Ser Tyr Gin Ala Leu Met Arg 
210 215 220 

AAA GAT AGC CCT AAA TTA TAC AAC CAT CAA GCC ACC AAC CAC TCG CAA 77 9 

Lys Asp Ser Pro Lys Leu Tyr Asn His Gin Ala Thr Asn His Ser Gin 
225 230 235 240 

GCC GCT TTA GAG AAA TTA AAA CTC ATT AAC AAA GAA CAA GGC AAA GAA 82 7 

Ala Ala Leu Glu Lys Leu Lys Leu He Asn Lys Glu Gin Gly Lys Glu 
245 250 255 

TGC TTG CCT AAA AAC TTG CAT GGC AAA CAG CAA TTC AAA AGC ACA TGG 875 
Cys Leu Pro Lys Asn Leu His Gly Lys Gin Gin Phe Lys Ser Thr Trp 
260 265 270 

GGG CGC CTG AAT TGG AAT AAA ATC AGC CCC ACC ATA GAC ACA CGA TTT 923 
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100 105 110 

Pro Met Pro Arg Asn lie His Arg Arg Tyr Val Gin Asp Thr Glu Phe 

115 120 125 

Ala Leu Trp Ala Val Lys Lys Lys Ala Lys Trp Val Phe Asn Lys Pro 

130 135 140 

Lys Asn Glu Lys Tyr Leu Arg Pro Leu lie Leu Lys Ser Pro Val Val 
145 150 155 160 

Ser Gly Leu Glu Lys Thr Lys His Pro Thr Gin Lys Ser Leu Ala Leu 

165 170 175 

Met Glu Lys lie lie Ser lie His Thr Asn Pro Asn Asp lie Val Leu 

180 185 190 

Asp Pro Phe Met Gly Ser Gly Thr Thr Gly Leu Ala Cys Lys Asn Leu 

195 200 205 

Glu Arg Asn Phe lie Gly lie Glu Ser Glu Lys Glu Tyr Phe Gin Thr 

210 215 220 

Ala Lys Lys Arg Leu Asn Leu Phe 
225 230 

(2) INFORMATION FOR SEQ ID NO: 60 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 1124 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 9: 
TTGCGTTTTG AAACCGATGA TTTTTCAACG CTTATTGATC GTATTTGTGA AAGCTTGAA 5 9 



ATG AAT TAT AAA ATT TTA GAT TTA TTT TGT GGG GCT GGG GGT TTT AGC 107 
Met Asn Tyr Lys lie Leu Asp Leu Phe Cys Gly Ala Gly Gly Phe Ser 
1 5 10 15 

GCT GGG TTA GAG TGT TTA GAA GAG TTT GAC GCT TTA ATA GGG CTA GAT 155 
Ala Gly Leu Glu Cys Leu Glu Glu Phe Asp Ala Leu lie Gly Leu Asp 
20 25 30 

TGC GAT AAA CAA GCC CTA ATC ACT TTT GAA AAC AAC CAT AAA AAC GCC 2 03 

Cys Asp Lys Gin Ala Leu lie Thr Phe Glu Asn Asn His Lys Asn Ala 
35 40 45 

ATA GGC GTT TGT GGG GAC ATC ACT CAA ACC GAA ATT AAA GAA AAA GTC 251 
lie Gly Val Cys Gly Asp lie Thr Gin Thr Glu lie Lys Glu Lys Val 
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Phe Ala Leu Trp Ala Val Lys Lys Lys Ala Lys Trp Val Phe Asn Lys 
130 135 140 

CCC AAA AAT GAA AAA TAT TTA CGG CCT TTG ATT TTA AAA AGC CCT GTG 5 99 

Pro Lys Asn Glu Lys Tyr Leu Arg Pro Leu lie Leu Lys Ser Pro Val 
145 150 155 

GTA AGC GGG CTT GAA AAA ACC AAA CAC CCC ACG CAA AAA AGC CTG GCC 64 7 

Val Ser Gly Leu Glu Lys Thr Lys His Pro Thr Gin Lys Ser Leu Ala 
160 165 170 175 

TTA ATG GAA AAA ATC ATT TCC ATC CAC ACA AAC CCT AAT GAC ATC GTG 695 
Leu Met Glu Lys lie lie Ser lie His Thr Asn Pro Asn Asp lie Val 
180 185 190 

CTA GAT CCT TTC ATG GGG AGC GGC ACC ACC GGC TTA GCG TGC AAA AAT 74 3 

Leu Asp Pro Phe Met Gly Ser Gly Thr Thr Gly Leu Ala Cys Lys Asn 
195 200 205 

TTA GAA CGG AAT TTT ATC GGC ATA GAA TCA GAA AAA GAA TAT TTT CAA 7 91 

Leu Glu Arg Asn Phe lie Gly lie Glu Ser Glu Lys Glu Tyr Phe Gin 
210 215 220 

ACC GCT AAA AAG CGT TTG AAT CTG TTT TAAAAACGCT ATTTGAATGA GATTGTG 84 5 
Thr Ala Lys Lys Arg Leu Asn Leu Phe 
225 230 

TTATAGTTAT TTAAAAGGAT ATTTTGA 8 72 

(2) INFORMATION FOR SEQ ID NO: 6 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 

Met lie Gin lie Tyr His Ala Asp Ala Phe Glu lie lie Lys Asp Phe 

15 10 15 

Tyr Gin Gin Asn Leu Lys Val Asp Ala lie lie Thr Asp Pro Pro Tyr 

20 25 30 

Asn lie Ser Val Lys Asn Asn Phe Pro Thr Leu Lys Ser Ala Lys Arg 

35 40 45 

Gin Gly lie Asp Phe Gly Glu Trp Asp Lys Asn Phe Lys Leu Leu Glu 

50 55 60 

Trp lie Ala Arg Tyr Ala Pro Leu Val Asn Pro Asn Gly Cys Met Val 
65 70 75 80 

lie Phe Cys Ser Tyr Arg Phe lie Ser Tyr lie Ala Asp Phe Leu Glu 

85 90 95 

Glu Asn Gly Phe Val Val Lys Asp Phe lie Gin Trp Val Lys Asn Asn 
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(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 123 . . . 818 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

AGTTTTAGCG ATAATTATAT CTTTTATGGC AATAAAACGA GCGTTTGTAA GCAAATCGGT 6 0 

AACGCTGTGC CTCCTCTTCT AGCCCTAGCC TTAGGCAAAG CGATCTTAAA AAGCTTAAGA 12 0 

AA ATG ATA CAA ATT TAT CAC GCT GAC GCT TTT GAA ATC ATC AAA GAC 167 
Met He Gin He Tyr His Ala Asp Ala Phe Glu He He Lys Asp 
15 10 15 

TTT TAC CAG CAA AAT TTA AAA GTG GAT GCG ATC ATC ACG GAC CCT CCT 215 
Phe Tyr Gin Gin Asn Leu Lys Val Asp Ala He He Thr Asp Pro Pro 
20 25 30 

TAT AAC ATT TCG GTT AAA AAC AAT TTT CCC ACC CTA AAG AGC GCT AAA 2 63 

Tyr Asn He Ser Val Lys Asn Asn Phe Pro Thr Leu Lys Ser Ala Lys 
35 40 45 

AGG CAA GGC ATA GAT TTT GGG GAA TGG GAT AAA AAT TTC AAG CTT TTA 311 
Arg Gin Gly He Asp Phe Gly Glu Trp Asp Lys Asn Phe Lys Leu Leu 
50 55 60 

GAA TGG ATC GCA CGC TAC GCC CCC TTA GTC AAT CCA AAC GGC TGC ATG 35 9 

Glu Trp He Ala Arg Tyr Ala Pro Leu Val Asn Pro Asn Gly Cys Met 
65 70 75 

GTT ATT TTT TGC TCT TAC AGG TTT ATA AGC TAT ATC GCT GAT TTT TTA 40 7 

Val He Phe Cys Ser Tyr Arg Phe He Ser Tyr He Ala Asp Phe Leu 
80 85 90 95 

GAA GAA AAC GGC TTT GTG GTC AAA GAC TTT ATC CAA TGG GTT AAA AAT 4 55 

Glu Glu Asn Gly Phe Val Val Lys Asp Phe He Gin Trp Val Lys Asn 
100 105 110 

AAT CCC ATG CCA AGA AAC ATT CAC CGG CGT TAT GTC CAA GAC ACG GAA 50 3 

Asn Pro Met Pro Arg Asn He His Arg Arg Tyr Val Gin Asp Thr Glu 
115 120 125 

TTT GCT CTG TGG GCG GTT AAA AAG AAA GCC AAG TGG GTG TTT AAC AAA 551 
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15 10 15 

GAT CAT GGC GAA AGT TTG GGC GAA GAA GCG TTC TAT TTG CAT GGC ATT 33 8 

Asp His Gly Glu Ser Leu Gly Glu Glu Ala Phe Tyr Leu His Gly lie 
20 25 30 

CCT AAA AGC ATC GCC CCC AAA GAA CAA TAC GAG ATC CCC TTT ATC GTT 3 86 

Pro Lys Ser lie Ala Pro Lys Glu Gin Tyr Glu lie Pro Phe lie Val 
35 40 45 

TAT GCT AAT GAG CCT TTC AAA GAA AAG CAT TCC ATC ATT CAA ACC CAA 434 
Tyr Ala Asn Glu Pro Phe Lys Glu Lys His Ser lie lie Gin Thr Gin 
50 55 60 

ACC CCC ATT AAT CAA AAT GTG ATT TTC CAT AGC GTT TTA GGG GTG TTT 4 82 

Thr Pro He Asn Gin Asn Val He Phe His Ser Val Leu Gly Val Phe 
65 70 75 

TTG GAT TTT AAA AAC CCA AGC GTT GTT TAT CGC CCT TCT TTA GAT CTG 53 0 

Leu Asp Phe Lys Asn Pro Ser Val Val Tyr Arg Pro Ser Leu Asp Leu 
80 85 90 95 

CTT AAA CAC AAA AAA GAG TAAAATAACA CGCATGAAAA AATTCTTATT TAAACAAA 586 
Leu Lys His Lys Lys Glu 
100 

AATTTTGTGA AAGCCTGCCC AAA 609 
(2) INFORMATION FOR SEQ ID NO : 606 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 

Met Leu Glu Asn Ala Lys Gin Pro Ala Leu Met He Tyr Leu Ser Asp 

15 10 15 

His Gly Glu Ser Leu Gly Glu Glu Ala Phe Tyr Leu His Gly He Pro 

20 25 30 

Lys Ser He Ala Pro Lys Glu Gin Tyr Glu He Pro Phe He Val Tyr 

35 40 45 

Ala Asn Glu Pro Phe Lys Glu Lys His Ser He He Gin Thr Gin Thr 

50 55 60 

Pro He Asn Gin Asn Val lie Phe His Ser Val Leu Gly Val Phe Leu 
65 70 75 80 

Asp Phe Lys Asn Pro Ser Val Val Tyr Arg Pro Ser Leu Asp Leu Leu 

85 90 95 

Lys His Lys Lys Glu 
100 
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Gin Asn Ala Asp 
195 

Phe Arg Pro Tyr 
210 

His His lie Thr 
225 

lie Lys Asp Phe 

Pro Gly Glu Ala 
260 

He Asp He Val 
275 

Phe Ser Ala His 
290 

He Phe Lys Val 
305 

Lys Gly Glu Asn 

Cys Lieu Ala Gly 
340 

Lys Arg Gly Asp 
355 

Val Lys Asn Asn 
370 

Leu Asp Gin Gin 
385 

Tyr Lys Asn Arg 



Ala Leu Cys Arg 
200 

Leu Glu Asn Met 
215 

Lys Ser Asp Tyr 
230 

Lys Glu Arg Tyr 
245 

lie Gly Trp Gin 

Gin Asn Asp Gin 
280 

Met Pro Asp Cys 
295 

Ser Val Glu Asn 
310 

Gin Gly Ala Phe 
325 

Asp Phe Met Gly 

Lys He Val Phe 
360 

Ser Phe Asn Gly 
375 

Gly Phe Lys He 
390 

Asn 
405 



Thr Leu Glu His 

Ala Trp Val Asn 
220 

Asp Val Asn Leu 
235 

His Asn He Glu 
250 

Cys Gly Phe Leu 
265 

Glu He Ala He 

Leu Glu Met Pro 
300 

Asp Glu Glu Leu 
315 

Ser Tyr Phe Leu 
330 

Ser Phe Ser Phe 
345 

Gin Asp Met Leu 

Val Pro Leu Pro 
380 

Leu Lys Asn Phe 
395 



Val Glu Lys His 
205 

Phe Gly Gly Gly 

Leu He Gin Thr 
240 

Val He Leu Glu 
255 

He Ala Ser Val 
270 

Leu Asp Ala Ser 
285 

Tyr Arg Pro Ser 

He Glu Val Glu 
320 

Gly Gly Pro Thr 
335 

Glu Thr Pro Leu 
350 

His Tyr Thr He 
365 

Ser Leu Ala Arg 

Ser Tyr Glu Asp 
400 



(2) INFORMATION FOR SEQ ID NO: 6 05: 



(i) SEQUENCE CH7VRACTERISTICS : 

(A) LENGTH: 609 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 246... 548 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

GAATCTTTAC TTTATAATTT GCCTGACCTT TTAAAAGAAC ACTCTAATGA AAATGTCTTG 60 
CTCATCTTAC ACTTGCAGGC TCGCATGGCC CAAACTACGA CAACAAAGTG CCTTTAAATT 12 0 

TTAGGGTGTT TAAGCCTTAT TGCTCAAGCG CTGATCTGTC TTCTTGCTCC AAAGAAAGCC 180 
TGATTAACGC CTATGACAAC ACCATTTTTT ACAACGACTA TCTGCTAGAT CGAAAGATCA 240 

TTAGC ATG CTT GAA AAC GCC AAG CAG CCC GCC TTA ATG ATC TAT TTA AGC 2 90 
Met Leu Glu Asn Ala Lys Gin Pro Ala Leu Met He Tyr Leu Ser 
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GAA ACG CCT TTA AAA AGG GGC GAT AAA ATC GTG TTT CAA GAC ATG CTC 1168 
Glu Thr Pro Leu Lys Arg Gly Asp Lys lie Val Phe Gin Asp Met Leu 
350 355 360 

CAT TAT ACG ATT GTC AAA AAC AAC TCG TTT AAT GGC GTG CCG CTC CCA 1216 
His Tyr Thr lie Val Lys Asn Asn Ser Phe Asn Gly Val Pro Leu Pro 
365 370 375 380 

AGC CTG GCT AGA TTG GAT CAA CAA GGG TTT AAA ATC CTT AAA AAC TTT 12 64 

Ser Leu Ala Arg Leu Asp Gin Gin Gly Phe Lys lie Leu Lys Asn Phe 
385 390 395 

TCT TAT GAA GAC TAT AAA AAC AGA AAC TAAAGCTTTT GATTAAGGCT TTTTGGG 1318 
Ser Tyr Glu Asp Tyr Lys Asn Arg Asn 
400 405 

GCTTGTAAAA AGNTTACGCA CAACATTCCA AC 13 5 0 

(2) INFORMATION FOR SEQ ID NO : 604 : 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 05 amino acids 

(B) TYPE: amino acid 

(C) STRT^DEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 

Met Lys Lys Tyr Ser Thr lie Pro Thr Pro Cys Tyr Val Leu Glu Ser 

15 10 15 

Glu Arg Leu Glu Lys Asn Ala Lys lie Leu Glu lie Val Arg Gin Gin 

20 25 30 

Ser Gly Ala Lys Val Leu Leu Ala Leu Lys Gly Tyr Ala Phe Trp Arg 

35 40 45 

Glu Phe Gly lie Leu Arg Gin Lys Leu Asn Gly Cys Cys Ala Ser Gly 

50 55 60 

Leu Tyr Glu Ala Lys Leu Ala Phe Glu Glu Phe Gly Gly Arg Glu Ser 
65 70 75 80 

His Lys Glu lie Cys Val Tyr Ser Pro Ala Phe Lys Glu Ala Glu Met 

85 90 95 

Ser Ala lie Leu Pro Leu Ala Thr Ser lie lie Phe Asn Ser Phe Tyr 

100 105 110 

Gin Tyr Ala Thr Tyr Lys Asp Arg lie Leu Asp Lys Asn Lys Gin Leu 

115 120 125 

Glu Asn Leu Gly Leu Ser Pro lie Lys Met Gly Leu Arg lie Asn Pro 

130 135 140 

Leu Tyr Ser Glu Val Thr Pro Ala lie Tyr Asn Pro Cys Ser Lys Val 
145 150 155 160 

Ser Arg Leu Gly lie Thr Pro Ser Gly Phe Glu Lys Gly Val Lys Glu 

165 170 175 

His Gly Leu Glu Gly Val Ser Gly Leu His Phe His Thr His Cys Glu 
180 185 190 
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AAC AAG CAA 
Asn Lys Gin 
125 

AGO ATA AAC 
Arg lie Asn 

TGC TCT AAA 
Cys Ser Lys 

GGG GTG AAA 
Gly Val Lys 
175 

ACG CAT TGC 
Thr His Cys 
190 

GTA GAA AAG 
Val Glu Lys 
205 

TTT GGT GGG 
Phe Gly Gly 



CTC ATC CAA 
Leu lie Gin 



GTG ATT TTA 
Val lie Leu 
255 

ATC GCA AGC 
lie Ala Ser 
270 

CTA GAC GCT 
Leu Asp Ala 
285 

TAT CGC CCT 
Tyr Arg Pro 

ATT GAA GTT 
lie Glu Val 



GGC GGC CCT 
Gly Gly Pro 
,-3 3 5 



TTA GAA AAC 
Leu Glu Asn 
130 

CCT CTC TAT 
Pro Leu Tyr 
145 

GTG AGC CGG 
Val Ser Arg 
160 

GAG CAT GGC 
Glu His Gly 



GAG CAA AAC 
Glu Gin Asn 



CAT TTC AGG 
His Phe Arg 
210 

GGG CAT CAT 
Gly His His 
225 

ACG ATT AAG 
Thr lie Lys 
240 

GAG CCT GGG 
Glu Pro Gly 



GTG ATA GAC 
Val lie Asp 



TCT TTT AGC 
Ser Phe Ser 
290 

AGC ATT TTT 
Ser lie Phe 
305 

GAA AAG GGC 
Glu Lys Gly 
320 

ACT TGT TTA 
Thr Cys Leu 



TTG GGC TTA 

Leu Gly Leu 

AGC GAA GTA 

Ser Glu Val 



TTA GGG ATT 
Leu Gly lie 
165 

TTA GAG GGG 
Leu Glu Gly 
180 

GCT GAC GCT 
Ala Asp Ala 
195 

CCC TAT TTA 
Pro Tyr Leu 



ATC ACT AAG 
lie Thr Lys 



GAT TTC AAA 
Asp Phe Lys 
245 

GAA GCC ATA 
Glu Ala lie 
260 

ATC GTT CAA 
lie Val Gin 
275 

GCT CAC ATG 
Ala His Met 



AAA GTC TCC 
Lys Val Ser 



GAA AAT CAA 
Glu Asn Gin 
325 

GCG GGG GAT 
Ala Gly Asp 
340 



AGC CCC ATT 
Ser Pro lie 
135 

ACC CCA GCG 
Thr Pro Ala 
150 

ACG CCT AGC 
Thr Pro Ser 



GTG AGC GGG 
Val Ser Gly 

TTG TGC CGG 
Leu Cys Arg 
200 

GAA AAC ATG 
Glu Asn Met 
215 

AGC GAT TAT 
Ser Asp Tyr 
230 

GAA CGC TAT 
Glu Arg Tyr 



GGG TGG CAA 
Gly Trp Gin 



AAC GAT CAA 
Asn Asp Gin 
280 

CCC GAT TGC 
Pro Asp Cys 
295 

GTA GAA AAT 
Val Glu Asn 
310 

GGG GCG TTT 
Gly Ala Phe 



TTT ATG GGG 
Phe Met Gly 



AAA ATG GGT 
Lys Met Gly 



ATC TAT AAC 
lie Tyr Asn 
155 

GGA TTT GAA 
Gly Phe Glu 
170 

TTG CAT TTC 
Leu His Phe 
185 

ACT TTA GAG 
Thr Leu Glu 



GCG TGG GTG 
Ala Trp Val 



GAT GTG AAT 
Asp Val Asn 
235 

CAT AAT ATA 
His Asn lie 
250 

TGC GGG TTT 
Cys Gly Phe 
265 

GAA ATT GCG 
Glu lie Ala 



TTA GAA ATG 
Leu Glu Met 



GAT GAA GAG 
Asp Glu Glu 
315 

TCT TAT TTT 
Ser Tyr Phe 
330 

AGT TTT AGC 
Ser Phe Ser 
345 



TTG 4 96 

Leu 

140 

CCA 544 
Pro 



AAG 5 92 

Lys 



CAT 64 0 

His 



CAT 688 
His 



AAT 73 6 

Asn 

220 

TTG 7 84 

Leu 



GAA 832 
Glu 



TTA 8 8 0 

Leu 



ATT 92 8 

He 



CCT 9 76 

Pro 

300 

CTT 1024 
Leu 



TTA 1072 
Leu 



TTT 112 0 

Phe 
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65 70 75 80 

L.ys Ala Pro Ser beu Ala Asn Trp Phe Met Gly lie Phe 
85 90 

(2) INFORMATION FOR SEQ ID NO: 603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 77 . . . 1291 
• (D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 03: 

TGAAGGCCGA TGATTTCATT TCCAAGTCTA ACCCCAAAGA CATCCAGCGA GTGGTTAAGC 6 0 

AATTTTTGGA ATTAGC ATG AAA AAA TAC AGC ACT ATC CCC ACC CCT TGC TAC 112 

Met Lys Lys Tyr Ser Thr lie Pro Thr Pro Cys Tyr 
15 10 

GTG TTA GAG AGC GAA CGC TTA GAA AAA AAC GCC AAG ATT TTA GAA ATC 160 
Val Leu Glu Ser Glu Arg Leu Glu Lys Asn Ala Lys lie Leu Glu lie 
15 20 25 

GTG CGC CAA CAA AGT GGG GCA AAG GTC TTG CTT GCT TTA AAG GGG TAT 2 08 

Val Arg Gin Gin Ser Gly Ala Lys Val Leu Leu Ala Leu Lys Gly Tyr 
30 35 40 

GCG TTT TGG CGT GAG TTT GGG ATT TTG AGG CAA AAA TTG AAC GGG TGT 2 56 

Ala Phe Trp Arg Glu Phe Gly lie Leu Arg Gin Lys Leu Asn Gly Cys 
45 50 55 60 

TGC GCG AGC GGT CTT TAT GAG GCT AAG CTC GCT' TTT GAA GAA TTT GGG 3 04 

Cys Ala Ser Gly Leu Tyr Glu Ala Lys Leu Ala Phe Glu Glu Phe Gly 
65 70 75 

GGG CGA GAG AGC CAC AAA GAA ATT TGC GTT TAT AGC CCG GCT TTC AAA 3 52 

Gly Arg Glu Ser His Lys Glu lie Cys Val Tyr Ser Pro Ala Phe Lys 
80 85 90 

GAG GCT GAA ATG AGC GCG ATT TTA CCC CTA GCG ACA AGC ATT ATT TTT 4 00 

Glu Ala Glu Met Ser Ala lie Leu Pro Leu Ala Thr Ser He He Phe 
95 100 105 

AAC TCT TTT TAC CAA TAC GCT ACC TAT AAA GAC AGG ATT TTA GAT AAA 44 8 

Asn Ser Phe Tyr Gin Tyr Ala Thr Tyr Lys Asp Arg He Leu Asp Lys 
110 115 120 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 01: 

TAAAGATAAA AACCACCCCT TTTGACCCCC TTTTAGAGGA TTTTTATGGT TTTTTTGAGA 60 
TAATAAAGAC TTATAAAGTT AATTAAAATT AAACAGAAGG GTTTTG ATG TCC GCT 115 

Met Ser Ala 
1 

CAT TTT TTA AAA ATC GTT TTT TTA GTA GGC ATG TGC GTT TCA ACT TTG 163 
His Phe Leu Lys lie Val Phe Leu Val Gly Met Cys Val Ser Ser Leu 
5 10 15 

TTC GCT GAA GGT TTA GAG GGG TTT TTT AAC GCC CTA GAA GCC CAG CTC 211 
Phe Ala Glu Gly Leu Glu Gly Phe Phe Asn Ala Leu Glu Ala Gin Leu 
20 25 30 35 

AAA AGC CCC ATC GCT AAG GGG ATT TTA ATG GTG ATT TTC ATA GGG ATC 259 
Lys Ser Pro lie Ala Lys Gly lie Leu Met Val lie Phe lie Gly lie 
40 45 50 

GCT ATT TAT GTG TGG AGG AAT TTG GAC CGG TGG AAA GAG ATC TTA TTC 3 07 

Ala lie Tyr Val Trp Arg Asn Leu Asp Arg Trp Lys Glu lie Leu Phe 
55 60 65 

ACG ATC CTT GGC GTG GTG TTT GGG ATT TTT TTA TTC TTT AAA GCT CCG 3 55 

Thr lie Leu Gly Val Val Phe Gly lie Phe Leu Phe Phe Lys Ala Pro 
70 75 80 

AGT TTA GCG AAT TGG TTT ATG GGA ATT TTT TAATGATTAT CCTGTCAGCG AGC 4 08 
Ser Leu Ala Asn Trp Phe Met Gly lie Phe 
85 90 

GTGAAGAATT TGCGTGAAAT TTCGGTTAAA GAAAAATTTT TA 4 50 

(2) INFORMATION FOR SEQ ID NO:602: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:602: 

Met Ser Ala His Phe Leu Lys lie Val Phe Leu Val Gly Met Cys Val 

15 10 15 

Ser Ser Leu Phe Ala Glu Gly Leu Glu Gly Phe Phe Asn Ala Leu Glu 

20 25 30 

Ala Gin Leu Lys Ser Pro lie Ala Lys Gly lie Leu Met Val lie Phe 

35 40 45 

lie Gly lie Ala lie Tyr Val Trp Arg Asn Leu Asp Arg Trp Lys Glu 

50 55 60 

lie Leu Phe Thr lie Leu Gly Val Val Phe Gly lie Phe Leu Phe Phe 
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160 



(2) INFORMATION FOR SEQ ID NO: 60 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 



Met Gly Asn Val Asn Pro Pro Lys Thr Ser Tyr Lys Glu Ser Leu Ser 

15 10 15 

Thr lie Ala Ser lie Glu Tyr Ala lie Ala His Val Gly Val Gin Asn 

20 25 30 

Leu He He Cys Gly His Ser Asp Cys Gly Ala Cys Gly Ser Val His 

35 40 45 

Leu He His Asp Glu Thr Thr Lys Ala Lys Thr Pro Tyr He Ala Asn 

50 55 60 

Trp He Gin Phe Leu Glu Pro Val Lys Glu Glu Leu Lys Asn His Pro 
65 70 75 80 

Gin Phe Ser Asn His Phe Ala Lys Arg Ser Trp Leu Thr Glu Arg Leu 

85 90 95 

Asn Ala Arg Leu Gin Leu Asn Asn Leu Leu Ser Tyr Asp Phe He Gin 

100 105 110 

Glu Lys Ala Ser Lys Asn Glu Leu Lys He Phe Gly Trp His Tyr He 

115 120 125 

He Glu Thr Gly Arg He Tyr Asn Tyr Asn Phe Glu Ser His Phe Phe 

130 135 140 

Glu Pro He Gly Glu Thr He Lys Gin Arg Lys Ser His Glu Asn Phe 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 601: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 107... 3 85 
(D) OTHER INFORMATION: 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 250... 729 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

TTTAGCTATG ATAAGCGTTT TAAAAACAAA CGAATTTTAA TCAAAATGAG ATTTAAGGGT 6 0 

TAAAGAGTGA AAGCGTTTTT AGGAGCGTTA GAGTTTCAAG AGAATGAATA TGAAGAGCTT 12 0 

AAAGAGCTTT ATGAGAGCTT AAAAACCAAG CAAAAGCCCC ACACTTTGTT CATTTCTTGT 18 0 

GTGGATTCAC GAGTCGTGCC TAATTTAATC ACTGGCACCA AACCGGGCGA ATTGTATGTG 24 0 

ATTTGCAAC ATG GGC AAT GTG AAC CCC CCT AAA ACA AGC TAT AAA GAG TCC 2 91 
Met Gly Asn Val Asn Pro Pro Lys Thr Ser Tyr Lys Glu Ser 
15 10 

CTT TCT ACC ATT GCG AGC ATT GAA TAC GCT ATC GCG CAT GTG GGC GTT 339 
Leu Ser Thr lie Ala Ser He Glu Tyr Ala He Ala His Val Gly Val 
15 20 25 30 

CAA AAC TTA ATC ATT TGC GGG CAT AGC GAT TGT GGG GCT TGC GGG AGC 3 87 

Gin Asn Leu He He Cys Gly His Ser Asp Cys Gly Ala Cys Gly Ser 
35 40 45 

GTT CAT TTA ATC CAT GAT GAA ACC ACC AAA GCT AAA ACC CCT TAC ATT 4 35 

Val His Leu He His Asp Glu Thr Thr Lys Ala Lys Thr Pro Tyr He 
50 55 60 

GCA AAC TGG ATA CAA TTT TTA GAG CCT GTT AAA GAA GAG TTA AAA AAC 4 83 

Ala Asn Trp He Gin Phe Leu Glu Pro Val Lys Glu Glu Leu Lys Asn 
65 70 75 

CAC CCG CAA TTC AGC AAC CAT TTC GCC AAG CGT TCA TGG CTT ACA GAG 531 
His Pro Gin Phe Ser Asn His Phe Ala Lys Arg Ser Trp Leu Thr Glu 
80 85 90 

CGT TTG AAT GCG CGC TTG CAA CTC AAC AAC CTC TTA AGC TAT GAT TTC 57 9 

Arg Leu Asn Ala Arg Leu Gin Leu Asn Asn Leu Leu Ser Tyr Asp Phe 
95 100 105 110 

ATT CAA GAG AAA GCG AGC AAG AAT GAA TTA AAA ATT TTT GGT TGG CAC 62 7 

He Gin Glu Lys Ala Ser Lys Asn Glu Leu Lys He Phe Gly Trp His 
115 120 125 

TAC ATC ATA GAA ACA GGC AGG ATT TAT AAT TAT AAT TTT GAA AGC CAT 675 
Tyr He He Glu Thr Gly Arg He Tyr Asn Tyr Asn Phe Glu Ser His 
130 135 140 

TTT TTT GAG CCG ATT GGA GAA ACC ATT AAA CAA AGG AAA AGT CAT GAA 72 3 

Phe Phe Glu Pro He Gly Glu Thr He Lys Gin Arg Lys Ser His Glu 
145 150 155 

AAC TTC TAAAACAAAA ACCCCTAAAT CCGTTTTAAT CGCTGGGCCA TGCGTCATTG A 78 0 

Asn Phe 
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Val Glu Gly Val Phe Phe Ser lie Tyr lie Phe Tyr Ser Gly Leu Asn 

35 40 45 

Glu Asp Asp Leu Asn Arg Leu Gin Glu Thr lie Lys Pro Phe Lys His 

50 55 60 

Phe Ala Ala Leu Lys Cys Gin Asp lie Ser Ala Thr Leu Asp Ser Leu 
65 70 75 80 

Pro Thr lie Thr Asp Ser Ala Trp Val Asn Arg Tyr Ser Arg Met lie 

85 90 95 

Leu Val Lys Tyr Leu Leu Pro Ser Leu Phe Pro Gin Tyr Ser Lys Met 

100 105 110 

He Trp Ser Asp Val Asp Val Val Phe Cys Arg Ala Phe Ala Asp Asp 

115 120 125 

Phe He Ala Leu Asp Thr Ser Glu Ser Phe His Leu Ser Gly Val He 

130 135 140 

Ser Leu Val Ser Gin Ser Val Thr Glu Gly Phe Trp Phe Cys Asn Leu 
145 150 155 160 

Asp Tyr Met Arg Lys His Ser Phe Thr Gin Gin Val Leu Glu Lys Phe 

165 170 175 

Lys He Gin Val Met Arg Pro Tyr Phe Lys Glu Pro Thr Leu He His 

180 185 190 

His Leu His Ala Tyr He Lys Glu Leu Pro Leu His Tyr Cys Val Leu 

195 200 205 

Pro Tyr Tyr Tyr Gin Glu Glu Leu Asp Asp Leu Arg His Lys Ala Ser 

210 215 220 

Leu Pro He Arg Phe Glu He He His Gin Asp Lys Pro Asn Glu Phe 
225 230 235 240 

He His Arg Gin Gin He Pro Tyr Glu He Ser Gin He Gin Asn He 

245 250 255 

Leu Ser Asn Pro He He Met His Tyr Glu Ser Asp Lys Asp Ala Leu 

260 265 270 

Gly He Tyr Asn Gly Lys Pro Trp Glu Phe Pro Leu Gly Asn Gin Tyr 

275 280 285 

His Leu Trp Leu Glu Met Leu Ala His Thr Pro Phe Trp Lys Asp Phe 

290 295 300 

Thr Leu Glu Met Gin Lys Lys Arg He Glu Tyr Arg Asp He Ala Gin 
305 310 315 320 

Lys He His Tyr Phe Ser Gin Asp Lys Arg Leu Tyr Glu Val Ser He 

325 330 335 

Arg Ser He Lys Val Phe Ala Ser His Tyr Tyr Asn Leu Val Val Lys 

340 345 350 

Glu Arg Trp Ser Lys Pro He Lys Thr Phe Phe Gin Lys Asn Phe Phe 

355 360 365 

Gin Lys Lys Phe 
370 

(2) INFORMATION FOR SEQ ID NO: 5 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRAITOEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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TTT GAA ATC ATC CAC CAA GAC AAA CCC AAT GAA TTT ATC CAT CGC CAG 777 
Phe Glu lie lie His Gin Asp Lys Pro Asn Glu Phe lie His Arg Gin 
230 235 240 

CAA ATC CCC TAT GAG ATC TCT CAA ATT CAA AAC ATT CTT TCA AAC CCT 82 5 

Gin ,Ile Pro Tyr Glu lie Ser Gin lie Gin Asn lie Leu Ser Asn Pro 
245 250 255 260 

ATT ATC ATG CAC TAT GAA TCT GAT AAA GAT GCT CTT GGA ATC TAC AAT 873 
lie lie Met His Tyr Glu Ser Asp Lys Asp Ala Leu Gly lie Tyr Asn 
265 270 275 

GGC AAA CCT TGG GAG TTC CCT TTG GGG AAT CAA TAC CAC CTG TGG TTA 921 
Gly Lys Pro Trp Glu Phe Pro Leu Gly Asn Gin Tyr His Leu Trp Leu 
280 285 290 

GAG ATG CTT GCA CAC ACT CCA TTT TGG AAA GAC TTC ACT CTG GAA ATG 96 9 

Glu Met Leu Ala His Thr Pro Phe Trp Lys Asp Phe Thr Leu Glu Met 
295 300 305 

CAA AAA AAA CGC ATA GAA TAC CGA GAT ATT GCT CAA AAA ATC CAT TAT 1017 
Gin Lys Lys Arg lie Glu Tyr Arg Asp lie Ala Gin Lys lie His Tyr 
310 315 320 

TTT TCT CAA GAT AAG CGT CTT TAT GAA GTG AGC ATA CGC TCC ATT AAG 106 5 

Phe Ser Gin Asp Lys Arg Leu Tyr Glu Val Ser lie Arg Ser lie Lys 
325 330 335 340 

GTT TTT GCA TCT CAT TAC TAT AAT TTA GTG GTT AAA GAA CGA TGG TCT 1113 
Val Phe Ala Ser His Tyr Tyr Asn Leu Val Val Lys Glu Arg Trp Ser 
345 350 355 

AAA CCA ATA AAA ACT TTC TTT CAA AAA AAT TTT TTT CAA AAA AAG TTC T 1162 
Lys Pro lie Lys Thr Phe Phe Gin Lys Asn Phe Phe Gin Lys Lys Phe 
360 365 370 

AATTTGTTGG AATCACAAGG CGAAACCAAA TCCCTAATAC TTATGCTTTC TCAGG 1217 

(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 3 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

Met Gin His Glu lie Pro lie Ala Phe Ala Phe Asp Lys Asn Tyr Leu 

15 10 15 

Lys Thr Gly Ala Val Ala Leu Tyr Ser Leu Leu His Ala His Arg Ala 
20 25 30 



-921- 



BNSDOCID: <WO 9843478AlJ_> 



wo 98/43478 



PCT/US98/06371 



ATC CCT ATT GCC TTT GCC TTT GAT AAA AAC TAG CTA AAA ACA GGG GCT 105 
lie Pro lie Ala Phe Ala Phe Asp Lys Asn Tyr Leu Lys Thr Gly Ala 
5 10 15 20 

GTG GCT CTC TAG TCT TTA TTG CAT GCC CAT CGT GCA GTT GAA GGG GTA 15 3 

Val Ala Leu Tyr Ser Leu Leu His Ala His Arg Ala Val Glu Gly Val 
25 30 35 

TTT TTC AGT ATC TAT ATA TTC TAT AGC GGT TTG AAT GAA GAT GAT TTA 2 01 

Phe Phe Ser lie Tyr lie Phe Tyr Ser Gly Leu Asn Glu Asp Asp Leu 
40 45 50 

AAC AGG CTC CAA GAA ACT ATC AAA CCT TTC AAA CAT TTT GCC GCT TTA 24 9 

Asn Arg Leu Gin Glu Thr lie Lys Pro Phe Lys His Phe Ala Ala Leu 
55 60 65 

AAA TGC CAA GAT ATT AGC GCC ACT CTT GAT TCT TTG CCC ACC ATC ACG 2 97 

Lys Cys Gin Asp lie Ser Ala Thr Leu Asp Ser Leu Pro Thr lie Thr 
70 75 80 

GAT AGT GCA TGG GTT AAT CGC TAT TCT AGA ATG ATT TTG GTC AAA TAG 34 5 

Asp Ser Ala Trp Val Asn Arg Tyr Ser Arg Met lie Leu Val Lys Tyr 
85 90 95 100 

CTT CTC CCT AGT TTA TTC CCC CAA TAG AGC AAA ATG ATT TGG TCT GAT 3 93 

Leu Leu Pro Ser Leu Phe Pro Gin Tyr Ser Lys Met lie Trp Ser Asp 
105 110 115 

GTG GAT GTG GTC TTT TGC AGA GCT TTC GCT GAT GAT TTT ATC GCT TTA 441 
Val Asp Val Val Phe Cys Arg Ala Phe Ala Asp Asp Phe lie Ala Leu 
120 125 130 

GAG ACA AGC GAA TCT TTT CAT TTG AGT GGT GTG ATA AGT TTA GTA TCA 48 9 

Asp Thr Ser Glu Ser Phe His Leu Ser Gly Val lie Ser Leu Val Ser 
135 140 145 

CAA TCA GTT ACA GAG GGG TTT TGG TTT TGC AAT TTG GAT TAG ATG CGA 53 7 

Gin Ser Val Thr Glu Gly Phe Trp Phe Cys Asn Leu Asp Tyr Met Arg 
150 155 160 

AAG CAC TCT TTC ACC CAA CAG GTC TTA GAA AAA TTT AAA ATT CAA GTA 58 5 

Lys His Ser Phe Thr Gin Gin Val Leu Glu Lys Phe Lys lie Gin Val 
165 170 175 180 

ATG CGT CCA TAT TTT AAA GAA CCT ACA TTA ATA CAC CAT TTG CAT GCT 63 3 

Met Arg Pro Tyr Phe Lys Glu Pro Thr Leu lie His His Leu His Ala 
185 190 195 

TAT ATT AAA GAA CTT CCC TTA CAC TAT TGC GTT GTG CCT TAT TAT TAT 681 
Tyr lie Lys Glu Leu Pro Leu His Tyr Cys Val Leu Pro Tyr Tyr Tyr 
200 205 210 

CAA GAA GAA CTT GAT GAT TTG AGA CAT AAA GCT TCC TTA CCC ATT GGG 72 9 

Gin Glu Glu Leu Asp Asp Leu Arg His Lys Ala Ser Leu Pro lie Arg 
215 220 225 
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755 760 765 

Gin Asn Phe Gly Leu Ala Asp Leu His Gin Leu Arg Gly Arg Val Gly 

770 775 780 

Arg Gly Lys Lys Glu Gly Phe Cys Tyr Phe Leu lie Glu Asp Gin Lys 
785 790 795 800 

Ser Leu Asn Glu Gin Ala Leu Lys Arg Leu Leu Ala Leu Glu Lys Asn 

805 810 815 

Ser Tyr Leu Gly Ser Gly Glu Ser Val Ala Tyr His Asp Leu Glu lie 

820 825 830 

Arg Gly Gly Gly Asn Leu Leu Gly Gin Asp Gin Ser Gly His lie Lys 

835 840 845 

Asn lie Gly Tyr Ala Leu Tyr Thr Arg Met Leu Glu Asp Ala lie Tyr 

850 855 860 

Glu Leu Ser Gly Gly Lys Lys Arg Leu Glu Lys Ser Val Glu lie Gin 
865 870 875 880 

Leu Gly Val Ser Ala Phe Leu Asn Pro Glu Leu lie Ala Ser Asp Ser 

885 890 895 

Leu Arg Leu Asp Leu Tyr Arg Arg Leu Ser Leu Cys Glu Asn Thr Asp 

900 905 910 

Glu Val Gly Gin lie His Glu Glu lie Glu Asp Arg Phe Gly Lys lie 

915 920 925 

Asp Asp Leu Ser Ala Gin Phe Leu Gin He He Thr Leu Lys He Leu 

930 935 940 

Ala Asn Gin Leu Gly He He Lys Leu Ser Asn Phe Asn Gin Asn He 
945 950 955 960 

Thr He Thr Tyr Ser Asp Glu Lys Lys Glu Ser Leu Lys Ala Pro Ser 

965 970 975 

Lys Asp Asp Asn Asp He Leu Glu Thr Leu Leu Lys His Leu Arg Ala 

980 985 990 

Gin He Ser Leu Lys Arg Arg 
995 1 

(2) INFORMATION FOR SEQ ID NO: 597: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 1161 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 97: 

AGGCTATGAT ATAGTTTATT TTCACAAAAT CTCCAAGGAA AATTG ATG CAA CAT GAA 57 

Met Gin His Glu 
1 
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Met Glu Cys Val lie Ala Pro Phe Val Leu Asn Phe Lys Thr Pro Asp 

325 330 335 

Gly He Phe He Ser Leu Asn Ser Phe Glu Arg Lys Lys Lys Arg Gin 

340 345 350 

Lys Ser Lys Leu Ala Leu Asn Glu Leu Asn Pro Gly Glu Trp Val Val 

355 360 365 

His Asp Asp Tyr Gly Val Gly Val Phe Ser Gin Leu Val Gin His Ser 

370 375 380 

Val Leu Gly Ser Lys Arg Asp Phe Leu Glu He Ala Tyr Leu Gly Glu 

385 390 395 400 

Asp Lys Leu Leu Leu Pro Val Glu Asn Leu His Leu He Ala Arg Tyr 

405 410 415 

Val Ala Gin Ser Asp Ser Val Pro Ala Lys Asp Arg Leu Gly Lys Gly 



Ala Ser Lys He He Glu Leu Ala Ala Glu Arg Asn Leu He Leu Gly 

450 455 460 

Lys Lys Met Asp Val His Leu Ala Glu Leu Glu Val Phe Lys Ser His 
465 470 475 480 

Ala Gly Phe Glu Tyr Thr Ser Asp Gin Glu Lys Ala He Ala Glu He 

485 490 495 

Ser Lys Asp Leu Ser Ser His Arg Val Met Asp Arg Leu Leu Ser Gly 

500 505 510 

Asp Val Gly Phe Gly Lys Thr Glu Val Ala Met His Ala He Phe Cys 

515 520 525 

Ala Phe Leu Asn Gly Phe Gin Ser Ala Leu Val Val Pro Thr Thr Leu 

530 535 540 

Leu Ala His Gin His Phe Glu Thr Leu Arg Ala Arg Phe Glu Asn Phe 
545 550 555 560 

Gly Val Lys Val Ala Arg Leu Asp Arg Tyr Ala Ser Glu Lys Asn Lys 

565 570 575 

Leu Leu Lys Ala Val Glu Leu Gly Gin Val Asp Ala Leu He Gly Thr 

580 585 590 

His Ala He Leu Gly Ala Lys Phe Lys Asn Leu Gly Leu Val Val Val 

595 600 605 

Asp Glu Glu His Lys Phe Gly Val Lys Gin Lys Glu Ala Leu Lys Glu 

610 615 620 

Leu Ser Lys Ser Val His Phe Leu Ser Met Ser Ala Thr Pro He Pro 
625 630 635 640 

Arg Thr Leu Asn Met Ala Leu Ser Gin He Lys Gly He Ser Ser Leu 

645 650 655 

Lys Thr Pro Pro Thr Asp Arg Lys Pro Ser Arg Thr Phe Leu Lys Glu 

660 665 670 

Lys Asn Asp Glu Leu Leu Lys Glu He He Tyr Arg Glu Leu Arg Arg 

675 680 685 

Asn Gly Gin He Phe Tyr He His Asn His He Ala Ser He Leu Lys 

690 695 700 

Val Lys Thr Lys Leu Glu Asp Leu He Pro Lys Leu Lys He Ala He 
705 710 715 720 

Leu His Ser Gin He Asn Ala Asn Glu Ser Glu Glu He Met Leu Glu 

725 730 735 

Phe Ala Lys Gly Asn Tyr Gin Val Leu Leu Cys Thr Ser He Val Glu 

740 745 750 

Ser Gly- He His Leu Pro Asn Ala Asn Thr He He He Asp Asn Ala 



420 425 
Ser Phe Leu Lys Leu Lys Ala Lys Val Arg Thr 
435 440 



Lys Leu Leu 
445 



430 



Glu He 
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AAAAGATTAT TAA 3101 
(2) INFORMATION FOR SEQ ID NO: 596: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 amino acids 

(B) TYPE: amino acid 

(C) STRAKDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 96: 

Met He Gin Ser Ser Leu Tyr Arg Ala Leu Asn Lys Gly Phe Asp Tyr 

15 10 15 

Gin He Leu Ala Cys Lys Asp Phe Lys Glu Ser Glu Leu Ala Lys Glu 

20 25 30 

Val He Ser Tyr Phe Lys Pro Asn Thr Lys Ala He Leu Phe Pro Glu 

35 40 45 

Phe Arg Ala Lys Lys Asn Asp Asp Leu Arg Ser Phe Phe Glu Glu Phe 

50 55 60 

Leu Gin Leu Leu Gly Gly Leu Arg Glu Phe Tyr Gin Ala Leu Glu Asn 
65 70 75 80 

Lys Gin Glu Thr He He He Ala Pro He Ser Ala Leu Leu His Pro 

85 90 95 

Leu Pro Lys Lys Glu Leu Leu Glu Ser Phe Lys He Thr Leu Leu Glu 

100 105 110 

Lys Tyr Asn Leu Lys Asp Leu Lys Asp Lys Leu Phe Tyr Tyr Gly Tyr 

115 120 125 

Glu He Leu Asp Leu Val Glu Val Glu Gly Glu Ala Ser Phe Arg Gly 

130 135 140 

Asp He Val Asp He Tyr Ala Pro Asn Ser Lys Ala Tyr Arg Leu Ser 
145 150 155 160 

Phe Phe Asp Thr Glu Cys Glu Ser He Lys Glu Phe Asp Pro He Thr 

165 170 175 

Gin Met Ser Leu Lys Glu Asp Leu Leu Glu He Glu He Pro Pro Thr 

180 185 190 

Leu Phe Ser Leu Asp Glu Ser Ser Tyr Lys Asp Leu Lys Thr Lys Val 

195 200 205 

Glu Gin Ser Pro Leu Asn Ser Phe Ser Lys Asp Leu Thr Ser Phe Gly 

210 215 220 

Leu Trp Phe Leu Gly Glu Lys Ala Gin Asp Leu Leu He Val Tyr Lys 
225 230 235 240 

Ser He He Ser Pro Arg Ala Leu Glu Glu He Gin Glu Leu Ala Ser 

245 250 255 

Leu Asn Glu Leu Asp Cys Glu Arg Phe Lys Phe Leu Lys Val Leu Glu 

260 265 270 

Asn Ala Gin Gly Tyr Glu Asp Leu Glu He His Ala His Ala Leu Glu 

275 280 285 

Gly Phe He Ala Leu His Ser Asn His Lys He Thr Leu Leu Ala Pro 

290 295 300 

Asn Lys Thr He Leu Asp Asn Ala He Ser Ala Leu Asp Ala Gly Asn 
305 310 315 320 
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GGT AAA AAA GAA GGC TTT TGT TAT TTC CTC ATA GAA GAT CAA AAA AGT 2 4 56 

Gly Lys Lys Glu Gly Phe Cys Tyr Phe Leu lie Glu Asp Gin Lys Ser 
790 795 800 

TTG AAT GAA GAG GCT TTA AAA CGC TTG CTC GCT TTG GAA AAA AAT TCA 2 5 04 

Leu Asn Glu Gin Ala Leu Lys Arg Leu Leu Ala Leu Glu Lys Asn Ser 
805 810 815 

TAT TTA GGC AGC GGG GAG AGT GTC GCT TAT CAT GAT TTA GAA ATC AGG 2 552 

Tyr Leu Gly Ser Gly Glu Ser Val Ala Tyr His Asp Leu Glu lie Arg 
820 825 830 

GGG GGC GGG AAT TTG CTC GGG CAA GAT CAG AGC GGG CAT ATT AAA AAC 2 6 00 

Gly Gly Gly Asn Leu Leu Gly Gin Asp Gin Ser Gly His lie Lys Asn 
835 840 845 

ATT GGT TAT GCA CTC TAT ACG CGC ATG CTT GAA GAC GCG ATT TAT GAA 2 64 8 

lie Gly Tyr Ala Leu Tyr Thr Arg Met Leu Glu Asp Ala lie Tyr Glu 
850 855 860 865 

TTG AGT GGG GGG AAG AAA AGG CTT GAA AAG AGC GTA GAA ATC -CAA CTT 2 6 96 

Leu Ser Gly Gly Lys Lys Arg Leu Glu Lys Ser Val Glu lie Gin Leu 
870 875 880 

GGC GTG AGC GCT TTT TTA AAC CCT GAA CTC ATT GCA AGC GAT AGT TTG 2 744 

Gly Val Ser Ala Phe Leu Asn Pro Glu Leu He Ala Ser Asp Ser Leu 
885 890 895 

AGA TTG GAT TTA TAC CGC CGT TTG AGT TTG TGT GAA AAT ACA GAT GAG 2 7 92 

Arg Leu Asp Leu Tyr Arg Arg Leu Ser Leu Cys Glu Asn Thr Asp Glu 
900 905 910 

GTG GGG CAA ATC CAT GAA GAA ATA GAA GAC AGG TTT GGC AAA ATA GAC 2 84 0 

Val Gly Gin He His Glu Glu He Glu Asp Arg Phe Gly Lys He Asp 
915 920 925 

GAT TTG AGC GCT CAA TTT TTG CAA ATC ATT ACG CTT AAA ATT CTA GCC 2 8 88 

Asp Leu Ser Ala Gin Phe Leu Gin He He Thr Leu Lys He Leu Ala 
930 935 940 945 

AAC CAG CTT GGC ATC ATC AAA CTT TCT AAT TTC AAT CAA AAC ATC ACC 2 93 6 

Asn Gin Leu Gly He He Lys Leu Ser Asn Phe Asn Gin Asn He Thr 
950 955 960 

ATC ACT TAT AGC GAT GAA AAG AAA GAA AGC CTG AAA GCC CCA AGC AAA 2 984 

He Thr Tyr Ser Asp Glu Lys Lys Glu Ser Leu Lys Ala Pro Ser Lys 
965 970 975 

GAC GAT AAC GAT ATT TTA GAA ACC CTT TTG AAA CAT TTG CGC GCT CAA 3 03 2 

Asp Asp Asn Asp He Leu Glu Thr Leu Leu Lys His Leu Arg Ala Gin 
980 985 990 

ATT TCT TTA AAG CGG CGT TAAAAGCGTT TGATTTTAGC GTTAATTTTG TTATTTTA 3 08 8 
He Ser Leu Lys Arg Arg 
995 1 
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GTT AAA GTG OCT CGT TTG GAC AGG TAT GCG AGC GAA AAA AAC AAG CTT 17 84 

Val Lys Val Ala Arg Leu Asp Arg Tyr Ala Ser Glu Lys Asn Lys Leu 
565 570 575 

TTA AAG GCG GTG GAA TTA GGG CAA GTT GAT GCG CTA ATA GGC ACG CAT 183 2 

Leu Lys Ala Val Glu Leu Gly Gin Val Asp Ala Leu lie Gly Thr His 
580 585 590 

GCG ATT TTA GGC GCG AAA TTC AAA AAC CTG GGC TTG GTG GTG GTG GAT 1880 
Ala lie Leu Gly Ala Lys Phe Lys Asn Leu Gly Leu Val Val Val Asp 
595 600 605 

GAA GAG CAT AAA TTT GGC GTG AAA CAA AAA GAA GCT TTA AAA GAA TTG 192 8 

Glu Glu His Lys Phe Gly Val Lys Gin Lys Glu Ala Leu Lys Glu Leu 
610 615 620 625 

AGT AAG AGC GTG CAT TTT TTA AGC ATG TCC GCT ACG CCT ATC CCG CGC 197 6 

Ser Lys Ser Val His Phe Leu Ser Met Ser Ala Thr Pro lie Pro Arg 
630 635 640 

ACT CTA AAC ATG GCG CTC TCT CAA ATT AAG GGC ATT AGT TCT TTA AAA 2 0 24 

Thr Leu Asn Met Ala Leu Ser Gin lie Lys Gly lie Ser Ser Leu Lys 
645 650 655 

ACC CCG CCC ACA GAC AGA AAG CCC AGC CGC ACT TTT TTG AAA GAA AAG 2072 
Thr Pro Pro Thr Asp Arg Lys Pro Ser Arg Thr Phe Leu Lys Glu Lys 
660 665 670 

AAT GAC GAA CTC TTA AAA GAG ATT ATT TAG AGA GAA TTA CGC CGT AAC 212 0 

Asn Asp Glu Leu Leu Lys Glu lie lie Tyr Arg Glu Leu Arg Arg Asn 
675 680 685 

GGG CAA ATT TTT TAC ATC CAT AAC CAC ATC GCT AGC ATT TTA AAA GTC 2168 
Gly Gin lie Phe Tyr lie His Asn His lie Ala Ser lie Leu Lys Val 
690 695 700 705 

AAA ACC AAG CTA GAA GAT TTA ATC CCT AAA CTC AAA ATC GCT ATT TTG 2 216 

Lys Thr Lys Leu Glu Asp Leu lie Pro Lys Leu Lys lie Ala lie Leu 
710 715 720 

CAT TCC CAG ATT AAC GCT AAT GAG AGC GAA GAA ATC ATG CTA GAG TTT 2 2 64 

His Ser Gin lie Asn Ala Asn Glu Ser Glu Glu lie Met Leu Glu Phe 
725 730 735 

GCC AAG GGA AAT TAT CAG GTT TTA TTA TGC ACT TCT ATT GTG GAA TCA 2 312 

Ala Lys Gly Asn Tyr Gin Val Leu Leu Cys Thr Ser lie Val Glu Ser 
740 745 750 

GGG ATT CAT TTG CCT AAC GCT AAC ACG ATC ATT ATA GAT AAT GCG CAA 2 3 60 

Gly lie His Leu Pro Asn Ala Asn Thr lie lie lie Asp Asn Ala Gin 
755 760 765 

AAT TTC GGG CTG GCT GAT TTG CAC CAA TTG AGA GGG CGT GTG GGG AGA 24 0 8 

Asn Phe Gly Leu Ala Asp Leu His Gin Leu Arg Gly Arg Val Gly Arg 
770 - lis 780 785 
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ATT TTT ATT TCG CTC AAT TCT TTT GAA AGG AAG AAA AAA CGC CAA AAA 1112 
lie Phe lie Ser Leu Asn Ser Phe Glu Arg Lys Lys Lys Arg Gin Lys 
340 345 350 

TCC AAG CTC GCT TTG AAT GAG TTG AAT CCG GGC GAA TGG GTG GTG CAT 116 0 

Ser Lys Leu Ala Leu Asn Glu Leu Asn Pro Gly Glu Trp Val Val His 
355 360 365 

GAT GAT TAT GGG GTG GGC GTG TTT TCT CAA TTA GTC CAG CAC AGC GTT 12 0 8 

Asp Asp Tyr Gly Val Gly Val Phe Ser Gin Leu Val Gin His Ser Val 
370 375 380 385 

TTA GGG AGC AAG AGG GAT TTT TTA GAA ATC GCT TAT TTG GGC GAA GAC 12 56 

Leu Gly Ser Lys Arg Asp Phe Leu Glu lie Ala Tyr Leu Gly Glu Asp 
390 395 400 

AAA CTG CTG TTA CCG GTA GAA AAC TTG CAT CTC ATC GCT CGC TAT GTG 13 04 

Lys Leu Leu Leu Pro Val Glu Asn Leu His Leu lie Ala Arg Tyr Val 
405 410 415 

GCG CAA AGC GAT AGC GTG CCA GCT AAA GAC CGG CTA GGG AAA GGG AGC 13 52 

Ala Gin Ser Asp Ser Val Pro Ala Lys Asp Arg Leu Gly Lys Gly Ser 
420 425 430 

TTT CTT AAA TTA AAA GCT AAA GTC AGG ACT AAG CTT TTA GAG ATT GCT 14 0 0 

Phe Leu Lys Leu Lys Ala Lys Val Arg Thr Lys Leu Leu Glu lie Ala 
435 440 445 

AGC AAG ATC ATT GAA TTA GCG GCT GAA CGC AAT TTG ATC TTG GGT AAA 14 4 8 

Ser Lys lie lie Glu Leu Ala Ala Glu Arg Asn Leu lie Leu Gly Lys 
450 455 460 465 

AAG ATG GAT GTG CAT TTA GCG GAG TTG GAA GTC TTT AAA TCG CAT GCG 14 96 

Lys Met Asp Val His Leu Ala Glu Leu Glu Val Phe Lys Ser His Ala 
470 475 480 

GGG TTT GAA TAC ACC AGC GAT CAA GAA AAG GCT ATC GCT GAA ATT TCA 1544 
Gly Phe Glu Tyr Thr Ser Asp Gin Glu Lys Ala He Ala Glu He Ser 
485 490 495 

AAG GAT TTA AGC TCT CAC AGG GTG ATG GAT AGA TTA TTG AGT GGG GAT 15 92 

Lys Asp Leu Ser Ser His Arg Val Met Asp Arg Leu Leu Ser Gly Asp 
500 505 510 

GTG GGT TTT GGG AAA ACA GAA GTG GCG ATG CAT GCG ATT TTT TGC GCG 164 0 

Val Gly Phe Gly Lys Thr Glu Val Ala Met His Ala He Phe Cys Ala 
515 520 525 

TTT TTG AAC GGC TTT CAA AGC GCT TTA GTT GTG CCT ACC ACT TTA TTA 16 8 8 

Phe Leu Asn Gly Phe Gin Ser Ala Leu Val Val Pro Thr Thr Leu Leu 
530 535 540 545 

GCG CAC CAG CAT TTT GAG ACT TTA AGG GCG CGT TTT GAA AAT TTT GGC 17 3 6 

Ala His Gin His Phe Glu Thr Leu Arg Ala Arg Phe Glu Asn Phe Gly 
550 555 560 
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TAT AAC CTT AAG GAT TTG AAA GAC AAG CTC TTT TAT TAT GGC TAT GAA 44 0 

Tyr Asn Leu Lys Asp Leu Lys Asp Lys Leu Phe Tyr Tyr Gly Tyr Glu 
115 120 125 

ATT TTA GAC TTA GTG GAA GTG GAA GGC GAA GCG AGC TTT AGG GGG GAT 48 8 

lie Leu Asp Leu Val Glu Val Glu Gly Glu Ala Ser Phe Arg Gly Asp 
130 135 140 145 

ATT GTG GAT ATT TAT GCG CCA AAT TCT AAA GCG TAT CGC TTG AGT TTT 53 6 

lie Val Asp lie Tyr Ala Pro Asn Ser Lys Ala Tyr Arg Leu Ser Phe 
150 155 160 

TTT GAC ACC GAG TGT GAG AGC ATT AAG GAA TTT GAT CCC ATT ACT CAA 584 
Phe Asp Thr Glu Cys Glu Ser lie Lys Glu Phe Asp Pro lie Thr Gin 
165 170 175 

ATG AGC CTT AAA GAA GAT TTG TTA GAA ATT GAA ATC CCC CCC ACG CTT 63 2 

Met Ser Leu Lys Glu Asp Leu Leu Glu lie Glu lie Pro Pro Thr Leu 
180 185 190 

TTT AGT TTG GAC GAA TCA TCT TAT AAG GAT CTA AAA ACA AAA GTG GAA 68 0 

Phe Ser Leu Asp Glu Ser Ser Tyr Lys Asp Leu Lys Thr Lys Val Glu 
195 200 205 

CAA AGC CCC TTA AAT AGC TTT TCT AAA GAT TTA ACC AGT TTT GGT TTG 72 8 

Gin Ser Pro Leu Asn Ser Phe Ser Lys Asp Leu Thr Ser Phe Gly Leu 
210 215 220 225 

TGG TTT TTA GGA GAA AAA GCA CAA GAC TTA CTA ATC GTT TAT AAA AGC 77 6 

Trp Phe Leu Gly Glu Lys Ala Gin Asp Leu Leu lie Val Tyr Lys Ser 
230 235 240 

ATT ATA AGT CCT AGA GCT TTA GAA GAA ATT CAA GAA TTA GCG AGC TTA 824 
lie lie Ser Pro Arg Ala Leu Glu Glu lie Gin Glu Leu Ala Ser Leu 
245 250 255 

AAC GAA TTG GAT TGT GAG CGT TTC AAA TTT TTA AAG GTT TTA GAA AAC 872 
Asn Glu Leu Asp Cys Glu Arg Phe Lys Phe Leu Lys Val Leu Glu Asn 
260 265 270 

GCG CAA GGC TAT GAA GAT TTA GAA ATC CAT GCG CAT GCC CTA GAA GGC 92 0 

Ala Gin Gly Tyr Glu Asp Leu Glu lie His Ala His Ala Leu Glu Gly 
275 280 285 

TTT ATC GCT TTG CAT TCA AAT CAT AAA ATC ACG CTC CTA GCC CCC AAT 968 
Phe lie Ala Leu His Ser Asn His Lys lie Thr Leu Leu Ala Pro Asn 
290 295 300 305 

AAA ACG ATT TTA GAC AAC GCG ATA AGC GCG CTT GAT GCA GGC AAC ATG 1016 
Lys Thr lie Leu Asp Asn Ala lie Ser Ala Leu Asp Ala Gly Asn Met 
310 315 320 

GAA TGC GTC ATC GCC CCC TTT GTG TTA AAC TTT AAA ACC CCT GAT GGG 10 64 

Glu Cys Val lie Ala Pro Phe Val Leu Asn Phe Lys Thr Pro Asp Gly 
325 330 335 
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50 55 60 

Leu lie Asn Ala Leu Asn Lys He Lys Lys Gly Leu Leu Val Phe 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 5 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 54... 3050 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 

GGGGGGCGTT GTTAATCAAT GAGCAAGAAA AGAAAATTGA AAATAAATAG GGA ATG 5 6 

Met 
1 

ATC CAA TCC AGC CTT TAT AGA GCC TTA AAC AAA GGC TTT GAT TAG CAA 104 
lie Gin Ser Ser Leu Tyr Arg Ala Leu Asn Lys Gly Phe Asp Tyr Gin 
5 10 15 

ATA CTC GCT TGT AAG GAT TTT AAA GAA TCC GAG CTC GCT AAA GAA GTC 152 
lie Leu Ala Cys Lys Asp Phe Lys Glu Ser Glu Leu Ala Lys Glu Val 
20 25 30 

ATA AGC TAT TTT AAG CCA AAT ACC AAA GCC ATT CTT TTC CCG GAG TTT 2 00 

lie Ser Tyr Phe Lys Pro Asn Thr Lys Ala lie Leu Phe Pro Glu Phe 
35 40 45 

AGG GCT AAA AAA AAC GAC GAT TTG CGT TCG TTT TTT GAA GAA TTT TTA 24 8 

Arg Ala Lys Lys Asn Asp Asp Leu Arg Ser Phe Phe Glu Glu Phe Leu 
50 55 60 65 

CAG CTT TTA GGG GGT TTA AGG GAG TTT TAT CAA GCC TTA GAA AAC AAG 2 96 

Gin Leu Leu Gly Gly Leu Arg Glu Phe Tyr Gin Ala Leu Glu Asn Lys 
70 75 80 

CAA GAA ACT ATC ATC ATT GCC CCG ATT AGC GCG TTA TTG CAC CCT TTA 344 
Gin Glu Thr lie lie lie Ala Pro lie Ser Ala Leu Leu His Pro Leu 
85 90 95 

CCT AAA AAA GAA CTT TTA GAA AGC TTT AAA ATC ACT CTT TTA GAA AAA 3 92 

Pro Lys Lys Glu Leu Leu Glu Ser Phe Lys lie Thr Leu Leu Glu Lys 
100 105 110 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 287 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:593: 



TAATTTAATA GTTTAGCTAT CATGGAGCAT TCTAAATTAA AGGCGATCAC ATG TTT 5 6 

Met Phe 
1 



GAA AAA ATA CGC AAG ATT TTA GCG GAT ATT GAA GAT TCG CAA AAT GAA 104 
Glu Lys lie Arg Lys lie Leu Ala Asp lie Glu Asp Ser Gin Asn Glu 
5 10 15 



ATT GAA ATG CTT TTA AAA TTA GCG AAT TTG ACT TTG GGG GAT TTT ATT 152 
lie Glu Met Leu Leu Lys Leu Ala Asn Leu Ser Leu Gly Asp Phe lie 
20 25 30 



GAG ATT AAA AGA GGG AGC ATG GAC ATG CCA AAG GGC GTG AAT GAA GCG 2 00 

Glu lie Lys Arg Gly Ser Met Asp Met Pro Lys Gly Val Asn Glu Ala 
35 40 45 50 



TTT TTT ACG CAA TTA AGC GAA GAA GTG GAG CGA TTG AAG GAG CTT ATT 24 8 

Phe Phe Thr Gin Leu Ser Glu Glu Val Glu Arg Leu Lys Glu Leu lie 
55 60 65 



AAC GCT TTG AAT AAA ATC AAA AAA GGG TTA TTG GTG TTT TAAATGTGTG GG 2 99 
Asn Ala Leu Asn Lys lie Lys Lys Gly Leu Leu Val Phe 
70 75 



ATTGTAGGTT ATATAGGGGA TAGCGAGAAA AAATCCGTTC T 34 0 

(2) INFORMATION FOR SEQ ID NO: 5 94: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 



Met Phe Glu Lys 

1 

Asn Glu lie Glu 
20 

Phe lie Glu lie 
35 

Glu Ala Phe Phe 



lie Arg Lys lie 
5 

Met Leu Leu Lys 

Lys Arg Gly Ser 
40 

Thr Gin Leu Ser 



Leu Ala Asp lie 
10 

Leu Ala Asn Leu 
25 

Met Asp Met Pro 
Glu Glu Val Glu 



Glu Asp Ser Gin 
15 

Ser Leu Gly Asp 
30 

Lys Gly Val Asn 
45 

Arg Leu Lys Glu 
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TTGCCCTTTT CAAAAATATA ATAGCTGTAT TTTAAAAGCT CTATCCATAG CCT GAG 5 6 

Pro Glu 
1 

CTT GGT GAT TTC GCA AGA ATT GGG GTT AAT ATC CAC GCC AAA AAG GCA 104 
Leu Gly Asp Phe Ala Arg lie Gly Val Asn lie His Ala Lys Lys Ala 
5 10 15 

GTT TTC AAT AAT GGA TTT TTT AAG ATT AAA AAG TTC TTT TTG GAT GTG 152 
Val Phe Asn Asn Gly Phe Phe Lys lie Lys Lys Phe Phe Leu Asp Val 
20 25 30 

GTG GTG GGG GTC GTT TTC GCT ATC TGG TTT TAT GTA GTT AAA GAT TTC 2 00 

Val Val Gly Val Val Phe Ala lie Trp Phe Tyr Val Val Lys Asp Phe 
35 40 45 50 

ACC CGT TGG CGT GTG GTG AAT GAT GAT TTC ATC GTT TTC TAATTTAAGA TC 251 
Thr Arg Trp Arg Val Val Asn Asp Asp Phe lie Val Phe 
55 60 

GTAGCGATAC AAGGAAGCAA TAAGTCCTAG CTCATAAGCA A 2 92 

(2) INFORMATION FOR SEQ ID NO: 5 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:592: 

Pro Glu Leu Gly Asp Phe Ala Arg lie Gly Val Asn lie His Ala Lys 

15 10 15 

Lys Ala Val Phe Asn Asn Gly Phe Phe Lys lie Lys Lys Phe Phe Leu 

20 25 30 

Asp Val Val Val Gly Val Val Phe Ala lie Trp Phe Tyr Val Val Lys 

35 40 45 

Asp Phe Thr Arg Trp Arg Val Val Asn Asp Asp Phe lie Val Phe 
50 55 60 

(2) INFORMATION FOR SEQ ID NO:593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 90: 



Met Glu Lys Leu 
1 

Ser Phe Leu Leu 
20 

Leu Lys Gly Val 
35 

Gin Asn Lys lie 
50 

Phe Ptte Tyr Ala 
65 

Gly Ala Lys Lys 

Asn lie Gin Lys 
100 

Tyr Ser Lys Lys 
115 

Leu lie Asp Tyr 
130 

Phe Glu Pro Thr 
145 

Ala He Lys Lys 
Gin Asn Lys 



Phe Glu Lys He 
5 

Ala Phe Val Val 

Phe Gin Leu He 
40 

Leu Ser Phe Ser 
55 

He Phe Leu He 
70 

Tyr Lys Gin Asn 
85 

Phe Pro Asn Glu 

Gin Asn Lys He 
120 

Leu Asp Leu He 
135 

Lys Asp Asp Tyr 
150 

Tyr His Lys He 
165 



Leu His Glu Met 
10 

Ser Leu He Val 
25 

Phe Glu Ser He 

Leu Ser Phe Phe 
60 

Phe Tyr Gin He 
75 

Gin Arg Asp Ser 
90 

He Lys Glu Glu 
105 

Leu Arg Thr Lys 

Gly Phe Phe Lys 
140 

He Val Lys Pro 
155 

Ala Phe Lys Ser 
170 



Arg Ser Arg Thr 
15 

Phe He Phe Asn 
30 

Phe Gin Tyr Thr 
45 

Phe He Phe Phe 

Phe Leu Trp Tyr 
80 

Glu He Val Tyr 
95 

Leu Tyr Arg Cys 
110 

Lys Leu Asp Asp 
125 

Ser Arg Asp Phe 

Asp Val Leu Arg 
160 

Val Tyr Trp Gin 
175 



(2) INFORMATION FOR SEQ ID NO: 5 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 239 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 91: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 9: 

ACCACCTCAC CGATGAAGAA ATTAAGATCA TTGAAGAGGG GCAGTGA ATG GAA AAG 5 6 

Met Glu Lys 

1 

TTA TTT GAA AAG ATA TTG CAT GAA ATG AGA TCA AGA ACT TCT TTT CTG 104 
Leu Phe Glu Lys lie Leu His Glu Met Arg Ser Arg Thr Ser Phe Leu 
5 10 15 

CTT GCT TTT GTC GTT TCG CTT ATT GTT TTT ATT TTT AAT CTA AAA GGG 152 
Leu Ala Phe Val Val Ser Leu lie Val Phe lie Phe Asn Leu Lys Gly 
20 25 30 35 

GTT TTT CAA TTG ATT TTT GAG TCT ATT TTC CAA TAG ACC CAA AAC AAA 200 
Val Phe Gin Leu lie Phe Glu Ser lie Phe Gin Tyr Thr Gin Asn Lys 
40 45 50 

ATC CTT TCT TTT TCT CTT TCT TTT TTC TTT ATT" TTC TTT TTC TTT TAC 24 8 

lie Leu Ser Phe Ser Leu Ser Phe Phe Phe lie Phe Phe Phe Phe Tyr 
55 60 65 

GCT ATT TTT CTT ATT TTT TAT CAA ATA TTT CTT TGG TAT GGG GCT AAA 2 96 

Ala lie Phe Leu lie Phe Tyr Gin lie Phe Leu Trp Tyr Gly Ala Lys 
70 75 80 

AAA TAT AAA CAA AAT CAA AGA GAT AGT GAA ATT GTC TAT AAC ATT CAA 344 
Lys Tyr Lys Gin Asn Gin Arg Asp Ser Glu lie Val Tyr Asn lie Gin 
85 90 95 

AAA TTC CCA AAT GAG ATA AAA GAA GAA CTT TAT CGT TGC TAT TCT AAA 3 92 

Lys Phe Pro Asn Glu lie Lys Glu Glu Leu Tyr Arg Cys Tyr Ser Lys 
100 105 110 115 

AAA CAA AAT AAA ATT CTT AGA ACG AAA AAA CTT GAT GAT TTG ATA GAT 44 0 

Lys Gin Asn Lys lie Leu Arg Thr Lys Lys Leu Asp Asp Leu lie Asp 
120 125 130 

TAT CTT GAT TTA ATA GGT TTT TTT AAG TCG AGA GAT TTT TTT GAA CCG 4 88 

Tyr Leu Asp Leu lie Gly Phe Phe Lys Ser Arg Asp Phe Phe Glu Pro 
135 140 145 

ACA AAA GAC GAT TAT ATT GTC AAA CCG GAT GTC TTA AGG GCT ATA AAA 53 6 

Thr Lys Asp Asp Tyr lie Val Lys Pro Asp Val Leu Arg Ala lie Lys 
150 155 160 

AAA TAC CAT AAA ATT GCT TTT AAA TCT GTA TAT TGG CAA CAG AAC AAA T 585 
Lys Tyr His Lys lie Ala Phe Lys Ser Val Tyr Trp Gin Gin Asn Lys 
165 170 175 

AAAAAGCGCT TGTTCTTTGT CTAAGAAACA CCCCCCTTTA AAAAAGGGGG 63 5 

(2) INFORMATION FOR SEQ ID NO: 590: 
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Gly Phe Leu lie Ala lie Leu Ala Lys Thr Met Trp Phe Val 
135 140 145 

TTCAACCTCA AAGAATTTAA GACCACTAAG AGTGAGCTAG CGCT 53 4 

(2) INFORMATION FOR SEQ ID NO: 588: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

Met Asp Ala lie Tyr Pro Tyr Val Leu Val Val His Leu Leu Cys Ala 

15 10 15 

lie lie Phe lie Gly Tyr Leu Phe Phe Asp Gly Val lie Phe Pro Asn 

20 25 30 

Val Lys Lys Met Phe Gly Glu Glu Phe Ala Asn Lys Ala Asn Thr Gly 

35 40 45 

lie Thr Gin Arg Ala lie Lys lie Met Pro Leu Cys Val Leu Gly Leu 

50 55 60 

Val Leu Thr Gly Gly Met Met Leu Ser Gin Tyr Met Gly Gly Asp Lys 
65 70 75 80 

Gly Trp Cys Glu Thr Pro Phe Gin Lys lie Leu Met Leu Lys Val lie 

85 90 95 

Leu Ala Leu Ser lie Phe Leu Leu Val Leu Phe Ser Leu Ser Cys Lys 

100 105 110 

Phe Leu Gly Lys Lys Asn Pro lie Gly Lys Tyr lie His Pro lie Ala 

115 120 125 

Leu Thr Phe Gly Phe Leu lie Ala lie Leu Ala Lys Thr Met Trp Phe 
130 135 140 

Val 
145 

(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 48... 584 
(D) OTHER INFORMATION: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47... 481 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 



AGTAGATTAA AAACTCTTTA ATTTTCCCAT GAATAAAGGA TTTTAA ATG GAT GCG 55 

Met Asp Ala 
1 



ATT TAT CCT TAT GTG TTG GTT GTT CAT TTA TTG TGC GCC ATT ATT TTT 10 3 

lie Tyr Pro Tyr Val Leu Val Val His Leu Leu Cys Ala lie lie Phe 
5 10 15 



ATT GGC TAC TTG TTT TTT GAT GGG GTA ATT TTC CCT AAT GTG AAG AAA 151 
lie Gly Tyr Leu Phe Phe Asp Gly Val lie Phe Pro Asn Val Lys Lys 
20 25 30 35 



ATG TTT GGC GAA GAG TTT GCC AAT AAA GCG AAT ACA GGA ATC ACT CAA 199 
Met Phe Gly Glu Glu Phe Ala Asn Lys Ala Asn Thr Gly lie Thr Gin 
40 45 50 



AGA GCG ATC AAA ATC ATG CCC TTA TGC GTT TTA GGG CTT GTT TTA ACA 24 7 

Arg Ala lie Lys lie Met Pro Leu Cys Val Leu Gly Leu Val Leu Thr 
55 60 65 



GGG GGC ATG ATG CTT AGC CAA TAC ATG GGG GGC GAT AAA GGC TGG TGT 2 95 

Gly Gly Met Met Leu Ser Gin Tyr Met Gly Gly Asp Lys Gly Trp Cys 
70 75 80 



GAA ACC CCT TTT CAA AAG ATA CTC ATG CTT AAA GTG ATC TTA GCG TTA 343 
Glu Thr Pro Phe Gin Lys lie Leu Met Leu Lys Val lie Leu Ala Leu 
85 90 95 



AGC ATT TTT CTT TTG GTG CTT TTT TCT TTA TCG TGT AAG TTT TTG GGC 3 91 

Ser lie Phe Leu Leu Val Leu Phe Ser Leu Ser Cys Lys Phe Leu Gly 
100 105 110 115 



AAG AAA AAC CCT ATT GGT AAA TAT ATC CAC CCT ATC GCT CTA ACT TTT 439 
Lys Lys Asn Pro lie Gly Lys Tyr lie His Pro lie Ala Leu Thr Phe 
120 125 130 



GGC TTT' TTA ATC GCC ATT TTA GCC AAA ACG ATG TGG TTT GTT TAAGAGCGT 4 90 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

Met Leu Asn Phe Met Thr Lys Lys Lys Asn Arg Met Gin Asp Cys Lys 

15 10 15 

Met Val Cys Lys Asn Phe Asn Arg Lys Glu Ser Val Leu lie Ala Gin 

20 25 30 

Ser Leu Asp lie Ser Lys Lys Gly Ser Val He Leu Gly Ala Leu Leu 

35 40 45 

Ser Ser Leu Trp Leu Thr Asn Pro Leu Asn Ala His Glu Lys Asn Gly 

50 55 60 

Ala Phe Val Gly He Ser Leu Glu Val Gly Arg Ala Asp Gin Lys Thr 
65 70 75 80 

Asn Ala Tyr Lys Asn Gly Glu Leu Phe Gin Val Pro Phe Gly Asp Val 

85 90 95 

Ser Ala Asn Asp Asp Gly Lys Val Pro Asp Gly Gin Thr Gly Gly Cys 

100 105 110 

Gin Pro Ala Ser Gly Thr Pro Gly Thr Pro Gly Tyr Thr Lys Ala Asn 

115 120 125 

Cys Val Val Asn Trp Thr Ser Arg Thr Met Leu Ser Thr Asn Lys Asn 

130 135 140 

He Pro Gly Arg Asn Gin Pro Met Tyr Gly Leu Gly Val Met Thr Gly 
145 150 155 160 

Tyr Lys His Phe He Gly Lys Lys Arg Trp Phe Gly Leu Arg Tyr Tyr 

165 170 175 

Gly Phe Phe Asp Tyr Gly His Thr Asn Phe Ser Asn Ser Arg Ala Ala 

180 185 190 

Asn Ala He Ser Pro Phe Tyr Leu Ser Asp Gin Lys Ala Asp Met Tyr 

195 200 205 

Thr Tyr Gly Phe Gly Thr Asp Met Leu Phe Asn He He Asp Lys Pro 

210 215 220 

Lys Ala Thr Ala Gly Phe Phe Leu Gly Val Asn Phe Ala Gly Asn Thr 
225 230 235 240 

Trp Thr Asn Asn Arg Val Gly Tyr Phe Lys Asp Gly Tyr Val Tyr Gly 

245 250 255 

Val Asn Thr Asp Ala Asp Ala Tyr Met Thr Asn Ala Asp Gly Thr He 

260 265 270 

Thr Cys Gly Asp Thr Thr Pro Ala Ser Cys Asn Val Gly He Asn Pro 

275 280 285 

Asn Ser Val Tyr Thr Thr Gly Lys Leu Asn. Ala Lys Val Asn His Thr 

290 295 300 

He Phe Gin Phe Leu Val Asn Val Gly He Arg Thr Asn He Phe Glu 
305 310 315 320 

His His Gly He Glu Phe Gly He Lys He Pro Thr Leu Pro Asn Tyr 

325 330 335 

Phe Phe Lys Gly Ser Thr Thr He Arg Ala Lys Lys Gin Gly Pro Leu 

340 345 350 

Glu Asn Gly Gin Pro Thr Thr He Thr Gly Ala Glu Thr Asn Phe Ser 

355 360 365 

Leu Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met Tyr Leu Arg Tyr Val 

370 375 380 

Tyr Thr Phe 
385 

(2) INFORMATION FOR SEQ ID NO: 587: 
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Ala Thr Ala Gly Phe Phe Leu Gly Val Asn Phe Ala Gly Asn Thr Trp 

230 235 240 

ACT AAT AAT CGT GTG GGG TAT TTT AAG GAC GGG TAT GTT TAT GGC GTC 82 5 

Thr Asn Asn Arg Val Gly Tyr Phe Lys Asp Gly Tyr Val Tyr Gly Val 
245 250 255 

AAT ACG GAC GCT GAC GCT TAG ATG ACT AAC GCT GAT GGC ACA ATC ACT 87 3 

Asn Thr Asp Ala Asp Ala Tyr Met Thr Asn Ala Asp Gly Thr lie Thr 
260 265 270 

TGC GGG GAC ACG ACG CCG GCG AGT TGC AAT GTG GGG ATT AAC CCT AAT 921 

Cys Gly Asp Thr Thr Pro Ala Ser Cys Asn Val Gly lie Asn Pro Asn 
275 280 285 

AGC GTC TAT ACC ACA GGA AAA TTG AAC GCT AAG GTG AAT CAC ACG ATT 96 9 

Ser Val Tyr Thr Thr Gly Lys Leu Asn Ala Lys Val Asn His Thr lie 
290 295 300 305 

TTC CAA TTT TTA GTG AAT GTG GGC ATT AGA ACT AAT ATT TTT GAA CAC 1017 

Phe Gin Phe Leu Val Asn Val Gly lie Arg Thr Asn lie Phe Glu His 

310 315 320 

CAT GGC ATT GAG TTT GGC ATC AAA ATC CCC ACG CTC CCT AAC TAG TTT 10 65 

His Gly lie Glu Phe Gly He Lys He Pro Thr Leu Pro Asn Tyr Phe 
325 330 335 

TTC AAA GGT TCT ACT ACC ATA AGA GCG AAA AAA CAA GGC CCG CTA GAG 1113 

Phe Lys Gly Ser. Thr Thr He Arg Ala Lys Lys Gin Gly Pro Leu Glu 
340 345 350 

AAT GGC CAA CCA ACC ACT ATC ACC GGA GCA GAA ACC AAT TTC AGC TTA 1161 

Asn Gly Gin Pro Thr Thr lie Thr Gly Ala Glu Thr Asn Phe Ser Leu 
355 360 365 

ACC CAA ACC TTA GGC CGT CAG TAT TCT ATG TAT TTG CGC TAT GTT TAT 12 0 9 

Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met Tyr Leu Arg Tyr Val Tyr 
370 375 380 385 

ACT TTT TAAGTTTGGT AGGGTTTTTA GGCAAGGCTT AGAGATGAA 12 54 

Thr Phe 



(2) INFORF^ATION FOR SEQ ID NO: 586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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5 10 15 

GTT TGT AAA AAT TTT AAT CGT AAG GAA TCT GTT TTG ATA GCT CAA TCT 153 
Val Cys Lys Asn Phe Asn Arg Lys Glu Ser Val Leu lie Ala Gin Ser 
20 25 30 

TTA GAT ATT TCT AAA AAA GGT TCG GTA ATT TTA GGC GCT CTT TTG AGT 2 01 

Leu Asp lie Ser Lys Lys Gly Ser Val lie Leu Gly Ala Leu Leu Ser 
35 40 45 

TCG TTA TGG CTG ACA AAC CCC TTA AAT GCC CAT GAA AAG AAT GGC GCG 24 9 

Ser Leu Trp Leu Thr Asn Pro Leu Asn Ala His Glu Lys Asn Gly Ala 
50 55 60 65 

TTT GTG GGG ATT AGO TTG GAA GTG GGT AGG GCC GAT CAA AAG ACA AAC 2 97 

Phe Val Gly lie Ser Leu Glu Val Gly Arg Ala Asp Gin Lys Thr Asn 
70 75 80 

GCT TAT AAA AAC GGC GAG TTG TTT CAA GTG CCT TTT GGC GAT GTT TCG 34 5 

Ala Tyr Lys Asn Gly Glu Leu Phe Gin Val Pro Phe Gly Asp Val Ser 
85 90 95 

GCT AAT GAT GAT GGC AAA GTT CCT GAC GGG CAG ACC GGT GGC TGT CAG 3 93 

Ala Asn Asp Asp Gly Lys Val Pro Asp Gly Gin Thr Gly Gly Cys Gin 
100 105 110 

CCA GCT TCA GGG ACG CCA GGA ACG CCA GGC TAC ACT AAA GCT AAC TGC 441 
Pro Ala Ser Gly Thr Pro Gly Thr Pro Gly Tyr Thr Lys Ala Asn Cys 
115 120 125 

GTG GTC AAT TGG ACT TCG CGC ACC ATG CTT AGC ACC AAT AAA AAC ATT 4 89 

Val Val Asn Trp Thr Ser Arg Thr Met Leu Ser Thr Asn Lys Asn lie 
130 135 140 145 

CCT GGC CGT AAC CAG CCG ATG TAT GGG CTA GGC GTG ATG ACA GGC TAT 537 
Pro Gly Arg Asn Gin Pro Met Tyr Gly Leu Gly Val Met Thr Gly Tyr 
150 155 160 

AAG CAT TTT ATC GGT AAA AAA AGA TGG TTT GGG TTG CGC TAT TAC GGC 5 85 

Lys His Phe lie Gly Lys Lys Arg Trp Phe Gly Leu Arg Tyr Tyr Gly 
165 170 175 

TTT TTT GAT TAT GGG CAT ACC AAT TTC TCT AAC TCC AGA GCC GCT AAC 63 3 

Phe Phe Asp Tyr Gly His Thr Asn Phe Ser Asn Ser Arg Ala Ala Asn 
180 185 190 

GCT ATA TCG CCT TTT TAT TTG AGC GAT CAA AAA GCC GAC ATG TAT ACT 681 
Ala lie Ser Pro Phe Tyr Leu Ser Asp Gin Lys Ala Asp Met Tyr Thr 
195 200 205 

TAT GGT TTT GGC ACA GAC ATG CTT TTT AAC ATT ATA GAT AAG CCT AAA 72 9 

Tyr Gly Phe Gly Thr Asp Met Leu Phe Asn lie lie Asp Lys Pro Lys 
210 215 220 225 

GCC ACG -GCC GGG TTT TTT TTA GGC GTG AAT TTT GCG GGT AAC ACT TGG 77 7 
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lie Asp Pro Phe Leu Glu Asp Asp Ala Ala Lys Ser lie Asp Glu Ala 

100 105 110 

Lys Arg Leu Phe Lys Thr Leu Asn Arg Pro Asn Val Met lie Lys Val 

115 120 125 

Pro Ala Ser Glu Ser Gly lie Glu Val Val Ser Ala Leu Thr Gin Ala 

130 135 140 

Ser lie Pro Val Asn Val Thr Leu Val Phe Ser Pro Lys lie Ala Gly 
145 150 155 160 

Glu lie Ala Gin lie Leu Ala Lys Glu Ala Gin Lys Arg Ala Val lie 

165 170 175 

Ser Val Phe Val Ser Arg Phe Asp Lys Glu lie Asp Pro Leu Val Pro 

180 185 190 

Lys Asn Leu Gin Ala Gin Ser Gly lie lie Asn Ala Thr Glu Cys Tyr 

195 200 205 

Tyr Gin lie Asn Gin His Ala Asn Lys Leu Thr Ser Thr Leu Phe Ala 

210 215 220 

Ser Thr Gly Val Lys Ser Asn Ser Leu Ala Lys Asp Tyr Tyr lie Lys 
225 230 235 240 

Ala Leu Cys Phe Lys Asn Ser lie Asn Thr Ala Pro Leu Glu Ala Leu 

245 250 255 

Asn Ala Tyr Leu Leu Asp Pro Asn Thr Glu Cys Gin Thr Pro Leu Lys 

260 265 270 

Thr Thr Glu He Glu Ala Phe Lys Lys Glu Leu Lys Val His Asn He 

275 280 285 

Asp Leu Glu Asn Thr Ala Gin Lys Leu Leu Lys Glu Gly Leu He Ala 

290 295 300 

Phe Lys Gin Ser Phe Glu Lys Leu Leu Ser Ser Phe 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 5 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1254 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 55... 1215 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 

AGCTATAATA AAATAATTAA AAAAGTAACA CTTAAGCGGA GACCCTAGAG AGTG ATG 57 

Met 
1 

CTC AAT TTT ATG ACA AAG AAG AAA AAT AGA ATG CAA GAT TGC AAA ATG 105 
Leu Asn Phe Met Thr Lys Lys Lys Asn Arg Met Gin Asp Cys Lys Met 
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ATT AAT CAG CAT GCC AAT AAG CTA ACA AGC ACC CTT TTT GCA TCC ACA 72 8 

lie Asn Gin His Ala Asn Lys Leu Thr Ser Thr Leu Phe Ala Ser Thr 
215 220 225 

GGC GTT AAA TCC AAT TCT TTA OCT AAA GAT TAG TAC ATT AAA GCG CTG 77 6 

Gly Val Lys Ser Asn Ser Leu Ala Lys Asp Tyr Tyr lie Lys Ala Leu 
230 235 240 

TGT TTT AAA AAC TCT ATC AAT ACA GCC CCT CTA GAG GCT TTA AAC GCT 82 4 

Cys Phe Lys Asn Ser lie Asn Thr Ala Pro Leu Glu Ala Leu Asn Ala 
245 250 255 

TAT TTG CTT GAC CCA AAC ACC GAG TGT CAA ACC CCT TTA AAG ACT ACA 872 
Tyr Leu Leu Asp Pro Asn Thr Glu Cys Gin Thr Pro Leu Lys Thr Thr 
260 265 270 

GAA ATT GAA GCG TTT AAA AAA GAA TTA AAA GTG CAC AAC ATT GAT TTA 92 0 

Glu lie Glu Ala Phe Lys Lys Glu Leu Lys Val His Asn lie Asp Leu 
275 280 285 290 

GAA AAC ACC GCT CAA AAA CTC CTT AAA GAA GGC TTG ATA GCG TTC AAA 968 
Glu Asn Thr Ala Gin Lys Leu Leu Lys Glu Gly Leu lie Ala Phe Lys 
295 300 305 

CAA TCC TTT GAA AAG CTT TTA AGC AGT TTT TGATTTTTAA GGGTTTTTTG GAT 1021 
Gin Ser Phe Glu Lys Leu Leu Ser Ser Phe 
310 315 

AGAATAAGCC CTTATTTTAT TTTAAAGGAT 1051 

(2) INFORMATION FOR SEQ ID NO: 5 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

Met Gin Glu Phe Ser Leu Trp Cys Asp Phe lie Glu Arg Asp Phe Leu 

15 10 15 

Glu Asn Asp Phe Leu Lys Leu lie Asn Lys Gly Ala lie Cys Gly Ala 

20 25 30 

Thr Ser Asn Pro Ser Leu Phe Cys Glu Ala lie Thr Lys Ser Ala Phe 

35 40 45 

Tyr Lys Asp Glu lie Ala Lys Leu Lys Gly Lys Lys Ala Lys Glu lie 

50 55 60 

Tyr Glu Thr Leu Ala Leu Lys Asp lie Leu Gin Ala Ser Ser Ala Leu 
65 70 75 80 

Met Pro Leu Tyr Glu Lys Asp Pro Asn Asn Gly Tyr lie Ser Leu Glu 
85 90 95 
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ACCAAGAGGT CGTTAAAAGC TATTACCAAC ATTTAAAACA AGGATAAAAC ATG CAA 5 6 

Met Gin 
1 

GAA TTC AGT TTG TGG TGC GAT TTT ATA GAA AGG GAT TTT TTA GAA AAC 104 
Glu Phe Ser Leu Trp Cys Asp Phe lie Glu Arg Asp Phe Leu Glu Asn 
5 10 15 

GAC TTT TTA AAG CTC ATT AAT AAG GGG GCT ATT TGC GGG GCA ACG AGT 152 
Asp Phe Leu Lys Leu lie Asn Lys Gly Ala lie Cys Gly Ala Thr Ser 
20 25 30 

AAC CCT AGT TTG TTT TGC GAA GCG ATC ACA AAA AGC GCG TTT TAT AAA 200 
Asn Pro Ser Leu Phe Cys Glu Ala lie Thr Lys Ser Ala Phe Tyr Lys 
35 40 45 50 

GAT GAA ATC GCT AAA CTC AAA GGC AAA AAA GCT AAA GAA ATT TAT GAA 24 8 

Asp Glu lie Ala Lys Leu Lys Gly Lys Lys Ala Lys Glu lie Tyr Glu 
55 60 65 

ACT CTG GCG TTA AAG GAT ATT TTA CAA GCT TCT AGC GCG TTG ATG CCT 2 96 

Thr Leu Ala Leu Lys Asp lie Leu Gin Ala Ser Ser Ala Leu Met Pro 
70 75 80 

TTA TAT GAA AAA GAC CCT AAC AAT GGC TAC ATT AGC CTA GAA ATT GAC 344 
Leu Tyr Glu Lys Asp Pro Asn Asn Gly Tyr lie Ser Leu Glu lie Asp 
85 90 95 

CCT TTT TTA GAA GAT GAT GCC GCT AAA AGC ATT GAT GAA GCC AAG CGG 3 92 

Pro Phe Leu Glu Asp Asp Ala Ala Lys Ser lie Asp Glu Ala Lys Arg 
100 105 110 

TTG TTC AAA ACA TTA AAC CGC CCT AAT GTG ATG ATT AAA GTC CCA GCG 44 0 

Leu Phe Lys Thr Leu Asn Arg Pro Asn Val Met lie Lys Val Pro Ala 
115 120 125 130 

AGT GAA AGC GGG ATT GAA GTG GTT AGC GCT TTA ACT CAA GCC TCT ATT 4 88 

Ser Glu Ser Gly lie Glu Val Val Ser Ala Leu Thr Gin Ala Ser lie 
135 140 145 

CCT GTT AAT GTA ACT TTA GTC TTT TCG CCT AAA ATT GCC GGT GAA ATC 53 6 

Pro Val Asn Val Thr Leu Val Phe Ser Pro Lys lie Ala Gly Glu lie 
150 155 160 

GCT CAA ATC TTA GCC AAA GAA GCG CAA AAA AGA GCG GTC ATT AGC GTG 5 84 

Ala Gin lie Leu Ala Lys Glu Ala Gin Lys Arg Ala Val lie Ser Val 
165 170 175 

TTT GTC TCA CGA TTT GAC AAA GAA ATA GAC CCT TTA GTG CCA AAA AAT 63 2 

Phe Val Ser Arg Phe Asp Lys Glu lie Asp Pro Leu Val Pro Lys Asn 
180 185 190 

TTG CAA GCT CAA AGC GGG ATT ATC AAC GCT ACC GAG TGC TAT TAT CAA 6 80 

Leu Gin Ala Gin Ser Gly lie He Asn Ala Thr Glu Cys Tyr Tyr Gin 
195 200 205 210 
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AAA AAT AGA GCG CGA AAG CAA ATC CCT AAT ATG CAC CAT GCC GAT AAT 319 
Lys Asn Arg Ala Arg Lys Gin lie Pro Asn Met His His Ala Asp Asn 
40 45 50 

GTT ATC CTT AGA ACC CTT GCA ATA AGG GTA GCG CGT GAA ATG GCC TTT T 3 68 
Val lie Leu Arg Thr Leu Ala lie Arg Val Ala Arg Glu Met Ala Phe 
55 60 65 

AAAACAATGT CTATATTTTC TTCATAGCTG TTTTCTTCAT CCAAACACAC CATGTCTTTT 42 8 

CGTGGGGTCA TGATTTCTTT AG 4 50 

(2) INFORMATION FOR SEQ ID NO: 5 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:582: 

Met Asp Leu Leu Phe Phe His Phe Asn Lys Asp Leu Arg Asn Gly Ser 

15 10 15 

Ala Phe Gly Asp Asp Asp His Phe Pro Asn Asp Leu lie Glu lie Met 

20 25 30 

His Phe Gly Gly Lys Asn Arg Ala Arg Lys Gin lie Pro Asn Met His 

35 40 45 

His Ala Asp Asn Val lie Leu Arg Thr Leu Ala lie Arg Val Ala Arg 

50 55 60 

Glu Met Ala Phe 
65 

(2) INFORMATION FOR SEQ ID NO:583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 998 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 
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Val Asn Ala Glu Asn Leu Asn Pro Pro Phe Thr Trp Ser Leu Gly Leu 
225 230 235 240 

Leu Gin Met Asp Phe Phe Tyr Arg Lys Lys Ser Trp Tyr Ala Gly Gin 

245 250 255 

Phe Met Arg Lys lie Thr Pro Lys Thr Glu lie Glu Asn Lys lie Asp 

260 265 270 

Leu Arg lie Ala Asn Tyr Phe Thr Thr Leu Leu Asn Ala Leu Lys Arg 

275 280 285 

Pro Leu Leu Ser Val Leu Val Arg Asp lie Asp Lys Thr Phe Arg Glu 

290 295 300 

Gin Lys lie Gin Leu Pro Leu Lys Pro Thr Ala Lys Thr Gin Thr Leu 
305 310 315 320 

Asp Gly lie Asp Phe Asp Phe Met His Thr Leu lie Asn Ala Leu Met 

325 330 335 

Lys Gin Thr lie Gin Gly Val Ala Gin Tyr Cys Asp Ala Lys He Gin 

340 345 350 

Ala Thr Lys Glu Val He Ser Gin Glu Ala Pro Val Gin Lys Asp Ser 
355 360 365 

Leu Phe 

370 

(2) INFORMATION FOR SEQ ID NO: 581: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 164,.. 367 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 

AATCCAGCAT GCCCTCTAAT TCAAACACGC CCTCTTCAAG CTTGTTTATG CCCTCTTGTT 6 0 

TTAAGTCGTA TTCGTCGCTA ATCTCGCCCA TGATCTCTTC AATGATGTCT TCCATAGTGA 12 0 

GCAACCCGGC TGTGCCGCCG TATTCATCAA TCACCAAAGC GGT ATG GAT TTG CTC 17 5 

Met Asp Leu Leu 
1 

TTT TTT CAT TTT AAT AAG GAT TTG AGA AAT GGA AGC GCT TTC GGG GAC 22 3 
Phe Phe His Phe Asn Lys Asp Leu Arg Asn Gly Ser Ala Phe Gly Asp 
5 10 15 20 

GAT GAT CAT TTT CCT AAC GAT TTG ATT GAA ATC ATG CAT TTT GGG GGT 271 
Asp Asp His Phe Pro Asn Asp Leu lie Glu lie Met His Phe Gly Gly 
25 30 35 
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CTA ATG AAG CAA ACC ATT CAA GGC GTG GCT CAA TAG TGC GAG GCT AAA 12 3 9 

Leu Met Lys Gin Thr He Gin Gly Val Ala Gin Tyr Cys Asp Ala Lys 
335 340 345 350 

ATA CAA GCT ACA AAA GAG GTT ATC AGO CAA GAA GCG CCC GTT CAA AAA 12 87 

He Gin Ala Thr Lys Glu Val He Ser Gin Glu Ala Pro Val Gin Lys 
355 360 365 

GAG TCG TTA TTT TAAAAGGGGT TTTAAGCGCG CTAGCTTGTG TTACAATAAA CTTAA 1344 
Asp Ser Leu Phe 
370 

AATTCGCTTG 13 54 

(2) INFORMATION FOR SEQ ID NO: 5 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

Val Arg Ala Ala Thr Asn Asn Gly He Lys Gly Phe He He Asp Asp 

15 10 15 

Pro Thr Phe Ala Asn Lys Lys Asn Thr Leu Ser Phe Ala Gin Asp Thr 

20 25 30 

Phe Thr Val Phe Tyr Gin Lys Gin Pro Tyr Phe Thr Gly Asn Lys Val 

35 40 45 

Lys He Leu Lys Pro Lys Phe Ala Phe Lys Ser Pro Lys He Leu His 

50 55 60 

Ser He Ser Ala He Leu Gin Phe He Leu Lys Pro Leu Thr Trp Gly 
65 70 75 80 

Leu Gly Ser Thr Thr Glu Ser He Ala Glu Phe Lys Phe Ser Leu Pro 

85 90 95 

Leu Lys Pro Thr Ala Asn Ala Gin Thr Leu Glu Asp He Asp Phe Asp 

100 105 110 

Phe Met Glu Lys Phe He Ala Glu Leu Glu Gin Cys Arg Leu Ala Glu 

115 120 125 

Leu Glu Gin Cys Arg Leu Ala Glu Leu Gin Ala Tyr Leu Lys Ala Thr 

130 135 140 

Gly Leu Glu Asn Thr Thr Leu Ser Asn Asp Glu Glu Asn Ala Leu Asn 
145 150 155 160 

Val Phe Asn Asn Ser Gly Gly Gly Gly Gly Asn Thr Pro Cys Gly Leu 

165 170 175 

Thr Trp Gin Ser Phe Arg Leu Gly Asp Leu Phe Glu He Glu Lys Thr 

180 185 190 

Leu Ser Phe Asn Lys Asp Ala Leu Thr Gin Gly Glu Asp Tyr Asp Tyr 

195 200 205 

He Thr Arg Thr Ser Gin Asn Gin Gly Val Leu Gin Thr Thr Gly Phe 
210 215 220 
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TTT GAT TTC ATG GAA AAA TTC ATA GCC GAA CTT GAG CAG TGT CGG CTC 5 67 

Phe Asp Phe Met Glu Lys Phe He Ala Glu Leu Glu Gin Cys Arg Leu 
115 120 125 

GCC GAA CTT GAG CAG TGT CGG CTC GCC GAA CTT CAG GCT TAT TTA AAA 615 
Ala Glu Leu Glu Gin Cys Arg Leu Ala Glu Leu Gin Ala Tyr Leu Lys 
130 135 140 

GCT ACA GGG CTA GAA AAC ACC ACC CTT TCT AAC GAT GAA GAA AAC GCC 6 63 

Ala Thr Gly Leu Glu Asn Thr Thr Leu Ser Asn Asp Glu Glu Asn Ala 
145 150 155 

CTT AAT GTT TTC AAT AAT TCT GGG GGG GGG GGG GGT AAT ACC CCA TGC 711 
Leu Asn Val Phe Asn Asn Ser Gly Gly Gly Gly Gly Asn Thr Pro Cys 
160 165 170 

GGC TTA ACA TGG CAA AGC TTT AGA TTA GGG GAT TTG TTT GAA ATT GAA 75 9 

Gly Leu Thr Trp Gin Ser Phe Arg Leu Gly Asp Leu Phe Glu He Glu 
175 180 185 190 

AAA ACC TTA AGC TTT AAT AAA GAC GCT TTA ACG CAA GGA GAA GAT TAT 807 
Lys Thr Leu Ser Phe Asn Lys Asp Ala Leu Thr Gin Gly Glu Asp Tyr 
195 200 205 

GAT TAT ATT ACA AGG ACT TCG CAA AAT CAA GGC GTT TTG CAA ACT ACA 85 5 

Asp Tyr He Thr Arg Thr Ser Gin Asn Gin Gly Val Leu Gin Thr Thr 
210 215 220 

GGA TTT GTC AAT GCA GAA AAT TTA AAC CCA CCA TTT ACT TGG AGT TTA 90 3 

Gly Phe Val Asn Ala Glu Asn Leu Asn Pro Pro Phe Thr Trp Ser Leu 
225 230 235 

GGG CTT TTG CAA ATG GAT TTT TTC TAT CGT AAA AAG TCA TGG TAT GCG 951 
Gly Leu Leu Gin Met Asp Phe Phe Tyr Arg Lys Lys Ser Trp Tyr Ala 
240 245 250 

GGA CAA TTC ATG CGA AAA ATC ACA CCA AAA ACT GAA ATT GAA AAT AAA 999 
Gly Gin Phe Met Arg Lys He Thr Pro Lys Thr Glu He Glu Asn Lys 
255 260 265 270 

ATT GAT TTA CGC ATA GCC AAC TAC TTC ACA ACG CTT TTA AAC GCC TTA 104 7 

He Asp Leu Arg He Ala Asn Tyr Phe Thr Thr Leu Leu Asn Ala Leu 
275 280 285 

AAA CGC CCT TTA TTA AGC GTA TTG GTT AGA GAT ATT GAT AAA ACT TTT 10 95 

Lys Arg Pro Leu Leu Ser Val Leu Val Arg Asp He Asp Lys Thr Phe 
290 295 300 

AGG GAG CAA AAA ATC CAA CTA CCC CTA AAA CCC ACC GCT AAA ACT CAA 1143 
Arg Glu Gin Lys He Gin Leu Pro Leu Lys Pro Thr Ala Lys Thr Gin 
305 310 315 

ACC CTT GAT GGT ATT GAT TTT GAT TTC ATG CAC ACC CTT ATC AAC GCC 1191 
Thr Leu Asp Gly He Asp Phe Asp Phe Met His Thr Leu He Asn Ala 
320 325 330 
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Glu Lys Gly Tyr Thr Leu Gin Ala Asp Val Arg Arg Val Tyr Ala Phe 
225 230 235 240 

Gin lie Ser Tyr Leu Arg Asp Phe 
245 

(2) INFORMATION FOR SEQ ID NO: 57 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 190... 1299 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

AGTAAGCTTG ATTTTAAAAA ACCAAATGCC CCCAAAGCGA TAGGCCCCCT TAATAGCCAA 6 0 

CTCAACGCTA TTAAGTGGGG CGAGTTCAGA TTGGGGGATT TGTTTGAAGT GTTGTCAAGT 12 0 

AAGAAAATTT ATCATGCCAA CACGATAAAA ATCCATGACA CGCAAATAGA AAACAGCTAC 180 

CCTTATGTC GTG CGC GCT GCA ACC AAT AAT GGT ATA AAA GGC TTT ATT ATA 231 
Val Arg Ala Ala Thr Asn Asn Gly lie Lys Gly Phe lie lie 
15 10 

GAT GAC CCT ACA TTT GCT AAT AAA AAA AAT ACC CTT TCG TTC GCG CAA 279 
Asp Asp Pro Thr Phe Ala Asn Lys Lys Asn Thr Leu Ser Phe Ala Gin 
15 20 25 30 

GAC ACT TTC ACT GTG TTT TAT CAA AAA CAA CCT TAT TTT ACA GGC AAT 327 
Asp Thr Phe Thr Val Phe Tyr Gin Lys Gin Pro Tyr Phe Thr Gly Asn 
35 40 45 

AAG GTT AAA ATT TTA AAA CCA AAA TTT GCT TTC AAA AGC CCT AAA ATT 3 75 

Lys Val Lys lie Leu Lys Pro Lys Phe Ala Phe Lys Ser Pro Lys lie 
50 55 60 

TTA CAT TCT ATA AGC GCG ATT TTA CAA TTT ATT TTA AAA CCC TTA ACT 42 3 

Leu His Ser lie Ser Ala lie Leu Gin Phe lie Leu Lys Pro Leu Thr 
65 70 75 

TGG GGG CTA GGC TCT ACA ACA GAA AGC ATT GCG GAG TTT AAA TTT TCT 471 
Trp Gly Leu Gly Ser Thr Thr Glu Ser lie Ala Glu Phe Lys Phe Ser 
80 85 90 

CTA CCC CTA AAA CCC ACC GCT AAC GCT CAA ACC CTT GAG GAT ATT GAT 519 
Leu Pro Leu Lys Pro Thr Ala Asn Ala Gin Thr Leu Glu Asp lie Asp 
95 100 105 110 
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AAA CAC AAT AGC CTA GAA TTA GGG ATT AAA GTG CCT ACG ATC ACA CAC 67 8 

Lys His Asn Ser Leu Glu Leu Gly lie Lys Val Pro Thr lie Thr His 
205 210 215 

CAG CTT TTC TCT CTT ACC AAC GAA AAG GGA TAG ACC TTA CAG GCT GAT 72 6 

Gin Leu Phe Ser Leu Thr Asn Glu Lys Gly Tyr Thr Leu Gin Ala Asp 
220 225 230 

GTG CGT AGA GTT TAT GCG TTT CAA ATC AGT TAG TTG AGG GAT TTT TAACC 77 6 
Val Arg Arg Val Tyr Ala Phe Gin lie Ser Tyr Leu Arg Asp Phe 
235 240 245 

CCTTTTTAGA TACAATCACG CCTGAAACTA TCCA 810 

(2) INFORMATION FOR SEQ ID N0:578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 8: 

Met Leu Lys Arg Met lie Leu Leu Gly Ala Leu Gly Val Leu Ala Ser 

15 10 15 

Ala Glu Glu Ser Ala Ala Phe Val Gly Val Asn Tyr Gin Val Ser Met 

20 25 30 

lie Gin Asn Gin Thr Lys Met Val Asn Asp Asn Gly Leu Gin Lys Pro 

35 40 45 

Leu lie Lys Phe Pro Pro Tyr Ala Gly Ala Gly Phe Glu Val Gly Tyr 

50 55 60 

Lys Gin Phe Phe Gly Lys Lys Lys Trp Phe Gly Met Arg Tyr Tyr Gly 
65 70 75 80 

Phe Phe Asp Tyr Ala His Asn Arg Phe Gly Val Met Lys Lys Gly lie 

85 90 95 

Pro Val Gly Asp Ser Gly Phe lie Tyr Asn Ser Phe Ser Phe Gly Gly 

100 105 110 

Asn Thr Leu Thr Glu Arg Asp Ser Tyr Gin Gly Gin Tyr Tyr Val Asn 

115 120 125 

Leu Phe Thr Tyr Gly Val Gly Leu Asp Thr Leu Trp Asn Phe Val Asn 

130 135 140 

Lys Glu Asn Met Val Phe Gly Phe Val Val Gly lie Gin Leu Ala Gly 
145 150 155 160 

Asp Ser Trp Ala Thr Ser lie Ser Lys Glu lie Ala His Tyr Ala Lys 

165 170 175 

His His Ser Asn Ser Ser Tyr Ser Pro Ala Asn Phe Gin Phe Leu Trp 

180 185 190 

Lys Phe Gly Val Arg Thr His lie Ala Lys His Asn Ser Leu Glu Leu 

195 200 205 

Gly lie Lys Val Pro Thr lie Thr His Gin Leu Phe Ser Leu Thr Asn 
210- 215 220 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:577: 

TATCAAAAAA TAAAGGGAAA AGACTGA ATG TTG AAA AG A ATG ATA TTA TTA GGG 54 

Met Leu Lys Arg Met lie Leu Leu Gly 
1 5 

GCT TTG GGT GTT TTA GCG AGC GCT GAA GAG AGT GCG GCT TTT GTG GGA 102 
Ala Leu Gly Val Leu Ala Ser Ala Glu Glu Ser Ala Ala Phe Val Gly 
10 15 20 25 

GTC AAT TAG CAG GTG AGC ATG ATA CAA AAT CAG ACT AAA ATG GTG AAT 15 0 

Val Asn Tyr Gin Val Ser Met lie Gin Asn Gin Thr Lys Met Val Asn 
30 35 40 

GAC AAC GGC TTG CAA AAG CCT TTG ATA AAG TTT CCG CCT TAG GCA GGA 198 
Asp Asn Gly Leu Gin Lys Pro Leu lie Lys Phe Pro Pro Tyr Ala Gly 
45 50 55 

GCG GGT TTT GAA GTG GGC TAT AAG CAA TTT TTT GGT AAG AAA AAA TGG 24 6 

Ala Gly Phe Glu Val Gly Tyr Lys Gin Phe Phe Gly Lys Lys Lys Trp 
60 65 70 

TTT GGC ATG CGT TAT TAT GGG TTT TTT GAC TAC GCG CAC AAC CGC TTT 2 94 

Phe Gly Met Arg Tyr Tyr Gly Phe Phe Asp Tyr Ala His Asn Arg Phe 
75 80 85 

GGC GTG ATG AAA AAG GGC ATT CCG GTG GGC GAT AGT GGG TTT ATT TAC 3 42 

Gly Val Met Lys Lys Gly lie Pro Val Gly Asp Ser Gly Phe lie Tyr 
90 95 100 105 

AAT AGT TTT AGT TTT GGA GGG AAC ACT TTA ACG GAA AGG GAT TCC TAT 3 90 

Asn Ser Phe Ser Phe Gly Gly Asn Thr Leu Thr Glu Arg Asp Ser Tyr 
110 115 120 

CAG GGG CAA TAC TAT GTC AAT TTA TTC ACT TAT GGC GTG GGG TTA GAT 438 
Gin Gly Gin Tyr Tyr Val Asn Leu Phe Thr Tyr Gly Val Gly Leu Asp 
125 130 135 

ACG CTG TGG AAT TTT GTG AAT AAA GAA AAC ATG GTT TTT GGT TTT GTG 4 86 

Thr Leu Trp Asn Phe Val Asn Lys Glu Asn Met Val Phe Gly Phe Val 
140 145 150 

GTG GGG ATC CAA TTA GCG GGG GAT AGT TGG GCA ACG AGC ATC AGT AAA 53 4 

Val Gly lie Gin Leu Ala Gly Asp Ser Trp Ala Thr Ser lie Ser Lys 
155 160 165 

GAA ATC GCT CAT TAT GCA AAA CAC CAC AGC AAT TCC AGT TAT AGC CCG 5 82 

Glu lie Ala His Tyr Ala Lys His His Ser Asn Ser Ser Tyr Ser Pro 
170 175 180 185 

GCC AAT TTC CAG TTT TTA TGG AAG TTT GGG GTC CGC ACC CAT ATC GCT 63 0 

Ala Asn Phe Gin Phe Leu Trp Lys Phe Gly Val Arg Thr His lie Ala 
190 195 200 
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TTG ACG CTA ACC CCT ATA AAA CGA TTT TTG GAG AAA GGA AAT AAT CAT 106 
Leu Thr Leu Thr Pro lie Lys Arg Phe Leu Glu Lys Gly Asn Asn His 
5 10 15 

GAG AGC TAG GGC GAT AAA AAT CTT TTG ACT CTC ATC AGC ATT AGC CCT 154 
Glu Ser Tyr Gly Asp Lys Asn Leu Phe Thr Leu lie Ser lie Ser Pro 
20 25 30 

ATT GCT TCA TGG TTG CTT GAG CAT CAA TTT AAA ACA AAT GCT ACC AGA 2 02 

lie Ala Ser Trp Leu Leu Glu His Gin Phe Lys Thr Asn Ala Thr Arg 
35 40 45 50 

GAT CAG AAC TTA CGA TTT GAA TGC GAG TTC TTT TGAAATCACG CAATGCGCTA 25 5 

Asp Gin Asn Leu Arg Phe Glu Cys Glu Phe Phe 
55 60 

AACCTTTGAC TGAAGTGAGG CTCATTAGTA TTTTG 2 90 

(2) INFORMATION FOR SEQ ID NO: 57 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

lie Asn Leu Thr Leu Thr Pro lie Lys Arg Phe Leu Glu Lys Gly Asn 

15 10 15 

Asn His Glu Ser Tyr Gly Asp Lys Asn Leu Phe Thr Leu lie Ser lie 

20 25 30 

Ser Pro lie Ala Ser Trp Leu Leu Glu His Gin Phe Lys Thr Asn Ala 

35 40 45 

Thr Arg Asp Gin Asn Leu Arg Phe Glu Cys Glu Phe Phe 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 577: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 28... 771 
(D) OTHER INFORMATION: 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:574: 

Met Phe Asp Lys Lys Leu Ser Ser Asn Asp Trp His lie Gin Lys Val 

15 10 15 

Glu Met Asn His Gin Val Tyr Asp lie Glu Thr Met Leu Ala Asp Ser 

20 25 30 

Ala Phe Arg Glu His Glu Glu Glu Gin Asp Ser Ser Leu Asn Thr Ala 

35 40 45 

Leu Pro Glu Asp Lys Thr Ala lie Glu Ala Lys Glu Gin Glu Gin Lys 

50 55 60 

Glu Lys Arg Lys Arg Trp Tyr Glu Leu Phe Lys Lys Lys Pro Lys Pro 
65 70 75 80 

Lys Ser Ser Met Gly Glu Phe Val Phe Asp Gin Lys Glu Asn Arg lie 

85 90 95 

Tyr Gly Lys Gly Tyr Cys Asn Arg Tyr Phe Ala Ser Tyr Val Trp Gin 

100 105 110 

Gly Asp Arg His lie Gly lie Glu Asp Ser Gly lie Ser Arg Lys Val 

115 120 125 

Cys Lys Asp Glu His Leu Met Ala Phe Glu Leu Glu Phe Met Glu Asn 

130 135 140 

Phe Lys Gly Asn Phe Thr Val Thr Lys Gly Lys Asp Thr Leu lie Leu 
145 150 155 160 

Asp Asn Gin Lys Met Lys lie Tyr Leu Lys Thr Pro 
165 170 

(2) INFORMATION FOR SEQ ID NO: 5 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 235 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 5: 

TTGAGCTTAA GAAACATTGA TAATTTTGTG GAAAAAGGCT CTGCTTTGAT AG ATA AAT 5 8 

lie Asn 
1 
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AAAATAAGGA GGAATTGTTT GATTTTACGA TTGGCTGGAG CAAGCGTTTT AACGGCTTGT 60 
GTCTTTTCGG GGTGTTTTTT TTTAAAA ATG TTT GAT AAA AAA CTT TCT AGT AAC 114 

Met Phe Asp Lys Lys Leu Ser Ser Asn 
1 5 

GAT TGG CAT ATC CAA AAA GTG GAA ATG AAC CAT CAA GTC TAT GAC ATT 162 
Asp Trp His lie Gin Lys Val Glu Met Asn His Gin Val Tyr Asp lie 
10 15 20 25 

GAA ACC ATG CTC GCT GAT AGC GCT TTT AGA GAG CAT GAA GAA GAG CAA 210 
Glu Thr Met Leu Ala Asp Ser Ala Phe Arg Glu His Glu Glu Glu Gin 
30 35 40 

GAT TCC TCT CTA AAT ACC GCT TTG CCT GAA GAT AAA ACA GCG ATT GAA 25 8 

Asp Ser Ser Leu Asn Thr Ala Leu Pro Glu Asp Lys Thr Ala lie Glu 
45 50 55 

GCC AAA GAG CAA GAG CAA AAA GAA AAA AGA AAA CGC TGG TAT GAG CTT 3 06 

Ala Lys Glu Gin Glu Gin Lys Glu Lys Arg Lys Arg Trp Tyr Glu Leu 
60 65 70 

TTT AAA AAG AAA CCA AAG CCC AAA AGC TCT ATG GGA GAG TTT GTG TTT 3 54 

Phe Lys Lys Lys Pro Lys Pro Lys Ser Ser Met Gly Glu Phe Val Phe 
75 80 85 

GAT CAA AAA GAA AAT CGT ATT TAT GGC AAA GGC TAT TGC AAC CGG TAT 4 02 

Asp Gin Lys Glu Asn Arg lie Tyr Gly Lys Gly Tyr Cys Asn Arg Tyr 
90 95 100 105 

TTT GCC AGC TAT GTA TGG CAG GGC GAT AGG CAC ATT GGG ATT GAA GAT 450 
Phe Ala Ser Tyr Val Trp Gin Gly Asp Arg His lie Gly lie Glu Asp 
110 115 120 

AGC GGG ATT TCA AGA AAA GTG TGT AAA GAT GAG CAT TTA ATG GCG TTT 4 98 

Ser Gly lie Ser Arg Lys Val Cys Lys Asp Glu His Leu Met Ala Phe 
125 130 135 

GAA TTG GAA TTT ATG GAG AAT TTT AAG GGT AAT TTT ACG GTA ACT AAG 54 6 

Glu Leu Glu Phe Met Glu Asn Phe Lys Gly Asn Phe Thr Val Thr Lys 
140 145 150 

GGC AAG GAC ACG CTC ATT TTA GAC AAC CAA AAA ATG AAA ATT TAT TTG 594 
Gly Lys Asp Thr Leu lie Leu Asp Asn Gin Lys Met Lys lie Tyr Leu 
155 160 165 

AAA ACG CCT TGAGTGGGTT TTTGATTTCA AAACAATCTA AGATCACTAA ATTAGGGAT 652 

Lys Thr Pro 

170 

TAAAAAGAAA TTTTTAA 669 

(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 172 amino acids 
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Met Ala Phe Trp His Lys Arg Leu Ala Val Gly Cys Cys lie Val Leu 

15 10 15 

Phe Ser Cys Met Met Asn Ala Asn Ser lie Gin lie Val Arg Asp Asp 

20 25 30 

Pro Pro Leu Asp Pro Thr Leu Pro Ala Trp Val Tyr Ser Val Ala Leu 

35 40 45 

Leu Lys Val Tyr Phe Ser Asp Gly Thr Tyr Lys Glu Gly Tyr Ala Thr 

50 55 60 

Leu Leu Lys Asn Gly Arg Tyr lie Ala Ser Ser Glu Thr Leu Tyr Ser 
65 70 75 80 

Asn Gly Leu Tyr Pro Lys Thr lie Leu Ala Lys Met Gin Asp Ser Ser 

85 90 95 

Ala Lys Glu Leu lie Cys lie Ala Ser Leu Arg Leu Glu Ala Met Asp 

100 105 110 

Arg Asn Gin Gly Leu Ser Leu Leu Lys Thr Ala Asp Phe Arg Asp Asp 

115 120 125 

Tyr Cys His Lys Arg Glu Glu Ser Tyr Tyr His Ala Arg lie Tyr Thr 

130 135 140 

Lys Tyr Ala Gin Thr Phe His Ser Asn Pro Tyr Thr Asn Gin Lys Thr 
145 150 155 160 

Pro Asn Ser Asp Leu Tyr Tyr Pro Ala Leu Asn Glu Gly Asn Ser Phe 

165 170 175 

Ser lie Gin lie Met Gly lie Ser Val Ala Glu Leu Leu Lys Ser Lys 

180 185 190 

Lys Phe Leu Ser Leu Asp Val Ser Phe Lys Lys Gly Ser Val Leu Trp 

195 200 205 

Gly Gly Arg Pro Tyr Phe Ser Glu Val Gly Glu Phe Met Gly Met Ala 

210 215 220 

Ser Ser Thr Leu Glu Asn Gin Glu Ser Leu Val lie lie Pro Lys Glu 
225 230 235 240 

Lys lie Val Gin Phe Leu Asn Ala Leu Lys Asn Gin Asn lie Phe Pro 
245 250 255 

Asn lie Pro 



(2) INFORMATION FOR SEQ ID NO: 5 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 88 . . . 603 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:573: 
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CTG ATT TGT ATA GCT AGC CTA CGC CTT GAA GCG ATG GAT AGG AAT CAA 3 92 

Leu lie Cys lie Ala Ser Leu Arg Leu Glu Ala Met Asp Arg Asn Gin 
100 105 110 115 

GGG CTT TCG CTT TTA AAA ACQ GCC GAT TTT AGA GAC GAT TAT TGC CAT 44 0 

Gly Leu Ser Leu Leu Lys Thr Ala Asp Phe Arg Asp Asp Tyr Cys His 
120 125 130 

AAA AGA GAA GAG AGC TAT TAT CAT GCA AGG ATT TAG ACA AAA TAC GCT 488 
Lys Arg Glu Glu Ser Tyr Tyr His Ala Arg lie Tyr Thr Lys Tyr Ala 
135 140 145 

CAA ACT TTT CAT TCA AAT CCC TAT ACC AAT CAA AAA ACA CCC AAT TCT 53 6 

Gin Thr Phe His Ser Asn Pro Tyr Thr Asn Gin Lys Thr Pro Asn Ser 
150 155 160 

GAT CTC TAC TAC CCA GCG TTA AAT GAG GGG AAT TCT TTT TCT ATA CAG 5 84 

Asp Leu Tyr Tyr Pro Ala Leu Asn Glu Gly Asn Ser Phe Ser lie Gin 
165 170 175 

ATA ATG GGC ATT TCT GTG GCT GAA CTT TTG AAA TCT AAA AAA TTC CTT 63 2 

lie Met Gly lie Ser Val Ala Glu Leu Leu Lys Ser Lys Lys Phe Leu 
180 185 190 195 

TCG CTT GAT GTT TCT TTT AAA AAG GGG AGC GTG TTG TGG GGA GGG AGG 68 0 

Ser Leu Asp Val Ser Phe Lys Lys Gly Ser Val Leu Trp Gly Gly Arg 
200 205 210 

CCT TAT TTT AGC GAA GTG GGG GAG TTT ATG GGG ATG GCT AGC AGC ACT 72 8 

Pro Tyr Phe Ser Glu Val Gly Glu Phe Met Gly Met Ala Ser Ser Thr 
215 220 225 

TTA GAA AAC CAA GAA AGT CTG GTG ATT ATC CCT AAA GAA AAG ATC GTG 776 
Leu Glu Asn Gin Glu Ser Leu Val lie lie Pro Lys Glu Lys lie Val 
230 235 240 

CAA TTT TTA AAC GCT CTA AAA AAT CAA AAT ATT TTC CCA AAC ATT CCC T 825 
Gin Phe Leu Asn Ala Leu Lys Asn Gin Asn lie Phe Pro Asn lie Pro 
245 250 255 

AAGTTTAAGC AAACCTTAAG CTTTCTATGA CTATACTTTC ATTTCCTTGT TTCAGCA 8 82 

(2) INFORMATION FOR SEQ ID NO: 572: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
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Glu Lys Met Arg Leu Ala Gly Lys Asp lie Glu Lys Leu Val Leu Ala 

260 265 270 

Arg Ala Leu Lys Leu Val Leu Glu Asp Arg Val Phe Val His Glu Asn 

275 280 285 

Lys Thr Val Val Phe 
290 

(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 48... 824 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 

CGATAGTGTT GTAGGATACT TTTGAAATTT AAGCGGTAAG TTGGATA ATG GCG TTT 56 

Met Ala Phe 
1 

TGG CAT AAA AGA TTA GCG GTT GGT TGT TGT ATC GTT TTA TTT TCA TGC 104 
Trp His Lys Arg Leu Ala Val Gly Cys Cys lie Val Leu Phe Ser Cys 
5 10 15 

ATG ATG AAC GCT AAT AGC ATT CAA ATC GTT AGA GAC GAT CCG CCC CTT 152 
Met Met Asn Ala Asn Ser lie Gin lie Val Arg Asp Asp Pro Pro Leu 
20 25 30 35 

GAT CCA ACG CTC CCT GCA TGG GTT TAT TCT GTT GCG TTA TTA AAA GTG 2 00 

Asp Pro Thr Leu Pro Ala Trp Val Tyr Ser Val Ala Leu Leu Lys Val 
40 45 50 

TAT TTT AGC GAT GGG ACT TAT AAA GAA GGC TAT GCG ACT TTG CTC AAA 24 8 

Tyr Phe Ser Asp Gly Thr Tyr Lys Glu Gly Tyr Ala Thr Leu Leu Lys 
55 60 65 

AAC GGG CGT TAT ATC GCT TCT TCT GAA ACG CTT TAT TCT AAC GGC TTA 2 96 

Asn Gly Arg Tyr lie Ala Ser Ser Glu Thr Leu Tyr Ser Asn Gly Leu 
70 75 80 

TAC CCT AAA ACG ATT TTA GCC AAA ATG CAA GAC AGC AGC GCT AAA GAG 344 
Tyr Pro Lys Thr lie Leu Ala Lys Met Gin Asp Ser Ser Ala Lys Glu 
85 90 95 
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TTA GCT AGG GCT TTA AAA CTC GTT TTA GAA GAC AGA GTC TTT GTA CAT 927 
Leu Ala Arg Ala Leu Lys Leu Val Leu Glu Asp Arg Val Phe Val His 
275 280 285 

GAA AAT AAA ACG GTG GTG TTT TGTU^TGCTTT TAGATTTCAG CAACCTCAAT GAAG 9 82 
Glu Asn Lys Thr Val Val Phe 
290 

AACCCTTAAA AAAC 996 
(2) INFORMATION FOR SEQ ID NO; 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

Met Leu Glu Phe lie Leu Lys lie Gin Ala Arg Asp Ser Lys Gly Leu 

1 5 10 15 

Val Ser Thr lie Ser Thr Thr lie Ala Asn Lys Gly Tyr Asn lie Val 

20 25 30 

Lys Asn Asp Glu Phe Val Asp Pro Leu Lys Gin Arg Phe Phe Met Arg 

35 40 45 

Leu Lys lie Gin Lys Glu lie Lys Pro Leu Asn Thr Glu lie Lys Glu 

50 55 60 

Gin Glu Glu Gin Ser Leu Lys Thr Ala Leu Phe Lys Ala Leu Glu Asn 
65 70 75 80 

Phe Asn Glu Leu Leu lie Glu Val lie Leu Thr His Lys Lys Asn lie 

85 90 95 

lie Leu Leu Ala Thr Lys Glu Ser His Cys Leu Gly Asp Leu Leu Leu 

100 105 110 

Arg Val Tyr Gly Gly Glu Leu Asn Ala Gin lie Leu Gly Val lie Ser 

115 120 125 

Asn His Glu lie Leu Arg Pro Leu Val Glu Lys Phe Asp lie Pro Tyr 

130 135 140 

Phe Tyr Ala Pro Cys Asp Asn Gin Val Leu His Glu Lys Glu Val Leu 
145 150 155 160 

Glu lie lie Lys Asn Leu Glu Leu Lys His Lys Val Ser Ala Asp Leu 

165 170 175 

Leu Val Leu Ala Lys Tyr Met Arg lie Leu Ser His Asp Phe Thr Lys 

180 185 190 

Arg Tyr Glu Asn Gin lie Leu Asn He His His Ser Phe Leu Pro Ala 

195 200 205 

Phe He Gly Ala Asn Pro Tyr Gin Gin Ala Phe Glu Arg Gly Val Lys 

210 215 220 

Val He Gly Ala Thr Ala His Phe Val Asn Glu Ser Leu Asp Ala Gly 
225 230 235 240 

Pro He He He Gin Asp Thr Leu Pro He Asn His Asn Tyr Ser Val 
245 250 255 
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ATG CGG TTA AAA ATC CAA AAA GAA ATC AAG CCC TTG AAT ACT GAA ATT 2 55 

Met Arg Leu Lys lie Gin Lys Glu lie Lys Pro Leu Asn Thr Glu lie 
50 55 60 

AAA GAG CAA GAA GAG CAA TCC TTA AAG ACC GCT CTT TTT AAA GCC CTA 3 03 

Lys Glu Gin Glu Glu Gin Ser Leu Lys Thr Ala Leu Phe Lys Ala Leu 
65 70 75 

GAA AAC TTT AAC GAG TTA TTG ATT GAA GTC ATT TTA ACG CAT AAA AAA 351 
Glu Asn Phe Asn Glu Leu Leu lie Glu Val lie Leu Thr His Lys Lys 
80 85 90 

AAC ATC ATT CTG CTC GCT ACT AAA GAG AGC CAT TGC TTA GGG GAT TTG 3 99 

Asn lie lie Leu Leu Ala Thr Lys Glu Ser His Cys Leu Gly Asp Leu 
95 100 105 110 

CTT TTA AGG GTG TAT GGG GGG GAA TTG AAC GCT CAA ATT TTA GGC GTT 44 7 

Leu Leu Arg Val Tyr Gly Gly Glu Leu Asn Ala Gin lie Leu Gly Val 
115 120 125 

ATT TCC AAC CAC GAG ATT TTA CGC CCT TTA GTG GAA AAA TTT GAC ATC 4 95 

lie Ser Asn His Glu lie Leu Arg Pro Leu Val Glu Lys Phe Asp lie 
130 135 140 

CCT TAT TTT TAT GCG CCT TGC GAC AAT CAA GTT TTG CAT GAA AAA GAA 54 3 

Pro Tyr Phe Tyr Ala Pro Cys Asp Asn Gin Val Leu His Glu Lys Glu 
145 150 155 

GTT TTA GAA ATC ATT AAA AAC CTG GAA TTA AAG CAC AAA GTG AGT GCA 5 91 

Val Leu Glu lie lie Lys Asn Leu Glu Leu Lys His Lys Val Ser Ala 
160 165 170 

GAC TTG CTC GTT TTA GCC AAA TAC ATG CGC ATT TTA AGC CAT GAT TTT 63 9 

Asp Leu Leu Val Leu Ala Lys Tyr Met Arg lie Leu Ser His Asp Phe 
175 180 185 190 

ACG AAG CGC TAT GAA AAC CAG ATC TTA AAT ATC CAT CAT AGT TTC TTG 6 87 

Thr Lys Arg Tyr Glu Asn Gin lie Leu Asn lie His His Ser Phe Leu 
195 200 205 

CCC GCA TTC ATT GGG GCT AAT CCT TAC CAG CAA GCG TTT GAA AGG GGC 73 5 

Pro Ala Phe lie Gly Ala Asn Pro Tyr Gin Gin Ala Phe Glu Arg Gly 
210 215 220 

GTG AAA GTC ATC GGG GCC ACG GCG CAT TTT GTG AAT GAA AGC CTT GAT 7 83 

Val Lys Val lie Gly Ala Thr Ala His Phe Val Asn Glu Ser Leu Asp 
225 230 235 

GCT GGG CCG ATT ATC ATA CAA GAC ACT CTG CCC ATT AAC CAC AAT TAC 831 
Ala Gly Pro lie lie lie Gin Asp Thr Leu Pro lie Asn His Asn Tyr 
240 245 250 

AGC GTG GAA AAA ATG CGC CTA GCG GGT AAG GAT ATA GAA AAA CTG GTT 87 9 

Ser Val Glu Lys Met Arg Leu Ala Gly Lys Asp lie Glu Lys Leu Val 
255 260 265 270 
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725 

Phe His Thr Gin Asn Arg 
740 

lie Pro Phe Ser Glu Thr 
755 

His Ala Leu Leu Glu Phe 
770 

Asn Asp Ser Leu Phe Leu 
785 790 
Phe Ser Pro Ser Ala Ser 
805 

Arg Tyr Tyr His Thr Gin 
820 

Ala Ser Leu Tyr Asp lie 
835 

Gly Lys Leu Asn Leu Pro 
850 

Leu Tyr Ala Asn Leu Gin 
865 870 
Gin Pro Lys Val Tyr Glu 
885 



730 

He Thr Thr Ala Gin 
745 

Glu Val Asn Ala Lys 
760 

Leu Lys Gly Glu He 
775 

Ser Ala Lys Lys Glu 
795 

Lys Val Phe Asp Ala 
810 

Asp Phe Thr Asn Arg 
825 

Lys Glu Phe Phe Gin 
840 

Ala Lys Ala Lys Asp 
855 

Asp Ala Leu Lys Asp 
875 

Tyr Gly Phe Leu Arg 
890 
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735 

Gly Thr Asn His Phe 
750 

Glu Arg Tyr Ser Ser 
765 

Lys Glu Leu Lys Glu 
780 

Asn Lys Pro Leu Lys 
800 

Ser Arg Glu Val Tyr 
815 

Pro Tyr Asn Ala Asn 
830 

Gly Arg Asn Lys Gin 
845 

Glu Tyr Tyr Lys Gin 
860 

Leu Ala Lys Glu He 
880 

Glu 



(2) INFORMATION FOR SEQ ID NO: 56 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 948 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 9: 



AAAAGCCACG CAATCAGCCA CAAACATCAT CACACAAGCC TTTGGCTATC AATTATTAAT 60 
GAGATAAAG ATG TTA GAA TTT ATT TTA AAA ATT CAA GCT AGA GAC TCT AAA 111 
Met Leu Glu Phe He Leu Lys He Gin Ala Arg Asp Ser Lys 
15 10 



GGC TTG GTG AGC ACG ATT AGC ACC ACT ATC GCT AAC AAG GGC TAT AAC 15 9 

Gly Leu Val Ser Thr He Ser Thr Thr He Ala Asn Lys Gly Tyr Asn 
15 20 25 30 



ATC GTC AAA AAC GAT GAA TTT GTT GAT CCC TTA AAA CAG CGT TTT TTC 2 07 

He Val Lys Asn Asp Glu Phe Val Asp Pro Leu Lys Gin Arg Phe Phe 
35 40 45 
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Val Pro Ser Asp Val Arg Glu Arg Lys Gly Ala Phe Phe Thr Pro Lys 

290 295 300 

lie Trp Val Glu Lys Ser Gin Glu Tyr Leu Ala Lys Ala Leu Gly Gin 
305 310 315 320 

Asp Tyr Gin Glu Asp Cys lie lie Trp Asp Cys Ala Gly Gly Thr Gly 

325 330 335 

Asn Leu Leu Arg Gly Leu Leu Asn Lys Ala Asn Leu Tyr Leu Ser Thr 

340 345 350 

Leu Asp His Asn Asp Val Ala lie Val Lys Asp Leu Ala Ala Lys Asn 

355 360 365 

His Leu Lys Leu Leu Glu Asn His Val Phe Gin Phe Asp Phe Leu Asn 

370 375 380 

Asp Asp Phe Phe Ser Asp Lys Thr Pro Lys Ser Leu Gin Glu lie Leu 
385 390 395 400 

Lys Asp Lys Glu Lys Arg Lys Lys Leu lie lie Tyr lie Asn Pro Pro 

405 410 415 

Tyr Ala Glu Ala Gly Asn Lys Ser Lys Met Ser Gly Thr Gly Glu His 

420 425 430 

Lys Ala Lys Val Ala Arg Asp Asn Leu lie Cys Glu Lys Tyr Lys Asn 

435 440 445 

Glu Leu Gly Lys Ala Asn Asn Glu Val Phe Ala Gin Phe Phe Met Arg 

450 455 460 

lie Tyr Lys Glu Leu Asn Gly Val lie Leu Ala Ser Phe Ser Thr Leu 
465 470 475 480 

Lys Asn Leu Gin Gly Ser Asn Phe Lys Lys Phe Arg Glu lie Phe Lys 

485 490 495 

Ala Lys Phe Leu Glu Gly Phe Met Val Pro Ala Asp Thr Phe Asp Asn 

500 505 510 

Val Arg Gly Gin Phe Pro lie Gly Phe Leu Val Trp Asp Thr Ser Ser 

515 520 525 

lie Leu Pro Lys Glu Asn Pro Leu Asn Leu Gly Gly Asn Ser Lys Glu 

530 535 540 

Glu Lys Gin Asn Ser Asn Leu lie Leu Asp Gin Asp Asn Leu Lys Asp 
545 550 555 560 

Asn Pro Leu Lys Glu Arg Phe Cys Leu Leu Asp lie Asn Ala Pro Asn 

565 570 575 

Arg Lys Met Cys Ser Gin Ser Arg Thr Arg Thr Lys Gly Thr Gin Lys 

580 585 590 

His Ser Thr Ala Ala Pro Phe Glu Thr Pro Leu His Thr Val Ser Leu 

595 600 605 

Glu lie Phe Asp Ser Phe Gly Gly Phe Leu Gly Ser Lys Lys lie Tyr 

610 615 620 

Thr His Thr lie Asp Lys Met Leu Thr Leu Ala Asp Tyr Leu Gin Lys 
625 630 635 640 

Phe Gin Pro Thr Lys Arg Asp Thr lie Phe Gly Tyr Leu Asp Pro Gly 

645 650 655 

Arg Asn Ser Phe Gin His Gin Asn Leu lie His lie Ser lie lie Asp 

660 665 670 

Lys Ser Lys Gin Ser His Val Lys Tyr Phe Pro lie lie Ala Thr Thr 

675 680 685 

lie Leu Leu Val Ser Val Phe Phe Ser lie Arg His Cys lie Lys Ala 

690 695 700 

Thr Trp Gin Asn Asp Arg Asp Gin Phe Tyr Ala Pro Tyr Asp Asp Ala 
705 710 715 720 

Phe Gin Asp Asp Ser Glu Phe Lys Asn Asn Cys Leu lie Phe Met Leu 
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TAT GAA TAC GGG TXT TTA AGG GAG TGATTTTTAA GACAAATAAT CAAAAAGCTA 2 751 
Tyr Glu Tyr Gly Phe Leu Arg Glu 
885 890 



GAGCAAGCGG TATTTTTTA 2 77 0 

(2) INFORMATION FOR SEQ ID NO: 56 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 892 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:568: 



Met Leu Leu Asp 
1 

lie Lys Ala Glu 
20 

Lys lie Asp Phe 
35 

lie Leu Trp Gly 
50 

Ala Phe Thr Gin 
65 

His His Thr Pro 

Phe lie Ala Phe 
100 

lie Asp Phe Ser 
115 

His Ala Leu Asp 
130 

Phe Asp Phe Lys 
14 5 

Arg Leu Asn Ser 

Asn Phe Phe Thr 
180 

lie Asp lie Asn 
195 

Asp Tyr Tyr Leu 
210 

Glu Lys Leu His 
225 

Gly Val Asn Glu 

Asp Ser Gin Gin 
260 

Pro Lys Arg Glu 
-275 



Phe Ser Asn Leu 
5 

Phe Phe Lys Asp 

Met Leu Ser Tyr 
40 

Glu Ala Lys Arg 
55 

Leu Leu Leu Thr 
70 

Pro Tyr Leu Cys 
85 

Asp Asp Thr lie 

lie Thr Pro Ser 
120 

Ala Phe Lys Ala 
135 

Thr Gin Ser Gin 
150 

Ser His Leu Leu 
165 

lie Tyr Gin Lys 

Trp Glu Val Ala 
200 

Ala Asp Leu Leu 
215 

Thr lie Leu Arg 
230 

Leu Gly Lys Met 
245 

Ala His Gin Glu 

Phe Gin Ala Ser 
280 



Asn Glu Glu Pro 
10 

Lys Lys Phe Leu 
25 

Lys His Ser Asn 

Gly Asp Phe Asp 
60 

lie Gly Lys His 
75 

Ala Phe Asn Ala 
90 

Thr Ser Phe Phe 
105 

Asn His Asn Thr 

Met Ser Lys Ser 
140 

Glu Cys Lys Glu 
155 

Ser Lys lie Gin 
170 

Trp Leu Glu lie 
185 

Lys Thr Lys Asp 

Ser Asp Gly Asp 
220 

Ser Ser His Tyr 
235 

Asp Phe Met Glu 
250 

Phe Trp Ser Val 
265 

lie Leu Glu Arg 



Leu Lys Asn Gin 
15 

Tyr Ser Gly Asp 
30 

Ala Thr Leu Pro 
45 

Asp Leu Asp Lys 

Arg Leu Tyr Thr 
80 

Phe Lys Met Glu 
95 

Tyr Lys Ser Asp 
110 

Glu Gly Phe Lys 
125 

His Lys Phe Val 

Phe lie Lys Asn 
160 

lie Asp Lys Asn 
175 

Val Lys Pro Thr 
190 

lie Leu Asp Ala 
205 

Lys Thr He He 

Lys Leu Asn Arg 
240 

Val Gly Phe Thr 
255 

Tyr Glu Arg Pro 
270 

Arg Asp Leu Leu 
285 
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CAA CAT CAA AAT CTA ATT CAT ATT AGC ATT ATT GAC AAA TCA AAA CAA 2 07 3 

Gin His Gin Asn Leu lie His lie Ser lie lie Asp Lys Ser L.ys Gin 
665 670 675 

TCG CAT GTA AAA TAT TTT CCA ATC ATT GCA ACT ACA ATT TTG TTG GTA 2121 
Ser His Val Lys Tyr Phe Pro lie lie Ala Thr Thr lie Leu Leu Val 
680 685 690 

TCT GTA TTT TTC TCC ATC CGC CAT TGC ATC AAA GCC ACA TGG CAA AAC 216 9 

Ser Val Phe Phe Ser lie Arg His Cys lie Lys Ala Thr Trp Gin Asn 
695 700 705 

GAT AGG GAT CAA TTT TAC GCC CCC TAT GAC GAT GCG TTC CAA GAC GAC 2217 
Asp Arg Asp Gin Phe Tyr Ala Pro Tyr Asp Asp Ala Phe Gin Asp Asp 
710 715 720 

AGC GAG TTT AAA AAC AAT TGT TTG ATT TTC ATG CTT TTT CAC ACC CAG 226 5 

Ser Glu Phe Lys Asn Asn Cys Leu lie Phe Met Leu Phe His Thr Gin 
725 730 735 740 

AAC CGC ATC ACT ACC GCT CAA GGG ACT AAC CAT TTT ATC CCC TTT AGC 2313 
Asn Arg lie Thr Thr Ala Gin Gly Thr Asn His Phe lie Pro Phe Ser 
745 750 755 

GAA ACT GAA GTC AAT GCC AAA GAA AGA TAT TCT AGC CAC GCT CTA TTA 23 61 

Glu Thr Glu Val Asn Ala Lys Glu Arg Tyr Ser Ser His Ala Leu Leu 
760 765 770 

GAG TTT TTA AAA GGC GAA ATC AAA GAA CTT AAA GAG AAC GAT AGC CTC 24 0 9 

Glu Phe Leu Lys Gly Glu lie Lys Glu Leu Lys Glu Asn Asp Ser Leu 
775 780 785 

TTT TTA AGT GCC AAA AAA GAA AAC AAG CCC CTG AAA TTC AGC CCG AGC 245 7 

Phe Leu Ser Ala Lys Lys Glu Asn Lys Pro Leu Lys Phe Ser Pro Ser 
790 795 800 

GCT TCA AAG GTG TTT GAC GCT AGC AGA GAG GTT TAT CGC TAT TAC CAC 25 05 

Ala Ser Lys Val Phe Asp Ala Ser Arg Glu Val Tyr Arg Tyr Tyr His 
805 810 815 820 

ACA CAA GAT TTC ACA AAC CGC CCC TAT AAC GCT AAC GCA AGC CTT TAT 2553 
Thr Gin Asp Phe Thr Asn Arg Pro Tyr Asn Ala Asn Ala Ser Leu Tyr 
825 830 835 

GAC ATC AAA GAA TTT TTT CAA GGC CGT AAC AAG CAA GGC AAA TTA AAT 2 601 

Asp lie Lys Glu Phe Phe Gin Gly Arg Asn Lys Gin Gly Lys Leu Asn 
840 845 850 

TTA CCC GCT AAA GCT TVAA GAT GAA TAT TAC AAA CAG CTT TAC GCT AAT 2 64 9 

Leu Pro Ala Lys Ala Lys Asp Glu Tyr Tyr Lys Gin Leu Tyr Ala Asn 
855 860 865 

TTG CAA GAC GCC CTA AAA GAT CTC GCC AAA GAA ATA CAG CCT AAA GTC 2 697 

Leu Gin Asp Ala Leu Lys Asp Leu Ala Lys Glu lie Gin Pro Lys Val 
870 875 880 
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GCA CGA GAC AAT CTC ATC TGT GAA AAA TAG AAA AAT GAA TTA GGC AAG 14 01 

Ala Arg Asp Asn Leu lie Cys Glu Lys Tyr Lys Asn Glu Leu Gly Lys 
440 445 450 

GOT AAT AAT GAA GTT TTT GCA CAA TTT TTC ATG CGT ATT TAG AAA GAA 144 9 

Ala Asn Asn Glu Val Phe Ala Gin Phe Phe Met Arg lie Tyr Lys Glu 
455 460 465 

TTA AAC GGT GTT ATC CTT GCA AGT TTT TCA ACT TTG AAA AAC TTG CAA 14 97 

Leu Asn Gly Val lie Leu Ala Ser Phe Ser Thr Leu Lys Asn Leu Gin 
470 475 480 

GGA TCT AAT TTT AAA AAA TTC AGA GAA ATC TTT AAA GCT AAA TTT TTA 154 5 

Gly Ser Asn Phe Lys Lys Phe Arg Glu lie Phe Lys Ala Lys Phe Leu 
485 490 495 500 

GAG GGG TTT ATG GTG CCA GCA GAC ACT TTT GAT AAT GTT AGG GGG CAA 1593 
Glu Gly Phe Met Val Pro Ala Asp Thr Phe Asp Asn Val Arg Gly Gin 
505 510 515 

TTT CCT ATC GGC TTT TTA GTG TGG GAT ACA AGC TCT ATT CTT CCT AAA 1641 
Phe Pro lie Gly Phe Leu Val Trp Asp Thr Ser Ser lie Leu Pro Lys 
520 525 530 

GAA AAC CCC CTA AAT TTA GGG GGC AAC TCT AAA GAA GAG AAA CAA AAC 16 89 

Glu Asn Pro Leu Asn Leu Gly Gly Asn Ser Lys Glu Glu Lys Gin Asn 
535 540 545 

TCC AAT TTA ATC TTA GAC CAA GAC AAT TTG AAA GAT AAT CCC TTG AAA 17 37 

Ser Asn Leu lie Leu Asp Gin Asp Asn Leu Lys Asp Asn Pro Leu Lys 
550 555 560 

GAG CGT TTT TGC CTT TTA GAC ATA AAC GCT CCT AAT AGG AAG ATG TGT 17 85 

Glu Arg Phe Cys Leu Leu Asp lie Asn Ala Pro Asn Arg Lys Met Cys 
565 570 575 580 

TCC CAA AGC AGA ACA AGA ACT AAG GGG ACA CAA AAG CAT TCT ACA GCA 1833 
Ser Gin Ser Arg Thr Arg Thr Lys Gly Thr Gin Lys His Ser Thr Ala 
585 590 595 

GCC CCC TTT GAA ACC CCT TTA CAC ACT GTT AGT TTA GAA ATA TTT GAT 18 81 

Ala Pro Phe Glu Thr Pro Leu His Thr Val Ser Leu Glu lie Phe Asp 
600 605 610 

AGT TTC GGC GGA TTT TTA GGC AGT AAA AAA ATA TAG ACT CAC ACA ATA 192 9 

Ser Phe Gly Gly Phe Leu Gly Ser Lys Lys lie Tyr Thr His Thr lie 
615 620 625 

GAC AAA ATG CTT ACT TTA GCG GAT TAT TTA CAA AAG TTT CAG CCA ACA 1977 
Asp Lys Met Leu Thr Leu Ala Asp Tyr Leu Gin Lys Phe Gin Pro Thr 
630 635 640 

AAA AGA GAC ACT ATT TTT GGC TAT TTA GAT CCT GGT CGC AAT AGT TTT 2 02 5 

Lys Arg Asp Thr lie Phe Gly Tyr Leu Asp Pro Gly Arg Asn Ser Phe 
645 650 655 660 
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GCG GAT TTG CTT AGC GAT GGC GAT AAA ACC ATT ATT GAG AAA TTG CAC 72 9 

Ala Asp Leu Leu Ser Asp Gly Asp Lys Thr lie lie Glu Lys Leu His 
215 220 225 



ACG ATT TTA AGA TCG AGC CAT TAT AAA TTG AAT AGG GGT GTG AAT GAA 7 77 

Thr lie Leu Arg Ser Ser His Tyr Lys Leu Asn Arg Gly Val Asn Glu 
230 235 240 



TTA GGC AAA ATG GAT TTT ATG GAA 
Leu Gly Lys Met Asp Phe Met Glu 
245 250 

GCC CAT CAA GAA TTT TGG AGC GTT 
Ala His Gin Glu Phe Trp Ser Val 
265 



GTT GGT TTC ACA GAC AGC CAA CAA 825 
Val Gly Phe Thr Asp Ser Gin Gin 
255 260 

TAT GAA CGA CCG CCT AAA AGA GAA 873 
Tyr Glu Arg Pro Pro Lys Arg Glu 
270 275 



TTT CAA GCC TCT ATT TTA GAG CGG CGC GAC TTG TTA GTA CCA AGC GAT 921 
Phe Gin Ala Ser lie Leu Glu Arg Arg Asp Leu Leu Val Pro Ser Asp 
280 285 290 



GTG AGA GAA AGG AAA GGG GCG TTT TTC ACC CCT AAA ATC TGG GTA GAA 969 
Val Arg Glu Arg Lys Gly Ala Phe Phe Thr Pro Lys lie Trp Val Glu 
295 300 305 



AAG AGT CAA GAA TAT TTA GCT AAA GCT TTG GGG CAA GAT TAT CAA GAG 1017 
Lys Ser Gin Glu Tyr Leu Ala Lys Ala Leu Gly Gin Asp Tyr Gin Glu 
310 315 320 



GAT TGT ATC ATT TGG GAT TGC GCT GGG GGG ACT GGG AAT TTG CTT CGA 1065 
Asp Cys lie lie Trp Asp Cys Ala Gly Gly Thr Gly Asn Leu Leu Arg 
325 330 335 340 



GGT TTA TTG AAT AAG GCT AAT TTG TAT CTA TCC ACT TTA GAT CAT AAC 1113 
Gly Leu Leu Asn Lys Ala Asn Leu Tyr Leu Ser Thr Leu Asp His Asn 
345 350 355 



GAT GTG GCA ATC GTT AAA GAT CTG GCT GCA AAA AAC CAC TTA AAA TTA 1161 
Asp Val Ala lie Val Lys Asp Leu Ala Ala Lys Asn His Leu Lys Leu 
360 365 370 



CTG GAA AAT CAT GTT TTC CAA TTT GAC TTT TTA AAC GAT GAT TTT TTC 12 09 

Leu Glu Asn His Val Phe Gin Phe Asp Phe Leu Asn Asp Asp Phe Phe 
375 380 385 



AGC GAT AAA ACG CCA AAA AGC TTG CAA GAA ATC TTA AAA GAC AAA GAG 1257 
Ser Asp Lys Thr Pro Lys Ser Leu Gin Glu lie Leu Lys Asp Lys Glu 
390 395 400 

AAA CGA AAA AAG CTC ATC ATT TAC ATC AAC CCG CCC TAT GCA GAA GCA 1305 
Lys Arg Lys Lys Leu lie lie Tyr lie Asn Pro Pro Tyr Ala Glu Ala 
405 410 415 420 



GGT AAT AAA TCT AAG ATG AGT GGC ACA GGC GAA CAT AAA GCC AAA GTG 13 5 3 

Gly Asn Lys Ser Lys Met Ser Gly Thr Gly Glu His Lys Ala Lys Val 
425 430 435 
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Met Leu Leu Asp 
1 

TTC AGC AAC CTC AAT GAA GAA CCC TTA AAA AAC CAA ATC AAA GCC GAG 105 
Phe Ser Asn Leu Asn Glu Glu Pro Leu Lys Asn Gin lie Lys Ala Glu 
5 10 15 20 

TTT TTT AAG GAT AAG AAA TTC CTT TAT AGC GGG GAT AAA ATA GAT TTC 153 
Phe Phe Lys Asp Lys Lys Phe Leu Tyr Ser Gly Asp Lys lie Asp Phe 
25 30 35 

ATG CTA AGC TAT AAG CAT TCT AAC GCC ACC TTA CCC ATT TTA TGG GGC 201 
Met Leu Ser Tyr Lys His Ser Asn Ala Thr Leu Pro lie Leu Trp Gly 
40 45 50 

GAA GCT AAA AGG GGC GAT TTT GAT GAT TTG GAC AAA GCT TTC ACG CAA 24 9 

Glu Ala Lys Arg Gly Asp Phe Asp Asp Leu Asp Lys Ala Phe Thr Gin 
55 60 65 

CTT CTT TTA ACC ATA GGC AAG CAC AGG CTT TAT ACC CAC CAC ACA CCA 2 97 

Leu Leu Leu Thr lie Gly Lys His Arg Leu Tyr Thr His His Thr Pro 
70 75 80 

CCT TAT TTG TGC GCT TTT AAC GCT TTT AAA ATG GAA TTT ATC GCC TTT 34 5 

Pro Tyr Leu Cys Ala Phe Asn Ala Phe Lys Met Glu Phe lie Ala Phe 
85 90 95 100 

GAT GAC ACG ATC ACA AGC TTT TTT TAT AAA AGC GAT ATA GAT TTT TCT 393 
Asp Asp Thr lie Thr Ser Phe Phe Tyr Lys Ser Asp lie Asp Phe Ser 
105 110 115 

ATC ACC CCA AGC AAC CAC AAC ACA GAA GGT TTT AAA CAT GCT TTA GAC 441 
lie Thr Pro Ser Asn His Asn Thr Glu Gly Phe Lys His Ala Leu Asp 
120 125 130 

GCG TTT AAA GCC ATG AGC AAA TCC CAT AAA TTC GTT TTT GAC TTT AAA 48 9 

Ala Phe Lys Ala Met Ser Lys Ser His Lys Phe Val Phe Asp Phe Lys 
135 140 145 

ACC CAA AGC CAA GAA TGC AAA GAA TTT ATC AAA AAC CGT TTA AAT TCT 53 7 

Thr Gin Ser Gin Glu Cys Lys Glu Phe lie Lys Asn Arg Leu Asn Ser 
150 155 160 

AGC CAT TTA CTC AGC AAA ATC CAA ATT GAC AAA AAC AAT TTC TTT ACG 5 85 

Ser His Leu Leu Ser Lys lie Gin lie Asp Lys Asn Asn Phe Phe Thr 
165 170 175 180 

ATC TAT CAA AAG TGG CTT GAA ATT GTC AAA CCC ACC ATT GAC ATA AAT 633 
lie Tyr Gin Lys Trp Leu Glu lie Val Lys Pro Thr lie Asp lie Asn 
185 190 195 

TGG GAG GTG GCT AAA ACT AAA GAC ATT TTA GAC GCA GAC TAT TAT TTA 6 81 

Trp Glu Val Ala Lys Thr Lys Asp lie Leu Asp Ala Asp Tyr Tyr Leu 
200 205 210 
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435 440 445 

Glu His Arg His lie Ser lie Lys Pro Asn Asp Leu Val lie lie Ser 

450 455 460 

Ala Lys Ala lie Pro Gly Asn Glu Ala Ser Val Ser Ala Val Leu Asn 
465 470 475 480 

Phe Leu lie Lys Lys Glu Ala Lys Val Ala Tyr Gin Glu Phe Asp Asn 

485 490 495 

lie His Val Ser Gly His Ala Ala Gin Glu Glu Gin Lys Leu Met Leu 

500 505 510 

Arg Leu lie Lys Pro Lys Phe Phe Leu Pro Val His Gly Glu Tyr Asn 

515 520 525 

His Val Ala Arg His Lys Gin Thr Ala lie Ser Cys Gly Val Pro Glu 

530 535 540 

Lys Asn lie Tyr Leu Met Glu Asp Gly Asp Gin Val Glu Val Gly Pro 
545 550 555 560 

Ala Phe lie Lys Lys Val Gly Thr lie Lys Ser Gly Lys Ser Tyr Val 

565 570 575 

Asp Asn Gin Ser Asn Leu Ser lie Asp Thr Ser lie Val Gin Gin Arg 

580 585 590 

Glu Glu Val Ala Ser Ala Gly Val Phe Val Ala Thr lie Phe Val Asn 

595 600 605 

Lys Asn Lys Gin Ala Leu Leu Glu Ser Ser Gin Phe Ser Ser Leu Gly 

610 615 620 

Leu Val Gly Phe Lys Asp Glu Lys Pro Leu lie Lys Glu lie Gin Gly 
625 630 635 640 

Gly Leu Glu Val Leu Leu Lys Ser Ser Asn Ala Glu lie Leu Asn Asn 

645 650 655 

Pro Lys Lys Leu Glu Asp His Thr Arg Asn Phe lie Arg Lys Ala Leu 

660 665 670 

Phe Lys Lys Phe Arg Lys Tyr Pro Ala lie lie Cys His Ala His Ser 
675 680 685 

Phe 



(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 2721 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 
GAAGACAGAG TCTTTGTACA TGAAAATAAA ACGGTGGTGT TTTGA ATG CTT TTA GAT 57 

-877- 



BNSDOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



Met Thr Asp Asn 
1 

Asn Ser Lys Ala 
20 

Asn Arg Lys Lys 
35 

Ser Asn His Glu 
50 

Lys Lys Pro Asn 
65 

Tyr Ala Gin Glu 

lie Leu His Val 
100 

Lys Lys Gly Val 
115 

Pro His Tyr Lys 
130 

Pro Leu Gly Gly 
145 

Thr Pro Lys Ser 

Glu Gly Leu Phe 
180 

His Gin lie Lys 
195 

Glu Asp His lie 
210 

Pro Leu Tyr Gly 
225 

Asp Glu His Gly 

Lys Arg Cys Pro 
260 

lie Thr His Ser 
275 

Ala Gly Thr lie 
290 

Val Asp Asn Leu 
305 

Lys Gly Val Met 

Gly Thr Thr Pro 
340 

Phe Lys Glu Ala 
355 

lie His Arg Val 
370 

Lys lie Ala Val 
385 

Arg Glu Leu Gly 

Asn Glu Val Ala 
420 

Gly Ser Gin Gly 



Asn Gin Asn Asn 
5 

Asp Glu Met Arg 

Arg Phe Arg Glu 
40 

Ala Ser Ser His 
55 

Asn His His Lys 
70 

Glu Leu Asp Ser 
85 

Asn Glu Arg Gly 

Glu Ala Asn Asn 
120 

Met Asn Leu Asn 
135 

Leu Gly Glu lie 
150 

Ala lie Val lie 
165 

Gly Val Asp He 

Asp Lys He Ala 
200 

Gly Ala Thr Pro 
215 

Thr Pro Leu Ser 
230 

Leu Lys Lys Tyr 
245 

He Ser Val Gly 

He He Asp Ser 
280 

He His Thr Gly 
295 

Pro Thr Asp Leu 
310 

Leu Leu Leu Ser 
325 

Ser Glu Ser Thr 

Gin Gly Arg Val 
360 

Tyr Gin Ala He 
375 

He Gly Arg Ser 
390 

Tyr He His Leu 
405 

Lys Tyr Pro Asp 
Glu Thr Met Ser 



Glu Asn His Glu 
10 

Ala Gly Ala Phe 
25 

Asn Ala Gin Lys 

His Lys Lys Glu 
60 

Gin Lys His Ala 
75 

Asn Lys Val Glu 
90 

Thr Leu Gly Phe 
105 

Lys He Gin Val 

Ser Lys Ala Ser 
140 

Gly Gly Asn Met 
155 

Asp Ala Gly Met 
170 

Leu He Pro Asp 
185 

Gly He He He 

Tyr Leu Phe Lys 
220 

Leu Gly Leu He 
235 

Arg Ser Tyr Phe 
250 

Glu Phe He He 
265 

Ser Ala Leu Ala 

Asp Phe Lys He 
300 

Tyr Arg Leu Ala 
315 

Asp Ser Thr Asn 
330 

He Ala Pro Ala 
345 

He Met Ser Thr 

Gin Tyr Gly He 
380 

Met Glu Lys Asn 
395 

Pro Tyr Gin Ser 
410 

Asn Glu He Leu 
425 

Ala Leu Tyr Arg 



Asn Ser Ser Glu 
15 

Glu Arg Phe Thr 
30 

Asn Ala Glu Tyr 
45 

His Arg Pro Asn 

Lys Thr Arg Asn 
80 

Gly Val Thr Glu 
95 

His Lys Glu Leu 
110 

Glu His Leu Asn 
125 

Val Lys He Thr 

Met Val He Glu 
160 

Ser Phe Pro Lys 
175 

Phe Ser Tyr Leu 
190 

Thr His Ala His 
205 

Glu Leu Gin Phe 

Gly Ser Lys Phe 
240 

Lys He Val Glu 
255 

Glu Trp He His 
270 

He Gin Thr Lys 
285 

Asp His Thr Pro 

His Tyr Gly Glu 
320 

Ser His Lys Ser 
335 

Phe Asp Thr Leu 
350 

Phe Ser Ser Asn 
365 

Lys Tyr Asn Arg 

Leu Asp He Ala 
400 

Phe He Glu Ala 
415 

He Val Thr Thr 
430 

Met Ala Thr Asp 
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GCG CGC CAC AAA CAA ACC OCT ATT TCT TGC GGA GTG CCT GAA AAA AAT 16 90 
Ala Arg His Lys Gin Thr Ala lie Ser Cys Gly Val Pro Glu bys Asn 
535 540 545 

ATC TAT TTA ATG GAG GAT GGC GAT CAG GTG GAG GTT GGC CCT GCG TTC 17 38 
lie Tyr Leu Met Glu Asp Gly Asp Gin Val Glu Val Gly Pro Ala Phe 
550 555 560 

ATC AAA AAA GTC GGC ACG ATT AAA AGC GGG AAA AGC TAT GTG GAT AAC 17 86 

lie Lys Lys Val Gly Thr lie Lys Ser Gly Lys Ser Tyr Val Asp Asn 
565 570 575 

CAA AGC AAT TTG AGT ATT GAT AC A AGC ATC GTG CAA CAA AGA GAA GAA 18 34 
Gin Ser Asn Leu Ser lie Asp Thr Ser lie Val Gin Gin Arg Glu Glu 
580 585 590 

GTC GCT AGC GCC GGG GTG TTT GTG GCT ACG ATT TTT GTG AAT AAA AAC 1882 
Val Ala Ser Ala Gly Val Phe Val Ala Thr lie Phe Val Asn Lys Asn 
595 600 605 610 

AAG CAA GCG CTT TTA GAA AGC TCT CAA TTT TCC AGT TTA GGG CTT GTG 1930 
Lys Gin Ala Leu Leu Glu Ser Ser Gin Phe Ser Ser Leu Gly Leu Val 
615 620 625 

GGT TTC AAA GAT GAA AAG CCT TTG ATC AAA GAA ATT CAA GGG GGC TTA 197 8 
Gly Phe Lys Asp Glu Lys Pro Leu lie Lys Glu lie Gin Gly Gly Leu 
630 635 640 

GAG GTG TTA TTG AAA TCC AGC AAC GCC GAA ATT TTG AAT AAC CCT AAA 2 026 
Glu Val Leu Leu Lys Ser Ser Asn Ala Glu lie Leu Asn Asn Pro Lys 
645 650 655 

AAA TTA GAA GAT CAC ACT CGT AAT TTC ATC AGA AAA GCG CTC TTT AAA 2074 
Lys Leu Glu Asp His Thr Arg Asn Phe lie Arg Lys Ala Leu Phe Lys 
660 665 670 

AAG TTT AGA AAA TAC CCG GCT ATC ATT TGT CAT GCC CAT TCT TTT TGATT 2124 
Lys Phe Arg Lys Tyr Pro Ala lie lie Cys His Ala His Ser Phe 
675 680 685 

GTAACGCTAT TGCTTCACAA GTTTTAAAAG ATGAAGCGAG CGCGC 2169 
(2) INFORMATION FOR SEQ ID NO:566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:566: 
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Asn Leu Pro Thr Asp Leu Tyr Arg Leu Ala His Tyr Gly Glu Lys Gly 
310 315 320 

GTG ATG CTT CTT TTA AGC GAT TCC ACC AAC TCC CAT AAA TCC GGG ACT 1066 
Val Met Leu Leu Leu Ser Asp Ser Thr Asn Ser His Lys Ser Gly Thr 
325 330 335 

ACG CCG AGT GAA AGC ACC ATA GCG CCG GCT TTT GAT ACC CTT TTT AAA 1114 
Thr Pro Ser Glu Ser Thr He Ala Pro Ala Phe Asp Thr Leu Phe Lys 
340 345 350 

GAA GCG CAA GGG AGG GTG ATT ATG AGC ACC TTC TCT AGC AAT ATC CAC 1162 
Glu Ala Gin Gly Arg Val He Met Ser Thr Phe Ser Ser Asn He His 
355 360 365 370 

CGG GTC TAT CAA GCC ATA CAA TAC GGC ATT AAA TAC AAC CGC AAG ATC 1210 
Arg Val Tyr Gin Ala He Gin Tyr Gly He Lys Tyr Asn Arg Lys He 
375 380 385 

GCT GTG ATC GGG CGC TCT ATG GAA AAA AAC CTA GAC ATC GCT AGA GAA 1258 
Ala Val He Gly Arg Ser Met Glu Lys Asn Leu Asp He Ala Arg Glu 
390 395 400 

TTG GGC TAT ATC CAT TTG CCT TAT CAA TCT TTT ATT GAA GCC AAT GAA 13 06 

Leu Gly Tyr He His Leu Pro Tyr Gin Ser Phe He Glu Ala Asn Glu 
405 410 415 

GTC GCC AAA TAC CCG GAC AAT GAA ATC TTA ATC GTA ACG ACC GGC TCA 13 54 

Val Ala Lys Tyr Pro Asp Asn Glu He Leu He Val Thr Thr Gly Ser 
420 425 430 

CAA GGC GAA ACC ATG AGC GCG CTT TAT CGC ATG GCG ACT GAT GAA CAC 14 02 

Gin Gly Glu Thr Met Ser Ala Leu Tyr Arg Met Ala Thr Asp Glu His 
435 440 445 450 

CGT CAT ATT TCT ATC AAA CCC AAC GAT TTA GTC ATC ATT TCC GCT AAA 14 50 

Arg His He Ser He Lys Pro Asn Asp Leu Val He He Ser Ala Lys 
455 460 465 

GCC ATT CCT GGC AAT GAA GCG AGC GTT TCA GCG GTG TTG AAT TTC TTG 14 98 

Ala He Pro Gly Asn Glu Ala Ser Val Ser Ala Val Leu Asn Phe Leu 
470 475 480 

ATC AAA AAA GAA GCT AAA GTG GCT TAT CAA GAA TTT GAC AAT ATC CAT 1546 
He Lys Lys Glu Ala Lys Val Ala Tyr Gin Glu Phe Asp Asn He His 
485 490 495 

GTG AGC GGG CAT GCC GCC CAA GAA GAG CAA AAG CTC ATG TTA AGA CTC 1594 
Val Ser Gly His Ala Ala Gin Glu Glu Gin Lys Leu Met Leu Arg Leu 
500 505 510 

ATT AAG CCT AAG TTT TTC TTA CCC GTG CAT GGG GAA TAT AAC CAT GTC 1642 
He Lys Pro Lys Phe Phe Leu Pro Val His Gly Glu Tyr Asn His Val 
515 520 525 530 
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85 90 95 

CAT GTG AAT GAG AGA GGG ACT TTA GGC TTT CAT AAG GAG TTA AAA AAG 3 94 

His Val Asn Glu Arg Gly Thr Leu Gly Phe His Lys Glu Leu Lys Lys 
100 105 110 

GGC GTT GAA GCG AAT AAC AAG ATC CAA GTG GAG CAT TTA AAC CCG CAT 44 2 

Gly Val Glu Ala Asn Asn Lys lie Gin Val Glu His Leu Asn Pro His 
115 120 125 130 

TAT AAG ATG AAC TTA AAC TCT AAA GCG AGC GTT AAA ATC ACG CCT TTA 4 90 

Tyr Lys Met Asn Leu Asn Ser Lys Ala Ser Val Lys lie Thr Pro Leu 
135 140 145 

GGG GGC TTG GGT GAG ATT GGG GGG AAC ATG ATG GTC ATT GAA ACC CCA 53 8 

Gly Gly Leu Gly Glu lie Gly Gly Asn Met Met Val lie Glu Thr Pro 
150 155 160 

AAA AGC GCG ATC GTG ATT GAT GCG GGC ATG AGC TTC CCT AAA GAG GGG 586 
Lys Ser Ala lie Val lie Asp Ala Gly Met Ser Phe Pro Lys Glu Gly 
165 170 175 

CTC TTT GGC GTG GAT ATT TTA ATC CCG GAT TTT TCC TAC TTG CAC CAA 634 
Leu Phe Gly Val Asp lie Leu lie Pro Asp Phe Ser Tyr Leu His Gin 
180 185 190 

ATC AAG GAC AAA ATC GCT GGC ATT ATC ATC ACC CAT GCC CAT GAA GAT 6 82 

lie Lys Asp Lys lie Ala Gly lie lie lie Thr His Ala His Glu Asp 
195 200 205 210 

CAC ATA GGG GCC ACG CCT TAT TTG TTT AAA GAG CTG CAA TTC CCC CTT 73 0 

His lie Gly Ala Thr Pro Tyr Leu Phe Lys Glu Leu Gin Phe Pro Leu 
215 220 225 

TAT GGC ACG CCC TTG AGT TTG GGG CTG ATT GGG AGC AAG TTT GIVT GAA 77 8 

Tyr Gly Thr Pro Leu Ser Leu Gly Leu lie Gly Ser Lys Phe Asp Glu 
230 235 240 

CAT GGT TTG AAA AAA TAC CGC TCG TAT TTT AAA ATC GTA GAA AAG CGC 82 6 

His Gly Leu Lys Lys Tyr Arg Ser Tyr Phe Lys lie Val Glu Lys Arg 
245 250 255 

TGT CCC ATT AGC GTG GGC GAA TTT ATC ATT GAA TGG ATC CAC ATC ACG 874 
Cys Pro lie Ser Val Gly Glu Phe He He Glu Trp He His He Thr 
260 265 270 

CAT TCT ATC ATT GAC AGC AGC GCT TTA GCG ATC CAA ACT AAA GCC GGA 922 
His Ser He He Asp Ser Ser Ala Leu Ala He Gin Thr Lys Ala Gly 
275 280 285 290 

ACG ATC ATC CAC ACC GGC GAT TTT AAA ATC GAT CAC ACC CCG GTG GAT 97 0 

Thr He He His Thr Gly Asp Phe Lys He Asp His Thr Pro Val Asp 
295 300 305 

AAT TTG CCC ACG GAT TTG TAT CGT TTA GCG CAC TAT GGC GAA AAG GGG 1018 
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Gly Leu Arg Met Gly Asp Lys His His Thr Leu Glu Leu Ser Thr Ser 
65 70 75 80 

Val His Gly Asp Ala Pro Ser Cys Ser Leu Lys Lys Leu Lys Ser Cys 

85 90 95 

Glu Ser Ala Arg Val Leu Gin Ala Lys lie Pro Arg Gly lie Phe Glu 

100 105 110 

Ser Tyr Val Thr Trp Ser Ala Asp Tyr Val Tyr Arg Phe 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 2 119 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 65: 

CTTTTAAAAG GCTAATGCCT TTTTAAAAAA TTGAAATAAA GGAATAAAAG TT ATG ACG 5 8 

Met Thr 
1 

GAT AAC AAC CAA AAC AAT GAA AAC CAT GAA AAC AGC AGT GAA AAT TCA 10 6 

Asp Asn Asn Gin Asn Asn Glu Asn His Glu Asn Ser Ser Glu Asn Ser 
5 10 15 

AAA GCT GAT GAG ATG CGA GCC GGA GCG TTT GAG CGC TTC ACC AAC CGC 154 
Lys Ala Asp Glu Met Arg Ala Gly Ala Phe Glu Arg Phe Thr Asn Arg 
20 25 30 

AAA AAG CGT TTC AGA GAA AAC GCG CAA AAA AAC GCA GAG TAT TCA AAC 2 02 

Lys Lys Arg Phe Arg Glu Asn Ala Gin Lys Asn Ala Glu Tyr Ser Asn 
35 40 45 50 

CAT GAA GCG TCT TCG CAC CAT AAA AAA GAG CAT CGC CCT AAC AAA AAA 250 
His Glu Ala Ser Ser His His Lys Lys Glu His Arg Pro Asn Lys Lys 
55 60 65 

CCA AAC AAC CAC CAC AAA CAA AAA CAT GCC AAA ACA CGA AAT TAC GCC 2 98 

Pro Asn Asn His His Lys Gin Lys His Ala Lys Thr Arg Asn Tyr Ala 
70 75 80 

CAA GAA GAA TTG GAT AGC AAC AAA GTA GAG GGC GTT ACG GAA ATT TTG 34 6 

Gin Glu Glu Leu Asp Ser Asn Lys Val Glu Gly Val Thr Glu lie Leu 
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TAT GAT AAC AAT GTC AAC TTA GAG CTT TTT TAT GGA TTT TTA CAC AAT 104 
Tyr Asp Asn Asn Val Asn Leu Gin Leu Phe Tyr Gly Phe Leu His Asn 
5 10 15 

GTG TAT GAA AAT GAG AAG TTT TTC ATC GGT TAT TTT ATA GGG GCT GGG 152 
Val Tyr Glu Asn Glu Lys Phe Phe lie Gly Tyr Phe lie Gly Ala Gly 
20 25 30 

CTA GGG GGT GAG AGO GTA ACA CCC AAT GTT CTT AAA GAT TTT GGT AAT 2 00 

Leu Gly Gly Glu Ser Val Thr Pro Asn Val Leu Lys Asp Phe Gly Asn 
35 40 45 50 

ATG TTA GGG CAA TTA GTG CAA TTT GAG GGC TAT GGC TCA CTA GGG CTA 24 8 

Met Leu Ala Gin Leu Val Gin Phe Gin Gly Tyr Gly Ser Leu Gly Leu 
55 60 65 

AGG ATG GGC GAT AAA CAC CAC ACG CTA GAA TTG AGC ACG AGC GTT CAT 296 
Arg Met Gly Asp Lys His His Thr Leu Glu Leu Ser Thr Ser Val His 
70 75 80 

GGC GAC GCT CCT AGT TGT TCT TTA AAA AAG CTA AAG AGT TGC GAA AGT 344 
Gly Asp Ala Pro Ser Cys Ser Leu Lys Lys Leu Lys Ser Cys Glu Ser 
85 90 95 

GCG AGG GTT TTA CAA GCA AAA ATC CCT AGG GGC ATT TTT GAA AGC TAT 3 92 

Ala Arg Val Leu Gin Ala Lys lie Pro Arg Gly lie Phe Glu Ser Tyr 
100 105 110 

GTT ACT TGG AGC GCG GAT TAT GTT TAT CGT TTT TAAAAGTTTT TAAAAATTTA 44 5 

Val Thr Trp Ser Ala Asp Tyr Val Tyr Arg Phe 
115 120 125 

ATGGCTTTGT TCCAATTGAA TAGGGGTAAT AAA 478 
(2) INFORMATION FOR SEQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:564: 

Trp Trp Tyr Asp Asn Asn Val Asn Leu Gin Leu Phe Tyr Gly Phe Leu 

15 10 15 

His Asn Val Tyr Glu Asn Glu Lys Phe Phe lie Gly Tyr Phe lie Gly 

20 25 30 

Ala Gly Leu Gly Gly Glu Ser Val Thr Pro Asn Val Leu Lys Asp Phe 

35 40 45 

Gly Asn Met Leu Ala Gin Leu Val Gin Phe Gin Gly Tyr Gly Ser Leu 
50 55 60 
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Thr Thr Lys Gly Lys Ala Pro Lys Lys Asp lie Glu Lys Leu Leu Ser 
35 40 45 50 

TAT GGC TTG ACA ATG GCG ACA AGC TGT CAT TAATGAGAGA ACTTTTTAAA AGC 2 53 
Tyr Gly Leu Thr Met Ala Thr Ser Cys His 
55 60 

GTTAGAGGGT TTTTTCGCCT TCTTAGAATG 2 83 

(2) INFORMATION FOR SEQ ID NO:562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:562: 

Met Val Ala Gly Gly Arg Glu Val Val Phe Lys Asp Asn Asp Lys Lys 

15 10 15 

Glu Ala Lys Leu Phe Glu Tyr Gly lie Asn Ala Val Val Leu Gly Asp 

20 25 30 

Tyr Leu Thr Thr Lys Gly Lys Ala Pro Lys Lys Asp lie Glu Lys Leu 

35 40 45 

Leu Ser Tyr Gly Leu Thr Met Ala Thr Ser Cys His 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 425 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:563: 

CACTTGTTTG GAGTGCCTAT AGGCATAATA AGTCCTGTTT CTATTTTTAA TGG TGG 56 

Trp Trp 
1 



-870- 



BNSCOCID: <WO 9S4347aA1 J_> 



wo 98/43478 



PCT/US98/06371 



100 

Gly Trp Lys Asp 
115 

lie Gly Ser Asn 
130 

Asn Asp Ser Gin 
145 

Ijys Asn Ala lie 

Asn lie Asn Asp 
180 

Thr lie lie Cys 
195 

Leu Asn Ala Arg 
210 

Glu Leu Leu Lys 
225 

Arg Trp Asn Ala 

Glu Thr Leu Phe 
260 



Arg His Thr Glu 
120 

Ala Glu Tyr Asp 
135 

Ala Ser Lys Asn 
150 

Cys Leu Lys Asp 
165 

Phe lie Glu Trp 

Pro Asn lie Asp 
200 

Gly Leu Pro Leu 
215 

His Ala Lys Glu 
230 

Leu Ser Gin Lys 
245 

Ser Trp Tyr Asn 



105 

Arg Lys Arg Leu 

Phe Gin Asp Ala 
140 

Lys Asn Asn Lys 
155 

Tyr Leu Met Lys 
170 

Leu Tyr Ser Asn 
185 

Lys Ala Leu Arg 

Asn Ala Thr Asp 
220 

His Glu Arg Glu 
235 

Cys Ser Asp Asn 
250 

Leu Ser Gin Ser 
265 



110 

Ser Phe Asn Thr 
125 

Leu Glu His Phe 

Asn Asn Tyr Leu 
160 

Lys Glu lie Lys 
175 

Val Lys Phe lie 
190 

lie Phe Asn Val 
205 

lie Phe Lys Gly 

Glu Phe Val Ser 
240 

Asp Leu Thr Met 
255 

Gly Asn Phe 
270 



(2) INFORMATION FOR SEQ ID NO: 561: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 230 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 

CCCTAGAATG CGTGCTTTTG GCTAAGGAAT TTTTGCCTAA CGCTAGGCTT ATG GTG 56 

Met Val 
1 

GCT GGG GGG CGT GAA GTG GTG TTT AAA GAT AAT GAC AAA AAG GAA GCC 104 
Ala Gly Gly Arg Glu Val Val Phe Lys Asp Asn Asp Lys Lys Glu Ala 
5 10 15 

AAG CTT TTT GAA TAC GGC ATC AAT GCG GTG GTG CTA GGG GAC TAT TTG 152 
Lys Leu Phe Glu Tyr Gly lie Asn Ala Val Val Leu Gly Asp Tyr Leu 
20 25 30 

ACC ACC AAA GGC AAA GCC CCT AAA AAA GAT ATA GAA AAA CTG CTC TCT 200 
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165 170 175 

AAC GAT TTC ATT GAG TGG CTG TAT TCT AAT GTT AAA TTT ATC AGO ATC 6 33 

Asn Asp Phe lie Glu Trp Leu Tyr Ser Asn Val Lys Phe lie Thr lie 
180 185 190 

ATT TGC CCA AAC ATA GAC AAG GCA TTA AGG ATT TTT AAT GTT TTA AAC 6 81 

lie Cys Pro Asn lie Asp Lys Ala Leu Arg lie Phe Asn Val Leu Asn 
195 200 205 210 

GCT AGG GGT TTG CCT TTG AAT GCG ACA GAT ATT TTT AAG GGG GAA TTA 729 
Ala Arg Gly Leu Pro Leu Asn Ala Thr Asp lie Phe Lys Gly Glu Leu 
215 220 225 

TTA AAA CAC GCT AAA GAG CAT GAG CGA GAA GAA TTT GTG TCT CGT TGG 777 
Leu Lys His Ala Lys Glu His Glu Arg Glu Glu Phe Val Ser Arg Trp 
230 235 240 

AAC GCC TTA AGC CAA AAA TGC TCG GAC AAT GAT TTA ACA ATG GAG ACA 825 
Asn Ala Leu Ser Gin Lys Cys Ser Asp Asn Asp Leu Thr Met Glu Thr 
245 250 255 

TTA TTC AGT TGG TAT AAC CTA TCT CAA TCC GGT AAC TTC TAGAGACAAA AT 8 76 
Leu Phe Ser Trp Tyr Asn Leu Ser Gin Ser Gly Asn Phe 
260 265 270 

GGAAAAAGAG CTCGTTACTT GGTTCAACAT ACTTAACAAA CCCCCCCTAG AATACCTTAA 93 6 

GGGCGTAGAG GATTTTTACA ACGCTTATGG TGAGGTGTTA GAAATGCAAG ATCG 990 

(2) INFORMATION FOR SEQ ID NO:560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 

Met Ala Lys lie Glu Ser Asn Asp Ser His Leu Arg Gly lie Leu Lys 

15 10 15 

Asp Glu Leu Tyr Tyr Gin He Pro He Tyr Gin Arg Pro Tyr Gin Trp 

20 25 30 

Thr Glu Glu Asn Cys Glu Lys Leu Leu Asp Asp Leu Phe Phe Asn Tyr 

35 40 45 

Glu Asp Asp Arg Glu Gly Asp Tyr Phe Cys Gly Ser Leu Val Leu He 

50 55 60 

Ala He Ser Lys Asp Ser Lys Ala Thr Thr Tyr Asp Val Val Asp Gly 
65 70 75 80 

Gin Gin Arg Leu Ser Thr Phe He Leu Leu Ala Lys Val Leu Ala Asp 

85 90 95 

Leu Tyr Asn Asp Cys Leu Asp Pro Lys Asn Leu Glu His Leu Gin Glu 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52 . . . 864 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

AAAAACACTT TTTAATGTTA TAATCTATCC TAAACAATAT AAGGGGTTTT T ATG GCA 57 

Met Ala 

1 

AAA ATT GAA AGC AAT GAT TCC CAC CTA AGA GGT ATT TTA AAA GAC GAA 105 
Lys lie Glu Ser Asn Asp Ser His Leu Arg Gly He Leu Lys Asp Glu 
5 10 15 

CTC TAG TAT CAA ATC CCC ATC TAC CAA CGC CCT TAT CAA TGG ACA GAA 153 
Leu Tyr Tyr Gin He Pro He Tyr Gin Arg Pro Tyr Gin Trp Thr Glu 
20 25 30 

GAA AAC TGC GAA AAA CTT TTA GAC GAT TTG TTT TTT AAT TAT GAA GAT 2 01 

Glu Asn Cys Glu Lys Leu Leu Asp Asp Leu Phe Phe Asn Tyr Glu Asp 
35 40 45 50 

GAC AGA GAA GGC GAT TAT TTT TGC GGC TCA TTA GTC TTA ATT GCA ATC 24 9 

Asp Arg Glu Gly Asp Tyr Phe Cys Gly Ser Leu Val Leu He Ala He 
55 60 65 

AGC AAA GAT TCT AAA GCC ACA ACC TAT GAT GTT GTA GAT GGC CAG CAA 2 97 

Ser Lys Asp Ser Lys Ala Thr Thr Tyr Asp Val Val Asp Gly Gin Gin 
70 75 80 

CGC TTA AGC ACT TTC ATT CTG CTT GCA AAA GTT TTA GCC GAT CTT TAT 34 5 

Arg Leu Ser Thr Phe He Leu Leu Ala Lys Val Leu Ala Asp Leu Tyr 
85 90 95 

AAT GAT TGT TTA GAC CCT AAG AAT TTA GAA CAT TTA CAA GAG GGT TGG 393 
Asn Asp Cys Leu Asp Pro Lys Asn Leu Glu His Leu Gin Glu Gly Trp 
100 105 110 

AAA GAT AGG CAT ACA GAA AGA AAA CGA CTG AGT TTT AAC ACT ATA GGG 441 
Lys Asp Arg His Thr Glu Arg Lys Arg Leu Ser Phe Asn Thr He Gly 
115 120 125 130 

TCT AAC GCT GAA TAT GAT TTT CAA GAT GCA TTA GAA CAT TTC AAC GAC 4 89 

Ser Asn Ala Glu Tyr Asp Phe Gin Asp Ala Leu Glu His Phe Asn Asp 
135 140 145 

TCT CAA GCA AGC AAG AAT AAA AAT AAT AAG AAC AAT TAC CTA AAA AAT 537 
Ser Gin Ala Ser Lys Asn Lys Asn Asn Lys Asn Asn Tyr Leu Lys Asn 
150 155 160 

GCG ATC TGT TTA AAA GAC TAT CTC ATG AAA AAA GAG ATT AAA AAC ATT 585 
Ala He Cys Leu Lys Asp Tyr Leu Met Lys Lys Glu He Lys Asn He 
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15 10 15 

Phe Gin Gly Val lie Phe Leu Asn Pro Lys lie Val Ser Trp Leu Leu 

20 25 30 

Lys Ala Tyr Arg Met Ser Asp Asn Leu Leu His Lys Asp lie Gin Ala 

35 40 45 

Leu lie Ala Arg Leu Lys Arg Gin Asp Leu Ser Leu Gly Met Leu Glu 

50 55 60 

Lys Ser Leu Ser Arg Leu lie His Asp Glu lie Asn Leu Glu Tyr Leu 
65 70 75 80 

Lys Ala Cys Gly Leu Asn Phe lie Glu Thr Ser Glu Asn Leu lie Thr 

85 90 95 

Leu Lys Asn Leu Lys Thr Pro Leu Lys Asp Glu Val Phe Ser Phe lie 

100 105 110 

Asp Leu Glu Thr Thr Gly Ser Cys Pro lie Lys His Glu lie Leu Glu 

115 120 125 

lie Gly Ala Val Gin Val Lys Gly Gly Glu lie lie Asn Arg Phe Glu 

130 135 140 

Thr Leu Val Lys Val Lys Ser Val Pro Asp Tyr lie Ala Glu Leu Thr 
145 150 155 160 

Gly lie Thr Tyr Glu Asp Thr Leu Asn Ala Pro Ser Ala His Glu Ala 

165 170 175 

Leu Gin Glu Leu Arg Leu Phe Leu Gly Asn Ser Val Phe Val Ala His 

180 185 190 

Asn Ala Asn Phe Asp Tyr Asn Phe Leu Gly Arg Tyr Phe Val Glu Lys 

195 200 205 

Leu His Cys Pro Leu Leu Asn Leu Lys Leu Cys Thr Leu Asp Leu Ser 

210 215 220 

Lys Arg Ala lie Leu Ser Met Arg Tyr Ser Leu Ser Phe Leu Lys Glu 
225 230 235 240 

Leu Leu Gly Phe Gly lie Glu Val Ser His Arg Ala Tyr Ala Asp Ala 

245 250 255 

Leu Ala Ser Tyr Lys Leu Phe Glu lie Cys Leu Leu Asn Leu Pro Ser 

260 265 270 

Tyr lie Lys Thr Thr Met Asp Leu lie Asp Phe Ser Lys Cys Ala Asn 

275 280 285 

Thr Leu lie Lys Arg Pro Pro Lys Ala Arg Tyr Gin Glu lie Pro Ser 

290 295 300 

Pro Phe Ser Leu Phe Glu Lys Thr Lys Gly Leu Phe Asn His Lys Ser 
305 310 315 320 

Asn Gin Leu Asn Glu Ser Cys Leu Met Gly Phe Met Gly Thr Glu lie 

325 330 335 

Leu Ala Ser Leu Phe Asp Thr Phe Glu Cys Cys Leu Val Phe 
340 345 350 

(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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Asn Phe Asp Tyr Asn 
195 

TGC CCT TTA TTG AAT 
Cys Pro Leu Leu Asn 
215 



Phe Leu Gly Arg Tyr 
200 

TTA AAG CTT TGC ACT 
Leu Lys Leu Cys Thr 
220 



Phe Val Glu Lys Leu His 
205 210 

CTG GAT TTA TCC AAA CGA 72 8 

Leu Asp Leu Ser Lys Arg 
225 



GCC ATT TTG TCC ATG CGT TAT TCT TTG AGC TTT TTA AAA GAG CTT TTA 776 
Ala lie Leu Ser Met Arg Tyr Ser Leu Ser Phe Leu Lys Glu Leu Leu 
230 235 240 



GGG TTT GGT ATA GAA GTC AGC CAT AGA GCC TAT GCG GAC GCT TTA GCG 824 
Gly Phe Gly lie Glu Val Ser His Arg Ala Tyr Ala Asp Ala Leu Ala 
245 250 255 



AGC TAT AAA CTC TTT GAA ATA TGC CTG TTA AAC TTG CCC AGC TAC ATC 872 
Ser Tyr Lys Leu Phe Glu lie Cys Leu Leu Asn Leu Pro Ser Tyr lie 
260 265 270 



AAA ACG ACA ATG GAT TTG ATT GAT TTT TCT AAA TGT GCT AAC ACG CTA 920 
Lys Thr Thr Met Asp Leu lie Asp Phe Ser Lys Cys Ala Asn Thr Leu 
275 280 285 290 



ATC AAA AGA CCC CCA AAA GCC AGA TAC CAA GAG ATT CCA TCG CCA TTT 968 
lie Lys Arg Pro Pro Lys Ala Arg Tyr Gin Glu lie Pro Ser Pro Phe 
295 300 305 



TCT CTT TTT GAA AAG ACA AAG GGC TTG TTC AAT CAT AAA AGC AAC CAG 1016 
Ser Leu Phe Glu Lys Thr Lys Gly Leu Phe Asn His Lys Ser Asn Gin 
310 315 320 



TTA AAC GAA AGC TGT TTA ATG GGG TTT ATG GGG ACT GAA ATT TTA GCA 1064 
Leu Asn Glu Ser Cys Leu Met Gly Phe Met Gly Thr Glu lie Leu Ala 
325 330 335 



TCT CTA TTT GAT ACT TTT GAA TGT TGC CTA GTA TTT TGATTTTATC GGTTAC 1116 
Ser Leu Phe Asp Thr Phe Glu Cys Cys Leu Val Phe 
340 345 350 



TTCGCACTCA TCGTATATCT TTTTGTATTC TTGTATG 1153 
(2) INFORMATION FOR SEQ ID NO: 55 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 



Met lie Lys lie Ser Leu Asn Ser Asn Lys Arg Ala Trp Met Trp Trp 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:557: 

GATCAAACGC TACAACCCTA GCTGTCTTTT AGAAGTGGAT GGGGGCGTGA ATG ATA 56 

Met lie 
1 

AAA ATA TCT TTG AAC TCC AAC AAG CGG GCG TGG ATG TGG TGG TTT CAG 104 
Lys lie Ser Leu Asn Ser Asn Lys Arg Ala Trp Met Trp Trp Phe Gin 
5 10 15 

GGA GTT ATA TTT TTA AAT CCA AAG ATC GTA AGC TGG CTA TTG AAG GCT 152 
Gly Val lie Phe Leu Asn Pro Lys lie Val Ser Trp Leu Leu Lys Ala 
20 25 30 

TAC AGA ATG TCA GAC AAT CTC TTG CAT AAA GAC ATC CAA GCC CTA ATC 200 
Tyr Arg Met Ser Asp Asn Leu Leu His Lys Asp lie Gin Ala Leu lie 
35 40 45 50 

GCT CGC TTA AAG CGC CAG GAC TTA AGC TTG GGC ATG CTA GAA AAA TCG 24 8 

Ala Arg Leu Lys Arg Gin Asp Leu Ser Leu Gly Met Leu Glu Lys Ser 
55 60 65 

CTC TCT CGC CTT ATT CAT GAT GAA ATC AAT TTG GAG TAT TTG AAG GCG 296 
Leu Ser Arg Leu lie His Asp Glu lie Asn Leu Glu Tyr Leu Lys Ala 
70 75 80 

TGC GGG CTC AAT TTC ATA GAA ACG AGC GAA AAT TTA ATC ACG CTC AAA 344 
Cys Gly Leu Asn Phe lie Glu Thr Ser Glu Asn Leu lie Thr Leu Lys 
85 90 95 

AAC CTT AAA ACC CCC CTT AAA GAT GAG GTT TTT TCC TTT ATT GAT TTA 3 92 

Asn Leu Lys Thr Pro Leu Lys Asp Glu Val Phe Ser Phe lie Asp Leu 
100 105 110 

GAA ACC ACC GGA TCT TGC CCC ATA AAG CAT GAG ATT TTA GAA ATT GGG 44 0 

Glu Thr Thr Gly Ser Cys Pro lie Lys His Glu lie Leu Glu lie Gly 
115 120 125 130 

GCC GTG CAA GTG AAA GGG GGG GAA ATT ATC AAT CGT TTT GAA ACC CTT 488 
Ala Val Gin Val Lys Gly Gly Glu lie He Asn Arg Phe Glu Thr Leu 
135 140 145 

GTG AAA GTC AAA AGC GTG CCT GAT TAT ATC GCT GAG CTT ACA GGC ATC 53 6 

Val Lys Val Lys Ser Val Pro Asp Tyr He Ala Glu Leu Thr Gly He 
150 155 160 

ACT TAT GAA GAC ACC CTA AAC GCC CCA AGC GCG CAT GAA GCT TTG CAA 5 84 

Thr Tyr Glu Asp Thr Leu Asn Ala Pro Ser Ala His Glu Ala Leu Gin 
165 170 175 

GAA TTG CGG CTT TTT TTA GGC AAT AGC GTG TTT GTG GCC CAC AAC GCT 63 2 

Glu Leu Arg Leu Phe Leu Gly Asn Ser Val Phe Val Ala His Asn Ala 
180 185 190 

AAT TTT GAT TAC AAC TTT TTG GGG CGT TAT TTT GTA GAA AAA TTG CAT 68 0 
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Met Glu lie Arg 
1 

His Gly Asp Leu 
20 

Ala Phe Val Arg 
35 

Ala Leu Glu Leu 
50 

Lys Asp Lys Glu 
65 

Gly Asp Phe Ser 

Leu Gin His Ser 
100 

Ala Leu Asn Ser 
115 

Pro Leu Leu Arg 
130 

Ser Asn His Arg 
145 

His Leu Arg His 

Lys Asn Leu Pro 
180 

Glu Glu Ala Lys 
195 

Asp Asn Leu Ser 
210 

Ala His Tyr Pro 
225 

Glu Ser lie Asn 

Gly Ala Leu lie 
260 

Ala 



Thr Phe Leu Glu 
5 

Phe Glu Arg Val 

Ala Lys Gin Glu 
40 

Leu Glu Met Thr 
55 

Arg Phe Lys Pro 
70 

Met Leu Leu Lys 
85 

Ser Gly lie Ala 

His Lys Val Arg 
12 0 

lie Phe Glu Lys 
135 

Leu Gly Leu Asp 
150 

Val Lys Asp Leu 
165 

Phe Thr Ala Lys 

Asn Ala Met Asn 
200 

Val Leu Glu Thr 
215 

Phe Val Leu Leu 
230 

Ala Tyr Ala Lys 
245 

His Gin Ala Thr 



Arg Ala Leu Lys 
10 

Leu Glu Lys Asp 
25 

Gly Val Phe Ser 

Gly lie Glu Cys 
60 

Lys Asp Ala Leu 
75 

Val Glu Arg Thr 
90 

Thr Leu Thr Ser 
105 

Leu Leu Asp Thr 

Tyr Ser Val Leu 
140 

Asp Ala Leu Met 
155 

Lys Ser Phe Leu 
170 

lie Glu lie Glu 
185 

Ala Gly Ala Asp 

Lys Glu lie Ala 
220 

Glu Ala Ser Gly 
235 

Ser Gly Val Asp 
250 

Phe lie Asp Met 
265 



Glu Asp Leu Gly 
15 

Phe Lys Ala Thr 
30 

Gly Glu Lys Tyr 
45 

Val Gin Thr lie 

Met Glu lie Arg 
80 

Leu Leu Asn Leu 
95 

Arg Phe Val Glu 
110 

Arg Lys Thr Arg 
125 

Asn Gly Gly Ala 

Leu Lys Asp Thr 
160 

Thr His Ala Arg 
175 

Cys Glu Ser Phe 
190 

lie Val Met Cys 
205 

Ala Tyr Arg Asp 

Asn lie Ser Leu 
240 

Ala lie Ser Val 
255 

His Met Lys Met 
270 



(2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1100 
(D) OTHER INFORMATION: 
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AAA GTG CGT TTG TTG GAC ACG AGA AAA ACG AGA CCC CTT TTA AGG ATC 43 8 

Lys Val Arg Leu Leu Asp Thr Arg Lys Thr Arg Pro Leu Leu Arg lie 
120 125 130 

TTT GAA AAA TAT TCC GTG CTT AAT GGG GGA GCG AGC AAC CAC CGC TTA 4 86 

Phe Glu Lys Tyr Ser Val Leu Asn Gly Gly Ala Ser Asn His Arg Leu 
135 140 145 

GGG CTA GAT GAC GCT TTA ATG CTT AAA GAC ACG CAT TTA AGG CAT GTG 534 
Gly Leu Asp Asp Ala Leu Met Leu Lys Asp Thr His Leu Arg His Val 
150 155 160 165 

AAA GAT CTC AAA AGC TTT TTA ACG CAT GCC AGA AAA AAC TTG CCT TTC 5 82 

Lys Asp Leu Lys Ser Phe Leu Thr His Ala Arg Lys Asn Leu Pro Phe 
170 175 180 

ACG GCT AAA ATT GAA ATT GAA TGC GAA AGC TTT GAA GAG GCC AAA AAC 63 0 

Thr Ala Lys lie Glu lie Glu Cys Glu Ser Phe Glu Glu Ala Lys Asn 
185 190 195 

GCC ATG AAT GCG GGA GCG GAT ATT GTG ATG TGC GAT AAT TTG AGC GTT 67 8 

Ala Met Asn Ala Gly Ala Asp lie Val Met Cys Asp Asn Leu Ser Val 
200 205 210 

TTA GAG ACT AAA GAA ATT GCC GCT TAT AGA GAT GCG CAT TAT CCC TTT 72 6 

Leu Glu Thr Lys Glu lie Ala Ala Tyr Arg Asp Ala His Tyr Pro Phe 
215 220 225 

GTT TTA CTG GAA GCG AGC GGG AAC ATT TCA CTA GAG AGC ATT AAC GCT 774 
Val Leu Leu Glu Ala Ser Gly Asn lie Ser Leu Glu Ser He Asn Ala 
230 235 240 245 

TAG GCT AAA AGC GGC GTG GAT GCC ATT AGC GTA GGG GCT TTA ATC CAT 82 2 

Tyr Ala Lys Ser Gly Val Asp Ala He Ser Val Gly Ala Leu He His 
250 255 260 

CAA GCC ACT TTT ATT GAC ATG CAC ATG AAA ATG GCT TAAAGACTTT AAAAAG 874 
Gin Ala Thr Phe He Asp Met His Met Lys Met Ala 
265 270 

GGGTTATTAA CATGCTAAAA GAATATTTAG AAAGCATTAA AGATCTTACG CCTGAAAAGA 934 
ATGAACTCAC GCACCGCCCT TCTTTATACA ACTTGCTTAA TCAGTTAAAA AACCAT 990 

(2) INFORMATION FOR SEQ ID N0:556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556: 
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Gin Lys Glu Tyr Leu Lys His Tyr His Glu Leu Asn Pro Ser Glu Gin 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 40... 858 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

GGCCAAACTC GCTTTAACTA AAATGATGGA GTTATCTTA ATG GAG ATT AGA ACC 54 

Met Glu lie Arg Thr 
1 5 

TTT TTA GAA CGC GCT TTA AAA GAA GAT TTA GGG CAT GGG GAT TTG TTT 102 
Phe Leu Glu Arg Ala Leu Lys Glu Asp Leu Gly His Gly Asp Leu Phe 
10 15 20 

GAA AGG GTG TTA GAA AAA GAT TTT AAG GCC ACA GCG TTT GTT AGG GCT 15 0 

Glu Arg Val Leu Glu Lys Asp Phe Lys Ala Thr Ala Phe Val Arg Ala 
25 30 35 

AAA CAA GAG GGC GTG TTT TCA GGC GAA AAA TAG GCT TTA GAG TTG TTG 198 
Lys Gin Glu Gly Val Phe Ser Gly Glu Lys Tyr Ala Leu Glu Leu Leu 
40 45 50 

GAA ATG ACC GGC ATT GAA TGC GTT CAA ACG ATT AAG GAT AAA GAA CGC 24 6 

Glu Met Thr Gly lie Glu Cys Val Gin Thr lie Lys Asp Lys Glu Arg 
55 60 65 

TTC AAG CCT AAA GAC GCT TTA ATG GAA ATT AGG GGG GAT TTT AGC ATG 2 94 

Phe Lys Pro Lys Asp Ala Leu Met Glu lie Arg Gly Asp Phe Ser Met 
70 75 80 85 

CTT TTA AAG GTT GAG CGC ACC CTT TTA AAC CTT TTG CAA CAC AGC AGC 34 2 

Leu Leu Lys Val Glu Arg Thr Leu Leu Asn Leu Leu Gin His Ser Ser 
90 95 100 

GGG ATT GCT ACT TTA ACG AGC CGT TTT GTA GAG GCT TTA AAT TCT CAT 3 90 

Gly lie Ala Thr Leu Thr Ser Arg Phe Val Glu Ala Leu Asn Ser His 
105 110 115 
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195 200 205 

CCT GAT TTT AAC TCG CCC ACC TOG TAT GAA GAA TTA CAA GAA CGC ATG 73 0 

Pro Asp Phe Asn Ser Pro Thr Trp Tyr Glu Glu Leu Gin Glu Arg Met 
210 215 220 

CAA AAA GAG TAT TTA AAA CAC TAT CAT GAA TTA AAC CCT AGC GAA CAA TGA 7 81 
Gin Lys Glu Tyr Leu Lys His Tyr His Glu Leu Asn Pro Ser Glu Gin 
225 230 235 240 

AGCTTTTTGA CTACGCTCCT TTGAGTTTGG CTTGGCGGGA GTTTTTGCAA AGCGAATTTA 841 
AAAAGCCTTA TTTTTTAGAA ATAGAAAAAC GCTACCTAGA AGCCCTAAAA ATCCCTAAA 90 0 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:554: 

Met Lys Ser Asn Lys Lys Ser Asn Arg Leu Arg Ala lie Tyr Arg Ala 

15 10 15 

Leu Val lie Ala lie Gly Leu Ala Val lie He Val Phe Asn Tyr Phe 

20 25 30 

Asn Arg Lys Asn Asn Asn Ala Arg Ser Ser Arg Arg Ala Cys Ser Cys 

35 40 45 

Phe Phe Ser Leu Thr Gly Val Asn Leu Glu Lys He Gly Thr Phe Asp 

50 55 60 

Thr Asp Ala Lys Leu He Val Leu Asn His Gin Ser Leu Leu Asp He 
65 70 75 80 

He Tyr Leu Glu Ala Tyr His Pro Arg Asn He Cys Trp He Ala Lys 

85 90 95 

Lys Glu Leu Gly Glu He Pro Phe Tyr Gly His Ala Leu Thr Asp Thr 

100 105 110 

Gly Met He Leu He Asp Arg Glu Asp Lys Lys Gly He Val Ser Leu 

115 120 125 

Leu Lys Ala Cys Lys Glu Lys Leu Asp Gin Asn Arg Pro Leu Val He 

130 135 140 

Phe Pro Glu Gly Thr Arg Gly Lys Gly Gly Glu Lys Phe Leu Pro Phe 
145 150 155 160 

Lys Gin Gly Ala Lys He He Ala Glu Lys Phe Gin Leu Lys He Gin 

165 170 175 

Pro Met Val Leu He Asn Ser He Lys He Phe Asn Ser Lys Pro Leu 

180 185 190 

Glu Ala Tyr Lys Ala Arg Thr Arg Leu Val Met Leu Glu Ser Tyr Thr 

195 200 205 

Pro Asp Phe Asn Ser Pro Thr Trp Tyr Glu Glu Leu Gin Glu Arg Met 
210 215 220 
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CATGATGATT ATTATAAAAT CCTAACGCCG CATGAACAAA TAGGATGGGT CAAAAAAG 5 8 



ATG AAG TCA AAT AAA AAG TCC AAT CGT TTA AGA GCG ATT TAT AGA GCT 106 
Met Lys Ser Asn Lys Lys Ser Asn Arg Leu Arg Ala lie Tyr Arg Ala 
15 10 15 

TTA GTG ATC GCT ATA GGA CTA GCT GTT ATC ATC GTT TTC AAT TAG TTT 154 
Leu Val lie Ala lie Gly Leu Ala Val lie lie Val Phe Asn Tyr Phe 
20 25 30 

AAC CGC AAA AAC AAT AAC GCC CGC TCC AGC CGT AGG GCT TGT TCG TGC 2 02 

Asn Arg Lys Asn Asn Asn Ala Arg Ser Ser Arg Arg Ala Cys Ser Cys 
35 40 45 

TTT TTT TCC CTT ACC GGG GTT AAT TTA GAA AAA ATA GGC ACT TTT GAT 25 0 

Phe Phe Ser Leu Thr Gly Val Asn Leu Glu Lys lie Gly Thr Phe Asp 
50 55 60 

ACG GAC GCT AAA CTC ATT GTC TTA AAC CAC CAA AGC TTA CTA GAC ATC 2 98 

Thr Asp Ala Lys Leu lie Val Leu Asn His Gin Ser Leu Leu Asp lie 
65 70 75 80 

ATT TAT TTA GAA GCC TAC CAC CCT AGA AAT ATT TGC TGG ATC GCT AAA 34 6 

lie Tyr Leu Glu Ala Tyr His Pro Arg Asn lie Cys Trp lie Ala Lys 
85 90 95 

AAA GAG CTG GGC GAA ATC CCT TTT TAT GGG CAT GCC TTA ACG GAT ACC 3 94 

Lys Glu Leu Gly Glu lie Pro Phe Tyr Gly His Ala Leu Thr Asp Thr 
100 105 110 

GGA ATG ATT TTA ATT GAC AGA GAG GAT AAA AAG GGG ATT GTG AGC CTT 44 2 

Gly Met lie Leu lie Asp Arg Glu Asp Lys Lys Gly lie Val Ser Leu 
115 120 125 

TTG AAA GCG TGT AAA GAA AAA TTA GAC CAA AAC CGC CCT TTA GTG ATT 4 90 

Leu Lys Ala Cys Lys Glu Lys Leu Asp Gin Asn Arg Pro Leu Val lie 
130 135 140 

TTC CCT GAA GGC ACT AGA GGC AAA GGA GGA GAA AAA TTC CTC CCT TTC 53 8 

Phe Pro Glu Gly Thr Arg Gly Lys Gly Gly Glu Lys Phe Leu Pro Phe 
145 150 155 160 

AAG CAA GGG GCT AAA ATC ATC GCC GAA AAA TTC CAG CTC AAA ATC CAA 5 86 

Lys Gin Gly Ala Lys lie lie Ala Glu Lys Phe Gin Leu Lys lie Gin 
165 170 175 

CCC ATG GTG TTA ATC AA^T TCC ATT AAA ATC TTT AAT TCC AAG CCT CTA 634 
Pro Met Val Leu lie Asn Ser lie Lys lie Phe Asn Ser Lys Pro Leu 
180 185 190 

GAA GCC TAT AAA GCG CGC ACC CGT TTA GTC ATG CTA GAA AGC TAT ACG 682 
Glu Ala Tyr Lys Ala Arg Thr Arg Leu Val Met Leu Glu Ser Tyr Thr 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 



Met Arg Val Met 
1 

Ala Leu Pro Asn 
20 

Lys Lys His Phe 
35 

Leu Asn Asp Thr 
50 

Tyr His Ala Leu 
65 

Leu Glu Glu Val 

Met Ala Gin Asn 
100 

Phe Gly Ala Glu 
115 

Arg Phe Leu Arg 
130 

Asp Glu Leu His 
145 

Val Thr Leu Asn 

lie Tyr Pro Asn 
180 

Arg Lys Leu Lys 
195 

Asp Phe Ser Glu 
210 

His Pro Asp Leu 
225 

Arg Phe Ala lie 



Ala Lys lie Glu 
5 

Ser His Pro Leu 

Ser Gin Cys His 
40 

Glu Cys Phe Lys 
55 

Glu Cys Val His 
70 

Leu Gin Lys Ser 
85 

Thr Leu Leu Glu 

Tyr Ala Leu Phe 
12 0 

Gin Lys Phe Ser 
135 

lie Arg Gly Asp 
150 

Glu Asn Arg lie 
165 

Gly Phe Gly Lys 

Glu Cys Tyr Lys 
200 

Val Lys Lys Arg 
215 

Cys Asp Leu Lys 
230 

lie Gin Glu Ala 
245 



Leu Leu Ala Lys 
10 

Leu Lys Lys Val 
25 

Met Leu Ser Ser 

Lys Asn Tyr Leu 
60 

Glu Lys Asp lie 
75 

His Leu Pro lie 
90 

Lys lie Glu Val 
105 

lie Thr Lys His 

Gly Cys Val Phe 
140 

Ser Glu Arg Phe 
155 

Val His Asn Ala 
170 

Asp Ser Tyr Thr 
185 

Thr Leu Gly Phe 

Tyr Leu Glu Leu 
220 

Glu Lys Lys Ala 
235 

Tyr Arg His lie 
250 



Phe Thr Gin lie 
15 

Leu Asn Tyr Ala 
30 

Ser Leu Leu lie 
45 

Leu Asn Trp Val 

Ser Ala His Ser 
80 

Arg lie Lys lie 
95 

Ly s Va 1 Leu Thr 
110 

Pro lie Ala Lys 
125 

Leu Glu Thr Gin 

Trp Glu Leu lie 
160 

Cys Leu Asp Phe 
175 

Thr Met Ala Glu 
190 

lie Lys His Glu 
205 

Ala Lys Thr Tyr 

Leu Tyr Ala Lys 
240 

Lys Lys His Ala 
255 



(2) INFORMATION FOR SEQ ID NO: 553: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATtmE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 59... 778 
(D) OTHER INFORMATION: 
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80 85 90 

ATC AAA ATC ATG GCT CAA AAC ACG CTT TTA GAA AAG ATA GAA GTG AAA 338 
lie Lys lie Met Ala Gin Asn Thr Leu Leu Glu Lys lie Glu Val Lys 
95 100 ICS 

GTT TTA ACC TTT GGG GCG GAA TAT GCG CTT TTT ATC ACC AAA CAC CCT 3 86 

Val Leu Thr Phe Gly Ala Glu Tyr Ala Leu Phe He Thr Lys His Pro 
110 115 120 125 

ATC GCC AAG CGG TTT TTA CGC CAA AAA TTT AGC GGC TGT GTG TTT TTA 434 
He Ala Lys Arg Phe Leu Arg Gin Lys Phe Ser Gly Cys Val Phe Leu 
130 135 140 

GAA ACC CAA GAT GAA TTG CAT ATA AGA GGC GAT TCA GAG CGT TTT TGG 4 82 

Glu Thr Gin Asp Glu Leu His He Arg Gly Asp Ser Glu Arg Phe Trp 
145 150 155 

GAA CTC ATT GTA ACG CTC AAT GAA AAT AGA ATC GTC CAT AAC GCA TGC 53 0 

Glu Leu He Val Thr Leu Asn Glu Asn Arg He Val His Asn Ala Cys 
160 165 170 

TTA GAT TTC ATC TAC CCT AAT GGC TTT GGC AAG GAC AGC TAC ACC ACT 57 8 

Leu Asp Phe He Tyr Pro Asn Gly Phe Gly Lys Asp Ser Tyr Thr Thr 
175 180 185 

ATG GCT GAA CGC AAA TTA AAA GAA TGC TAT AAA ACG CTA GGG TTT ATC 62 6 

Met Ala Glu Arg Lys Leu Lys Glu Cys Tyr Lys Thr Leu Gly Phe He 
190 195 200 205 

AAG CAT GAA GAT TTC AGC GAA GTC AAA AAG CGC TAT TTA GAA TTG GCT 674 
Lys His Glu Asp Phe Ser Glu Val Lys Lys Arg Tyr Leu Glu Leu Ala 
210 215 220 

AAA ACC TAC CAC CCT GAT TTA TGC GAT CTC AAA GAA AAA AAG GCT CTT 722 
Lys Thr Tyr His Pro Asp Leu Cys Asp Leu Lys Glu Lys Lys Ala Leu 
225 230 235 

TAT GCC AAA CGC TTC GCT ATC ATT CAA GAG GCG TAT CGC CAC ATT AAA 77 0 

Tyr Ala Lys Arg Phe Ala He He Gin Glu Ala Tyr Arg His He Lys 
240 245 250 

AAA CAC GCC TAAACCCCTA AACTAGCCCT AATCGCGCTA G 810 
Lys His Ala 
255 



(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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L»eu Lys lie Pro Arg 
290 

Asn Glu Gly Val Lys 
305 

Val Glu Leu Gin Val 
325 

Glu Leu Leu Glu Lys 
340 

His Lys Ser Val Leu 
355 

Lys 



Asn Ala Arg Asp Lys 
295 

His Pro Glu Ser Ser 
310 

lie Tyr Pro Lys Ser 
330 

Leu His Ala Ser Phe 
345 

Glu Thr Cys lie Ser 
360 



Gin Thr Phe Ala Phe Arg 
300 

Tyr Arg Gly Ser Leu lie 
315 320 
Leu Asn Lys Glu Gin Gin 
335 

Gly Tyr Glu Gly Glu Pro 
350 

Lys lie Lys Asp Trp Phe 
365 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 12... 77 9 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CGCATGGAGA A ATG AGG GTA ATG GCC AAA ATT GAA TTG TTA GCC AAA TTC 50 
Met Arg Val Met Ala Lys lie Glu Leu Leu Ala Lys Phe 
15 10 

ACG CAA ATC GCG CTC CCT AAC AGC CAC CCT TTA TTG AAA AAA GTT TTA 98 
Thr Gin lie Ala Leu Pro Asn Ser His Pro Leu Leu Lys Lys Val Leu 
15 20 25 

AAC TAC GCC AAA AAG CAT TTC AGC CAG TGC CAC ATG CTC TCT TCA TCG 14 6 

Asn Tyr Ala Lys Lys His Phe Ser Gin Cys His Met Leu Ser Ser Ser 
30 35 40 45 

TTA CTC ATC TTA AAC GAC ACG GAA TGC TTT AAA AAA AAC TAC TTG CTT 194 
Leu Leu lie Leu Asn Asp Thr Glu Cys Phe Lys Lys Asn Tyr Leu Leu 
50 55 60 

AAT TGG GTC TAT CAT GCC CTT GAA TGC GTG CAT GAA AAA GAT ATT AGC 24 2 

Asn Trp Val Tyr His Ala Leu Glu Cys Val His Glu Lys Asp lie Ser 
65 70 75 

GCG CAT TCT TTA GAA GAG GTT TTA CAA AAA AGC CAC CTG CCC ATA CGC 2 90 

Ala His Ser Leu Glu Glu Val Leu Gin Lys Ser His Leu Pro lie Arg 
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ACC TGT ATT TCT AAA ATT AAA GAC TGG TTC AAA TAAAAGGTTG TTGATGCATG 1162 
Thr Cys lie Ser Lys lie Lys Asp Trp Phe Lys 
360 365 

AGTTTCTAAA AGCTTTTAAA GACGCTTTCC CTCATACCAT TT 12 04 

(2) INFORMATION FOR SEQ ID NO: 550: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 



Val 


Glu 


Leu 


Ser 


Tvr 


Tvr 


Glu 


He 


Leu 


Glu 


Val 


Glu 


Lvs 


His 


Ser 


Asn 


1 








5 










10 










15 




Gin 


Glu 


Thr 


He 


Lvs 


Lys 


Ser 


Tvr 


Arg 


Lvs 


Leu 


Ala 


Leu 


Lvs 


Tvr 


His 








20 










25 










30 






Pro 


Asp 


Arp 


Asn 


Ala 


Gly 


Asp 


Lvs 


Glu 


Ala 


Glu 


Glu 


Lvs 


Phe 


Lvs 


Leu 






35 










40 










45 








lie 


Asn 


Glu 


Ala 


Tvr 


Gly 


Val 


Leu 


Ser 


Asp 


Glu 


Lvs 


Lvs 


Arg 


Ala 


Leu 




50 










55 










60 










Tvr 


Asp 


Aro 


Tyr 


Gly 


Lys 


Lys 


Gly 


Leu 


Asn 


Gin 


Ala 


Gly 


Ala 


Ser 


Gin 


65 










70 










75 










80 


Glv 


Asp 


Phe 


Ser 


Asp 


Phe 


Phe 


Glu 


Asp 


Leu 


Glv 


Ser 


Phe 


Phe 


Glu 


Asp 










85 










90 










95 




Ala 


Phe 


Gly 


Phe 


Gly 


Ala 


Arg 


Gly 


Ser 


Lys 


Arg 


Gin 


Lys 


Ser 


Ser 


He 








100 










105 










110 






Ala 


Pro 


Asp 


Tyr 


Leu 


Gin 


Thr 


Leu 


Glu 


Leu 


Ser 


Phe 


Lys 


Glu 


Ala 


Val 






115 










120 










125 








Phe 


Gly 


Cys 


Lys 


Lys 


Thr 


He 


Lys 


Val 


Gin 


Tyr 


Gin 


Ser 


Val 


Cys 


Glu 




130 










135 










140 










Ser 


Cys 


Asp 


Gly 


Thr 


Gly 


Ala 


Lys 


Asp 


Lys 


Ala 


Leu 


Glu 


Thr 


Cys 


Lys 


145 










150 










155 










160 


Gin 


Cys 


Asn 


Gly 


Gin 


Gly 


Gin 


Val 


Phe 


Met 


Arg 


Gin 


Gly 


Phe 


Met 


Ser 










165 










170 










175 




Phe 


Ala 


Gin 


Thr 


Cys 


Gly 


Ala 


Cys 


Gin 


Gly 


Lys 


Gly 


Lys 


He 


Val 


Lys 








180 










185 










190 






Thr 


Pro 


Cys 


Gin 


Ala 


Cys 


Lys 


Gly 


Lys 


Thr 


Tyr 


He 


Leu 


Lys 


Asp 


Glu 






195 










200 










205 








Glu 


He 


Asp 


Ala 


He 


He 


Pro 


Glu 


Gly 


He 


Asp 


Asp 


Gin 


Asn 


Arg 


Met 




210 










215 










220 










Val 


Leu 


Lys 


Asn 


Lys 


Gly 


Asn 


Glu 


Tyr 


Glu 


Lys 


Gly 


Lys 


Arg 


Gly 


Asp 


225 










230 










235 










240 


Leu 


Tyr 


Leu 


Glu 


Ala 


Gin 


Val 


Lys 


Glu 


Asp 


Glu 


His 


Phe 


Lys 


Arg 


Glu 










245 










250 










255 




Gly 


Cys 


Asp 


Leu 


Phe 


He 


Lys 


Ala 


Pro 


Val 


Phe 


Phe 


Thr 


Thr 


He 


Ala 








260 










265 










270 






Leu 


Gly 


His 


Thr 


He 


Lys 


Val 


Pro 


Ser 


Leu 


Lys 


Gly 


Asp 


Glu 


Leu 


Glu 






275 










280 










285 
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lie Lys Val Gin Tyr Gin Ser Val Cys Glu Ser Cys Asp Gly Thr Gly 
135 140 145 150 

GCT AAA GAC AAA GCC CTA GAG ACT TGC AAG CAA TGC AAT GGG CAG GGG 533 
Ala Lys Asp Lys Ala Leu Glu Thr Cys Lys Gin Cys Asn Gly Gin Gly 
155 160 165 

CAG GTG TTT ATG CGT CAA GGT TTT ATG AGT TTT GCG CAA ACT TGT GGG 581 
Gin Val Phe Met Arg Gin Gly Phe Met Ser Phe Ala Gin Thr Cys Gly 
170 175 180 

GCG TGT CAA GGC AAG GGC AAG ATC GTT AAA ACC CCA TGC CAA GCG TGC 62 9 

Ala Cys Gin Gly Lys Gly Lys lie Val Lys Thr Pro Cys Gin Ala Cys 
185 190 195 

AAG GGT AAA ACC TAT ATC CTT AAA GAT GAA GAA ATT GAT GCG ATA ATC 677 
Lys Gly Lys Thr Tyr lie Leu Lys Asp Glu Glu lie Asp Ala He He 
200 205 210 

CCT GAG GGC ATT GAT GAT CAA AAC CGC ATG GTG CTT AAA AAT AAA GGC 72 5 

Pro Glu Gly lie Asp Asp Gin Asn Arg Met Val Leu Lys Asn Lys Gly 
215 220 225 230 

AAT GAA TAC GAG AAG GGA AAA AGA GGG GAT TTG TAT TTA GAA GCG CAA 773 
Asn Glu Tyr Glu Lys Gly Lys Arg Gly Asp Leu Tyr Leu Glu Ala Gin 
235 240 245 

GTC AAA GAA GAT GAG CAT TTC AAG CGC GAA GGC TGC GAT TTA TTC ATT 821 
Val Lys Glu Asp Glu His Phe Lys Arg Glu Gly Cys Asp Leu Phe He 
250 255 260 

AAA GCG CCG GTG TTT TTC ACC ACT ATC GCT TTA GGG CAT ACG ATT AAA 869 
Lys Ala Pro Val Phe Phe Thr Thr He Ala Leu Gly His Thr He Lys 
265 270 275 

GTG CCG TCT TTA AAA GGG GAC GAA CTG GAA TTA AAA ATC CCT AGA AAC 917 
Val Pro Ser Leu Lys Gly Asp Glu Leu Glu Leu Lys He Pro Arg Asn 
280 285 290 

GCC AGA GAC AAG CAG ACT TTT GCG TTT AGA AAC GAG GGC GTG AAA CAC 965 
Ala Arg Asp Lys Gin Thr Phe Ala Phe Arg Asn Glu Gly Val Lys His 
295 300 305 310 

CCT GAA AGC TCT TAT AGA GGG AGT TTG ATC GTG GAA TTG CAA GTG ATT 1013 
Pro Glu Ser Ser Tyr Arg Gly Ser Leu He Val Glu Leu Gin Val He 
315 320 325 

TAC CCT AAA AGT TTG AAT AAA GAG CAG CAA GAA TTG TTG GAA AAA TTG 10 61 

Tyr Pro Lys Ser Leu Asn Lys Glu Gin Gin Glu Leu Leu Glu Lys Leu 
330 335 340 

CAT GCG AGT TTT GGC TAT GAG GGC GAG CCG CAT AAA AGC GTT TTA GAA 110 9 

His Ala Ser Phe Gly Tyr Glu Gly Glu Pro His Lys Ser Val Leu Glu 
345 350 355 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 36 . . . 1142 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 9: 

AAAATAAGCG TTTTGATGCC ATTTTTGGAG CGATC GTG GAA TTG AGT TAT TAT 53 

Val Glu Leu Ser Tyr Tyr 
1 5 

GAA ATT TTA GAA GTG GAA AAA CAC AGC AAC CAA GAG ACC ATT AAA AAG 101 
Glu lie Leu Glu Val Glu Lys His Ser Asn Gin Glu Thr lie Lys Lys 
10 15 20 

TCT TAC AGA AAG CTG GCT TTA AAA TAG CAC CCA GAC AGA AAC GCC GGC 14 9 

Ser Tyr Arg Lys Leu Ala Leu Lys Tyr His Pro Asp Arg Asn Ala Gly 
25 30 35 

GAT AAA GAA GCC GAA GAA AAA TTC AAG CTC ATC AAT GAA GCC TAT GGG 197 
Asp Lys Glu Ala Glu Glu Lys Phe Lys Leu lie Asn Glu Ala Tyr Gly 
40 45 50 

GTG TTA AGC GAT GAA AAG AAG CGG GCC TTA TAC GAC AGG TAT GGT AAA 24 5 

Val Leu Ser Asp Glu Lys Lys Arg Ala Leu Tyr Asp Arg Tyr Gly Lys 
55 60 65 70 

AAA GGC TTA AAC CAA GCC GGC GCA AGC CAA GGC GAT TTT TCT GAT TTT 2 93 

Lys Gly Leu Asn Gin Ala Gly Ala Ser Gin Gly Asp Phe Ser Asp Phe 
75 80 85 

TTT GAA GAT TTA GGC TCG TTT TTT GAA GAC GCT TTT GGG TTT GGC GCT 341 
Phe Glu Asp Leu Gly Ser Phe Phe Glu Asp Ala Phe Gly Phe Gly Ala 
90 95 100 

AGG GGG AGT AAA AGG CAA AAA AGC TCT ATC GCA CCG GAT TAT TTG CAA 3 89 

Arg Gly Ser Lys Arg Gin Lys Ser Ser lie Ala Pro Asp Tyr Leu Gin 
105 110 115 

ACC CTT GAA TTG AGT TTC AAA GAA GCG GTT TTT GGC TGT AAA AAA ACC 43 7 

Thr Leu Glu Leu Ser Phe Lys Glu Ala Val Phe Gly Cys Lys Lys Thr 
120 125 130 

ATT AAA--GTC CAA TAC CAG AGC GTT TGT GAA AGT TGC GAT GGC ACG GGC 4 85 
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35 40 45 50 

TTG ACG ACA GCG GCT CGT GTG GGC GAT TTT AAT GCG CAC CGC CAT GTT 24 8 

Leu Thr Thr Ala Ala Arg Val Gly Asp Phe Asn Ala His Arg His Val 
55 60 65 

TTT GCA TAT TTG CAA AAC AAA GAA GCC ACC CAC AAG CTT GTA ACT GAA 2 96 

Phe Ala Tyr Leu Gin Asn Lys Glu Ala Thr His Lys Leu Val Thr Glu 
70 75 80 

ATC GCG CCC AAA TAG GCG CAA AGG AAT GGC GGA TAG ACC AGG ATC CAA 344 
lie Ala Pro Lys Tyr Ala Gin Arg Asn Gly Gly Tyr Thr Arg lie Gin 
85 90 95 

CGC ACC ACT TTT AGA AGA GGG GAC GCT TCC ACT CTA GCC ACC ATT GAA 3 92 

Arg Thr Thr Phe Arg Arg Gly Asp Ala Ser Thr Leu Ala Thr lie Glu 
100 105 110 

TTT GTA TGAAATTTGA TGAACTGCTA GCCAAGATTT AGTCTTGGTT GGTGGTTATC GC 450 

Phe Val 

115 

T 451 
(2) INFORMATION FOR SEQ ID NO: 54 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 8: 

Met Arg His Lys His Gly Tyr Arg Lys Leu Gly Arg Thr Ser Ser His 

15 10 15 

Arg Lys Ala Leu Leu Lys Asn Leu Ala lie Ala Leu lie Glu His Asn 

20 25 30 

Lys lie Glu Thr Gly lie Tyr Lys Ala Lys Glu Leu Arg Ser Tyr lie 

35 40 45 

Glu Lys Leu Thr Thr Ala Ala Arg Val Gly Asp Phe Asn Ala His Arg 

50 55 60 

His Val Phe Ala Tyr Leu Gin Asn Lys Glu Ala Thr His Lys Leu Val 
65 70 75 80 

Thr Glu lie Ala Pro Lys Tyr Ala Gin Arg Asn Gly Gly Tyr Thr Arg 

85 90 95 

lie Gin Arg Thr Thr Phe Arg Arg Gly Asp Ala Ser Thr Leu Ala Thr 

100 105 110 

lie Glu Phe Val 
115 

(2) INFORMATION FOR SEQ ID NO: 549: 
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Leu Ser Tyr Ala Tyr Val Ala Tyr Gin Trp Lys L»eu Asn Ala Asp Val 
65 70 75 80 

lie Leu Cys Leu Phe Leu Tyr Met Pro Val Thr lie Tyr Gly Leu Phe 

85 90 95 

Ala Trp Lys Lys Thr Glu Gin His Glu Gly Val lie Lys Ala Gin Lys 

100 105 110 

Leu Ser Lys Asn Trp Arg Phe lie Leu lie Leu Gly Val Gly Val Leu 

115 120 125 

Thr Cys Val Ser Ala Leu Phe Phe Lys Glu lie Lys Thr Asn Phe Leu 

130 135 140 

Trp Ala Glu Ser Phe Asn Phe Val lie Phe lie lie Ala Phe lie Leu 
145 150 155 160 

Gin Val Leu Arg Tyr lie Glu Asn Tyr Ala Leu Val Thr Leu Gly Asn 

165 170 175 

lie Val Ser lie lie Val Trp Phe Cys lie Phe Gin lie Ser Thr Glu 

180 185 190 

Ser Leu Val Gin Leu Phe Thr Thr lie Leu Tyr Leu Phe lie Gly Leu 

195 200 205 

Tyr Tyr Phe Asn Arg Trp Asn Lys Ser Cys Lys Gin 
210 215 220 



(2) INFORMATION FOR SEQ ID NO: 54 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.., 3 98 
CD) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 7: 



AGTTTAAAGA AAAGATTAGA AAAATTAGAA GATAAAGGAG GTAACGACTG ATG AGA 56 

Met Arg 
1 



CAC AAA CAC GGA TAG CGC AAG CTT GGG AGA ACC AGC TCG CAC AGA AAG 104 
His Lys His Gly Tyr Arg Lys Leu Gly Arg Thr Ser Ser His Arg Lys 
5 10 15 



GCG TTA TTA AAG AAT TTA GCG ATC GCT TTG ATT GAG CAT AAC AAA ATT 152 
Ala Leu Leu Lys Asn Leu Ala lie Ala Leu lie Glu His Asn Lys lie 
20 25 30 



GAA ACA GGG ATT TAT AAG GCT AAG GAA TTG CGC AGT TAC ATT GAG AAA 2 00 

Glu Thr Gly lie Tyr Lys Ala Lys Glu Leu Arg Ser Tyr lie Glu Lys 
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Lys Lys Thr Glu Gin His Glu Gly Val lie Lys Ala Gin Lys Leu Ser 
100 105 110 

AAA AAT TGG CGT TTT ATA CTC ATT TTA GGC GTA GGG GTT TTA ACT TGT 4 42 

Lys Asn Trp Arg Phe lie Leu lie Leu Gly Val Gly Val Leu Thr Cys 
115 120 125 130 

GTG AGC GCT TTG TTT TTT AAA GAG ATT AAA ACG AAT TTT TTA TGG GCA 4 90 

Val Ser Ala Leu Phe Phe Lys Glu lie Lys Thr Asn Phe Leu Trp Ala 
135 140 145 

GAG AGT TTT AAT TTC GTC ATC TTT ATT ATT GCT TTT ATT TTA CAG GTT 53 8 

Glu Ser Phe Asn Phe Val lie Phe lie lie Ala Phe lie Leu Gin Val 
150 155 160 

TTG CGC TAT ATA GAA AAT TAT GCG CTA GTA ACT TTG GGG AAT ATC GTA 58 6 

Leu Arg Tyr lie Glu Asn Tyr Ala Leu Val Thr Leu Gly Asn lie Val 
165 170 175 

TCC ATT ATC GTG TGG TTT TGT ATT TTT CAA ATT TCT ACA GAG AGC TTG 63 4 

Ser lie lie Val Trp Phe Cys He Phe Gin He Ser Thr Glu Ser Leu 
180 185 190 

GTG CAA CTC TTC ACA ACG ATC CTA TAG CTT TTT ATT GGC TTG TAT TAT 682 
Val Gin Leu Phe Thr Thr lie Leu Tyr Leu Phe He Gly Leu Tyr Tyr 
195 200 205 210 

TTT AAC CGG TGG AAT AAG TCA TGC AAG CAG TGATTTTAGC GAATGGGGAG TTT 73 5 
Phe Asn Arg Trp Asn Lys Ser Cys Lys Gin 
215 220 

CCTAAATCTC AAAAATGCTT AGACCTTTTA AAAAACGCTC CCTTTTTAAT CGCATGCGAT 7 95 

GGGGCTGTTA CCTCA 810 

(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:546: 

Met Leu He Thr Thr Gin Leu Ser Lys Arg Phe Tyr Ala Thr Leu Ala 

15 10 15 

Leu Ser Cys Val Phe Leu Thr He Thr Asn He Leu Val Lys Gly Ser 

20 25 30 

Phe He Asn Leu Leu Ala Gly Leu Ser Gly Val Leu Tyr Ala Phe Phe 

35 40 45 

Ala Gly Glu Arg Gin Thr He Cys Phe Val Phe Gly Leu Val Tyr Asn 
50 55 60 
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130 135 140 

Leu Val Cys Val Phe Gly lie Cys Ser Arg Leu Cys Ala Lys Leu Gly 
145 150 155 160 

Trp 



(2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53. . .712 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 

TGAGATCAAA CCCGTAGAAC TTGTCAAGGT AATTCTTGCG TAAGGAAATA GC ATG TTA 58 

Met Leu 
1 

ATA ACC ACC CAA CTA TCC AAA CGA TTT TAC GCC ACA CTC OCT CTT TCT 106 
lie Thr Thr Gin Leu Ser Lys Arg Phe Tyr Ala Thr Leu Ala Leu Ser 
5 10 15 

TGC GTG TTT TTA ACC ATC ACT AAC ATT CTT GTC AAA GGC TCG TTT ATC 154 
Cys Val Phe Leu Thr lie Thr Asn lie Leu Val Lys Gly Ser Phe lie 
20 25 30 

AAT CTT TTA GCA GGG CTT AGT GGG GTT TTG TAT GCG TTT TTT GCC GGA 2 02 

Asn Leu Leu Ala Gly Leu Ser Gly Val Leu Tyr Ala Phe Phe Ala Gly 
35 40 45 50 

GAA AGG CAA ACG ATT TGC TTT GTG TTT GGT CTT GTT TAT AAT TTG AGT 250 
Glu Arg Gin Thr lie Cys Phe Val Phe Gly Leu Val Tyr Asn Leu Ser 
55 60 65 

TAC GCT TAT GTC GCT TAT CAG TGG AAA TTA AAC GCT GAT GTG ATT TTA 2 98 

Tyr Ala Tyr Val Ala Tyr Gin Trp Lys Leu Asn Ala Asp Val lie Leu 
70 75 80 

TGC CTT TTT TTG TAT ATG CCA GTA ACG ATT TAT GGG CTG TTC GCA TGG 34 6 

Cys Leu Phe Leu Tyr Met Pro Val Thr lie Tyr Gly Leu Phe Ala Trp 
85 90 95 

AAA AAG ACA GAG CAG CAT GAA GGC GTT ATC AAG GCT CAA AAA CTT TCC 3 94 
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Cys Tyr Lys Ser Phe Gin Val Arg Tyr Pro Arg Phe Leu Lys Ala lie 
75 80 85 

TTT AAG GCT AGG TTT TTT GGC GCG TTT TAT AAC GCT GTG TTA GGA TCA 342 
Phe Lys Ala Arg Phe Phe Gly Ala Phe Tyr Asn Ala Val Leu Gly Ser 
90 95 100 

AGG CTA AGG GAT TTT TAT GTG ATG CTT TTA ACG ATA CCC TTT ATT GCC 3 90 

Arg Leu Arg Asp Phe Tyr Val Met Leu Leu Thr lie Pro Phe lie Ala 
105 110 115 120 

GCT ATC CAT GAG GTT TCG GCG TAT TAC GGG CAT CCT AGC AAC TTC CTT 43 8 

Ala lie His Glu Val Ser Ala Tyr Tyr Gly His Pro Ser Asn Phe Leu 
125 130 135 

ATA GAG GGT TTG GTC ATT CTT GGC CTT GTG TGT GTT TTT GGG ATT TGT 4 86 

lie Glu Gly Leu Val lie Leu Gly Leu Val Cys Val Phe Gly lie Cys 
140 145 150 

TCT AGG CTT TGC GCT AAA TTA GGG TGG TGATTTAACT CAAATAGCAT TAAATGG 54 0 
Ser Arg Leu Cys Ala Lys Leu Gly Trp 
155 160 

AGGGGGGAGT AAAAAATTA 559 

(2) INFORMATION FOR SEQ ID NO : 544 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID KG: 544: 

Met Asn Val Lys Asn Arg Leu Ser Asp Trp Glu Tyr Gin Trp Ala Val 

15 10 15 

Ala Leu Val Tyr Thr lie Cys lie Ser He Asn Ala Arg He Phe Tyr 

20 25 30 

Asp He Asp Gly Ser Ala Ser Asp Ser He Phe Asp Pro Lys Asn Ser 

35 40 45 

Tyr Tyr Met Trp Leu Val Gly Leu He Ala Ala Leu Leu Ser Asn Leu 

50 55 60 

Leu Phe Asp Pro Arg Gly Arg Asp Cys Tyr Lys Ser Phe Gin Val Arg 
65 70 75 80 

Tyr Pro Arg Phe Leu Lys Ala He Phe Lys Ala Arg Phe Phe Gly Ala 

85 90 95 

Phe Tyr Asn Ala Val Leu Gly Ser Arg Leu Arg Asp Phe Tyr Val Met 

100 105 110 

Leu Leu Thr He Pro Phe He Ala Ala He His Glu Val Ser Ala Tyr 

115 120 125 

Tyr Gly His Pro Ser Asn Phe Leu He Glu Gly Leu Val He Leu Gly 
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300 

Glu Ser Gly Arg Ala 
320 

Glu Ala Phe Lys Leu 
335 

Glu Ser Ser His Ala 
350 

Glu Glu Glu Ser lie 
365 

Asp Leu Ser Thr Val 
380 



(2) INFORMATION FOR SEQ ID NO: 54 3: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 513 
(D) OTHER INFORMATION: 



290 

Pro Gly Val Gly Pro 
305 

Val Tyr Glu Ser Ala 
325 

Leu Cys Gin Lys Glu 
340 

Leu Ala Tyr Ala Leu 
355 

lie Val Val Asn Leu 
370 

Tyr Asn Ala Leu Lys 
385 



295 

Glu His Ser Tyr Leu Lys 
310 315 
Ser Asp Ala Glu Ala Leu 
330 

Gly lie lie Pro Ala Leu 
345 

Lys Leu Ala Gin Lys Cys 
360 

Ser Gly Arg Gly Asp Lys 
375 

Gly Gly Leu Lys 
390 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

CAGGTAGCTT TGGGCGAGAA AGGAGAGAGC ATG AAT GTC AAA AAT CGT TTG AGC 54 

Met Asn Val Lys Asn Arg Leu Ser 
1 5 

GAT TGG GAA TAT CAA TGG GCA GTG GCT CTA GTC TAT ACQ ATA TGT ATC 102 
Asp Trp Glu Tyr Gin Trp Ala Val Ala Leu Val Tyr Thr lie Cys lie 
10 15 20 

TCC ATA AAC GCT AGG ATT TTT TAT GAC ATA GAT GGT TCA GCT AGC GAT 150 
Ser lie Asn Ala Arg lie Phe Tyr Asp lie Asp Gly Ser Ala Ser Asp 
25 30 35 40 

TCG ATT TTT GAC CCT AAA AAT AGC TAT TAT ATG TGG CTA GTG GGT CTA 198 
Ser lie Phe Asp Pro Lys Asn Ser Tyr Tyr Met Trp Leu Val Gly Leu 
45 50 55 

ATA GCG GCT TTG TTG TCT AAC CTT TTA TTT GAC CCA CGA GGT AGG GAT 24 6 

lie Ala Ala Leu Leu Ser Asn Leu Leu Phe Asp Pro Arg Gly Arg Asp 
60 65 70 

TGT TAT AAA TCT TTC CAA GTA AGA TAC CCT AGG TTT CTC AAA GCC ATT 2 94 
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Leu Lys Gly Gly Leu Lys 
390 

GAAAAAATGG CGTTTA 12 80 



(2) INFORMATION FOR SEQ ID NO: 54 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 



Met Asn Lys Lys 
1 

Glu Leu Leu Val 
20 

Cys Leu Lys Asp 
35 

Asp Phe Val Gly 
50 

Ser Asn Pro Lys 
65 

Gly Gly Ala His 

Lys Lys Met Gly 
100 

His Gly Val Ala 
115 

Val Val Phe Met 
130 

Phe Arg Met His 
145 

Ser Ala Thr Leu 

Ser Ser Tyr Lys 
180 

His Pro Tyr Pro 
195 

Glu Val Lys Ser 
210 

Val lie Ala Cys 
225 

Ala Phe Leu Asn 

Gly Leu Gly Leu 
260 

Arg Val Gly lie 
275 

Glu Gly Gin lie 



Ala Tyr Phe Gly 
5 

Pro Ala Leu Arg 

Glu Lys Phe Gin 
40 

Arg Pro Ser Pro 
55 

Val Lys Leu Tyr 
70 

Lys Thr Asn Gin 
85 

Lys Thr Arg lie 

Thr Ala He Ala 
120 

Gly Ser Lys Asp 
135 

Leu Leu Gly Ala 
150 

Lys Asp Ala Val 
165 

Asp Thr His Tyr 

Thr Met Val Lys 
200 

Gin He Leu Glu 
215 

Val Gly Gly Gly 
230 

Asp Lys Glu Val 
245 

Glu Thr Asn Lys 

Leu His Gly Asn 
280 

Ala Glu Ser His 



Glu Phe Gly Gly 
10 

Glu Leu Glu Gin 
25 

Lys Glu Tyr Phe 

Leu Thr Leu Cys 
60 

Leu Lys Arg Glu 
75 

Ala Leu Gly Gin 
90 

He Ala Glu Thr 
105 

Cys Ala Leu Leu 

He Lys Arg Gin 
140 

Glu Val Arg Glu 
155 

Asn Glu Ala Leu 
170 

Leu Leu Gly Thr 
185 

Thr Phe Gin Lys 

Lys Glu Asn Arg 
220 

Ser Asn Ala He 
235 

Lys Leu He Gly 
250 

His Gly Ala Thr 
265 

Lys Thr Tyr Leu 
Ser He Ser Ala 



Ser Phe Val Ser 
15 

Ala Phe Asp Ala 
30 

Arg Leu Leu Lys 
45 

Gin Asn He Val 

Asp Leu He His 
80 

Ala Leu Leu Ala 
95 

Gly Ala Gly Gin 
110 

Asn Leu Lys Cys 
125 

Glu Met Asn Val 

Val Asn Ser Gly 
160 

Arg Asp Trp Ala 
175 

Ala Ala Gly Pro 
190 

Met He Gly Asp 
205 

Leu Pro Asp Tyr 

Gly He Phe Ser 
240 

Val Glu Pro Ala 
255 

Leu Asn Lys Gly 
270 

Leu Gin Asp Asp 
285 

Gly Leu Asp Tyr 
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165 170 175 

AAG GAC ACG CAT TAT TTG CTA GGC ACA GCC GCC GGG CCA CAC OCT TAC 632 
Lys Asp Thr His Tyr Leu Leu Gly Thr Ala Ala Gly Pro His Pro Tyr 
180 185 190 195 

CCC ACA ATG GTT AAA ACC TTT CAA AAA ATG ATA GGC GAT GAG GTT AAA 680 
Pro Thr Met Val Lys Thr Phe Gin Lys Met lie Gly Asp Glu Val Lys 
200 205 210 

AGC CAG ATT TTA GAA AAA GAA AAC CGC TTG CCT GAT TAT GTG ATC GCA 72 8 

Ser Gin lie Leu Glu Lys Glu Asn Arg Leu Pro Asp Tyr Val lie Ala 
215 220 225 

TGC GTT GGA GGG GGG TCT AAC GCT ATA GGG ATA TTC AGC GCA TTT TTA 776 
Cys Val Gly Gly Gly Ser Asn Ala lie Gly lie Phe Ser Ala Phe Leii 
230 235 240 

AAC GAC AAA GAA GTT AAA CTC ATA GGC GTA GAG CCG GCG GGT TTA GGG 824 
Asn Asp Lys Glu Val Lys Leu lie Gly Val Glu Pro Ala Gly Leu Gly 
245 250 255 

CTA GAA ACC AAT AAG CAT GGG GCG ACT TTG AAT AAG GGG CGT GTG GGG 872 
Leu Glu Thr Asn Lys His Gly Ala Thr Leu Asn Lys Gly Arg Val Gly 
260 265 270 275 

ATT TTG CAT GGG AAT AAA ACC TAT CTT TTA CAA GAT GAT GAA GGC CAG 92 0 

lie Leu His Gly Asn Lys Thr Tyr Leu Leu Gin Asp Asp Glu Gly Gin 
280 285 290 

ATT GCA GAA AGC CAT AGC ATT AGC GCC GGG CTT GAT TAT CCA GGA GTG 968 
lie Ala Glu Ser His Ser lie Ser Ala Gly Leu Asp Tyr Pro Gly Val 
295 300 305 

GGG CCA GAA CAC AGC TAT TTA AAA GAA AGT GGG CGT GCG GTT TAT GAA 1016 
Gly Pro Glu His Ser Tyr Leu Lys Glu Ser Gly Arg Ala Val Tyr Glu 
310 315 320 

AGC GCA AGC GAT GCT GAA GCG CTA GAA GCC TTC AAG TTG TTG TGC CAA 1064 
Ser Ala Ser Asp Ala Glu Ala Leu Glu Ala Phe Lys Leu Leu Cys Gin 
325 330 335 

AAA GAA GGC ATT ATC CCA GCG CTA GAA AGC TCA CAC GCC TTA GCG TAT 1112 
Lys Glu Gly lie lie Pro Ala Leu Glu Ser Ser His Ala Leu Ala Tyr 
340 345 350 355 

GCC TTA AAG CTC GCT CAA AAA TGC GAA GAA GAA AGC ATC ATC GTA GTG 1160 
Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser lie lie Val Val 
360 365 370 

AAT TTA AGC GGC AGA GGG GAT AAG GAT TTA AGC ACC GTT TAT AAC GCT 12 0 8 

Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val Tyr Asn Ala 
375 380 385 

TTA AAA GGA GGT TTA AAA TGAGGTATCA AAACATGTTT GAAACCTTAA AAAAACAC 1264 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 48... 1226 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

TAAGGATAAA ATCAAGCGAT TAGCCCGAAT TTTAAGAGAG TATTAAG ATG AAT AAA 5 6 

Met Asn Lys 
1 

AAA GCG TAT TTT GGG GAG TTT GGA GGG AGT TTT GTT TCG GAG TTG TTA 104 

Lys Ala Tyr Phe Gly Glu Phe Gly Gly Ser Phe'Val Ser Glu Leu Leu 
5 10 15 

GTG CCT GCA TTA AGA GAA TTA GAA CAG GCG TTT GAT GCG TGT TTG AAA 152 
Val Pro Ala Leu Arg Glu Leu Glu Gin Ala Phe Asp Ala Cys Leu Lys 
20 25 30 35 

GAT GAA AAA TTC CAA AAA GAA TAT TTT CGT CTT TTA AAG GAT TTT GTG 20 0 

Asp Glu Lys Phe Gin Lys Glu Tyr Phe Arg Leu Leu Lys Asp Phe Val 
40 45 50 

GGC CGT CCT AGC CCT TTA ACC TTG TGT CAA AAT ATC GTT TCT AAC CCT 24 8 

Gly Arg Pro Ser Pro Leu Thr Leu Cys Gin Asn lie Val Ser Asn Pro 
55 60 65 

AAA GTC AAG CTT TAT TTA AAA CGA GAG GAT TTA ATC CAT GGC GGG GCG 2 96 

Lys Val Lys Leu Tyr Leu Lys Arg Glu Asp Leu lie His Gly Gly Ala 
70 75 80 

CAT AAG ACT AAT CAA GCC TTA GGG CAA GCC CTT TTA GCG AAA AAA ATG 344 
His Lys Thr Asn Gin Ala Leu Gly Gin Ala Leu Leu Ala Lys Lys Met 
85 90 95 

GGT AAA ACA AGG ATC ATC GCT GAA ACA GGC GCC GGT CAG CAT GGC GTG 3 92 

Gly Lys Thr Arg lie lie Ala Glu Thr Gly Ala Gly Gin His Gly Val 
100 105 110 115 

GCG ACG GCT ATC GCT TGC GCA TTA TTG AAC TTA AAA TGC GTG GTT TTT 44 0 

Ala Thr Ala lie Ala Cys Ala Leu Leu Asn Leu Lys Cys Val Val Phe 
120 125 130 

ATG GGA TCT AAA GAC ATC AAG CGC CAG GAA ATG AAT GTT TTT AGA ATG 4 88 

Met Gly Ser Lys Asp lie Lys Arg Gin Glu Met Asn Val Phe Arg Met 
135 140 145 

CAC TTA TTA GGC GCT GAA GTG AGA GAG GTT AAT TCA GGG AGC GCG ACG 53 6 

His Leu Leu Gly Ala Glu Val Arg Glu Val Asn Ser Gly Ser Ala Thr 
150 155 160 

CTT AAA GAC GCT GTG AAT GAA GCC TTA AGA GAT TGG GCG AGC AGT TAC 584 
Leu Lys Asp Ala Val Asn Glu Ala Leu Arg Asp Trp Ala Ser Ser Tyr 
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Met Ser Ala Tyr lie lie Glu Thr Leu lie Lys lie Leu lie Leu Val 

15 10 15 

Ala Val Phe Ser Ala Leu Gly Gly Phe Ala Thr Tyr lie Glu Arg Lys 

20 25 30 

Val Leu Ala Tyr Phe Gin Arg Arg Leu Gly Pro Cys Tyr Val Gly Pro 

35 40 45 

Phe Gly Leu Leu Gin Val Ala Ala Asp Gly lie Lys Leu Phe Thr Lys 

50 55 60 

Glu Asp lie lie Pro Gin Gly Ala Asn Lys Phe lie Phe Thr Leu Ala 
65 70 75 80 

Pro lie lie Ala Met Val Ser Ala Phe Val Ser Met Ala Pro lie Pro 

85 90 95 

Phe Phe Pro Asn Phe Thr Leu Phe Gly Tyr Glu lie Lys Pro Leu lie 

100 105 110 

Ser Asp lie Asn lie Gly Phe Leu Phe Phe Leu Ala Val Gly Ser Ala 

115 120 125 

Gly lie Tyr Ala Pro lie Leu Ala Gly Leu Ala Ser Asn Asn Lys Tyr 

130 135 140 

Ser Leu lie Gly Ser Ala Arg Ala Thr lie Gin Leu Leu Ser Phe Glu 
145 150 155 160 

Val Val Ser Thr Leu Thr lie Leu Ala Pro Leu Met Val Val Gly Ser 

165 170 175 

Leu Ser Leu Val Glu lie Asn His Tyr Gin Ser Gly Gly Phe Leu Asp 

180 185 190 

Trp Leu Val Phe Lys Gin Pro Leu Ala Phe Val Leu Phe Leu lie Ala 

195 200 205 

Ser Tyr Ala Glu Leu Asn Arg Thr Pro Phe Asp Leu Leu Glu His Glu 

210 215 220 

Ala Glu lie Val Ala Gly Tyr Cys Thr Glu Tyr Ser Gly Leu Lys Trp 
225 230 235 240 

Gly Met Phe Phe Leu Ala Glu Tyr Ala His Leu Phe Ala Phe Ser Phe 

245 250 255 

Val lie Ser lie Val Phe Phe Gly Gly Phe Asn Ala Trp Gly Phe Xle 

260 265 270 

Pro Gly Gly lie Ala lie Leu lie Lys Ala Gly Phe Phe Val Phe Leu 

275 280 285 

Ser Met Trp Val Arg Ala Thr Tyr Pro His Val Arg Pro Asp Gin Leu 

290 295 300 

Met Asp Met Cys Trp Lys lie Met Leu Pro Leu Ala Leu Leu Asn lie 
305 310 315 320 

Val Leu Thr Gly lie lie lie Leu lie 
325 

(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 8 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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165 170 175 

GTG GAA ATC AAT CAT TAG CAA AGC GGT GGG TTT TTA GAG TGG CTT GTG 631 
Val Glu lie Asn His Tyr Gin Ser Gly Gly Phe Leu Asp Trp Leu Val 
180 185 190 195 

TTT AAG GAG COT CTA GOG TTT GTT TTG TTT TTG ATC GCA AGT TAT GCC 67 9 

Phe Lys Gin Pro Leu Ala Phe Val Leu Phe Leu lie Ala Ser Tyr Ala 
200 205 210 

GAA TTG AAT CGA ACC CCC TTT GAC TTG CTA GAG CAT GAA GCC GAG ATC 72 7 

Glu Leu Asn Arg Thr Pro Phe Asp Leu Leu Glu His Glu Ala Glu lie 
215 220 225 

GTG GCG GGG TAT TGC ACC GAA TAC AGC GGC TTG AAA TGG GGC ATG TTC 775 
Val Ala Gly Tyr Cys Thr Glu Tyr Ser Gly Leu Lys Trp Gly Met Phe 
230 235 240 

TTT TTA GCG GAA TAC GCG CAT TTA TTC GCT TTT TCT TTT GTG ATT TCT 82 3 

Phe Leu Ala Glu Tyr Ala His Leu Phe Ala Phe Ser Phe Val lie Ser 
245 250 255 

ATT GTG TTT TTT GGC GGG TTT AAC GCA TGG GGC TTT ATC CCT GGA GGC 871 
lie Val Phe Phe Gly Gly Phe Asn Ala Trp Gly Phe lie Pro Gly Gly 
260 265 270 275 

ATA GCG ATT TTG ATT AAA GCG GGC TTT TTT GTC TTT TTA TCC ATG TGG 919 
lie Ala lie Leu lie Lys Ala Gly Phe Phe Val Phe Leu Ser Met Trp 
280 285 290 

GTT AGA GCG ACT TAT CCG CAT GTG CGC CCA GAC CAA CTG ATG GAT ATG 967 
Val Arg Ala Thr Tyr Pro His Val Arg Pro Asp Gin Leu Met Asp Met 
295 300 305 

TGC TGG AAA ATC ATG CTG CCT TTA GCG TTA TTG AAC ATT GTG CTA ACG 1015 
Cys Trp Lys lie Met Leu Pro Leu Ala Leu Leu Asn lie Val Leu Thr 
310 315 320 

GGC ATT ATC ATT TTA ATT TAAAGGAGGT TTTATGGCCA AACAAGAATA CAAGCAAC 1071 
Gly lie He He Leu He 
325 

TTCCTAAAC 108 0 

(2) INFORMATION FOR SEQ ID NO: 54 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 47 . . . 1033 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 53 9: 

TTTTTAGAGA GGGCGTGTTT GATAGCGTGG ATTTAAAGGA GCAAGC ATG AGC GCT 55 

Met Ser Ala 
1 

TAT ATC ATT GAA ACC CTG ATT AAA ATT TTG ATT TTA GTC GCT GTT TTT 103 
Tyr lie lie Glu Thr Leu lie Lys lie Leu lie Leu Val Ala Val Phe 
5 10 15 

TCG GCT TTA GGA GGC TTT GCC ACT TAT ATT GAA AGG AAA GTG TTA GCC 151 
Ser Ala Leu Gly Gly Phe Ala Thr Tyr He Glu Arg Lys Val Leu Ala 
20 25 30 35 

TAT TTC CAA CGC CGT TTA GGG CCT TGT TAT GTG GGG CCT TTT GGG CTT 19 9 

Tyr Phe Gin Arg Arg Leu Gly Pro Cys Tyr Val Gly Pro Phe Gly Leu 
40 45 50 

TTG CAA GTC GCA GCA GAC GGC ATT AAG CTT TTC ACT AAA GAA GAC ATT 247 
Leu Gin Val Ala Ala Asp Gly He Lys Leu Phe Thr Lys Glu Asp He 
55 60 65 

ATC CCT CAA GGC GCG AAC AAA TTC ATT TTC ACG CTA GCG CCC ATT ATT 2 95 

He Pro Gin Gly Ala Asn Lys Phe He Phe Thr Leu Ala Pro He He 
70 75 80 

GCG ATG GTG AGT GCG TTT GTG TCC ATG GCG CCT ATC CCC TTT TTC CCT 34 3 

Ala Met Val Ser Ala Phe Val Ser Met Ala Pro He Pro Phe Phe Pro 
85 90 95 

AAT TTC ACT CTG TTT GGC TAT GAG ATC AAG CCC CTT ATT TCT GAC ATC 3 91 

Asn Phe Thr Leu Phe Gly Tyr Glu He Lys Pro Leu He Ser Asp He 
100 105 110 115 

AAC ATT GGC TTT TTG TTT TTC TTA GCC GTG GGT TCG GCA GGG ATT TAT 43 9 

Asn He Gly Phe Leu Phe Phe Leu Ala Val Gly Ser Ala Gly He Tyr 
120 125 130 

GCG CCT ATT TTA GCC GGG CTT GCC TCT AAT AAC AAA TAC TCT TTA ATT 4 87 

Ala Pro He Leu Ala Gly Leu Ala Ser Asn Asn Lys Tyr Ser Leu He 
135 140 145 

GGC TCC GCA AGA GCG ACG ATC CAA CTG CTC AGC TTT GAA GTG GTC • AGC 535 
Gly Ser Ala Arg Ala Thr He Gin Leu Leu Ser Phe Glu Val Val Ser 
150 155 160 

ACT TTA ACC ATT CTA GCC CCC TTA ATG GTG GTA GGA TCG CTC TCT TTA 5 83 

Thr Leu -Thr He Leu Ala Pro Leu Met Val Val Gly Ser Leu Ser Leu 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXniE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 8: 



Met 


Lys 


Leu 


Lys 


Ser 


Phe 


Gly 


Val 


Phe 


Gly 


Asn 


Pro 


He 


Lys 


His 


Ser 


1 








5 










10 










15 




Lys 


Ser 


Pro 


Leu 


He 


His 


Asn 


Ala 


Cys 


Phe 


Leu 


Thr 


Phe 


Gin 


Lys 


Glu 








20 










25 










30 






Leu 


Arg 


Phe 


Leu 


Gly 


His 


Tyr 


His 


Pro 


He 


Leu 


Leu 


Pro 


Leu 


Glu 


Ser 






35 










40 










45 








His 


lie 


Lys 


Ser 


Glu 


Phe 


Leu 


His 


Leu 


Gly 


Leu 


Ser 


Gly Ala 


Asn 


Val 




50 










55 










60 










Thr 


Leu 


Pro 


Phe 


Lys 


Glu 


Arg 


Ala 


Phe 


Gin 


Val 


Cys 


Asp 


Lys 


He 


Lys 


65 










70 










75 










80 


Gly 


He 


Ala 


Leu 


Glu 


Cys 


Gly 


Ala 


Val 


Asn 


Thr 


Leu 


Val 


Leu 


Glu 


Asn 










85 










90 










95 




Asp 


Glu 


Leu 


Val 


Gly 


Tyr 


Asn 


Thr 


Asp 


Ala 


Leu 


Gly 


Phe 


Tyr 


Leu 


Ser 








100 










105 










110 






Leu 


Lys 


Gin 


Lys 


Asn 


Tyr 


Gin 


Asn 


Ala 


Leu 


He 


Leu 


Gly 


Ala 


Gly 


Gly 






115 










120 










125 








Ser 


Ala 


Lys 


Ala 


Leu 


Ala 


Cys 


Glu 


Leu 


Lys 


Lys 


Gin 


Gly 


Leu 


Gin 


Val 




130 










135 










140 










Ser 


Val 


Leu 


Asn 


Arg 


Ser 


Ser 


Arg 


Gly 


Leu 


Asp 


Phe 


Phe 


Gin 


Arg 


Leu 


145 










150 










155 










160 


Gly 


Cys 


Asp 


Cys 


Phe 


Met 


Glu 


Pro 


Pro 


Lys 


Ser 


Ala 


Phe 


Asp 


Leu 


He 










165 










170 










175 




lie 


Asn 


Ala 


Thr 


Ser 


Ala 


Ser 


Leu 


His 


Asn 


Glu 


Leu 


Pro 


Leu 


Asn 


Lys 








180 










185 










190 






Glu 


Val 


Leu 


Lys 


Gly 


Tyr 


Phe 


Lys 


Glu 


Gly 


Lys 


Leu 


Ala 


Tyr 


Asp 


Leu 






195 










200 










205 








Ala 


Tyr 


Gly 


Phe 


Leu 


Thr 


Pro 


Phe 


Leu 


Ser 


Leu 


Ala 


Lys 


Glu 


Leu 


Lys 




210 










215 










220 










Thr 


Pro 


Phe 


Gin 


Asp 


Gly 


Lys 


Asp 


Met 


Leu 


He 


Tyr 


Gin 


Ala 


Ala 


Leu 


225 










230 










235 










240 


Ser 


Phe 


Glu 


Lys 


Phe 


Ser 


Ala 


Ser 


Gin 


He 


Pro 


Tyr 


Ser 


Lys 


Ala 


Phe 










245 










250 










255 




Glu 


Val 


Met 


Arg 


Ser 


Val 


Phe 





















260 

(2) INFORMATION FOR SEQ ID NO: 53 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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TTT CAA GTT TGC GAT AAA ATC AAA GOT ATC GCG CTT GAA TGC GGA GCG 3 54 

Phe Gin Val Cys Asp Lys lie Lys Gly lie Ala Leu Glu Cys Gly Ala 
75 80 85 

GTC AAT ACG CTT GTT TTA GAA AAT GAT GAG CTT GTG GGT TAC AAT ACC 4 02 

Val Asn Thr Leu Val Leu Glu Asn Asp Glu Leu Val Gly Tyr Asn Thr 
90 95 100 

GAC GOT TTA GGG TTT TAT CTT TCT TTA AAG CAA AAA AAC TAT CAA AAC 45 0 

Asp Ala Leu Gly Phe Tyr Leu Ser Leu Lys Gin Lys Asn Tyr Gin Asn 
105 110 115 120 

GOT TTG ATT TTA GGA GCT GGG GGG AGO GCT AAA GCC CTA GCG TGT GAA 4 98 

Ala Leu lie Leu Gly Ala Gly Gly Ser Ala Lys Ala Leu Ala Cys Glu 
125 130 135 

TTG AAA AAA CAA GGC TTA CAA GTG AGC GTG TTG AAC CGC TCT TCT AGG 54 6 

Leu Lys Lys Gin Gly Leu Gin Val Ser Val Leu Asn Arg Ser Ser Arg 
140 145 150 

GGA TTG GAT TTT TTC CAA CGC CTG GGC TGT GAT TGT TTT ATG GAG CCT 594 
Gly Leu Asp Phe Phe Gin Arg Leu Gly Cys Asp Cys Phe Met Glu Pro 
155 160 165 

CCT AAA AGC GCT TTT GAT TTG ATT ATT AAC GCC ACT TCA GCG AGT TTG 642 
Pro Lys Ser Ala Phe Asp Leu lie lie Asn Ala Thr Ser Ala Ser Leu 
170 175 180 

CAT AAC GAA TTG CCT TTG AAT AAA GAG GTT TTG AAA GGG TAT TTT AAA 69 0 

His Asn Glu Leu Pro Leu Asn Lys Glu Val Leu Lys Gly Tyr Phe Lys 
185 190 195 200 

GAG GGC AAG CTC GCT TAT GAT TTG GCG TAT GGG TTT TTA ACG CCC TTT 73 8 

Glu Gly Lys Leu Ala Tyr Asp Leu Ala Tyr Gly Phe Leu Thr Pro Phe 
205 210 215 

TTG TCT TTA GCC AAA GAG TTA AAA ACC CCT TTT CAA GAC GGA AAA GAC 7 86 

Leu Ser Leu Ala Lys Glu Leu Lys Thr Pro Phe Gin Asp Gly Lys Asp 
220 225 230 

ATG CTC ATC TAT CAA GCT GCT TTA AGT TTT GAA AAA TTC AGC GCT TCT 834 
Met Leu lie Tyr Gin Ala Ala Leu Ser Phe Glu Lys Phe Ser Ala Ser 
235 240 245 

CAA ATC CCT TAT TCA AAA GCG TTT GAA GTC ATG CGA AGT GTT TTT TGATG 884 
Gin lie Pro Tyr Ser Lys Ala Phe Glu Val Met Arg Ser Val Phe 
250 255 260 

CAAGGGTTTT TAAGAAGCCT GTTTTTTGGG GTTAAAAAGA TCCCTAAACC ATTCGCTCCT 944 
CTAGTAGAAA AGGGCGTTTT AAAAGAAGCG CTTGAATTGA AAAAGG 990 

(2) INFORMATION FOR SEQ ID NO: 53 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 263 amino acids 
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85 90 95 

lie Thr Ala Asp Ser Leu Gin Phe Met Asp Lys Lys Ser Asp Asn Pro 

100 105 110 

Gin Ala Asn Ala Met Gin Asp Ser lie Met His Glu Asn Ser Asn Asn 

115 120 125 

Ala Tyr Pro Ala Asn His Asn Ala Pro Ser Gin Asp Pro Phe Asn Gin 

130 135 140 

Ala Tyr Ala Gin Asn Ala Tyr Ala Lys Glu Asn Leu Gin Ala Gin Pro 
145 150 155 160 

Ser Lys Tyr Gin Asn Ser Val Pro Glu lie Asn lie Asp Glu Glu Glu 
165 170 175 

lie Pro Phe 



(2) INFORMATION FOR SEQ ID NO: 53 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 91... 879 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 

TCCAAAACGA TTGGGCTGAA ATTGAATTTT CTCACGAAAC AAAGGGCTAT GTGTTTTTAA 6 0 

AACTTTTAAA AAAGGCTGAA TGAAAGAATA ATG AAA TTA AAA TCT TTT GGG GTT 114 

Met Lys Leu Lys Ser Phe Gly Val 
1 5 

TTT GGA AAT CCC ATT AAG CAT TCC AAA TCG CCC TTA ATC CAT AAC GCT 162 
Phe Gly Asn Pro lie Lys His Ser Lys Ser Pro Leu lie His Asn Ala 
10 15 20 

TGT TTT TTA ACT TTT CAA AAA GAA TTA AGG TTT TTG GGG CAT TAC CAC 210 
Cys Phe Leu Thr Phe Gin Lys Glu Leu Arg Phe Leu Gly His Tyr His 
25 30 35 40 

CCC ATA TTA CTC CCT TTA GAA AGC CAC ATC AAA AGC GAG TTT TTG CAT 25 8 

Pro lie Leu Leu Pro Leu Glu Ser His lie Lys Ser Glu Phe Leu His 
45 50 55 

TTG GGA TTG AGT GGG GCT AAT GTA ACC TTA CCC TTT AAA GAA AGG GCG 3 06 

Leu Gly Leu Ser Gly Ala Asn Val Thr Leu Pro Phe Lys Glu Arg Ala 
60 65 70 
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GGT TCA AGC GTT TTG ATA GAA GGG CGT TTG ACT TAT GAG AGT TGG ATG 354 
Gly Ser Ser Val Leu lie Glu Gly Arg Leu Thr Tyr Glu Ser Trp Met 
70 75 80 85 

GAT CAA ACG GGC AAA AAA AAT TCC CGC CAC ACT ATC ACA GCG GAG TCG 4 02 

Asp Gin Thr Gly Lys Lys Asn Ser Arg His Thr lie Thr Ala Asp Ser 
90 95 100 

TTG CAA TTT ATG GAT AAA AAG TCA GAC AAT CCC CAA GCA AAC GCT ATG 450 
Leu Gin Phe Met Asp Lys Lys Ser Asp Asn Pro Gin Ala Asn Ala Met 
105 110 115 

CAA GAT AGT ATA ATG CAT GAG AAT TCC AAC AAC GCT TAT CCC GCT AAT 4 98 

Gin Asp Ser lie Met His Glu Asn Ser Asn Asn Ala Tyr Pro Ala Asn 
120 125 130 

CAT AAC GCT CCC AGC CAA GAT CCT TTT AAC CAA GCT TAT GCG CAA AAC 54 6 

His Asn Ala Pro Ser Gin Asp Pro Phe Asn Gin Ala Tyr Ala Gin Asn 
135 140 145 

GCT TAC GCT AAA GAG AAT TTA CAA GCA CAG CCG TCC AAG TAT CAA AAC 594 
Ala Tyr Ala Lys Glu Asn Leu Gin Ala Gin Pro Ser Lys Tyr Gin Asn 
150 155 160 165 

AGC GTG CCT GAA ATC AAT ATT GAT GAA GPiA GAA ATC CCC TTT TAAGGGTTA 64 5 
Ser Val Pro Glu lie Asn lie Asp Glu Glu Glu lie Pro Phe 
170 175 

AAATTAAGGA GACATTATGG AAAGAAAACG CTATTCAAAA CGCTATTGCA AATACACTGA 705 
AGCTAAAATC AGCTT 72 0 

(2) INFORMATION FOR SEQ ID NO: 5 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 6: 

Met Phe Asn Lys Val lie Met Val Gly Arg Leu Thr Arg Asn Val Glu 

15 10 15 

Leu Lys Tyr Leu Pro Ser Gly Ser Ala Ala Ala Thr lie Gly Leu Ala 

20 25 30 

Thr Ser Arg Arg Phe Lys Lys Gin Asp Gly Thr Leu Gly Glu Glu Val 

35 40 45 

Cys Phe lie Asp Ala Arg Leu Phe Gly Arg Thr Ala Glu lie Ala Asn 

50 55 60 

Gin Tyr Leu Ser Lys Gly Ser Ser Val Leu lie Glu Gly Arg Leu Thr 
65 70 75 80 

Tyr Glu Ser Trp Met Asp Gin Thr Gly Lys Lys Asn Ser Arg His Thr 
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740 745 750 

Val Val Glu Gin Gly Asp lie Lys Glu Met lie Asp Arg Leu Lys Ser 

755 760 765 

Lys His Glu Arg Leu Leu Ala Met Val Phe Lys Lys Glu Asn Glu Arg 

770 775 780 

lie Thr Leu Ala Cys Gly Val Lys Asn Ala Pro lie Lys Ala Asn Val 
785 790 795 800 

Trp Ala Asn Glu Val Ala Gin lie Leu Gly Gly Lys Gly Gly Gly Arg 

805 810 815 

Gly Asp Phe Ala Ser Ala Gly Gly Lys Asp lie Glu Asn Leu Gin Ala 

820 825 830 

Ala Leu Asn Leu Ala Lys Asn Thr Ala Leu Lys Ala Leu Glu Gly 
835 840 845 

(2) INFORMATION FOR SEQ ID NO: 53 5: 

(i) SEQUENCE CHAI^CTERISTICS : 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ICEY: Coding Sequence 

(B) LOCATION: 100 . . . 636 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:535: 

TCGCACGCCA AAGAAAAACA CGAAAAAACC GAACACACGC ATTCTCACCA CACAGAGGAA 6 0 

GCAGAAAGCG TAGGATCTCA TAGCGAATAA GGGCTTATC ATG TTT AAT AAA GTG 114 

Met Phe Asn Lys Val 
1 5 

ATT ATG GTA GGG CGT TTG ACC AGG AAT GTG GAG TTG AAA TAT TTG CCT 162 
lie Met Val Gly Arg Leu Thr Arg Asn Val Glu Leu Lys Tyr Leu Pro 
10 15 20 

AGC GGT TCG GCT GCG GCT ACA ATA GGT TTA GCC ACA AGC AGG CGT TTT 210 
Ser Gly Ser Ala Ala Ala Thr lie Gly Leu Ala Thr Ser Arg Arg Phe 
25 30 35 

AAA AAA CAA GAC GGC ACG CTA GGC GAA GAG GTG TGC TTT ATA GAT GCG 25 8 

Lys Lys Gin Asp Gly Thr Leu Gly Glu Glu Val Cys Phe lie Asp Ala 
40 45 50 

CGT TTG TTT GGG CGA ACG GCT GAA ATC GCT AAC CAG TAT TTG AGC AAG 306 
Arg Leu Phe Gly Arg Thr Ala Glu lie Ala Asn Gin Tyr Leu Ser Lys 
55 60 65 
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Arg His Gly Tyr 
305 

Val Gly Val Val 

Glu Ser Lys Glu 
340 

Phe Leu Glu Thr 
355 

Lys His Leu Asn 
370 

Leu Tyr Asp Thr 
385 

Arg Ser His Gly 

Gin Glu Gin Val 
420 

Asn Ala Asp Phe 
435 

Val Gly Tyr Glu 
450 

Asp Ser Asp Phe 
465 

Trp Val Leu Leu 

lie Gly Asp Arg 
500 

Val Leu Asp Thr 
515 

lie Lys Lys Ala 
530 

Asp Glu Arg Phe 
545 

Gin Ser Ala Leu 

Ser Leu Val Glu 
580 

Ala Leu Asn Asp 
595 

Gin lie Phe Lys 
610 

Gin Ala Lys Asp 
625 

Glu Asn Val Arg 

Gly Gly lie His 
660 

Val Lys Glu Ser 
675 

Cys Gly Lys Ala 
690 

Lys Asn Ala Lys 
705 

Asn Lys Leu Lys 
Met Asp Leu Pro 



Leu Met Gly Leu 
310 

Cys Glu Gin Phe 
325 

Met Val Val Lys 

Leu Glu Ser Gly 
360 

Glu Asn Lys lie 
375 

Phe Gly Phe Pro 
390 

Ala Cys Ala Asp 
405 

Lys Arg Ser Lys 

Ser Ala lie Leu 
440 

Thr Thr Glu Cys 
455 

Lys Glu lie Thr 
470 

Glu Lys Thr Pro 
485 

Gly Ala Leu Phe 

Lys Asn Phe Phe 
520 

Leu Lys Lys Gly 
535 

Glu lie Ala Lys 
550 

Arg Glu Val Leu 
565 

Ser Lys Arg Leu 

Glu Glu Leu Glu 
600 

His Leu Asn Ser 
615 

Lys Gly Ala Leu 
630 

Val Val Ser Phe 
645 

Val Glu Asn Thr 

Gly Val Ser Ser 
680 

Phe Tyr Gin Leu 
695 

Thr Leu Leu Lys 
710 

Glu Ser Val Lys 
725 

Val Glu Lys lie 



Lys Glu Ala Phe 
315 

Ala Asn Thr His 
330 

Glu Cys Phe Glu 
345 

Met Glu Leu Phe 

Phe Asp Gly Lys 
380 

Leu Asp Leu Thr 
395 

Met Gin Gly Phe 
410 

Ala Ser Trp Lys 
425 

Asn Ala Tyr Ala 

Ser Ala Lys Val 
460 

Asp Ala Asn Pro 
475 

Phe Tyr Ala Glu 
490 

Lys Asp Asn Gly 
505 

Gly Leu Asn Phe 

Asp Gin Val lie 
540 

His His Ser Ala 
555 

Gly Ser His Val 
570 

Arg Phe Asp Phe 
585 

Lys Val Glu Asp 

Gin Val Glu His 
620 

Ala Leu Phe Ser 
635 

Lys Glu Ala Ser 
650 

Gly Leu lie Gly 
665 

Gly Val Arg Arg 

Ala Lys Glu Glu 
700 

Asn Asn Asp Val 
715 

Asn Ser Gin Lys 
730 

His Gly Val Asn 



Leu Tyr Lys Val 
320 

Ala Tyr Leu Lys 
335 

Glu Glu Glu His 
350 

Asn Leu Ser Leu 
365 

lie Ala Phe Lys 

Asn Asp Met Leu 
400 

Glu Leu Cys Met 
415 

Gly Lys Gin Asn 
430 

Pro Asn Val Phe 
445 

Leu Gly Phe Phe 

Asn Gin Glu Val 
480 

Gly Gly Gly Ala 
495 

Glu Val Ala lie 
510 

Ser Leu Leu Glu 
525 

Ala Gin Val Ser 

Thr His Leu Leu 
560 

Ser Gin Ala Gly 
575 

Ser His Ala Lys 
590 

Leu Val Asn Ala 
605 

Met Pro Leu Asn 

Glu Lys Tyr Ala 
640 

lie Glu Leu Cys 
655 

Gly Phe Arg lie 
670 

lie Glu Ala Val 
685 

Asn Lys Glu Leu 

lie Ala Gly lie 
720 

Ala Pro Val Ser 
735 

Leu Val Val Gly 
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GCTTATTTTA GGCTCTCAAT CCAGCACTAG GGCGAATCTC TTAAAAGAGC ATGGGATTAA 2725 
GTTTGAACAA AAAGCGCTCT ATTTTGATGA AGAT^GCCTA AAAACCACAG ACCCTAGGGA 2 785 
GTTTG 27 90 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 



Met 


Asp 


He 


Arg 


Asn 


Glu 


Phe 


Leu 


Gin 


Phe 


Phe 


Gin 


Asn 


Lys 


Gly 


His 


1 








5 










10 










15 




Ala 


Val 


Tyr 


Pro 


Ser 


Met 


Pro 


Leu 


Val 


Pro 


Asn 


Asp 


Ala 


Thr 


Leu 


Leu 








20 










25 










30 






Phe 


Thr 


Asn 


Ala 


Gly 


Met 


Val 


Gin 


Phe 


Lys 


Asp 


He 


Phe 


Thr 


Gly 


He 






35 










40 










45 








Val 


Pro 


Arg 


Pro 


Ser 


He 


Pro 


Arg 


Ala 


Ala 


Ser 


Ser 


Gin 


Leu 


Cys 


Met 




50 










55 










60 










Arg 


Ala 


Gly 


Gly 


Lys 


His 


Asn 


Asp 


Leu 


Glu 


Asn 


Val 


Gly 


Tyr 


Thr 


Ala 


65 










70 










75 










80 


Arg 


His 


His 


Thr 


Leu 


Phe 


Glu 


Met 


Leu 


Gly 


Asn 


Phe 


Ser 


Phe 


Gly Asp 










85 










90 










95 




Tyr 


Phe 


Lys 


Glu 


Glu 


Ala 


He 


Leu 


Phe 


Ala 


Trp 


Glu 


Phe 


Val 


Thr 


Lys 








100 










105 










110 






Asn 


Leu 


Gly 


Phe 


Lys 


Pro 


Lys 


Asp 


Leu 


Tyr 


He 


Ser 


Val 


His 


Glu 


Lys 






115 










120 










125 








Asp 


Asp 


Glu 


Ala 


Val 


Lys 


Leu 


Trp 


Glu 


Lys 


Phe 


Val 


Pro 


Val 


Asp 


Arg 




130 










135 










140 










He 


Lys 


Lys 


Met 


Gly Asp 


Lys 


Asp 


Asn 


Phe 


Trp 


Gin 


Met 


Gly 


Asp 


Ser 


145 










150 










155 










160 


Gly 


Pro 


Cys 


Gly 


Pro 


Cys 


Ser 


Glu 


He 


Tyr 


He 


Asp 


Gin 


Gly 


Glu 


Lys 










165 










170 










175 




His 


Phe 


Lys 


Gly 


Ser 


Glu 


Asp 


Tyr 


Phe 


Gly Gly 


Glu 


Gly 


Asp 


Arg 


Phe 








180 










185 










190 






Leu 


Glu 


He 


Trp 


Asn 


Leu 


Val 


Phe 


Met 


Gin 


Xyr 


Glu 


Arg 


Ser 


Asn 


Asp 






195 










200 










205 








Gly Val 


Leu 


Ser 


Pro 


Leu 


Pro 


Lys 


Pro 


Ser 


He 


Asp 


Thr 


Gly 


Met 


Gly 




210 










215 










220 










Leu 


Glu 


Arg 


Val 


Gin 


Ala 


Leu 


Leu 


Glu 


His 


Lys 


Leu 


Asn 


Asn 


Phe 


Asp 


225 










230 










235 










240 


Ser 


Ser 


Leu 


Phe 


Ala 


Pro 


Leu 


Met 


Glu 


Glu 


He 


Ser 


Glu 


Leu 


Thr 


Ser 










245 










250 










255 




Leu 


Asp 


Tyr 


Ala 


Ser 


Glu 


Phe 


Gin 


Pro 


Ser 


Phe 


Arg 


Val 


Val 


Ala 


Asp 








260 










265 










270 






His 


Ala 


Arg 


Ala 


Val 


Ala 


Phe 


Leu 


Leu 


Ala 


Gin 


Gly Val 


His 


Phe 


Asn 






275 










280 










285 








Lys 


Glu 


Gly Arg 


Gly 


Tyr 


Val 


Leu 


Arg 


Arg 


He 


Leu 


Arg 


Arg 


Ala 


Leu 




2 90, 










295 










300 
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GCC AAA GAT AAG GGA GCG TTA GCG TTA TTC AGT GAA AAA TAG GCT GAA 2 03 8 

Ala Lys Asp Lys Gly Ala Leu Ala Leu Phe Ser Glu Lys Tyr Ala Glu 
630 635 640 

AAT GTG CGG GTG GTG AGC TTT AAA GAA GCG TCC ATT GAA TTG TGT GGG 2 0 86 

Asn Val Arg Val Val Ser Phe Lys Glu Ala Ser lie Glu Leu Cys Gly 
645 650 655 

GGC ATT CAT GTG GAA AAT ACT GGG CTT ATT GGG GGG TTT AGG ATT GTA 2134 
Gly lie His Val Glu Asn Thr Gly Leu lie Gly Gly Phe Arg lie Val 
660 665 670 

AAA GAA AGC GGG GTG AGT AGT GGG GTC AGA CGC ATT GAA GCG GTG TGC 2182 
Lys Glu Ser Gly Val Ser Ser Gly Val Arg Arg lie Glu Ala Val Cys 
675 680 685 

GGG AAA GCC TTT TAC CAA CTG GCT AAA GAA GAA AAT AAA GAG CTT AAA 2230 
Gly Lys Ala Phe Tyr Gin Leu Ala Lys Glu Glu Asn Lys Glu Leu Lys 
690 695 700 705 

AAC GCT AAG ACT TTA TTG AAA AAT AAC GAT GTG ATC GCC GGT ATC AAT 227 8 

Asn Ala Lys Thr Leu Leu Lys Asn Asn Asp Val lie Ala Gly lie Asn 
710 715 720 

AAG CTT AAA GAG AGC GTG AAA AAC AGC CAA AAA GCC CCC GTT TCT ATG 23 2 6 

Lys Leu Lys Glu Ser Val Lys Asn Ser Gin Lys Ala Pro Val Ser Met 
725 730 735 

GAT TTA CCG GTT GAA AAA ATC CAT GGC GTG AAT TTG GTG GTG GGC GTA 2374 
Asp Leu Pro Val Glu Lys lie His Gly Val Asn Leu Val Val Gly Val 
740 745 750 

GTG GAA CAA GGC GAC ATT AAA GAA ATG ATT GAC CGA TTG AAA AGT AAG 242 2 

Val Glu Gin Gly Asp lie Lys Glu Met lie Asp Arg Leu Lys Ser Lys 
755 760 765 

CAT GAA AGA TTG CTC GCT ATG GTG TTT AAA AAA GAA AAT GAG CGA ATC 24 7 0 

His Glu Arg Leu Leu Ala Met Val Phe Lys Lys Glu Asn Glu Arg lie 
770 775 780 785 

ACT CTC GCA TGC GGG GTG AAA AAC GCG CCC ATA AAA GCG AAT GTG TGG 2518 
Thr Leu Ala Cys Gly Val Lys Asn Ala Pro lie Lys Ala Asn Val Trp 
790 795 BOO 

GCT AAT GAA GTG GCG CAA ATT TTA GGG GGC AAA GGG GGC GGG AGA GGT 2566 
Ala Asn Glu Val Ala Gin lie Leu Gly Gly Lys Gly Gly Gly Arg Gly 
805 810 815 

GAT TTT GCG AGC GCT GGA GGC AAG GAT ATT GAA AAT TTG CAA GCC GCA 2614 
Asp Phe Ala Ser Ala Gly Gly Lys Asp lie Glu Asn Leu Gin Ala Ala 
820 825 830 

CTC AAT TTA GCG AAA AAT ACC GCT CTT AAA GCT TTA GAG GGA TAGCATGGA 2665 
Leu Asn Leu Ala Lys Asn Thr Ala Leu Lys Ala Leu Glu Gly 
835 840 845 
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AGT CAT GGG GCG TGT GCG GAT ATG CAA GGC TTT GAA TTG TGC ATG CAA 13 66 

Ser His Gly Ala Cys Ala Asp Met Gin Gly Phe Glu Leu Cys Met Gin 
405 410 415 

GAG CAA GTG AAA CGC TCT AAA GCT TCA TGG AAA GGC AAA CAA AAC AAC 1414 
Glu Gin Val Lys Arg Ser Lys Ala Ser Trp Lys Gly Lys Gin Asn Asn 
420 425 430 

GCC GAT TTT AGC GCT ATT TTA AAC GCT TAT GCA CCT AAT GTT TTT GTG 1462 
Ala Asp Phe Ser Ala lie Leu Asn Ala Tyr Ala Pro Asn Val Phe Val 
435 440 445 

GGG TAT GAA ACG ACA GAA TGT TCT GCT AAA GTT TTA GGG TTT TTT GAT 1510 
Gly Tyr Glu Thr Thr Glu Cys Ser Ala Lys Val Leu Gly Phe Phe Asp 
450 455 460 465 

AGC GAT TTT AAA GAA ATA ACC GAT GCA AAT CCT AAC CAA GAA GTC TGG 155 8 

Ser Asp Phe Lys Glu He Thr Asp Ala Asn Pro Asn Gin Glu Val Trp 
470 475 480 

GTG TTG TTA GAA AAA ACC CCT TTT TAT GCA GAA GGT GGA GGG GCT ATA 16 06 

Val Leu Leu Glu Lys Thr Pro Phe Tyr Ala Glu Gly Gly Gly Ala He 
485 490 495 

GGC GAT AGG GGC GCG CTT TTT AAA GAC AAT GGA GAA GTG GCT ATC GTG 1654 
Gly Asp Arg Gly Ala Leu Phe Lys Asp Asn Gly Glu Val Ala He Val 
500 505 510 

TTA GAT ACA AAA AAC TTT TTT GGG CTT AAT TTT TCA CTC CTT GAA ATC 17 02 

Leu Asp Thr Lys Asn Phe Phe Gly Leu Asn Phe Ser Leu Leu Glu lie 
515 520 525 

AAA AAA GCG CTA AAA AAA GGC GAT CAA GTG ATC GCG CAA GTG AGC GAT 1750 
Lys Lys Ala Leu Lys Lys Gly Asp Gin Val He Ala Gin Val Ser Asp 
530 535 540 545 

GAG CGC TTT GAA ATC GCC AAA CAC CAT AGT GCG ACT CAT TTA TTG CAG 17 98 

Glu Arg Phe Glu He Ala Lys His His Ser Ala Thr His Leu Leu Gin 
550 555 560 

AGC GCT TTA AGA GAA GTT TTA GGC TCG CAT GTG AGT CAA GCG GGG AGT 184 6 

Ser Ala Leu Arg Glu Val Leu Gly Ser His Val Ser Gin Ala Gly Ser 
565 570 575 

TTA GTG GAA TCC AAG CGA TTG CGC TTT GAT TTC TCG CAT GCT AAA GCG 18 94 

Leu Val Glu Ser Lys Arg Leu Arg Phe Asp Phe Ser His Ala Lys Ala 
580 585 590 

CTC AAT GAT GAA GAG CTA GAA AAA GTA GAA GAT TTA GTC AAC GCT CAA 1942 
Leu Asn Asp Glu Glu Leu Glu Lys Val Glu Asp Leu Val Asn Ala Gin 
595 600 605 

ATT TTC AAG CAC CTA AAT AGC CAG GTG GAG CAT ATG CCT TTA AAC CAA 1990 
He Phe Lys His Leu Asn Ser Gin Val Glu His Met Pro Leu Asn Gin 
610 - 615 620 625 
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TTT AAG GGG AGC GAG GAT TAT TTT GGG GGC GAG GGC GAT AGG TTT TTA 694 
Phe Lys Gly Ser Glu Asp Tyr Phe Gly Gly Glu Gly Asp Arg Phe Leu 
180 185 190 

GAA ATT TGG AAT CTG GTG TTC ATG CAA TAG GAA CGC TCT AAT GAT GGC 742 
Glu lie Trp Asn Leu Val Phe Met Gin Tyr Glu Arg Ser Asn Asp Gly 
195 200 205 

GTT TTA TCC CCC TTG CCA AAG CCT AGC ATT GAT ACA GGC ATG GGA TTA 7 90 

Val Leu Ser Pro Leu Pro Lys Pro Ser lie Asp Thr Gly Met Gly Leu 
210 215 220 225 

GAA AGG GTG CAA GCG CTA TTA GAA CAT AAG CTC AAT AAT TTT GAT TCT 83 8 

Glu Arg Val Gin Ala Leu Leu Glu His Lys Leu Asn Asn Phe Asp Ser 
230 235 240 

TCA TTA TTT GCG CCC CTA ATG GAA GAA ATC AGC GAG CTT ACA AGC CTA 8 86 

Ser Leu Phe Ala Pro Leu Met Glu Glu lie Ser Glu Leu Thr Ser Leu 
245 250 255 

GAT TAT GCG AGC GAG TTC CAG CCA AGC TTT AGG GTA GTG GCC GAT CAC 934 
Asp Tyr Ala Ser Glu Phe Gin Pro Ser Phe Arg Val Val Ala Asp His 
260 265 270 

GCA AGA GCG GTA GCA TTC TTG CTC GCT CAA GGG GTG CAT TTC AAT AAG 982 
Ala Arg Ala Val Ala Phe Leu Leu Ala Gin Gly Val His Phe Asn Lys 
275 280 285 

GAA GGC CGT GGC TAT GTT TTA AGG CGC ATT TTA AGG CGA GCC TTA AGG 1030 
Glu Gly Arg Gly Tyr Val Leu Arg Arg lie Leu Arg Arg Ala Leu Arg 
290 295 300 305 

CAT GGG TAT TTA ATG GGC TTG AAA GAA GCG TTT TTA TAC AAA GTC GTG 107 8 

His Gly Tyr Leu Met Gly Leu Lys Glu Ala Phe Leu Tyr Lys Val Val 
310 315 320 

GGC GTG GTG TGC GAG CAA TTT GCT AAC ACG CAT GCG TAT TTG AAA GAG 112 6 

Gly Val Val Cys Glu Gin Phe Ala Asn Thr His Ala Tyr Leu Lys Glu 
325 330 335 

TCT AAA GAA ATG GTG GTA AAA GAA TGT TTT GAA GAA GAA GAG CAC TTT 1174 
Ser Lys Glu Met Val Val Lys Glu Cys Phe Glu Glu Glu Glu His Phe 
340 345 350 

TTA GAG ACT TTG GAA TCG GGC ATG GAA TTG TTT AAC TTG TCT TTA AAG 1222 
Leu Glu Thr Leu Glu Ser Gly Met Glu Leu Phe Asn Leu Ser Leu Lys 
355 360 365 

CAT TTG AAT GAA AAT AAA ATC TTT GAT GGC AAG ATC GCT TTC AAG CTT 12 7 0 

His Leu Asn Glu Asn Lys lie Phe Asp Gly Lys lie Ala Phe Lys Leu 
370 375 380 385 

TAT GAC ACT TTT GGT TTC CCT TTG GAT TTA ACA AAC GAC ATG TTA AGA 1318 
Tyr Asp Thr Phe Gly Phe Pro Leu Asp Leu Thr Asn Asp Met Leu Arg 
390 395 400 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:533: 

TTCTCCTTTA ATTTTGGATG TTTAAAGTAT AATAAACTAT CTTTTTAAAA AAATAACTTA 6 0 

AAAGAGCTAT AAAATAGCCT TAAAATACGC GATAAAACAA CAAAAAGGAA TACCC ATG 118 

Met 
1 

GAT ATT CGC AAC GAA TTT TTA CAA TTT TTT CAA AAT AAA GGG CAT GCC 166 
Asp lie Arg Asn Glu Phe Leu Gin Phe Phe Gin Asn Lys Gly His Ala 
5 10 15 

GTT TAT CCT AGC ATG CCT TTA GTG CCT AAT GAC GCT ACC TTG CTT TTT 214 
Val Tyr Pro Ser Met Pro Leu Val Pro Asn Asp Ala Thr Leu Leu Phe 
20 25 30 

ACC AAT GCC GGC ATG GTG CAA TTT AAA GAT ATT TTT ACC GGG ATT GTG 2 62 

Thr Asn Ala Gly Met Val Gin Phe Lys Asp lie Phe Thr Gly lie Val 
35 40 45 

CCA CGC CCT AGC ATT CCT AGA GCG GCA AGC TCG CAA TTG TGC ATG CGC 310 
Pro Arg Pro Ser lie Pro Arg Ala Ala Ser Ser Gin Leu Cys Met Arg 
50 55 60 65 

GCA GGC GGC AAG CAT AAC GAT TTG GAA AAT GTC GGT TAT ACC GCA AGG 3 58 

Ala Gly Gly Lys His Asn Asp Leu Glu Asn Val Gly Tyr Thr Ala Arg 
70 75 80 

CAC CAC ACG CTT TTT GAA ATG CTA GGG AAT TTC TCT TTT GGG GAT TAT 4 06 

His His Thr Leu Phe Glu Met Leu Gly Asn Phe Ser Phe Gly Asp Tyr 
85 90 95 

TTC AAA GAA GAA GCG ATC TTG TTT GCG TGG GAA TTT GTA ACC AAG AAT 4 54 

Phe Lys Glu Glu Ala lie Leu Phe Ala Trp Glu Phe Val Thr Lys Asn 
100 105 110 

TTA GGG TTT AAG CCT AAA GAT TTA TAC ATC AGC GTG CAT GAA AAG GAC 5 02 

Leu Gly Phe Lys Pro Lys Asp Leu Tyr lie Ser Val His Glu Lys Asp 
115 120 125 

GAT GAA GCC GTT AAA TTA TGG GAA AAG TTT GTG CCT GTT GAT AGG ATT 5 50 

Asp Glu Ala Val Lys Leu Trp Glu Lys Phe Val Pro Val Asp Arg lie 
130 135 140 145 

AAA AAA ATG GGC GAT AAA GAT AAT TTT TGG CAA ATG GGC GAT AGC GGG 598 
Lys Lys Met Gly Asp Lys Asp Asn Phe Trp Gin Met Gly Asp Ser Gly 
150 155 160 

CCT TGC GGG CCT TGC AGT GAA ATT TAC ATT GAT CAG GGC GAA AAA CAC 64 6 

Pro Cys Gly Pro Cys Ser Glu lie Tyr lie Asp Gin Gly Glu Lys His 
165 170 175 
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Lys Asp Asp Glu 
65 

Tyr Asn Glu Phe 

Tyr Leu Lys Glu 
100 

Ser Leu lie Lys 
115 

Ser Thr lie Glu 
130 

Ala Pro Lys lie 
145 

Lys lie Thr Thr 

Tyr Lys Pro Phe 
180 

Asp Phe Gly Ala 
195 

Leu Lys Ala Leu 
210 

Lys Ala Tyr Glu 
225 

Pro Gly Asp Pro 

Leu lie Asn Ala 
260 

Leu Leu Ser lie 
275 

His His Gly Ser 
290 

Glu He Thr Ala 
305 

Glu He Ala He 

Gly Val Arg Tyr 
340 

Glu Ser Ser Pro 
355 

Val Glu Leu Leu 
370 



Ser Phe Phe Ser 
70 

Phe Ser Asn Ser 
85 

Arg Gly Val Leu 

Thr Leu Arg His 
120 

His Asp Lys Asn 
135 

Ser His Ser Pro 
150 

His Gin Arg Ala 
165 

Asp Glu Lys Thr 

Lys Gly Asn He 
200 

He Tyr Pro His 
215 

Lys Lys Glu He 
230 

Leu Ser Leu Gin 
245 

Lys He Pro Met 

Ala Gin Gly Tyr 
280 

Asn His Pro Val 
295 

Gin Asn His Asn 
310 

He Thr His Arg 
325 

Lys Asn Ala Pro 

Gly Pro Lys Glu 
360 

Lys Asp Phe 
375 



Cys Ala Gly Val 
75 

Arg Ala Asp Phe 
90 

Gly Val Cys Gly 
105 

His Gly Cys Leu 

Lys Leu Glu Glu 
140 

Leu Val Ser Ser 
155 

Thr Phe Asp Phe 
170 

Ser His Lys He 
185 

Leu Asn Glu Leu 

His Thr Lys Ala 
220 

Ser Gly He Phe 
235 

Gin Glu He Gly 
250 

Leu Gly He Cys 
265 

Pro Thr Tyr Lys 

Lys Asn Leu Lys 
300 

Tyr Cys Val Pro 
315 

Asn Leu Phe Asp 
330 

He He Ser Val 
345 

Ser His Tyr He 



Leu Ala Arg His 
80 

Ser Leu Ser Ala 
95 

Val Asp Thr Arg 
110 

Met Met Val Ala 
125 

He Leu Lys Asn 

Val Ser Thr Pro 
160 

Lys Thr Leu Asp 
175 

He Ala Val Leu 
190 

Gin Asn Val Gly 
205 

Ser Glu Leu He 

Leu Ser Asn Gly 
240 

Glu He Lys Gin 
255 

Leu Gly His Gin 
270 

Leu Lys Phe Gly 
285 

Thr Asn Ala Val 

Glu Asp He Glu 
320 

Asn Thr He Glu 
335 

Gin His His Pro 
350 

Phe Lys Glu Phe 
365 



(2) INFORMATION FOR SEQ ID NO: 533: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 



(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 116... 2656 
(D) OTHER INFORMATION: 
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ATT AAC GCT AAA ATC CCC ATG CTT GGC ATT TGC TTA GGG CAT CAA TTG 8 73 

lie Asn Ala Lys lie Pro Met Leu Gly lie Cys Leu Gly His Gin Leu 
260 265 270 

CTC TCT ATC GCT CAA GGC TAG CCT ACT TAC AAG CTC AAA TTT GGT CAT 921 
Leu Ser lie Ala Gin Gly Tyr Pro Thr Tyr Lys Leu Lys Phe Gly His 
275 280 285 

CAT GGG AGC AAC CAC CCC GTT AAA AAC CTA AAA ACA AAC GCC GTA GAA 969 
His Gly Ser Asn His Pro Val Lys Asn Leu Lys Thr Asn Ala Val Glu 
290 295 300 305 

ATC ACC GCG CAA AAC CAC AAC TAT TGC GTC CCT GAA GAC ATT GAA GAA 1017 
lie Thr Ala Gin Asn His Asn Tyr Cys Val Pro Glu Asp lie Glu Glu 
310 315 320 

ATC GCC ATT ATC ACG CAC CGC AAT CTT TTT GAC AAC ACC ATT GAG GGC 1065 
lie Ala lie lie Thr His Arg Asn Leu Phe Asp Asn Thr lie Glu Gly 
325 330 335 

GTG CGT TAT AAA AAC GCT CCC ATT ATC TCT GTC CAG CAC CAC CCA GAA 1113 
Val Arg Tyr Lys Asn Ala Pro lie lie Ser Val Gin His His Pro Glu 
340 345 350 

ACT AGC CCA GGT CCT AAA GAG AGC CAC TAT ATT TTT AAA GAA TTT GTG 1161 
Ser Ser Pro Gly Pro Lys Glu Ser His Tyr lie Phe Lys Glu Phe Val 
355 360 365 

GAA TTG TTA AAG GAT TTT TAGGGGTTTT TAAAACAGCG CTTATAGAGA CTGAAAAG 1217 
Glu Leu Leu Lys Asp Phe 
370 375 

CGCTTTAAAA ATAGATTTAA ATCTTTTTAT CAAAAAATCT CGC 12 60 

(2) INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

Met Val Ser Leu Tyr Leu Glu Asn Gly Leu Phe Leu Gin Ala Gin Ser 

15 10 15 

Phe Gly Ala Ser Gly Thr Gin Ala Gly Glu Leu Val Phe Asn Thr Ser 

20 25 30 

Met Ser Gly Tyr Gin Glu Val lie Ser Asp Pro Ser Tyr Lys Gly Gin 

35 40 45 

Phe Val Val Phe Ser Met Pro Glu lie Gly Val Val Gly Ala Asn Ser 
50 55 60 
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AGC GGT TAT CAA GAA GTC ATT AGC GAC CCT AGC TAT AAG GGG CAA TTT 201 
Ser Gly Tyr Gin Glu Val lie Ser Asp Pro Ser Tyr Lys Gly Gin Phe 
35 40 45 

GTG GTT TTT AGC ATG CCT GAG ATT GGG GTT GTG GGT GCT AAT TCT AAA 24 9 

Val Val Phe Ser Met Pro Glu lie Gly Val Val Gly Ala Asn Ser Lys 
50 55 60 65 

GAT GAT GAA TCC TTT TTT TCA TGC GCA GGG GTT TTA GCG CGC CAT TAG 2 97 

Asp Asp Glu Ser Phe Phe Ser Cys Ala Gly Val Leu Ala Arg His Tyr 
70 75 80 

AAC GAA TTT TTT TCT AAC TCA AGG GCG GAT TTT AGC TTG AGC GCT TAT 34 5 

Asn Glu Phe Phe Ser Asn Ser Arg Ala Asp Phe Ser Leu Ser Ala Tyr 
85 90 95 

TTG AAA GAG CGT GGC GTT TTA GGG GTT TGT GGC GTT GAT ACT AGG AGT 3 93 

Leu Lys Glu Arg Gly Val Leu Gly Val Cys Gly Val Asp Thr Arg Ser 
100 105 110 

TTG ATT AAA ACC TTA CGC CAT CAT GGG TGC TTA ATG ATG GTC GCT TCC 441 
Leu lie Lys Thr Leu Arg His His Gly Cys Leu Met Met Val Ala Ser 
115 120 125 

ACG ATA GAG CAT GAC AAA 7VAC AAG CTT GAA GAA ATT TTA AAA AAC GCT 4 89 

Thr lie Glu His Asp Lys Asn Lys Leu Glu Glu lie Leu Lys Asn Ala 
130 135 140 145 

CCT AAA ATT TCT CAC TCC CCC CTA GTG TCT AGC GTT TCT ACG CCA AAA 53 7 

Pro Lys lie Ser His Ser Pro Leu Val Ser Ser Val Ser Thr Pro Lys 
150 155 160 

ATA ACC ACG CAC CAG CGT GCG ACT TTT GAT TTC AAA ACC CTA GAT TAG 5 85 

lie Thr Thr His Gin Arg Ala Thr Phe Asp Phe Lys Thr Leu Asp Tyr 
165 170 175 

AAG CCT TTT GAT GAA AAA ACC TCT CAT AAA ATT ATC GCG GTG TTA GAC 63 3 

Lys Pro Phe Asp Glu Lys Thr Ser His Lys lie lie Ala Val Leu Asp 
180 185 190 

TTT GGG GCT AAG GGC AAT ATT TTA AAC GAG CTT CAA AAT GTG GGG TTA 6 81 

Phe Gly Ala Lys Gly Asn lie Leu Asn Glu Leu Gin Asn Val Gly Leu 
195 200 205 

AAA GCC CTT ATT TAG CCG CAC CAC ACT AAA GCT AGC GAG CTG ATT AAA 72 9 

Lys Ala Leu lie Tyr Pro His His Thr Lys Ala Ser Glu Leu lie Lys 
210 215 220 225 

GCC TAT GAA AAA AAA GAA ATT AGC GGG ATT TTC CTC TCT AAC GGG CCG 77 7 

Ala Tyr Glu Lys Lys Glu lie Ser Gly lie Phe Leu Ser Asn Gly Pro 
230 235 240 

GGC GAT CCT TTA AGC TTG CAG CAA GAA ATT GGC GAA ATC AAA CAA CTC 825 
Gly Asp Pro Leu Ser Leu Gin Gin Glu lie Gly Glu lie Lys Gin Leu 
245 250 255 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

Gin Cys Gly Glu Arg Asp Phe Arg Gly Leu Phe Phe Gin Ala lie Arg 

15 10 15 

Val Ser Asn Tyr Ala Arg Asn Tyr Gin Val Lys His Asn Leu Ala Tyr 

20 25 30 

Trp Gly Ala Lys Asp Tyr Leu Gly Cys Gly Ala Gly Ala Val Gly Cys 

35 40 45 

Val Ala Asn Glu Arg Phe Phe Ala Lys Lys Leu lie Glu Asn Tyr lie 

50 55 60 

Lys Asp Pro Leu Gin Arg Gin Val Glu Thr Leu Asn Lys Gin Asp Lys 
65 70 75 80 

Arg Leu Glu Lys Leu Phe Leu Gly Leu Arg Cys Val Leu Gly Val Glu 

85 90 95 

Leu Ser Phe Leu Asp Glu Asn Lys Val Lys Phe Leu lie Glu Glu Asn 

100 105 110 

Lys Ala Phe lie Lys Asn Asn Arg Leu lie Ala Ser Asp Phe Phe Met 

115 120 125 

Ala Asp Glu Met Ala Leu Trp Leu Leu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 55 . . . 1179 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

GTTGGTGTTT GAAAGGCTCT ATGAAGT^GA ATTAAGGCGC AAGGGTTTTT TATA ATG 57 

Met 
1 

GTC TCT CTC TAT TTA GAA AAC GGG CTT TTT TTG CAA GCG CAA AGT TTT 105 
Val Ser Leu Tyr Leu Glu Asn Gly Leu Phe Leu Gin Ala Gin Ser Phe 
5 10 15 

GGG GCT AGC GGC ACG CAA GCG GGC GAG CTT GTT TTT AAC ACT TCT ATG 15 3 

Gly Ala Ser Gly Thr Gin Ala Gly Glu Leu Val Phe Asn Thr Ser Met 
20 25 30 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...411 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

CAA TGT GGT GAG AGA GAT TTT AGA GGG CTT TTC TTT CAA GCA ATA CGA 4 8 

Gin Cys Gly Glu Arg Asp Phe Arg Gly Leu Phe Phe Gin Ala lie Arg 
15 10 15 

GTG TCT AAT TAC GCT AGA AAT TAT CAA GTC AAA CAC AAC TTG GCT TAC 96 
Val Ser Asn Tyr Ala Arg Asn Tyr Gin Val Lys His Asn Leu Ala Tyr 
20 25 30 

TGG GGG GCT AAA GAT TAT TTA GGG TGC GGG GCT GGG GCT GTG GGC TGC 144 
Trp Gly Ala Lys Asp Tyr Leu Gly Cys Gly Ala Gly Ala Val Gly Cys 
35 40 45 

GTG GCG AAT GAG CGC TTT TTT GCA AAA AAA CTC ATA GAA AAC TAC ATC 192 
Val Ala Asn Glu Arg Phe Phe Ala Lys Lys Leu lie Glu Asn Tyr lie 
50 55 60 

AAA GAC CCC CTA CAA CGC CAA GTT GAG ACG CTT AAT AAA CAA GAC AAA 24 0 

Lys Asp Pro Leu Gin Arg Gin Val Glu Thr Leu Asn Lys Gin Asp Lys 
65 70 75 80 

CGC TTA GAA AAG CTG TTT TTA GGC TTG AGG TGC GTG CTT GGG GTT GAG 2 88 

Arg Leu Glu Lys Leu Phe Leu Gly Leu Arg Cys Val Leu Gly Val Glu 
85 90 95 

CTT AGT TTC TTA GAT GAA AAT AAA GTA AAG TTT TTG ATT GAA GAG AAC 336 
Leu Ser Phe Leu Asp Glu Asn Lys Val Lys Phe Leu lie Glu Glu Asn 
100 105 110 

AAG GCT TTC ATT AAA AAT AAC CGC TTG ATA GCG AGC GAT TTT TTC ATG 3 84 

Lys Ala Phe lie Lys Asn Asn Arg Leu lie Ala Ser Asp Phe Phe Met 
115 120 125 

GCC GAT GAA ATG GCT TTG TGG CTG TTA TGATTGTAGG CTTTGCTTCA ATCAAGC 4 38 
Ala Asp Glu Met Ala Leu Trp Leu Leu 
130 135 

GTTAATAAAA CGCTAGAAAG CGTTTTTTAA TGAATGCCTA CAAAATTTTT AGCCAAAAAC 4 98 

CAACCAA 505 

(2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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Ala Thr Lys Pro Leu Ser Ser Tyr Leu Arg Glu lie Phe 
195 200 205 

TCATTCAAAA TATCGTATTT CATCACAAGC TTAGTGCCTT CATTGTCATA AGCGGCT 74 0 

(2) INFORMATION FOR SEQ ID NO: 52 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

Met Phe Leu Gly Asn Pro His Ser Ser lie Leu Arg Leu Gly Pro Thr 

15 10 15 

Thr lie Thr Leu Leu Pro Glu Lys Ser Thr Arg Leu Pro Lys Arg Phe 

20 25 30 

Cys Leu Lys Arg Pro Cys Leu Pro Leu Met lie Ser Leu Ser Asp Phe 

35 40 45 

Lys Gly Arg Leu Leu Ala Pro Leu Thr Ala Leu Val Leu Arg Pro Leu 

50 55 60 

Ser Lys Ser Thr Ser Thr Ala Ser Cys Asn lie Leu Phe Ser Leu Arg 
65 70 75 80 

Thr Met lie Ser Gly Ala Pro Ser Ser lie Lys Arg Phe Lys Arg Trp 

85 90 95 

Leu Arg Leu Met Thr Arg Arg Tyr Asn Ser Phe Thr Ser Leu Thr Ala 

100 105 110 

Asn Phe Pro Pro Ser Ser Ala Thr Lys Gly Leu Lys Ser Gly Gly Asn 

115 120 125 

Thr Gly Lys Thr Val Ser lie lie His Ser Gly Leu Leu Pro Glu Phe 

130 135 140 

Lys Lys Leu Ser Thr Thr Phe Lys Arg Leu Met Ser Phe Phe Phe Phe 
145 150 155 160 

Ala Ser Glu Leu Val Ser Phe Thr Ser Ser Phe Lys Leu Cys Asn Lys 

165 170 175 

Val lie Lys Ser lie Ser Ser Asn Lys Ser Leu lie Ala Ser Pro Pro 

180 185 190 

lie Cys Ala Thr Lys Pro Leu Ser Ser Tyr Leu Arg Glu lie Phe 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 52 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

CTCTCTCTTC AAGCTTGGAT AACAAATGCG GTTTGAAGAG TTCTAACGCC ATG TTT 5 6 

Met Phe 
1 

TTA GGC AAC CCG CAT TCA TCC ATT TTG AGA TTA GGC CCA ACC ACA ATC 104 
Leu Gly Asn Pro His Ser Ser lie Leu Arg Leu Gly Pro Thr Thr lie 
5 10 15 

ACG CTT CTG CCT GAA AAA TCC ACG CGC TTA CCT AAA AGG TTT TGC CTG 152 
Thr Leu Leu Pro Glu Lys Ser Thr Arg Leu Pro Lys Arg Phe Cys Leu 
20 25 30 

AAA CGC CCC TGC TTG CCT TTA ATG ATT TCA CTG AGC GAT TTT AAA GGG 200 
Lys Arg Pro Cys Leu Pro Leu Met lie Ser Leu Ser Asp Phe Lys Gly 
35 40 45 50 

CGT TTG TTA GCC CCT TTA ACC GCA TTA GTG CTG CGG CCG TTA TCA AAA 24 8 

Arg Leu Leu Ala Pro Leu Thr Ala Leu Val Leu Arg Pro Leu Ser Lys 
55 60 65 

AGC ACA TCC ACG GCT TCT TGC AAC ATC CTT TTT TCA TTG CGC ACA ATG 2 96 

Ser Thr Ser Thr Ala Ser Cys Asn lie Leu Phe Ser Leu Arg Thr Met 
70 75 80 

ATT TCT GGC GCT CCA AGC TCC ATT AAG CGT TTC AAG CGT TGG TTA CGA 344 
lie Ser Gly Ala Pro Ser Ser lie Lys Arg Phe Lys Arg Trp Leu Arg 
85 90 95 

TTG ATG ACA CGA CGA TAG AAT TCA TTC ACA TCG CTG ACT GCA AAC TTC 3 92 

Leu Met Thr Arg Arg Tyr Asn Ser Phe Thr Ser Leu Thr Ala Asn Phe 
100 105 110 

CCG CCA TCT AGC GCG ACT AAA GGC CTT AAA TCC GGT GGC AAT ACC GGT 44 0 

Pro Pro Ser Ser Ala Thr Lys Gly Leu Lys Ser Gly Gly Asn Thr Gly 
115 120 125 130 

AAA ACC GTG AGC ATC ATC CAT TCA GGC CTA TTA CCA GAA TTT AAA AAG 48 8 

Lys Thr Val Ser lie lie His Ser Gly Leu Leu Pro Glu Phe Lys Lys 
135 140 145 

CTT TCT ACC ACT TTC AAA CGC TTA ATG AGT TTT TTC TTT TTC GCA TCA 53 6 

Leu Ser Thr Thr Phe Lys Arg Leu Met Ser Phe Phe Phe Phe Ala Ser 
150 155 160 

G;^ TTG GTG TCT TTC ACT TCT TCT TTC AAA CTC TGC AAT AAG GTG ATC 584 
Glu Leu Val Ser Phe Thr Ser Ser Phe Lys Leu Cys Asn Lys Val lie 
165 170 175 

AAA TCA ATT TCT TCT AAC AAA TCC TTG ATC GCT TCA CCG CCC ATT TGC 632 
Lys Ser lie Ser Ser Asn Lys Ser Leu lie Ala Ser Pro Pro lie Cys 
180 185 190 

GCT ACA AAG CCC CTG TCT TCG TAT CTT CGT GAG ATA TTT TGATACTGCT CT 683 
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CCT AGC AAT ATC GCT AAG GAA ATT GTA GAA AAG CGC AAA GGC TGATTTAAT 4 91 
Pro Ser Asn lie Ala Lys Glu lie Val Glu Lys Arg Lys Gly 
135 140 

TATAACGCTC TCTTATTTTT AGGGGGTGTT ATAGGTGCTG TTTA 53 5 

(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

Met Gin Asn Gly Val Leu Ala Gly Tyr Pro Val Val Asp Phe Lys Val 

15 10 15 

Thr Leu Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser Glu Met Ala 

20 25 30 

Phe Lys lie Ala Gly Ser Met Ala Phe Lys Glu Ala Ser Arg Ala Ala 

35 40 45 

Asn Pro Val Leu Leu Glu Pro Met Met Lys Val Glu Val Glu Val Pro 

50 55 60 

Glu Glu Tyr Met Gly Asp Val lie Gly Asp Leu Asn Arg Arg Arg Gly 
65 70 75 80 

Gin lie Asn Ser Met Asp Asp Arg Leu Gly Leu Lys lie Val Asn Ala 

85 90 95 

Phe Val Pro Leu Val Glu Met Phe Gly Tyr Ser Thr Asp Leu Arg Ser 

100 105 110 

Ala Thr Gin Gly Arg Gly Thr Tyr Ser Met Glu Phe Asp His Tyr Gly 

115 120 125 

Glu Val Pro Ser Asn lie Ala Lys Glu lie Val Glu Lys Arg Lys Gly 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 671 
(D) OTHER INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO:525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 482 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 5: 

GTGATCCCTA AAGAATATAT CCTGCGGTGG ATAAGGGTAT CCAAGAAGCG ATG CAA 56 

Met Gin 
1 

AAT GGC GTT TTG GCA GGC TAT CCG GTG GTG GAT TTT AAA GTT ACC CTT 104 
Asn Gly Val Leu Ala Gly Tyr Pro Val Val Asp Phe Lys Val Thr Leu 
5 10 15 

TAT GAT GGG AGC TAC CAT GAT GTG GAT TCT TCA GAA ATG GCG TTT AAA 152 
Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser Glu Met Ala Phe Lys 
20 25 30 

ATC GCT GGC TCT ATG GCG TTT AAA GAA GCG AGT CGC GCG GCT AAC CCG 200 
lie Ala Gly Ser Met Ala Phe Lys Glu Ala Ser Arg Ala Ala Asn Pro 
35 40 45 50 

GTT TTA CTA GAG CCT ATG ATG AAA GTG GAA GTG GAA GTC CCT GAA GAA 24 8 

Val Leu Leu Glu Pro Met Met Lys Val Glu Val Glu Val Pro Glu Glu 
55 60 65 

TAC ATG GGC GAT GTG ATT GGC GAT TTG AAT AGA AGA AGA GGG CAA ATC 2 96 

Tyr Met Gly Asp Val lie Gly Asp Leu Asn Arg Arg Arg Gly Gin lie 
70 75 80 

AAT TCT ATG GAC GAT AGA TTA GGC TTG AAA ATC GTG AAC GCT TTT GTG 344 
Asn Ser Met Asp Asp Arg Leu Gly Leu Lys lie Val Asn Ala Phe Val 
85 90 95 

CCG TTG GTG GAA ATG TTT GGC TAT TCT ACG GAT TTA CGA TCA GCC ACC 3 92 

Pro Leu Val Glu Met Phe Gly Tyr Ser Thr Asp Leu Arg Ser Ala Thr 
100 105 110 

CAA GGG CGT GGG ACT TAC TCT ATG GAG TTT GAT CAT TAT GGC GAA GTG 44 0 

Gin Gly Arg Gly Thr Tyr Ser Met Glu Phe Asp His Tyr Gly Glu Val 
115 120 125 130 
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CTC TCA AAG CCT TTT GTT CGT TCA AAA AAT ATT TGC TGATGTTAAT AAAGAA 93 7 
Leu Ser Lys Pro Phe Val Arg Ser Lys Asn lie Cys 
260 265 

ATAGAAGCAG TTGCTAATAC TGAAAAGAAA GCAGAAAAAG MGGGTTATGG TTATAGTAAA 997 
AGGATGTAGG CATAAGAAAA TAAGAAC 1024 

(2) INFORMATION FOR SEQ ID NO:524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 69 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:524: 

Met Lys Arg Val Arg Glu Leu Val Lys Lys His Pro Glu Lys Ser Ser 

1 5 . 10 15 

Val Ala Leu Val Val Leu Thr His Ala Ala Cys Lys Lys Ala Lys Glu 

20 25 30 

Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu Asn 

35 40 45 

Gin lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala Thr Ser Leu 

50 55 60 

Leu Leu Ala Ala Cys Ser Val Gly Asp lie Asp Lys Gin lie Glu Leu 
65 70 75 80 

Glu Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn Lys 

85 90 95 

Ser Gly lie Glu Leu Glu Gin Glu Lys Gin Lys Thr lie Lys Glu Gin 

100 105 110 

Lys Asp Leu Val Lys Lys Ala Glu Gin Asn Cys Gin Glu Asn His Gly 

115 120 125 

Gin Phe Phe Met Lys Lys Leu Gly lie Lys Gly Gly lie Ala lie Glu 

130 135 140 

Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn Gin Thr 
145 150 155 160 

Pro lie Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His Ser Gin 

165 170 175 

Arg Gly Ser Lys Ala Gin Glu Leu lie Ala Tyr Leu Gin Lys Glu Leu 

180 185 190 

Glu Ser Leu Pro Tyr Ser Gin Lys Ala lie Ala Lys Gin Val Asn Phe 

195 200 205 

Tyr Arg Pro Ser Ser Val Ala Tyr Leu Glu Leu Asp Pro Arg Asp Phe 

210 215 220 

Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys lie Arg Ser Lys 
225 230 235 240 

Ala Gin Ala Lys Met Leu Gly Asn Glu Lys Pro Thr Ser Pro Pro Phe 

245 250 255 

Asn Leu Ser Lys Pro Phe Val Arg Ser Lys Asn lie Cys 
260 265 
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GAC GAT AAA GTC CAG GAT AAA TCC AAA CAA OCT GAA AAA GAA AAT CAA 261 
Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu Asn Gin 
35 40 45 

ATC AAT TGG TGG AAA TAT TCA GGA TTA ACA ATA GCG ACA AGT TTA TTA 3 09 

He Asn Trp Trp bys Tyr Ser Gly Leu Thr He Ala Thr Ser Leu Leu 
50 55 60 65 

TTA GCC GCT TGT AGT GTT GGT GAT ATT GAT AAA CAG ATA GAG TTA GAA 357 
Leu Ala Ala Cys Ser Val Gly Asp He Asp Lys Gin He Glu Leu Glu 
70 75 80 

CAA GAA AAA AAG GAA GCT GAA AAC GCT AGG GAT AGA GCG AAC AAG AGT 4 05 

Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn Lys Ser 
85 90 95 

GGG ATA GAA CTG GAA CAG GAA AAA CAA AAG ACC ATT AAA GAA CAA AAA 453 
Gly He Glu Leu Glu Gin Glu Lys Gin Lys Thr He Lys Glu Gin Lys 
100 105 110 

GAT TTA GTT AAA AAA GCA GAA CAA AAT TGC CAA GAA AAT CAT GGC CAA 501 
Asp Leu Val Lys Lys Ala Glu Gin Asn Cys Gin Glu Asn His Gly Gin 
115 120 125 

TTC TTT ATG AAA AAA TTA GGA ATT AAG GGT GGC ATT GCT ATA GAA GTA 54 9 

Phe Phe Met Lys Lys Leu Gly He Lys Gly Gly He Ala He Glu Val 
130 135 140 145 

GAA GCT GAA TGC AAA ACC CCT AAA CCT GCA AAA ACC AAT CAA ACC CCT 5 97 

Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn Gin Thr Pro 
150 155 160 

ATC CAG CCA AAA CAC CTC CCC AAC TCT AAA CAA CCC CAC TCT CAA AGA 645 
He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His Ser Gin Arg 
165 170 175 

GGA TCA AAA GCG CAA GAG CTT ATC GCT TAT TTG CAA AAA GAG TTA GAA 693 
Gly Ser Lys Ala Gin Glu Leu He Ala Tyr Leu Gin Lys Glu Leu Glu 
180 185 190 

TCT CTG CCC TAT TCA CAA AAA GCT ATC GCT AAA CAA GTG AAT TTT TAC 741 
Ser Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val Asn Phe Tyr 
195 200 205 

AGG CCA AGT TCT GTC GCT TAT TTA GAA CTA GAC CCT AGA GAT TTT AAG 7 89 

Arg Pro Ser Ser Val Ala Tyr Leu Glu Leu Asp Pro Arg Asp Phe Lys 
210 215 220 225 

GTT ACA GAA GAA TGG CAA AAA GAA AAT CTA AAA ATA CGC TCT AAA GCT 837 
Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg Ser Lys Ala 
230 235 240 

CAA GCT AAA ATG CTT GGA AAT GAG AAA CCC ACA AGC CCA CCT TTC AAC 885 
Gin Ala Lys Met Leu Gly Asn Glu Lys Pro Thr Ser Pro Pro Phe Asn 
245 250 255 
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210 215 220 

Ser Leu Val Ala lie Val Val Trp Leu Lys lie Glu Met Val Leu Gly 
225 230 235 240 

Ala Phe Leu Ala Gly Leu Val Val Ser Thr Phe Phe Pro His Lys Ser 

245 250 255 

Glu Leu lie His Lys Leu Asn Asp Val Gly Phe Gly Phe Phe Val Pro 

260 265 270 

Leu Phe Phe lie His Val Gly Ser Thr Leu Asp Leu Lys Leu Val Phe 

275 280 285 

Leu Asn Pro His Leu lie Leu Gin Gly lie Leu lie Val lie Ala Met 

290 295 300 

Leu Ser Leu His Leu lie Thr Ser Thr Leu Leu Trp Arg Lys Tyr Phe 
305 310 315 320 

Lys Glu Ala Lys His Leu Phe Ser Phe Ala Leu Gly Ala Ser Met Pro 

325 330 335 

Leu Thr Phe Leu Val Thr Thr Ala Ala Val Gly Leu Lys Ala Gin Ala 

340 345 350 

lie Ser Gin Asn Thr Tyr Tyr Ala Leu Leu Met Ala Ala lie Phe Glu 

355 360 365 

Gly Val Leu Phe Thr lie Ala He Lys He Leu Asn Lys Lys Ala 
370 375 380 

(2) INFORMATION FOR SEQ ID NO:523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1024 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 

'(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 115... 921 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:523: 

AGTTGGCAAA AACGCAGAGA CAGTAACGCA AAGGCAAATA AAGAGACTCA TTTTAAACAA 60 
GCGAATGCCA TTACAAATAT AATCAGATCA GTTGGTGGGT TTTTTACAAA GATT ATG 117 

Met 
1 

AAG AGA GTT AGA GAA CTT GTA AAA AAA CAT CCC GAG AAA AGC AGT GTG 165 
Lys Arg Val Arg Glu Leu Val Lys Lys His Pro Glu Lys Ser Ser Val 
5 10 15 

GCA TTA GTA GTA TTA ACC CAT GCT GCA TGC AAG AAA GCG AAA GAA TTG 213 
Ala Leu Val Val Leu Thr His Ala Ala Cys Lys Lys Ala Lys Glu Leu 
20 25 30 
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GCA GTA GGC TTA AAA GCG CAA GCG ATC TCA CAA AAC ACC TAG TAG GCA 116 8 

Ala Val Gly Leu Lys Ala Gin Ala lie Ser Gin Asn Thr Tyr Tyr Ala 
345 350 355 360 

TTG CTC ATG GCG GCT ATT TTT GAA GGG GTA TTA TTC ACG ATT GCG ATC 1216 
Leu Leu Met Ala Ala lie Phe Glu Gly Val Leu Phe Thr lie Ala lie 
365 370 375 

AAA ATA CTC AAC AAA AAA GCT TGAATGAAAG CTTAAGCGTC TAAATATTTA GCGT 1271 
Lys lie Leu Asn Lys Lys Ala 
380 

CGCTAAAGCT GTTCGCTTGA ACATTATTGA ACGCATTCTC TAAGCTATCA AAGAAACGAG 1331 

GGTGCAAGTT TTGCATTTCT TTTAAGAAA 13 6 0 

(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXILiE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

Met His Ala Glu Phe Phe Thr Phe Ala Leu lie Met Leu Leu lie Val 

15 10 15 

Met Ala Pro Tyr Met Ser Arg lie Ser Arg Leu Pro lie Thr Val Val 

20 25 30 

Glu lie Leu Phe Gly Ser Val Gly Ala Tyr Val Gly Phe He Glu Pro 

35 40 45 

Thr Lys Gly Phe Glu He Met Ser Glu He Gly Phe Leu Phe Leu Met 

50 55 60 

Phe Leu Cys Gly Leu Glu Val Glu He Tyr Leu Phe Lys Lys Leu Gly 
65 70 75 80 

Val Ser Leu Leu Lys Arg He Phe Ala Tyr Leu Leu He Leu Tyr Thr 

85 90 95 

Leu Ser Phe He Leu Thr Phe Ser Leu Asn Leu Glu Pro He Phe Met 

100 105 110 

Val He Phe Pro He He Ser Leu Gly Met He Met Thr Leu Val Lys 

115 120 125 

Asp Tyr Arg Lys Glu He Leu Trp Leu Asp Leu Val Leu Lys Val Gly 

130 135 140 

Val He Gly Glu Leu Leu Ser He Phe Gly Leu Val Val Val Asp Gly 
145 150 155 160 

Val Tyr Ser His Gly Leu Gly Met Asp Leu He Lys Asp Leu Gly He 

165 170 175 

Leu He Val Phe Leu He Leu He He Val Ala Phe Gin He Phe Lys 

180 185 190 

Thr Leu Phe Trp Trp Phe Pro His Leu Lys Leu Phe Val Met Pro Lys 

195 200 205 

Ser Ser Gin Phe Asn Gin Asp Val Arg Phe Ser Leu Met Leu Phe Phe 
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GGC ATG ATC ATG ACT TTA GTC AAA GAT TAT CGT AAA GAG ATT TTG TGG 4 96 

Gly Met lie Met Thr Leu Val Lys Asp Tyr Arg Lys Glu lie Leu Trp 
125 130 135 

CTT GAT TTG GTT TTG AAA GTG GGC GTT ATT GGG GAA TTG TTA AGC ATT 544 
Leu Asp Leu Val Leu Lys Val Gly Val lie Gly Glu Leu Leu Ser He 
140 145 150 

TTT GGT TTG GTG GTC GTG GAT GGG GTG TAT TCG CAT GGT TTG GGC ATG 592 
Phe Gly Leu Val Val Val Asp Gly Val Tyr Ser His Gly Leu Gly Met 
155 160 165 

GAT TTG ATT AAA GAT TTA GGC ATT CTC ATT GTT TTT TTA ATC TTA ATT 64 0 

Asp Leu lie Lys Asp Leu Gly He Leu He Val Phe Leu lie Leu He 
170 175 180 

ATC GTG GCG TTT CAA ATC TTT AAG ACT TTG TTT TGG TGG TTC CCG CAT 6 88 

He Val Ala Phe Gin He Phe Lys Thr Leu Phe Trp Trp Phe Pro His 
185 190 195 200 

TTA AAG CTT TTT GTG ATG CCT AAA AGC AGT CAG TTT AAC CAA GAT GTG 73 6 

Leu Lys Leu Phe Val Met Pro Lys Ser Ser Gin Phe Asn Gin Asp Val 
205 210 215 

CGT TTT TCG CTC ATG CTC TTT TTT TCC TTA GTT GCG ATC GTG GTG TGG 7 84 

Arg Phe Ser Leu Met Leu Phe Phe Ser Leu Val Ala He Val Val Trp 
220 225 230 

CTC AAA ATA GAA ATG GTT TTA GGG GCG TTT CTA GCA GGG TTA GTC GTT 832 
Leu Lys He Glu Met Val Leu Gly Ala Phe Leu Ala Gly Leu Val Val 
235 240 245 

TCT ACT TTT TTC CCT CAT AAA TCA GAA TTG ATC CAC AAG CTC AAT GAT 8 80 

Ser Thr Phe Phe Pro His Lys Ser Glu Leu He His Lys Leu Asn Asp 
250 255 260 

GTG GGT TTT GGG TTT TTT GTG CCT TTG TTT TTC ATC CAT GTA GGC TCT 92 8 

Val Gly Phe Gly Phe Phe Val Pro Leu Phe Phe He His Val Gly Ser 
265 270 275 280 

ACT TTA GAC TTA AAA TTA GTG TTT TTA AAC CCG CAT TTG ATT CTC CAA 976 
Thr Leu Asp Leu Lys Leu Val Phe Leu Asn Pro His Leu He Leu Gin 
285 290 295 

GGG ATA TTG ATT GTC ATA GCG ATG TTG AGT TTG CAC TTG ATC ACT TCA 1024 
Gly He Leu He Val He Ala Met Leu Ser Leu His Leu He Thr Ser 
300 305 310 

ACC TTA TTG TGG CGC AAA TAC TTT AAA GAG GCT AAG CAT TTA TTT TCA 1072 
Thr Leu Leu Trp Arg Lys Tyr Phe Lys Glu Ala Lys His Leu Phe Ser 
315 320 325 

TTC GCT TTA GGG GCT TCT ATG CCT TTA ACT TTT TTA GTA ACC ACC GCA 112 0 

Phe Ala Leu Gly Ala Ser Met Pro Leu Thr Phe Leu Val Thr Thr Ala 
330 335 340 
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435 440 

(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 89... 1237 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

GAAATCAAAA AAGGGGTTTT TACCACTTAT AAAGAGAAAA ACCCTATCGC TTTGAAAACT 6 0 

TAACCAATAA ATAAGGTAAA ATTAAGAC ATG CAT GCA GAA TTT TTC ACT TTC 112 

Met His Ala Glu Phe Phe Thr Phe 
1 5 

GCG CTC ATC ATG CTT TTA ATT GTG ATG GCC CCT TAT ATG TCT AGA ATC 16 0 

Ala Leu lie Met Leu Leu lie Val Met Ala Pro Tyr Met Ser Arg lie 
10 15 20 

TCT CGT TTG CCT ATC ACG GTT GTG GAG ATT TTA TTT GGA TCT GTT GGG 208 
Ser Arg Leu Pro lie Thr Val Val Glu lie Leu Phe Gly Ser Val Gly 
25 30 35 40 

GCG TAT GTG GGT TTT ATT GAG CCT ACT AAG GGC TTT GAA ATC ATG TCC 256 
Ala Tyr Val Gly Phe lie Glu Pro Thr Lys Gly Phe Glu lie Met Ser 
45 50 55 

GAA ATT GGC TTT TTG TTT TTA ATG TTT TTA TGC GGT TTG GAA GTG GAG 3 04 

Glu lie Gly Phe Leu Phe Leu Met Phe Leu Cys Gly Leu Glu Val Glu 
60 65 70 

ATT TAT CTG TTC AAA AAA TTA GGG GTT TCT CTT TTA AAA CGC ATT TTT 3 52 

lie Tyr Leu Phe Lys Lys Leu Gly Val Ser Leu Leu Lys Arg lie Phe 
75 80 85 

GCT TAT CTG TTG ATT TTA TAG ACG CTT TCG TTT ATT CTT ACT TTT AGC 4 00 

Ala Tyr Leu Leu lie Leu Tyr Thr Leu Ser Phe lie Leu Thr Phe Ser 
90 95 100 

CTT AAT TTA GAG CCT ATT TTT ATG GTG ATT TTC CCT ATT ATT AGT TTG 44 8 

Leu Asn Leu Glu Pro lie Phe Met Val lie Phe Pro lie lie Ser Leu 
105 110 115 120 
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Met Phe Lys Lys lie Phe Pro Leu Ala Leu Val Ser Ser Leu Arg Phe 

15 10 15 

Leu Gly Leu Phe lie Val Leu Pro Val lie Ser Leu Tyr Ala Asp Ser 

20 25 30 

Phe His Ser Ser Ser Pro Leu Leu Val Gly Leu Ala Val Gly Gly Ala 

35 40 45 

Tyr Leu Thr Gin lie Val Phe Gin Thr Pro Met Gly lie Leu Ser Asp 

50 55 60 

Lys lie Gly Arg Lys Val Val Val Met Val Cys Leu Leu Leu Phe Leu 
65 70 75 80 

Ala Gly Ser Leu Val Cys Phe lie Ala Asn Asp lie Val Trp Leu Val 

85 90 95 

lie Gly Arg Phe He Gin Gly Met Gly Ala Leu Gly Gly Val lie Ser 

100 105 110 

Ala Met Val Ala Asp Glu Val Lys Glu Glu Glu Arg Thr Lys Ala Met 

115 120 125 

Ala He Met Gly Ala Phe He Phe He Ser Phe Thr He Ser Met Ala 

130 135 140 

He Gly Pro Gly Val Val Ala Phe Leu Gly Gly Ala Lys Trp Leu Phe 
145 150 155 160 

Leu Leu Thr Ala He Leu Thr Leu Leu Ser Leu Leu Met Leu Leu Lys 

165 170 175 

Val Lys Asp Ala Pro Lys He Ser Tyr Gin He Lys Asn He Lys Ala 

180 185 190 

Tyr Gin Pro Asn Ser Lys Ala Leu Tyr Leu Leu Tyr Leu Ser Ser Phe 

195 200 205 

Phe Glu Lys Ala Phe Met Thr Leu He Phe Val Leu He Pro Leu Ala 

210 215 220 

Leu Val Asn Glu Phe His Lys Asp Glu Ser Phe Leu He Leu Val Tyr 
225 230 235 240 

Val Pro Gly Ala Leu Leu Gly Val Leu Ser Met Gly He Ala Ser Val 

245 250 255 

Met Ala Glu Lys Tyr Asn Lys Pro Lys Gly Val Met Leu Ser Gly Val 

260 265 270 

Leu Leu Phe He Val Ser Tyr Leu Cys Leu Phe Leu Ala Asp Ser Ser 

275 280 285 

Phe Leu Gly Lys Tyr Leu Trp Leu Phe He Val Gly Val Ala Phe Phe 

290 295 300 

Phe He Gly Phe Ala Thr Leu Glu Pro He Met Gin Ser Leu Ala Ser 
305 310 315 320 

Lys Phe Ala Lys Val His Glu Lys Gly Lys Val Leu Gly Gin Phe Thr 

325 330 335 

Thr Phe Gly Tyr Leu Gly Ser Phe Val Gly Gly Val Ser Gly Gly Leu 

340 345 350 

Ser Tyr His His Leu Gly Val Ser Asn Thr Ser Leu He Val Val Ala 

355 360 365 

Leu Gly Leu He Trp Gly Leu Ser Leu Phe Leu Leu Asn Asn Pro Ser 

370 375 380 

Lys Gin Lys Asn Val Tyr Phe Pro Leu Asp Ala Tyr Asn Glu Glu Gin 
385 390 395 400 

Phe Glu Thr Leu Glu Asp Lys He He Glu Trp Tyr Val Asn He Ser 

405 410 415 

Glu Glu He He He Val Lys Tyr Asn Ser Asp His He Ser Glu Glu 

420 425 430 

Glu He He His Leu Ala Gin Asn Phe Arg Lys 



-814- 



BNSDOCID: <WO 984347eAl I > 



wo 98/43478 PCT/US98/06371 

TTA GCC GAC TCT AGC TTT TTA GGG AAA TAT TTA TGG CTT TTT ATT GTT 97 5 

Leu Ala Asp Ser Ser Phe Leu Gly Lys Tyr Leu Trp Leu Phe lie Val 
285 290 295 

GGG GTG GCG TTT TTC TTT ATT GGT TTT GCC ACC TTA GAG OCT ATC ATG 1023 
Gly Val Ala Phe Phe Phe He Gly Phe Ala Thr Leu Glu Pro He Met 
300 305 310 315 

CAA TCT TTA GCG TCT AAA TTC GCC AAA GTG CAT GAA AAA GGC AAG GTT 1071 
Gin Ser Leu Ala Ser Lys Phe Ala Lys Val His Glu Lys Gly Lys Val 
320 325 330 

TTA GGG CAA TTC ACT ACT TTT GGC TAT TTA GGG AGC TTT GTT GGG GGC 1119 
Leu Gly Gin Phe Thr Thr Phe Gly Tyr Leu Gly Ser Phe Val Gly Gly 
335 340 345 

GTG AGC GGG GGG TTG AGC TAG CAT CAT TTA GGC GTT TCT AAC ACA AGC 1167 
Val Ser Gly Gly Leu Ser Tyr His His Leu Gly Val Ser Asn Thr Ser 
350 355 360 

TTG ATC GTT GTA GCT TTA GGG CTT ATT TGG GGG CTA TCG CTC TTT TTA 1215 
Leu He Val Val Ala Leu Gly Leu lie Trp Gly Leu Ser Leu Phe Leu 
365 370 375 

CTC AAC AAC CCT TCC AAG CAA AAA AAT GTC TAT TTC CCC TTA GAC GCT 1263 
Leu Asn Asn Pro Ser Lys Gin Lys Asn Val Tyr Phe Pro Leu Asp Ala 
380 385 390 395 

TAC AAT GAG GAA CAA TTT GAA ACT TTA GAG GAT AAA ATC ATT GAA TGG 1311 
Tyr Asn Glu Glu Gin Phe Glu Thr Leu Glu Asp Lys He He Glu Trp 
400 405 410 

TAT GTT AAT ATT AGC GAA GAA ATC ATT ATT GTG AAA TAT AAT TCC GAT 135 9 

Tyr Val Asn He Ser Glu Glu He He He Val Lys Tyr Asn Ser Asp 
415 420 425 

CAC ATT AGC GAA GAA GAA ATC ATT CAC TTA GCG CAA AAC TTT AGA AAA T 14 08 
His He Ser Glu Glu Glu He He His Leu Ala Gin Asn Phe Arg Lys 
430 435 440 

AAAACAATTA AGGATCAAAA ATGGCCTATG AA 1440 

(2) INFORMATION FOR SEQ ID NO: 52 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 443 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 
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GGC ATT CTT AGC GAT AAG ATA GGC CGT AAA GTG GTG GTT ATG GTG TGC 3 03 

Gly lie Leu Ser Asp Lys lie Gly Arg Lys Val Val Val Met Val Cys 
60 65 70 75 



TTG CTG TTG TTT TTA GCC GGC TCG TTA GTG TGC TTT ATA GCG AAT GAT 3 51 

Leu Leu Leu Phe Leu Ala Gly Ser Leu Val Cys Phe lie Ala Asn Asp 
80 85 90 



ATT GTT TGG CTC GTT ATA GGG CGC TTC ATT CAA GGC ATG GGG GCT TTA 3 99 

lie Val Trp Leu Val lie Gly Arg Phe He Gin Gly Met Gly Ala Leu 
95 100 105 



GGG GGG GTT ATT AGT GCG ATG GTG GCG GAT GAA GTG AAA GAA GAA GAG 44 7 

Gly Gly Val He Ser Ala Met Val Ala Asp Glu Val Lys Glu Glu Glu 
110 115 120 



CGC ACC AAA GCC ATG GCC ATC ATG GGA GCG TTT ATT TTC ATT AGC TTC 4 95 

Arg Thr Lys Ala Met Ala He Met Gly Ala Phe He Phe He Ser Phe 
125 130 135 



ACT ATA AGC ATG GCG ATT GGC CCT GGG GTT GTA GCG TTT TTG GGG GGG 54 3 

Thr He Ser Met Ala He Gly Pro Gly Val Val Ala Phe Leu Gly Gly 
140 145 150 155 



GCA AAA TGG CTC TTT TTA CTC ACG GCG ATC TTA ACT TTA TTG AGT TTA 591 
Ala Lys Trp Leu Phe Leu Leu Thr Ala He Leu Thr Leu Leu Ser Leu 
160 165 170 



TTG ATG CTT TTA AAA GTC AAA GAC GCC CCT AAA ATT TCT TAC CAG ATC 63 9 

Leu Met Leu Leu Lys Val Lys Asp Ala Pro Lys He Ser Tyr Gin He 
175 180 185 



A;^ AAC ATA AAA GCT TAC CAA CCC AAC TCT AAA GCC TTG TAT CTT TTG 687 
Lys Asn He Lys Ala Tyr Gin Pro Asn Ser Lys Ala Leu Tyr Leu Leu 
190 195 200 



TAT CTA AGC TCT TTT TTT GAA AAA GCG TTC ATG ACG CTT ATT TTT GTG 73 5 

Tyr Leu Ser Ser Phe Phe Glu Lys Ala Phe Met Thr Leu He Phe Val 
205 210 215 



CTG ATC CCT TTA GCC TTA GTG AAT GAA TTT CAT AAA GAT GAA AGC TTT 7 83 

Leu He Pro Leu Ala Leu Val Asn Glu Phe His Lys Asp Glu Ser Phe 
220 225 230 235 



TTA ATC TTG GTG TAT GTG CCT GGA GCC TTA TTA GGG GTC TTA AGC ATG 831 
Leu He Leu Val Tyr Val Pro Gly Ala Leu Leu Gly Val Leu Ser Met 
240 245 250 



GGA ATA GCG AGC GTT ATG GCT GAA AAA TAC AAC AAG CCT AAA GGA GTG 87 9 

Gly He Ala Ser Val Met Ala Glu Lys Tyr Asn Lys Pro Lys Gly Val 
255 260 265 



ATG CTT TCT GGC GTA TTA TTG TTT ATT GTG AGT TAT TTG TGC TTG TTT 92 7 

Met Leu Ser Gly Val Leu Leu Phe He Val Ser Tyr Leu Cys Leu Phe 
270 275 280 
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Lys Val Gly Leu 
100 

Ser Lys Thr Asn 
115 

Pro Asp Phe Glu 
130 

Gly lie Asp Leu 
145 

Ser Phe Glu Thr 

Ala lie Glu Met 
180 

Asn Ala Lys Ala 
195 

Lys Glu Ala Leu 
210 

lie He Leu Ala 
225 



Lys Asp He He 

Arg Val Arg Phe 
120 

Leu Ser Leu Arg 
135 

Lys Val Gly Asn 
150 

His Ala Phe Asp 
165 

Glu Ala Ala Gly 

Leu Cys Leu Cys 
200 

Ser Pre Lys Glu 
215 

Leu Glu Met Met 
230 



Met Ala Thr Gly 
105 

Leu Asn His Asp 

Ala Tyr Gin Thr 
140 

Val Phe Ser Ser 
155 

Leu Met Ala Lys 
170 

Leu Tyr Ala Thr 
185 

Ser Val Ser Asp 

Arg Val Glu Ser 
220 

Ser 



Ala Ser Thr Asp 
110 

Leu Ser Ala Thr 
125 

Ala Lys Arg Leu 

Asp Phe Phe Tyr 
160 

Tyr Asn His Leu 
175 

Ala Met Glu Leu 
190 

His Leu He Thr 
205 

Phe Asp Asn Met 



(2) INFORMATION FOR SEQ ID NO: 519: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 79... 1407 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 



TACATTAAAC TTTTATTATC AATCCTACTT CCTCAAATTG ATGACAAAAG TTGGGCATTT 60 
TCTTGTAAAA TAACCCGC ATG TTT AAG AAA ATT TTT CCA TTA GCG TTA GTG 111 

Met Phe Lys Lys He Phe Pro Leu Ala Leu Val 
15 10 



TCA TCG TTG CGG TTT TTG GGG CTT TTT ATT GTT TTG CCG GTC ATT AGT 15 9 

Ser Ser Leu Arg Phe Leu Gly Leu Phe He Val Leu Pro Val He Ser 
15 20 25 



TTG TAT GCG GAT AGT TTC CAT TCA AGC AGT CCC TTA CTC GTG GGG TTG 2 07 

Leu Tyr Ala Asp Ser Phe His Ser Ser Ser Pro Leu Leu Val Gly Leu 
30 35 40 



GCT GTG GGC GGA GCG TAT CTT ACG CAA ATT GTT TTT CAA ACC CCC ATG 255 
Ala Val Gly Gly Ala Tyr Leu Thr Gin He Val Phe Gin Thr Pro Met 
45 50 55 
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ACG CCT GAT TTT GAA TTG AGT TTA AG A GCG TAT CAA AC A GCA AAG CGT 4 94 

Thr Pro Asp Phe Glu Leu Ser Leu Arg Ala Tyr Gin Thr Ala Lys Arg 
130 135 140 

TTG GGT ATT GAT TTG AAA GTG GGC AAT GTT TTT TCA AGC GAT TTT TTC 54 2 

Leu Gly lie Asp Leu Lys Val Gly Asn Val Phe Ser Ser Asp Phe Phe 
145 150 155 

TAT TCT TTT GAA ACG CAT GCC TTT GAT TTA ATG GCT AAA TAC AAC CAC 590 
Tyr Ser Phe Glu Thr His Ala Phe Asp Leu Met Ala Lys Tyr Asn His 
160 165 170 175 

TTG GCT ATT GAA ATG GAA GCG GCG GGG TTA TAC GCC ACG GCG ATG GAA 63 8 

Leu Ala lie Glu Met Glu Ala Ala Gly Leu Tyr Ala Thr Ala Met Glu 
180 185 190 

CTA AAC GCT AAG GCT TTA TGC TTA TGC TCA GTC TCA GAT CAC TTA ATC 686 
Leu Asn Ala Lys Ala Leu Cys Leu Cys Ser Val Ser Asp His Leu lie 
195 200 205 

ACT AAA GAA GCC TTA AGC CCT AAA GAA AGG GTA GAA AGC TTT GAT AAC 734 
Thr Lys Glu Ala Leu Ser Pro Lys Glu Arg Val Glu Ser Phe Asp Asn 
210 215 220 

ATG ATA ATT TTG GCT TTG GAG ATG ATG AGT TAGCCTTTTT TGCCCCCATA AGT 787 
Met lie lie Leu Ala Leu Glu Met Met Ser 
225 230 

TAAGGATAAA ATTTAAAGGA AAACCCTTAA AGCTAAAAGC CTTAAGGGAA CTTTGG 843 

(2) INFORMATION FOR SEQ ID NO:518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

Met Thr Pro His lie Asn Ala Lys lie Gly Asp Phe Tyr Pro Gin Cys 

15 10 15 

Leu Leu Cys Gly Asp Pro Leu Arg Val Ser Tyr lie Ala Lys Lys Phe 

20 25 30 

Leu Gin Asp Ala Lys Glu lie Thr Asn Val Arg Asn Met Leu Gly Phe 

35 40 45 

Ser Gly Lys Tyr Lys Gly Arg Gly lie Ser Leu Met Gly His Gly Met 

50 55 60 

Gly lie Ala Ser Cys Thr lie Tyr Val Thr Glu Leu lie Lys Thr Tyr 

65 70 75 80 

Gin Val Lys Glu Leu Leu Arg lie Gly Thr Cys Gly Ala lie Ser Pro 
85 90 95 
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435 

(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 764 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:517: 

GCTCACTTTT TGGGATTAAG CCCCCTAGAT TATGGCAAAA ACTTATTAAA CTTTAAAGGA 6 0 

CAACC ATG ACC CCT CAC ATC AAC GCC AAA ATC GGC GAT TTT TAT CCT CAA 110 
Met Thr Pro His lie Asn Ala Lys lie Gly Asp Phe Tyr Pro Gin 
15 10 15 

TGC CTT TTA TGC GGC GAT CCC TTA AGG GTG AGC TAC ATT GCA AAA AAA 15 8 

Cys Leu Leu Cys Gly Asp Pro Leu Arg Val Ser Tyr lie Ala Lys Lys 
20 25 30 

TTC TTA CAA GAC GCC AAA GAG ATC ACG AAT GTG CGT AAC ATG CTA GGC 2 06 

Phe Leu Gin Asp Ala Lys Glu lie Thr Asn Val Arg Asn Met Leu Gly 
35 40 45 

TTT AGC GGG AAG TAT AAG GGT AGG GGG ATT TCT TTA ATG GGG CAT GGC 2 54 

Phe Ser Gly Lys Tyr Lys Gly Arg Gly lie Ser Leu Met Gly His Gly 
50 55 60 

ATG GGC ATT GCG TCA TGC ACG ATT TAT GTA ACC GAA CTC ATT AAA ACC 3 02 

Met Gly lie Ala Ser Cys Thr lie Tyr Val Thr Glu Leu lie Lys Thr 
65 70 75 

TAT CAG GTT AAA GAG CTT TTA AGG ATT GGC ACT TGC GGG GCG ATT AGC 350 
Tyr Gin Val Lys Glu Leu Leu Arg lie Gly Thr Cys Gly Ala lie Ser 
80 85 90 95 

CCA AAA GTT GGC CTG AAA GAC ATT ATC ATG GCG ACG GGG GCT TCA ACG 3 98 

Pro Lys Val Gly Leu Lys Asp lie lie Met Ala Thr Gly Ala Ser Thr 
100 105 110 

GAT TCT AAA ACC AAT CGG GTG CGT TTT TTA AAC CAC GAT TTG AGC GCA 44 6 

Asp Ser Lys Thr Asn Arg Val Arg Phe Leu Asn His Asp Leu Ser Ala 
115 120 125 
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Met Gly Phe Phe 
1 

Phe Arg Ala Gly 
20 

Leu Asn Ala Leu 
35 

Leu Ser Ala Thr 
50 

Leu Trp Ala Asn 
65 

Ala Tyr Phe Ser 

Ser Ala Leu Gly 
100 

Ser Phe Thr Lys 
115 

Leu Arg Arg Ala 
130 

Gly Leu Lys Glu 
145 

Thr Leu Gly Asp 

lie lie Leu Thr 
180 

lie He Ala Val 
195 

Ala Pro Tyr Pro 
210 

He Ala Phe Gin 
225 

Ala Phe Thr Leu 

Asp Leu Phe Asp 
260 

Asp Phe Phe Asn 
275 

Glu Ala Asp Ala 
290 

Thr Thr Thr Ala 
305 

Arg Thr Gly Leu 

Leu Phe Cys Leu 
340 

Pro Val Leu Val 
355 

Val Asn Phe Lys 
370 

Val Met Met Pro 
385 

Phe Leu Ser Tyr 

Leu Asn Ser Gly 
420 

He Phe Arg 



Lys Leu Lys Glu 
5 

Leu Thr Thr Phe 

He Leu Ser Gin 
40 

Ala He He Thr 
55 

Thr Pro He Ala 
70 

Phe Gly Leu Val 
85 

He Val Ala Leu 

Phe Arg Ser Trp 
120 

Val Ser Ala Gly 
135 

Met His He Val 
150 

Phe Gly Asp Pro 
165 

Phe Ala Leu Tyr 

Leu He Thr Ser 
200 

Ser Glu Phe Phe 
215 

Leu Asp Phe Lys 
230 

Ala Leu Val Pro 
245 

Ser Leu Gly Thr 

Asp Glu Glu Lys 
280 

Val Ala Ser Leu 
295 

Phe He Glu Ser 
310 

Thr Ala Val Phe 
325 

Pro Leu Leu Lys 

Val Val Gly Val 
360 

Asp Met Ala He 
375 

Leu Thr Phe Ser 
390 

Ser He He Lys 
405 

He He He Leu 



His Asn Thr Asn 
10 

He Thr Met He 
25 

Ala Asn Met Pro 

He Leu Ser Ser 
60 

Met Ser Val Gly 
75 

Gin Gly Leu Lys 
90 

Ser Gly Ala He 
105 

Val Met Arg Ser 

He Gly Ala Phe 
140 

Val Thr His Xaa 
155 

His Val Leu Leu 
170 

Thr Leu Lys He 
185 

He Leu Ala Trp 

Ser Met Pro Ala 
220 

Gly He Phe Phe 
235 

Val He He Thr 
250 

Leu Ala Gly He 
265 

Asn Lys Glu Leu 

Gly Ser Ala Val 
300 

Ala Ser Gly Val 
315 

Thr Gly Leu Phe 
330 

Ala He Pro Ser 
345 

Leu Met Phe Ser 

Ser Val Ser Thr 
380 

He Ala Asp Gly 
395 

Leu Val Gin Lys 
410 

Cys He He Ser 
425 



He Ala Thr Glu 
15 

Tyr He Val Pro 
30 

Tyr Glu Ala Leu 
45 

Val Phe Asn Gly 

Leu Gly Leu Ser 
80 

Leu Pro Trp Gin 
95 

Phe Val He Leu 
110 

He Pro Ser Asp 
125 

He Ala Phe He 

Ala Thr Leu Val 
160 

Gly Val Val Gly 
175 

Arg Gly Ser Phe 
190 

Val Leu Lys Leu 
205 

Ser He Gly Pro 

Asp Ala Ser Gly 
240 

Phe Phe Val Thr 
255 

Gly His Lys Thr 
270 

Glu Lys Thr Leu 
285 

Val Gly Val Ser 

Glu Glu Gly Gly 
320 

Phe Val Leu Thr 
335 

Asn Ala He Tyr 
350 

Val Leu Glu Gly 
365 

Phe Leu Thr Val 

Leu Ala Phe Gly 
400 

Asp Phe Lys Ala 
415 

Val Ser Val Phe 
430 



-808- 



: <WO 9&4347eA1_l_> 



wo 98/43478 PCT/US98/06371 

AAA AAC AAG GAA TTG GAA AAG ACT TTG GAA GCG GAT GCG GTG GCT TCT 918 
Lys Asn Lys Glu Leu Glu Lys Thr Leu Glu Ala Asp Ala Val Ala Ser 
280 285 290 295 

TTA GGG AGC GCG GTG GTG GGC GTT TCT ACT ACG ACC GCT TTT ATA GAG 966 
Leu Gly Ser Ala Val Val Gly Val Ser Thr Thr Thr Ala Phe lie Glu 
300 305 310 

AGC GCG AGT GGG GTT GAA GAG GGG GGC CGC ACA GGG CTT ACA GCG GTT 1014 
Ser Ala Ser Gly Val Glu Glu Gly Gly Arg Thr Gly Leu Thr Ala Val 
315 320 325 

TTT ACC GGA TTA TTT TTT GTT TTA ACG CTC TTT TGC TTG CCT CTT TTA 1062 
Phe Thr Gly Leu Phe Phe Val Leu Thr Leu Phe Cys Leu Pro Leu Leu 
330 335 340 

AAA GCT ATT CCT AGC AAT GCG ATT TAT CCG GTG CTG GTG GTA GTA GGG 1110 
Lys Ala lie Pro Ser Asn Ala lie Tyr Pro Val Leu Val Val Val Gly 
345 350 355 

GTT TTG ATG TTT AGC GTG TTA GAG GGG GTG AAT TTT AAA GAC ATG GCC 115 8 

Val Leu Met Phe Ser Val Leu Glu Gly Val Asn Phe Lys Asp Met Ala 
360 365 370 375 

ATT AGC GTT TCC ACT TTT TTA ACC GTG GTG ATG ATG CCC TTA ACC TTC 12 06 

lie Ser Val Ser Thr Phe Leu Thr Val Val Met Met Pro Leu Thr Phe 
380 385 390 

TCC ATT GCC GAT GGC TTA GCC TTT GGC TTT TTG TCT TAT AGT ATC ATC 1254 
Ser lie Ala Asp Gly Leu Ala Phe Gly Phe Leu Ser Tyr Ser lie lie 
395 400 405 

AAA TTG GTT CAA AAA GAC TTC AAA GCA CTC AAT TCA GGC ATT ATC ATT 13 02 

Lys Leu Val Gin Lys Asp Phe Lys Ala Leu Asn Ser Gly He He He 
410 415 420 

CTC TGC ATC ATT TCT GTT TCT GTA TTT ATC TTT CGT TAAGCTCTTT TTAAGG 1354 
Leu Cys He He Ser Val Ser Val Phe He Phe Arg 
425 430 435 

GGCTTTGCAT TTTTTACTCA TTTCATGCCT CTTTTTCTTT ATTTAGACAG ATTATTATCT 1414 
TAAAATAATT GTAATATCAT TATTAT 144 0 

(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 435 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 
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ACT ATC TTA TCG AGC GTG TTT AAC GGA TTG TGG GCA AAC ACC CCT ATC 24 6 

Thr lie Leu Ser Ser Val Phe Asn Gly Leu Trp Ala Asn Thr Pro lie 
60 65 70 

GCT ATG AGC GTG GGC TTA GGG CTG TCA GCT TAT TTT AGC TTC GGG TTG 2 94 

Ala Met Ser Val Gly Leu Gly Leu Ser Ala Tyr Phe Ser Phe Gly Leu 
75 80 85 

GTT CAA GGG TTA AAA CTC CCT TGG CAG AGC GCT TTA GGC ATC GTA GCG 34 2 

Val Gin Gly Leu Lys Leu Pro Trp Gin Ser Ala Leu Gly lie Val Ala 
90 95 100 

CTC TCG GGA GCG ATT TTT GTG ATT TTG TCT TTC ACT AAA TTT AGA AGT 3 90 

Leu Ser Gly Ala lie Phe Val lie Leu Ser Phe Thr Lys Phe Arg Ser 
105 110 115 

TGG GTC ATG CGA AGC ATT CCT AGC GAT TTA AGG CGT GCG GTG AGT GCG 43 8 

Trp Val Met Arg Ser lie Pro Ser Asp Leu Arg Arg Ala Val Ser Ala 
120 125 130 135 

GGG ATA GGG GCT TTT ATC GCG TTT ATT GGC CTT AAA GAA ATG CAT ATC 4 86 

Gly lie Gly Ala Phe lie Ala Phe lie Gly Leu Lys Glu Met His lie 
140 145 150 

GTC GTT ACC CAT AAR GCT ACG CTT GTA ACC TTA GGC GAT TTT GGC GAT 534 
Val Val Thr His Xaa Ala Thr Leu Val Thr Leu Gly Asp Phe Gly Asp 
155 160 165 

CCG CAT GTG TTA TTG GGG GTT GTG GGG ATC ATT CTA ACT TTC GCG CTC 5 82 

Pro His Val Leu Leu Gly Val Val Gly lie lie Leu Thr Phe Ala Leu 
170 175 180 

TAC ACG CTC AAA ATC AGG GGT TCT TTC ATT ATA GCG GTC TTA ATC ACT 63 0 

Tyr Thr Leu Lys lie Arg Gly Ser Phe lie lie Ala Val Leu lie Thr 
185 190 195 

TCC ATT CTC GCA TGG GTT TTA AAG CTA GCC CCT TAC CCT AGC GAG TTT 67 8 

Ser lie Leu Ala Trp Val Leu Lys Leu Ala Pro Tyr Pro Ser Glu Phe 
200 205 210 215 

TTT TCC ATG CCC GCT AGC ATT GGC CCT ATC GCC TTT CAA TTA GAC TTT 72 6 

Phe Ser Met Pro Ala Ser lie Gly Pro lie Ala Phe Gin Leu Asp Phe 
220 225 230 

AAG GGC ATT TTT TTT GAT GCG AGT GGG GCT TTC ACT TTA GCG TTA GTG 774 
Lys Gly lie Phe Phe Asp Ala Ser Gly Ala Phe Thr Leu Ala Leu Val 
235 240 245 

CCA GTT ATT ATC ACT TTT TTT GTA ACC GAT TTG TTT GAT TCT TTA GGC 822 
Pro Val lie lie Thr Phe Phe Val Thr Asp Leu Phe Asp Ser Leu Gly 
250 255 260 

ACG CTT GCA GGG ATT GGC CAC AAG ACT GAT TTT TTC AAT GAT GAA GAA 87 0 

Thr Leu Ala Gly lie Gly His Lys Thr Asp Phe Phe Asn Asp Glu Glu 
265 270 275 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 



lie 


Ser 


Ser 


Lys 


Arg 


Leu 


Glu 


lie 


Ser 


Arg 


Leu 


Leu 


Glu 


Arg 


Lys 


Asn 


1 








5 










10 










15 




Glu 


Arg 


Ser 


Phe 
20 


Leu 


Ala 


Glu 


LVS 


Tvr 
25 


His 


Lvs 


He 


Pro 


Thr 
30 


Asn 


Lvs 


Arg 


Lys 


Phe 
35 


Lys 


Glu 


Arg 


Ser 


He 
40 


He 


Ser 


Val 


Cys 


Glu 
45 


He 


Ser 


Asn 


Pro 


Val 
50 


Ala 


His 


Lys 


Gly 


Leu 
55 


Lys 


Arg 


He 


Lys 


Asn 
60 


Pro 


Asn 


Thr 


He 


Leu 


Thr 


Thr 


Asn 


lie 


His 


Gin 


Leu 


Pro 


Lys 


Pro 


He 


Ala 


Thr 


Arg 


Leu 


65 










70 










75 










80 


Phe 


Asn 


Ser 


Ser 


Ser 
85 


Asn 


Thr 


Gly 


He 


Cys 
90 


Ala 


Cys 


Asn 


Cys 


Ser 
95 


Phe 


Ser 


Ala 


Cys 


Phe 
100 


Ser 


Phe 


Cys 


Asn 


Cys 
105 


Phe 


Ala 


Asn 


Ala 


Ser 
110 


Asn 


Ser 



(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 34... 13 3 8 
CD) OTHER INFORMATION: 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO:515: 

TAATCTATAC CAAATAGTAA GGAGTTATCT AAC ATG GGG TTT TTC AAG CTT AAA 54 

Met Gly Phe Phe Lys Leu Lys 
1 5 

GAA CAC AAC ACT AAC ATT GCC ACC GAG TTT AGA GCG GGT TTA ACG ACC 102 
Glu His Asn Thr Asn He Ala Thr Glu Phe Arg Ala Gly Leu Thr Thr 
10 15 20 

TTT ATC ACC ATG ATT TAC ATC GTG CCC TTA AAC GCT CTT ATC CTT TCT 15 0 

Phe He Thr Met He Tyr He Val Pro Leu Asn Ala Leu He Leu Ser 
25 30 35 

CAA GCC AAC ATG CCT TAT GAA GCC CTT TTA AGT GCA ACG GCC ATT ATC 198 
Gin Ala Asn Met Pro Tyr Glu Ala Leu Leu Ser Ala Thr Ala He He 
40 - 45 50 55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1...336 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:513: 

ATC TCT TCT AAG CGT TTG GAG ATT TCA AGA CTT TTG GAG CGT AAA AAT 4 8 

lie Ser Ser Lys Arg Leu Glu lie Ser Arg Leu Leu Glu Arg Lys Asn 
15 10 15 

GAA CGC AGT TTT TTA GCC GAA AAA TAC CAC AAA ATC CCC ACA AAT AAA 96 
Glu Arg Ser Phe Leu Ala Glu Lys Tyr His Lys lie Pro Thr Asn Lys 
20 25 30 

AGG AAA TTT AAA GAA CGC TCT ATA ATA TCT GTT TGT GAA ATA TCC AAT 144 
Arg Lys Phe Lys Glu Arg Ser lie lie Ser Val Cys Glu lie Ser Asn 
35 40 45 

CCA GTA GCG CAC AAA GGG CTT AAA AGG ATC AAA AAC CCT AAC ACC ATT 192 
Pro Val Ala His Lys Gly Leu Lys Arg lie Lys Asn Pro Asn Thr lie 
50 55 60 

TTA ACT ACA AAC ATT CAT CAA CTC CCT AAA CCC ATA GCC ACA CGC TTG 24 0 

Leu Thr Thr Asn lie His Gin Leu Pro Lys Pro lie Ala Thr Arg Leu 
65 70 75 80 

TTT AAC TCG TCT TCA AAT ACC GGC ATT TGC GCT TGC AAC TGC TCT TTT 288 
Phe Asn Ser Ser Ser Asn Thr Gly lie Cys Ala Cys Asn Cys Ser Phe 
85 90 95 

AGC GCT TGC TTT TCA TTT TGT AAT TGC TTC GCA AAC GCT TCA AAC TCT T 337 
Ser Ala Cys Phe Ser Phe Cys Asn Cys Phe Ala Asn Ala Ser Asn Ser 
100 105 110 

GA 339 

(2) INFOR^4ATION FOR SEQ ID NO: 5 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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GTT TTA GGT TTT AGG GAA GTG AGC GAA GAA GAA ATT TTA GCT TCA CAC 54 3 

Val Leu Gly Phe Arg Glu Val Ser Glu Glu Glu lie Leu Ala Ser His 
145 150 155 

CAT GGC GGT GGG ACA GGT TGC TGT GGC GGT CAT GGG GGT CAT GGC GGA 591 
His Gly Gly Gly Thr Gly Cys Cys Gly Gly His Gly Gly His Gly Gly 
160 165 170 

AAG AAA GGT GGG GGT TGT GGT TGC TCA TGT TCG CAT GGG TAGTAAGGTA TA 64 2 
Lys Lys Gly Gly Gly Cys Gly Cys Ser Cys Ser His Gly 
175 180 185 

GGAGTATTTA AAAGGCAAGG TCATGAATAG TTCTAATCTC AAAAATTGGC TATTCCCTAC 7 02 

CATTTGCTTT TTTTTAT 719 

(2) INFORMATION FOR SEQ ID NO: 5 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

Met Gin Asn His Asp Leu Glu Ser lie Lys Gin Ala Ala Leu lie Glu 

15 10 15 

Tyr Glu Val Arg Glu Gin Gly Ser Ser lie Val Leu Asp Ser Asn lie 

20 25 30 

Ser Lys Glu Pro Leu Glu Phe lie lie Gly Thr Asn Gin lie lie Ala 

35 40 45 

Gly Leu Glu Lys Ala Val Leu Lys Ala Gin lie Gly Glu Trp Glu Glu 

50 55 60 

Val Val lie Ala Pro Glu Glu Ala Tyr Gly Val Tyr Glu Ser Ser Tyr 
65 70 75 80 

Leu Gin Glu Val Pro Arg Asp Gin Phe Glu Gly lie Glu Leu Glu Lys 

85 90 95 

Gly Met Ser Val Phe Gly Gin Thr Glu Asp Asn Gin Thr lie Gin Ala 

100 105 110 

lie lie Lys Asp Phe Ser Ala Thr His Val Met Val Asp Tyr Asn His 

115 120 125 

Pro Leu Ala Gly Lys Thr Leu Ala Phe Arg Phe Lys Val Leu Gly Phe 

130 135 140 

Arg Glu Val Ser Glu Glu Glu lie Leu Ala Ser His His Gly Gly Gly 
145 150 155 160 

Thr Gly Cys Cys Gly Gly His Gly Gly His Gly Gly Lys Lys Gly Gly 

165 170 175 

Gly Cys Gly Cys Ser Cys Ser His Gly 
180 185 

(2) INFORMATION FOR SEQ ID NO: 5 13: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 76... 630 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

TGCAACACTT GTATCCTTCA AGCGAACAAG CTAAAATGGC GCAAAAAATC TTAGAAAACA 6 0 

AGGAGAAACA CCACC ATG CAA AAC CAT GAT TTA GAG TCA ATC AAA CAA GCC 111 
Met Gin Asn His Asp Leu Glu Ser lie Lys Gin Ala 
15 10 

GCT TTG ATT GAA TAT GAA GTG AGA GAA CAA GGC TCT AGT ATT GTG CTA 15 9 

Ala Leu lie Glu Tyr Glu Val Arg Glu Gin Gly Ser Ser lie Val Leu 
15 20 25 

GAC AGC AAT ATT TCC AAA GAG CCT TTA GAG TTT ATT ATA GGC ACT AAT 2 07 

Asp Ser Asn lie Ser Lys Glu Pro Leu Glu Phe lie lie Gly Thr Asn 
30 35 40 

CAA ATC ATA GCA GGG TTA GAA AAG GCG GTA TTA AAG GCT CAA ATT GGC 255 
Gin He He Ala Gly Leu Glu Lys Ala Val Leu Lys Ala Gin He Gly 
45 50 55 60 

GAG TGG GAA GAG GTT GTC ATC GCC CCA GAG GAA GCT TAT GGG GTT TAT 303 
Glu Trp Glu Glu Val Val He Ala Pro Glu Glu Ala Tyr Gly Val Tyr 
65 70 75 

GAA AGC AGC TAT TTG CAA GAA GTC CCT AGA GAT CAA TTT GAA GGC ATT 351 
Glu Ser Ser Tyr Leu Gin Glu Val Pro Arg Asp Gin Phe Glu Gly He 
80 85 90 

GAA TTA GAA AAA GGC ATG AGC GTT TTT GGG CAA ACT GAA GAC AAT CAA 3 99 

Glu Leu Glu Lys Gly Met Ser Val Phe Gly Gin Thr Glu Asp Asn Gin 
95 100 105 

ACC ATT CAA GCC ATT ATC AAA GAC TTT AGC GCT ACG CAT GTG ATG GTG 447 
Thr He Gin Ala He He Lys Asp Phe Ser Ala Thr His Val Met Val 
110 115 120 

GAT TAT AAC CAC CCG TTA GCC GGG AAA ACT TTA GCG TTT CGT TTC AAG 4 95 

Asp Tyr Asn His Pro Leu Ala Gly Lys Thr Leu Ala Phe Arg Phe Lys 
125 130 135 140 
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145 150 155 160 

Pro Ala lie Lys Leu Ala Glu Asn Gly Tyr Ala He Ser Gin Arg Gin 

165 170 175 

Ala Glu Thr Leu Lys Glu Ala Arg Glu Arg Phe Leu Lys Tyr Ser Ser 

180 185 190 

Ser Lys Lys Tyr Phe Phe Lys Lys Gly His Leu Asp Tyr Gin Glu Gly 

195 200 205 

Asp Leu Phe Val Gin Lys Asp Leu Ala Lys Thr Leu Asn Gin He Lys 

210 215 220 

Thr Leu Gly Ala Lys Gly Phe Tyr Gin Gly Gin Val Ala Glu Leu He 
225 230 235 240 

Glu Lys Asp Met Lys Lys Asn Gly Gly He He Thr Lys Glu Asp Leu 

245 250 255 

Ala Ser Tyr Asn Val Lys Trp Arg Lys Pro Val Val Gly Ser Tyr Arg 

260 265 270 

Gly Tyr Lys He He Ser Met Ser Pro Pro Ser Ser Gly Gly Thr His 

275 280 285 

Leu He Gin He Leu Asn Val Met Glu Asn Ala Asp Leu Ser Ala Leu 

290 295 300 

Gly Tyr Gly Ala Ser Lys Asn He His He Ala Ala Glu Ala Met Arg 
305 310 315 320 

Gin Ala Tyr Ala Asp Arg Ser Val Tyr Met Gly Asp Ala Asp Phe Val 

325 330 335 

Ser Val Pro Val Asp Lys Leu He Asn Lys Ala Tyr Ala Lys Lys He 

340 345 350 

Phe Asp Thr He Gin Pro Asp Thr Val Thr Pro Ser Ser Gin He Lys 

355 360 365 

Pro Gly Met Gly Gin Leu His Glu Gly Ser Asn Thr Thr His Tyr Ser 

370 375 380 

Val Ala Asp Arg Trp Gly Asn Ala Val Ser Val Thr Tyr Thr He Asn 
385 390 395 400 

Ala Ser Tyr Gly Ser Ala Ala Ser He Asp Gly Ala Gly Phe Leu Leu 

405 410 415 

Asn Asn Glu Met Asp Asp Phe Ser lie Lys Pro Gly Asn Pro Asn Leu 

420 425 430 

Tyr Gly Leu Val Gly Gly Asp Ala Asn Ala He Glu Ala Asn Lys Arg 

435 440 445 

Pro Leu Ser Ser Met Ser Pro Thr He Val Leu Lys Asn Asn Lys Val 

450 455 460 

Phe Leu Val Val Gly Ser Pro Gly Gly Ser Arg He He Thr Thr Val 
465 470 475 480 

Leu Gin Val He Ser Asn Val He Asp Tyr Asn Met Asn He Ser Glu 

485 490 495 

Ala Val Ser Ala Pro Arg Phe His Met Gin Trp Leu Pro Asp Glu Leu 

500 505 510 

Arg He Glu Lys Phe Gly Met Pro Ala Asp Val Lys Asp Asn Leu Thr 

515 520 525 

Lys Met Gly Tyr Gin He Val Thr Lys Pro Val Met Gly Asp Val Asn 

530 535 540 

Ala He Gin Val Leu Pro Lys Thr Lys Gly Ser Val Phe Tyr Gly Ser 
545 550 555 560 

Thr Asp Pro Arg Lys Glu Phe 
565 

(2) INFORMATION FOR SEQ ID N0:511: 
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ATT GAT TAT AAT ATG AAT ATT TCT GAA GCG GTT TCA GCC CCA AGA TTT 154 3 
lie Asp Tyr Asn Met Asn lie Ser Glu Ala Val Ser Ala Pro Arg Phe 
490 495 500 

CAC ATG CAA TGG CTC CCT GAT GAA TTA AGG ATT GAA AAG TTT GGC ATG 1591 
His Met Gin Trp Leu Pro Asp Glu Leu Arg lie Glu Lys Phe Gly Met 
505 510 515 

CCC GCT GAT GTG AAA GAC AAC CTC ACT AAA ATG GGC TAT CAA ATC GTT 163 9 
Pro Ala Asp Val Lys Asp Asn Leu Thr Lys Met Gly Tyr Gin lie Val 
520 525 530 535 

ACT AAG CCG GTC ATG GGT GAT GTG AAT GCG ATC CAA GTT TTG CCT AAA 1687 
Thr Lys Pro Val Met Gly Asp Val Asn Ala lie Gin Val Leu Pro Lys 
540 545 550 

ACT AAA GGG AGC GTT TTC TAT GGT TCA ACG GAT CCA AGG AAA GAA TTT T 1736 
Thr Lys Gly Ser Val Phe Tyr Gly Ser Thr Asp Pro Arg Lys Glu Phe 
555 560 565 

AATTCTTTGT CATATACAGG TTTTTAATCC TATTTAGCCT TATTTTTTGG GATGGAGGGG 17 96 
GGCTTTTTAG CGAGAAAATC TTAAATTTAG TTTTAAAATT CATA 184 0 

(2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:510: 

Met Arg Arg Ser Phe Leu Lys Thr lie Gly Leu Gly Val lie Ala Leu 

15 10 15 

Phe Leu Gly Leu Leu Asn Pro Leu Ser Ala Ala Ser Tyr Pro Pro lie 

20 25 30 

Lys Asn Thr Lys Val Gly Leu Ala Leu Ser Ser His Pro Leu Ala Ser 

35 40 45 

Glu lie Gly Gin Lys Val Leu Glu Glu Gly Gly Asn Ala lie Asp Ala 

50 55 60 

Ala Val Ala lie Gly Phe Ala Leu Ala Val Val His Pro Ala Ala Gly 
65 70 75 80 

Asn lie Gly Gly Gly Gly Phe Ala Val lie His Leu Ala Asn Gly Glu 

85 90 95 

Asn Val Ala Leu Asp Phe Arg Glu Lys Ala Pro Leu Lys Ala Thr Lys 

100 105 110 

Asn Met Phe Leu Asp Lys Gin Gly Asn Val Val Pro Lys Leu Ser Glu 

115 120 125 

Asp Gly Tyr Leu Ala Ala Gly Val Pro Gly Thr Val Ala Gly Met Glu 

130 135 140 

Ala Met Leu Lys Lys Tyr Gly Thr Lys Lys Leu Ser Gin Leu lie Asp 
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CGC AAA CCC GTG OTA GGG AGT TAT CGT GGG TAT AAG ATC ATT TCT ATG 871 
Arg Lys Pro Val Val Gly Ser Tyr Arg Gly Tyr Lys lie lie Ser Met 
265 270 275 

TCG CCG CCA AGT TCG GGA GGC ACG CAT TTG ATC CAG ATT TTA AAT GTC 919 
Ser Pro Pro Ser Ser Gly Gly Thr His Leu lie Gin lie Leu Asn Val 
280 285 290 295 

ATG GAA AAT GCG GAT TTA AGC GCC CTT GGG TAT GGG GCT TCT AAG AAT 967 
Met Glu Asn Ala Asp Leu Ser Ala Leu Gly Tyr Gly Ala Ser Lys Asn 
300 305 310 

ATC CAT ATC GCT GCC GAA GCG ATG CGT CAG GCT TAT GCG GAT AGA TCG 1015 
He His He Ala Ala Glu Ala Met Arg Gin Ala Tyr Ala Asp Arg Ser 
315 320 325 

GTT TAT ATG GGA GAC GCT GAT TTT GTT TCG GTG CCG GTG GAT AAA TTG 10 63 

Val Tyr Met Gly Asp Ala Asp Phe Val Ser Val Pro Val Asp Lys Leu 
330 335 340 

ATT AAT AAA GCG TAT GCC AAA AAG ATT TTT GAC ACT ATC CAG CCA GAT 1111 
He Asn Lys Ala Tyr Ala Lys Lys He Phe Asp Thr He Gin Pro Asp 
345 350 355 

ACG GTT ACG CCA AGC TCT CAA ATC AAA CCA GGA ATG GGG CAG TTG CAT 1159 
Thr Val Thr Pro Ser Ser Gin He Lys Pro Gly Met Gly Gin Leu His 
360 365 370 375 

GAG GGG AGC AAT ACC ACG CAT TAT TCT GTA GCG GAC AGG TGG GGG AAT 12 07 

Glu Gly Ser Asn Thr Thr His Tyr Ser Val Ala Asp Arg Trp Gly Asn 
380 385 390 

GCA GTC AGC GTT ACT TAC ACC ATT AAC GCT TCT TAT GGA AGC GCT GCC 1255 
Ala Val Ser Val Thr Tyr Thr He Asn Ala Ser Tyr Gly Ser Ala Ala 
395 400 405 

AGT ATT GAT GGG GCA GGA TTT TTA TTG AAC AAT GAA ATG GAT GAT TTT 13 03 

Ser He Asp Gly Ala Gly Phe Leu Leu Asn Asn Glu Met Asp Asp Phe 
410 415 420 

TCC ATC AAG CCA GGG AAT CCC AAT CTC TAT GGT TTA GTA GGG GGC GAT 13 51 

Ser He Lys Pro Gly Asn Pro Asn Leu Tyr Gly Leu Val Gly Gly Asp 
425 430 435 

GCG AAT GCG ATT GAA GCC AAT AAG CGC CCT TTA AGC TCC ATG TCG CCT 13 99 

Ala Asn Ala He Glu Ala Asn Lys Arg Pro Leu Ser Ser Met Ser Pro 
440 445 450 455 

ACG ATT GTG TTG AAA AAC AAT AAG GTT TTT TTG GTG GTG GGA AGC CCT 144 7 

Thr He Val Leu Lys Asn Asn Lys Val Phe Leu Val Val Gly Ser Pro 
460 465 470 

GGA GGG TCT AGG ATT ATC ACT ACG GTG CTG CAA GTG ATT TCT AAT GTC 14 95 

Gly Gly Ser Arg He He Thr Thr Val Leu Gin Val He Ser Asn Val 
475 480 485 
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GCC CTT TCT AGC CAC CCG CTA OCT ACT GAG ATC GGG CAA AAG GTT TTA 199 
Ala Leu Ser Ser His Pro Leu Ala Ser Glu lie Gly Gin Lys Val Leu 
40 45 50 55 

GAA GAG GGA GGT AAT GCG ATT GAT GCG GCT GTA GCG ATA GGT TTT GCT 2 47 

Glu Glu Gly Gly Asn Ala He Asp Ala Ala Val Ala He Gly Phe Ala 
60 65 70 

CTT GCG GTT GTC CAT CCG GCA GCA GGC AAT ATT GGT GGA GGC GGT TTT 2 95 

Leu Ala Val Val His Pro Ala Ala Gly Asn He Gly Gly Gly Gly Phe 
75 80 85 

GCG GTT ATC CAT TTG GCT AAT GGT GAA AAT GTT GCG TTA GAT TTT AGA 34 3 

Ala Val He His Leu Ala Asn Gly Glu Asn Val Ala Leu Asp Phe Arg 
90 95 100 

GAA AAA GCC CCC TTA AAA GCC ACT AAA AAC ATG TTT TTA GAC AAG CAA 3 91 

Glu Lys Ala Pro Leu Lys Ala Thr Lys Asn Met Phe Leu Asp Lys Gin 
105 110 115 

GGC AAT GTA GTC CCT AAA CTC AGC GAA GAT GGC TAT TTG GCG GCC GGG 43 9 

Gly Asn Val Val Pro Lys Leu Ser Glu Asp Gly Tyr Leu Ala Ala Gly 
120 125 130 135 

GTT CCT GGA ACG GTG GCA GGC ATG GAA GCG ATG CTG AAA AAA TAC GGC 4 87 

Val Pro Gly Thr Val Ala Gly Met Glu Ala Met Leu Lys Lys Tyr Gly 
140 145 150 

ACT AAA AAA CTA TCG CAA CTC ATT GAT CCT GCC ATT AAA TTG GCT GAA 535 
Thr Lys Lys Leu Ser Gin Leu He Asp Pro Ala He Lys Leu Ala Glu 
155 160 165 

AAT GGT TAT GCG ATT TCA CAA AGA CAA GCA GAA ACC CTA AAG GAA GCA 58 3 

Asn Gly Tyr Ala He Ser Gin Arg Gin Ala Glu Thr Leu Lys Glu Ala 
170 175 180 

AGG GAG CGG TTT TTA AAA TAC AGT TCT AGC AAA AAG TAT TTT TTT AAA 631 
Arg Glu Arg Phe Leu Lys Tyr Ser Ser Ser Lys Lys Tyr Phe Phe Lys 
185 190 195 

AAA GGC CAT CTT GAT TAT CAA GAA GGG GAT TTG TTT GTC CAA AAA GAT 67 9 

Lys Gly His Leu Asp Tyr Gin Glu Gly Asp Leu Phe Val Gin Lys Asp 
200 205 210 215 

TTA GCC AAG ACT TTG AAT CAA ATC AAA ACG CTA GGC GCT AAA GGC TTT 72 7 

Leu Ala Lys Thr Leu Asn Gin He Lys Thr Leu Gly Ala Lys Gly Phe 
220 225 230 

TAT CAA GGG CAA GTC GCT GAG CTT ATT GAG AAA GAC ATG AAA AAA AAT 775 
Tyr Gin Gly Gin Val Ala Glu Leu He Glu Lys Asp Met Lys Lys Asn 
235 240 245 

GGA GGG ATT ATC ACT AAA GAA GAT TTA GCC AGT TAC AAT GTG AAA TGG 82 3 

Gly Gly He He Thr Lys Glu Asp Leu Ala Ser Tyr Asn Val Lys Trp 
250 255 260 
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100 

Trp Val Gly Lys 
115 

Leu Phe Tyr Asp 
130 

Ser Val lie lie 
145 

Ala Tyr Asn Gly 

Phe Ala Gly Val 
180 

Leu Leu Leu Asp 
195 

Phe Leu Val Asn 
210 

Gly Phe Gly lie 
225 

Met Asn Asn lie 

Phe Leu Glu Tyr 
260 

Arg Asn Tyr Ser 
275 



His Glu Glu Thr 
12 0 

Leu Ser Ala Ser 
135 

Ser Thr Tyr Gly 
150 

Asp Lys Phe Phe 
165 

Tyr Asp Lys Val 

Thr Phe Gly Gly 
200 

Leu Gly Val Arg 
215 

Lys lie Pro Thr 
230 

Ser Asn Asn Ser 
245 

Gly lie Asn Ser 

Val Tyr Phe Asn 
280 



105 

Lys Trp Phe Gly 

Leu Tyr Gly Gin 
140 

Thr Tyr Met Asp 
155 

Ala Gly Phe Asn 
170 

Ser Asp Ala Leu 
185 

Lys Val Asp Pro 

Leu Gly Asn Lys 
220 

Tyr Tyr Phe Asn 
235 

Glu Asp Val Leu 
250 

Leu Leu Tyr Gin 
265 

Tyr Thr Tyr lie 



110 

Phe Arg Trp Gly 
125 

Lys Glu Ser Gin 

Leu Leu Leu Asn 
160 

Leu Gly lie Ala 
175 

Leu Tyr Gin Ala 
190 

Asn Gly Phe Gin 
205 

His Asn Gin Phe 

His Tyr Tyr Ser 
240 

Lys Val Leu Arg 
255 

Val Asp Phe Arg 
270 

Phe 
285 



(2) INFORMATION FOR SEQ ID NO: 50 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 35... 1735 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 



AAATTTTAAG CGTAACTGAT TGAAAGGAAA ACAG ATG AGA CGG AGT TTT TTG AAA 55 

Met Arg Arg Ser Phe Leu Lys 
1 5 



ACG ATT GGC TTG GGT GTG ATA GCA CTC TTT TTG GGT TTG TTA AAC CCT 10 3 

Thr lie Gly Leu Gly Val lie Ala Leu Phe Leu Gly Leu Leu Asn Pro 
10 15 20 



TTG AGT GCG GCG AGT TAC CCC CCC ATT AAA AAC ACT AAA GTA GGC TTA 151 
Leu Ser Ala Ala Ser Tyr Pro Pro lie Lys Asn Thr Lys Val Gly Leu 
25 30 35 
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180 185 190 

CTT CTT TTA GAC ACT TTT GGC GOG AAA GTG GAT CCA AAT GGC TTC CAG 624 
Leu Lieu Leu Asp Thr Phe Gly Gly Lys Val Asp Pro Asn Gly Phe Gin 
195 200 205 

TTT TTG GTA AAT TTA GGG GTT CGT TTA GGG AAT AAG CAC AAC CAA TTT 672 
Phe Leu Val Asn Leu Gly Val Arg Leu Gly Asn Lys His Asn Gin Phe 
210 215 220 

GGC TTT GGG ATT AAA ATC CCT ACT TAT TAT TTT AAC CAT TAT TAT TCC 72 0 

Gly Phe Gly lie Lys lie Pro Thr Tyr Tyr Phe Asn His Tyr Tyr Ser 
225 230 235 240 

ATG AAT AAC ATT AGC AAT AAT AGT GAA GAT GTC CTC AAA GTT TTA CGA 768 
Met Asn Asn lie Ser Asn Asn Ser Glu Asp Val Leu Lys Val Leu Arg 
245 250 255 

TTT TTA GAA TAC GGG ATC AAC AGC TTG TTA TAC CAA GTT GAT TTC AGG 816 
Phe Leu Glu Tyr Gly lie Asn Ser Leu Leu Tyr Gin Val Asp Phe Arg 
260 265 270 

CGC AAT TAC TCG GTT TAT TTC AAC TAC ACT TAT ATT TTT TAAGCGATAG CG 867 
Arg Asn Tyr Ser Val Tyr Phe Asn Tyr Thr Tyr lie Phe 
275 280 285 

TTTAAAGCGT TCTTAATTGA GCGATTTCGT CTCTCAAACG CATCGCTTCT TCAAAATCCA 92 7 

AATTCTTCGT GCATTCTCGC A 94 8 

(2) INFORMATION FOR SEQ ID NO: 5 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:508: 

Lys lie Glu Val Leu Arg Gly Phe Leu Lys Arg Ala Leu Tyr Leu lie 

15 10 15 

Leu Gly Leu Phe Tyr Thr Leu Asn Ala Glu Ser Phe Lys Asp Val Leu 

20 25 30 

Thr Lys Val Asp Tyr Thr Phe Phe Asn Lys Lys Val Val Ser Pro lie 

35 40 45 

Lys Arg Tyr Ala Asp Arg Ser Ala Phe Tyr Leu Gly Leu Gly Tyr Gin 

50 55 60 

Leu Gly Ser lie Gin His Asn Ser Ser Asn Leu Asn Leu Ser Gin Gin 
65 70 75 80 

Phe Asn Lys Ser Gin lie lie Phe Ser Asp Ser Leu Ser Pro Val Phe 

85 90 95 

Lys Asn Ser Tyr Val Ser Asn Gly Leu Gly Val Gin Val Gly Tyr Lys 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . , . 855 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 

AAA ATT GAA GTT TTA AGG GGT TTT TTG AAA CGA GCG TTA TAG TTA ATT 4 8 

Lys lie Glu Val Leu Arg Gly Phe Leu Lys Arg Ala Leu Tyr Leu lie 
15 10 15 

TTA GGG CTT TTT TAG ACG CTT AAT GCA GAG AGC TTT AAA GAT GTT TTG 96 
Leu Gly Leu Phe Tyr Thr Leu Asn Ala Glu Ser Phe Lys Asp Val Leu 
20 25 30 

ACT AAA GTG GAT TAG ACT TTT TTT AAT AAA AAG GTG GTT TCG CCC ATC 144 
Thr Lys Val Asp Tyr Thr Phe Phe Asn Lys Lys Val Val Ser Pro lie 
35 40 45 

AAA CGC TAT GCG GAT AGA TCG GCG TTT TAT CTG GGG CTT GGG TAT CAA 192 
Lys Arg Tyr Ala Asp Arg Ser Ala Phe Tyr Leu Gly Leu Gly Tyr Gin 
50 55 60 

TTA GGG AGC ATT CAG CAC AAC TCT AGC AAC TTG AAT TTA TCC CAG CAA 24 0 

Leu Gly Ser lie Gin His Asn Ser Ser Asn Leu Asn Leu Ser Gin Gin 
65 70 75 SO 

TTC AAT AAG AGT CAG ATT ATT TTC AGC GAT AGT CTA AGC CCT GTT TTT 2 88 

Phe Asn Lys Ser Gin lie lie Phe Ser Asp Ser Leu Ser Pro Val Phe 
85 90 95 

AAA AAT TCG TAT GTG TCT AAT GGC CTT GGC GTG CAA GTG GGC TAT AAG 3 36 

Lys Asn Ser Tyr Val Ser Asn Gly Leu Gly Val Gin Val Gly Tyr Lys 
100 105 110 

TGG GTG GGT AAG CAT GAA GAG ACG AAA TGG TTT GGC TTC AGG TGG GGG 3 84 

Trp Val Gly Lys His Glu Glu Thr Lys Trp Phe Gly Phe Arg Trp Gly 
115 120 125 

CTG TTT TAT GAT TTG AGC GCC TCT CTT TAT GGC CAA AAA GAA TCA CAG 432 
Leu Phe Tyr Asp Leu Ser Ala Ser Leu Tyr Gly Gin Lys Glu Ser Gin 
130 135 140 

TCT GTC ATC ATT TCC ACT TAG GGC ACT TAT ATG GAT TTA TTA TTG AAC 4 80 

Ser Val lie lie Ser Thr Tyr Gly Thr Tyr Met Asp Leu Leu Leu Asn 
145 150 155 160 

GCT TAT AAT GGG GAT AAG TTT TTT GCT GGG TTC AAT CTG GGG ATT OCT 52 8 

Ala Tyr Asn Gly Asp Lys Phe Phe Ala Gly Phe Asn Leu Gly lie Ala 
165 170 175 

TTT GCT GGA GTG TAT GAC AAA GTG AGC GAT GCG TTA TTG TAT CAA GCC 576 
Phe Ala Gly Val Tyr Asp Lys Val Ser Asp Ala Leu Leu Tyr Gin Ala 
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65 

Arg Val Ser Thr 

Val Leu Tyr Gin 
100 

Val Asn Arg Ala 
115 

Asp Met Tyr Leu 
130 

Asn Pro Gin Glu 
145 

Glu His Gin Lys 

Leu Cys Ser His 
180 

Ala Tyr Gin Phe 
195 

Phe Asp Lys Pro 
210 

Tyr Glu His Lys 
225 

Val lie Glu Glu 

Tyr His Leu Thr 
260 

Phe Ala Leu Gly 
275 

Met Arg Lys Lys 
290 

Arg Pro Phe Pro 
305 

Ala Leu Ala lie 

Met Phe Asn Glu 
340 

His Pro Val Val 
355 

Thr lie Ala His 
370 

Leu Lys Gly Thr 
385 

Gly Pro Lys Met 



70 

Ala Thr Ser Ser 
85 

Ala Ser Gly Met 

Leu Ala Ala Pro 
120 

Ser Arg Asp Ser 
135 

Ala Tyr Asp Phe 
150 

Val Arg Val Pro 
165 

Thr Val Gin Asn 

Val Gly Glu Tyr 
200 

Val Ser Tyr Gly 
215 

Ala Gin Leu His 
230 

Val Phe Asn Asp 
245 

Lys Thr Phe Gin 

Thr Thr Tyr Glu 
280 

Gly lie Lys Ala 
295 

Tyr Glu Arg Leu 
310 

Leu Asp Lys Ser 
325 

Val Thr Ser Ala 

Ser Asn Tyr lie 
360 

Leu Cys Glu lie 
375 

Leu Thr His Pro 

390 
Ser Phe Phe 
405 



75 

Gin Gly Leu Ala 
90 

Arg Leu Pro lie 
105 

Leu Asn lie His 

Gly Trp lie Ser 
140 

Thr Leu Met Ala 
155 

Thr He Val Asn 
170 

Val Arg Pro Leu 
185 

Gin Thr Lys His 

Ala Gin Ala Glu 
220 

His Ala He Met 
235 

Phe Ala Lys Leu 
250 

Leu Glu Asp Ala 
265 

Ser Ala He Val 

Gly Val Ala Thr 
300 

Gly Gin Asp Leu 
315 

Ser Pro Ala Gly 
330 

Val Tyr Gin Thr 
345 

Tyr Gly Leu Gly 

Phe Glu Glu He 
380 

Thr Gin Gin Phe 
395 



80 

Leu Met Val Glu 
95 

Val Leu Asn Leu 
110 

Gly Asp His Ser 
125 

Leu Cys Thr Cys 

Phe Arg He Ala 
160 

Gin Asp Gly Phe 
175 

Ser Asp Ala Val 
190 

Ser Leu Leu Asp 
205 

Glu Glu Trp His 

Ser Ala Ser Ser 
240 

Thr Gly Arg Gin 
255 

Glu He Ala He 
270 

Ala Ala Lys Glu 
285 

He His Ser Leu 

Lys Asn Leu Lys 
320 

Thr Met Gly Ala 
335 

Gin Gly Thr Lys 
350 

Glu Arg Asp Met 
365 

Asn Glu Asp Ala 

Val Gly Leu His 
400 



(2) INFORMATION FOR SEQ ID NO: 507: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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Val Ala Ala Lys Glu Met Arg Lys Lys Gly He Lys Ala Gly Val Ala 
285 290 295 

ACC ATC CAT TCC TTG CGC CCC TTC COT TAT GAA AGA TTA GGG GAG GAT 102 2 

Thr He His Ser Leu Arg Pro Phe Pro Tyr Glu Arg Leu Gly Gin Asp 
300 305 310 315 

TTG AAA AAT CTT AAA GOT TTA GCG ATT TTA GAC AAG AGO TCT CCA GCG 107 0 

Leu Lys Asn Leu Lys Ala Leu Ala He Leu Asp Lys Ser Ser Pro Ala 
320 325 330 

GGC ACT ATG GGG GCG ATG TTT AAT GAA GTA ACG AGC GCG GTG TAT CAA 1118 
Gly Thr Met Gly Ala Met Phe Asn Glu Val Thr Ser Ala Val Tyr Gin 
335 340 345 

ACG CAA GGG ACT AAA CAC CCC GTG GTG TCT AAC TAC ATT TAT GGT TTA 1166 
Thr Gin Gly Thr Lys His Pro Val Val Ser Asn Tyr He Tyr Gly Leu 
350 355 360 

GGC GAA AGG GAT ATG ACG ATC GCG CAT TTA TGC GAA ATT TTT GAA GAA 1214 
Gly Glu Arg Asp Met Thr He Ala His Leu Cys Glu He Phe Glu Glu 
365 370 375 

ATC AAT GAA GAC GCT CTT AAA GGC ACG CTC ACG CAC CCT ACC CAA CAA 12 62 

He Asn Glu Asp Ala Leu Lys Gly Thr Leu Thr His Pro Thr Gin Gin 
380 385 390 395 

TTC GTA GGC TTG CAC GGC CCT AAA ATG AGC TTT TTT TAAAAAGGAA ATATCA 1314 
Phe Val Gly Leu His Gly Pro Lys Met Ser Phe Phe 
400 405 

TGGTAAT^GA AGTCAAAACA CTCAAAGGTT TTAGC 134 9 

(2) INFORMATION FOR SEQ ID NO: 5 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 07 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

Met Ala Lys Ser He Glu Leu Gin Glu He Glu Val Trp Asp Gly Asn 

15 10 15 

Thr Ala Ser Ser Asn Ala Leu Arg Gin Ala Gin He Asp Val He Ala 

20 25 30 

Ala Tyr Pro He Thr Pro Ser Thr Pro He Val Gin Asn Tyr Gly Ser 

35 40 45 

Phe Lys Asp Asn Gly Tyr Val Asp Gly Glu Phe Val Leu Val Glu Ser 

50 55 60 

Glu His Ala Ala Met Ser Ala Cys Val Gly Ala Ala Ala Ala Gly Gly 
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60 

GCC GCA GCT 
Ala Ala Ala 



GCG TTA ATG 
Ala Leu Met 



ATC GTT TTG 
lie Val Leu 
110 

CAT GGC GAT 
His Gly Asp 
125 

AGT TTA TGC 
Ser Leu Cys 
140 

GCG TTT AGA 
Ala Phe Arg 



AAT CAA GAC 
Asn Gin Asp 



TTG AGC GAT 
Leu Ser Asp 
190 

CAT TCC CTT 
His Ser Leu 
205 

GAA GAA GAA 
Glu Glu Glu 
220 

ATG AGC GCG 
Met Ser Ala 



CTC ACA GGC 
Leu Thr Gly 



GCT GAA ATC 
Ala Glu lie 
270 

GTA GCG GCT 



65 

GGA GGG AGA 
Gly Gly Arg 
80 

GTA GAG GTT 
Val Glu Val 
95 

AAT TTA GTC 
Asn Leu Val 



CAT TCT GAT 
His Ser Asp 



ACA TGC AAC 
Thr Cys Asn 
14 5 

ATC GCA GAG 
lie Ala Glu 
160 

GGG TTT TTA 
Gly Phe Leu 
175 

GCA GTG GCT 
Ala Val Ala 



TTG GAT TTT 
Leu Asp Phe 



TGG CAT TAT 
Trp His Tyr 
225 

TCT TCT GTG 
Ser Ser Val 
240 

AGG CAA TAC 
Arg Gin Tyr 
255 

GCT ATC TTT 
Ala lie Phe 



AAA GAA ATG 



GTC AGC ACT 
Val Ser Thr 



TTA TAC CAG 
Leu Tyr Gin 
100 

AAT CGT GCT 
Asn Arg Ala 
115 

ATG TAT TTA 
Met Tyr Leu 
130 

CCC CAA GAA 
Pro Gin Glu 



CAT CAA AAG 
His Gin Lys 



TGC TCG CAC 
Cys Ser His 
180 

TAC CAA TTC 
Tyr Gin Phe 
195 

GAT AAA CCG 
Asp Lys Pro 
210 

GAG CAT AAA 
Glu His Lys 



ATT GAA GAA 
lie Glu Glu 



CAT TTA ACC 
His Leu Thr 
260 

GCG TTA GGC 
Ala Leu Gly 
275 

CGT AAA AAA 



70 

GCG ACT AGC 
Ala Thr Ser 
85 

GCT TCT GGA 
Ala Ser Gly 



TTA GCA GCC 
Leu Ala Ala 



AGC AGG GAT 
Ser Arg Asp 
135 

GCT TAT GAT 
Ala Tyr Asp 
150 

GTG CGC GTG 
Val Arg Val 
165 

ACC GTG CAA 
Thr Val Gin 



GTG GGC GAA 
Val Gly Glu 



GTA AGC TAT 
Val Ser Tyr 
215 

GCC CAA CTC 
Ala Gin Leu 
230 

GTG TTC AAT 
Val Phe Asn 
245 

AAA ACT TTC 
Lys Thr Phe 



ACT ACT TAT 
Thr Thr Tyr 



GGC ATT AAG 



TCT CAA GGT 
Ser Gin Gly 
90 

ATG CGT TTG 
Met Arg Leu 
105 

CCT TTG AAT 
Pro Leu Asn 
120 

TCT GGT TGG 
Ser Gly Trp 



TTC ACT TTA 
Phe Thr Leu 



CCT ACT ATT 
Pro Thr lie 
170 

AAT GTC CGC 
Asn Val Arg 
185 

TAC CAA ACC 
Tyr Gin Thr 
200 

GGC GCG CAA 
Gly Ala Gin 



CAC CAT GCC 
His His Ala 



GAT TTC GCT 
Asp Phe Ala 
250 

CAG CTA GAA 
Gin Leu Glu 
265 

GAA TCA GCG 
Glu Ser Ala 
280 

GCC GGC GTG 



75 

TTG 350 
Leu 



CCT 3 98 

Pro 



ATC 44 6 

He 



ATA 4 94 

He 



ATG 542 

Met 

155 

GTC 590 
Val 



CCT 63 8 

Pro 



AAG 686 
Lys 



GCT 734 
Ala 



ATC 782 

He 

235 

AAA 830 
Lys 



GAC 878 
Asp 



ATC 92 6 

He 



GCT 974 
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115 120 125 

Lys Lys Glu His Tyr Lys Pro lie Lys Asn lie Arg Lys Ser Lys Tirp 

130 135 140 

Tyr Ala Tyr Leu Glu Ala Leu Gly Leu Glu Lys Phe Phe Gin Tyr Thr 
145 150 155 160 

Trp Asp Thr Lys Lys Lys Asn Asn Lys Lys Lys Leu lie lie Asp Asp 

165 170 175 

Lys Asp Gly Asp Glu lie Glu lie Lys Asp Gin Tyr Lys Gly Asp Tyr 

180 185 190 

Glu Glu Leu Lys Lys Val Leu Asp Leu Asn Ser Lys Ser Leu lie Pro 

195 200 205 

Leu Lys Asn Phe Leu Gly Gin Phe Ala Glu Asn Asn Gin Lys Thr Asn 

210 215 220 

Pro Lys lie Phe 
225 

(2) INFORMATION FOR SEQ ID NO:505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 78... 1298 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

AAGAACAAAT TGAGCCTGCT ACCGCTCTCA CTCAATGGCC GCAAAAACAA GAAAAGAAAA 60 
AATCATAAGG AAAAAAT ATG GCA AAA AGT ATT GAA TTG CAA GAG ATA GAA 110 

Met Ala Lys Ser lie Glu Leu Gin Glu lie Glu 
15 10 

GTG TGG GAT GGC T^T ACC GCT AGT TCT AAC GCT TTA AGA CAG GCT CAA 15 8 

Val Trp Asp Gly Asn Thr Ala Ser Ser Asn Ala Leu Arg Gin Ala Gin 
15 20 25 

ATT GAT GTC ATC GCA GCC TAT CCT ATC ACC CCA TCA ACQ CCC ATT GTG 2 06 

lie Asp Val lie Ala Ala Tyr Pro lie Thr Pro Ser Thr Pro lie Val 
30 35 40 

CAA AAT TAT GGC TCG TTT AAG GAT AAT GGC TAT GTT GAT GGC GAA TTC 2 54 

Gin Asn Tyr Gly Ser Phe Lys Asp Asn Gly Tyr Val Asp Gly Glu Phe 
45 50 55 

GTT TTA GTG GAA TCT GAG CAT GCC GCC ATG AGC GCA TGC GTG GGA GCT 3 02 

Val Leu Val Glu Ser Glu His Ala Ala Met Ser Ala Cys Val Gly Ala 
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AAA AAC ATA AGA AAA AGT AAG TGG TAT GCC TAT TTA GAA GCG CTT GGA 486 
Lys Asn lie Arg Lys Ser Lys Trp Tyr Ala Tyr Leu Glu Ala Leu Gly 
140 145 150 

TTA GAA AAA TTT TTC CAA TAG ACA TGG GAG ACA AAG AAA AAG AAT AAT 534 
Leu Glu Lys Phe Phe Gin Tyr Thr Trp Asp Thr Lys Lys Lys Asn Asn 
155 160 165 

AAA AAA AAG CTT ATC ATT GAG GAT AAA GAT GGA GAT GAG ATT GAG ATA 5 82 

Lys Lys Lys Leu lie lie Asp Asp Lys Asp Gly Asp Glu lie Glu lie 
170 175 180 

AAA GAT CAA TAT AAA GGA GAT TAT GAA GAA CTA AAA AAA GTT CTT GAT 63 0 

Lys Asp Gin Tyr Lys Gly Asp Tyr Glu Glu Leu Lys Lys Val Leu Asp 
185 190 195 200 

CTT AAC TCA AAA TCT CTT ATT CCC CTT AAA AAT TTT TTA GGG CAA TTT 67 8 

Leu Asn Ser Lys Ser Leu lie Pro Leu Lys Asn Phe Leu Gly Gin Phe 
205 210 215 

GCA GAA AAT AAT CAA AAA ACA AAT CCT AAA ATT TTC TAATTTAACA AAATAT 73 0 
Ala Glu Asn Asn Gin Lys Thr Asn Pro Lys lie Phe 
220 225 

ATTACAATTA CCAAAAAAGT ATTATTTTTC TTT^AAAGGTG CGCTGTGAAA TTGTGGTTTC 790 
CTTATTTTTT AGCGATTGTG TTCTTGCATG CATTGGGTTT 83 0 

(2) INFORMATION FOR SEQ ID NO : 504 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 04: 

Met Ala Asp Lys Glu lie Leu He Phe Val Glu Gly Pro Ser Asp Lys 

15 10 15 

Val Phe Leu Glu Val Tyr Leu Tyr Phe Leu Glu Arg Phe Pro He Lys 

20 25 30 

Asn Phe Lys Val Gin Asn Val Asp Gly Lys Asp Asn Leu Ser Lys Arg 

35 40 45 

Leu Leu Glu He Glu Lys Tyr Asp Lys Thr Leu He He Phe Asp Ala 

50 55 60 

Asp Lys Asp Tyr Glu Ser Asn Lys Lys Glu He Leu Lys He Val Ser 
65 70 75 80 

Glu Ser Lys Gin Thr He Ser Glu Glu Gin He Phe Leu Phe Pro Asn 

85 90 95 

Asn Gin Asp Asp Gly Asp Leu Glu Thr Leu Leu Leu Lys He Ala Asn 

100 105 110 

His Lys Glu Phe He Asn Cys Phe Glu Ser Tyr Leu Asp Cys He Lys 
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(2) INFORMATION FOR SEQ ID NO:503: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 714 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:503: 

AAAGCACTAT ATTAAGATTA GGATCATTTA ATG GCA GAT AAA GAA ATA CTG ATT 54 

Met Ala Asp Lys Glu lie Leu lie 
1 5 

TTT GTA GAA GGT CCA AGC GAT AAG GTG TTT TTA GAA GTT TAT CTG TAT 102 
Phe Val Glu Gly Pro Ser Asp Lys Val Phe Leu Glu Val Tyr Leu Tyr 
10 15 20 

TTT CTA GAA AGA TTT CCA ATC AAA AAC TTT AAA GTG CAA AAT GTA GAT 15 0 

Phe Leu Glu Arg Phe Pro lie Lys Asn Phe Lys Val Gin Asn Val Asp 
25 30 35 40 

GGA AAA GAT AAC CTG TCT AAA CGA TTG CTT GAA ATT GAA AAA TAC GAT 198 
Gly Lys Asp Asn Leu Ser Lys Arg Leu Leu Glu lie Glu Lys Tyr Asp 
45 50 55 

AAA ACA CTT ATC ATT TTT GAT GCG GAT AAA GAC TAT GAG ACT AAT AAA 24 6 

Lys Thr Leu lie lie Phe Asp Ala Asp Lys Asp Tyr Glu Ser Asn Lys 
60 65 70 

AAA GAG ATT TTA AAA ATT GTA TCA GAA TCG AAA CAA ACT ATT TCA GAA 2 94 

Lys Glu lie Leu Lys lie Val Ser Glu Ser Lys Gin Thr lie Ser Glu 
75 80 85 

GAA CAA ATT TTT TTA TTT CCT AAT AAT CAA GAT GAT GGC GAT TTA GAA 342 
Glu Gin lie Phe Leu Phe Pro Asn Asn Gin Asp Asp Gly Asp Leu Glu 
90 95 100 

ACC CTA TTA TTA AAG ATT GCT AAC CAC AAA GAG TTC ATA AAT TGT TTT 3 90 

Thr Leu Leu Leu Lys lie Ala Asn His Lys Glu Phe lie Asn Cys Phe 
105 110 115 120 

GAA AGC TAT TTG GAT TGT ATT AAA AAG AAA GAA CAT TAC AAA CCG ATT 4 38 

Glu Ser Tyr Leu Asp Cys lie Lys Lys Lys Glu His Tyr Lys Pro lie 
125 130 135 
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Thr Tyr Phe Arg Ala lie Gly Lys Phe Gly Val Gin Phe Arg Thr lie 
140 145 150 

GTT TTG TAT CAT AAG GTG GAT GTA GAA ATT GGC ATG AAA ATC TTT CTA 53 4 

Val Leu Tyr His Lys Val Asp Val Glu lie Gly Met Lys He Phe Leu 
155 160 165 

ACT CCT GAA AGG CGC AGT TTG TTT GAA AGG AGC TTT TTG TTT TTT GTT 5 82 

Thr Pro Glu Arg Arg Ser Leu Phe Glu Arg Ser Phe Leu Phe Phe Val 
170 175 180 

TCG CAT TCG TGG CAT TTT TAAATGGCGG AGAGAGAGGG ATTCGAACCC TCGAAGGC 63 8 
Ser His Ser Trp His Phe 
185 190 

TTGCACCTTA CACGCGT 65 5 

(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

Met Ser Phe Leu Asn He Leu Asn Ala Glu Asn Leu Ser Tyr Met Ser 

15 10 15 

Ser Ser Tyr Gin He Gly Thr Val Phe Met Arg Pro Leu Asn Thr Asn 

20 25 30 

Lys Leu Leu Gin Gly Ala Ser He Leu Gin Gly Tyr Glu Val Asn Pro 

35 40 45 

Lys Asn Asp Trp Ala Tyr Ser Arg Tyr Tyr Phe Phe He Asp Tyr Gly 

50 55 60 

Asn Val Leu Phe Asn Asn Asp Ser Thr Leu Gin Ala Asn Met Phe Thr 
65 70 75 80 

Tyr Gly Val Gly Gly Asp Phe Met Val Ala Tyr Ala Lys Asn Pro He 

85 90 95 

Asn Arg Trp Ala Phe Phe Phe Gly Leu Gin Leu Ala Ala Asn Thr Trp 

100 105 110 

He Leu Asn Asn Lys Val Lys Asp Leu Val Val Asn Thr Trp Asp Ser 

115 120 125 

Leu Lys Asp Phe Asn Phe His Asn Thr Tyr Phe Arg Ala He Gly Lys 

130 135 140 

Phe Gly Val Gin Phe Arg Thr He Val Leu Tyr His Lys Val Asp Val 
145 150 155 160 

Glu He Gly Met Lys He Phe Leu Thr Pro Glu Arg Arg Ser Leu Phe 

165 170 175 

Glu Arg Ser Phe Leu Phe Phe Val Ser His Ser Trp His Phe 
180 185 190 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 31... 600 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

GTGCATTATT TAAGAATTTT AATACTGAGT ATG AGT TTT TTA AAT ATT TTA AAT 54 

Met Ser Phe Leu Asn lie Leu Asn 
1 5 

GCT GAA AAT TTG AGT TAT ATG TCT TCT TCT TAT CAA ATA GGC ACG GTG 102 
Ala Glu Asn Leu Ser Tyr Met Ser Ser Ser Tyr Gin lie Gly Thr Val 
10 15 20 

TTT ATG CGC CCT TTA AAC ACC AAC AAG CTT TTA CAA GGG GCT TCA ATC 150 
Phe Met Arg Pro Leu Asn Thr Asn Lys Leu Leu Gin Gly Ala Ser lie 
25 30 35 40 

CTT CAA GGC TAT GAA GTG AAT CCT AAA AAC GAT TGG GCT TAT TCT AGG 198 
Leu Gin Gly Tyr Glu Val Asn Pro Lys Asn Asp Trp Ala Tyr Ser Arg 
45 50 55 

TAT TAT TTC TTT ATA GAT TAT GGC AAT GTG CTT TTT AAT AAT GAC TCT 24 6 

Tyr Tyr Phe Phe lie Asp Tyr Gly Asn Val Leu Phe Asn Asn Asp Ser 
60 65 70 

ACT TTA CAA GCG AAC ATG TTC ACT TAT GGG GTG GGA GGG GAT TTT ATG 2 94 

Thr Leu Gin Ala Asn Met Phe Thr Tyr Gly Val Gly Gly Asp Phe Met 
75 80 85 

GTC GCC TAC GCT AAA AAC CCT ATC AAC CGC TGG GCT TTT TTC TTT GGC 342 
Val Ala Tyr Ala Lys Asn Pro lie Asn Arg Trp Ala Phe Phe Phe Gly 
90 95 100 

TTG CAA CTG GCC GCT AAC ACA TGG ATA CTC AAC AAT AAA GTC AAA GAT 3 90 

Leu Gin Leu Ala Ala Asn Thr Trp lie Leu Asn Asn Lys Val Lys Asp 
105 110 115 120 

TTG GTG GTG AAT ACT TGG GAT TCA TTA AAA GAT TTC AAT TTT CAC AAC 43 8 

Leu Val Val Asn Thr Trp Asp Ser Leu Lys Asp Phe Asn Phe His Asn 
125 130 135 

ACT TAT TTC AGG GCT ATT GGG AAG TTT GGG GTG CAG TTT CGC ACG ATC 4 86 
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AAT AAC ACC AAA GAG AAT ACA AAG AGC GAG TCG TAGTTGAAAG AATTTGATTT 889 
Asn Asn Thr Lys Glu Asn Thr Lys Ser Glu Ser 
245 250 

AGAAAGCTAT GATTATTATT TGCCTAAGGA ATTGATCGCA AGCTACCCCG TTTTGCCCAA 94 9 

AGAAAAGGCT AAATTACTCG TCTATGAAAG GCGTTCGCAA 98 9 

(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 0: 

Met Phe Glu Asp Leu Lys Pro His Leu Gin Glu Leu Arg Lys Arg Leu 

15 10 15 

Met Val Ser Val Gly Thr lie Leu Val Ala Phe Leu Gly Cys Phe His 

20 25 30 

Phe Trp Lys Ser lie Phe Glu Phe Val Lys Asn Ser Tyr Lys Gly Thr 

35 40 45 

Leu lie Gin Leu Ser Pro lie Glu Gly Val Met Val Ala Val Lys lie 

50 55 60 

Ser Phe Ser Ala Ala He Val He Ser Met Pro He He Phe Trp Gin 
65 70 75 80 

Leu Trp Leu Phe He Ala Pro Gly Leu Tyr Lys Asn Glu Lys Lys Val 

85 90 95 

He Leu Pro Phe Val Phe Phe Gly Ser Gly Met Phe Leu He Gly Ala 

100 105 110 

Ala Phe Ser Tyr Tyr Val Val Phe Pro Phe He He Glu Tyr Leu Ala 

115 120 125 

Thr Phe Gly Ser Asp Val Phe Ala Ala Asn He Ser Ala Ser Ser Tyr 

130 135 140 

Val Ser Phe Phe Thr Arg Leu He Leu Gly Phe Gly Val Ala Phe Glu 
145 150 155 160 

Leu Pro Val Leu Ala Tyr Phe Leu Ala Lys Val Gly Leu He Thr Asp 

165 170 175 

Ala Ser Leu Lys Ala Tyr Phe Lys Tyr Ala He Val Val He Phe He 

180 185 190 

Val Ala Ala He He Thr Pro Pro Asp Val Val Ser Gin He Phe Met 

195 200 205 

Ala Leu Pro Leu Val Gly Leu Tyr Gly Leu Ser He Leu He Ala Lys 

210 215 220 

Met Val Asn Pro Ala Pro Lys Asp Asn Glu Asn Asn Asn Glu Asn Asn 
225 230 235 240 

Asn Glu Asn Asn Thr Lys Glu Asn Thr Lys Ser Glu Ser 
245 250 

(2) INFORMATION FOR SEQ ID NO: 5 01: 
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TCT GTA GGA ACQ ATT CTA GTG GCG TTT TTG GGG TGC TTT CAT TTT TGG 212 
Ser Val Gly Thr lie Leu Val Ala Phe Leu Gly Cys Phe His Phe Trp 
20 25 30 

AAA AGT ATT TTT GAA TTT GTT AAA AAT TCC TAT AAA GGC ACG CTC ATT 2 60 

Lys Ser lie Phe Glu Phe Val Lys Asn Ser Tyr Lys Gly Thr Leu lie 
35 40 45 50 

GAG CTC TCC CCT ATT GAA GGG GTC ATG GTA GCG GTT AAA ATC AGT TTT 3 08 

Gin Leu Ser Pro lie Glu Gly Val Met Val Ala Val Lys lie Ser Phe 
55 60 65 

TCA GCC GCT ATC GTC ATT TCC ATG CCC ATT ATT TTT TGG CAA TTA TGG 3 56 

Ser Ala Ala He Val He Ser Met Pro He He Phe Trp Gin Leu Trp 
70 75 80 

CTC TTT ATC GCT CCA GGG CTT TAC AAG AAT GAA AAA AAA GTG ATT TTG 4 04 

Leu Phe He Ala Pro Gly Leu Tyr Lys Asn Glu Lys Lys Val He Leu 
85 90 95 

CCT TTT GTG TTT TTT GGG AGT GGG ATG TTT TTG ATT GGG GCG GCG TTT 4 52 

Pro Phe Val Phe Phe Gly Ser Gly Met Phe Leu He Gly Ala Ala Phe 
100 105 110 

TCT TAT TAT GTG GTG TTC CCT TTC ATT ATT GAA TAC TTA GCC ACT TTT 50 0 

Ser Tyr Tyr Val Val Phe Pro Phe He He Glu Tyr Leu Ala Thr Phe 
115 120 125 130 

GGG AGC GAT GTG TTT GCG GCT AAT ATT TCT GCG TCC AGT TAC GTG AGC 54 8 

Gly Ser Asp Val Phe Ala Ala Asn He Ser Ala Ser Ser Tyr Val Ser 
135 140 145 

TTT TTC ACG CGC TTG ATT TTA GGC TTT GGC GTG GCG TTT GAA TTG CCT 5 96 

Phe Phe Thr Arg Leu He Leu Gly Phe Gly Val Ala Phe Glu Leu Pro 
150 155 160 

GTT TTG GCG TAT TTT TTG GCT AAA GTG GGC TTG ATT ACT GAT GCG AGC 644 
Val Leu Ala Tyr Phe Leu Ala Lys Val Gly Leu He Thr Asp Ala Ser 
165 170 175 

TTG AAA GCG TAT TTT AAA TAC GCT ATT GTA GTG ATT TTT ATT GTA GCA 692 
Leu Lys Ala Tyr Phe Lys Tyr Ala He Val Val He Phe He Val Ala 
180 185 190 

GCC ATT ATC ACT CCC CCT GAT GTG GTG AGT CAA ATC TTT ATG GCG TTG 74 0. 

Ala He He Thr Pro Pro Asp Val Val Ser Gin He Phe Met Ala Leu 
195 200 205 210 

CCC TTA GTG GGG CTT TAT GGG CTT TCT ATT TTA ATC GCC AAA ATG GTC 78 8 

Pro Leu Val Gly Leu Tyr Gly Leu Ser He Leu He Ala Lys Met Val 
215 220 225 

AAT CCG GCT CCC AAA GAT AAC GAA AAT AAC AAC GAA AAT AAT AAC GAA 836 
Asn Pro Ala Pro Lys Asp Asn Glu Asn Asn Asn Glu Asn Asn Asn Glu 
230 235 240 
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130 135 140 

Thr He Lys He Asp Leu Pro Pro Phe Thr Leu He Gly Ala Thr Thr 
145 150 155 160 

Arg Ala Gly Met Leu Ser Asn Pro Leu Arg Asp Arg Phe Gly Met Ser 

165 170 175 

Phe Arg Met Gin Phe Tyr Asn Pro Ser Glu Leu Ala Leu He He Lys 

180 185 190 

Lys Ala Ala Val Lys Leu Asn Gin Asp He Lys Gin Glu Ser Ala Asp 

195 200 205 

Glu He Ala Lys Arg Ser Arg Gly Thr Pro Arg He Ala Leu Arg Leu 

210 215 220 

Leu Lys Arg Val Arg Asp Phe Ala Leu Val Lys Asn Ser Ser Leu Met 
225 230 235 240 

Asp Leu Asn He Thr Leu His Ala Leu Asn Glu Leu Gly Val Asn Glu 

245 250 255 

Leu Gly Phe Asp Glu Ala Asp Leu Ala Tyr Leu Ser Leu Leu Ala Asn 

260 265 270 

Ala Gin Gly Lys Pro Val Gly Leu Asn Thr He Ala Ala Ser Met Arg 

275 280 285 

Glu Asp Glu Gly Thr He Glu Asp Val He Glu Pro Phe Leu Leu Ala 

290 295 300 

Asn Gly Tyr Leu Glu Arg Thr Ala Lys Gly Arg He Ala Thr Pro Lys 
305 310 315 320 

Thr His Glu Leu Leu Lys He Pro Thr Leu Asn Pro Gin Thr Leu Phe 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 4 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 111.,, 869 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 99: 

AGTTTCCAAT GAAGAAGCCT TAAACAAAGA AGTTTCAAGC GATGAATCCC CTAAAGAAGT 60 
CCAATTAGCA ACCGATAACA ACACCAAAGA ACACGACAAA GAAAAAGAGA ATG TTT 116 

Met Phe 
1 

GAA GAT TTA AAA CCG CAT TTA CAG GAA TTA AGA AAG CGT TTG ATG GTT 164 
Glu Asp Leu Lys Pro His Leu Gin Glu Leu Arg Lys Arg Leu Met Val 
5 10 15 
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Leu Asn Glu Leu Gly Val Asn Glu Leu Gly Phe Asp Glu Ala Asp Leu 
250 255 260 

GCG TAT TTA TCT TTG TTG GCT AAC GCT CAA GGA AAG CCG GTG GGT TTG 92 8 

Ala Tyr Leu Ser Leu Leu Ala Asn Ala Gin Gly Lys Pro Val Gly Leu 
265 270 275 280 

AAC ACG ATT GCA GCA TCT ATG AGA GAA GAT GAA GGC ACG ATT GAA GAG 97 6 

Asn Thr lie Ala Ala Ser Met Arg Glu Asp Glu Gly Thr lie Glu Asp 
285 290 295 

GTG ATT GAG COT TTT TTA CTC GCT AAT GGT TAT TTA GAG CGC ACC GCT 1024 
Val lie Glu Pro Phe Leu Leu Ala Asn Gly Tyr Leu Glu Arg Thr Ala 
300 305 310 

AAA GGC AGA ATC GCC ACG CCT AAA ACC CAT GAG CTC TTA AAA ATC CCC 10 7 2 

Lys Gly Arg lie Ala Thr Pro Lys Thr His Glu Leu Leu Lys lie Pro 
315 320 325 

ACT TTA AAC CCC CAA ACT TTA TTT TAATCTTGTT TAGAAAGAAA ATTACACTAC 112 6 
Thr Leu Asn Pro Gin Thr Leu Phe 
330 335 

AATAACGATA AAATTTTAAA GGGTGTAAAA GTAGATTGTT ATGTT 1171 

(2) INFORMATION FOR SEQ ID NO: 4 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 98: 

Met Lys Glu Arg lie Val Asn Leu Glu Thr Leu Asp Phe Glu lie Ser 

15 10 15 

Gin Glu Val Ser Leu Arg Pro Ser Leu Trp Glu Asp Phe lie Gly Gin 

20 25 30 

Glu Lys lie Lys Ser Asn Leu Gin lie Ser lie Cys Ala Ala Lys Lys 

35 40 45 

Arg Gin Glu Ser Leu Asp His Met Leu Phe Phe Gly Pro Pro Gly Leu 

50 55 60 

Gly Lys Thr Ser lie Ser His lie lie Ala Lys Glu Met Glu Thr Asn 
65 70 75 80 

lie Lys lie Thr Ala Ala Pro Met lie Glu Lys Ser Gly Asp Leu Ala 

85 90 95 

Ala lie Leu Thr Asn Leu Gin Ala Lys Asp lie Leu Phe lie Asp Glu 

100 105 110 

lie His Arg Leu Ser Pro Ala lie Glu Glu Val Leu Tyr Pro Ala Met 

115 120 125 

Glu Asp Phe Arg Leu Asp lie lie lie Gly Ser Gly Pro Ala Ala Gin 
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25 30 35 40 

ATT TCT ATT TGC GCG OCT AAA AAA CGC CAA GAA AGT TTG GAT CAC ATG 2 56 

lie Ser lie Cys Ala Ala Lys Lys Arg Gin Glu Ser Leu Asp His Met 
45 50 55 

CTT TTT TTT GGC CCG CCC GGT TTG GGT AAA ACT TCA ATC AGC CAT ATC 3 04 

Leu Phe Phe Gly Pro Pro Gly Leu Gly Lys Thr Ser lie Ser His lie 
60 65 70 

ATC GCT AAA GAA ATG GAA ACC AAT ATC AAG ATC ACC GCC GCT CCC ATG 352 
lie Ala Lys Glu Met Glu Thr Asn lie Lys lie Thr Ala Ala Pro Met 
75 80 85 

ATA GAA AAA AGC GGT GAT TTA GCC GCC ATT TTG ACC AAT TTG CAA GCT 4 00 

lie Glu Lys Ser Gly Asp Leu Ala Ala lie Leu Thr Asn Leu Gin Ala 
90 95 100 

AAA GAC ATT CTT TTT ATT GAT GAA ATC CAC CGG CTC AGC CCA GCG ATT 44 8 

Lys Asp lie Leu Phe lie Asp Glu lie His Arg Leu Ser Pro Ala lie 
105 110 115 120 

GAA GAG GTT TTA TAC CCG GCG ATG GAA GAT TTT AGO TTG GAT ATT ATC 4 96 

Glu Glu Val Leu Tyr Pro Ala Met Glu Asp Phe Arg Leu Asp lie lie 
125 130 135 

ATA GGC TCA GGC CCA GCG GCT CAA ACC ATT AAA ATT GAT TTA CCC CCT 544 
lie Gly Ser Gly Pro Ala Ala Gin Thr lie Lys He Asp Leu Pro Pro 
140 145 150 

TTC ACT CTC ATC GGC GCT ACC ACC AGA GCC GGA ATG CTC TCT AAC CCC 5 92 

Phe Thr Leu He Gly Ala Thr Thr Arg Ala Gly Met Leu Ser Asn Pro 
155 160 165 

TTA AGA GAC AGA TTT GGC ATG AGT TTT AGA ATG CAA TTT TAT AAC CCT 64 0 

Leu Arg Asp Arg Phe Gly Met Ser Phe Arg Met Gin Phe Tyr Asn Pro 
170 175 180 

AGC GAA CTG GCC CTC ATC ATT AAA AAA GCT GCC GTT AAA CTC AAC CAA 68 8 

Ser Glu Leu Ala Leu He He Lys Lys Ala Ala Val Lys Leu Asn Gin 
185 190 195 200 

GAC ATC AAA CAA GAA AGT GCT GAT GAA ATC GCT AAA AGG AGT AGA GGC 73 6 

Asp He Lys Gin Glu Ser Ala Asp Glu He Ala Lys Arg Ser Arg Gly 
205 210 215 

ACG CCA AGG ATC GCT TTA AGG CTT TTA AAA AGG GTG CGC GAT TTT GCG 7 84 

Thr Pro Arg He Ala Leu Arg Leu Leu Lys Arg Val Arg Asp Phe Ala 
220 225 230 

CTA GTC AAA AAT TCA AGC TTG ATG GAT TTA AAC ATC ACT TTG CAT GCT 83 2 

Leu Val Lys Asn Ser Ser Leu Met Asp Leu Asn He Thr Leu His Ala 
235 240 245 

TTG AAT GAA TTA GGC GTG AAT GAA TTA GGC TTT GAT GAA GCG GAT TTG 880 
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50 55 60 

Met Val Arg Thr Asn Ser Ser Ser lie Thr Gly Asn lie Leu Asn Tyr 
65 70 75 80 

Leu Phe Ala Tyr Gly Leu Arg Phe Gly Tyr Gin Thr Phe Arg Pro Ser 

85 90 95 

Phe Phe Ala Arg Leu Val Lys Pro Asn lie lie Gly Arg Arg lie Tyr 

100 105 110 

lie Gin Tyr Tyr Gly Gly Ala Pro Lys Lys Ala Gly Phe Gly Asp Val 

115 120 125 

Gly Phe Gin Ser Val Met Leu Asn Gly Asp Phe Leu Leu Asp Phe Pro 

130 135 140 

Leu Pro Phe Val Gly Lys Tyr Leu Tyr Met Gly Gly Tyr Met Gly Leu 
145 150 155 160 

Gly Leu Gly Val Val Ala His Gly Val Asn Tyr Thr Ala Glu Trp Gly 

165 170 175 

Met Ser Phe Asn Ala Gly Leu Ala Leu Thr Val Leu Glu Lys Asn Arg 

180 185 190 

lie Glu Phe Gly Phe Lys lie Leu Asn Asn Phe Pro Phe Leu Gin Ser 

195 200 205 

Asn Ser Ser Lys Glu Thr Trp Trp Gly Ala Met Ala Asn lie Gly Tyr 

210 215 220 

Gin Tyr Val Phe 
225 

(2) INFORMATION FOR SEQ ID NO:497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 89... 1096 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 97: 

GGAAAGAATT GATTCAAAAC GCTATCAAAC AATACGCTGA TGATGTGAAA AAGGGAAACT 60 
TCCCTAACGA ATTAGAAAGT TATCATTA ATG AAA GAA CGG ATA GTC AAT TTA 112 

Met Lys Glu Arg lie Val Asn Leu 
1 5 

GAA ACT TTG GAT TTT GAA ATT TCT CAA GAA GTG AGT TTG CGC CCT AGT 16 0 

Glu Thr Leu Asp Phe Glu lie Ser Gin Glu Val Ser Leu Arg Pro Ser 
10 15 20 

CTT TGG GAA GAT TTT ATC GGT CAA GAA AAG ATT AAA AGC AAT TTG CAA 2 08 

Leu Trp. Glu Asp Phe lie Gly Gin Glu Lys lie Lys Ser Asn Leu Gin 
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AGG CGC ATT TAT ATC CAA TAT TAT GGA GGA GCT CCT AAA AAA GCG GGC 44 7 

Arg Arg lie Tyr lie Gin Tyr Tyr Gly Gly Ala Pro Lys Lys Ala Gly 
110 115 120 



TTT GGG GAT GTA GGG TTT CAA TCG GTT ATG CTG AAT GGG GAT TTT TTA 4 95 

Phe Gly Asp Val Gly Phe Gin Ser Val Met Leu Asn Gly Asp Phe Leu 
125 130 135 140 



TTG GAT TTT CCT TTG CCT TTT GTG GGG AAA TAG CTT TAT ATG GGG GGT 54 3 

Leu Asp Phe Pro Leu Pro Phe Val Gly Lys Tyr Leu Tyr Met Gly Gly 
145 150 155 



TAT ATG GGT TTA GGT TTG GGG GTT GTA GCG CAT GGG GTG AAT TAG ACG 591 
Tyr Met Gly Leu Gly Leu Gly Val Val Ala His Gly Val Asn Tyr Thr 
160 165 170 



GCG GAA TGG GGG ATG TCT TTT AAC GCA GGA TTG GCT CTA ACG GTA TTA 63 9 

Ala Glu Trp Gly Met Ser Phe Asn Ala Gly Leu Ala Leu Thr Val Leu 
175 180 185 



GAA AAA AAC CGC ATT GAA TTT GGA TTT AAA ATT TTG AAT AAT TTC CCT 687 
Glu Lys Asn Arg lie Glu Phe Gly Phe Lys lie Leu Asn Asn Phe Pro 
190 195 200 



TTT TTG CAA TCT AAT TCT TCA AAA GAG ACT TGG TGG GGA GCT ATG GCA 735 
Phe Leu Gin Ser Asn Ser Ser Lys Glu Thr Trp Trp Gly Ala Met Ala 
205 210 215 220 



AAC ATT GGG TAT CAA TAT GTG TTC TAAAAAAATA AGAAATCTCA TTTTATGCTT 789 
Asn lie Gly Tyr Gin Tyr Val Phe 
225 

TGGTTTTATG TTGGGCTTGC ACGCTGAAGA AAATACGACT GAAGGAAATA TGACTGAAGA 84 9 

AAATATCTCT AAAGACGCTC 869 



(2) INFORMATION FOR SEQ ID NO: 496: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 96: 



Met Gly Phe Tyr 
1 

Pro Lys Tyr Tyr 
20 

Glu Val Glu Glu 
35 

Gly Leu lie Leu 



Ala Gly Leu Asn 
5 

Lys Tyr lie Lys 

Leu lie Arg Asp 
40 

Gly lie Asn Thr 



Ala Leu Asp Tyr 
10 

Tyr Tyr Lys Ala 
25 

Leu Lys Arg Ala 
Gly Phe Phe Tyr 



Asp Thr lie Asp 
15 

Tyr Glu Asp Lys 
30 

Asn Ala Lys Ser 
45 

Asn His Glu lie 
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His Asp Tyr Asn Gly Tyr Val Lys Gin Asp Tyr Leu Glu Lys Pro Leu 

355 360 365 

Asp Leu Val Ser Ser Gin Gly lie Gly Phe Glu Lys Thr Leu Leu Glu 

370 375 380 

Trp Leu Asp Lys lie Arg Asn 
385 390 

(2) INFORMATION FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 76... 759 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 

AAGAAACTTT TAACAAACAA TTCAAGGGAT TTGGCGATTT TGGGTCTTAA AAAATATGCT 6 0 

ATTTTATGGT . CTTTA ATG GGG TTT TAT GCA GGA TTG AAC GCG CTT GAT TAT 111 

Met Gly Phe Tyr Ala Gly Leu Asn Ala Leu Asp Tyr 
15 10 

GAC ACC ATA GAC CCA AAA TAC TAC AAG TAT ATC AAG TAT TAT AAA GCC 15 9 

Asp Thr lie Asp Pro Lys Tyr Tyr Lys Tyr lie Lys Tyr Tyr Lys Ala 
15 20 25 

TAT GAG GAT AAA GAA GTT GAA GAA TTG ATC AGA GAC TTA AAA AGG GCG 2 07 

Tyr Glu Asp Lys Glu Val Glu Glu Leu lie Arg Asp Leu Lys Arg Ala 
30 35 40 

AAC GCT AAA AGC GGG CTT ATT TTA GGG ATC AAT ACC GGG TTT TTT TAC 2 55 

Asn Ala Lys Ser Gly Leu lie Leu Gly lie Asn Thr Gly Phe Phe Tyr 
45 50 55 60 

AAT CAT GAA ATC ATG GTT AGA ACT AAT AGC TCT AGC ATC ACG GGG AAT 3 03 

Asn His Glu lie Met Val Arg Thr Asn Ser Ser Ser lie Thr Gly Asn 
65 70 75 

ATT TTA AAT TAT TTG TTC GCT TAC GGC TTG CGT TTT GGC TAT CAA ACT 351 
lie Leu Asn Tyr Leu Phe Ala Tyr Gly Leu Arg Phe Gly Tyr Gin Thr 
80 85 90 

TTC AGG CCG TCG TTT TTT GCG CGC TTG GTC AAG CCA AAT ATC ATT GGC 3 99 

Phe Arg Pro Ser Phe Phe Ala Arg Leu Val Lys Pro Asn lie lie Gly 
95 100 105 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:494: 

Met Ser Leu Thr Ser Leu Leu Asn Pro Lys Ser Leu Glu Asp Phe Leu 

15 10 15 

Gly Gin Glu His Leu Val Gly Lys Asp Ala Pro Leu Phe Lys Ala Leu 

20 25 30 

Gin Ser Lys His Phe Pro His Ala Phe Phe Tyr Gly Pro Pro Gly Val 

35 40 45 

Gly Lys Thr Ser Leu Ala Gin lie lie Ala Tyr Met Leu Glu Arg Pro 

50 55 60 

lie Leu Leu Phe Asn Ala Thr Asp Phe Lys Leu Glu Asp Leu Arg Leu 
65 70 75 80 

Lys Leu Lys Asn Tyr Gin Asn Thr Leu Leu Lys Pro Val Val Phe lie 

85 90 95 

Asp Glu Thr His Arg Leu Asn Lys Thr Gin Gin Glu Phe Leu Leu Pro 

100 105 110 

lie Met Glu Lys Asp His Ala Leu lie Leu Gly Ala Ser Thr Gin Asp 

115 120 125 

Pro Asn Tyr Ser Leu Ser His Ala lie Arg Ser Arg Ser Phe lie Phe 

130 135 140 

Glu Leu Thr Pro Leu Asn Lys Ser Asp Leu Asp Arg Leu Cys Ala Lys 
145 150 155 160 

Ala Leu Thr Leu Leu Lys Lys Gin lie Glu Pro Gly Ala Lys Thr Tyr 

165 170 175 

Leu Leu Asn Asn Ser Ala Gly Asp Ala Arg Ala Leu Leu Asn Leu Leu 

180 185 190 

Asp Leu Ser Ala Lys lie Glu Asp Pro lie Thr Leu Lys Thr Leu Gin 

195 200 205 

Ser Leu Arg Pro His Ser Leu Asn Asp Gly Ser Tyr Ser Asp Asp Thr 

210 215 220 

His Tyr Asn Leu Thr Ser Ala Leu He Lys Ser Leu Arg Gly Ser Asp 
225 230 235 240 

Glu Asn Ala Ser He Tyr Tyr Leu Ala Arg Leu He Ala Gly Gly Glu 

245 250 255 

Asn Pro Glu Phe He Ala Arg Arg Leu Val He Phe Ala Ser Glu Asp 

260 265 270 

He Gly Asn Ala Asn Pro Asn Ala Leu Asn Leu Ala Ala Ser Cys Leu 

275 280 285 

Phe Ala Val Lys Gin He Gly Tyr Pro Glu Ala Arg He He Leu Ser 

290 295 300 

Gin Cys Val He Tyr Leu Ala Cys Ser Pro Lys Ser Asn Thr Ala Tyr 
305 310 315 320 

Arg Ala He Asn Gin Ala Leu Asp Cys Val Gin Lys Gly Ser Leu Tyr 

325 330 335 

Pro He Pro Lys His Leu Leu Pro Asn Ala Lys Asp Tyr Leu Tyr Pro 
340 345 350 
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AGC GCT AAA ATA GAA GAT CCT ATC ACT TTA AAA ACG CTA CAA TCC TTA 742 
Ser Ala Lys lie Glu Asp Pro lie Thr Leu Lys Thr Leu Gin Ser Leu 
195 200 205 210 

CGG CCT CAT AGC CTA AAT GAT GGA TCT TAT AGC GAT GAT ACG CAT TAT 7 90 

Arg Pro His Ser Leu Asn Asp Gly Ser Tyr Ser Asp Asp Thr His Tyr 
215 220 225 

AAC CTT ACT AGC GCG TTA ATC AAA TCT TTA AGA GGG AGC GAT GAA AAC 83 8 

Asn Leu Thr Ser Ala Leu lie Lys Ser Leu Arg Gly Ser Asp Glu Asn 
230 235 240 

GCT TCC ATC TAT TAT CTG GCG CGC TTG ATT GCT GGC GGG GAA AAC CCG 8 86 

Ala Ser lie Tyr Tyr Leu Ala Arg Leu lie Ala Gly Gly Glu Asn Pro 
245 250 255 

GAA TTT ATC GCC AGA AGG CTG GTG ATT TTT GCG AGC GAA GAT ATT GGT 934 
Glu Phe lie Ala Arg Arg Leu Val lie Phe Ala Ser Glu Asp lie Gly 
260 265 270 

AAC GCT AAC CCG AAC GCC CTT AAT TTA GCC GCT TCT TGT TTG TTT GCA 982 
Asn Ala Asn Pro Asn Ala Leu Asn Leu Ala Ala Ser Cys Leu Phe Ala 
275 280 285 290 

GTC AAA CAA ATC GGC TAC CCT GAA GCG CGC ATC ATT TTA AGC CAA TGC 103 0 
Val Lys Gin lie Gly Tyr Pro Glu Ala Arg lie lie Leu Ser Gin Cys 
295 300 305 

GTG ATT TAT CTG GCT TGT TCG CCC AAG TCT AAC ACG GCT TAT AGA GCG 10 7 8 
Val lie Tyr Leu Ala Cys Ser Pro Lys Ser Asn Thr Ala Tyr Arg Ala 
310 315 320 

ATC AAT CAG GCT TTG GAT TGC GTT CAA AAA GGC TCA CTC TAC CCT ATT 112 6 
lie Asn Gin Ala Leu Asp Cys Val Gin Lys Gly Ser Leu Tyr Pro lie 
325 330 335 

CCT AAA CAC CTG CTG CCT AAC GCT AAA GAT TAC CTT TAC CCG CAT GAT 1174 
Pro Lys His Leu Leu Pro Asn Ala Lys Asp Tyr Leu Tyr Pro His Asp 
340 345 350 

TAT AAC GGC TAT GTC AAA CAA GAT TAT TTG GAA AAA CCC CTA GAT TTG 1222 
Tyr Asn Gly Tyr Val Lys Gin Asp Tyr Leu Glu Lys Pro Leu Asp Leu 
355 360 365 370 

GTT TCT TCT CAA GGC ATA GGA TTT GAA AAA ACC CTT TTA GAA TGG CTT 1270 
Val Ser Ser Gin Gly lie Gly Phe Glu Lys Thr Leu Leu Glu Trp Leu 
375 380 385 

GAT AAG ATA AGA AAT TGATCTTATA AGTTACATTA AAATGCGACA ATGGTAATAA A 132 6 
Asp Lys He Arg Asn 
390 

AAATCAATAT TTTTGGATTG AATT 135 0 

(2) INFORMATION FOR SEQ ID NO: 4 94: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:493: 

ACCTGAATAA AGAGTTGCAA GACGCTCTGC ACAAACACTC TAAAAATACC AAAACCCCAA 60 
CGAAAAATTT AAACACCCCT ACGAATTTTT ACGAATTGAT TTTATTTAAA AA ATG AGC 118 

Met Ser 
1 

CTG ACT TCG CTT TTA AAC CCA AAA AGC CTA GAA GAT TTT TTA GGC CAA 16 6 

Leu Thr Ser Leu Leu Asn Pro Lys Ser Leu Glu Asp Phe Leu Gly Gin 
5 10 15 

GAG CAT TTA GTA GGG AAA GAC GCC CCC TTA TTT AAA GCC CTA CAA TCC 214 
Glu His Leu Val Gly Lys Asp Ala Pro Leu Phe Lys Ala Leu Gin Ser 
20 25 30 

AAA CAC TTC CCC CAT GCC TTT TTC TAT GGC CCT CCT GGC GTG GGT AAA 262 
Lys His Phe Pro His Ala Phe Phe Tyr Gly Pro Pro Gly Val Gly Lys 
35 40 45 50 

ACA AGC CTG GCT CAA ATC ATC GCC TAT ATG CTA GAG CGC CCC ATT CTT 310 
Thr Ser Leu Ala Gin lie lie Ala Tyr Met Leu Glu Arg Pro lie Leu 
55 60 65 

TTA TTC AAT GCG ACG GAT TTT AAA TTA GAG GAT TTG CGC CTT AAG CTT 358 
Leu Phe Asn Ala Thr Asp Phe Lys Leu Glu Asp Leu Arg Leu Lys Leu 
70 75 80 

AAA AAT TAC CAA AAT ACC CTT TTA AAA CCC GTT GTT TTT ATT GAT GAA 406 
Lys Asn Tyr Gin Asn Thr Leu Leu Lys Pro Val Val Phe lie Asp Glu 
85 90 95 

ACC CAC AGA TTG AAT AAA ACC CAA CAA GAA TTT TTA CTC CCC ATT ATG 454 
Thr His Arg Leu Asn Lys Thr Gin Gin Glu Phe Leu Leu Pro He Met 
100 105 110 

GAA AAA GAT CAC GCT TTA ATT TTA GGG GCT AGC ACG CAA GAT CCT AAT 5 02 

Glu Lys Asp His Ala Leu He Leu Gly Ala Ser Thr Gin Asp Pro Asn 
115 120 125 130 

TAC AGC CTA AGC CAT GCG ATC CGA TCA AGA AGT TTT ATT TTT GAA TTA 550 
Tyr Ser Leu Ser His Ala He Arg Ser Arg Ser Phe He Phe Glu Leu 
135 140 145 

ACC CCC CTA AAC AAG AGC GAT TTA GAC AGG CTT TGC GCT AAA GCT TTA 5 98 

Thr Pro Leu Asn Lys Ser Asp Leu Asp Arg Leu Cys Ala Lys Ala Leu 
150 155 160 

ACA TTG CTC AAA AAA CAA ATA GAG CCT GGC GCT AAA ACC TAT CTT TTA 64 6 

Thr Leu Leu Lys Lys Gin He Glu Pro Gly Ala Lys Thr Tyr Leu Leu 
165 170 175 

AAC AAC AGC GCT GGC GAC GCT AGA GCG TTA TTA AAC CTT TTA GAT TTG 694 
Asn Asn Ser Ala Gly Asp Ala Arg Ala Leu Leu Asn Leu Leu Asp Leu 
180 185 190 
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Met 


Ser 


Lys 


Ser 


Leu 


Tyr 


Gin 


Thr 


Leu 


Asn 


Val 


Ser 


Glu 


Asn 


Ala 


Ser 


1 








5 










10 










15 




Gin 


Asp 


Glu 


He 


Lys 


Lys 


Ser 


Tyr 


Arg 


Arg 


Leu 


Ala 


Arg 


Gin 


Tyr 


His 








20 










25 










30 






Pro 


Asp 


Leu 


Asn 


Lys 


Thr 


Lys 


Glu 


Ala 


Glu 


Glu 


Lys 


Phe 


Lys 


Glu 


He 






35 










40 










45 








Asn 


Ala 


Ala 


Tyr 


Glu 


He 


Leu 


Ser 


Asp 


Glu 


Glu 


Lys 


Arg 


Arg 


Gin 


Tyr 




50 










55 










60 










Asp 


Gin 


Phe 


Gly 


Asp 


Asn 


Met 


Phe 


Gly 


Gly 


Gin 


Asn 


Phe 


Ser 


Asp 


Phe 


65 










70 










75 










80 


Ala 


Arg 


Ser 


Arg 


Gly 


Pro 


Ser 


Glu 


Asp 


Leu 


Asp 


Asp 


He 


Leu 


Ser 


Ser 










85 










90 










95 




He 


Phe 


Gly 


Lys 


Gly 


Gly 


Phe 


Ser 


Gin 


Arg 


Phe 


Ser 


Gin 


Asn 


Ser 


Gin 








100 










105 










110 






Gly 


Phe 


Ser 


Gly 


Phe 


Asn 


Phe 


Ser 


Asn 


Phe 


Ala 


Pro 


Glu 


Asn 


Leu 


Asp 






115 










120 










125 








He 


Thr 


Ala 


Ala 


Leu 


Asn 


Val 


Ser 


Val 


Leu 


Asp 


Thr 


Leu 


Leu 


Gly Asn 




130 










135 










140 










Lys 


Lys 


Gin 


Val 


Ser 


He 


Asn 


Asn 


Glu 


Thr 


Phe 


Ser 


Leu 


Lys 


He 


Pro 


145 










150 










155 










160 


He 


Gly Val 


Glu 


Glu 


Gly 


Glu 


Lys 


He 


Arg 


Val 


Arg 


Asn 


Lys 


Gly 


Lys 










165 










170 










175 




Thr 


Gly 


Arg 


Thr 


Thr 


Arg 


Gly 


Asp 


Leu 


Leu 


Leu 


Glu 


He 


His 


He 


Glu 








180 










185 










190 






Glu 


Asp 


Glu 


Met 


Tyr 


Arg 


Arg 


Glu 


Lys 


Asp 


Asp 


He 


Thr 


Gin 


He 


Phe 






195 










200 










205 








Asp 


Leu 


Pro 


Leu 


Lys 


Thr 


Ala 


Leu 


Phe 


Gly 


Gly 


Lys 


He 


Glu 


He 


Ala 




210 










215 










220 










Thr 


Trp 


His 


Lys 


Thr 


Leu 


Thr 


Leu 


Thr 


He 


Pro 


Pro 


Asn 


Thr 


Lys 


Ala 


225 










230 










235 










240 


Met 


Gin 


Lys 


Phe 


Arg 


He 


Lys 


Glu 


Lys 


Gly 


He 


Lys 


Asn 


Arg 


Lys 


Thr 










245 










250 










255 




Ser 


His 


Val 


Gly 


Asp 


Leu 


Tyr 


Leu 


Gin 


Ala 


Arg 


Leu 


He 


Leu 


Pro 


Lys 








260 










265 










270 






Thr 


Glu 


Thr 


Leu 


Ser 


Asn 


Glu 


Leu 


Lys 


Ala 


Leu 


Leu 


Glu 


Lys 


Glu 


Leu 



275 280 285 



(2) INFORMATION FOR SEQ ID NO:493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 113... 1285 
(D) OTHER INFORMATION: 
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GTC TCT GTT TTA GAC ACC CTT TTA GGC AAT AAA AAA CAA GTG AGC ATC 4 88 

Val Ser Val Leu Asp Thr Leu Leu Gly Asn Lys Lys Gin Val Ser lie 
135 140 145 150 

AAT AAT GAG ACT TTT AGC CTT AAA ATC CCT ATT GGC GTG GAA GAG GGC 536 
Asn Asn Glu Thr Phe Ser Leu Lys lie Pro lie Gly Val Glu Glu Gly 
155 160 165 

GAA AAG ATT AGG GTT CGC AAC AAG GGG AAA ACG GGG CGA ACG ACT AGG 5 84 

Glu Lys lie Arg Val Arg Asn Lys Gly Lys Thr Gly Arg Thr Thar Arg 
170 175 180 

GGC GAT TTG CTC TTA GAG ATC CAT ATT GAA GAA GAT GAA ATG TAT AGG 632 
Gly Asp Leu Leu Leu Glu lie His lie Glu Glu Asp Glu Met Tyr Arg 
185 190 195 

CGC GAG AAA GAT GAT ATT ACC CAA ATC TTT GAT TTA CCC TTA AAA ACG 6 80 

Arg Glu Lys Asp Asp lie Thr Gin lie Phe Asp Leu Pro Leu Lys Thr 
200 205 210 

GCT CTT TTT GGA GGG AAA ATT GAA ATC GCT ACT TGG CAT AAA ACC TTA 72 8 

Ala Leu Phe Gly Gly Lys lie Glu lie Ala Thr Trp His Lys Thr Leu 
215 220 225 230 

ACC CTA ACC ATT CCC CCT AAC ACC AAA GCG ATG CAA AAA TTC CGC ATT 776 
Thr Leu Thr lie Pro Pro Asn Thr Lys Ala Met Gin Lys Phe Arg lie 
235 240 245 

AAA GAA AAA GGG ATC AAA AAC AGA AAA ACT TCG CAT GTG GGG GAT TTG 824 
Lys Glu Lys Gly lie Lys Asn Arg Lys Thr Ser His Val Gly Asp Leu 
250 255 260 

TAT TTG CAG GCT CGT TTG ATT TTG CCT AAA ACT GAA ACG CTT TCT AAT 872 
Tyr Leu Gin Ala Arg Leu lie Leu Pro Lys Thr Glu Thr Leu Ser Asn 
265 270 275 

GAG TTG AAA GCG TTA TTA GAA AAA GAA TTG TAAGGAGGAA TCGTGTGCGA TTA 92 5 
Glu Leu Lys Ala Leu Leu Glu Lys Glu Leu 
280 285 

TGATGAACCG CTTTATTTAA TCAGCGTCGT GGCTAAAATC TTAGGCGTGC ACCCTCAAAC 985 
CTTGC 990 

(2) INFORMATION FOR SEQ ID NO: 4 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:492: 
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(2) INFORMATION FOR SEQ ID N0:491: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
{ ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 3 9... 902 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 91: 

ATGATAGTAT TCTAATAAAA CTATTTTAAA GGTGATCC ATG AGT AAG AGT TTA TAG 56 

Met Ser Lys Ser Leu Tyr 
1 5 

CAA ACT TTA AAT GTG AGC GAA AAC GCC AGC CAA GAT GAA ATC AAA AAA 104 
Gin Thr Leu Asn Val Ser Glu Asn Ala Ser Gin Asp Glu lie Lys Lys 
10 15 20 

TCC TAC CGC CGT TTA GCC CGA CAA TAC CAC CCG GAT TTG AAT AAA ACC 152 
Ser Tyr Arg Arg Leu Ala Arg Gin Tyr His Pro Asp Leu Asn Lys Thr 
25 30 35 

AAA GAA GCC GAA GAG AAA TTC AAA GAA ATC AAC GCC GCT TAT GAA ATT 2 00 

Lys Glu Ala Glu Glu Lys Phe Lys Glu lie Asn Ala Ala Tyr Glu lie 
40 45 50 

TTG AGC GAT GAA GAA AAA CGC CGC CAA TAC GAT CAG TTT GGC GAT AAC 24 8 

Leu Ser Asp Glu Glu Lys Arg Arg Gin Tyr Asp Gin Phe Gly Asp Asn 
55 60 65 70 

ATG TTT GGC GGG CAG AAT TTC AGC GAT TTT GCC AGA AGC CGT GGT CCT 2 96 

Met Phe Gly Gly Gin Asn Phe Ser Asp Phe Ala Arg Ser Arg Gly Pro 
75 80 85 

AGT GAA GAT TTA GAC GAT ATT TTA AGC TCT ATT TTT GGG AAA GGA GGC 344 
Ser Glu Asp Leu Asp Asp lie Leu Ser Ser lie Phe Gly Lys Gly Gly 
90 95 100 

TTT TCG CAA AGA TTT TCT CAA AAC TCG CAA GGC TTT TCT GGC TTT AAT 3 92 

Phe Ser Gin Arg Phe Ser Gin Asn Ser Gin Gly Phe Ser Gly Phe Asn 
105 110 115 

TTT TCC AAT TTC GCC CCT GAA AAT TTA GAC ATA ACC GCC GCT TTA AAT 44 0 

Phe Ser Asn Phe Ala Pro Glu Asn Leu Asp lie Thr Ala Ala Leu Asn 
120 125 130 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90: 



Met He Pro Lys 
1 

He Ser Lys Asp 
20 

Ala Asp Lys He 
35 

Ala Thr Leu Val 
50 

He Thr Lys Asp 
65 

He Ser Val Gly 

Gly Glu Ser Val 
100 

He Gly He Asn 
115 

Asn Pro Gly Asn 
130 

Val Gly He Asn 
145 

He Gly Phe Ala 

Leu He Lys Thr 
180 

Gin Asp Leu Ser 
195 

Ala Val Val He 
210 

He Leu Val Trp 
225 

Asn Thr Asn Glu 

Arg Val Thr Leu 
260 

Leu Thr Leu Ala 
275 

Ala Gin Asn Gly 
290 

Leu Thr Gin Glu 
305 

Gly Val Leu Val 

Gly Phe Arg Gin 
340 

Lys Ser Val Ala 
355 

Pro Lys Arg Phe 

370 
Leu Val Lys 
385 



Glu Arg Met Glu 
5 

Gly Tyr He Val 

Lys Val Thr He 
40 

Gly Thr Asp Ser 
55 

Asn Leu Pro Thr 
70 

Asp Leu Val Phe 
85 

Thr Gin Gly He 

Ser Tyr Glu Asn 
120 

Ser Gly Gly Ala 
135 

Thr Ala He He 
150 

He Pro Ser Asn 
165 

Gly Lys He Glu 

Gly Asp Leu Gin 
200 

Ser Val Glu Lys 
215 

Asp Leu He Thr 
230 

Leu Arg Asn Leu 
245 

Lys Val He Arg 

Glu Arg Lys Asn 
280 

Ala Gin Gly Gin 
295 

Thr Lys Arg Ser 
310 

Ser Gin Val Asn 
325 

Gly Asn He He 

Asp Phe Asn His 
360 

Leu Val Leu Asp 
375 



Arg Ala Leu Gly 
10 

Thr Asn Asn His 
25 

Pro Gly Ser Asn 

Glu Ser Asp Leu 
60 

He Lys Phe Ser 
75 

Ala He Gly Asn 
90 

Val Ser Ala Leu 
105 

Phe He Gin Thr 

Leu He Asp Ser 
140 

Ser Lys Thr Gly 
155 

Met Val Lys Asp 
170 

Arg Gly Tyr Leu 
185 

Asn Ser Tyr Asp 

Asp Ser Pro Ala 
220 

Glu Val Asn Gly 
235 

He Gly Ser Met 
250 

Asp Lys Lys Glu 
265 

Pro Asn Lys Lys 

Leu Asn Gly Leu 
300 

Met Arg Leu Ser 
315 

Glu Asn Ser Pro 
330 

Thr Lys He Glu 
345 

Ala Leu Glu Lys 

Leu Asn Gin Gly 
380 
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Ser Gly Val He 
15 

Val He Asp Gly 
30 

Lys Glu Tyr Ser 
45 

Ala Val He Arg 

Asp Ser Asn Asp 
80 

Pro Phe Gly Val 
95 

Asn Lys Ser Gly 
110 

Asp Ala Ser He 
125 

Arg Gly Gly Leu 

Gly Asn His Gly 
160 

Thr Val Thr Gin 
175 

Gly Val Gly Leu 
190 

Asn Lys Glu Gly 
205 

Lys Lys Ala Gly 

Lys Lys Val Lys 
240 

Leu Pro Asn Gin 
255 

Arg Ala Phe Thr 
270 

Glu Thr He Ser 
285 

Gin Val Glu Asp 

Asp Asp Val Gin 
320 

Ala Glu Gin Ala 
335 

Glu Val Glu Val 
350 

Tyr Lys Gly Lys 
365 

Tyr Arg He He 
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ATT AGC GTA GAA AAA GAC TCT CCG GCT AAA AAA GCA GGG ATT TTG GTG 910 
lie Ser Val Glu Lys Asp Ser Pro Ala Lys Lys Ala Gly lie Leu Val 
215 220 225 

TGG GAT TTG ATC ACC GAA GTC AAT GGG AAA AAG GTT AAA AAC ACG AAT 95 8 

Trp Asp Leu lie Thr Glu Val Asn Gly Lys Lys Val Lys Asn Thr Asn 
230 235 240 

GAG TTA AGA AAT CTA ATC GGG TCC ATG CTA CCC AAT CAA AGA GTA ACC 10 06 
Glu Leu Arg Asn Leu lie Gly Ser Met Leu Pro Asn Gin Arg Val Thr 
245 250 255 

TTA AAA GTC ATT AGA GAC AAA AAA GAA CGC GCT TTC ACC CTC ACT CTA 1054 
Leu Lys Val lie Arg Asp Lys Lys Glu Arg Ala Phe Thr Leu Thr Leu 
260 265 270 275 

GCT GAA AGG AAA AAC CCT T^C AAA AAA GAA ACC ATT TCT GCT CAA AAC 1102 
Ala Glu Arg Lys Asn Pro Asn Lys Lys Glu Thr lie Ser Ala Gin Asn 
280 285 290 

GGC GCG CAA GGC CAA TTG AAC GGG CTT CAA GTA GAA GAT TTA ACT CAA 115 0 
Gly Ala Gin Gly Gin Leu Asn Gly Leu Gin Val Glu Asp Leu Thr Gin 
295 300 305 

GAA ACC AAA AGG TCT ATG CGT TTG AGC GAT GAT GTT CAA GGG GTT TTA 1198 
Glu Thr Lys Arg Ser Met Arg Leu Ser Asp Asp Val Gin Gly Val Leu 
310 315 320 

GTC TCT CAA GTG AAT GAA AAT TCC CCA GCA GAG CAA GCC GGA TTT AGG 124 6 

Val Ser Gin Val Asn Glu Asn Ser Pro Ala Glu Gin Ala Gly Phe Arg 
325 330 335 

CAA GGT AAC ATT ATC ACA AAA ATT GAA GAG GTT GAA GTT AAA AGC GTT 12 94 
Gin Gly Asn lie lie Thr Lys He Glu Glu Val Glu Val Lys Ser Val 
340 345 350 355 

GCG GAT TTT AAC CAT GCT TTA GAA AAG TAT AAA GGC AAA CCC AAA CGA 1342 
Ala Asp Phe Asn His Ala Leu Glu Lys Tyr Lys Gly Lys Pro Lys Arg 
360 365 370 

TTC TTA GTT TTA GAC TTG AAT CAA GGT TAT AGG ATC ATT TTG GTG AAA T 13 91 
Phe Leu Val Leu Asp Leu Asn Gin Gly Tyr Arg He He Leu Val Lys 
375 380 385 

GATAGGGGTG GGTCGTTAGT CGCATGTCTT TGATTAGAGT GAATGGGGAA GCTTTTAAAC 14 51 
TCTCTTTAGA AAGTTTAGAA GAAGACCCTT TTGAAACTAA AGAAACGCTA GAAACGCTTA 1511 
TCAAACAAAC GAGCGTTG 1529 

(2) INFORMATION FOR SEQ ID NO: 4 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 
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Met lie Pro 
1 

AAA GAA AGA ATG GAA AGG GCT TTA GGC AGC GGC GTA ATC ATT TCT AAA 2 86 

Lys Glu Arg Met Glu Arg Ala beu Gly Ser Gly Val lie lie Ser Lys 
5 10 15 

GAC GGC TAT ATT GTA ACT AAT AAC CAT GTG ATT GAT GGC GCG GAT AAG 3 34 

Asp Gly Tyr lie Val Thr Asn Asn His Val lie Asp Gly Ala Asp Lys 
20 25 30 35 

ATT AAA GTT ACC ATT CCA GGG AGC AAT AAA GAA TAT TCC GCC ACT CTA 3 82 

lie Lys Val Thr lie Pro Gly Ser Asn Lys Glu Tyr Ser Ala Thr Leu 
40 45 50 

GTA GGC ACC GAT TCT GAA AGC GAT TTA GCG GTG ATT CGC ATC ACT AAA 43 0 

Val Gly Thr Asp Ser Glu Ser Asp Leu Ala Val lie Arg lie Thr Lys 
55 60 65 

GAC AAT CTG CCC ACG ATC AAA TTC TCT GAT TCT AAT GAT ATT TCA GTG 47 8 

Asp Asn Leu Pro Thr lie Lys Phe Ser Asp Ser Asn Asp lie Ser Val 
70 75 80 

GGC GAT TTG GTT TTT GCG ATT GGT AAC CCT TTT GGC GTG GGC GAA AGC 52 6 

Gly Asp Leu Val Phe Ala lie Gly Asn Pro Phe Gly Val Gly Glu Ser 
85 90 95 

GTT ACG CAA GGC ATT GTT TCA GCG CTC AAT AAA AGC GGG ATT GGG ATC 574 
Val Thr Gin Gly lie Val Ser Ala Leu Asn Lys Ser Gly lie Gly lie 
100 105 110 115 

AAC AGC TAT GAG AAT TTC ATT CAA ACA GAC GCT TCC ATC AAT CCT GGA 622 
Asn Ser Tyr Glu Asn Phe lie Gin Thr Asp Ala Ser He Asn Pro Gly 
120 125 130 

AAT TCC GGC GGC GCT TTA ATT GAT AGC CGT GGA GGG TTA GTG GGG ATT 67 0 

Asn Ser Gly Gly Ala Leu He Asp Ser Arg Gly Gly Leu Val Gly He 
135 140 145 

AAT ACC GCT ATT ATC TCT AAA ACT GGG GGC AAC CAC GGC ATT GGC TTT 718 
Asn Thr Ala He He Ser Lys Thr Gly Gly Asn His Gly He Gly Phe 
150 155 160 

GCC ATC CCT TCT AAC ATG GTT AAA GAT ACT GTA ACC CAA CTC ATC AAA 7 66 

Ala He Pro Ser Asn Met Val Lys Asp Thr Val Thr Gin Leu He Lys 
165 170 175 

ACC GGT AAG ATT GAA AGA GGT TAC TTG GGC GTG GGC TTG CAA GAT TTG 814 
Thr Gly Lys He Glu Arg Gly Tyr Leu Gly Val Gly Leu Gin Asp Leu 
180 185 190 195 

AGT GGC GAT TTG CAA AAT TCT TAT GAC AAC AAA GAA GGG GCG GTA GTC 862 
Ser Gly Asp Leu Gin Asn Ser Tyr Asp Asn Lys Glu Gly Ala Val Val 
200 205 210 
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Lys Ala Thr Leu Trp Arg lie Val Phe Phe Phe Leu Gly Ser Val Phe 

290 295 300 

Val lie Ser Val Phe Leu Pro Met Asn Asp Ser Ser lie Thr Gin Ser 
305 310 315 320 

Pro Phe Val Ser Val Leu Glu Arg lie Asn Leu Pro Phe lie Gly Met 

325 330 335 

Gly lie Pro Tyr Val Ala Asp lie Met Asn Ala Val lie lie Thr Ala 

340 345 350 

Met Phe Ser Thr Ala Asn Ser Gly Leu Tyr Gly Ala Ser Arg Met lie 

355 360 365 

Tyr Gly Leu Ser Lys Gin Lys Met Phe Phe Lys Val Phe Ser Gin Leu 

370 375 380 

Asn Arg Gin Gly Thr Pro Thr Tyr Ala Met Phe Phe Ser Leu Ser Phe 
385 390 395 400 

Ser Leu lie Gly Leu Leu Val Gin lie Tyr Ala Lys Glu Asn Val Val 

405 410 415 

Glu Ala Leu lie Asn Val He Ser Phe Thr Val He He Val Trp Val 

420 425 430 

Ser Val Ser Val Ser Gin Tyr Ser Phe Arg Lys Gin Tyr Leu Lys Ala 

435 440 445 

Gly His Ser Leu Glu Asp Leu Pro Tyr Lys Ala Pro Phe Leu Pro Phe 

450 455 460 

Leu Gin Leu He Gly He Thr Gly Cys Ala He Gly Val He Gly Ser 
465 470 475 480 

Ala Met Asp Lys Asp Gin Arg He Gly Met He Leu Thr He Val Phe 

485 490 495 

Ala Val He Cys Tyr He Gly Tyr Tyr Phe Thr Gin Lys Ala Asn Glu 

500 505 510 

Asn Asn Lys Lys Asp Leu He 



(2) INFORMATION FOR SEQ ID NO: 489: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 152 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATXJRE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 230... 1390 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 9: 



TGTGTTTAAT GACCCCTTTT TCCAACAATT TTTTGGGGAT TTGGGTGGC ATG ATT CCT 23 8 
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515 



TTAAGCTTGA ATGCGGGCAA TATCCAAATC 
AGTGTCCCCT CTAAAGACGA TACGGATCTA 
TAAGGCGGTG GTGAATATCT CCACTGAAAA 



CAGAGCATGC CCAAAGTTAA AGAGCGAGTG 
TTCTTACCAC GATTCTATTA AGGACTCTAT 
GAAGATTAAA AACAATTTTA TAGGTGGCGG 



60 
120 
180 
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AAT GAA AAT AAC AAA AAA GAT TTG ATA TAATCTTTTC TTAATTTTGA AGTTTAG 1651 
Asn Glu Asn Asn Lys Lys Asp Leu lie 
515 

CAAATTTTAA GGAAGTAACC ATGATGAAAA AAACCCTTTT TATCTCTTTG GCTTTAGC 17 09 

(2) INFORMATION FOR SEQ ID NO: 4 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:488: 

Met Leu Leu lie Val Phe Phe Lys Phe Tyr Phe Gin Tyr Ser lie Lys 

15 10 15 

Lys Lys Ser Phe Tyr Phe lie Phe Val lie lie Gin Ala lie Phe lie 

20 25 30 

Phe Asn Leu Arg Arg Cys Arg Met Asp Asn Gin Lys lie Thr His Gin 

35 40 45 

Asn lie Thr Gin Lys Gin Gly Glu Leu Lys Arg Asp Met Lys Met Arg 

50 55 60 

His Leu Leu Met lie Ala Phe Gly Gly Ala lie Gly Thr Gly Leu Phe 
65 70 75 80 

Val Gly Thr Gly Gly Asn lie Ala Ser Ala Gly Pro Leu Gly Thr Leu 

85 90 95 

lie Ala Tyr Cys Phe Gly Gly Leu Val Val Tyr Cys lie Met Leu Ser 

100 105 110 

Leu Gly Glu Leu Ala Ser Val Tyr Pro Thr Thr Gly Ser Phe Gly Asp 

115 120 125 

Tyr Ala Ala Lys Phe lie Gly Pro Gly Thr Gly Tyr Met Val Phe Trp 

130 135 140 

Met Tyr Trp Leu Gly Trp Val lie Thr Val Ala Leu Glu Tyr lie Ala 
145 150 155 160 

lie Gly Met Leu Met Gin Arg Trp Phe Ala Asp lie Pro lie His Tyr 

165 170 175 

Trp Val He Leu Cys He Ala Leu Val Phe Leu Leu Asn Phe Phe Ser 

180 185 190 

Val Lys He Phe Ala Glu Gly Glu Phe Phe Phe Ser Leu He Lys Val 

195 200 205 

Leu Ala Val He Ala Phe He Gly He Gly Ala He Gly He He Tyr 

210 215 220 

Gin He Tyr Ser His Gly Phe Gly Ser He Phe Asp Asn Phe His Phe 
225 230 235 240 

Gly Asp Lys Gly Phe Phe Pro Asn Gly Ser Ala Ala Val Phe Ser Ala 

245 250 255 

Met Leu Ala Val He Phe Ala Phe Thr Gly Thr Glu Val He Gly Val 

260 265 270 

Ala Val Gly Glu Thr Lys Asn Ala Ser Glu Val Met Pro Lys Ala He 
275 280 285 
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GCG ATT AAA GCG ACC TTG TGG CGG ATT GTC TTT TTC TTT TTA GGC TCT 97 3 

Ala lie Lys Ala Thr Leu Trp Arg lie Val Phe Phe Phe Leu Gly Ser 
290 295 300 

GTG TTT GTC ATT TCT GTT TTT TTA CCC ATG AAT GAT TCT TCT ATC ACG 1021 
Val Phe Val lie Ser Val Phe Leu Pro Met Asn Asp Ser Ser lie Thr 
305 310 315 

CAA AGC CCT TTT GTG AGC GTT TTA GAA CGC ATT AAT TTG CCC TTT ATT 106 9 

Gin Ser Pro Phe Val Ser Val Leu Glu Arg lie Asn Leu Pro Phe lie 
320 325 330 

GGC ATG GGT ATC CCT TAT GTG GCT GAT ATA ATG AAC GCT GTT ATC ATT 1117 
Gly Met Gly lie Pro Tyr Val Ala Asp lie Met Asn Ala Val lie lie 
335 340 345 350 

ACG GCG ATG TTT TCT ACC GCT AAT TCA GGG CTT TAT GGA GCG AGC CGC 1165 
Thr Ala Met Phe Ser Thr Ala Asn Ser Gly Leu Tyr Gly Ala Ser Arg 
355 360 365 

ATG ATT TAT GGG CTG TCC AAA CAA AAG ATG TTT TTT AAG GTT TTT TCC 1213 
Met lie Tyr Gly Leu Ser Lys Gin Lys Met Phe Phe Lys Val Phe Ser 
370 375 380 

CAA CTC AAC CGA C7UV GGC ACG CCC ACT TAT GCG ATG TTT TTT TCC CTT 1261 
Gin Leu Asn Arg Gin Gly Thr Pro Thr Tyr Ala Met Phe Phe Ser Leu 
385 390 395 

TCT TTT TCT CTC ATA GGG CTT TTA GTC CAA ATT TAT GCC AAA GAA AAT 1309 
Ser Phe Ser Leu lie Gly Leu Leu Val Gin lie Tyr Ala Lys Glu Asn 
400 405 410 

GTC GTG GAA GCT TTG ATT AAT GTG ATC AGT TTC ACG GTG ATT ATT GTG 135 7 

Val Val Glu Ala Leu lie Asn Val He Ser Phe Thr Val He He Val 
415 420 425 430 

TGG GTT AGC GTG TCT GTT TCG CAA TAT TCT TTC CGC AAG CAA TAC TTA 14 05 

Trp Val Ser Val Ser Val Ser Gin Tyr Ser Phe Arg Lys Gin Tyr Leu 
435 440 445 

AAA GCC GGG CAT TCT TTA GAG GAT TTG CCT TAT AAA GCC CCC TTT CTA 145 3 

Lys Ala Gly His Ser Leu Glu Asp Leu Pro Tyr Lys Ala Pro Phe Leu 
450 455 460 

CCC TTT TTG CAA CTC ATA GGG ATC ACT GGG TGT GCC ATC GGC GTG ATT 1501 
Pro Phe Leu Gin Leu He Gly He Thr Gly Cys Ala He Gly Val He 
465 470 475 

GGT TCG GCT ATG GAT AAG GAT CAA CGC ATT GGG ATG ATT TTA ACG ATT 154 9 

Gly Ser Ala Met Asp Lys Asp Gin Arg He Gly Met He Leu Thr He 
480 485 490 

GTT TTC GCT GTT ATT TGT TAC ATT GGA TAC TAT TTT ACA CAA AAA GCT 15 97 

Val Phe Ala Val He Cys Tyr He Gly Tyr Tyr Phe Thr Gin Lys Ala 
495 - 500 505 510 
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ATG CGC CAT CTC TTA ATG ATT GCA TTT GGA GGA GCG ATC GGC ACA GGG 301 
Met Arg His Leu Leu Met lie Ala Phe Gly Gly Ala lie Gly Thr Gly 
65 70 75 

CTT TTT OTA GGC ACT GGG GGT AAT ATT GCG AGC GCT GGC CCT TTA GGG 34 9 

Leu Phe Val Gly Thr Gly Gly Asn lie Ala Ser Ala Gly Pro Leu Gly 
80 85 90 

ACC TTG ATC GCT TAT TGT TTT GGA GGG CTT GTG GTC TAT TGT ATC ATG 3 97 

Thr Leu lie Ala Tyr Cys Phe Gly Gly Leu Val Val Tyr Cys lie Met 
95 100 105 110 

CTC TCT TTA GGC GAA TTG GCT AGC GTT TAT CCC ACT ACA GGA AGT TTT 44 5 

Leu Ser Leu Gly Glu Leu Ala Ser Val Tyr Pro Thr Thr Gly Ser Phe 
115 120 125 

GGG GAT TAT GCG GCT AAA TTC ATA GGC CCT GGC ACG GGC TAT ATG GTT 4 93 

Gly Asp Tyr Ala Ala Lys Phe lie Gly Pro Gly Thr Gly Tyr Met Val 
130 135 140 

TTT TGG ATG TAT TGG CTT GGC TGG GTG ATC ACG GTG GCG TTA GAA TAC 541 
Phe Trp Met Tyr Trp Leu Gly Trp Val lie Thr Val Ala Leu Glu Tyr 
145 150 155 

ATC GCT ATA GGC ATG CTC ATG CAA CGC TGG TTT GCG GAT ATT CCC ATC 589 
lie Ala lie Gly Met Leu Met Gin Arg Trp Phe Ala Asp lie Pro lie 
160 165 170 

CAT TAT TGG GTT ATT TTA TGC ATT GCG TTA GTT TTT TTA TTG AAC TTT 637 
His Tyr Trp Val lie Leu Cys lie Ala Leu Val Phe Leu Leu Asn Phe 
175 180 185 190 

TTT TCG GTT AAA ATT TTT GCC GAG GGC GAG TTT TTC TTT AGC CTG ATT 685 
Phe Ser Val Lys lie Phe Ala Glu Gly Glu Phe Phe Phe Ser Leu lie 
195 200 205 

AAA GTT TTA GCG GTG ATC GCT TTT ATA GGC ATT GGC GCG ATT GGG ATT 733 
Lys Val Leu Ala Val lie Ala Phe lie Gly lie Gly Ala lie Gly lie 
210 215 220 

ATT TAT CAA ATC TAT TCG CAT GGG TTT GGT TCT ATT TTT GAT AAT TTC 781 
He Tyr Gin He Tyr Ser His Gly Phe Gly Ser He Phe Asp Asn Phe 
225 230 235 

CAT TTT GGC GAT AAG GGG TTT TTC CCT AAT GGG AGC GCA GCG GTT TTT 82 9 

His Phe Gly Asp Lys Gly Phe Phe Pro Asn Gly Ser Ala Ala Val Phe 
240 245 250 

AGC GCG ATG CTC GCT GTT ATT TTT GCT TTC ACT GGC ACA GAG GTG ATT 877 
Ser Ala Met Leu Ala Val He Phe Ala Phe Thr Gly Thr Glu Val He 
255 260 265 270 

GGG GTG GCT GTG GGA GAG ACT AAA AAC GCT AGC GAA GTG ATG CCC AAA 925 
Gly Val Ala Val Gly Glu Thr Lys Asn Ala Ser Glu Val Met Pro Lys 
275 280 285 
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Glu Val Pro Asn lie Lys Ala lie Lys Glu Ala Ser Gly Ser Leu Lys 

165 170 175 

Arg Val Thr Glu Leu His Tyr Tyr Glu Lys Asp Phe Lys lie Phe Ser 

180 185 190 

Gly Glu Asp Ser Leu Asn His Ser lie Met Phe Ser Gly Gly Cys Gly 

195 200 205 

Val lie Ser Val Thr Gly Asn Leu Met Pro Asn Leu lie Ser Gin Met 

210 215 220 

Val Asn Cys Ala Leu Lys Gin Lys Tyr Gin Gin Ala Leu Glu lie Gin 
225 230 235 240 

Asn Lys Leu Phe Cys Leu His Gin Ala Leu Phe Val Glu Thr Asn Pro 

245 250 255 

lie Pro lie Lys Met Ala Met His Leu Ala Gly Leu lie Glu Asn Pro 

260 265 270 

Ser Tyr Arg Leu Pro Leu Val Ala Pro Ser Lys Glu Thr lie Gin Leu 

275 280 285 

Leu Glu Lys Thr Leu Gin Gin Tyr Glu Val lie Ala 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 4 87: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 0 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 68... 1624 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:487: 



AAAAAACCTT AAAACGCTTA AAAAACTATC GCTTGATCAA GCTCTTTTAC TATTTAATCT 60 
TTAAAAA ATG CTT TTG ATC GTT TTT TTT AAA TTT TAT TTT CAA TAT TCA 109 
Met Leu Leu lie Val Phe Phe Lys Phe Tyr Phe Gin Tyr Ser 
15 10 



ATT AAA AAA AAA TCA TTT TAT TTT ATT TTT GTT ATA ATT CAA GOT ATT 157 
lie Lys Lys Lys Ser Phe Tyr Phe lie Phe Val lie lie Gin Ala lie 
15 20 25 30 



TTT ATT TTC AAT CTA AGG AGG TGT CGC ATG GAC AAT CAA AAG ATA ACG 2 05 

Phe lie Phe Asn Leu Arg Arg Cys Arg Met Asp Asn Gin Lys lie Thr 
35 40 45 



CAT CAA AAT ATC ACG CAA AAA CAA GGC GAG CTT AAA AGA GAC ATG AAA 2 53 

His Gin Asn lie Thr Gin Lys Gin Gly Glu Leu Lys Arg Asp Met Lys 
50 55 60 
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AAA CAA AAA TAC CAA CAA GCC CTA GAA ATC CAA AAT AAG CTT TTT TOT 83 5 

Lys Gin Lys Tyr Gin Gin Ala Leu Glu lie Gin Asn Lys Leu Phe Cys 
230 235 240 245 

TTG CAC CAA GCC CTT TTT GTA GAA ACA AAT CCC ATC CCT ATT AAA ATG 8 83 

Leu His Gin Ala Leu Phe Val Glu Thr Asn Pro lie Pro lie Lys Met 
250 255 260 

OCT ATG CAT TTA GCC GGC TTG ATT GAA AAC CCA AGC TAC AGA CTG CCT 931 
Ala Met His Leu Ala Gly Leu lie Glu Asn Pro Ser Tyr Arg Leu Pro 
265 270 275 

TTA GTG GCC CCA AGC AAA GAA ACG ATT CAA CTT TTA GAA AAA ACT TTA 97 9 

Leu Val Ala Pro Ser Lys Glu Thr lie Gin Leu Leu Glu Lys Thr Leu 
280 285 290 

CAA CAA TAT GAG GTA ATT GCA TGAATGGTTC CAATCACATG AAAAATAAAA CCCT 1034 
Gin Gin Tyr Glu Val He Ala 
295 300 

AGTGATCAGC GGCGCGACTA GAGGGATTGG CAAGGCGATA TTGTAC 108 0 

(2) INFORMATION FOR SEQ ID NO: 4 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:486: 

Met Gin Phe His Ser Ser Ser Ala Leu He Thr Pro Phe Lys Lys Asp 

15 10 15 

Leu Ser Val Asp Glu Ala Ala Tyr Glu Thr Leu lie Lys Arg Gin lie 

20 25 30 

Phe Gin Gly Met Asp Ala Cys Val Pro Val Gly Thr Thr Gly Glu Ser 

35 40 45 

Ala Thr Leu Thr His Lys Glu His Met Arg Cys lie Glu lie Ala lie 

50 55 60 

Glu Thr Cys Lys Asn Thr Lys Thr Pro Ser Asn Ser Arg Met Lys Val 
65 70 75 80 

Leu Ala Gly Val Gly Ser Asn Ala Thr Ser Glu Ser Leu Ser Leu Ala 

85 90 95 

Lys Phe Ala Gin Lys lie Gly Ala Asp Ala lie Leu Cys Val Ser Pro 

100 105 110 

Tyr Tyr Asn Arg Pro Thr Gin Gin Gly Leu Phe Glu His Tyr Lys Thr 

115 120 125 

lie Ala Gin Ser Val Glu lie Pro Val Met Leu Tyr Asp Val Pro Ser 

130 135 140 

Arg Thr Gly Val Ser lie Glu Val Pro Thr Ala Leu Lys Leu Phe Arg 
145 150 155 160 
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Ser Ser Ala Leu lie Thr Pro Phe Lys Lys Asp Leu Ser Val Asp Glu 
10 15 20 

GCC GCT TAT GAA ACC TTG ATC AAG CGC CAA ATT TTT CAG GGC ATG GAG 211 
Ala Ala Tyr Glu Thr Leu lie Lys Arg Gin He Phe Gin Gly Met Asp 
25 30 35 

GCA TGC GTG CCT GTT GGC ACG ACA GGA GAA TCC GCC ACG CTC ACC CAC 2 59 

Ala Cys Val Pro Val Gly Thr Thr Gly Glu Ser Ala Thr Leu Thr His 
40 45 50 

AAA GAG CAC ATG CGT TGC ATT GAA ATC GCC ATA GAA ACT TGC AAA AAC 3 07 

Lys Glu His Met Arg Cys He Glu He Ala He Glu Thr Cys Lys Asn 
55 60 65 

ACT AAA ACG CCC TCA AAT TCG CGC ATG AAA GTG TTA GCC GGC GTG GGC 3 55 

Thr Lys Thr Pro Ser Asn Ser Arg Met Lys Val Leu Ala Gly Val Gly 
70 75 80 85 

AGT AAC GCC ACG AGC GAG TCC CTT TCT TTA GCA AAG TTC GCT CAA AAA 4 03 

Ser Asn Ala Thr Ser Glu Ser Leu Ser Leu Ala Lys Phe Ala Gin Lys 
90 95 100 

ATC GGC GCG GAT GCG ATT TTA TGC GTA AGC CCT TAT TAT AAC CGC CCC 451 
He Gly Ala Asp Ala He Leu Cys Val Ser Pro Tyr Tyr Asn Arg Pro 
105 110 115 

ACC CAA CAA GGC TTG TTT GAA CAT TAT AAA ACC ATC GCT CAA TCG GTG 4 99 

Thr Gin Gin Gly Leu Phe Glu His Tyr Lys Thr He Ala Gin Ser Val 
120 125 130 

GAA ATC CCT GTC ATG CTT TAT GAT GTG CCA AGT AGA ACA GGC GTG TCT 54 7 

Glu He Pro Val Met Leu Tyr Asp Val Pro Ser Arg Thr Gly Val Ser 
135 140 145 

ATT GAA GTT CCA ACC GCC CTC AAA CTC TTT AGA GAA GTC CCT AAC ATT 5 95 

He Glu Val Pro Thr Ala Leu Lys Leu Phe Arg Glu Val Pro Asn He 
150 155 160 165 

AAA GCC ATT AAA GAA GCG TCT GGC TCT TTG AAA AGG GTA ACA GAA TTG 64 3 

Lys Ala He Lys Glu Ala Ser Gly Ser Leu Lys Arg Val Thr Glu Leu 
170 175 180 

CAT TAT TAT GAA AAA GAT TTT AAA ATT TTT AGT GGC GAA GAT TCG CTC 691 
His Tyr Tyr Glu Lys Asp Phe Lys He Phe Ser Gly Glu Asp Ser Leu 
185 190 195 

AAC CAC TCC ATC ATG TTT TCA GGG GGG TGC GGC GTG ATT TCA GTG ACC 73 9 

Asn His Ser He Met Phe Ser Gly Gly Cys Gly Val He Ser Val Thr 
200 205 210 

GGT AAT TTA ATG CCC AAT CTG ATT TCA CAA ATG GTC AAT TGC GCG CTC 787 
Gly Asn Leu Met Pro Asn Leu He Ser Gin Met Val Asn Cys Ala Leu 
215 220 225 
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(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:484: 



Met Lys Lys lie 
1 

Met His Val His 
20 

Asn Lys Glu Val 
35 

Lys Asp Phe Glu 
50 

Val His Leu Leu 
65 

Val Asn Ser Leu 

Phe Lys Ser Val 
100 

Ser Tyr Phe Ala 
115 

Gin Tyr lie Gin 
130 



Asp Asp Met Arg 
5 

Phe Val Phe Val 

lie Asp Phe Leu 
40 

Ser Glu Leu Val 
55 

lie Asn Tyr Pro 
70 

Lys Gly Val Ser 
85 

Glu Ala Ser Leu 

Gly Ser Cys Gly 
120 

Asp Gin Glu Thr 
135 



His Gly Arg His 
10 

Thr Lys Tyr Arg 
25 

Gly Ser Val Phe 

Glu Phe Asp Gly 
60 

Pro Lys Val Ser 
75 

Ser Arg Leu Thr 
90 

Trp Gly Lys His 
105 

Asp Ala Pro Leu 
Pro His 



Cys Val Phe Leu 
15 

Arg Ser Ala Phe 
30 

Ala Lys Val Cys 
45 

Glu Ser Asp His 

Val Ser Lys Leu 
80 

Arg Gin His His 
95 

Leu Trp Ser Pro 
110 

Glu Met lie Lys 
125 



(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 8 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 101... 1000 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:485: 



TGGATTTAAA AGTGAGCGAT TTGGTGCGTG TGGCCAATGA ATATTTTAAA GACACCCAAT 60 
CAACCACCGT GTTTTTGAAA CCTTAAAAGA GCCTTATAAC ATG CAA TTT CAT TCA 115 

Met Gin Phe His Ser 
1 5 

TCT AGC GCG TTG ATT ACG CCT TTT AAA AAA GAT TTG AGC GTT GAT GAG 163 
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(B) LOCATION: 105... 518 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:483: 

GTTGCAACTT ATCTTGTTGT TCTTTAGTGG GATACAAGCG GAATTTAAAA CCCTTATTGA 60 
CTTTCATAGA AAGTATTTTA ACCTCTTTTT GTTAAAATAG GTCT ATG AAA AAA ATT 116 

Met Lys Lys lie 
1 

GAT GAT ATG AGA CAC GGA AGA CAT TGT GTT TTT TTA ATG CAT GTG CAT 164 
Asp Asp Met Arg His Gly Arg His Cys Val Phe Leu Met His Val His 
5 10 15 20 

TTT GTA TTT GTT ACT AAA TAC AGG CGT TCA GCA TTC AAT AAG GAA GTG 212 
Phe Val Phe Val Thr Lys Tyr Arg Arg Ser Ala Phe Asn Lys Glu Val 
25 30 35 

ATA GAT TTT TTA GGA TCG GTG TTT GCC AAA GTG TGT AAG GAC TTT GAG 260 
lie Asp Phe Leu Gly Ser Val Phe Ala Lys Val Cys Lys Asp Phe Glu 
40 45 50 

AGC GAA TTG GTA GAA TTT GAT GGG GAG AGC GAT CAT GTG CAT TTG CTT 3 08 

Ser Glu Leu Val Glu Phe Asp Gly Glu Ser Asp His Val His Leu Leu 
55 60 65 

ATC AAC TAC CCT CCA AAA GTG AGC GTG AGT AAG TTA GTT AAT TCT TTA 3 56 

lie Asn Tyr Pro Pro Lys Val Ser Val Ser Lys Leu Val Asn Ser Leu 
70 75 80 

AAA GGC GTT AGC AGT CGT TTG ACT AGA CAA CAC CAT TTC AAA AGC GTT 4 04 

Lys Gly Val Ser Ser Arg Leu Thr Arg Gin His His Phe Lys Ser Val 
85 90 95 100 

GAA GCT AGT TTG TGG GGG AAG CAT TTA TGG TCG CCT AGT TAT TTC GCT 452 
Glu Ala Ser Leu Trp Gly Lys His Leu Trp Ser Pro Ser Tyr Phe Ala 
105 110 115 

GGG AGT TGT GGG GAC GCG CCT TTA GAG ATG ATT AAG CAA TAC ATA CAA 500 
Gly Ser Cys Gly Asp Ala Pro Leu Glu Met lie Lys Gin Tyr lie Gin 
120 125 130 

GAT CAA GAA ACA CCG CAT TAAATTAGCT AACTTTGATT TTTAAGTAGA ACGCGCTA 5 56 
Asp Gin Glu Thr Pro His 
135 

AAAAGCGAAT GGATCTAAGT GAAACAATGT TCAAATAGCC TAACGGCTAA ACGCTTACAT 616 
CTCCGCCCTA AAGG 630 

(2) INFORMATION FOR SEQ ID N0:484: 

(i) SEQUENCE CHARACTERISTICS: 
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Glu Lys Phe Thr 
65 

Lys Glu Val Arg 

Ty r Ly s Ly s lie 
100 

Asp Tyr Leu Tyr 
115 

lie Asp Leu Glu 
130 

Phe Ser Lys Asp 
145 

Glu Leu Tyr Thr 

Ser lie Val Lys 
180 

Arg His Ser lie 
195 

Tyr Gin Asn Leu 
210 

Lys Phe Phe Lys 
225 

Met Lys Lys lie 

Leu Leu Asp Lys 
260 

Leu Glu Glu Glu 
275 

Glu Thr Arg Ala 
290 

Ala Asn Glu Asn 
305 

lie lie Lys Arg 

Gin Cys Phe Lys 
340 

Leu Gin Asn Asn 
355 

Asn Thr Asn Ala 
370 

Asn Glu Pro Thr 
385 



Pro Pro lie Asn 
70 

Trp Phe Asn Lys 
85 

Gin Lys Leu Ala 

Thr lie Lys Lys 
120 

Lys Asn Tyr Lys 
135 

Leu Glu Thr Asp 
150 

Arg Leu Lys Ser 
165 

Asn Leu Glu Lys 

Ser Gin Arg Ala 
200 

Ser Leu Thr Asp 
215 

Asp Ser Ala Leu 
230 

Asp Asp Ser Val 
245 

Gin Val Trp Leu 

Asn Tyr Cys Phe 
280 

Met Glu Leu Ser 
295 

Asp Leu Pro lie 
310 

Cys Glu Lys Ser 
325 

Thr Leu Leu Lys 

His Gin lie Thr 
360 

Leu Lys Asn Thr 
375 

Asn Pro Asn Asn 
390 



Glu Lys Glu Leu 
75 

Glu Thr Pro Thr 
90 

Val Tyr Lys Ser 
105 

Leu Phe Ala Thr 

Asp Leu Arg Ala 
140 

Leu Ser His Ser 
155 

Met Ser Lys Val 
170 

lie Ala Leu Asp 
185 

Phe Glu Phe Phe 

Lys Gly Asn Met 
220 

Leu Val Asn lie 
235 

Lys Asn Ser Asn 
250 

Asn Leu Leu Glu 
265 

Ala Lys Lys Arg 

Lys Asp Leu Lys 
300 

Tyr Glu Arg Gly 
315 

Leu Asn Phe Leu 
330 

Ser Ala Ser lie 
345 

Ala Val Thr Gin 

Thr Gin Asp Phe 
380 

Asn Tyr Gly Met 
395 



Asn Arg Gin Leu 
80 

Ser Lys Asn Thr 
95 

Pro Leu lie Lys 
110 

Gin Lys Lys lie 
125 

Leu Lys Glu Glu 

Lys Lys Arg Phe 
160 

Phe lie Ser Lys 
175 

Phe Lys Ser Asp 
190 

Lys Tyr Met Asn 
205 

Phe Leu Val Ala 

Ala Arg Phe Glu 
240 

Pro Gin Asp Asn 
255 

His Leu Lys Arg 
270 

Lys Glu Phe Leu 
285 

Phe Leu Thr Gin 

Gin Arg Asp Lys 
320 

Gin Lys Glu Leu 
335 

Ala Leu Glu Asn 
350 

Asp Thr Gin Glu 
365 

Asn Lys Thr Thr 
Asp Phe 



(2) INFORMATION FOR SEQ ID NO: 4 83: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
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CTT GAA GAG GAA AAT TAT TGT TTT OCT AAG AAA CGA AAA GAA TTC TTA 92 5 

Leu Glu Glu Glu Asn Tyr Cys Phe Ala Lys Lys Arg Lys Glu Phe Leu 
275 280 285 

GAG ACT AGA GCG ATG GAG CTA TCA AAA GAT TTA AAA TTT TTA ACA CAG 97 3 

Glu Thr Arg Ala Met Glu Leu Ser Lys Asp Leu Lys Phe Leu Thr Gin 
290 295 300 

GCT AAT GAA AAT GAT TTG CCC ATT TAT GAA AGA GGG CAA AGG GAT AAA 1021 
Ala Asn Glu Asn Asp Leu Pro lie Tyr Glu Arg Gly Gin Arg Asp Lys 
305 310 315 320 

ATC ATT AAA CGC TGT GAA AAA TCG CTT AAC TTT TTG CAG AAA GAA TTA 1069 
lie lie Lys Arg Cys Glu Lys Ser Leu Asn Phe Leu Gin Lys Glu Leu 
325 330 335 

CAA TGC TTT AAA ACC TTA TTG AAA AGT GCA AGT ATA GCT TTA GAA AAC 1117 
Gin Cys Phe Lys Thr Leu Leu Lys Ser Ala Ser lie Ala Leu Glu Asn 
340 345 350 

TTG CAA AAT AAC CAT CAA ATC ACA GCC GTT ACA CAA GAC ACG CAA GAA 1165 
Leu Gin Asn Asn His Gin lie Thr Ala Val Thr Gin Asp Thr Gin Glu 
355 360 365 

AAC ACA AAC GCG CTC AAA AAT ACT ACT CAA GAT TTT AAC AAA ACT ACC 1213 
Asn Thr Asn Ala Leu Lys Asn Thr Thr Gin Asp Phe Asn Lys Thr Thr 
370 375 380 

AAT GAA CCA ACA AAC CCT AAC AAT AAC TAT GGA ATG GAT TTT TAAAACCAT 12 64 
Asn Glu Pro Thr Asn Pro Asn Asn Asn Tyr Gly Met Asp Phe 
385 390 395 

CCATAAAATT AAATAACTTT ACTTAGCGTA TTTTTT 13 0 0 

(2) INFORMATION FOR SEQ ID NO:482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:482: 

Arg His Arg Asn Arg Ala Arg Cys Thr Arg Lys Gin Ala Arg Asn Asn 

15 10 15 

Lys Ser Phe Arg His Glu Asn Tyr Phe Tyr Lys Val Leu Gly Ser Ala 

20 25 30 

Thr Ser Gin lie Glu Ser Leu Lys Lys Arg Glu Asn Ala Leu Phe Asp 

35 40 45 

His Leu Asp Ser Leu Lys Ser Leu Leu Glu Lys Thr His Trp Glu Lys 
50 55 60 
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His Leu Asp Ser Leu Lys Ser Leu Leu Glu Lys Thr His Trp Glu Lys 
50 55 60 

GAA AAA TTC ACG CCC CCA ATA AAT GAA AAA GAA CTT AAT AGG CAA CTT 3 01 

Glu Lys Phe Thr Pro Pro lie Asn Glu Lys Glu Leu Asn Arg Gin Leu 
65 70 75 80 

AAA GAA GTG AGA TGG TTC AAT AAA GAA ACT CCA ACT TCT AAA AAC ACT 34 9 

Lys Glu Val Arg Trp Phe Asn Lys Glu Thr Pro Thr Ser Lys Asn Thr 
85 90 95 

TAT AAG AAA ATT CAA AAA TTA GCT GTT TAT AAA AGC CCT TTA ATA AAA 3 97 

Tyr Lys Lys lie Gin Lys Leu Ala Val Tyr Lys Ser Pro Leu lie Lys 
100 105 110 

GAT TAT CTT TAT ACC ATT AAA AAA CTT TTT GCC ACA CAA AAA AAG ATT 44 5 

Asp Tyr Leu Tyr Thr lie Lys Lys Leu Phe Ala Thr Gin Lys Lys lie 
115 120 125 

ATA GAT TTA GAA AAA AAT TAT AAA GAT TTA AGA GCC TTA AAG GAA GAA 4 93 

lie Asp Leu Glu Lys Asn Tyr Lys Asp Leu Arg Ala Leu Lys Glu Glu 
130 135 140 

TTT AGC AAA GAT TTA GAA ACT GAT TTA TCC CAT TCA AAA AAA CGC TTT 541 
Phe Ser Lys Asp Leu Glu Thr Asp Leu Ser His Ser Lys Lys Arg Phe 
145 150 155 160 

GAA CTT TAC ACT AGA CTA AAG AGC ATG AGC AAA GTT TTT ATA AGC AAA 58 9 

Glu Leu Tyr Thr Arg Leu Lys Ser Met Ser Lys Val Phe lie Ser Lys 
165 170 175 

AGC ATT GTT AAA AAT TTA GAA AAA ATT GCT TTA GAT TTT AAA AGC GAT 637 
Ser lie Val Lys Asn Leu Glu Lys lie Ala Leu Asp Phe Lys Ser Asp 
ISO 185 190 

AGA CAT AGT ATT TCG CAA AGA GCT TTT GAA TTT TTT AAG TAT ATG AAT 685 
Arg His Ser lie Ser Gin Arg Ala Phe Glu Phe Phe Lys Tyr Met Asn 
195 200 205 

TAT CAA AAT TTA AGC TTG ACT GAT AAA GGC AAT ATG TTT TTA GTG GCT 73 3 

Tyr Gin Asn Leu Ser Leu Thr Asp Lys Gly Asn Met Phe Leu Val Ala 
210 215 220 

AAG TTT TTT AAA GAT AGT GCT TTA CTT GTT AAT ATT GCT AGG TTT GAA 781 
Lys Phe Phe Lys Asp Ser Ala Leu Leu Val Asn lie Ala Arg Phe Glu 
225 230 235 240 

ATG AAA AAG ATA GAT GAT AGT GTT AAA AAT TCT AAC CCA CAA GAC AAT 82 9 

Met Lys Lys lie Asp Asp Ser Val Lys Asn Ser Asn Pro Gin Asp Asn 
245 250 255 

TTA TTA GAC AAA CAA GTT TGG CTC AAT CTT TTA GAG CAT TTA AAA AGA 877 
Leu Leu Asp Lys Gin Val Trp Leu Asn Leu Leu Glu His Leu Lys Arg 
260 265 270 
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(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 80: 



Met Gly Lys Met 
1 

Asn His Trp Asn 
20 

Ala Asp Ala Tyr 
35 

Ala lie Glu lie 
50 

His Lys Asn Pro 
65 



Lys Gin Glu Thr 
5 

Asn Asn Asp Glu 

Leu Tyr Lys His 
40 

Ser Ser Leu Asn 
55 

Lys Leu Phe 
70 



Ala lie Asp Tyr 
10 

Asn Ser Glu Ala 
25 

Glu Lys Lys Ser 

Lys Ala Cys Met 
60 



Glu Lys Leu Ala 
15 

Leu Asn Ala Phe 
30 

Gin Lys lie Arg 
45 

Gly Glu Phe Tyr 



(2) INFORMATION FOR SEQ ID NO: 4 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 62... 1255 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

TTGCTAACTC AAAAGGCGAA TTGCAATATT CTAACACGCC TAATATTTAT AAGGCGATTA 60 
A AGA CAT AGA AAC AGA GCT AGA TGC ACT AGA AAA CAG GCT AGA AAC AAT 10 9 
Arg His Arg Asn Arg Ala Arg Cys Thr Arg Lys Gin Ala Arg Asn Asn 
15 10 15 

AAG AGT TTT AGG CAT GAA AAC TAT TTT TAT AAA GTT TTG GGT AGT GCA 15 7 

Lys Ser Phe Arg His Glu Asn Tyr Phe Tyr Lys Val Leu Gly Ser Ala 
20 25 30 

ACT TCT CAA ATA GAA AGT TTG AAA AAA AGA GAA AAT GCC CTA TTT GAT 2 05 

Thr Ser Gin lie Glu Ser Leu Lys Lys Arg Glu Asn Ala Leu Phe Asp 
35 40 45 

CAT TTA GAT AGT CTA AAA AGT TTA TTA GAA AAA ACA CAT TGG GAA AAA 253 
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Ser Leu Ala Val Phe Lys Lys Thr Leu Lys Lys Gly Leu Leu Leu Arg 
385 390 395 400 

Pro Leu Asn Asn Thr lie Tyr Leu Met Pro Pro Tyr lie lie Thr His 

405 410 415 

Glu Glu Val Lys Lys Ala Val Ala Gly Leu Val Glu He Leu Asp Glu 

420 425 430 

Leu Arg Lys Gly 
435 

(2) INFORMATION FOR SEQ ID NO: 47 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 82... 294 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

AAAGCTAAAC AAGAGCAAGA GAAACGCATG GCATAAGAGC CATGCGCTTG GCAGTGAAAC 6 0 

AAAAGAAAAA AAGGAAACAA C ATG GGA AAA ATG AAA CAA GAA ACA GCG ATT 111 

Met Gly Lys Met Lys Gin Glu Thr Ala lie 
15 10 

GAC TAT GAA AAA TTA GCG AAT CAT TGG AAT AAT AAT GAT GAA AAC AGC 15 9 

Asp Tyr Glu Lys Leu Ala Asn His Trp Asn Asn Asn Asp Glu Asn Ser 
15 20 25 

GAA GCA CTA AAC GCT TTT OCA GAC GCT TAC CTT TAT AAA CAT GAG AAA 2 07 

Glu Ala Leu Asn Ala Phe Ala Asp Ala Tyr Leu Tyr Lys His Glu Lys 
30 35 40 

AAG AGT CAA AAG ATT CGG GCA ATA GAG ATA AGT TCT CTA AAC AAA GCC 25 5 

Lys Ser Gin Lys lie Arg Ala lie Glu lie Ser Ser Leu Asn Lys Ala 
45 50 55 

TGC ATG GGA GAA TTT TAC CAC AAA AAC CCA AAA TTA TTT TAATAACGAT CG 3 06 
Cys Met Gly Glu Phe Tyr His Lys Asn Pro Lys Leu Phe 
60 65 70 

CTCCAAGGAA CCAACGCCCC ATGACCTCAA GAAAAGAGAA TAGCTTGAAT CGGT 3 60 

(2) INFORMATION FOR SEQ ID NO: 4 80: 

(i) SEQUENCE CHARACTERISTICS: 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 478: 

Met Asn Phe Gin Glu Asn Leu Ala Ala Leu Asp Leu Glu Tyr Leu Trp 

15 10 15 

His Pro Cys Ser Gin Met Gin Glu His Gin Asn Phe Pro lie lie Pro 

20 25 30 

lie Lys Lys Ala Gin Gly lie Tyr Leu Tyr Asp Phe Asn Asp Asn Ala 

35 40 45 

Tyr Met Asp Leu lie Ser Ser Trp Trp Val Asn Leu Phe Gly His Asn 

50 55 60 

Asn Ala Tyr lie Ser Gin Gin Leu Lys Asn Gin lie Asp Asp Leu Glu 
65 70 75 80 

His Val Leu Leu Ala Ser Phe Ser His Lys Pro lie lie Thr Leu Ser 

85 90 95 

Gin Arg Leu Cys Gin Leu Thr His Met Asp Lys Cys Phe Tyr Ala Asp 

100 105 110 

Asn Gly Ser Ser Cys Val Glu lie Ala Leu Lys Met Ser Tyr His Ala 

115 120 125 

His Phe Leu Lys Asn Gin Thr Arg Arg Lys Lys Leu Phe Leu Ser Leu 

130 135 140 

Ser Asn Ser Tyr His Gly Glu Thr Leu Gly Ala Leu Ser Val Gly Asp 
145 150 155 160 

Val Lys Leu Tyr Lys Asp Thr Tyr Thr Pro Leu Leu Leu Lys Asn Leu 

165 170 175 

Thr Thr Pro Val Pro Lys Asn Asp His Glu lie Glu Asn Ser Leu Asn 

180 185 190 

Ala Leu Lys Arg Leu Leu Asp Lys His Ser Glu Glu lie Cys Ala Phe 

195 200 205 

lie Ala Glu Pro Leu Leu Gin Cys Ala Gly Asn Met His lie Tyr Ser 

210 215 220 

Ala Arg Tyr Leu Lys Gin Ala Val Leu Leu Cys Lys Gin Lys Asn lie 
225 230 235 240 

His lie He Phe Asp Glu He Ala Thr Gly Phe Gly Arg Thr Gly Ser 

245 250 255 

Met Phe Ala Tyr Glu Gin Cys Glu He Lys Pro Asp Phe Leu Cys Leu 

260 265 270 

Ser Lys Gly He Ser Gly Gly Tyr Leu Pro Leu Ser Ala Leu Leu Thr 

275 280 285 

His Asn Glu He Tyr Asn Gin Phe Tyr Ala Pro Tyr Glu Glu Asn Lys 

290 295 300 

Ala Phe Leu His Ser His Ser Tyr Thr Gly Asn Ala Leu Ala Cys Ala 
305 310 315 320 

Cys Ala Asn Ala Thr Leu Asp He Phe Glu Lys Glu Asn Val He Glu 

325 330 335 

Lys Asn Lys Ala Leu Ser Gly Phe He Phe Asn Thr Leu Gin Asn Ala 

340 345 350 

Leu Lys Pro Leu Met Glu Gin Gin Val Val Ser Asp Leu Arg His Leu 

355 360 365 

Gly Met Val Phe Ala Phe Glu Val Phe He Gin Thr Lys Glu Arg Leu 
370 " 375 380 



-755- 



BNSDOC1 0: <WO 984347aA 1 _!_> 



wo 98/43478 



PCT/US98/06371 



Thr Gly Ser Met Phe Ala Tyr Glu Gin Cys Glu lie Lys Pro Asp Phe 
255 260 265 

TTA TGC TTG TCT AAG GGG ATT AGT GGG GGG TAT TTG CCT TTA AGC GCA 92 6 

Leu Cys Leu Ser Lys Gly lie Ser Gly Gly Tyr Leu Pro Leu Ser Ala 
270 275 280 285 

CTA TTA ACC CAT AAT GAA ATC TAT AAC CAA TTT TAC GCC CCC TAT GAA 974 
Leu Leu Thr His Asn Glu lie Tyr Asn Gin Phe Tyr Ala Pro Tyr Glu 
290 295 300 

GAA AAT AAA GCG TTT TTG CAT TCG CAC AGC TAC ACA GGA AAC GCT TTG 1022 
Glu Asn Lys Ala Phe Leu His Ser His Ser Tyr Thr Gly Asn Ala Leu 
305 310 315 

GCA TGC GCA TGC GCG AAC GCT ACG CTG GAT ATT TTT GAA AAA GAA AAT 10 7 0 
Ala Cys Ala Cys Ala Asn Ala Thr Leu Asp lie Phe Glu Lys Glu Asn 
320 325 330 

GTT ATT GAA AAG AAC AAG GCT TTA AGC GGG TTT ATT TTT AAT ACG CTC 1118 
Val lie Glu Lys Asn Lys Ala Leu Ser Gly Phe lie Phe Asn Thr Leu 
335 340 345 

CAA AAC GCA TTA AAA CCC TTG ATG GAG CAA CAA GTG GTG TCT GAT TTA 1166 
Gin Asn Ala Leu Lys Pro Leu Met Glu Gin Gin Val Val Ser Asp Leu 
350 355 360 365 

AGG CAT TTG GGC ATG GTC TTT GCC TTT GAA GTC TTT ATT CAA ACC AAA 1214 
Arg His Leu Gly Met Val Phe Ala Phe Glu Val Phe lie Gin Thr Lys 
370 375 380 

GAG CGT TTG AGT TTG GCG GTT TTT AAA AAA ACT CTA AAA AAA GGC CTG 12 62 

Glu Arg Leu Ser Leu Ala Val Phe Lys Lys Thr Leu Lys Lys Gly Leu 
385 390 395 

TTA TTA CGC CCT TTA AAC AAC ACC ATT TAC CTC ATG CCC CCT TAC ATT 1310 
Leu Leu Arg Pro Leu Asn Asn Thr lie Tyr Leu Met Pro Pro Tyr lie 
400 405 410 

ATC ACG CAT GAA GAA GTC AAA AAG GCG GTT GCG GGG CTA GTG GAA ATT 13 5 8 

He Thr His Glu Glu Val Lys Lys Ala Val Ala Gly Leu Val Glu He 
415 420 425 

CTT GAT GAG TTA AGA AAA GGC TGAAAGCGTT TTTAAAAATA AAATAGTAAA AAGC 1413 
Leu Asp Glu Leu Arg Lys Gly 
430 435 

TTGAATTGGA TCAAGCGATA GCTTTTT 144 0 

(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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30 35 40 45 

GAT AAC GCT TAG ATG GAT TTG ATC AGO TCA TGG TGG GTG AAT CTT TTT 254 
Asp Asn Ala Tyr Met Asp Leu lie Ser Ser Trp Trp Val Asn Leu Phe 
50 55 60 

GGG CAT AAT AAC GCC TAG ATC AGC CAG CAA CTC AAA AAT CAA ATT GAT 302 
Gly His Asn Asn Ala Tyr lie Ser Gin Gin Leu Lys Asn Gin lie Asp 
65 70 75 

GAT TTA GAG CAT GTC CTT TTG GCT TCT TTT AGC CAT AAG CCC ATT ATC 3 50 

Asp Leu Glu His Val Leu Leu Ala Ser Phe Ser His Lys Pro lie lie 
80 85 90 

ACG CTC TCT CAA AGG CTT TGC CAG CTC ACT CAT ATG GAT AAA TGC TTT 3 98 

Thr Leu Ser Gin Arg Leu Cys Gin Leu Thr His Met Asp Lys Cys Phe 
95 100 105 

TAT GCG GAT AAC GGC TCA TCT TGT GTT GAA ATC GCT TTG AAA ATG AGC 44 6 

Tyr Ala Asp Asn Gly Ser Ser Cys Val Glu lie Ala Leu Lys Met Ser 
110 115 120 125 

TAT CAC GCC CAT TTT TTA AAG AAT CAA ACG CGC CGC AAA AAG CTT TTT 4 94 

Tyr His Ala His Phe Leu Lys Asn Gin Thr Arg Arg Lys Lys Leu Phe 
130 135 140 

TTA TCG CTC TCT AAT TCC TAT CAT GGC GAG ACT TTG GGA GCG TTA AGC 542 
Leu Ser Leu Ser Asn Ser Tyr His Gly Glu Thr Leu Gly Ala Leu Ser 
145 150 155 

GTG GGC GAT GTG AAA CTT TAT AAA GAC ACT TAG ACC CCT TTA TTG CTC 5 90 

Val Gly Asp Val Lys Leu Tyr Lys Asp Thr Tyr Thr Pro Leu Leu Leu 
160 165 170 

AAA AAT CTC ACC ACA CCT GTG CCT AAA AAC GAC CAT GAA ATA GAA AAT 63 8 

Lys Asn Leu Thr Thr Pro Val Pro Lys Asn Asp His Glu lie Glu Asn 
175 180 185 

AGT TTG AAC GCT TTA AAG CGT TTG TTA GAC AAG CAT AGT GAA GAA ATT 6 86 

Ser Leu Asn Ala Leu Lys Arg Leu Leu Asp Lys His Ser Glu Glu lie 
190 195 200 205 

TGC GCT TTC ATT GCA GAG CCT CTT TTG CAA TGC GCA GGG AAT ATG CAT 7 34 

Cys Ala Phe lie Ala Glu Pro Leu Leu Gin Cys Ala Gly Asn Met His 
210 215 220 

ATT TAT AGC GCA AGA TAT TTA AAA CAA GCC GTT TTA TTG TGC AAG CAA 7 82 

lie Tyr Ser Ala Arg Tyr Leu Lys Gin Ala Val Leu Leu Cys Lys Gin 
225 230 235 

AAA AAC ATC CAC ATT ATT TTT GAT GAA ATC GCT ACC GGG TTT GGG CGC 83 0 

Lys Asn lie His lie lie Phe Asp Glu lie Ala Thr Gly Phe Gly Arg 
240 245 250 

ACA GGG AGC ATG TTT GCT TAT GAA CAA TGC GAA ATT AAG CCG GAT TTT 87 8 



-753- 



BNSOOCID: <WO 9843478A1 t > 



wo 98/43478 



PCTAJS98/06371 



305 310 315 320 

Tyr Ala His Val Thr Phe Phe lie Asn Gly Gly Val Glu Thr Pro Phe 

325 330 335 

Lys Asn Glu Asn Arg Val Leu lie Gin Ser Pro Lys Val Thr Thr Tyr 

340 345 350 

Asp Leu Lys Pro Glu Met Ser Ala Lys Glu Val Thr Leu Ala Val Leu 

355 360 365 

Glu Gin Met Lys Leu Gly Thr Asp Leu lie lie Val Asn Phe Ala Asn 

370 375 380 

Gly Asp Met Val Gly His Thr Gly Asn Phe Glu Ala Ser Val Lys Ala 
385 390 395 400 

Val Glu Ala Val Asp Ala Cys Leu Gly Glu He Leu Ser Leu Ala Lys 

405 410 415 

Lys Leu Asp Tyr Ala Met Leu Leu Thr Ser Asp His Gly Asn Cys Glu 

420 425 430 

Arg Met Lys Asp Glu Asn Gin Asn Pro Leu Thr Asn His Thr Ala Gly 

435 440 445 

Ser Val Tyr Cys Phe Val Leu Gly Asp Gly Val Lys Ser He Lys Asn 

450 455 460 

Gly Ala Leu Asn Asn He Ala Ser Ser Val Leu Lys Leu Met Gly Leu 
465 470 475 480 

Lys Ala Pro Ala Thr Met Asp Glu Pro Leu Phe 
485 490 

(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 72.., 13 79 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

ATAGTAAAAT CAAATAACCT TATTTTAACC AAAGGTTATT AAAATTATCC TTATTATAGA 60 
GAGTTTTTAA C ATG AAT TTT CAA GAA AAT TTA GCC GCT TTG GAT TTG GAG 110 
Met Asn Phe Gin Glu Asn Leu Ala Ala Leu Asp Leu Glu 
15 10 

TAT CTT TGG CAC CCT TGT TCG CAA ATG CAA GAG CAT CAA AAT TTC CCC 158 
Tyr Leu Trp His Pro Cys Ser Gin Met Gin Glu His Gin Asn Phe Pro 
15 20 25 

ATT ATC CCC ATT AAA AAG GCT CAA GGG ATT TAC CTC TAT GAT TTT AAT 2 06 

He He Pro He Lys Lys Ala Gin Gly He Tyr Leu Tyr Asp Phe Asn 

-752- 



BNSDOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



AATGCAAATT GATGACGCAT TATTGCAACG CTTGGAAAAA TTGAGCATGC TAGAGATTAA 1614 
AGATGA 1620 

(2) INFORMATION FOR SEQ ID NO: 4 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

Met Ala Gin Lys Thr Leu Leu lie lie Thr Asp Gly lie Gly Tyr Arg 

15 10 15 

Lys Asp Ser Asp His Asn Ala Phe Phe His Ala Lys Lys Pro Thr Tyr 

20 25 30 

Asp Leu Met Phe Lys Thr Leu Pro Tyr Ser Leu lie Asp Thr His Gly 

35 40 45 

Leu Ser Val Gly Leu Pro Lys Gly Gin Met Gly Asn Ser Glu Val Gly 

50 55 60 

His Met Cys lie Gly Ala Gly Arg Val Leu Tyr Gin Asp Leu Val Lys 
65 70 75 80 

lie Ser Leu Ser Leu Gin Asn Asp Glu Leu Lys Asn Asn Pro Ala Phe 

85 90 95 

Leu Asn Thr lie Gin Lys Ser Pro Val Val His Leu Met Gly Leu Met 

100 105 110 

Ser Asp Gly Gly Val His Ser His lie Glu His Phe lie Ala Leu Ala 

115 120 125 

Leu Glu Cys Glu Lys Ser His Lys Lys Val Cys Leu His Leu lie Thr 

130 135 140 

Asp Gly Arg Asp Val Ala Pro Lys Ser Ala Leu Thr Tyr Leu Lys Gin 
145 150 155 160 

Met Gin Asn lie Cys Asn Glu Ser lie Gin lie Ala Thr lie Ser Gly 

165 170 175 

Arg Phe Tyr Ala Met Asp Arg Asp Lys Arg Phe Glu Arg lie Glu Leu 

180 185 190 

Ala Tyr His Ser Leu Met Gly Leu Asn His Thr Pro Leu Ser Pro Ser 

195 200 205 

Glu Tyr lie Gin Ser Gin Tyr Asp Lys Asn lie Thr Asp Glu Phe lie 

210 215 220 

Met Pro Ala Cys Phe Lys Asn Tyr Cys Gly Met Gin Asp Asp Glu Ser 
225 230 235 240 

Phe lie Phe lie Asn Phe Arg Asn Asp Arg Ala Arg Glu lie Val Ser 

245 250 255 

Ala Leu Gly Gin Lys Gin Phe Ser Gly Phe Lys Arg Gin Val Phe Lys 

260 265 270 

Lys Leu His lie Ala Thr Met Thr Pro Tyr Asp Asn Thr Phe Pro Tyr 

275 280 285 

Pro Val Leu Phe Pro Lys Glu Ser Val Gin Asn Thr Leu Ala Glu Val 

290 295 300 

Val Ser Gin His Asn Leu Thr Gin Ser His lie Ala Glu Thr Glu Lys 
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Lys Lys Leu His lie Ala Thr Met Thr Pro Tyr Asp Asn Thr Phe Pro 
275 280 285 

TAG CCT GTT TTA TTC CCC AAA GAA AGC GTT CAA AAC ACG CTC GCT GAA 974 
Tyr Pro Val Leu Phe Pro Lys Glu Ser Val Gin Asn Thr Leu Ala Glu 
290 295 300 

GTG GTC TCT CAA CAC AAC CTG ACC CAA AGC CAT ATC GCT GAA ACT GAA 1022 
Val Val Ser Gin His Asn Leu Thr Gin Ser His lie Ala Glu Thr Glu 
305 310 315 

AAA TAC GCG CAT GTA ACC TTT TTC ATC AAT GGC GGA GTG GAG ACG CCT 1070 
Lys Tyr Ala His Val Thr Phe Phe lie Asn Gly Gly Val Glu Thr Pro 
320 325 330 335 

TTT AAA AAT GAA AAC CGG GTG CTT ATC CAA AGC CCT AAA GTT ACC ACT 1118 
Phe Lys Asn Glu Asn Arg Val Leu lie Gin Ser Pro Lys Val Thr Thr 
340 345 350 

TAT GAC TTA AAG CCT GAA ATG AGC GCT AAA GAA GTA ACC CTT GCG GTG 1166 
Tyr Asp Leu Lys Pro Glu Met Ser Ala Lys Glu Val Thr Leu Ala Val 
355 360 365 

TTA GAG C7VA ATG AAA CTA GGC ACG GAT TTG ATC ATT GTG AAT TTT GCT 1214 
Leu Glu Gin Met Lys Leu Gly Thr Asp Leu lie lie Val Asn Phe Ala 
370 375 380 

AAT GGC GAT ATG GTA GGG CAT ACG GGG AAT TTT GAA GCG AGC GTC AAA 12 62 

Asn Gly Asp Met Val Gly His Thr Gly Asn Phe Glu Ala Ser Val Lys 
385 390 395 

GCG GTG GAA GCA GTG GAT GCA TGT TTA GGG GAA ATC CTT TCA CTG GCT 1310 
Ala Val Glu Ala Val Asp Ala Cys Leu Gly Glu lie Leu Ser Leu Ala 
400 405 410 415 

AAA AAA TTG GAT TAC GCC ATG CTT TTA ACC AGC GAT CAT GGG AAT TGC 1358 
Lys Lys Leu Asp Tyr Ala Met Leu Leu Thr Ser Asp His Gly Asn Cys 
420 425 430 

GAG CGC ATG AAA GAC GAA AAC CAA AAC CCC TTA ACC AAC CAC ACC GCC 14 06 

Glu Arg Met Lys Asp Glu Asn Gin Asn Pro Leu Thr Asn His Thr Ala 
435 440 445 

GGG AGC GTG TAT TGC TTT GTT TTA GGG GAT GGA GTC AAA TCC ATA AAA 14 54 

Gly Ser Val Tyr Cys Phe Val Leu Gly Asp Gly Val Lys Ser He Lys 
450 455 460 

AAC GGA GCC TTA AAC AAT ATC GCT AGC AGC GTG TTA AAA CTC ATG GGC 15 02 

Asn Gly Ala Leu Asn Asn He Ala Ser Ser Val Leu Lys Leu Met Gly 
465 470 475 

CTT AAA GCC CCA GCA ACG ATG GAC GAA CCC CTA TTT TAAACTAAAG GAAAAG 1554 
Leu Lys Ala Pro Ala Thr Met Asp Glu Pro Leu Phe 
480 485 490 
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50 55 60 

GGG CAT ATG TGT ATT GGG GCT GGT AGG GTG CTC TAT CAG GAT TTA GTC 3 02 

Gly His Met Cys lie Gly Ala Gly Arg Val Leu Tyr Gin Asp Leu Val 
65 70 75 

AAA ATT TOT TTA AGC CTT CAA AAC GAT GAA TTA AAA AAC AAC CCC GCT 3 50 

Lys lie Ser Leu Ser Leu Gin Asn Asp Glu Leu Lys Asn Asn Pro Ala 
80 85 90 95 

TTT TTA AAC ACG ATC CAA AAA AGC CCT GTG GTG CAT CTT ATG GGT TTA 3 98 

Phe Leu Asn Thr lie Gin Lys Ser Pro Val Val His Leu Met Gly Leu 
100 105 110 

ATG AGC GAT GGA GGC GTG CAT TCA CAC ATT GAG CAT TTT ATC GCT CTG 44 6 

Met Ser Asp Gly Gly Val His Ser His lie Glu His Phe lie Ala Leu 
115 120 125 

GCT TTA GAG TGT GAA AAA TCC CAT AAA AAA GTC TGT CTG CAT TTA ATC 4 94 

Ala Leu Glu Cys Glu Lys Ser His Lys Lys Val Cys Leu His Leu lie 
130 135 140 

ACC GAT GGG CGC GAT GTC GCT CCT AAA AGC GCT TTA ACT TAT TTA AAA 542 
Thr Asp Gly Arg Asp Val Ala Pro Lys Ser Ala Leu Thr Tyr Leu Lys 
145 150 155 

CAA ATG CAA AAT ATC TGC AAT GAA AGC ATT CAA ATC GCT ACC ATA AGC 5 90 

Gin Met Gin Asn lie Cys Asn Glu Ser lie Gin lie Ala Thr lie Ser 
160 165 170 175 

GGT CGT TTT TAT GCC ATG GAT AGG GAT AAG CGC TTT GAA AGG ATT GAG 63 8 

Gly Arg Phe Tyr Ala Met Asp Arg Asp Lys Arg Phe Glu Arg lie Glu 
180 185 190 

CTT GCG TAT CAT AGC TTA ATG GGG CTT AAT CAC ACG CCT TTA AGC CCT 6 86 

Leu Ala Tyr His Ser Leu Met Gly Leu Asn His Thr Pro Leu Ser Pro 
195 200 205 

AGC GAG TAT ATC CAA AGC CAG TAT GAT AAA AAT ATC ACC GAT GAA TTT 734 
Ser Glu Tyr lie Gin Ser Gin Tyr Asp Lys Asn lie Thr Asp Glu Phe 
210 215 220 

ATC ATG CCC GCT TGT TTT AAA AAT TAT TGC GGC ATG CAA GAT GAT GAG 7 82 

lie Met Pro Ala Cys Phe Lys Asn Tyr Cys Gly Met Gin Asp Asp Glu 
225 230 235 

AGT TTT ATT TTT ATC AAT TTC AGG AAT GAT AGG GCT AGA GAA ATC GTG 8 30 

Ser Phe lie Phe lie Asn Phe Arg Asn Asp Arg Ala Arg Glu lie Val 
240 245 250 255 

AGC GCT TTA GGC CAA AAA CAA TTC AGT GGC TTT AAG CGC CAA GTT TTT 87 8 

Ser Ala Leu Gly Gin Lys Gin Phe Ser Gly Phe Lys Arg Gin Val Phe 
260 265 270 

AAA AAA CTC CAT ATC GCT ACC ATG ACG CCT TAT GAT AAC ACT TTC CCC 926 
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145 

Gly Leu Glu Arg 

Glu lie Glu Trp 
180 

Val His Leu Glu 
195 

Ala Ser Val Lys 
210 

Ala Leu His Cys 
225 

Val Met Leu Cys 

lie Ser Val Ala 
260 

Ala Lys Gly Cys 
275 

Glu Arg Leu Lys 
290 



150 

Leu Ala Met Tyr 
165 

Ala Lys Lys Asn 

Ser Glu Tyr Glu 
200 

Arg Leu Leu Glu 
215 

Leu Glu Asn Lys 
230 

Ser His Phe Phe 
245 

Glu Arg Gin Asn 

Ala Leu Leu Tyr 
280 

Asn Ala Leu Thr 
295 



155 

Val Gin Lys Val 
170 

His Asp Ser Val 
185 

Phe Ser Lys Tyr 

Met Phe Lys Asn 
220 

Leu Pro Leu Pro 
235 

Asn lie Leu Asp 
250 

Tyr lie Leu Gin 
265 

Lys Glu Gin Glu 

Lys Ala Glu Asn 
300 



160 

Glu Asn lie Leu 
175 

Asn Tyr Ala Gin 
190 

His Phe Glu Thr 
205 

Ala Gin Ala Glu 

Ala Tyr Asp Phe 
240 

Ala Arg Lys Ala 
255 

lie Arg Asp Leu 
270 

Glu Glu Arg Glu 
285 

Gly Val Ser 



(2) INFORMATION FOR SEQ ID NO:475: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 1538 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:475: 

GGCAAAGGAA TGCGATTTTA GCTAACCACA TCACTCTCAT GCAAGAGCTT TAAAAAGTCC 6 0 

TAAAA ATG GCG CAA AAA ACT CTT TTG ATT ATC ACT GAT GGC ATT GGG TAT 110 
Met Ala Gin Lys Thr Leu Leu lie lie Thr Asp Gly lie Gly Tyr 
15 10 15 

CGT AAA GAT AGO GAT CAT AAC GCT TTC TTC CAT GCC AAA AAA CCC ACT 158 
Arg Lys Asp Ser Asp His Asn Ala Phe Phe His Ala Lys Lys Pro Thr 
20 25 30 

TAT GAT TTG ATG TTT AAA ACC TTG CCT TAT AGC CTG ATT GAT ACG CAT 2 06 

Tyr Asp Leu Met Phe Lys Thr Leu Pro Tyr Ser Leu lie Asp Thr His 
35 40 45 

GGC TTG AGC GTG GGC TTA CCT AAG GGG CAA ATG GGA AAT TCT GAA GTG 254 
Gly Leu Ser Val Gly Leu Pro Lys Gly Gin Met Gly Asn Ser Glu Val 
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CCC TTG CCG GCT TAT GAT TTT GTG ATG TTA TGC TCG CAT TTT TTC AAT 83 2 

Pro Leu Pro Ala Tyr Asp Phe Val Met Leu Cys Ser His Phe Phe Asn 
235 240 245 

ATT TTA GAC GCC AGA AAA GCG ATT TCG GTG GCT GAA AGG CAA AAT TAT 8 80 

lie Leu Asp Ala Arg Lys Ala lie Ser Val Ala Glu Arg Gin Asn Tyr 
250 255 260 265 

ATT TTA CAA ATC AGG GAT TTA GCC AAA GGG TGT GCG CTT CTT TAT AAA 92 8 

lie Leu Gin lie Arg Asp Leu Ala Lys Gly Cys Ala Leu Leu Tyr Lys 
270 275 280 

GAA CAA GAA GAA GAG AGG GAA GAG CGT TTA AAA AAC GCT TTA ACA AAG 976 
Glu Gin Glu Glu Glu Arg Glu Glu Arg Leu Lys Asn Ala Leu Thr Lys 
285 290 295 

GCT GAA AAT GGC GTT AGT TAAGGAAGTG TTGGTAGTTT TGAATCGCCT TTCGCCTT 1032 
Ala Glu Asn Gly Val Ser 
300 

TTGAACTCCA AGAATCATGG GATAATAGCG GGTTGAATGT GGGGAGTGAA AATAGTGAAT 1092 
TTAGCGAGAT TG 1104 

(2) INFORMATION FOR SEQ ID NO:474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 03 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:474: 

Met Gin Asp Phe Ser Ser Leu Leu Leu Lys Leu Gin Glu Tyr Trp Lys 

15 10 15 

Asn Gin Gly Cys Leu Val lie Gin Pro Tyr Asp lie Pro Ala Gly Ala 

20 25 30 

Gly Thr Phe His Pro Ala Thr Leu Leu Arg Ser Leu Asp Lys Lys Pro 

35 40 45 

Trp Asn Val Ala Tyr Val Ala Pro Ser Arg Arg Pro Thr Asp Gly Arg 

50 55 60 

Tyr Gly Glu Asn Pro Asn Arg Leu Gly Ser Tyr Tyr Gin Phe Gin Val 
65 70 75 80 

Val lie Lys Pro Ser Pro Ser Asn lie Gin Glu Leu Tyr Leu Lys Ser 

85 90 95 

Leu Glu Val Leu Gly lie Asn Leu Asn Glu His Asp lie Arg Phe Val 

100 105 110 

Glu Asp Asn Trp Glu Ser Pro Thr Leu Gly Ala Trp Gly Leu Gly Trp 

115 120 125 

Glu Val Trp Leu Asp Gly Met Glu Val Thr Gin Phe Thr Tyr Phe Gin 

130 135 140 

Gin Val Gly Gly lie Ala Cys Ser Pro lie Pro Val Glu lie Thr Tyr 
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AAA CTA CAA GAG TAT TGG AAG J^T CAA GGC TGT TTG GTG ATC GAG CCT 160 
Lys Leu Gin Glu Tyr Trp Lys Asn Gin Gly Cys Leu Val lie Gin Pro 
10 15 20 25 

TAT GAT ATT CCT GCA GGA GCT GGG ACA TTC CAT CCG GCC ACG CTT TTA 2 08 

Tyr Asp lie Pro Ala Gly Ala Gly Thr Phe His Pro Ala Thr Leu Leu 
30 35 40 



AGG AGT TTG GAT AAA AAG CCG TGG AAT GTG GCG TAT GTC GCG CCC TCT 256 
Arg Ser Leu Asp Lys Lys Pro Trp Asn Val Ala Tyr Val Ala Pro Ser 
45 50 55 



AGA AGG CCT ACT GAT GGG CGC TAT GGG GAA AAC CCT AAC CGC TTG GGG 3 04 

Arg Arg Pro Thr Asp Gly Arg Tyr Gly Glu Asn Pro Asn Arg Leu Gly 
60 65 70 

AGT TAT TAC CAA TTC CAA GTA GTC ATC AAG CCC AGC CCT TCT AAT ATC 3 52 

Ser Tyr Tyr Gin Phe Gin Val Val lie Lys Pro Ser Pro Ser Asn lie 
75 80 85 

CAG GAA CTC TAT TTA AAA AGC TTA GAA GTG TTA GGG ATA AAC CTT AAT 4 00 

Gin Glu Leu Tyr Leu Lys Ser Leu Glu Val Leu Gly lie Asn Leu Asn 
90 95 100 105 

GAG CAT GAT ATA CGA TTT GTA GAA GAC AAT TGG GAG AGT CCG ACT TTA 44 8 

Glu His Asp lie Arg Phe Val Glu Asp Asn Trp Glu Ser Pro Thr Leu 
110 115 120 

GGG GCA TGG GGG CTT GGC TGG GAA GTG TGG CTT GAT GGC ATG GAA GTT 496 
Gly Ala Trp Gly Leu Gly Trp Glu Val Trp Leu Asp Gly Met Glu Val 
125 130 135 



ACG CAA TTC ACT TAT TTC CAG CAA GTG GGG GGC ATT GCT TGT AGC CCT 544 
Thr Gin Phe Thr Tyr Phe Gin Gin Val Gly Gly lie Ala Cys Ser Pro 
140 145 150 



ATT CCT GTA 
lie Pro Val 
155 

CAA AAA GTG 
Gin Lys Val 
170 

GAC AGC GTG 
Asp Ser Val 



AGC AAG TAT 
Ser Lys Tyr 



TTT AAA AAC 
Phe Lys Asn 
220 



GAG ATC ACT 
Glu lie Thr 



GAA AAT ATC 
Glu Asn lie 
175 

AAT TAC GCA 
Asn Tyr Ala 
190 

CAT TTT GAA 
His Phe Glu 
205 

GCT CAA GCC 
Ala Gin Ala 



TAC GGC TTA 
Tyr Gly Leu 
160 

CTA GAG ATT 
Leu Glu lie 



CAA GTG CAT 
Gin Val His 



ACA GCG AGC 
Thr Ala Ser 
210 

GAA GCC TTG 
Glu Ala Leu 
225 



GAA AGA TTA 
Glu Arg Leu 
165 

GAA TGG GCT 
Glu Trp Ala 
180 

TTG GAA AGC 
Leu Glu Ser 
195 

GTG AAA CGG 
Val Lys Arg 



CAT TGC TTG 
His Cys Leu 



GCG ATG TAT 
Ala Met Tyr 

AAA AAA AAT 
Lys Lys Asn 

GAA TAC GAA 
Glu Tyr Glu 
200 

CTA TTA GAA 
Leu Leu Glu 
215 

GAA AAC AAG 
Glu Asn Lys 
230 



GTG 5 92 

Val 



CAT 640 

His 

185 

TTC 688 
Phe 



ATG 736 
Met 



CTC 7 84 

Leu 
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50 55 60 

Trp Ala Glu Leu Gly lie Val Leu Gly Ala Arg He Gly Tyr He Leu 
65 70 75 80 

He Tyr Glu Pro Asn Ser Gly Tyr Tyr Leu Thr His Phe Trp Gin He 

85 90 95 

Phe Asn Pro Phe Asp Ser His Gly Asn Phe Val Gly He Arg Gly Met 

100 105 110 

Ser Tyr His Gly Gly Leu Val Gly Phe Leu He Ala Ser Tyr Leu Tyr 

115 120 125 

Ser Arg Lys Asp Leu Lys Lys Leu Leu He Tyr Leu Asp Leu He Ala 

130 135 140 

He Ser Leu Pro Leu Gly Tyr Val Phe Gly Arg He Gly Asn Phe Leu 
145 150 155 160 

Asn Gin Glu Leu Val Gly Arg He Val Pro Lys Asp Ser His Leu Gly 

165 170 175 

Gin He He Gly He Met Val Asp Asn Glu Leu Arg Tyr Pro Ser Gin 

180 185 190 

Leu He Glu Ala Phe Leu Glu Gly Val He Val Phe Leu Met Val Met 

195 200 205 

Trp Ala Lys Lys His Thr Lys Thr His Gly Leu Leu He Val Val Tyr 

210 215 220 

Gly Leu Gly Tyr Ser Leu Met Arg Phe He Ala Glu Phe Tyr Arg Glu 
225 230 235 240 

Pro Asp Ser Gin Met Gly Val Tyr Phe Leu Asn Leu Ser Met Gly Gin 

245 250 255 

He Leu Ser Leu Phe Met Val He Val Ser' Leu Gly He Leu Leu Tyr 

260 265 270 

Ala Thr Lys Asn Ser Lys Lys He Lys Glu Asn Gin 
275 280 

(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 86... 994 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 

AGCGAATTAG CCTTGCTTTT AAAGGGTAAG AGTGTGCTAG AGAGCATGAA CGATTTGATC 60 
AGACGCGCTT AAAAGGAAAG AGAGC ATG CAA GAT TTT TCA AGT TTA TTA TTA 112 

Met Gin Asp Phe Ser Ser Leu Leu Leu 
1 5 
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GGG AAT TTT TTA AAC CAG GAG CTT GTG GGA AGA ATT GTC CCC AAA GAC 591 
Gly Asn Phe Leu Asn Gin Glu Leu Val Gly Arg lie Val Pro Lys Asp 
160 165 170 

AGC CAT TTA GGG CAA ATC ATA GGC ATT ATG GTG GAT AAT GAG TTG CGT 63 9 

Ser His Leu Gly Gin He He Gly He Met Val Asp Asn Glu Leu Arg 
175 180 185 

TAT CCC AGC CAA TTG ATT GAA GCG TTT TTA GAG GGG GTT ATC GTG TTT 687 
Tyr Pro Ser Gin Leu He Glu Ala Phe Leu Glu Gly Val He Val Phe 
190 195 200 

TTA ATG GTA ATG TGG GCT AAA AAA CAC ACC AAA ACG CAT GGG TTG CTG 735 
Leu Met Val Met Trp Ala Lys Lys His Thr Lys Thr His Gly Leu Leu 
205 210 215 220 

ATT GTG GTT TAT GGT TTG GGG TAT TCC TTG ATG CGC TTT ATT GCG GAA 783 
He Val Val Tyr Gly Leu Gly Tyr Ser Leu Met Arg Phe He Ala Glu 
225 230 235 

TTT TAG AGA GAG CCG GAC AGC CAA ATG GGG GTT TAT TTT TTA AAT TTG 831 
Phe Tyr Arg Glu Pro Asp Ser Gin Met Gly Val Tyr Phe Leu Asn Leu 
240 245 250 

AGC ATG GGG CAG ATT TTA AGC TTA TTT ATG GTA ATT GTT TCG TTA GGG 879 
Ser Met Gly Gin He Leu Ser Leu Phe Met Val He Val Ser Leu Gly 
255 260 265 

ATT TTA TTG TAT GCT ACA AAA AAT TCT AAA AAA ATA AAG GAA AAT CAA T 92 8 
He Leu Leu Tyr Ala Thr Lys Asn Ser Lys Lys He Lys Glu Asn Gin 
270 275 280 

GAAATTTTTG GATCAAGAAA AAAGAAGACA ATTATTAAAC GAGCGCCATT CTTGCAAGAT 988 
GTTTGATAGC C 999 

(2) INFORMATION FOR SEQ ID NO:472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:472: 

Met Asn Ala Trp Asn Thr He Tyr Asp Gin Phe Asn Pro He Ala Phe 

15 10 15 

Ser Leu Gly Ser He Glu Val His Trp Tyr Gly Leu Ala Tyr Ala Cys 

20 25 30 

Ala He Val Thr Ala Phe Tyr Met Ala Leu Arg Met He Gin Lys Asp 

35 40 45 

Pro Lys Arg Phe Pro He Glu Arg Lys Glu Phe Glu Ser Tyr Phe Leu 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 76... 927 
CD) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

GCACATAAAA TCGCGCTACT AGGGTATGAA TTTGAAGCGA TCGCTCCTAA AGAATTTGAA 60 
ATTTAAGGAT TGATC ATG AAC GCT TGG AAT ACG ATT TAT GAT CAA TTT AAC 111 
Met Asn Ala Trp Asn Thr lie Tyr Asp Gin Phe Asn 
15 10 

CCT ATC GCT TTT AGT CTT GGC AGT ATT GAA GTG CAT TGG TAT GGT TTG 15 9 

Pro lie Ala Phe Ser Leu Gly Ser lie Glu Val His Trp Tyr Gly Leu 
15 20 25 

GCG TAT GCG TGT GCG ATT GTT ACC GCT TTT TAT ATG GCG TTA AGA ATG 2 07 

Ala Tyr Ala Cys Ala lie Val Thr Ala Phe Tyr Met Ala Leu Arg Met 
30 35 40 

ATC CAA AAA GAC CCC AAG CGA TTC CCC ATT GAA AGG AAG GAA TTT GAG 2 55 

lie Gin Lys Asp Pro Lys Arg Phe Pro lie Glu Arg Lys Glu Phe Glu 
45 50 55 60 

AGT TAT TTT TTA TGG GCG GAG CTT GGC ATT GTG CTA GGG GCA AGG ATA 3 03 

Ser Tyr Phe Leu Trp Ala Glu Leu Gly lie Val Leu Gly Ala Arg lie 
65 70 75 

GGA TAC ATT CTT ATT TAT GAG CCT AAT TCT GGC TAT TAT TTG ACG CAT 351 
Gly Tyr lie Leu lie Tyr Glu Pro Asn Ser Gly Tyr Tyr Leu Thr His 
80 85 90 

TTT TGG CAA ATC TTT AAC CCT TTT GAT AGC CAT GGG AAT TTT GTA GGC 3 99 

Phe Trp Gin He Phe Asn Pro Phe Asp Ser His Gly Asn Phe Val Gly 
95 100 105 

ATT CGT GGG ATG AGC TAT CAT GGG GGG TTG GTG GGG TTT TTG ATC GCT 44 7 

He Arg Gly Met Ser Tyr His Gly Gly Leu Val Gly Phe Leu He Ala 
110 115 120 

TCG TAT CTT TAT AGC CGT AAG GAT TTG AAA AAG CTT TTG ATT TAT TTG 4 95 

Ser Tyr Leu Tyr Ser Arg Lys Asp Leu Lys Lys Leu Leu He Tyr Leu 
125 130 135 140 

GAT TTG ATT GCG ATC AGC CTG CCT TTA GGG TAT GTT TTT GGG AGG ATT 54 3 

Asp Leu He Ala He Ser Leu Pro Leu Gly Tyr Val Phe Gly Arg He 
145 150 155 
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185 190 195 

CTG AAT TTA GGG TTT TGAAAGGTTA AAAATGAAAT TTAAATTTTT GAATATGGAT A 74 7 

Leu Asn Leu Gly Phe 

200 

ATGAAAGCGG TTTTATTTTG ATTGAAAAAG AATTGAAACG ATTAAACATT CTCGCTCAAG 8 07 

TCA 810 

(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

Met Gin Gly Phe Leu Leu Gin Thr Gin Ser lie Arg Asp Glu Asp Leu 

1 5 10 15 

lie Val Arg Val Leu Thr Lys Asn Gin Leu Lys Thr Leu Tyr Arg Phe 

20 25 30 

Tyr Gly Lys Arg His Ser Val Leu Asn Val Gly Arg Lys lie Asp Phe 

35 40 45 

Glu Glu Glu Asn Asp Asp Lys Phe Leu Pro Lys Leu Arg Asn lie Leu 

50 55 60 

His Leu Gly Tyr lie Trp Glu Arg Glu Met Glu Arg Leu Phe Phe Trp 
65 70 75 80 

Gin Arg Phe Cys Ala Leu Leu Phe Arg His Leu Glu Gly Val His Ser 

85 90 95 

Leu Asp Ser Val Tyr Phe Asp Thr Leu Asp Asp Gly Ala Asn Lys Leu 

100 105 110 

Ala Lys Gin His Pro Leu Arg Val lie Leu Glu Met Tyr Ala Thr Leu 

115 120 125 

Leu Asn Phe Glu Gly Arg Leu Gin Ser Tyr Asn Ser Cys Phe Leu Cys 

130 135 140 

Asp Ala Lys Leu Glu Arg Ser Val Ala Leu Ala Gin Gly Phe lie Leu 
145 150 155 160 

Ala His Pro Ser Cys Leu Lys Ala Lys Ser Leu Asn Leu Glu Lys lie 

165 170 175 

Gin Ala Phe Phe Arg Thr Gin Ser Thr He Asp Leu Glu Thr Glu Glu 

180 185 190 

Val Glu Glu Leu Trp Arg Thr Leu Asn Leu Gly Phe 
195 200 

(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 9: 

TCTGCAAATC CCATGTTTTT CCTTAATTTT GCAGTTTTGT TAGGATTGTA TCCAAGTTTT 60 
GCTTATAATA AACAAAATTA GCTTAAGAGT AGTG ATG CAA GGG TTT CTT TTA CAA 115 

Met Gin Gly Phe Leu Leu Gin 
1 5 

ACA CAA AGC ATA AGA GAT GAA GAT TTG ATC GTG CGC GTT TTA ACC AAA 163 
Thr Gin Ser lie Arg Asp Glu Asp Leu lie Val Arg Val Leu Thr Lys 
10 15 20 

AAC CAG CTC AAA ACC CTC TAT CGT TTC TAT GGC AAA CGC CAT AGC GTG 211 
Asn Gin Leu Lys Thr Leu Tyr Arg Phe Tyr Gly Lys Arg His Ser Val 
25 30 35 

CTG AAT GTG GGG CGT AAA ATT GAT TTT GAA GAA GAA AAC GAT GAT AAG 259 
Leu Asn Val Gly Arg Lys lie Asp Phe Glu Glu Glu Asn Asp Asp Lys 
40 45 50 55 

TTT TTA CCC AAG TTA AGG AAT ATT TTG CAT TTA GGC TAT ATT TGG GAA 3 07 

Phe Leu Pro Lys Leu Arg Asn lie Leu His Leu Gly Tyr lie Trp Glu 
60 65 70 

AGA GAA ATG GAG CGC TTG TTT TTT TGG CAA CGC TTT TGC GCT CTC TTG 3 55 

Arg Glu Met Glu Arg Leu Phe Phe Trp Gin Arg Phe Cys Ala Leu Leu 
75 80 85 

TTT AGG CAT TTA GAA GGC GTG CAT TCT TTA GAT AGC GTC TAT TTT GAC 4 03 

Phe Arg His Leu Glu Gly Val His Ser Leu Asp Ser Val Tyr Phe Asp 
90 95 100 

ACT TTA GAT GAT GGG GCT AAC AAA CTC GCC AAA CAG CAC CCC TTA AGA 451 
Thr Leu Asp Asp Gly Ala Asn Lys Leu Ala Lys Gin His Pro Leu Arg 
105 110 115 

GTG ATT TTA GAA ATG TAT GCA ACG CTT TTG AAT TTT GAA GGG CGC TTG 4 99 

Val lie Leu Glu Met Tyr Ala Thr Leu Leu Asn Phe Glu Gly Arg Leu 
120 125 130 135 

CAA AGT TAC AAT TCT TGT TTT TTA TGC GAT GCA AAA TTA GAG CGT TCT 54 7 

Gin Ser Tyr Asn Ser Cys Phe Leu Cys Asp Ala Lys Leu Glu Arg Ser 
140 145 150 

GTC GCT TTA GCG CAA GGG TTT ATT CTA GCG CAC CCC TCT TGT TTG AAA 595 
Val Ala Leu Ala Gin Gly Phe lie Leu Ala His Pro Ser Cys Leu Lys 
155 160 165 

GCT AAA AGC CTA AAT TTA GAA AAA ATC CAA GCT TTT TTT CGC ACT CAA 64 3 

Ala Lys Ser Leu Asn Leu Glu Lys lie Gin Ala Phe Phe Arg Thr Gin 
170 175 180 

AGC ACG ATT GAT TTA GAA ACA GAA GAA GTA GAA GAA TTA TGG CGC ACG 6 91 

Ser Thr lie Asp Leu Glu Thr Glu Glu Val Glu Glu Leu Trp Arg Thr 
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Met Gly Pile Ala 
1 

Thr Ala Pro Ser 
20 

L»ys Cys Tyr Ala 
35 

Val Cys Leu Lys 
50 

lie Glu Phe Leu 
65 

Leu Arg Pro Asn 

Gin Arg lie Lys 
100 

lie Ser Gly Glu 
115 

Phe Asp Phe Ser 
130 

Lys lie Val Arg 
145 

Leu lie Val Ser 

Ser Leu Met Gin 
180 

Glu Ala Lys Leu 
195 

Gly Val Ser Ala 
210 

Pro Gly Ala Met 
225 

lie Gly Ala Asp 

Glu His Gly Leu 
260 

Thr Leu Ser Asp 
275 

Phe 



Asp Phe Phe Lys 
5 

Lys Glu Glu Gin 

Leu Met Tyr His 
40 

Cys His Tyr His 
55 

Cys Asp Val Gly 
70 

Asp Pro Leu Asn 
85 

Lys Tyr Glu Lys 

Ala Lys lie Asn 
120 

Phe Met Gly Gly 
135 

Ala lie Asn Arg 
150 

Ala Ser Gly Gly 
165 

Met Ala Lys Thr 

Pro Phe lie Ser 
200 

Ser Phe Ala Phe 
215 

lie Gly Phe Ala 
230 

Leu Pro Glu Gly 
245 

lie Asp Met lie 

Leu lie Ala Met 
280 



Asn Phe Lys lie 
10 

Pro Ser His Trp 
25 

Lys Glu Val Phe 

Phe Arg Met Lys 
60 

Ser Phe Glu Glu 
75 

Phe Val Asp Lys 
90 

Arg Thr Asn Arg 
105 

Arg Met Pro Leu 

Ser Leu Gly Ser 
140 

Ala Val Ala Lys 
155 

Ala Arg Met Gin 
170 

Ser Ala Ala Leu 
185 

Leu Leu Ser Asp 

Leu Gly Asp Leu 
220 

Gly Pro Arg Val 
235 

Phe Gin Thr Ala 
250 

Val His Arg Lys 
265 

Met Thr His Lys 



Asn Lys Leu Arg 
15 

Val Lys Cys Pro 
30 

Ser Lys Tyr Ser 
45 

Ala Ala Glu Arg 

Phe Asp Lys His 
80 

Glu Ser Tyr Lys 
95 

Pro Ser Ser Val 
110 

Gin lie Val Val 
125 

Val Glu Gly Glu 

Arg Glu Ala Leu 
160 

Glu Ser Thr Tyr 
175 

Asn Arg Leu Ser 
190 

Pro Thr Tyr Gly 
205 

lie lie Ala Glu 

lie Lys Gin Thr 
240 

Glu Phe Leu Leu 
255 

Asp Leu Lys Lys 
270 

Thr Ser Lys lie 
285 



(2) INFORMATION FOR SEQ ID NO: 4 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
CB) LOCATION: 95... 706 
(D) OTHER INFORMATION: 
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AGC TTT ATG GGG GGG AGT TTA GGC TCT GTG GAG GGC GAA AAG ATC GTA 550 
Ser Phe Met Gly Gly Ser Leu Gly Ser Val Glu Gly Glu Lys He Val 
135 140 145 

AGA GCA ATC AAT CGC GCG GTC GCT AAA AGA GAA GCG TTA TTG ATT GTT 598 
Arg Ala He Asn Arg Ala Val Ala Lys Arg Glu Ala Leu Leu He Val 
150 155 160 

TCA GCG AGT GGG GGG GCT AGG ATG CAA GAA TCC ACT TAT TCG CTC ATG 64 6 

Ser Ala Ser Gly Gly Ala Arg Met Gin Glu Ser Thr Tyr Ser Leu Met 
165 170 175 

CAA ATG GCT AAA ACG AGC GCG GCT TTG AAC CGA TTG AGT GAG GCC AAA 694 
Gin Met Ala Lys Thr Ser Ala Ala Leu Asn Arg Leu Ser Glu Ala Lys 
180 185 190 195 

CTC CCT TTC ATT TCG CTC TTA AGC GAT CCC ACT TAT GGG GGC GTT AGC 742 
Leu Pro Phe He Ser Leu Leu Ser Asp Pro Thr Tyr Gly Gly Val Ser 
200 205 210 

GCA TCT TTT GCT TTT TTA GGG GAT CTC ATT ATC GCA GAG CCA GGG GCG 7 90 

Ala Ser Phe Ala Phe Leu Gly Asp Leu He He Ala Glu Pro. Gly Ala 
215 220 225 

ATG ATA GGC TTT GCG GGG CCT AGG GTG ATT AAG CAA ACT ATA GGG GCG 83 8 

Met He Gly Phe Ala Gly Pro Arg Val He Lys Gin Thr He Gly Ala 
230 235 240 

GAT TTG CCT GAG GGC TTT CAA ACA GCG GAA TTT TTA TTA GAG CAT GGC 886 
Asp Leu Pro Glu Gly Phe Gin Thr Ala Glu Phe Leu Leu Glu His Gly 
245 250 255 

TTG ATT GAT ATG ATT GTG CAC AGG AAG GAT TTG AAG AAG ACT TTG AGC 93 4 

Leu He Asp Met He Val His Arg Lys Asp Leu Lys Lys Thr Leu Ser 
260 265 270 275 

GAT CTC ATC GCT ATG ATG ACG CAT AAG ACT TCA AAG ATT TTT TAAAGTTTT 98 5 
Asp Leu He Ala Met Met Thr His Lys Thr Ser Lys He Phe 
280 285 

AACATTGATG CGTTGCGTGG TGTATTCTAT CGCTAAAAGT TCGCCTTTAG AGTTAGTGAA 104 5 
AATCTATC 1053 

(2) INFORMATION FOR SEQ ID NO: 4 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:468: 
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(i) SEQUENCE CPIARACTERISTICS : 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 110... 976 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 67: 

CTTTGTGTTT GTAAAAGAAA CCCTTGCATC ACTACTCTTA AGCTAATTTT GTTTATTATA 60 
AGCAAAACTT GGATACAATC CTAACAAAAC TGCAAAATTA AGGAAAAAC ATG GGA TTT 118 

Met Gly Phe 
1 

GCA GAT TTC TTT AAA AAT TTT AAG ATC AAT AAA TTG CGG ACA GCG CCA 166 
Ala Asp Phe Phe Lys Asn Phe Lys lie Asn Lys Leu Arg Thr Ala Pro 
5 10 15 

AGT AAG GAA GAA CAG CCA AGC CAT TGG GTG AAA TGC CCT AAA TGT TAT 214 
Ser Lys Glu Glu Gin Pro Ser His Trp Val Lys Cys Pro Lys Cys Tyr 
20 25 30 35 

GCG TTA ATG TAT CAT AAA GAA GTG TTT AGT AAA TAC AGC GTG TGT TTG 2 62 

Ala Leu Met Tyr His Lys Glu Val Phe Ser Lys Tyr Ser Val Cys Leu 
40 45 50 

AAA TGC CAT TAC CAT TTC CGC ATG AAA GCG GCT GAA AGG ATT GAA TTT 310 
Lys Cys His Tyr His Phe Arg Met Lys Ala Ala Glu Arg lie Glu Phe 
55 60 65 

TTA TGC GAT GTG GGG AGT TTT GAA GAG TTT GAC AAG CAT TTA CGG CCT 358 
Leu Cys Asp Val Gly Ser Phe Glu Glu Phe Asp Lys His Leu Arg Pro 
70 75 80 

AAT GAT CCT TTA AAT TTC GTG GAT AAA GAG AGC TAT AAA CAA CGC ATT 4 06 

Asn Asp Pro Leu Asn Phe Val Asp Lys Glu Ser Tyr Lys Gin Arg lie 
85 90 95 

AAA AAA TAC GAA AAA AGG ACT AAC CGC CCA AGC TCA GTG ATC AGC GGT 4 54 

Lys Lys Tyr Glu Lys Arg Thr Asn Arg Pro Ser Ser Val lie Ser Gly 
100 105 110 115 

GAG GCT AAA ATC AAC CGC ATG CCT TTG CAG ATC GTG GTG TTT GAT TTT 5 02 

. Glu Ala Lys lie Asn Arg Met Pro Leu Gin lie Val Val Phe Asp Phe 
120 125 130 
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AAA ATG TTA GAA AAT CAT CTT AAT ATT AAT TTT TTT ATC GCT GGA GCG 4 03 

Lys Met Leu Glu Asn His Leu Asn lie Asn Phe Phe lie Ala Gly Ala 
90 95 100 

TAT GGG TTT GAA GAA AAT TTT TTA AAG GAT TGT CAA GCT TGG AGT TTG 4 51 

Tyr Gly Phe Glu Glu Asn Phe Leu Lys Asp Cys Gin Ala Trp Ser Leu 
105 110 115 

AGC GAG ATG ACT TTT AGC CAT GAA GTG GCT AAA ATT GTC TTA TGC GAG 4 99 

Ser Glu Met Thr Phe Ser His Glu Val Ala Lys lie Val Leu Cys Glu 
120 125 130 

CAA ATC TAT AGG GCT TTA AGC ATT ATT TTT AAG CAT CCA TAC CAT AAA T 54 8 
Gin lie Tyr Arg Ala Leu Ser lie lie Phe Lys His Pro Tyr His Lys 
135 140 145 150 

AGGAGGTGCG CATGCGTTTT TACATTATCT TTACATTTTT GTTTATTGTG GGTTTTGGTG 608 
TGTTTGTTTA TAGTATTG 626 

(2) INFORMATION FOR SEQ ID NO:466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 66: 

Met Arg Cys Val Val Tyr Ser lie Ala Lys Ser Ser Pro Leu Glu Leu 

15 10 15 

Val Lys lie Tyr Gin Lys Gin Cys Arg Gin Phe Asp Cys Glu Leu Glu 

20 25 30 

Leu Val Asp Leu Phe Pro Lys Asn Thr Ala Asn Ala Gin Lys Val Ser 

35 40 45 

Lys Lys Leu Ala Gin Lys Ser Tyr Ser Leu Ala Phe Glu Pro Tyr Leu 

SO 55 60 

Asn Pro Lys Ala Lys Asn lie Ala Leu His Pro Lys Ala Gin Arg Gly 
65 70 75 80 

Asp Ser Phe Ala Phe Ser Lys Met Leu Glu Asn His Leu Asn lie Asn 

85 90 95 

Phe Phe lie Ala Gly Ala Tyr Gly Phe Glu Glu Asn Phe Leu Lys Asp 

100 105 110 

Cys Gin Ala Trp Ser Leu Ser Glu Met Thr Phe Ser His Glu Val Ala 

115 120 125 

Lys lie Val Leu Cys Glu Gin lie Tyr Arg Ala Leu Ser lie lie Phe 

130 135 140 

Lys His Pro Tyr His Lys 
145 150 

(2) INFORMATION FOR SEQ ID NO:467: 
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Lys Gly Pro Leu 
100 

Glu Lys Ala Val 
115 

Glu Gly Ala Pro 
130 

Val Val Arg Phe 
145 



85 

Glu Asn Arg His 

Asn Gly Cys Gin 
12 0 

Ser Asn Phe Lys 
135 



90 

His His Ser Phe 
105 

Lys Tyr Phe His 

Ser Gly Ser Tyr 
140 



95 

Thr Lys Asn Tyr 
110 

lie Lys Leu Pro 
125 

Met Ala Thr Met 



(2) INFORMATION FOR SEQ ID NO: 4 65: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 98... 547 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:465: 



ATGATTGTGC ACAGGAAGGA TTTGAAGAAG ACTTTGAGCG ATCTCATCGC TATGATGACG 60 
CATAAGACTT CAAAGATTTT TTAT^GTTTT AACATTG ATG CGT TGC GTG GTG TAT 115 

Met Arg Cys Val Val Tyr 
1 5 



TCT ATC GCT AAA AGT TCG CCT TTA GAG TTA GTG AAA ATC TAT CAA AAG 163 

Ser lie Ala Lys Ser Ser Pro Leu Glu Leu Val Lys lie Tyr Gin Lys 
10 15 20 

CAA TGC AGG CAA TTT GAT TGC GAG CTG GAA TTG GTG GAT TTA TTC CCT 211 

Gin Cys Arg Gin Phe Asp Cys Glu Leu Glu Leu Val Asp Leu Phe Pro 

25 30 35 



AAA AAT ACC GCC AAC GCT CAA AAA GTT TCT AAA AAA CTG GCT CAA AAA 25 9 

Lys Asn Thr Ala Asn Ala Gin Lys Val Ser Lys Lys Leu Ala Gin Lys 
40 45 50 



AGC TAC TCT CTA GCT TTT GAG CCG TAT TTA AAC CCT AAG GCA AAA AAT 3 07 

Ser Tyr Ser Leu Ala Phe Glu Pro Tyr Leu Asn Pro Lys Ala Lys Asn 
55 60 65 70 



ATC GCC TTA CAC CCT AAA GCT CAA AGG GGC GAT AGC TTT GCG TTT AGT 3 55 

lie Ala Leu His Pro Lys Ala Gin Arg Gly Asp Ser Phe Ala Phe Ser 
75 80 85 
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AAA CTT TCA GTC TAT TCG GCT GGA AGT GGG ATG ATT GGG ATT GAT ATT 255 
Lys Leu Ser Val Tyr Ser Ala Gly Ser Gly Met lie Gly He Asp He 
45 50 55 

GAG AAA CGG ACA TTT TAT AAG CGA GCG TTC GCT TTC ACG ATG AAA TCG 3 03 

Asp Lys Arg Thr Phe Tyr Lys Arg Ala Phe Ala Phe Thr Met Lys Ser 
60 65 70 

TTG TTC GGT GAA AAC TTG CTT TTG TTT GTC AAA TTA AAG CAT TCT GCG 351 
Leu Phe Gly Glu Asn Leu Leu Leu Phe Val Lys Leu Lys His Ser Ala 
75 80 85 90 

TTG ACG AGC AAA CAC ATG AAA GGG CCT TTA GAA AAC CGC CAT CAC CAT 3 99 

Leu Thr Ser Lys His Met Lys Gly Pro Leu Glu Asn Arg His His His 
95 100 105 

TCT TTC ACT AAA AAT TAT GAA AAA GCG GTT AAT GGT TGT CAA AAG TAT 44 7 

Ser Phe Thr Lys Asn Tyr Glu Lys Ala Val Asn Gly Cys Gin Lys Tyr 
110 115 120 

TTC CAT ATT AAA TTG CCT GAA GGC GCT CCT AGC AAC TTC AAA TCA GGT 4 95 

Phe His He Lys Leu Pro Glu Gly Ala Pro Ser Asn Phe Lys Ser Gly 
125 130 135 

TCA TAC ATG GCC ACT ATG GTG GTG CGT TTT TAAAGCGTTA TTTGGGGTAT TCT 54 8 
Ser Tyr Met Ala Thr Met Val Val Arg Phe 
140 145 

TTAATACCCT TATCGTCTTT TAAAATACCA TCTTTTAAAA GCACAAATTT ATTTTTTAGC 60 8 

CCTTTTTTAA ATCTTCTTAA A 62 9 

(2) INFORMATION FOR SEQ ID NO: 4 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:464: 

He Ser Leu Ser Val Lys His Thr Asp Asn Val He Ser Leu Arg Lys 

15 10 15 

Glu Asn Gly Val Arg Thr Leu He Ser Leu Gly He Leu Leu Ser Val 

20 25 30 

Leu Ser Gly Asp Asp Leu Lys Leu Tyr Ser Lys Leu Ser Val Tyr Ser 

35 40 45 

Ala Gly Ser Gly Met He Gly He Asp He Asp Lys Arg Thr Phe Tyr 

50 55 60 

Lys Arg Ala Phe Ala Phe Thr Met Lys Ser Leu Phe Gly Glu Asn Leu 
65 70 75 80 

Leu Leu Phe Val Lys Leu Lys His Ser Ala Leu Thr Ser Lys His Met 
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Val Gly Pro Ala lie Leu Val Leu Thr Leu Ala Trp Ala lie Gly Pro 
305 310 315 320 

Val lie Arg Asp Asp Ala Gin Thr Gly Leu Tyr Leu Ala Asn lie Ser 

325 330 335 

Lys Gly Phe Leu Asn Asn Gly Gly Gly Val Tyr Met Pro Leu lie Phe 

340 345 350 

Phe Leu lie Ser Gly Phe lie Ala Phe Ser Thr Gly Thr Ser Trp Gly 

355 360 365 

Ala Phe Ala lie Met Leu Pro lie Gly Ala Gly Met Ala Ser Glu Ser 

370 375 380 

Asp lie lie Leu lie Val Ser Ala lie Leu Ser Gly Ala Val Tyr Gly 
385 390 395 400 

Asp His Thr Ser Pro lie Ser Asp Thr Thr lie Leu Ser Ala Thr Gly 

405 410 415 

Ala Gly Cys Ser Val Gin Ser His Phe lie Thr Gin Leu Pro Tyr Ala 

420 425 430 

Thr He Ala Met Leu Cys Ser Ala Val Ser Leu Gly Val Ala Ser Phe 

435 440 445 

Met Tyr Ser Arg Ser Leu Ala Leu Leu He Gly Val Ala Leu Leu Val 

450 455 460 

Gly Val Phe Tyr Leu Leu Lys Lys Phe Tyr Gly Glu Asn Leu Lys Thr 
465 470 475 480 

(2) INFORMATION FOR SEQ ID NO:463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
{B) LOCATION: 82... 525 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:463: 

CAAATCTCTA AAGAGTAACG CTTTTTAAAA AAATACATTT TTTTTAATTT TTTAATCAAT 6 0 

CATTAAGGTG TTTTAAGTTA A ATT TCC TTA TCT GTT AAA CAT ACG GAT AAT 111 

lie Ser Leu Ser Val Lys His Thr Asp Asn 
15 10 

GTT ATA TCT TTA AGG AAA GAA AAT GGG GTT AGG AC A CTA ATA AGT TTA 159 
Val lie Ser Leu Arg Lys Glu Asn Gly Val Arg Thr Leu lie Ser Leu 
15 20 25 

GGG ATT TTG TTA AGC GTT TTG AGT GGC GAT GAT CTG AAG TTG TAT TCA 2 07 

Gly lie Leu Leu Ser Val Leu Ser Gly Asp Asp Leu Lys Leu Tyr Ser 
30 35 40 



-734- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



AATATTGATT GAAGAAGCTT AAAAATCCCA TTTTTTAAAA TTAAAATAAG GTTTTATCGA 1621 
TCCCTATTTG ACTCAAAAAG AGTCTTATTC CATTATCAAT CAATTAAAAA AGGTTATTCA 1681 
AAAATAACCA TACAATTATA AAAATCTTC 1710 

(2) INFORMATION FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:462: 

Met Val Val Phe Thr Lys Arg Val Ala Leu Ser Leu Phe Val Gly lie 

15 10 15 

Leu Val Ser Ala Val Leu Met His Ser Leu His Leu Ser Gin Leu Val 

20 25 30 

Glu Tyr lie Tyr His Lys lie Thr Ser Val Phe Tyr Thr Tyr Glu Pro 

35 40 45 

Glu Lys Gly Leu Asn Phe Asn Leu Ser Asn Leu Tyr Val Phe Gly Phe 

50 55 60 

Leu lie Phe Leu Gly Val Leu Ser Gin Val lie Leu Lys Ser Gly Ser 
65 70 75 80 

Val Gin Asn Phe Val Lys Lys Ala Lys Lys Tyr Ser Lys Asn Ala Lys 

85 90 95 

Thr Pro Glu Phe lie Ala Phe Phe Ser Gly lie lie lie Phe Val Asp 

100 105 110 

Asp Tyr Phe Asn Ala Leu Thr Val Gly Gin lie Ser Lys Ser Leu Asn 

115 120 125 

Asp Ala His Asn Ser Thr Arg Glu Arg Leu Ala Tyr lie lie Asp Ser 

130 135 140 

Thr Ser Ala Pro Val Cys Leu Leu Val Pro lie Ser Ser Trp Gly Ala 
145 150 155 160 

Tyr lie Met Gly lie Met Asn Asn Asp Ser Ser Pro Leu Leu Lys Asp 

165 170 175 

Ser Phe Ser Val Leu Val Gin Ser Leu Ser Ser Asn Tyr Tyr Ala lie 

180 185 190 

Phe Ala Leu lie Ala Val Phe Leu Thr lie Leu Trp Gin lie Asn Leu 

195 200 205 

Pro Ser Met Arg Lys Tyr Gin Asn lie Gly Val Lys Asp Phe Tyr Ser 

210 215 220 

Glu Gin Glu Glu Ser Ser Ser Lys Leu Ala Pro Leu Ser Leu Leu Pro 
225 230 235 240 

Leu Ser lie Leu Leu Leu lie Val Ser lie Ser Ser Leu Leu Phe Tyr 

245 250 255 

Thr Gly Val lie Leu Lys Asn Thr Asp Ala Ser Phe Ser Leu Phe Tyr 

260 265 270 

Gly Gly Leu Phe Ser Leu lie Val Thr Tyr Leu Leu Ala Tyr Lys Phe 

275 280 285 

Leu Glu Lys Gly Ser Phe Phe Lys Leu Met Leu Asp Gly Phe Lys Ser 
290 295 300 
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ACA GGA GTG ATT TTA AAA AAC ACT GAT GCG AGT TTT TCG CTC TTT TAT 93 5 

Thr Gly Val lie Leu Lys Asn Thr Asp Ala Ser Phe Ser Leu Phe Tyr 
260 265 270 

GGA GGG TTG TTT TCG CTC ATC GTT ACT TAT CTT TTA GCT TAT AAG TTT 98 3 

Gly Gly Leu Phe Ser Leu lie Val Thr Tyr Leu Leu Ala Tyr Lys Phe 
275 280 285 

TTA GAA AAA GGG AGC TTT TTT AAA CTC ATG TTG GAT GGC TTT AAG AGT 1031 
Leu Glu Lys Gly Ser Phe Phe Lys Leu Met Leu Asp Gly Phe Lys Ser 
290 295 300 

GTG GGG CCG GCG ATA CTA GTC TTA ACQ CTC GCT TGG GCT ATC GGG CCT 10 7 9 

Val Gly Pro Ala lie Leu Val Leu Thr Leu Ala Trp Ala lie Gly Pro 
305 310 315 320 

GTG ATT AGA GAT GAC GCT CAA ACA GGG CTT TAC TTG GCT AAC ATC AGC 112 7 

Val lie Arg Asp Asp Ala Gin Thr Gly Leu Tyr Leu Ala Asn lie Ser 
325 330 335 

AAG GGG TTT TTA AAT AAT GGA GGA GGC GTG TAT ATG CCT TTA ATC TTT 1175 
Lys Gly Phe Leu Asn Asn Gly Gly Gly Val Tyr Met Pro Leu lie Phe 
340 345 350 

TTT TTA ATC TCT GGG TTT ATC GCT TTT TCT ACC GGC ACA AGC TGG GGA 1223 
Phe Leu lie Ser Gly Phe lie Ala Phe Ser Thr Gly Thr Ser Trp Gly 
355 360 365 

GCG TTT GCG ATC ATG CTT CCC ATT GGA GCG GGC ATG GCT AGT GAA AGC 12 71 

Ala Phe Ala lie Met Leu Pro lie Gly Ala Gly Met Ala Ser Glu Ser 
370 375 380 

GAT ATT ATT TTG ATT GTT TCA GCG ATT CTC TCA GGC GCG GTT TAT GGC 1319 
Asp He He Leu He Val Ser Ala He Leu Ser Gly Ala Val Tyr Gly 
385 390 395 400 

GAT CAC ACA AGC CCT ATT TCT GAC ACG ACT ATA CTA TCG GCT ACG GGG 13 67 

Asp His Thr Ser Pro He Ser Asp Thr Thr He Leu Ser Ala Thr Gly 
405 410 415 

GCA GGG TGT TCG GTG CAA AGC CAT TTT ATC ACG CAA CTC CCT TAT GCG 1415 
Ala Gly Cys Ser Val Gin Ser His Phe He Thr Gin Leu Pro Tyr Ala 
420 425 430 

ACC ATT GCG ATG CTT TGC AGC GCG GTG AGT TTG GGG GTG GCA AGC TTT 14 63 

Thr He Ala Met Leu Cys Ser Ala Val Ser Leu Gly Val Ala Ser Phe 
435 440 445 

ATG TAT TCG CGT TCG CTC GCT CTT TTA ATC GGT GTG GCT TTG CTT GTG 1511 
Met Tyr Ser Arg Ser Leu Ala Leu Leu He Gly Val Ala Leu Leu Val 
450 455 460 

GGG GTG TTT TAT CTT TTA AAA AAA TTT TAT GGT GAA AAT CTA AAA ACT TG 1561 
Gly Val Phe Tyr Leu Leu Lys Lys Phe Tyr Gly Glu Asn Leu Lys Thr 
465 470 475 480 
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GAA TAT ATT TAT CAT AAA ATC ACT TCC GTT TTT TAG ACT TAG GAG CCA 2 63 

Glu Tyr lie Tyr His Lys lie Thr Ser Val Phe Tyr Thr Tyr Glu Pro 
35 40 45 

GAA AAG GGG CTT AAT TTC AAT CTT TCC AAC CTC TAT GTT TTT GGG TTT 311 
Glu Lys Gly Leu Asn Phe Asn Leu Ser Asn Leu Tyr Val Phe Gly Phe 
50 55 60 

TTA ATC TTT TTA GGC GTC TTA AGC CAA GTG ATT TTA AAA TCC GGT AGC 359 
Leu lie Phe Leu Gly Val Leu Ser Gin Val lie Leu Lys Ser Gly Ser 
65 70 75 80 

GTG CAA AAC TTT GTC AAA AAA GCT AAA AAA TAG TCA AAA AAC GCT AAA 407 
Val Gin Asn Phe Val Lys Lys Ala Lys Lys Tyr Ser Lys Asn Ala Lys 
85 90 95 

ACT CCC GAA TTT ATC GCC TTT TTT TCA GGT ATC ATT ATT TTT GTA GAT 455 
Thr Pro Glu Phe lie Ala Phe Phe Ser Gly lie lie lie Phe Val Asp 
100 105 110 

GAT TAT TTT AAC GCC CTA ACC GTG GGG CAA ATC TCA AAG TCT TTA AAC 5 03 

Asp Tyr Phe Asn Ala Leu Thr Val Gly Gin lie Ser Lys Ser Leu Asn 
115 120 125 

GAG GCT CAT AAC TCC ACA CGA GAG CGC TTG GCT TAT ATT ATA GAG TCC 551 
Asp Ala His Asn Ser Thr Arg Glu Arg Leu Ala Tyr lie lie Asp Ser 
130 135 140 

ACT TCA GCG CCG GTG TGC TTG CTA GTC CCC ATT TCT AGT TGG GGG GCG 599 
Thr Ser Ala Pro Val Cys Leu Leu Val Pro lie Ser Ser Trp Gly Ala 
145 150 155 160 

TAT ATT ATG GGG ATC ATG AAT AAC GAC AGC TCG CCC TTA TTA AAA GAT 64 7 

Tyr lie Met Gly lie Met Asn Asn Asp Ser Ser Pro Leu Leu Lys Asp 
165 170 175 

AGT TTT TCG GTG CTT GTG CAA AGC TTA AGC AGT AAT TAT TAT GCC ATT 6 95 

Ser Phe Ser Val Leu Val Gin Ser Leu Ser Ser Asn Tyr Tyr Ala lie 
180 185 190 

TTT GCA CTC ATT GCA GTC TTT CTC ACC ATT TTA TGG CAA ATC AAC CTC 74 3 

Phe Ala Leu lie Ala Val Phe Leu Thr lie Leu Trp Gin lie Asn Leu 
195 200 205 

GCT AGC ATG AGA AAG TAT CAA AAC ATA GGC GTG AAG GAT TTT TAT AGC 7 91 

Pro Ser Met Arg Lys Tyr Gin Asn lie Gly Val Lys Asp Phe Tyr Ser 
210 215 220 

GAA CAA GAA GAA AGC TCT TCA AAA CTA GCC CCC TTG AGT TTG TTA CCC 83 9 

Glu Gin Glu Glu Ser Ser Ser Lys Leu Ala Pro Leu Ser Leu Leu Pro 
225 230 235 240 

CTT TCT ATT TTA TTG TTG ATT GTG TCC ATT TCA TCA TTG CTT TTT TAT 8 87 

Leu Ser lie Leu Leu Leu lie Val Ser lie Ser Ser Leu Leu Phe Tyr 
245 250 255 
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Asp Leu Ala Pro Asn Tyr Lys Asp Phe Asp Phe lie Leu Thr Gly Gly 

85 90 95 

Glu Pro Ser Leu Tyr Phe Asn Asn Pro lie Leu lie Ser Val Leu Glu 

100 105 110 

His Phe Tyr Arg Gin Lys lie Pro Leu Cys Val Glu Ser Asn Gly Ser 

115 120 125 

lie Phe Phe Glu Phe Ser Pro lie Leu Lys Glu Leu His Phe Thr Leu 

130 135 140 

Ser Val Lys Leu Ser Phe Ser Leu Glu Glu Glu Ser Lys Arg lie His 
145 150 155 160 

Leu Lys Ala Leu Gin Asn lie Leu Asn Asn Ala Lys Ser Ala His Phe 

165 170 175 

Lys Phe Val Leu Glu Ser Gin Asn Ala Ala Gin Ser lie lie Glu lie 

180 185 190 

Gin Ser Leu Leu Lys Gin Leu Ser Leu Lys Asn Asn Glu lie Phe Leu 

195 200 205 

Met Pro Leu Gly Thr Asn Asn Asn Glu Leu Asp Lys Asn Leu Lys Thr 

210 215 220 

Leu Ala Pro Leu Ala lie Lys His Gly Phe Arg Leu Ser Asp Arg Leu 
225 230 235 240 

His lie Arg Leu Trp Asp Asn Gin Lys Gly Phe 
245 250 



(2) INFORMATION FOR SEQ ID NO: 4 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 120... 1559 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 

TAAAGAATTT TGTGJ^TATT GATTGTCTCT TTTAATTGAA ATTTAAAGAT TAGTTTAAAG 6 0 

GATTTTATTC GGTGGGATTG TCAGCATCAA GCCTCATTGT TCCTATTAGC GTTATTTTA 119 



ATG GTG GTT TTT ACT AAA AGA GTC GCA CTC TCG TTA TTT GTG GGC ATT 167 
Met Val Val Phe Thr Lys Arg Val Ala Leu Ser Leu Phe Val Gly lie 
1 5 10 15 



TTA GTG AGC GCT GTT TTA ATG CAT TCG TTA CAC CTT TCC CAA CTC GTA 215 
Leu Val Ser Ala Val Leu Met His Ser Leu His Leu Ser Gin Leu Val 
20 25 30 
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145 150 155 

AAG CGG ATC CAT CTT AAA GCC TTA CAA AAT ATC TTA AAT AAC GCT AAA 5 91 

Lys Arg lie His Leu Lys Ala Leu Gin Asn lie Leu Asn Asn Ala Lys 
160 165 170 

AGC GCG CAT TTT AAA TTT GTT TTA GAG AGC CAA AAC GCC GCT CAA TCT 63 9 

Ser Ala His Phe Lys Phe Val Leu Glu Ser Gin Asn Ala Ala Gin Ser 
175 180 185 

ATT ATA GAA ATT CAA AGC CTC TTG AAA CAA CTC TCC TTA AAA AAT AAT 6 87 

lie lie Glu lie Gin Ser Leu Leu Lys Gin Leu Ser Leu Lys Asn Asn 
190 195 200 

GAA ATC TTT TTA ATG CCT TTA GGC ACA AAT AAC AAC GAG CTA GAC AAA 73 5 

Glu lie Phe Leu Met Pro Leu Gly Thr Asn Asn Asn Glu Leu Asp Lys 
205 210 215 220 

AAT CTA AAA ACC CTA GCC CCC CTA GCC ATA AAG CAT GGT TTC AGG CTA 783 
Asn Leu Lys Thr Leu Ala Pro Leu Ala lie Lys His Gly Phe Arg Leu 
225 230 235 

AGC GAT AGG CTT CAT ATC CGC TTG TGG GAT AAT CAA AAA GGG TTT TAAAA 833 
Ser Asp Arg Leu His lie Arg Leu Trp Asp Asn Gin Lys Gly Phe 
240 245 250 

AGTTAATCAT GACCATCAAA GTTTTTTCGC CCAAATACCC CACTGAATTA GAAGAATTTT 8 93 

ATGCTGAGCG TATCGCTGAC AACCCTTTAG GGTTTATCCA ACGCTTGGAT CTTTTGCCTA 953 

GTATTAGCGG GTTCGTTCAA AAATTGCGCG AGCATGGCGG GGAATTTTTT GAAATGAGAG 1013 

AGGGTAACAA GCTCATTGGG ATTTGTGGGC TTAATCCTAT CAATCAAACA GAAGCCGAGC 107 3 

TGTGCAA 1080 

(2) INFORMATION FOR SEQ ID NO: 4 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 60: 

Met Lys Leu Pro Val Val Glu Ser Phe Phe Ser Leu Gin Gly Glu Gly 

15 10 15 

Lys Arg lie Gly Lys Pro Ser Leu Phe Leu Arg Leu Gly Gly Cys Asn 

20 25 30 

Leu Ser Cys Lys Gly Phe Asn Cys Lys Thr Leu Leu Asn Asp Glu lie 

35 40 45 

Leu Thr Gly Cys Asp Ser Leu Tyr Ala Val His Pro Lys Phe Lys Thr 

50 55 60 

Ser Trp Asp Tyr Tyr Asn Glu Pro Lys Pro Leu lie Glu Arg Leu Glu 
65 " 70 75 80 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 76... 828 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

CTTTTTCAAT AAAATCATCA GCGATGAAAA ACAATGCTTT TTCCACGCTA AACCCTTACA 60 
CCAGATCCCT TAAAA ATG AAA CTC CCG GTC GTT GAG AGC TTT TTT TCC TTA 111 
Met Lys Leu Pro Val Val Glu Ser Phe Phe Ser Leu 
15 10 

CAA GGT GAA GGA AAA AGG ATA GGC AAG CCC AGT CTT TTT TTG CGC TTA 15 9 

Gin Gly Glu Gly Lys Arg lie Gly Lys Pro Ser Leu Phe Leu Arg Leu 
15 20 25 

GGG GGG TGT AAC CTT TCA TGC AAG GGC TTT AAT TGT AAA ACC TTA TTG 2 07 

Gly Gly Cys Asn Leu Ser Cys Lys Gly Phe Asn Cys Lys Thr Leu Leu 
30 35 40 

AAT GAT GAA ATC CTA ACA GGT TGC GAC AGC TTG TAT GCG GTG CAT CCC 255 
Asn Asp Glu lie Leu Thr Gly Cys Asp Ser Leu Tyr Ala Val His Pro 
45 50 55 60 

AAA TTC AAA ACA TCT TGG GAT TAT TAT AAT GAG CCT AAG CCC TTG ATT 3 03 

Lys Phe Lys Thr Ser Trp Asp Tyr Tyr Asn Glu Pro Lys Pro Lei.'. lie 
65 70 75 

GAA CGA TTA GAG GAT TTA GCC CCT AAT TAT AAG GAT TTT GAT TTC ATT 3 51 

Glu Arg Leu Glu Asp Leu Ala Pro Asn Tyr Lys Asp Phe Asp Phe lie 
80 85 90 

CTT ACA GGC GGG GAG CCA AGC TTG TAT TTC AAT AAC CCT ATT TTA ATC 3 99 

Leu Thr Gly Gly Glu Pro Ser Leu Tyr Phe Asn Asn Pro lie Leu lie 
95 100 105 

AGC GTT TTA GAG CAT TTT TAT CGC CAA AAA ATC CCT TTA TGT GTA GAG 447 
Ser Val Leu Glu His Phe Tyr Arg Gin Lys lie Pro Leu Cys Val Glu 
110 115 120 

AGT AAT GGT TCT ATT TTT TTT GAA TTT AGC CCT ATT TTA AAA GAA TTG 4 95 

Ser Asn Gly Ser lie Phe Phe Glu Phe Ser Pro lie Leu Lys Glu Leu 
125 130 135 140 

CAT TTC ACT CTA AGC GTC AAA CTC TCT TTT TCT TTA GAG GAA GAA AGC 54 3 

His Phe Thr Leu Ser Val Lys Leu Ser Phe Ser Leu Glu Glu Glu Ser 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:458: 

Met Asn Leu Asn Phe Met Pro Leu Leu His Ala Tyr Asn His Ala Ser 

15 10 15 

lie Asp Phe His Phe Asn Ser Ser Ala Arg Asp Phe Cys Val His Glu 



Val Arg Lys Ser Gly Leu Ser Thr Leu Glu Met Leu Gin lie Phe Ser 

50 55 60 

Gin lie Leu Gly Val Arg lie Ala Glu Leu Gly Tyr Ala Gly Leu Lys 
65 70 75 80 

Asp Lys Asn Ala Leu Thr Thr Gin Phe lie Ser Leu Pro Lys Lys Tyr 

85 90 95 

Ala Pro Leu Leu Glu Lys Asn Thr Ser Asn Phe Gin Glu Lys Asn Leu 

100 105 110 

Lys lie Leu Ser Leu Asn Tyr His His Asn Lys lie Lys Leu Gly His 

115 120 125 

Leu Lys Gly Asn Arg Phe Phe Met Arg Phe Lys Lys Met Thr Pro Leu 

130 135 140 

Asn Ala Gin Lys Thr Lys Gin Val Leu Glu Gin lie Ala Gin Phe Gly 
145 150 155 160 

Met Pro Asn Tyr Phe Gly Ser Gin Arg Phe Gly Lys Phe Asn Asp Asn 

165 170 175 

His Gin Glu Gly Leu Lys lie Leu Gin Asn Gin Thr Lys Phe Ala His 

180 185 190 

Gin Lys Leu Asn Ala Phe Leu lie Ser Ser Tyr Gin Ser Tyr Leu Phe 

195 200 205 

Asn Ala Leu Leu Ser Lys Arg Leu Glu lie Ser Lys lie lie Ser Ala 

210 215 220 

Phe Ser Val Lys Glu Asn Leu Glu Phe Phe Lys Gin Lys Asn Leu Ser 
225 230 235 240 

Val Asp Ser Asp Thr Leu Lys Thr Leu Lys Asn Gin Ala His Pro Phe 

245 250 255 

Lys lie Leu Glu Gly Asp Val Met Cys His Tyr Pro Tyr Gly Lys Phe 

260 265 270 

Phe Asp Ala Leu Glu Leu Glu Lys Glu Gly Glu Arg Phe Leu Lys Lys 

275 280 285 

Glu Val Ala Pro Thr Gly Leu Leu Asp Gly Lys Lys Ala Leu Tyr Ala 

290 295 300 

Lys Asn Leu Ser Leu Glu lie Glu Lys Glu Phe Gin His Asn Leu Leu 
305 310 315 320 

Ser Ser His Ala Lys Thr Leu Gly Ser Arg Arg Phe Phe Trp Val Phe 

325 330 335 

Val Glu Asn Val Thr Ser Gin Tyr Val Lys Glu Lys Ala Gin Phe Glu 

340 345 350 

Leu Gly Phe Tyr Leu Pro Lys Gly Ser Tyr Ala Ser Ala Leu Leu Lys 

355 360 365 

Glu lie Lys His Glu Lys Gly Glu Asn Asn Asp Glu Phe 
370 375 380 



20 

Val Pro Leu Tyr Glu Phe 
35 



25 30 
Ser Asn Thr Gly Glu His Ala Val lie Gin 
40 45 



(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1080 base pairs 
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Phe Ser Val Lys Glu Asn Leu Glu Phe Phe Lys Gin Lys Asn Leu Ser 
225 230 235 240 

GTT GAT TCA GAC ACT CTA AAA ACC CTT AAA AAC CAA GCC CAC CCC TTT 827 
Val Asp Ser Asp Thr Leu Lys Thr Leu Lys Asn Gin Ala His Pro Phe 
245 250 255 

AAA ATC TTA GAA GGC GAT GTG ATG TGC CAT TAG CCT TAT GGG AAG TTT 875 
Lys lie Leu Glu Gly Asp Val Met Cys His Tyr Pro Tyr Gly Lys Phe 
260 265 270 

TTT GAC GCT TTA GAA TTA GAA AAA GAG GGC GAA AGG TTT TTG AAA AAA 92 3 

Phe Asp Ala Leu Glu Leu Glu Lys Glu Gly Glu Arg Phe Leu Lys Lys 
275 280 285 

GAA GTT GCG CCT ACG GGG TTA CTA GAC GGC AAA AAA GCT CTT TAT GCA 971 
Glu Val Ala Pro Thr Gly Leu Leu Asp Gly Lys Lys Ala Leu Tyr Ala 
290 295 300 

AAA AAT TTG ACT TTA GAA ATT GAA AAA GAA TTC CAG CAT AAC CTT TTA 1019 
Lys Asn Leu Ser Leu Glu lie Glu Lys Glu Phe Gin His Asn Leu Leu 
305 310 315 320 

AGT AGC CAT GCT AAA ACG CTA GGC TCT AGG CGG TTT TTT TGG GTG TTT 10 67 

Ser Ser His Ala Lys Thr Leu Gly Ser Arg Arg Phe Phe Trp Val Phe 
325 330 335 

GTA GAA AAT GTA ACT TCT CAA TAC GTG AAA GAA AAA GCG CAA TTT GAA 1115 
Val Glu Asn Val Thr Ser Gin Tyr Val Lys Glu Lys Ala Gin Phe Glu 
340 345 350 

TTG GGA TTT TAC TTG CCT AAA GGG AGT TAT GCG AGC GCG TTG CTC A7^ 1163 
Leu Gly Phe Tyr Leu Pro Lys Gly Ser Tyr Ala Ser Ala Leu Leu Lys 
355 360 365 

GAA ATC AAG CAT GAG AAA GGA GAA AAT AAT GAC GAA TTT TGAAAAGATT ATC 1215 
Glu lie Lys His Glu Lys Gly Glu Asn Asn Asp Glu Phe 
370 375 380 

GCGCAAAACA GGATCAAAAC GAACGCGGTT TTAGCGACTT ATTGCGTGAT TTTTGCTTTT 127 5 
ATCGGGTTGT TGGTGGATGT CATTAGAATT AATGCTAATG ATTTAGGAAT AGCTCTTTTT 133 5 
AAACTCATGA CTTTT 135 0 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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15 10 15 

ATT GAT TTT CAT TTC AAT TCT AGT OCT AGO GAT TTT TGC GTG CAT GAA 155 
lie Asp Phe His Phe Asn Ser Ser Ala Arg Asp Phe Cys Val His Glu 
20 25 30 

GTG CCT TTG TAT GAA TTT AGT AAC ACG GGC GAA CAT GCC GTT ATT CAA 2 03 

Val Pro Leu Tyr Glu Phe Ser Asn Thr Gly Glu His Ala Val lie Gin 
35 40 45 

GTG AGG AAA AGC GGT TTA AGC ACT TTA GAA ATG CTT CAG ATT TTT TCT 2 51 

Val Arg Lys Ser Gly Leu Ser Thr Leu Glu Met Leu Gin lie Phe Ser 
50 55 60 

CAA ATT TTA GGG GTA AGA ATC GCT GAA TTG GGT TAT GCG GGC TTG AAA 2 99 

Gin lie Leu Gly Val Arg lie Ala Glu Leu Gly Tyr Ala Gly Leu Lys 
65 70 75 80 

GAT AAA AAC GCG CTG ACG ACT CAA TTC ATC TCA CTC CCT AAA AAA TAC 34 7 

Asp Lys Asn Ala Leu Thr Thr Gin Phe lie Ser Leu Pro Lys Lys Tyr 
85 90 95 

GCC CCT TTA TTA GAA AAA AAT ACG AGC AAC TTT CAA GAA AAA AAC CTT 3 95 

Ala Pro Leu Leu Glu Lys Asn Thr Ser Asn Phe Gin Glu Lys Asn Leu 
100 105 110 

AAA ATC CTG TCT TTG AAT TAC CAC CAC AAT AAA ATC AAA TTG GGG CAT 443 
Lys lie Leu Ser Leu Asn Tyr His His Asn Lys lie Lys Leu Gly His 
115 120 125 

TTG AAA GGG AAT CGC TTT TTT ATG CGT TTT AAA AAA ATG ACC CCT CTA 4 91 

Leu Lys Gly Asn Arg Phe Phe Met Arg Phe Lys Lys Met Thr Pro Leu 
130 135 140 

AAC GCT CAA AAA ACA AAG CAG GTT TTA GAA CAA ATC GCG CAG TTT GGA 53 9 

Asn Ala Gin Lys Thr Lys Gin Val Leu Glu Gin lie Ala Gin Phe Gly 
145 150 155 160 

ATG CCT AAT TAT TTT GGC TCG CAA CGC TTT GGG AAG TTC AAT GAC AAC 5 87 

Met Pro Asn Tyr Phe Gly Ser Gin Arg Phe Gly Lys Phe Asn Asp Asn 
165 170 175 

CAC CAA GAG GGT TTA AAA ATC TTA CAA AAT CAA ACG AAA TTC GCC CAT 63 5 

His Gin Glu Gly Leu Lys lie Leu Gin Asn Gin Thr Lys Phe Ala His 
180 185 190 

CAA AAA TTA AAC GCT TTT TTA ATT TCA AGC TAT CAA AGT TAT TTG TTT 683 
Gin Lys Leu Asn Ala Phe Leu lie Ser Ser Tyr Gin Ser Tyr Leu Phe 
195 200 205 

AAC GCG CTT TTA AGC AAA CGA TTA GAA ATC AGT AAA ATC ATT AGC GCT 731 
Asn Ala Leu Leu Ser Lys Arg Leu Glu lie Ser Lys lie lie Ser Ala 
210 215 220 

TTT AGT GTC AAA GAA AAT TTA GAA TTT TTT AAA CAA AAA AAT TTA AGC 77 9 
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15 10 15 

Ala Ala Gly Phe Val Gin Leu lie Ser Asn Tyr lie Asn Gin Phe Ser 

20 25 30 

Ser Thr Leu Phe Val Thr Asn Leu Pro Ala Gin Asp lie lie Tyr Val 

35 40 45 

Pro Gly Tyr Glu Val Ser Gly Thr Ala Lys Tyr Lys Gly Phe Ser Leu 

50 55 60 

Gly Leu Ser Val Ala Arg Ser Trp Pro Ser Leu Lys Gly Arg Leu lie 
65 70 75 80 

Ala Asp Val Tyr Glu Leu Ala Ala Thr Thr Gly Asn Val Phe lie Leu 

85 90 95 

Thr Ala Ser Tyr Lys lie Pro Arg Thr Gly Leu Ser lie Thr Trp Leu 

100 105 110 

Ser Arg Phe Val Thr Asp Leu Ser Tyr Cys Ser Tyr Ser Pro Tyr Arg 

115 120 125 

Asn Gly Pro Thr Asp lie Asp Arg Arg Pro Ser Asn Cys Pro Lys Thr 

130 135 140 

Pro Gly lie Phe His Val His Lys Pro Gly Tyr Gly Val Ser Ser Phe 
145 150 155 160 

Phe Val Thr Tyr Lys Pro Thr Tyr Lys Lys Leu Lys Gly Leu Ser Leu 

165 170 175 

Asn Ala Val Phe Asn Asn Val Phe Asn Gin Gin Tyr lie Asp Gin Ala 

180 185 190 

Ser Pro Val Met Ser Pro Asp Glu Pro Asn Gin Asp Lys Tyr Ala Arg 

195 200 205 

Gly Met Ala Glu Pro Gly Phe Asn Ala Arg Phe Glu lie Ser Tyr Lys 
210 215 220 

Phe 
225 

(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 1202 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 57: 
AATTTTGAAA ACATTGACTC AGTTTCGCTC TCAAGGGCGT TTAATTCAAG GATCAAAGC 5 9 



ATG AAT TTA AAT TTT ATG CCC CTA TTG CAT GCT TAT AAC CAT GCG AGC 10 7 

Met Asn Leu Asn Phe Met Pro Leu Leu His Ala Tyr Asn His Ala Ser 
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Lys Gly Arg Leu lie Ala Asp Val Tyr Glu Leu Ala Ala Thr Thr Gly 
80 85 90 

AAT GTG TTT ATT TTG ACG GCA AGT TAT AAA ATC CCA CGC ACT GGT CTT 3 99 

Asn Val Phe lie Leu Thr Ala Ser Tyr Lys lie Pro Arg Thr Gly Leu 
95 100 105 

AGC ATC ACT TGG CTT TCA CGC TTC GTT ACG GAT TTG AGT TAT TGC TCT 44 7 

Ser lie Thr Trp Leu Ser Arg Phe Val Thr Asp Leu Ser Tyr Cys Ser 
110 115 120 

TAT AGC CCT TAT CGT AAC GGC CCT ACG GAT ATT GAC AGA CGG CCT AGT 4 95 

Tyr Ser Pro Tyr Arg Asn Gly Pro Thr Asp lie Asp Arg Arg Pro Ser 
125 130 135 

AAT TGC CCT AAA ACG CCC GGG ATT TTT CAT GTT CAT AAA CCC GGT TAT 54 3 

Asn Cys Pro Lys Thr Pro Gly lie Phe His Val His Lys Pro Gly Tyr 
140 145 150 155 

GGG GTG AGC AGT TTT TTT GTA ACC TAC AAA CCC ACC TAT AAG AAG CTT 591 
Gly Val Ser Ser Phe Phe Val Thr Tyr Lys Pro Thr Tyr Lys Lys Leu 
160 165 170 

AAA GGG TTG AGC TTG AAT GCG GTG TTT AAC AAT GTT TTT AAC CAA CAA 63 9 

Lys Gly Leu Ser Leu Asn Ala Val Phe Asn Asn Val Phe Asn Gin Gin 
175 180 185 

TAT ATT GAT CAA GCA AGC CCG GTG ATG AGC CCT GAT GAA CCC AAT CAA 687 
Tyr lie Asp Gin Ala Ser Pro Val Met Ser Pro Asp Glu Pro Asn Gin 
190 195 200 

GAC AAA TAC GCA AGA GGC ATG GCA GAG CCT GGC TTT AAC GCT AGA TTT 73 5 

Asp Lys Tyr Ala Arg Gly Met Ala Glu Pro Gly Phe Asn Ala Arg Phe 
205 210 215 

GAA ATT TCC TAT AAG TTT TAATAATGGA TCTAAAAATA AGGATTTCAT GGGTAGCG 7 91 
Glu lie Ser Tyr Lys Phe 
220 225 

GATCTAATCA AAAATAAAAC ATTCTTTAGA 821 
(2) INFORMATION FOR SEQ ID NO: 45 6: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 
Met Trp Asn Phe Asn Thr Glu Tyr Ser Ser Gin Tyr Phe Asp Phe Arg 
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Gin Thr Ser Leu Glu Tyr Leu Lys Ala Met His Ser His lie Leu Lys 

130 135 140 

Lys Glu Ala Pro Phe Ala Val Gly Leu Phe Ala Gly Glu Thr Leu Glu 
145 150 155 160 

Lys Pro His Phe Leu Ser Met Ser Leu Cys Lys Gin Gin Cys Glu Leu 

165 170 175 

Glu Ala Asp Leu lie Glu Ser Val Leu Gin lie Lys Ser Glu lie lie 

180 185 190 

Thr Pro Leu Ala Phe Gin Arg Gly Leu Glu Lys Lys Ala Lys Lys Gin 

195 200 205 

lie Lys Lys Val Val Leu Pro Glu Ser Glu Lys Met Lys Gly Phe 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 455; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 79... 753 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 55: 

TATGCCTGGA GGTTTGGTGT GGATGCGTCA AGATAATTTG CGNCTACAAC CGCAATTTAA 60 

AGCCAGAAAT TGGGCAAA ATG TGG AAT TTT AAC ACC GAA TAG AGC AGT CAG 111 

Met Trp Asn Phe Asn Thr Glu Tyr Ser Ser Gin 
15 10 

TAT TTT GAT TTT AGA GCC GCC GGT TTT GTC CAA TTG ATT TCT AAT TAC 159 
Tyr Phe Asp Phe Arg Ala Ala Gly Phe Val Gin Leu lie Ser Asn Tyr 
15 20 25 

ATC AAT CAA TTT TCT TCA ACG CTT TTT GTA ACC AAC TTG CCC GCA CAA 2 07 

lie Asn Gin Phe Ser Ser Thr Leu Phe Val Thr Asn Leu Pro Ala Gin 
30 35 40 

GAT ATT ATT TAT GTG CCT GGT TAT GAA GTT TCA GGG ACG GCT AAA TAC 2 55 

Asp lie lie Tyr Val Pro Gly Tyr Glu Val Ser Gly Thr Ala Lys Tyr 
45 50 55 

AAG GGC TTT TCT TTA GGC TTG AGC GTG GCG CGA TCA TGG CCT TCT TTA 3 03 

Lys Gly Phe Ser Leu Gly Leu Ser Val Ala Arg Ser Trp Pro Ser Leu 
60 65 70 75 

AAG GGG CGT TTG ATC GCT GAT GTG TAT GAA TTG GCG GCC ACG ACA GGC 3 51 
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AGT TTG GAA TAT TTG AAA GCC ATG CAC TCT CAT ATT CTC AAA AAA GAA 54 8 

Ser Leu Glu Tyr Leu Lys Ala Met His Ser His lie Leu Lys Lys Glu 
135 140 145 

GCC CCT TTC GCT GTA GGG TTA TTT GCG GGC GAA ACG CTT GAA AAA CCA 5 96 

Ala Pro Phe Ala Val Gly Leu Phe Ala Gly Glu Thr Leu Glu Lys Pro 
150 155 160 

CAT TTT TTA AGC ATG TCT CTT TGC AAG CAA CAA TGC GAA TTA GAA GCG 644 
His Phe Leu Ser Met Ser Leu Cys Lys Gin Gin Cys Glu Leu Glu Ala 
165 170 175 

GAT CTG ATT GAA AGC GTG TTG CAA ATA AAA AGC GAG ATT ATT ACC CCT 692 
Asp Leu lie Glu Ser Val Leu Gin lie Lys Ser Glu lie lie Thr Pro 
180 185 190 

TTA GCC TTT CAA AGG GGT TTG GAA AAA AAG GCT AAA AAA CAG ATT AAA 74 0 

Leu Ala Phe Gin Arg Gly Leu Glu Lys Lys Ala Lys Lys Gin lie Lys 
195 200 205 210 

AAA GTG GTT TTA CCA GAG AGC GAA AAG ATG AAA GGA TTT TGAAAGCTGC AC 7 91 
Lys Val Val Leu Pro Glu Ser Glu Lys Met Lys Gly Phe 
215 220 

ATCGTTTGAA TTTAATGGGC GCGGTAGGAT TGATCTTATT AGGCGATAAA GAA 844 

(2) INFORMATION FOR SEQ ID NO: 4 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454: 

Met Gin Gly Leu Trp lie Tyr Pro Glu Asp Thr Glu Val Leu Gly Val 

15 10 15 

Ala Cys Lys Ser Leu Leu Lys Ala Leu Thr Pro Arg Tyr Gin Lys Val 

20 25, 30 

Ala Leu Phe Ser Pro lie Ser Gly Gly Cys Glu Ser Leu Glu Glu Cys 

35 40 45 

Glu Ser Leu Asn Pro Leu Glu Phe His Ser Ala lie Ser Lys Gin Lys 

50 55 60 

Ala Leu Glu Leu Ala Ser Thr Ala Gin Glu Glu Leu Leu Phe Glu Thr 
65 70 75 80 

He Leu Lys Arg Tyr Asp Glu Leu Gin Ser Thr His Asp Phe Val He 

85 90 95 

Asn Leu Gly Cys Ala Pro Lys Phe Phe Leu Asn Ala Pro Leu Asp Leu 

100 105 110 

Asn Thr He Leu Ala Lys His Leu Asn Ala Ser Val Val Ala Val Ala 
115 120 125 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 111,.. 779 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:453: 

GTCGTTATTC GCGCTTAATG AGAACAGAGG TTTTTAAAAC TATGGTTTCG TTTAGGTTTA 6 0 

ATTAAATTTC GCTACAATTA AATAAAAACG ATAATTTTAG AGAGATTGGC ATG CAA 116 

Met Gin 

1 . 

GGT TTA TGG ATT TAT CCA GAG GAT ACA GAA GTT TTA GGG GTT GCT TGT 164 
Gly Leu Trp lie Tyr Pro Glu Asp Thr Glu Val Leu Gly Val Ala Cys 
5 10 15 

AAG AGC CTT TTA AAA GCA CTA ACG CCA CGC TAT CAA AAA GTC GCC TTG 212 
Lys Ser Leu Leu Lys Ala Leu Thr Pro Arg Tyr Gin Lys Val Ala Leu 
20 25 30 

TTT TCG CCC ATT AGT GGA GGG TGT GAG AGC TTG GAG GAG TGC GAG AGC 260 
Phe Ser Pro lie Ser Gly Gly Cys Glu Ser Leu Glu Glu Cys Glu Ser 
35 40 45 50 

TTG AAC CCT TTA GAA TTT CAT AGT GCG ATA AGC AAA CAA AAG GCT TTA 3 08 

Leu Asn Pro Leu Glu Phe His Ser Ala lie Ser Lys Gin Lys Ala Leu 
55 60 65 

GAG CTT GCG AGC ACC GCT CAA GAA GAG TTA CTA TTT GAA ACG ATT CTC 3 56 

Glu Leu Ala Ser Thr Ala Gin Glu Glu Leu Leu Phe Glu Thr lie Leu 
70 75 80 

AAA CGC TAT GAT GAA TTA CAA TCC ACG CAT GAT TTT GTC ATT AAT TTG 4 04 

Lys Arg Tyr Asp Glu Leu Gin Ser Thr His Asp Phe Val lie Asn Leu 
85 90 95 

GGG TGT GCG CCG AAG TTT TTC TTA AAC GCT CCT TTA GAT TTA AAC ACC 4 52 

Gly Cys Ala Pro Lys Phe Phe Leu Asn Ala Pro Leu Asp Leu Asn Thr 
100 105 110 

ATT TTA GCC AAG CAT TTA AAC GCT TCT GTT GTG GCT GTC GCG CAA ACG 5 00 

lie Leu Ala Lys His Leu Asn Ala Ser Val Val Ala Val Ala Gin Thr 
115 120 125 130 
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AGATGTAGGT AACAAAGAGA CAGATTTGAT TGTTGAGGAT TTTTCTAGTT ACAGCAATGA 60 
AAGAAAAAGG GCTTTAG GTG TTG AAG CTC AAT CTT AAA AAA TCT TTT CAA 110 

Val Leu Lys Leu Asn Leu Lys Lys Ser Phe Gin 
15 10 



AAA GAT TTT GAT AAA TTG CTT TTG AAT GGG TTT GAT GAT AGC GTT TTG 15 8 

Lys Asp Phe Asp Lys Leu Leu Leu Asn Gly Phe Asp Asp Ser Val Leu 
15 20 25 



AAT GAA GTC ATT CTA ACC TTA AGA AAA AAA GAA CCG CTA GAT CCA CAA 2 06 

Asn Glu Val lie Leu Thr Leu Arg Lys Lys Glu Pro Leu Asp Pro Gin 
30 35 40 

TTT CAA GAT CAT GCC TTA AAG GGA AAG TGG AAA CCT TTT AGG GAA TGC 2 54 

Phe Gin Asp His Ala Leu Lys Gly Lys Trp Lys Pro Phe Arg Glu Cys 
45 50 55 

CAC ATT AAG CCT GAT GTT TTG CTT GTG TAT TTA GTG AAA GAT GAT GAA 3 02 

His lie Lys Pro Asp Val Leu Leu Val Tyr Leu Val Lys Asp Asp Glu 
60 65 70 75 

CTG ATT TTG TTA AGG TTA GGC AGT CAT AGC GAG CTG TTT TAATCCACCC AC 3 53 
Leu lie Leu Leu Arg Leu Gly Ser His Ser Glu Leu Phe 
80 85 

ACCCCTTATA ACGCTTAAAC CAAATCGCTT GCGCTATAAT GAACTGATAT TATATTTTAA 413 
AAGGAATAAA CA 42 5 



(2) INFORMATION FOR SEQ ID NO: 4 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 



Val Leu Lys Leu 
1 

Leu Leu Leu Asn 
20 

Thr Leu Arg Lys 
35 

Leu Lys Gly Lys 
50 

Val Leu Leu Val 
65 

Leu Gly Ser His 



Asn Leu Lys Lys 
5 

Gly Phe Asp Asp 

Lys Glu Pro Leu 
40 

Trp Lys Pro Phe 
55 

Tyr Leu Val Lys 
70 

Ser Glu Leu Phe 
85 



Ser Phe Gin Lys 
10 

Ser Val Leu Asn 
25 

Asp Pro Gin Phe 

Arg Glu Cys His 
60 

Asp Asp Glu Leu 
75 



Asp Phe Asp Lys 
15 

Glu Val lie Leu 
30 

Gin Asp His Ala 
45 

lie Lys Pro Asp 

lie Leu Leu Arg 
80 



(2) INFORMATION FOR SEQ ID NO:453: 
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65 70 75 80 

GTA AGG TTA GGC AGT CAT AGC GAG CTG TTT TGAACCGCCC ACACCCCTTA TAAC 353 
Val Arg Leu Gly Ser His Ser Glu Leu Phe 
85 90 

GCTTAAACCA ACTACCCCCC TTTTTTAGGG ATAAATTTAG GGTTGAAACA CCGCTTA 410 

(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:450: 

Met Leu Thr lie Glu Thr Ser Lys Lys Phe Asp Lys Asp Leu Lys lie 

15 10 15 

Leu Val Lys Asn Gly Phe Asp Leu Lys Leu Leu Tyr Lys Val Val Gly 

20 25 30 

Asn Leu Ala Thr Glu Gin Pro Leu Ala Pro Lys Tyr Lys Asp His Pro 

35 40 45 

Leu Lys Gly Gly Leu Lys Asp Phe Arg Glu Cys His Leu Lys Pro Asp 

50 55 60 

Leu Leu Leu Val Tyr Gin lie Lys Lys Gin Glu Asn Thr Leu Phe Leu 
65 70 75 80 

Val Arg Leu Gly Ser His Ser Glu Leu Phe 
85 90 

C2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 78... 341 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 
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100 105 110 

Glu Lys Leu Asp Gin Asn Pro Lys Asn Lys Ala Leu Lys Gly Glu lie 

115 120 125 

Leu Ala Asn Leu Lys Phe lie Glu Glu Leu Leu Gly lie Gly Phe Lys 

130 135 140 

Asp Pro Ser Ala Tyr Phe Gin Leu Gly Val Ser Glu Ser Glu Lys Gin 
145 150 155 160 

Glu lie Glu Asn Lys lie Glu Glu Arg Lys Arg Ala Lys Glu Arg Lys 

165 170 175 

Asp Phe Leu Lys Ala Asp Ser lie Arg Glu Glu Leu Leu Lys Gin Lys 

180 185 190 

lie Ala Leu Met Asp Thr Pro Gin Gly Thr lie Trp Glu Lys Phe Phe 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 44 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 329 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 9: 
TTTTAAAAGC ATAGAGGATT TTAAGAAGCA TTGTGAAAAC TTATAAATAA GATTTAAAA 5 9 



ATG CTG ACG ATT GAA ACC AGT AAA AAA TTT GAT AAG GAT CTT AAA ATT 107 
Met Leu Thr lie Glu Thr Ser Lys Lys Phe Asp Lys Asp Leu Lys lie 
15 10 15 

CTT- GTT AAA AAC GGG TTT GAT TTA AAG CTT TTG TAT AAA GTG GTT GGA 15 5 

Leu Val Lys Asn Gly Phe Asp Leu Lys Leu Leu Tyr Lys Val Val Gly 
20 25 30 

AAT TTA GCC ACA GAG CAA CCC CTA GCT CCC AAA TAG AAA GAC CAC CCA 2 03 

Asn Leu Ala Thr Glu Gin Pro Leu Ala Pro Lys Tyr Lys Asp His Pro 
35 40 45 

CTC AAA GGC GGT TTA AAA GAT TTT AGG GAA TGC CAC TTA AAA CCG GAT 2 51 

Leu Lys Gly Gly Leu Lys Asp Phe Arg Glu Cys His Leu Lys Pro Asp 
50 55 60 

TTA TTG CTT GTC TAT CAA ATT AAA AAA CAA GAA AAC ACC CTC TTT TTA 2 99 

Leu Leu Leu Val Tyr Gin lie Lys Lys Gin Glu Asn Thr Leu Phe Leu 
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Lys Ala Leu Ser Val Leu Glu Ser Met Leu Ser Ser Thr Asn Glu Lys 
100 105 110 



TTG GAT CAA AAC CCT AAA AAC AAG GCT TTA AAG GGC GAA ATT TTA GCG 44 0 

Leu Asp Gin Asn Pro Lys Asn Lys Ala Leu Lys Gly Glu lie Leu Ala 
115 120 125 130 



AAT TTG AAA TTC ATA GAA GAA CTG CTT GGC ATC GGG TTT AAA GAC CCT 488 
Asn Leu Lys Phe lie Glu Glu Leu Leu Gly lie Gly Phe Lys Asp Pro 
135 140 145 

AGC GCC TAT TTC CAA TTA GGC GTG AGT GAA AGC GAA AAA CAA GAA ATT 53 6 

Ser Ala Tyr Phe Gin Leu Gly Val Ser Glu Ser Glu Lys Gin Glu lie 
150 155 160 



GAA AAC AAG ATA GAA GAA AG A AAA CGC GCC AAA GAG CGA AAA GAT TTT 5 84 

Glu Asn Lys lie Glu Glu Arg Lys Arg Ala Lys Glu Arg Lys Asp Phe 
165 170 175 



TTA AAA GCC GAT AGC ATC AGA GAA GAG CTT TTG AAA CAA AAA ATC GCT 63 2 

Leu Lys Ala Asp Ser lie Arg Glu Glu Leu Leu Lys Gin Lys lie Ala 
180 185 190 



TTG ATG GAC ACC CCA CAA GGC ACG ATC TGG GAG AAG TTT TTT TAAACACCT 68 3 
Leu Met Asp Thr Pro Gin Gly Thr lie Trp Glu Lys Phe Phe 
195 200 205 

CCAATTTTAC CTTTTTACAC ATTCTAGCAA CAACTTTCAG CATT 72 7 



(2) INFORMATION FOR SEQ ID NO: 448: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 8: 



Met His Asn Gly Phe Val Asn lie Asn Asn Glu Lys Met Ser Lys Ser 

15 10 15 

Leu Gly Asn Ser Phe Phe Val Lys Asp Ala Leu Lys Asn Tyr Asp Gly 

20 25 30 

Glu lie Leu Arg Asn Tyr Leu Leu Gly Val His Tyr Arg Ser Val Leu 

35 40 45 

Asn Phe Asn Glu Glu Asp Leu Leu Val Ser Lys Lys Arg Leu Asp Lys 

50 55 60 

lie Tyr Arg Leu Lys Gin Arg Val Leu Gly Thr Leu Gly Gly He Asn 

65 70 75 80 

Pro Asn Phe Lys Lys Glu He Leu Glu Cys Met Gin Asp Asp Leu Asn 

85 90 95 

Val Ser Lys Ala Leu Ser Val Leu Glu Ser Met Leu Ser Ser Thr Asn 



-716- 



BNSDOCID: <WO 9843478A1 J_> 



wo 98/43478 PCT/US98/06371 

465 470 475 480 

Lys Ala Leu Glu Lys His Gin Lys Met Met Lys Asp Met His Gly Lys 

485 490 495 

Asp Met His His Thr Lys Lys Lys Lys 
500 505 

(2) INFORMATION FOR SEQ ID NO: 44 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 51... 674 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 7: 

GAAGCGTGCC AAACCCGTTG CCCTTTGGCG TGGAGCTTGC CAAATATTGG ATG CAT 56 

Met His 

1 

AAC GGC TTT GTG AAT ATC AAT AAC GAA AAA ATG TCT AAA AGT TTG GGG 104 
Asn Gly Phe Val Asn lie Asn Asn Glu Lys Met Ser Lys Ser Leu Gly 
5 10 15 

AAT AGC TTT TTT GTT AAA GAC GCT CTC AAA AAC TAT GAT GGC GAA ATT 152 
Asn Ser Phe Phe Val Lys Asp Ala Leu Lys Asn Tyr Asp Gly Glu lie 
20 25 30 

TTG CGC AAT TAG TTA CTA GGG GTG CAT TAT CGC TCT GTT TTG AAT TTC 2 00 

Leu Arg Asn Tyr Leu Leu Gly Val His Tyr Arg Ser Val Leu Asn Phe 
35 40 45 50 

AAT GAA GAA GAC TTG TTA GTG AGT AAA AAA CGC TTG GAT AAA ATC TAT 24 8 

Asn Glu Glu Asp Leu Leu Val Ser Lys Lys Arg Leu Asp Lys lie Tyr 
55 60 65 

CGT TTA AAA CAG CGC GTT TTA GGG ACT CTT GGA GGA ATA AAT CCA AAC 2 96 

Arg Leu Lys Gin Arg Val Leu Gly Thr Leu Gly Gly lie Asn Pro Asn 
70 75 80 

TTT AAA AAA GAA ATT TTA GAG TGC ATG CAA GAT GAT TTA AAC GTT TCT 344 
Phe Lys Lys Glu lie Leu Glu Cys Met Gin Asp Asp Leu Asn Val Ser 
85 90 95 

AAA GCG TTG AGC GTT TTA GAA AGC ATG CTT TCT TCC ACT AAT GAA AAA 3 92 
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Leu Gin Ser Thr Trp Phe Leu Glu Lys Leu Ala Ala Phe Asp Arg Glu 

35 40 45 

Arg lie Pro Glu Arg Val Val His Ala Lys Gly Ser Gly Ala Tyr Gly 

50 55 60 

Thr Phe Thr Val Thr Lys Asp lie Thr Lys Tyr Thr Lys Ala Lys lie 
65 70 75 80 

Phe Ser Lys Val Gly Lys Lys Thr Glu Cys Phe Phe Arg Phe Ser Thr 

85 90 95 

Val Ala Gly Glu Arg Gly Ser Ala Asp Ala Val Arg Asp Pro Arg Gly 

100 105 110 

Phe Ala Met Lys Tyr Tyr Thr Glu Glu Gly Asn Trp Asp Leu Val Gly 

115 120 125 

Asn Asn Thr Pro Val Phe Phe lie Arg Asp Ala lie Lys Phe Pro Asp 

130 135 140 

Phe lie His Thr Gin Lys Arg Asp Pro Gin Thr Asn Leu Pro Asn His 
145 150 155 160 

Asp Met Val Trp Asp Phe Trp Ser Asn Val Pro Glu Ser Leu Tyr Gin 

165 170 175 

Val Thr Trp Val Met Ser Asp Arg Gly lie Pro Lys Ser Phe Arg His 

180 185 190 

Met Asp Gly Phe Gly Ser His Thr Phe Ser Leu lie Asn Ala Lys Gly 

195 200 205 

Glu Arg Phe Trp Val Lys Phe His Phe His Thr Met Gin Gly Val Lys 

210 215 220 

His Leu Thr Asn Glu Glu Ala Ala Glu Val Arg Lys Tyr Asp Pro Asp 
225 230 235 240 

Ser Asn Gin Arg Asp Leu Phe Asn Ala lie Ala Arg Gly Asp Phe Pro 

245 250 255 

Lys Trp Lys Leu Ser lie Gin Val Met Pro Glu Glu Asp Ala Lys Lys 

260 265 270 

Tyr Arg Phe His Pro Phe Asp Val Thr Lys lie Trp Tyr Leu Gin Asp 

275 280 285 

Tyr Pro Leu Met Glu Val Gly lie Val Glu Leu Asn Lys Asn Pro Glu 

290 295 300 

Asn Tyr Phe Ala Glu Val Glu Gin Ala Ala Phe Ser Pro Ala Asn Val 
305 310 315 320 

Val Pro Gly lie Gly Tyr Ser Pro Asp Arg Met Leu Gin Gly Arg Leu 

325 330 335 

Phe Ser Tyr Gly Asp Thr His Arg Tyr Arg Leu Gly Val Asn Tyr Pro 

340 345 350 

Gin lie Pro Val Asn Lys Pro Arg Cys Pro Phe His Ser Ser Ser Arg 

355 360 365 

Asp Gly Tyr Met Gin Asn Gly Tyr Tyr Gly Ser Leu Gin Asn Tyr Thr 

370 375 380 

Pro Ser Ser Leu Pro Gly Tyr Lys Glu Asp Lys Ser Ala Arg Asp Pro 
385 390 395 400 

Lys Phe Asn Leu Ala His lie Glu Lys Glu Phe Glu Val Trp Asn Trp 

405 410 415 

Asp Tyr Arg Ala Asp Asp Ser Asp Tyr Tyr Thr Gin Pro Gly Asp Tyr 

420 425 430 

Tyr Arg Ser Leu Pro Ala Asp Glu Lys Glu Arg Leu His Asp Thr lie 

435 440 445 

Gly Glu Ser Leu Ala His Val Thr His Lys Glu lie Val Asp Lys Gin 

450 455 460 

Leu Glu His Phe Lys Lys Ala Asp Pro Lys Tyr Ala Glu Gly Val Lys 
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TCT TCT AGC AGA GAT GOT TAG ATG CAA AAC GGA TAG TAG GGC TCT TTA 1334 
Ser Ser Ser Arg Asp Gly Tyr Met Gin Asn Gly Tyr Tyr Gly Ser Leu 
365 370 375 380 

CAA AAC TAT ACG CCT AGC TCA TTG CCT GGC TAT AAA GAA GAT AAG AGC 13 82 

Gin Asn Tyr Thr Pro Ser Ser Leu Pro Gly Tyr Lys Glu Asp Lys Ser 
385 390 395 

GCG AGA GAT CCT AAG TTC AAC TTA GCT CAT ATT GAG AAA GAG TTT GAA 1430 
Ala Arg Asp Pro Lys Phe Asn Leu Ala His lie Glu Lys Glu Phe Glu 
400 405 410 

GTG TGG AAT TGG GAT TAG AGA GCT GAT GAT AGC GAT TAG TAG ACC CAA 14 78 

Val Trp Asn Trp Asp Tyr Arg Ala Asp Asp Ser Asp Tyr Tyr Thr Gin 
415 420 425 

CCA GGT GAT TAG TAG CGC TCA TTG CCA GCT GAT GAA AAA GAA AGG TTG 1526 
Pro Gly Asp Tyr Tyr Arg Ser Leu Pro Ala Asp Glu Lys Glu Arg Leu 
430 435 440 

CAT GAG ACT ATT GGA GAG TCT TTA GCT CAT GTT ACC CAT AAG GAA ATT 1574 
His Asp Thr lie Gly Glu Ser Leu Ala His Val Thr His Lys Glu lie 
445 450 455 460 

GTG GAT AAA CAA TTG GAG CAT TTC AAG AAA GCT GAC CCC AAA TAG GCT 1622 
Val Asp Lys Gin Leu Glu His Phe Lys Lys Ala Asp Pro Lys Tyr Ala 
465 470 475 

GAG GGA GTT AAA AAA GCT CTT GAA AAA CAC CAA AAA ATG ATG AAA GAC 167 0 

Glu Gly Val Lys Lys Ala Leu Glu Lys His Gin Lys Met Met Lys Asp 
480 485 490 

ATG CAT GGA AAA GAC ATG CAC CAC ACA AAA AAG AAA AAG TAACCCTTTT CT 1721 
Met His Gly Lys Asp Met His His Thr Lys Lys Lys Lys 
495 500 505 

TTAAGCGTTC TTATTTTTTA GGAACGCTTT GTCTTTCAAA ATTTAGGTTT TTGGATACT 17 80 

(2) INFORMATION FOR SEQ ID NO: 44 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 05 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 6: 

Met Val Asn Lys Asp Val Lys Gin Thr Thr Ala Phe Gly Ala Pro Val 

15 10 15 

Trp Asp Asp Asn Asn Val lie Thr Ala Gly Pro Arg Gly Pro Val Leu 

20 25 30 
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Lys Phe Pro Asp Phe lie His Thr Gin Lys Arg Asp Pro Gin Thr Asn 
145 150 155 

TTG CCT AAC CAT GAC ATG GTA TGG GAT TTT TGG AGT AAT GTT CCT GAA 710 
Leu Pro Asn His Asp Met Val Trp Asp Phe Trp Ser Asn Val Pro Glu 
160 165 170 

AGC TTA TAG CAA GTA ACA TGG GTT ATG AGC GAT AGG GGT ATT CCT AAA 758 
Ser Leu Tyr Gin Val Thr Trp Val Met Ser Asp Arg Gly lie Pro Lys 
175 180 185 

TCT TTC CGC CAC ATG GAT GGT TTT GGC AGC CAC ACT TTC AGT CTT ATC 8 06 

Ser Phe Arg His Met Asp Gly Phe Gly Ser His Thr Phe Ser Leu lie 
190 ^195 200 

AAC GCG AAA GGC GAA CGC TTT TGG GTG AAA TTC CAC TTT CAC ACC ATG 854 
Asn Ala Lys Gly Glu Arg Phe Trp Val Lys Phe His Phe His Thr Met 
205 210 215 220 

CAA GGC GTT AAG CAT TTG ACT AAC GAA GAA GCC GCA GAA GTT AGG AAG 902 
Gin Gly Val Lys His Leu Thr Asn Glu Glu Ala Ala Glu Val Arg Lys 
225 230 235 

TAT GAT CCG GAT TCC AAT CAA AGG GAT TTA TTC AAT GCG ATC GCT AGA 95 0 

Tyr Asp Pro Asp Ser Asn Gin Arg Asp Leu Phe Asn Ala lie Ala Arg 
240 245 250 

GGG GAT TTC CCA AAA TGG AAA TTA AGC ATT CAA GTG ATG CCA GAA GAA 9 98 

Gly Asp Phe Pro Lys Trp Lys Leu Ser lie Gin Val Met Pro Glu Glu 
255 260 265 

GAT GCT AAG AAG TAT CGA TTC CAT CCG TTT GAT GTA ACT AAA ATT TGG 104 6 

Asp Ala Lys Lys Tyr Arg Phe His Pro Phe Asp Val Thr Lys lie Trp 
270 275 280 

TAT CTC CAA GAT TAT CCA TTG ATG GAA GTG GGC ATT GTG GAG TTG AAT 1094 
Tyr Leu Gin Asp Tyr Pro Leu Met Glu Val Gly lie Val Glu Leu Asn 
285 290 295 300 

AAA AAT CCT GAA AAC TAT TTC GCA GAA GTG GAA CAA GCG GCA TTC AGT 1142 
Lys Asn Pro Glu Asn Tyr Phe Ala Glu Val Glu Gin Ala Ala Phe Ser 
305 310 315 

CCG GCT AAT GTC GTT CCT GGA ATT GGC TAT AGC CCT GAT AGG ATG TTA 1190 
Pro Ala Asn Val Val Pro Gly lie Gly Tyr Ser Pro Asp Arg Met Leu 
320 325 330 

CAA GGG CGC TTG TTC TCT TAT GGA GAC ACA CAC CGC TAC CGC TTA GGC 12 3 8 

Gin Gly Arg Leu Phe Ser Tyr Gly Asp Thr His Arg Tyr Arg Leu Gly 
335 340 345 

GTT AAT TAT CCT CAA ATA CCG GTT AAT AAA CCA AGA TGC CCA TTC CAC 12 86 

Val Asn Tyr Pro Gin lie Pro Val Asn Lys Pro Arg Cys Pro Phe His 
350 355 360 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIiECUIiE TYPE: Genomic DMA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 195... 1709 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

TTGTTTTTAA TCTTTCTTAT TTTCATTAAT TGTTACGAAT AGAAATACTT AAGGGGGTTT 60 

TTCATTCTTA AAAAAAGGAT TTTTTAAGGA AATTGAATCT TGTTAGTCTT TGTATAACAA 12 0 

ATTATGTGAT AATCACCACA AGTAATCGGC TTAGTGTCAT ATTACGAAGA TTTAAGATCA 18 0 

TAAAAGGAAA AAAG ATG GTT AAT AAA GAT GTG AAA CAA ACC ACT GCT TTT 230 
Met Val Asn Lys Asp Val Lys Gin Thr Thr Ala Phe 
15 10 

GGC GCT CCC GTT TGG GAT GAC AAC AAT GTG ATT ACQ GCC GGC CCT AGA 278 
Gly Ala Pro Val Trp Asp Asp Asn Asn Val lie Thr Ala Gly Pro Arg 
15 20 25 

GGT CCT GTT TTA TTA CAA AGC ACT TGG TTT TTG GAA AAG TTA GCG GCG 326 
Gly Pro Val Leu Leu Gin Ser Thr Trp Phe Leu Glu Lys Leu Ala Ala 
30 35 40 

TTT GAC AGA GAA AGA ATC CCT GAA AGG GTG GTG CAT GCT AAA GGA AGC 374 
Phe Asp Arg Glu Arg lie Pro Glu Arg Val Val His Ala Lys Gly Ser 
45 50 55 60 

GGA GCT TAT GGC ACT TTC ACT GTG ACT AAA GAC ATC ACT AAA TAC ACT 422 
Gly Ala Tyr Gly Thr Phe Thr Val Thr Lys Asp lie Thr Lys Tyr Thr 
65 70 75 

AAA GCG AAA ATT TTC TCT AAA GTG GGC AAA AAA ACC GAA TGC TTC TTC 47 0 

Lys Ala Lys lie Phe Ser Lys Val Gly Lys Lys Thr Glu Cys Phe Phe 
80 85 90 

AGA TTT TCT ACT GTG GCT GGT GAA AGA GGC AGT GCG GAT GCG GTG AGA 518 
Arg Phe Ser Thr Val Ala Gly Glu Arg Gly Ser Ala Asp Ala Val Arg 
95 100 105 

GAC CCT AGA GGT TTT GCG ATG AAG TAT TAC ACT GAA GAA GGT AAC TGG 566 
Asp Pro Arg Gly Phe Ala Met Lys Tyr Tyr Thr Glu Glu Gly Asn Trp 
110 115 120 

GAT TTA GTG GGG AAC AAC ACG CCT GTT TTC TTT ATC CGT GAT GCG ATC 614 
Asp Leu Val Gly Asn Asn Thr Pro Val Phe Phe lie Arg Asp Ala He 
125 130 135 140 

AAA TTC CCT GAT TTC ATC CAC ACT CAA AAA CGA GAT CCT CAA ACC AAT 662 
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205 



210 



215 



GGG ATA CTA GAA TGC AAA TGATGCACAA TTTGAGTTTT TTGGGCATGT TTTTAGCC 794 
Gly lie Leu Glu Cys Lys 
220 



(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



Met 


Pro 


Ala 


Arg 


Gin 


Ser 


Phe 


Thr 


Asp 


Leu 


Lys 


Asn 


Leu 


Val 


Leu 


Cys 


1 








5 










10 










15 




Asp 


lie 


Gly 


Asn 


Thr 


Arg 


He 


His 


Phe 


Ala 


Gin 


Asn 


Tyr 


Gin 


Leu 


Phe 








20 










25 










30 






Ser 


Ser 


Ala 


Lys 


Glu 


Asp 


Leu 


Lys 


Arg 


Leu 


Gly 


He 


Gin 


Lys 


Glu 


He 






35 










40 










45 








Phe 


Tyr 


He 


Ser 


Val 


Asn 


Glu 


Glu 


Asn 


Glu 


Lys 


Ala 


Leu 


Leu 


Asn 


Cys 




50 










55 










60 










Tyr 


Pro 


Asn 


Ala 


Lys 


Asn 


He 


Ala 


Gly 


Phe 


Phe 


His 


Leu 


Glu 


Thr 


Asp 


65 










70 










75 










80 


Tyr 


Val 


Gly 


Leu 


Gly 


He 


Asp 


Arg 


Gin 


Met 


Ala 


Cys 


Leu 


Ala 


Val 


Asn 










85 










90 










95 




Asn 


Gly 


Val 


Val 


Val 


Asp 


Ala 


Gly 


Ser 


Ala 


He 


Thr 


He 


Asp 


Leu 


He 








100 










105 










110 






Lys 


Glu 


Gly 


Lys 


His 


Leu 


Gly 


Gly 


Cys 


He 


Leu 


Pro 


Gly 


Leu 


Ala 


Gin 






115 










120 










125 








Tyr 


He 


His 


Ala 


Tyr 


Lys 


Lys 


Ser 


Ala 


Lys 


He 


Leu 


Glu 


Gin 


Pro 


Phe 




130 










135 










140 










Lys 


Ala 


Leu 


Asp 


Ser 


Leu 


Glu 


Val 


Leu 


Pro 


Lys 


Ser 


Thr 


Arg 


Asp 


Ala 


145 










150 










155 










160 


Val 


Asn 


Tyr 


Gly 


Met 


Val 


Leu 


Ser 


Val 


He 


Ala 


Cys 


He 


Gin 


His 


Leu 










165 










170 










175 




Ala 


Lys 


Asn 


Gin 


Lys 


He 


Tyr 


Leu 


Cys 


Gly 


Gly 


Asp 


Ala 


Lys 


Tyr 


Leu 








180 










185 










190 






Ser 


Ala 


Phe 


Leu 


Pro 


His 


Ser 


Val 


Cys 


Lys 


Glu 


Arg 


Leu 


Val 


Phe 


Asp 






195 










200 










205 








Gly 


Met 


Glu 


He 


Ala 


Leu 


Lys 


Lys 


Ala 


Gly 


He 


Leu 


Glu 


Cys 


Lys 





210 215 220 



GCTTTGAGCA TGTCTTTAGG GCATTGTGTG GGCATGT 



831 



(2) INFORMATION FOR SEQ ID 



NO: 445 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 
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AACAAAGAGC GAGAGAGCAT CAAGAAAGAG ATGAAAAAGA GCTTGAAGAA AGAAGAAAAG 60 
CTTTAGAAAT GAATAAGAAG TAGGCCT ATG CCA GCT AGG CAA TCT TTT ACA GAT 114 

Met Pro Ala Arg Gin Ser Phe Thr Asp 
1 5 

TTG AAA AAC CTG GTT TTG TGC GAT ATA GGC AAC ACG CGT ATC CAT TTT 162 
Leu Lys Asn Leu Val Leu Cys Asp lie Gly Asn Thr Arg lie His Phe 
10 15 20 25 

GCA CAA AAC TAT CAG CTC TTT TCA AGC GCT AAA GAA GAT TTA AAG CGT 210 
Ala Gin Asn Tyr Gin Leu Phe Ser Ser Ala Lys Glu Asp Leu Lys Arg 
30 35 40 

TTG GGT ATT CAA AAG GAA ATT TTT TAC ATT AGC GTG AAT GAA GAA AAT 2 58 

Leu Gly lie Gin Lys Glu lie Phe Tyr lie Ser Val Asn Glu Glu Asn 
45 50 55 

GAA AAA GCC CTT TTG AAT TGT TAC CCT AAC GCT AAA AAT ATT GCA GGG 3 06 

Glu Lys Ala Leu Leu Asn Cys Tyr Pro Asn Ala Lys Asn lie Ala Gly 
60 65 70 

TTT TTT CAT TTA GAA ACC GAC TAT GTA GGG CTT GGG ATA GAC CGG CAA 3 54 

Phe Phe His Leu Glu Thr Asp Tyr Val Gly Leu Gly lie Asp Arg Gin 
75 80 85 

ATG GCG TGT CTG GCG GTA AAT AAT GGC GTG GTG GTG GAT GCC GGG AGT 4 02 

Met Ala Cys Leu Ala Val Asn Asn Gly Val Val Val Asp Ala Gly Ser 
90 95 100 105 

GCG ATT ACG ATA GAT TTA ATC AAA GAG GGC AAG CAT TTA GGA GGG TGT 45 0 

Ala lie Thr lie Asp Leu lie Lys Glu Gly Lys His Leu Gly Gly Cys 
110 115 120 

ATT TTA CCC GGT TTA GCC CAA TAT ATT CAT GCG TAT AAA AAA AGC GCT 4 98 

lie Leu Pro Gly Leu Ala Gin Tyr lie His Ala Tyr Lys Lys Ser Ala 
125 130 135 

AAA ATT TTA GAG CAA CCT TTC AAG GCC TTA GAT TCT TTA GAA GTT TTA 54 6 

Lys lie Leu Glu Gin Pro Phe Lys Ala Leu Asp Ser Leu Glu Val Leu 
140 145 150 

CCT AAA AGC ACT AGA GAC GCT GTG AAT TAC GGC ATG GTT TTG AGC GTC 594 
Pro Lys Ser Thr Arg Asp Ala Val Asn Tyr Gly Met Val Leu Ser Val 
155 160 165 

ATT GCT TGT ATC CAG CAT TTA GCC AAA AAT CAA AAA ATC TAT CTT TGT 642 
lie Ala Cys lie Gin His Leu Ala Lys Asn Gin Lys lie Tyr Leu Cys 
170 175 180 185 

GGG GGC GAT GCG AAG TAT TTG AGC GCG TTT TTA CCC CAT TCT GTT TGC 690 
Gly Gly Asp Ala Lys Tyr Leu Ser Ala Phe Leu Pro His Ser Val Cys 
190 195 200 

AAG GAG CGT TTG GTT TTT GAC GGG ATG GAA ATC GCT CTT AAA AAA GCA 73 8 

Lys Glu Arg Leu Val Phe Asp Gly Met Glu lie Ala Leu Lys Lys Ala 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:442: 



Met 


Pro 


Cys 


He 


He 


Lys 


Thr 


Thr 


Pro 


Leu 


Ser 


Met 


He 


He 


Lys 


Glu 


1 








5 










10 










15 




Cys 


He 


Arg 


Met 


Asn 


Thr 


Asn 


Lys 


Ala 


Leu 


Phe 


Leu 


Asp 


Arg 


Asp 


Gly 








20 










25 










30 






He 


He 


Asn 


He 


Asp 


Lys 


Gly 


Tyr 


Val 


Ser 


Gin 


Lys 


Glu 


Asp 


Phe 


Glu 






35 










40 










45 








Phe 


Gin 


Lys 


Gly 


He 


Phe 


Glu 


Leu 


Leu 


Lys 


His 


Ala 


Lys 


Ser 


Leu 


Gly 




50 










55 










60 










Tyr 


Lys 


Leu 


Leu 


Leu 


He 


Thr 


Asn 


Gin 


Ser 


Gly 


He 


Asn 


Arg 


Gly 


Tyr 


65 










70 










75 










80 


Tyr 


Thr 


Leu 


Lys 


Asp 


Phe 


Glu 


Gin 


Leu 


Thr 


Gin 


Tyr 


Leu 


Gin 


Glu 


Ser 










85 










90 










95 




Leu 


Phe 


Lys 


Glu 


Leu 


Gly 


Phe 


Asn 


Leu 


Asp 


Gly 


He 


Tyr 


Phe 


Cys 


Arg 








100 










105 










110 






His 


Ala 


Pro 


Glu 


Glu 


Asn 


Cys 


Ala 


Cys 


Arg 


Lys 


Pro 


Lys 


Pro 


Ser 


Leu 






115 










120 










125 








He 


Leu 


Gin 


Ala 


Ala 


Lys 


Glu 


His 


Gin 


He 


Cys 


Leu 


Glu 


Gin 


Ser 


Phe 




130 










135 










14 0 










Met 


He 


Gly 


Asp 


Lys 


Glu 


Ser 


Asp 


Met 


Leu 


Ala 


Gly 


Leu 


Asn 


Ala 


Lys 


145 










150 










155 










160 


Val 


Lys 


Asn 


Asn 


Leu 


Leu 


Leu 


He 


Gin 


Asn 


Pro 


Leu 


Lys 


Thr 


Pro 


His 










165 










170 










175 




Ser 


Trp 


He 


Gin 


Cys 


Lys 


Asp 


Phe 


Lys 


Glu 


Met 


He 


Asp 


Leu 


He 


Lys 



180 185 190 

(2) INFORMATION FOR SEQ ID N0:443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 88... 756 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:443: 
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ACA CCA CTC AGC ATG ATC ATA AAG GAG TGC ATT CGC ATG AAC ACT AAC 221 
Thr Pro Leu Ser Met lie lie Lys Glu Cys lie Arg Met Asn Thr Asn 
10 15 20 

AAA GCC CTT TTT TTG GAC AGA GAC GGC ATT ATC AAT ATT GAT AAA GGC 269 
Lys Ala Leu Phe Leu Asp Arg Asp Gly lie lie Asn lie Asp Lys Gly 
25 30 35 

TAT GTG AGT CAA AAA GAA GAT TTT GAG TTT CAA AAA GGG ATT TTT GAA 317 
Tyr Val Ser Gin Lys Glu Asp Phe Glu Phe Gin Lys Gly lie Phe Glu 
40 45 50 55 

TTG CTA AAG CAT GCG AAA TCT TTA GGC TAC 7VAA CTG CTT TTA ATC ACC 365 
Leu Leu Lys His Ala Lys Ser Leu Gly Tyr Lys Leu Leu Leu lie Thr 
60 65 70 

AAC CAA TCT GGG ATC AAC CGA GGC TAT TAC ACC CTT AAA GAT TTT GAA 413 
Asn Gin Ser Gly lie Asn Arg Gly Tyr Tyr Thr Leu Lys Asp Phe Glu 
75 80 85 

C7^ CTC ACC CAA TAC CTC CAA GAA AGC TTG TTC AAA GAA TTA GGT TTT 4 61 

Gin Leu Thr Gin Tyr Leu Gin Glu Ser Leu Phe Lys Glu Leu Gly Phe 
90 95 100 

AAT CTG GAT GGC ATC TAT TTT TGC AGG CAC GCC CCA GAA GAA AAT TGC 5 09 

Asn Leu Asp Gly lie Tyr Phe Cys Arg His Ala Pro Glu Glu Asn Cys 
105 110 115 

GCT TGC AGG AAG CCA AAG CCT TCT TTG ATT TTG CAA GCT GCT AAA GAG 557 
Ala Cys Arg Lys Pro Lys Pro Ser Leu lie Leu Gin Ala Ala Lys Glu 
120 125 130 135 

CAT CAA ATT TGC TTG GAG CAA TCT TTT ATG ATA GGC GAT AAA GAG AGC 6 05 

His Gin He Cys Leu Glu Gin Ser Phe Met He Gly Asp Lys Glu Ser 
140 145 150 

GAC ATG TTA GCC GGC TTG AAC GCT AAA GTT AAA AAT AAC CTT TTG CTC 653 
Asp Met Leu Ala Gly Leu Asn Ala Lys Val Lys Asn Asn Leu Leu Leu 
155 160 165 

ATT CAA AAC CCT TTA AAA ACT CCT CAT TCT TGG ATA CAA TGT AAA GAT 701 
He Gin Asn Pro Leu Lys Thr Pro His Ser Trp He Gin Cys Lys Asp 
170 175 180 

TTT AAA GAG ATG ATA GAT CTA ATC AAA TAAGGACAAG AATGCGTTAT ATTGATG 75 5 
Phe Lys Glu Met He Asp Leu He Lys 
185 190 

ATGAATTAGA AAATCAAACG ATTTTAATCA CCGGTGGGGC TGGCTTTGTA GGCAGTAATC 815 
TAGCCTT 822 

(2) INFORMATION FOR SEQ ID NO:442: 
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Phe Lys Gly Gin Cys Ala 
85 

TATTATTTGT GGGGGAAGAG CGAACGGCTT T 444 
(2) INFORMATION FOR SEQ ID NO: 44 0: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:440: 



Met 


Lys 


Ala 


Gin 


Lys 


Ser 


Gly 


Val 


Tyr 


Asn 


Val 


Gly 


Tyr 


Ser 


Gin Ala 


1 








5 










10 










15 


Arg 


Ser 


Tyr 


Asn 


Glu 


He 


Val 


Ser 


He 


Leu 


Lys 


Glu 


His 


Leu 


Gly Asp 








20 










25 










30 




Phe 


Lys 


Val 


Ser 


Tyr 


He 


Lys 


Asn 


Pro 


Tyr 


Ala 


Phe 


Phe 


Gin 


Lys His 






35 










40 










45 






Thr 


Gin 


Ala 


His 


He 


Glu 


Pro 


Ala 


He 


Leu 


Asp 


Leu 


Asp 


Tyr 


Thr Pro 




50 










55 










60 








Leu 


Tyr 


Asp 


Leu 


Glu 


Ser 


Gly 


He 


Lys 


Asp 


Tyr 


Leu 


Pro 


His 


He His 


65 










70 










75 








80 


Ala 


He 


Phe 


Lys 


Gly 


Gin 


Cys 


Ala 

















85 



(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 153 ... 728 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

CAGCATGCCG TTGTTTTTAA TGGGGATTTT TTTAAGCAAA ATTTCCGTTT CTTACAGGAA 60 

ATTTTTCAAT CTTTTGTCTA AAATTTTAAT GGGGGTTTTT GGGCTTTATA TCCTTTATAT 12 0 

GGGGATCATG CTCATTAACC ACAAAATGCC TC ATG CCA TGC ATC ATC AAA ACA 173 

Met Pro Cys He He Lys Thr 
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Ser Ala Lys Asn Glu 
270 

Tyr Asp Pro Asn His 
285 

Lys Lys Lys Glu Glu 
300 



(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 112... 3 75 
(D) OTHER INFORMATION: 



Glu Lys lie Asp Tyr 
260 

Lys Phe Ser Gly Phe 
275 

Pro Gin Arg Tyr Lys 
290 

Leu Tyr Pro Asn Pro 
305 



Thr Val Ala Ser Glu Leu 
265 

Leu Lys Ser Val Val Asp 
280 

Asp Glu Phe Asp Glu Val 
295 

Ser Leu Ser Phe 
310 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

AACTCGCCTT AGGCGCGATG GCGTTTAAGG AAGTCAAGCT TTTTGAATTT GGCGAGCAAT 60 
TAAGGGATTT TGTACTATAT TGAAGATGTG ATCCAAGCGA GCGTGAAAGC G ATG AAG 117 

Met Lys 
1 

GCT CAA AAA AGC GGG GTT TAT AAT GTG GGT TAT TCC CAA GCC AGA AGT 165 
Ala Gin Lys Ser Gly Val Tyr Asn Val Gly Tyr Ser Gin Ala Arg Ser 
5 10 15 

TAT AAT GAA ATC GTT AGC ATT TTA AAA GAG CAT TTA GGG GAT TTT AAA 213 
Tyr Asn Glu lie Val Ser lie Leu Lys Glu His Leu Gly Asp Phe Lys 
20 25 30 

GTG AGT TAT ATC AAA AAC CCT TAT GCT TTC TTC CAA AAG CAC ACC CAA 261 
Val Ser Tyr lie Lys Asn Pro Tyr Ala Phe Phe Gin Lys His Thr Gin 
35 40 45 50 

GCA CAC ATT GAG CCT GCT ATT TTG GAT TTG GAT TAC ACC CCT TTA TAC 309 
Ala His lie Glu Pro Ala lie Leu Asp Leu Asp Tyr Thr Pro Leu Tyr 
55 60 65 

GAT TTG GAA AGC GGC ATT TVAA GAT TAT TTG CCC CAT ATC CAT GCG ATT 3 57 

Asp Leu Glu Ser Gly He Lys Asp Tyr Leu Pro His He His Ala He 
70 75 80 

TTT AAA-'GGA CAG TGC GCA TGAAAAAAAT CTTAGTCATA GGCGATCTGA TCGCTGAT 413 
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CCA AAC CAC CCA CAA CGC TAT AAA GAT GAA TTT GAT GAG GTT AAA AAG 97 3 

Pro Asn His Pro Gin Arg Tyr Lys Asp Glu Phe Asp Glu Val Lys Lys 
290 295 300 



AAA GAG GAG TTA TAC CCT AAC CCA TCG CTT TCT TTT TAAAAATGAG ATTTTA 1025 
Lys Glu Glu Leu Tyr Pro Asn Pro Ser Leu Ser Phe 
305 310 

AAAAACGCTT TAAGTGTTTT TGTAAAAAAT AGCAAAGAGC TTGATTTTAA TCA 107 8 



(2) INFORMATION FOR SEQ ID NO: 438: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 



Met Lys Pro Gin 
1 

Gly Pro lie Lys 
20 

Glu Lys Gly Glu 
35 

His Gin Ser Val 
50 

Glu Leu Pro Phe 
65 

Lys Asp Ala Ala 

Pro Ser Ser Lys 
100 

Tyr Met Ala lie 
115 

Phe His Glu Glu 
130 

Tyr Pro Asp Met 
145 

Glu Asn Leu lie 

Leu Val Ser Phe 
180 

Glu Cys Tyr Phe 
195 

Tyr Leu Val Gly 
210 

Leu Leu Lys Arg 
225 

Arg Thr Asn Glu 



Asp lie Glu lie 
5 

Pro Thr Glu Val 

lie Thr Asn Met 
40 

Ser Ser Tyr lie 
55 

Lys Lys Val Gin 
70 

Lys Glu Leu Gly 
85 

lie Ala His Glu 

Asn Asn Glu Asn 
120 

Ser Ser Lys Ser 
135 

Val Gly Val Arg 
150 

Ala Phe Ser Lys 
165 

Glu Leu Lys Lys 

Gin Ala He Ser 
200 

Arg His He Asp 
215 

Leu His Ala Ser 
230 

Asp Lys Ser Ala 
245 



Val Gin Ser Val 
10 

Tyr Asp Lys Ala 
25 

Phe Asp Cys Gly 

Tyr Thr Ala Leu 
60 

Glu Asn Pro Thr 
75 

Leu Asp Ala Gin 
90 

Arg Asp Leu His 
105 

Leu Lys Cys Tyr 

He Lys Gly Met 
140 

Phe Leu His Ala 
155 

Lys Phe Asp Thr 
170 

Glu He Ser Val 
185 

Asn Ser Ser Trp 

Thr His Asn Pro 
220 

Phe Gly He Gly 
235 

He Leu Leu Asn 
250 



Leu Glu He Thr 
15 

Lys Glu Leu Phe 
30 

Gly Lys Thr Pro 
45 

Asn Lys Gly Glu 

Leu He Ala Leu 
80 

Lys He Ser Ala 
95 

Pro Phe Leu Thr 
110 

Thr Lys Thr He 
125 

Asp Arg Trp Leu 

Glu Leu Ser Asn 
160 

Leu Pro He Lys 
175 

His Asn Cys Arg 
190 

Ala Asn Glu Gly 
205 

Gin Leu Met Asp 

Val He Asp Leu 
240 

Ala Lys Tyr Lys 
255 
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Lys Gly Glu Glu Leu Pro Phe Lys Lys Val Gin Glu Asn Pro Thr Leu 
65 70 75 

ATC OCT TTA AAA GAG GCG GCT AAA GAG CTA GGT TTA GAC GCT CAA AAA 34 9 

lie Ala Leu Lys Asp Ala Ala Lys Glu Leu Gly Leu Asp Ala Gin Lys 
80 85 90 

ATA AGO GCT CCA AGC TOT AAA ATC GCG CAT GAA AGG GAT TTG CAC CCC 3 97 

lie Ser Ala Pro Ser Ser Lys lie Ala His Glu Arg Asp Leu His Pro 
95 100 105 

TTT TTA ACC TAC ATG GCT ATT AAT AAC GAA AAT TTG AAA TGC TAG ACG 445 
Phe Leu Thr Tyr Met Ala lie Asn Asn Glu Asn Leu Lys Cys Tyr Thr 
110 115 120 125 

AAA ACC ATT TTT CAT GAA GAG AGT TCA AAA TCA ATA AAA GGC ATG GAC 4 93 

Lys Thr lie Phe His Glu Glu Ser Ser Lys Ser lie Lys Gly Met Asp 
130 135 140 

AGG TGG CTT TAT CCG GAC ATG GTG GGG GTT AGG TTT TTG CAC GCT GAA 541 
Arg Trp Leu Tyr Pro Asp Met Val Gly Val Arg Phe Leu His Ala Glu 
145 150 155 

TTA TCT AAT GAA AAT TTA ATC GCT TTT TCT AAG AAA TTT GAC ACT TTA 5 89 

Leu Ser Asn Glu Asn Leu lie Ala Phe Ser Lys Lys Phe Asp Thr Leu 
160 165 170 

CCC ATT AAA CTG GTG AGC TTT GAA TTG AAA AAA GAA ATC AGC GTG CAT 637 
Pro lie Lys Leu Val Ser Phe Glu Leu Lys Lys Glu lie Ser Val His 
175 180 185 

AAT TGC AGG GAG TGT TAC TTT CAA GCG ATT TCC AAC AGC TCG TGG GCT 685 
Asn Cys Arg Glu Cys Tyr Phe Gin Ala lie Ser Asn Ser Ser Trp Ala 
190 195 200 205 

AAT GAA GGG TAT TTA GTG GGC CGT CAT ATT GAT ACG CAC AAT CCT CAA 73 3 

Asn Glu Gly Tyr Leu Val Gly Arg His lie Asp Thr His Asn Pro Gin 
210 215 220 

CTC ATG GAT TTG TTG AAG CGT TTG CAT GCG AGT TTT GGG ATT GGC GTG 781 
Leu Met Asp Leu Leu Lys Arg Leu His Ala Ser Phe Gly lie Gly Val 
225 230 235 

ATT GAT TTA AGA ACT AAT GAG GAT AAA AGC GCT ATT TTA TTG AAC GCT 82 9 

lie Asp Leu Arg Thr Asn Glu Asp Lys Ser Ala lie Leu Leu Asn Ala 
240 245 250 

AAA TAC AAA GAA AAG ATT GAT TAC ACC GTG GCT TCA GAG CTT AGC GCG 877 
Lys Tyr Lys Glu Lys lie Asp Tyr Thr Val Ala Ser Glu Leu Ser Ala 
255 260 265 

AAA AAT GAA AAA TTC AGC GGT TTT TTA AAG AGC GTT GTG GAT TAT GAC 925 
Lys Asn Glu Lys Phe Ser Gly Phe Leu Lys Ser Val Val Asp Tyr Asp 
270 275 280 285 
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Lys lie Ala His Ala 
115 

Ala Cys Ala Ser Pro 
130 

Leu Asn Ser Pro lie 
145 

Asn His lie lie Leu 
165 

Lys Gly Asp Gly Ala 
180 

Ala Gin Thr Leu Leu 
195 

Val Met Gly Gly Ala 
210 

Ser Asn Thr Phe 
225 



Leu Ala Asp Asn lie Val 
120 

Lys lie Leu Ala Pro Ser 
135 

Thr Gin Ser Asn Leu Lys 
150 155 
Asp Thr Lys Asn Ala Leu 
170 

Met Ala Glu Pro Leu Glu 
185 

Lys Asp Ala Tyr Phe Glu 
200 

Ser lie Glu Lys lie Asp 
215 



Ser Ala Thr Phe Leu 
125 

Met Asn Thr Asn Met 
140 

Arg Leu Lys Asp Ser 
160 

Leu Ala Cys Asp Thr 
175 

lie Leu Phe Lys Ala 
190 

Asn Arg Glu Val lie 
205 

Ser Val Arg Thr lie 
220 



(2) INFORMATION FOR SEQ ID NO: 4 37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 71... 1009 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:437: 

TCCCAAGCCC TAAAGGCGCT TCAATCATTG ATTTTAAAGG CAGTTATGT^ GAGTATTTGG 6 0 

CGAGCAAAAA ATG AAA CCG CAA GAC ATT GAA ATC GTT CAA AGO GTT TTA 109 
Met Lys Pro Gin Asp lie Glu lie Val Gin Ser Val Leu 
15 10 

GAG ATT ACA GGA CCG ATT AAG CCT ACT GAA GTG TAT GAT AAA GCC AAA 157 
Glu lie Thr Gly Pro lie Lys Pro Thr Glu Val Tyr Asp Lys Ala Lys 
15 20 25 

GAG CTT TTT GAA AAA GGT GAG ATT ACA AAC ATG TTT GAT TGT GGG GGC 2 05 

Glu Leu Phe Glu Lys Gly Glu lie Thr Asn Met Phe Asp Cys Gly Gly 
30 35 40 45 

AAA ACC CCG CAC CAG AGC GTT AGT TCT TAT ATT TAT ACA GCC TTA AAC 253 
Lys Thr Pro His Gin Ser Val Ser Ser Tyr He Tyr Thr Ala Leu Asn 
50 55 60 

AAG GGC GAA GAA CTG CCT TTT AAA AAA GTG CAA GAA AAC CCA ACC TTA 301 
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135 140 145 

TCC CCT ATC ACT CAA AGT AAT TTA AAA CGC TTG AAA GAT TCC AAC CAC 536 
Ser Pro lie Thr Gin Ser Asn Leu Lys Arg Leu Lys Asp Ser Asn His 
150 155 160 

ATT ATT TTA GAG ACC AAA AAC GCC CTT TTA GCA TGC GAC ACT AAA GGC 584 
lie lie Leu Asp Thr Lys Asn Ala Leu Leu Ala Cys Asp Thr Lys Gly 
165 170 175 

GAT GGG GCG ATG GCT GAG CCT TTA GAA ATC CTT TTT AAA GCC GCT CAA 632 
Asp Gly Ala Met Ala Glu Pro Leu Glu lie Leu Phe Lys Ala Ala Gin 
180 185 190 

ACG CTC CTA AAA GAC GCT TAT TTT GAA AAC AGA GAA GTC ATA GTC ATG 68 0 

Thr Leu Leu Lys Asp Ala Tyr Phe Glu Asn Arg Glu Val lie Val Met 
195 200 205 210 

GGC GGC GCG AGT ATA GAA AAG ATT GAC AGC GTT CGA ACG ATT AGC AAT 728 
Gly Gly Ala Ser lie Glu Lys lie Asp Ser Val Arg Thr lie Ser Asn 
215 220 225 

ACT TTC TAGCGGGATT CAAGCGAGCG CTTTAGCTTT GGCGTTATAT TTTAAGGGAG CC 78 6 
Thr Phe 



A 787 
(2) INFORMATION FOR SEQ ID NO: 43 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6: 



Met 


Asn 


Phe 


Leu 


Glu 


Asp 


Leu 


Phe 


Tyr 


Pro 


Leu 


Arg 


Leu 


Leu 


Glu 


Asn 


1 








5 










10 










15 




Lys 


Arg 


Val 


Leu 
20 


Leu 


Leu 


Val 


Ser 


Gly 
25 


Ser 


He 


Ala 


Ala 


Tyr 
30 


Lys 


Ser 


Leu 


Glu 


Leu 
35 


Val 


Arg 


Leu 


Leu 


Phe 
40 


Lys 


Ser 


Gly 


Ala 


Ser 
45 


He 


Gin 


Val 


Val 


Met 
50 


Ser 


Lys 


Gly 


Ala 


Lys 
55 


Lys 


Phe 


He 


Lys 


Pro 
60 


Leu 


Ser 


Phe 


Glu 


Ala 


Leu 


Ser 


His 


His 


Lys 


Val 


Leu 


His 


Asp 


Arg 


Asn 


Glu 


Lys 


Trp 


Tyr 


65 










70 










75 










80 


Tyr 


Asn 


His 


Gin 


Asn 
85 


Ala 


Leu 


His 


His 


Asn 
90 


His 


He 


Ala 


Cys 


Ala 
95 


Ala 


Asn 


Ala 


Asp 


Leu 
100 


Leu 


lie 


Phe 


Ala 


Pro 
105 


Leu 


Ser 


Thr 


Asn 


Ser 
110 


Leu 


Ser 
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(A) LENGTH: 7 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.,. 734 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

AGCCATAATA ACAACCAATG GCTAGAGCCT GATGATTTGT TGAAATTATT ATG AAT 56 

Met Asn 
1 

TTT TTA GAA GAT TTG TTT TAC CCC TTA AGA TTG TTA GAA AAC AAG CGT 104 
Phe Leu Glu Asp Leu Phe Tyr Pro Leu Arg Leu Leu Glu Asn Lys Arg 
5 10 15 

GTT TTA TTG CTC GTG AGC GGT TCT ATT GCA GCG TAT AAA TCC CTA GAA 152 
Val Leu Leu Leu Val Ser Gly Ser lie Ala Ala Tyr Lys Ser Leu Glu 
20 25 30 

TTA GTG CGC TTG TTG TTT AAA AGC GGG GCT AGT ATC CAA GTG GTG ATG 2 00 

Leu Val Arg Leu Leu Phe Lys Ser Gly Ala Ser lie Gin Val Val Met 
35 40 45 50 

AGT AAG GGT GCG AAA AAA TTC ATC AAA CCC TTA AGT TTT GAA GCT TTG 24 8 

Ser Lys Gly Ala Lys Lys Phe lie Lys Pro Leu Ser Phe Glu Ala Leu 
55 60 65 

AGC CAC CAT AAA GTC TTG CAT GAT CGT AAT GAA AAA TGG TAT TAC AAC 2 96 

Ser His His Lys Val Leu His Asp Arg Asn Glu Lys Trp Tyr Tyr Asn 
70 75 80 

CAC CAA AAC GCC TTA CAC CAT AAC CAC ATC GCA TGC GCT GCT AAT GCT 344 
His Gin Asn Ala Leu His His Asn His lie Ala Cys Ala Ala Asn Ala 
85 90 95 

GAT TTG CTC ATC TTT GCC CCT TTA AGC ACT AAC AGC TTG TCT AAA ATC 392 
Asp Leu Leu lie Phe Ala Pro Leu Ser Thr Asn Ser Leu Ser Lys lie 
100 105 110 

GCT CAC GCT TTA GCG GAT AAT ATC GTA AGC GCG ACT TTT TTA GCT TGC 44 0 

Ala His Ala Leu Ala Asp Asn He Val Ser Ala Thr Phe Leu Ala Cys 
115 120 125 130 

GCT TCC CCT AAA ATC CTA GCC CCT AGC ATG AAC ACT AAC ATG CTC AAT 4 88 

Ala Ser Pro Lys He Leu Ala Pro Ser Met Asn Thr Asn Met Leu Asn 
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205 210 215 



GGG GCT GTT TTA ACC GAT GAA GOG CAT TAAGCAATAA CATTCTTGTT TGGCTTT 790 
Gly Ala Val Leu Thr Asp Glu Gly His 
220 225 

AATATTGTTT TTTAAAACTT TGTTTTATGG TAAAGCTTTT A 831 

(2) INFORMATION FOR SEQ ID N0:434: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:434: 



Met Glu His Arg 
1 

Asp Phe lie Thr 
20 

Leu He Ser Leu 
35 

Leu Gin Asn Val 
50 

Lys Asp He He 
65 

Gly Thr He Ala 

Pro Gly Phe Glu 
100 

Ala Leu He Val 
115 

Gly Phe Phe Lys 
130 

Ala Pro Phe Met 
145 

Val Ser Leu Ser 

Phe Leu Leu He 
180 

Pro Phe Met Val 
195 

Met He Leu Thr 

210 
Gly His 
225 



Val Phe Thr He 
5 

Gly Phe Phe Val 

Gly Ala Ser Arg 
40 

Tyr Glu Ser He 
55 

Gly Glu Glu Leu 
70 

Leu Tyr Val Phe 
85 

Ser Pro Thr Ala 

Phe Phe Tyr Tyr 
120 

Tyr Phe Ala His 
135 

Phe Pro He Glu 
150 

Phe Arg Leu Phe 
165 

Met Leu Leu Leu 

Leu Phe Phe Met 
200 

Tyr Val Tyr Leu 
215 



Ala Asn Phe Phe 
10 

Val Leu Thr Ala 
25 

Lys Met Gin Met 

He Ser Ala He 
60 

Ala Arg Lys Tyr 
75 

Phe Ser Asn Met 
90 

Ser Trp Ser Phe 
105 

His Phe Glu Gly 

Phe Ala Gly Pro 
140 

He He Ser His 
155 

Gly Asn He Lys 
170 

Val Pro Trp Ala 
185 

Gly He Leu Gin 

Ala Gly Ala Val 
220 



Ser Ser Asn His 
15 

Val Leu Met Phe 
30 

Val Pro Met Gly 
45 

Leu Ser Val Ala 

Phe Pro Leu Ala 
80 

He Gly He He 
95 

Thr Leu Val Leu 
110 

He Arg Val Gin 
125 

Val Lys Trp Leu 

Phe Ser Arg He 
160 

Gly Asp Asp Met 
175 

Val Pro Val Ala 
190 

Ala Phe Val Phe 
205 

Leu Thr Asp Glu 



(2) INFORMATION FOR SEQ ID N0:435: 



(i) SEQUENCE CHARACTERISTICS: 
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CGTTAAAAAG CTTGGTTTAA TGAGTTTTTA AAACTTAATT GCTACAATTA AGGAAATTTT 60 
CAATAAGGCT TAGAGAAACT TTTTC ATG GAA CAC AGA GTA TTT ACT ATT GCT 112 

Met Glu His Arg Val Phe Thr He Ala 
1 5 

AAT TTT TTT AGC TCC AAT CAT GAT TTT ATC ACC GGG TTT TTT GTC GTT 160 
Asn Phe Phe Ser Ser Asn His Asp Phe He Thr Gly Phe Phe Val Val 
10 15 20 25 

TTG ACA GCG GTT TTG ATG TTT TTA ATC TCG CTT GGC GCG TCG CGC AAA 208 
Leu Thr Ala Val Leu Met Phe Leu He Ser Leu Gly Ala Ser Arg Lys 
30 35 40 

ATG CAG ATG GTA CCT ATG GGT TTG CAG AAT GTG TAT GAG AGC ATC ATT 256 
Met Gin Met Val Pro Met Gly Leu Gin Asn Val Tyr Glu Ser He He 
45 50 55 

AGC GCG ATT TTG AGC GTG GCT AAG GAT ATT ATA GGC GAA GAA TTA GCC 3 04 

Ser Ala He Leu Ser Val Ala Lys Asp He He Gly Glu Glu Leu Ala 
60 65 70 

CGC AAA TAG TTC CCC CTA GCT GGC ACG ATC GCT TTG TAT GTC TTT TTT 3 52 

Arg Lys Tyr Phe Pro Leu Ala Gly Thr He Ala Leu Tyr Val Phe Phe 
75 80 85 

TCT AAC ATG ATA GGC ATC ATT CCT GGC TTT GAA TCC CCT ACG GCT AGC 4 00 

Ser Asn Met He Gly He He Pro Gly Phe Glu Ser Pro Thr Ala Ser 
90 95 100 105 

TGG AGC TTT ACG CTG GTT TTA GCG CTG ATT GTG TTT TTT TAT TAC CAT 44 8 

Trp Ser Phe Thr Leu Val Leu Ala Leu He Val Phe Phe Tyr Tyr His 
110 115 120 

TTT GAA GGC ATT AGA GTG CAG GGC TTT TTT AAG TAT TTC GCT CAT TTT 4 96 

Phe Glu Gly He Arg Val Gin Gly Phe Phe Lys Tyr Phe Ala His Phe 
125 130 135 

GCA GGT CCT GTG AAA TGG CTC GCC CCT TTC ATG TTC CCT ATT GAG ATC 544 
Ala Gly Pro Val Lys Trp Leu Ala Pro Phe Met Phe Pro He Glu He 
140 145 150 

ATC TCG CAT TTT TCT AGG ATC GTG TCT TTA TCG TTT CGT TTG TTT GGG 592 
He Ser His Phe Ser Arg He Val Ser Leu Ser Phe Arg Leu Phe Gly 
155 160 165 

AAT ATC AAG GGC GAT GAC ATG TTC TTG CTC ATC ATG CTT TTA TTA GTG 64 0 

Asn He Lys Gly Asp Asp Met Phe Leu Leu He Met Leu Leu Leu Val 
170 175 180 185 

CCT TGG GCG GTT CCT GTA GCG CCT TTT ATG GTG TTG TTT TTC ATG GGG 688 
Pro Trp Ala Val Pro Val Ala Pro Phe Met Val Leu Phe Phe Met Gly 
190 195 200 

ATT TTA CAA GCT TTT GTT TTT ATG ATC CTC ACT TAT GTG TAT TTG GCA 736 
He Leu Gin Ala Phe Val Phe Met He Leu Thr Tyr Val Tyr Leu Ala 
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CTT ATT GTG AAA AAT GGT AAA TTT GAG CTT TTA GAA AAT ATC ACT TTT T 393 
Leu lie Val Lys Asn Gly Lys Phe Glu Leu Leu Glu Asn lie Thr Phe 
100 105 110 

AGATTTTTAC AGAAAGTAAA TGCGATTTCA TTAACATTCT TAAGCTAATA TA 445 



(2) INFORMATION FOR SEQ ID NO: 432: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE 

Met Arg Phe Leu Asn 

1 5 
Glu Glu Ala Cys Gly 
20 

Arg Asn Phe Phe Ser 
35 

Lys Gly Val Leu His 
50 

Pro lie Tyr Ala lie 
65 

lie Arg Cys Tyr Leu 
85 

Asp Ala Leu lie Val 
100 

Thr Phe 



DESCRIPTION: SEQ ID NO : ^ 

Asn Lys His Arg Glu Lys 
10 

Phe Leu Lys Ser Leu Gly 
25 

Gin Phe Gly Glu lie Asp 
40 

Phe lie Glu Val Lys Ser 
55 

Thr Pro Ser Lys Leu Lys 
70 75 
Ser Gin Lys Asp Pro Asn 
90 

Lys Asn Gly Lys Phe Glu 
105 



32 : 

Gly Leu Lys Ala Glu 
15 

Phe Glu Met Val Glu 
30 

lie lie Ala Leu Lys 
45 

Gly Glu Asn Phe Asp 
60 

Lys Met lie Lys Thr 
80 

Ser Asp Phe Cys lie 
95 

Leu Leu Glu Asn lie 
110 



(2) INFORMATION FOR SEQ ID NO: 433: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic RNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 86... 763 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: 
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Leu Val Ala lie Thr lie Gin Ala Cys 

20 25 
Asn Glu Lys Pro Ala Lys Lys Thr Ser 

35 40 
Gin Thr Pro Thr Asn Ser Thr Thr Pro 
50 55 



PCTAJS98/06371 

Gly Tyr Lys Ala Pro Pro Phe 
30 

Asn Ser Ser Asn Ser Ser Met 
45 

Glu Phe Leu Asn Gin Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 392 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

ATTTACAAAG CGTGTTGGAT ACCCCCAAGA TGATTCGTTT GGAAAATTGA ATG CGC 56 

Met Arg 
1 

TTT TTG AAC AAC AAA CAT AG A GAA AAG GGC TTA AAG GCT GAA G7^ GAA 104 
Phe Leu Asn Asn Lys His Arg Glu Lys Gly Leu Lys Ala Glu Glu Glu 
5 10 15 

GCT TGC GGA TTT TTA AAA TCG TTA GGT TTT GAA ATG GTG GAG AGG AAC 152 
Ala Cys Gly Phe Leu Lys Ser Leu Gly Phe Glu Met Val Glu Arg Asn 
20 25 30 

TTT TTT TCA CAA TTT GGC GAA ATT GAT ATT ATC GCT TTG AAA AAA GGG 200 
Phe Phe Ser Gin Phe Gly Glu lie Asp lie lie Ala Leu Lys Lys Gly 
35 40 45 50 

GTT TTG CAT TTC ATT GAA GTC AAA AGC GGG GAA AAT TTT GAT CCC ATT 24 8 

Val Leu His Phe lie Glu Val Lys Ser Gly Glu Asn Phe Asp Pro lie 
55 60 65 

TAT GCG ATC ACG CCG AGC AAA TTA AAA AAG ATG ATT AAA ACG ATC CGC 296 
Tyr Ala lie Thr Pro Ser Lys Leu Lys Lys Met lie Lys Thr lie Arg 
70 75 80 

TGT TAT TTG TCC CAA AAA GAT CCC AAT AGC GAT TTT TGC ATA GAC GCT 344 
Cys Tyr Leu Ser Gin Lys Asp Pro Asn Ser Asp Phe Cys lie Asp Ala 
85 90 95 
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(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 82... 270 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

ACTTAAAGGC ATAAAAACCT TAAGCTTTTT GAGTTTCAAA AGGGTTTCAA GCTTTTTATA 60 
AGACTTTTTT TGAATGAGTA A GGA GAA AAT ATT TTG TTC CAT AAA CTG ATC 111 

Gly Glu Asn lie Leu Phe His Lys Leu lie 
1 5 10 

TTA ACA TGC TTT TTA GCG CTT GTA GCA ATA ACC ATT CAA GCT TGC GGT 15 9 

Leu Thr Cys Phe Leu Ala Leu Val Ala lie Thr lie Gin Ala Cys Gly 
15 20 25 

TAT AAA GCC CCT CCA TTC AAT GAA AAA CCC GCT AAA AAA ACT TCA AAC 2 07 

Tyr Lys Ala Pro Pro Phe Asn Glu Lys Pro Ala Lys Lys Thr Ser Asn 
30 35 40 

AGC TCT AAT TCT TCT ATG CAA ACG CCC ACC AAC AGC ACC ACG CCA GAA 255 
Ser Ser Asn Ser Ser Met Gin Thr Pro Thr Asn Ser Thr Thr Pro Glu 
45 50 55 

TTT TTA AAT CAG CCT TAAAATCACT GCTCTTGTTT AAGGGCTTTG ATTTCTAGGG T 311 
Phe Leu Asn Gin Pro 
60 

TTTTGTGGCT AACTTTTGAN STTCGCTTTC ATCATGCGTT ACCATAATG 3 60 

(2) INFORMATION FOR SEQ ID NO: 43 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: 

Gly Glu Asn lie Leu Phe His Lys Leu lie Leu Thr Cys Phe Leu Ala 
1.5 10 15 
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5 10 15 

ATC TAT GCC AAT CAT TTA AAA TTA TTG GAT AAT GAA ATC AGT GAA AAA 152 
lie Tyr Ala Asn His Leu Lys Leu Leu Asp Asn Glu lie Ser Glu Lys 
20 25 30 

GAG ATT TTT AAT AAA GCC ATC AAT CAA AAA CGA ATT CAA ATG GCT CTT 2 00 

Asp lie Phe Asn Lys Ala lie Asn Gin Lys Arg lie Gin Met Ala Leu 
35 40 45 50 

AAT CTC ATC TTT AAG CTT GTT TTT GCC TTT GTT AGT AAC CAC TTC TTC 24 8 

Asn Leu He Phe Lys Leu Val Phe Ala Phe Val Ser Asn His Phe Phe 
55 60 65 

CAC GCT TTT AG A CGA CAG AAT CTC TAT AAT CGT GTC TTT AAT CGC TGT 2 96 

His Ala Phe Arg Arg Gin Asn Leu Tyr Asn Arg Val Phe Asn Arg Cys 
70 75 80 

GTC TTT AAC CTT GAC TTC ATT CAA AAG CTT TTC ATT ACT CAA TTC TAACG 34 6 
Val Phe Asn Leu Asp Phe He Gin Lys Leu Phe He Thr Gin Phe 
85 90 95 

AAATAGAAGC CTTAAGGTAG CGTCTGCCAT TTTGAGAGAC CAGATTCA 394 

(2) INFORMATION FOR SEQ ID NO: 42 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:428: 

Lys Leu His Phe Lys Ala Phe His Ala Leu Phe Tyr Pro Ser Asn Arg 

15 10 15 

Asp Asn He Tyr Ala Asn His Leu Lys Leu Leu Asp Asn Glu He Ser 

20 25 30 

Glu Lys Asp He Phe Asn Lys Ala He Asn Gin Lys Arg He Gin Met 

35 40 45 

Ala Leu Asn Leu He Phe Lys Leu Val Phe Ala Phe Val Ser Asn His 

50 55 60 

Phe Phe His Ala Phe Arg Arg Gin Asn Leu Tyr Asn Arg Val Phe Asn 
65 70 75 80 

Arg Cys Val Phe Asn Leu Asp Phe He Gin Lys Leu Phe He Thr Gin 
85 90 95 

Phe 



(2) INFORMATION FOR SEQ ID NO: 42 9: 
(i) SEQUENCE CHARACTERISTICS: 
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Leu lie Ala Lys Asp Gly Gly Val Leu Ala Arg Thr Gly His Thr Glu 

130 135 140 

Ala Ser Val Asp Leu Cys Lys Leu Ala Gly Leu Lys Pro Val Ser Val 
145 150 155 160 

lie Cys Glu lie Met Lys Glu Asp Gly Ser Met Ala Arg Arg Gly Asp 

165 170 175 

Lys Phe Leu Ser Asp Phe Ala Leu Lys His Asn Leu Lys Thr Leu Tyr 

180 185 190 

Val Ser Asp Leu lie Ser Tyr Arg Leu Glu Asn Glu Ser Leu Leu Lys 

195 200 205 

Met Phe Cys Gin Glu Glu Arg Glu Phe Leu Lys His Gin Thr Gin Cys 

210 215 220 

Tyr Thr Phe Leu Asp His Gin Gin Lys Asn His Tyr Ala Phe Lys Phe 
225 230 235 240 

Lys Gly Ala Lys Thr His Asp Leu Ala Pro Leu Val Arg Phe His Pro 

245 250 255 

lie Lys Glu Asp Phe Asp Phe Leu Thr Thr Asp Ala Phe Glu Val Phe 

260 265 270 

Phe Lys Ala Leu Glu Tyr Leu Lys His Glu Gly Gly Tyr Leu lie Phe 

275 280 285 

Met Asn Thr His Ser Lys Glu Asn Asn Val Val Lys Asp Phe Gly lie 

290 295 300 

Gly Ala Leu Val Leu Lys Asn Leu Gly lie Lys Asp Phe Arg Leu Leu 
305 310 315 320 

Ser Ser Cys Glu Asp Arg Gin Tyr Lys Ala Leu Ser Gly Phe Gly Leu 

325 330 335 

Lys Leu Val Glu Thr lie Ser Leu 
340 

(2) INFORMATION FOR SEQ ID NO: 42 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 341 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:427: 
GATTTATCCT TGCATAAAAC AATCTCGCTT GGCGATAAAA AACGCTCTAA AAA CTT 5 6 



Lys Leu 
1 



CAT TTT AAA GCG TTT CAC GCA CTT TTC TAT CCT AGC AAT AGA GAC AAT 
His Phe -Lys Ala Phe His Ala Leu Phe Tyr Pro Ser Asn Arg Asp Asn 



104 
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CGT TTC CAC CCT ATC AAA GAG GAT TTT GAT TTT TTA ACG ACT GAT GCG 878 
Arg Phe His Pro lie Lys Glu Asp Phe Asp Phe Leu Thr Thr Asp Ala 
255 260 265 

TTT GAA GTG TTT TTT AAA GCG TTA GAA TAT TTA AAG CAC GAA GGG GGC 926 
Phe Glu Val Phe Phe Lys Ala Leu Glu Tyr Leu Lys His Glu Gly Gly 
270 275 280 

TAT TTG ATC TTT ATG AAC ACC CAT TCT AAA GAA AAC AAT GTC GTT AAA 974 
Tyr Leu lie Phe Met Asn Thr His Ser Lys Glu Asn Asn Val Val Lys 
285 290 295 300 

GAT TTT GGG ATC GGG GCG TTG GTG TTA AAA AAT TTG GGG ATA AAG GAT 1022 
Asp Phe Gly lie Gly Ala Leu Val Leu Lys Asn Leu Gly He Lys Asp 
305 310 315 

TTC AGG CTC TTA AGC TCT TGT GAA GAC AGG CAG TAT AAG GCT TTG AGC 10 7 0 

Phe Arg Leu Leu Ser Ser Cys Glu Asp Arg Gin Tyr Lys Ala Leu Ser 
320 325 330 

GGG TTT GGG CTT AAG CTT GTA GAA ACG ATT AGC CTT TAAGAGGCTC GTTAAG 1122 
Gly Phe Gly Leu Lys Leu Val Glu Thr He Ser Leu 
335 340 

TTTTATTGAA TGTGTTGTAA TGTTTTTAAG GTATAATAAA CTCTTTTTAA 1172 

(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:426: 

Met He Leu Lys Arg Val Thr Glu Ala Leu Glu Ala Tyr Lys Asn Gly 

15 10 15 

Glu Met Leu He Val Met Asp Asp Glu Asp Arg Glu Asn Glu Gly Asp 

20 25 30 

Leu Val Leu Ala Gly He Phe Ser Thr Pro Glu Lys He Asn Phe Met 

35 40 45 

Ala Thr His Ala Arg Gly Leu He Cys Val Ser Leu Thr Lys Asp Leu 

50 55 60 

Ala Lys Lys Phe Glu Leu Pro Pro Met Val Ser Val Asn Asp Ser Asn 
65 70 75 80 

His Glu Thr Ala Phe Thr Val Ser He Asp Ala Lys Glu Ala Arg Thr 

85 90 95 

Gly He Ser Ala Phe Glu Arg His Leu Thr He Glu Leu Leu Cys Lys 

100 105 110 

Asp Thr Thr Lys Pro Ser Asp Phe Val Arg Pro Gly His He Phe Pro 
115 120 125 
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Asn Glu Gly Asp Leu Val Leu Ala Gly lie Phe Ser Thr Pro Glu Lys 
30 35 40 

ATC AAT TTC ATG GCC ACG CAT OCT AGG GGG TTG ATT TGC GTG TCT TTG 254 
lie Asn Phe Met Ala Thr His Ala Arg Gly Leu lie Cys Val Ser Leu 
45 50 55 60 

ACC AAA GAT TTA GCG AAA AAA TTT GAA TTA CCC CCT ATG GTT AGC GTG 302 
Thr Lys Asp Leu Ala Lys Lys Phe Glu Leu Pro Pro Met Val Ser Val 
65 70 75 

AAT GAT TCT AAC CAT GAG ACC GCT TTC ACG GTT TCC ATT GAC GCT AAA 3 50 

Asn Asp Ser Asn His Glu Thr Ala Phe Thr Val Ser lie Asp Ala Lys 
80 85 90 

GAA GCC AGA ACC GGG ATT TCT GCT TTT GAA AGG CAT TTA ACG ATT GAA 3 98 

Glu Ala Arg Thr Gly lie Ser Ala Phe Glu Arg His Leu Thr lie Glu 
95 100 105 

TTA TTG TGT AAA GAC ACC ACC AAA CCG AGC GAT TTT GTG CGC CCG GGG 446 
Leu Leu Cys Lys Asp Thr Thr Lys Pro Ser Asp Phe Val Arg Pro Gly 
110 115 120 

CAT ATT TTC CCT TTG ATC GCC AAA GAC GGG GGC GTG TTA GCG CGC ACG 4 94 

His He Phe Pro Leu He Ala Lys Asp Gly Gly Val Leu Ala Arg Thr 
125 130 135 140 

GGC CAT ACT GAA GCG AGC GTG GAT TTG TGC AAA TTA GCT GGA TTA AAG 542 
Gly His Thr Glu Ala Ser Val Asp Leu Cys Lys Leu Ala Gly Leu Lys 
145 150 155 

CCC GTG AGC GTG ATT TGT GAA ATC ATG AAA GAA GAT GGC TCT ATG GCG 590 
Pro Val Ser Val He Cys Glu He Met Lys Glu Asp Gly Ser Met Ala 
160 165 170 

AGA AGG GGG GAT AAA TTT TTG AGC GAT TTC GCC CTC AAA CAT AAC CTT 63 8 

Arg Arg Gly Asp Lys Phe Leu Ser Asp Phe Ala Leu Lys His Asn Leu 
175 180 185 

AAA ACT CTC TAT GTC TCT GAT TTG ATT AGC TAT CGT TTG GAA AAT GAA 686 
Lys Thr Leu Tyr Val Ser Asp Leu He Ser Tyr Arg Leu Glu Asn Glu 
190 195 200 

AGT TTG CTG AAA ATG TTT TGT CAA GAA GAA AGG GAA TTT TTA AAA CAC 7 34 

Ser Leu Leu Lys Met Phe Cys Gin Glu Glu Arg Glu Phe Leu Lys His 
205 210 215 220 

CAA ACG CAA TGC TAC ACT TTT TTA GAT CAC CAG CAA AAA AAC CAT TAC 7 82 

Gin Thr Gin Cys Tyr Thr Phe Leu Asp His Gin Gin Lys Asn His Tyr 
225 230 235 

GCT TTT AAG TTT AAA GGC GCA AAA ACC CAT GAT TTA GCC CCT TTA GTG 83 0 

Ala Phe Lys Phe Lys Gly Ala Lys Thr His Asp Leu Ala Pro Leu Val 
240 245 250 
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Glu Asn His Ser Lys Tyr Arg Asp lie Pro Phe Ala Glu Val Ser Leu 

100 105 110 

lie Tyr Gly Tyr Lys Gin Phe Phe Pro Lys Lys Glu Arg Tyr Gly Phe 

115 120 125 

Arg Phe Tyr Val Ser Leu Asp Tyr Ala Tyr Gly Phe Phe Leu Lys Asn 

130 135 140 

Lys Gly Val Leu Gly Asp Ser Leu Arg Glu Ser Ser Gin lie Pro Lys 
145 150 155 160 

Ser Tyr Arg Glu Lys Leu Gin Arg Lys Glu Thr Phe lie Asn Ala lie 

165 170 175 

Phe Tyr Gly Ala Gly Ala Asp Phe Leu Tyr Lys Arg Ala Phe Gly Thr 

180 185 190 

Leu lie Leu Gly Met Asn Phe Val Gly Glu Thr Trp Phe Tyr Glu Thr 

195 200 205 

Lys He Phe Lys Lys Trp Ala Lys Asp Pro Leu Ser Val Tyr His Pro 

210 215 220 

Tyr Met Phe Gin Val Met Leu Asn Val Gly Tyr Arg Tyr Arg Phe Ser 
225 230 235 240 

Arg Tyr Lys Asn Trp Ala He Glu Leu Gly Ala Arg He Pro Phe Leu 

245 250 255 

Thr Asn Asp Tyr Phe Lys Thr Pro Leu Tyr Thr Leu His Phe Lys Arg 

260 265 270 

Asn He Ser Val Tyr Leu Thr Ser Thr Tyr Asp Phe 
275 280 

(2) INFORMATION FOR SEQ ID NO:425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 75... 1106 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 

TAATCGTTAT TAAACATGCT ATATTTCTTT TTTCTATAAA ACTCAATATT ATTGAATAAA 6 0 

ACTAGGGAGT TAGA ATG ATC TTA AAA CGA GTT ACT GAA GCT TTA GAA GCG 110 
Met He Leu Lys Arg Val Thr Glu Ala Leu Glu Ala 
15 10 

TAT AAA AAT GGC GAA ATG CTC ATT GTT ATG GAC GAT GAA GAC AGA GAA 15 8 

Tyr Lys Asn Gly Glu Met Leu He Val Met Asp Asp Glu Asp Arg Glu 
15 20 25 

AAT GAG GGG GAT TTG GTT TTA GCT GGG ATT TTT TCT ACC CCT GAG AAA 2 06 
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TTT TAT GGC GCG GGA GCT GAC TTT TTA TAG AAA CGC GCT TTT GGA ACG 5 76 

Phe Tyr Gly Ala Gly Ala Asp Phe Leu Tyr Lys Arg Ala Phe Gly Thr 
180 185 190 

CTG ATT TTA GGG ATG AAT TTC GTG GGA GAA ACC TGG TTT TAT GAA ACA 624 
Leu lie Leu Gly Met Asn Phe Val Gly Glu Thr Trp Phe Tyr Glu Thr 
195 200 205 

AAG ATT TTT AAA AAG TGG GCT AAA GAT CCT TTG AGO GTT TAT CAC CCT 672 
Lys lie Phe Lys Lys Trp Ala Lys Asp Pro Leu Ser Val Tyr His Pro 
210 215 220 

TAG ATG TTT CAA GTG ATG TTG AAT GTG GGG TAT CGT TAG CGC TTT TCA 720 
Tyr Met Phe Gin Val Met Leu Asn Val Gly Tyr Arg Tyr Arg Phe Ser 
225 230 235 240 

AGG TAT AAG AAT TGG GCG ATA GAA TTG GGT GCG CGC ATC CCT TTT TTA 768 
Arg Tyr Lys Asn Trp Ala lie Glu Leu Gly Ala Arg lie Pro Phe Leu 
245 250 255 

ACC AAT GAT TAT TTT AAA ACC CCT TTA TAC ACC CTT CAT TTC AAG CGC 816 
Thr Asn Asp Tyr Phe Lys Thr Pro Leu Tyr Thr Leu His Phe Lys Arg 
260 265 270 

AAT ATT TCT GTC TAT CTC ACT TCA ACT TAT GAC TTT TAGTTTTTTA AATTTT 868 
Asn lie Ser Val Tyr Leu Thr Ser Thr Tyr Asp Phe 
275 280 

TGAAAACTAG AATTAAAACC GCTTTTTATA AACTGG 904 

(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

Arg lie Lys Lys Glu Arg Leu Asn Lys Leu Leu Lys Arg Gly Phe Leu 

15 10 15 

Ala Phe Phe Leu Ser Val Tyr Leu Arg Ala Asp Asp Leu Val Thr Tyr 

20 25 30 

Thr lie lie Lys Glu Lys Asp Leu Gly Tyr Gin Arg Phe Leu Ala Lys 

35 40 45 

Lys Cys Leu Arg Gly Lys Thr His Pro Pro Cys Phe Thr Lys Pro Lys 

50 55 60 

Lys Pro Lys Arg Lys Leu Phe Asn lie Asp Lys Ser Ser His Tyr Tyr 
65 70 75 80 

Gly Thr Ser Val Val Gin Met Ser Trp Leu Gin Ser Arg Glu Lys Phe 
85 90 95 



-689- 



BNSDOCID: <WO 9843478A1 J_> 



wo 98/43478 



PCT/US98/06371 



(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1 . . . 852 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:423: 

CGC ATA AAA AAA GAA CGC TTG AAC AAA CTG CTT AAA AGG GGG TTT TTA 4 8 

Arg lie Lys Lys Glu Arg Leu Asn Lys Leu Leu Lys Arg Gly Phe Leu 
15 10 15 

GCG TTC TTT TTG AGC GTG TAT TTA AGG GCT GAT GAT TTG GTT ACT TAC 96 
Ala Phe Phe Leu Ser Val Tyr Leu Arg Ala Asp Asp Leu Val Thr Tyr 
20 25 30 

ACC ATC ATC AAA GAA AAA GAT CTA GGA TAC CAG CGG TTT TTA GCC AAG 144 
Thr lie lie Lys Glu Lys Asp Leu Gly Tyr Gin Arg Phe Leu Ala Lys 
35 40 45 

AAG TGT TTA AGG GGT AAA ACC CAC CCT CCG TGT TTT ACT AAG CCT AAA 192 
Lys Cys Leu Arg Gly Lys Thr His Pro Pro Cys Phe Thr Lys Pro Lys 
50 55 60 

AAG CCT AAA AGA AAA CTT TTT AAT ATA GAC AAA AGC TCC CAC TAT TAT 24 0 

Lys Pro Lys Arg Lys Leu Phe Asn lie Asp Lys Ser Ser His Tyr Tyr 
65 70 75 80 

GGC ACA AGC GTG GTG CAA ATG TCA TGG CTA CAG ACT AGG GAA AAA TTT 2 88 

Gly Thr Ser Val Val Gin Met Ser Trp Leu Gin Ser Arg Glu Lys Phe 
85 90 95 

GAA AAC CAT TCA AAA TAC CGA GAC ATT CCT TTT GCT GAA GTC AGT TTG 336 
Glu Asn His Ser Lys Tyr Arg Asp lie Pro Phe Ala Glu Val Ser Leu 
100 105 110 

ATT TAT GGC TAT AAA CAA TTT TTT CCT AAA AAA GAG CGC TAC GGC TTC 3 84 

lie Tyr Gly Tyr Lys Gin Phe Phe Pro Lys Lys Glu Arg Tyr Gly Phe 
115 120 125 

CGT TTT TAT GTC TCT TTG GAT TAC GCT TAT GGG TTT TTT CTT AAA AAT 4 32 

Arg Phe Tyr Val Ser Leu Asp Tyr Ala Tyr Gly Phe Phe Leu Lys Asn 
130 135 140 

AAG GGC GTG TTG GGC GAT AGT TTG AGG GAG AGT TCG CAA ATC CCT AAA 4 80 

Lys Gly Val Leu Gly Asp Ser Leu Arg Glu Ser Ser Gin lie Pro Lys 
145 150 155 160 

AGC TAT AGA GAA AAA TTG CAA AGA AAA GAG ACT TTT ATT AAC GCT ATT 52 8 

Ser Tyr Arg Glu Lys Leu Gin Arg Lys Glu Thr Phe He Asn Ala He 
165 170 175 
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AAA GTG TTA GAG AAA AAC GTG AAG TGATTGCATG GCGTTATTAG AGATTATCCA 693 
Lys Val Leu Gin Lys Asn Val Lys 
190 195 

TTACCCTTCT AAAATCTTAA GAACGATTTC TAAAGAGGTC GTT 736 

(2) INFORMATION FOR SEQ ID NO: 42 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

Met Met Gly Tyr lie Pro Tyr Val lie Glu Asn Thr Asp Arg Gly Glu 

15 10 15 

Arg Ser Tyr Asp lie Tyr Ser Arg Leu Leu Lys Asp Arg lie Val Leu 

20 25 30 

Leu Ser Gly Glu lie Asn Asp Ser Val Ala Ser Ser lie Val Ala Gin 

35 40 45 

Leu Leu Phe Leu Glu Ala Glu Asp Pro Glu Lys Asp lie Gly Leu Tyr 

50 55 60 

lie Asn Ser Pro Gly Gly Val lie Thr Ser Gly Leu Ser lie Tyr Asp 
65 70 75 80 

Thr Met Asn Phe lie Arg Pro Asp Val Ser Thr lie Cys lie Gly Gin 

85 90 95 

Ala Ala Ser Met Gly Ala Phe Leu Leu Ser Cys Gly Ala Lys Gly Lys 

100 105 110 

Arg Phe Ser Leu Pro His Ser Arg lie Met lie His Gin Pro Leu Gly 

115 120 125 

Gly Ala Gin Gly Gin Ala Ser Asp lie Glu lie lie Ser Asn Glu He 

130 135 140 

Leu Arg Leu Lys Gly Leu Met Asn Ser He Leu Ala Gin Asn Ser Gly 
145 150 155 160 

Gin Ser Leu Glu Gin He Ala Lys Asp Thr Asp Arg Asp Phe Tyr Met 

165 170 175 

Ser Ala Lys Glu Ala Lys Glu Tyr Gly Leu He Asp Lys Val Leu Gin 

180 185 190 

Lys Asn Val Lys 
195 

(2) INFORMATION FOR SEQ ID NO: 42 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

GCAACAAGAA ATTTTAGAGA AAATGAGACC CAACGCTCAA AAAATTCAAG CGGGTTAAAC 60 
GGCTAAAAAG GAGAG ATG ATG GGA TAG ATT CCT TAT GTA ATA GAG AAT ACC 111 
Met Met Gly Tyr He Pro Tyr Val He Glu Asn Thr 
15 10 

GAT CGT GGG GAG CGT AGC TAT GAT ATT TAC TCG CGC CTT TTA AAG GAT 159 
Asp Arg Gly Glu Arg Ser Tyr Asp He Tyr Ser Arg Leu Leu Lys Asp 
15 20 25 

CGC ATT GTT TTA TTG AGC GGT GAA ATT AAT GAC AGC GTG GCG TCT TCT 207 
Arg He Val Leu Leu Ser Gly Glu He Asn Asp Ser Val Ala Ser Ser 
30 35 40 

ATC GTG GCC CAA CTC TTG TTT TTG GAA GCT GAA GAC CCT GAA AAA GAC 255 
He Val Ala Gin Leu Leu Phe Leu Glu Ala Glu Asp Pro Glu Lys Asp 
45 50 55 60 

ATT GGT TTG TAT ATC AAT TCT CCC GGT GGG GTG ATA ACA AGC GGT CTT 303 
He Gly Leu Tyr He Asn Ser Pro Gly Gly Val He Thr Ser Gly Leu 
65 70 75 

AGT ATT TAT GAC ACC ATG AAT TTT ATC CGC CCT GAT GTT TCC ACG ATT 351 
Ser He Tyr Asp Thr Met Asn Phe He Arg Pro Asp Val Ser Thr He 
80 85 90 

TGC ATC GGT CAA GCG GCT TCT ATG GGG GCG TTT TTA CTG AGC TGT GGG 399 
Cys He Gly Gin Ala Ala Ser Met Gly Ala Phe Leu Leu Ser Cys Gly 
95 100 105 

GCT AAG GGC AAG CGC TTT TCG CTA CCC CAT TCA AGG ATT ATG ATC CAC 44 7 

Ala Lys Gly Lys Arg Phe Ser Leu Pro His Ser Arg He Met He His 
110 115 120 

CAG CCT TTA GGG GGG GCT CAA GGG CAA GCG AGC GAT ATT GAA ATC ATT 4 95 

Gin Pro Leu Gly Gly Ala Gin Gly Gin Ala Ser Asp He Glu He He 
125 130 135 140 

TCT AAT GAG ATT CTC AGG CTT AAA GGC TTG ATG AAT TCT ATT CTA GCT 543 
Ser Asn Glu He Leu Arg Leu Lys Gly Leu Met Asn Ser He Leu Ala 
145 150 155 

CAA AAC TCA GGG CAG AGT TTG GAG CAA ATC GCT AAA GAC ACG GAC AGG 591 
Gin Asn Ser Gly Gin Ser Leu Glu Gin He Ala Lys Asp Thr Asp Arg 
160 165 170 

GAT TTT TAT ATG AGT GCT AAA GAA GCT AAA GAG TAT GGC TTG ATT GAT 63 9 

Asp Phe Tyr Met Ser Ala Lys Glu Ala Lys Glu Tyr Gly Leu He Asp 
. 175 180 185 
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(2) INFORMATION FOR SEQ ID NO: 42 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 0: 



Met 


Leu 


Pro 


Phe 


Asp 


Val 


Phe 


He 


Lys 


Ser 


Tyr 


Pro 


Thr 


Pro 


Cys 


Tyr 


1 








5 










10 










15 




Phe 


Lys 


Gin 


Phe 


Leu 


Arg 


Leu 


Lys 


Lys 


Thr 


Tyr 


Pro 


Ser 


Lys 


Leu 


Asn 








20 










25 










30 






Glu 


Ser 


Phe 


Leu 


Phe 


Arg 


Arg 


He 


Asp 


Ala 


Gly 


Phe 


He 


Ser 


Ser 


He 






35 










40 










45 








Ala 


Gly Tyr 


Pro 


Phe 


Ala 


Leu 


His 


Ser 


His 


Ser 


Leu 


Gly 


He 


Val 


Ala 




50 










55 










60 










Tyr 


Lys 


Glu 


Val 


Leu 


Ser 


Val 


Leu 


Val 


Val 


Asp 


Thr 


Lys 


Asn 


Ala 


Phe 


65 










70 










75 










80 


Asp 


Lys 


Glu 


Ser 


Ala 


Ser 


Ser 


Asn 


Ala 


Leu 


Ser 


Gin 


Ala 


Leu 


Gly Leu 










85 










90 










95 




Lys 


Gly 


Glu 


Val 


Leu 


He 


Gly 


Asn 


Lys 


Ala 


Leu 


Gin 


Phe 


Tyr 


Tyr 


Ser 








100 










105 










110 






Asn 


Pro 


Lys 


Lys 


Asp 


Phe 


He 


Asp 


Leu 


Ala 


Ala 


Leu 


Trp 


Tyr 


Glu 


Lys 






115 










120 










125 








Lys 


Arg 


Leu 


Pro 


Phe 


Val 


Phe 


Gly 


Arg 


Leu 


Cys 


Tyr 


Tyr 


Gin 


Asn 


Lys 




130 










135 










140 










Asp 


Phe 


Tyr 


Lys 


Arg 


Leu 


Ser 


Leu 


Ala 


Phe 


Lys 


His 


Gin 


Lys 


Thr 


Lys 


145 










150 










155 










160 


He 


Pro 


Tyr 


Tyr 


He 


Leu 


Lys 


Glu 


Ala 


Ala 


Leu 


Lys 


Thr 


Asn 


Leu 


Lys 










165 










170 










175 




Arg 


Gin 


Asp 


He 


Leu 


Asn 


Tyr 


Leu 


Gin 


Lys 


He 


Tyr 


Tyr 


Thr 


Leu 


Gly 








180 










185 










190 






Lys 


Lys 


Glu 


Gin 


Leu 


Gly 


Leu 


Lys 


Ala 


Phe 


Tyr 


Arg 


Glu 


Leu 


Leu 


Phe 






195 










200 










205 








Lys 


Arg 


He 


Gin 


Lys 


Pro 


Lys 


Arg 


Phe 

















210 215 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 76... 663 
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10 15 20 

AAA AAA ACC TAG CCC TCC AAA CTC AAT GAG AGT TTT TTA TTC AGG CGT 211 
Lys Lys Thr Tyr Pro Ser Lys Leu Asn Glu Ser Phe Leu Phe Arg Arg 
25 30 35 

ATT GAT GCG GGG TTT ATT TCT TCT ATC GCC GGC TAT CCA TTC GCT CTT 25 9 

lie Asp Ala Gly Phe lie Ser Ser lie Ala Gly Tyr Pro Phe Ala Leu 
40 45 50 55 

CAT TCC CAT TCT CTA GGC ATT GTC GCT TAT AAG GAA GTT TTA AGC GTG 3 07 

His Ser His Ser Leu Gly lie Val Ala Tyr Lys Glu Val Leu Ser Val 
60 65 70 

CTG GTT GTG GAT ACA AAA AAC GCT TTT GAT AAA GAA AGC GCT TCT TCA 3 55 

Leu Val Val Asp Thr Lys Asn Ala Phe Asp Lys Glu Ser Ala Ser Ser 
75 80 85 

AAC GCC CTC TCT CAA GCG CTA GGG TTA AAG GGC GAA GTG TTA ATC GGC 40 3 

Asn Ala Leu Ser Gin Ala Leu Gly Leu Lys Gly Glu Val Leu lie Gly 
90 95 100 

AAT AAA GCA CTG CAG TTT TAT TAT TCC AAC CCT AAA AAA GAT TTT ATA 451 
Asn Lys Ala Leu Gin Phe Tyr Tyr Ser Asn Pro Lys Lys Asp Phe lie 
105 110 115 

GAT TTA GCC GCT CTT TGG TAT GAA AAA AAA CGC TTG CCG TTT GTT TTT 4 99 

Asp Leu Ala Ala Leu Trp Tyr Glu Lys Lys Arg Leu Pro Phe Val Phe 
120 125 130 135 

GGG CGT TTG TGT TAT TAC CAA AAC AAG GAT TTT TAC AAG CGC TTG TCT 54 7 

Gly Arg Leu Cys Tyr Tyr Gin Asn Lys Asp Phe Tyr Lys Arg Leu Ser 
140 145 150 

TTA GCT TTC AAA CAT CAA AAA ACA AAA ATC CCT TAC TAC ATC CTT AAA 595 
Leu Ala Phe Lys His Gin Lys Thr Lys lie Pro Tyr Tyr lie Leu Lys 
155 160 165 

GAA GCC GCT TTA AAA ACC AAC TTA AAA CGC CAA GAT ATT TTA AAT TAC 64 3 

Glu Ala Ala Leu Lys Thr Asn Leu Lys Arg Gin Asp lie Leu Asn Tyr 
170 175 180 

TTG CAA AAA ATT TAC TAC ACT TTA GGC AAA AAG GAG CAA TTA GGT CTT 6 91 

Leu Gin Lys lie Tyr Tyr Thr Leu Gly Lys Lys Glu Gin Leu Gly Leu 
185 190 195 

AAA GCG TTC TAT CGT GAA TTG TTA TTC AAA CGC ATT CAA AAA CCC AAG 73 9 

Lys Ala Phe Tyr Arg Glu Leu Leu Phe Lys Arg lie Gin Lys Pro Lys 
200 205 210 215 

CGT TTT TAGTGATTCG CTGAGAATGT AGGCTTAAAA TTCAGAAAGG GTGTTTTTAA GC 7 97 
Arg Phe 



AAGATTAGGT TACAATCACA AGTTTTATTA 82 7 
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35 40 45 

Val Tyr Asp Phe lie Val Lys Leu Asp Tyr Leu Gly Leu lie Gly Val 

SO 55 60 

Ser Leu lie Tyr Leu Leu Val Leu lie Leu Lys Pro Phe Thr Leu Thr 
65 70 75 80 

Arg Gin Lys Cys Ala Cys Val Gly lie Leu Cys Leu Ser Phe Tyr Ala 

85 90 95 

Trp Asn Phe Pro Val Lys Asp Ser Leu Met Val Leu Tyr Leu Phe Tyr 

100 105 110 

Phe Ala Leu Leu Ala Thr Leu Leu Trp Arg Phe Leu Gly Ala Ser Met 

115 120 125 

Lys Gin Ser Phe Leu Pro Ser Met Asn lie Cys lie Val Trp Val Phe 

130 135 140 

Ala Ser Ser Leu Gin Ser Phe Arg Phe Leu Ser Val Ser Asp Cys Val 
145 150 155 160 

Asp Phe Ser Leu Phe Thr Leu Ala Leu lie Leu Leu lie Leu Val Leu 

165 170 175 

lie Tyr Cys Lys Arg Leu Phe Gly Leu Tyr Glu Tyr Ala Asn Thr Leu 

180 185 190 

lie Leu lie Val Gly Leu Ser Val Val Val Leu Cys Ser Ser Met Phe 

195 200 205 

lie Gin Thr Lys Glu Tyr Tyr Gly Met Arg Leu Gly Phe Tyr Phe Leu 

210 215 220 

Gly Leu Leu Gly Trp Leu Leu Glu Tyr Val His Asn Thr Leu Arg Arg 
225 230 235 240 

Leu Glu His Gin lie 
245 

(2) INFORMATION FOR SEQ ID NO:419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 95 . . . 745 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 19: 

CTTTTTACTT TTAGTTAAGT TGTAAGAAAC TTTAGCTACC ATGCGATACA AAAAAGGATT 60 
TTAAGTGCGT TTTGGTAAAA TTGATTATTT GAAC ATG CTC CCT TTT GAT GTG TTT 115 

Met Leu Pro Phe Asp Val Phe 
1 5 

ATC AAA TCC TAC CCC ACC CCT TGT TAT TTC AAA CAA TTC TTA CGG CTT 163 
lie Lys Ser Tyr Pro Thr Pro Cys Tyr Phe Lys Gin Phe Leu Arg Leu 
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Leu Ala Thr Leu Leu Trp Arg Phe Leu Gly Ala Ser Met Lys Gin Ser 
120 125 130 

TTC TTG CCC TCT ATG AAT ATT TGC ATC GTG TGG GTT TTT GCT TCT TCT 4 89 

Phe Leu Pro Ser Met Asn lie Cys lie Val Trp Val Phe Ala Ser Ser 
135 140 145 

TTA CAG AGT TTT AGG TTT TTA AGC GTG TCT GAT TGC GTG GAT TTT TCC 5 37 

Leu Gin Ser Phe Arg Phe Leu Ser Val Ser Asp Cys Val Asp Phe Ser 
150 155 160 

CTT TTT ACA CTC GCG CTT ATT TTA TTG ATA CTG GTT TTA ATC TAT TGC 585 
Leu Phe Thr Leu Ala Leu lie Leu Leu lie Leu Val Leu lie Tyr Cys 
165 170 175 

AAA CGC CTT TTT GGG TTG TAT GAA TAC GCT AAC ACG CTC ATT TTG ATC 63 3 

Lys Arg Leu Phe Gly Leu Tyr Glu Tyr Ala Asn Thr Leu He Leu He 
180 185 190 195 

GTG GGG CTT AGC GTG GTG GTG CTA TGC TCT AGC ATG TTC ATT CAA ACT 681 
Val Gly Leu Ser Val Val Val Leu Cys Ser Ser Met Phe He Gin Thr 
200 205 210 

AAA GAA TAC TAT GGC ATG CGA TTG GGT TTT TAT TTT TTA GGC CTG TTA 72 9 

Lys Glu Tyr Tyr Gly Met Arg Leu Gly Phe Tyr Phe Leu Gly Leu Leu 
215 220 225 

GGG TGG CTT TTA GAA TAT GTG CAT AAC ACT TTA AGG CGT TTG GAA CAT 77 7 

Gly Trp Leu Leu Glu Tyr Val His Asn Thr Leu Arg Arg Leu Glu His 
230 235 240 

CAA ATT TAAAGCTCAA ATAGGAATAG CTAAAGCCTT TTGATTGAGT GTTTTTTTAG GG 835 
Gin He 
245 

CTTAAAAGCG GGTTTA 851 
(2) INFORMATION FOR SEQ ID NO: 4 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 18: 

Met Asn Phe Tyr Gin Lys He Tyr Thr His Lys Val Val Phe Ser Ser 

15 10 15. 

Leu Phe Phe Leu Leu Phe Leu Phe Asn Val Glu Thr Leu Leu Leu Ser 

20 25 30 

His Phe Ser Asp Asp Phe Ser Gin Leu Phe Phe Leu Phe Glu Asn His 
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180 185 190 

(2) INFORMATION FOR SEQ ID NO:417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 9... 7 83 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

ATTGATAGTT TCTTCAGCAA GAATGATTAG GCAATGATTA GGTTGTAG ATG AAT TTT 57 

Met Asn Phe 
1 

TAT CAA AAA ATA TAG ACT CAT AAA GTC GTT TTT TCT TCA TTG TTT TTT 105 
Tyr Gin Lys lie Tyr Thr His Lys Val Val Phe Ser Ser Leu Phe Phe 
5 10 15 

TTG TTG TTT TTG TTC AAT GTG GAA ACT TTG TTG CTT TCG CAT TTC AGC 153 
Leu Leu Phe Leu Phe Asn Val Glu Thr Leu Leu Leu Ser His Phe Ser 
20 25 30 35 

GAT GAT TTT TCG CAA TTG TTT TTT TTG TTT GAA AAC CAT GTT TAT GAT 2 01 

Asp Asp Phe Ser Gin Leu Phe Phe Leu Phe Glu Asn His Val Tyr Asp 
40 45 50 

TTC ATT GTC AAA TTA GAT TAT TTG GGG CTA ATA GGC GTT TCT TTA ATT 24 9 

Phe lie Val Lys Leu Asp Tyr Leu Gly Leu lie Gly Val Ser Leu lie 
55 60 65 

TAT CTG CTT GTG CTT ATT CTA AAG CCT TTC ACC CTC ACG CGC CAA AAA 2 97 

Tyr Leu Leu Val Leu lie Leu Lys Pro Phe Thr Leu Thr Arg Gin Lys 
70 75 80 

TGC GCT TGC GTA GGG ATA TTA TGC CTT TCT TTC TAC GCT TGG AAT TTT 34 5 

Cys Ala Cys Val Gly lie Leu Cys Leu Ser Phe Tyr Ala Trp Asn Phe 
85 90 95 

CCT GTT AAA GAT TCT TTA ATG GTG CTT TAT CTT TTC TAT TTT GCG CTG 3 93 

Pro Val Lys Asp Ser Leu Met Val Leu Tyr Leu Phe Tyr Phe Ala Leu 
100 105 110 115 

TTA GCG ACT TTA TTG TGG CGT TTT TTA GGG GCT AGC ATG AAG CAA TCT 441 
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TTA GAA AAT TTA GGC TCA ATG ACT TTA GGG GCA AAG AGC CAG CCA AAT 54 0 

Leu Glu Asn Leu Gly Ser Met Thr Leu Gly Ala Lys Ser Gin Pro Asn 
145 150 155 160 

TGC AAG ACA TTA GAT AAA AAC GCC ACC ACC ACT AAA ATG ATT AAA ATC 588 
Cys Lys Thr Leu Asp Lys Asn Ala Thr Thr Thr Lys Met lie Lys lie 
165 170 175 



GGT AAA AGC AAT AAA AAA GTG TCT TTA GCC AGT TGG TTA AAC AGC TCT T 63 7 
Gly Lys Ser Asn Lys Lys Val Ser Leu Ala Ser Trp Leu Asn Ser Ser 
180 185 190 



GAACGCTTTC TTTACTGAAA TCTAGGGAAA AATCTTTCAA CACATGGCGA TACATTTCGC 6 97 

TAAAGCCATC CACCCACCAT ATAAAAAAAA CAAAAATACT AATTAGCCCG GCCAATAACC 757 

CCAAAACCCC CACCACTTCC ATGCTCTTAG GCACATTGCC TTCTTCTCTG GCTTTTTGGA 817 

TTTTTTTCGC GCTAGGGAGT TCGGTTTTTT CTTCTTCAGC CATTGGCCCT CTTTTTTAAA 877 

ATTTGAACGG CTAATTCATA GTCTTTTAGG GTGTTTAGGT TTAAAAATTC TTCTTCTTTG 93 7 

TCAAAATT 945 



(2) INFORMATION FOR SEQ ID NO: 416: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 16: 



Val Gly Leu Val 
1 

Leu Phe Val Ala 
20 

Ser Gly Phe Pro 
35 

Val lie Phe Lys 
50 

lie Lys Ala Lys 
65 

Asn Ala Leu Phe 

Ala Ala Trp Phe 
100 

Arg Lys Lys Ala 
115 

Ser Phe Phe Lys 
130 

Leu Glu Asn Leu 
145 

Cys Lys Thr Leu 
Gly Lys Ser Asn 



Thr Thr Thr Leu 
5 

Phe Phe Ser lie 

Ser Cys Cys Leu 
40 

Glu Leu Val Tyr 
55 

Thr Lys Asn Lys 
70 

Leu Asn His Asn 
85 

Asn Ser Pro Lys 

Lys Lys Thr Phe 
120 

Glu Lys Arg Phe 
135 

Gly Ser Met Thr 
150 

Asp Lys Asn Ala 
165 

Lys Lys Val Ser 



Ala Leu Gly lie 
10 

lie Trp Arg lie 
25 

Tyr Ser Ser Leu 

Trp Arg Arg Leu 
60 

Asn Lys Ser Glu 
75 

Asn Trp Pro Cys 
90 

Asn Lys Glu Lys 
105 

Lys Val lie Lys 

Leu Thr Pro Leu 
140 

Leu Gly Ala Lys 
155 

Thr Thr Thr Lys 
170 

Leu Ala Ser Trp 



Ser Cys lie lie 
15 

Leu Ala Leu lie 
30 

Thr Ser Cys Leu 
45 

lie Ala Lys Ser 

Glu Met Ser His 
80 

Lys Phe Lys Arg 
95 

Met Lys Asn Pro 
110 

Leu Pro Ser Arg 
125 

He Gly Leu He 

Ser Gin Pro Asn 
160 

Met He Lys He 
175 

Leu Asn Ser Ser 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 61,.. 636 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415: 

TTAGCCACTA CCACAGGCAC AGGGTGTTCT TCATCAAATT TGAGAGCGAC GGCGTAATGG 60 
GTG GGG TTA GTA ACC ACQ ACA TTG GCT TTA GGG ATT TCT TGC ATC ATT 10 8 

Val Gly Leu Val Thr Thr Thr Leu Ala Leu Gly lie Ser Cys lie lie 
15 10 15 

TTA TTC GTG GCG TTT TTT AGC ATC ATT TGG CGG ATT TTG GCT TTG ATT 156 
Leu Phe Val Ala Phe Phe Ser lie lie Trp Arg lie Leu Ala Leu lie 
20 25 30 

TCT GGG TTC CCT TCT TGC TGT TTG TAT TCG TCC TTA ACT TCT TGT TTA 2 04 

Ser Gly Phe Pro Ser Cys Cys Leu Tyr Ser Ser Leu Thr Ser Cys Leu 
35 40 45 

GTC ATT TTT AAA GAG TTG GTG TAT TGG CGG CGT TTG ATC GCT AAA TCT 252 
Val lie Phe Lys Glu Leu Val Tyr Trp Arg Arg Leu lie Ala Lys Ser 
50 55 60 

ATA AAA GCC AAG ACA AAA AAT AAA AAT AAA AGC GAA GAA ATG AGC CAT 300 
lie Lys Ala Lys Thr Lys Asn Lys Asn Lys Ser Glu Glu Met Ser His 
65 70 75 80 

AAC GCC TTA TTT TTA AAC CAC AAC AAC TGG CCT TGT AAA TTC AAA AGA 34 8 

Asn Ala Leu Phe Leu Asn His Asn Asn Trp Pro Cys Lys Phe Lys Arg 
85 90 95 

GCC GCA TGG TTT AAT TCC CCT AAA AAC AAA GAA AAG ATG AAA AAC CCC 3 96 

Ala Ala Trp Phe Asn Ser Pro Lys Asn Lys Glu Lys Met Lys Asn Pro 
100 105 110 

AGA AAA AAA GCT AAA AAA ACT TTT AAG GTG ATC AAA CTC CCA TCA AGG 444 
Arg Lys Lys Ala Lys Lys Thr Phe Lys Val lie Lys Leu Pro Ser Arg 
115 120 125 

AGC TTT TTT AAA GAA AAA AGG TTT TTG ACG CCA TTG ATA GGG TTG ATT 4 92 

Ser Phe Phe Lys Glu Lys Arg Phe Leu Thr Pro Leu He Gly Leu He 
130 135 140 
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(v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:414: 



Met Leu Glu Asn 
1 

lie Cys Met Cys 
20 

Met lie Ser Ala 
35 

Thr Glu Ser Phe 
50 

lie Ala Leu Ser 
65 

Lys Ser Asn Leu 

Asp Tyr Lys Arg 
100 

Phe Ser Gin Asn 
115 

lie Pro Pro Leu 
130 

Ala Val Ala Cys 
145 

Leu Pro Val Gly 

Leu Lys Ala Asn 
180 

Met Trp lie Ala 
195 

Leu Thr Leu Tyr 
210 

lie Glu Asn Tyr 
225 

Phe lie Gly lie 

Met Pro Leu Ala 
260 

Gly lie Lys Phe 
275 

Met Met Ala Phe 
290 

Glu Val Leu Gin 
305 

Thr Ser Val Val 

Val Gly Leu Phe 
340 

Pro lie He Ala 
355 

Ser Val Glu Ser 
370 



Ser Ser He Trp 
5 

Val Leu Ser Leu 

Thr Leu He Ala 
40 

Asn Ala Met He 
55 

Tyr He Leu Leu 
70 

He Lys Val Ala 
85 

Ser Thr Phe Cys 

Leu Val Pro Val 
120 

Leu His Leu Met 
135 

Ala Leu Thr Phe 
150 

Phe Gly Leu He 
165 

Gly Val Ser Thr 

Gly Leu Ala Met 
200 

Lys Lys Pro Arg 
215 

Ala Ser Leu Gin 
230 

Val Val Ala Phe 
245 

Ala Phe Leu Ala 

Lys Glu Thr Asp 
280 

He Ala Phe Val 
295 

Lys Val His Ala 
310 

Gin Gly Lys Leu 
325 

He Thr Met Gly 

Val Phe Tyr Val 
360 

Thr He Leu Leu 

375 



Ser Asn Pro Ala 
10 

Leu Arg Leu Asn 
25 

Gly Leu Met Gly 

Asp Gly Met Lys 
60 

Gly Ala Leu Ala 
75 

Leu Ser Lys Leu 
90 

Phe Leu He Ala 
105 

His He Ala Phe 

Asn Arg Leu Glu 
140 

Gly Leu Gin Ala 
155 

Phe Gin Thr Thr 
170 

Thr He Ala Gin 
185 

Val Val Gly Leu 

His Tyr Lys Glu 
220 

Leu Asn Tyr His 
235 

Val He Gin Leu 
250 

Leu Ala He He 
265 

Ser Leu Met Asp 

Met Leu Val Ala 
300 

He Glu Gly Leu 
315 

Leu Gly Ala Phe 
330 

He Gly Thr Ser 
345 

Pro Leu Cys Ala 
He Ala 



Phe Val Ala He 
15 

Val Met Leu Ser 
30 

Gly Leu Gly He 
45 

Gly Asn Leu Asn 

Val Ala He Ala 
80 

He Gly Leu Met 
95 

Phe He Ala Cys 
110 

He Pro He Leu 
125 

Leu Asp Arg Arg 

Pro Tyr Leu Val 
160 

He Leu Glu Gin 
175 

He Thr Gly Val 
190 

Leu Val Ala Val 
205 

Lys Ser Phe Asn 

Asp Tyr Leu Thr 
240 

Ala Thr Asp Ser 
255 

Leu Leu Gly Arg 
270 

Asp Ser Val Lys 
285 

Ser Gly Phe Gly 

Val Asn Ala He 
320 

Leu Met Leu Val 
335 

Phe Gly Thr He 
350 

Lys Leu Gly Phe 
365 



(2) INFORMATION FOR SEQ ID NO: 4 15: 
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CTA TAG AAA AAA CCC AGG CAC TAG AAA GAG AAA TCT TTT AAT ATA GAA 728 
Leu Tyr Lys Lys Pro Arg His Tyr Lys Glu Lys Ser Phe Asn lie Glu 
215 220 225 

AAT TAG GCC TCG CTT GAA TTA AAC TAG CAT GAG TAG TTG AGT TTT ATA 77 6 

Asn Tyr Ala Ser Leu Gin Leu Asn Tyr His Asp Tyr Leu Thr Phe lie 
230 235 240 

GGG ATT GTG GTA GGG TTT GTG ATG GAA TTA GGG AGG GAT TGG ATG CCC 824 
Gly lie Val Val Ala Phe Val lie Gin Leu Ala Thr Asp Ser Met Pro 
245 250 255 

TTA GCC GCC TTT TTA GGG TTA GGG ATG ATG TTA TTA GGG CGT GGG ATT 872 
Leu Ala Ala Phe Leu Ala Leu Ala lie lie Leu Leu Gly Arg Gly lie 
260 265 270 

AAG TTT AAA GAA AGA GAG TGG GTT ATG GAT GAT AGG GTG AAA ATG ATG 92 0 

Lys Phe Lys Glu Thr Asp Ser Leu Met Asp Asp Ser Val Lys Met Met 
275 280 285 290 

GGG TTT ATG GCT TTT GTG ATG TTG GTG GGT AGG GGG TTT GGA GAA GTG 968 
Ala Phe lie Ala Phe Val Met Leu Val Ala Ser Gly Phe Gly Glu Val 
295 300 305 

TTG GAA AAA GTG GAT GGG ATA GAG GGG TTA GTG AAT GCG ATT AGA AGG 1016 
Leu Gin Lys Val His Ala lie Glu Gly Leu Val Asn Ala lie Thr Ser 
310 315 320 

GTA GTG GAA GGG AAG GTT TTA GGG GCT TTT TTA ATG CTT GTT GTA GGG 1064 
Val Val Gin Gly Lys Leu Leu Gly Ala Phe Leu Met Leu Val Val Gly 
325 330 335 

CTT TTT ATG ACT ATG GGG ATA GGG AGT TCT TTT GGG ACT ATT GCT ATG 1112 
Leu Phe He Thr Met Gly lie Gly Thr Ser Phe Gly Thr He Pro He 
340 345 350 

ATG GCT GTG TTT TAT GTG GCT TTA TGG GGG AAA TTA GGT TTT AGG GTA 1160 
He Ala Val Phe Tyr Val Pro Leu Cys Ala Lys Leu Gly Phe Ser Val 
355 360 365 370 

GAA TCT AGG ATT TTA GTG ATG GGA TAGGGGCAGG TTTAGGGGAT GCAGGGTCAG 1214 
Glu Ser Thr He Leu Leu He Ala 
375 

GGGCTAGCGA TAGGACGATG GGG 1237 
(2) INFORMATION FOR SEQ ID NO: 4 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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AAAAATTCTA TCATTTTTGC GCGGTATTTT CATTTTAACA AGGAGCAAAA ATG CTA 56 

Met Leu 
1 

GAA AAT AGC TCT ATA TGG AGC AAT CCT GCC TTT GTG GCT ATC ATT TGC 104 
Glu Asn Ser Ser lie Trp Ser Asn Pro Ala Phe Val Ala lie lie Cys 
5 10 15 

ATG TGC GTT CTT AGC CTT TTA AGG CTC AAT GTC ATG CTT TCT ATG ATT 152 
Met Cys Val Leu Ser Leu Leu Arg Leu Asn Val Met Leu Ser Met lie 
20 25 30 

AGT GCG ACT CTC ATA GCA GGA CTT ATG GGA GGG CTT GGG ATC ACG GAG 2 00 

Ser Ala Thr Leu lie Ala Gly Leu Met Gly Gly Leu Gly lie Thr Glu 
35 40 45 50 

AGT TTT AAT GCA ATG ATA GAC GGC ATG AAA GGC AAT TTG AAC ATC GCT 24 8 

Ser Phe Asn Ala Met lie Asp Gly Met Lys Gly Asn Leu Asn lie Ala 
55 60 • 65 

TTA AGC TAG ATC CTT TTA GGG GCT TTA GCG GTA GCG ATC GCT AAA AGC 2 96 

Leu Ser Tyr lie Leu Leu Gly Ala Leu Ala Val Ala lie Ala Lys Ser 
70 75 80 

AAT CTC ATT AAA GTC GCT TTG AGT AAA TTA ATA GGT TTA ATG GAT TAG 344 
Asn Leu lie Lys Val Ala Leu Ser Lys Leu lie Gly Leu Met Asp Tyr 
85 90 95 

AAG CGA TCC ACT TTT TGC TTT TTG ATC GCT TTC ATC GCA TGC TTT TCG 3 92 

Lys Arg Ser Thr Phe Cys Phe Leu lie Ala Phe lie Ala Cys Phe Ser 
100 105 110 

CAA AAT TTA GTG CCG GTG CAT ATC GCT TTT ATC CCT ATT TTA ATC CCC 44 0 

Gin Asn Leu Val Pro Val His lie Ala Phe lie Pro lie Leu lie Pro 
115 120 125 130 

CCT CTT TTG CAT TTA ATG AAC CGG CTA GAA TTG GAT AGA AGA GCG GTC 4 88 

Pro Leu Leu His Leu Met Asn Arg Leu Glu Leu Asp Arg Arg Ala Val 
135 140 145 

GCT TGC GCT TTA ACC TTT GGC TTG CAA GCC CCC TAC TTG GTG CTT CCT 53 6 

Ala Cys Ala Leu Thr Phe Gly Leu Gin Ala Pro Tyr Leu Val Leu Pro 
150 155 160 

GTA GGG TTT GGC TTG ATT TTT CAA ACC ACC ATT TTA GAG CAA TTA AAA 5 84 

Val Gly Phe Gly Leu lie Phe Gin Thr Thr lie Leu Glu Gin Leu Lys 
165 170 175 

GCT AAT GGC GTT AGC ACC ACC ATA GCG CAA ATC ACA GGA GTG ATG TGG 632 
Ala Asn Gly Val Ser Thr Thr lie Ala Gin lie Thr Gly Val Met Trp 
180 185 190 

ATA GCG GGG TTA GCG ATG GTC GTT GGA CTG CTT GTT GCT GTA TTA ACG 68 0 

lie Ala Gly Leu Ala Met Val Val Gly Leu Leu Val Ala Val Leu Thr 
195 200 205 210 
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15 10 15 

Met Val Trp " lie Pro Thr Thr Phe Leu Ala Cys Met Leu Thr Ala Phe 

20 25 30 

lie Met Gly Phe Thr Asp Leu Lys Leu Asp Ser Asp Pro His Tyr Leu 

35 40 45 

Glu Arg Leu Lys Ala Gly Lys lie Ser Pro Pro Lys lie Lys Glu Glu 

50 55 60 

Lys Glu Thr Ser Lys Asn Ala Lys Leu Ser Leu Trp lie Phe lie Gly 
65 70 75 80 

Gly Val Val Ala lie Val Phe Tyr Ala Ser Ala lie Ser Lys Asn lie 

85 90 95 

Ala Phe Val Ser Pro Val Val Leu Gly Arg Asp His Ala lie Val Ser 

100 105 110 

Phe Met Leu Ser Val Ala Thr Leu lie Val Leu Phe Cys Lys lie Asn 

115 120 125 

Ala Asn Glu lie Ala His Ser Ser Val Phe Lys Ser Gly Met Gin Ala 

130 135 140 

Cys Val Cys Val Leu Gly Val Ala Trp Leu Gly Asp Thr Phe Val Ser 
145 150 155 160 

Asn His He Asp Glu He Lys Arg Tyr Ala Ser Phe Leu He Ala Asp 

165 170 175 

Tyr Pro Phe Leu Leu Ala Val Ala Leu Phe Leu Ala Ser Met Leu Leu 

180 185 190 

Tyr Ser Gin Ala Ala Thr Ser Lys Ala Leu He Pro Ser Val He Thr 

195 200 205 

Ala Leu Gly He Ser Ala Asn His Thr Glu His Leu Tyr He He Val 

210 215 220 

Ala Ser Phe Ala Ser Val Ser Ala Leu Phe Val Leu Pro Thr Tyr Pro 
225 230 235 240 

Thr Leu Leu Gly Ala He Ala Met Asp Asn Thr Gly Thr Thr Lys Met 

245 250 255 

Gly Arg Tyr Val Phe Asp His Ala Phe Leu He Pro Gly Val Leu Val 

260 265 270 

Val Ser Leu Ser Val Ala Leu Gly Phe Val Val Ala Pro Leu Val Leu 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATXJRE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1184 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 
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Glu lie Ala His Ser Ser Val Phe Lys Ser Gly Met Gin Ala Cys Val 
135 140 145 

TGC GTG TTG GGC GTG GCG TGG TTG GGC GAT ACT TTT GTG AGC AAT CAT 53 6 

Cys Val Leu Gly Val Ala Trp Leu Gly Asp Thr Phe Val Ser Asn His 
150 155 160 

ATA GAT GAG ATC AAA CGA TAC GCT TCT TTT TTG ATC GCA GAT TAT CCG 584 
lie Asp Glu lie Lys Arg Tyr Ala Ser Phe Leu lie Ala Asp Tyr Pro 
165 170 175 

TTT TTA TTA GCC GTA GCG CTC TTT TTG GCT TCC ATG CTT TTG TAT TCG 632 
Phe Leu Leu Ala Val Ala Leu Phe Leu Ala Ser Met Leu Leu Tyr Ser 
180 185 190 

CAA GCC GCC ACC TCT AAA GCG CTC ATC CCA AGC GTG ATC ACA GCC TTA 680 
Gin Ala Ala Thr Ser Lys Ala Leu He Pro Ser Val He Thr Ala Leu 
195 200 205 210 

GGC ATT AGC GCT AAT CAT ACG GAG CAT TTG TAT ATT ATC GTG GCT TCG 72 8 

Gly He Ser Ala Asn His Thr Glu His Leu Tyr He He Val Ala Ser 
215 220 225 

TTT GCG AGC GTT TCG GCG TTG TTT GTG TTA CCC ACT TAC CCC ACT TTA 7 76 

Phe Ala Ser Val Ser Ala Leu Phe Val Leu Pro Thr Tyr Pro Thr Leu 
230 235 240 

CTA GGA GCG ATC GCT ATG GAT AAC ACC GGC ACC ACT AAA ATG GGC CGT 824 
Leu Gly Ala He Ala Met Asp Asn Thr Gly Thr Thr Lys Met Gly Arg 
245 250 255 

TAT GTG TTT GAT CAT GCG TTT TTG ATC CCT GGG GTT TTA GTC GTG TCT 872 
Tyr Val Phe Asp His Ala Phe Leu He Pro Gly Val Leu Val Val Ser 
260 265 270 

TTG AGC GTA GCG TTA GGG TTT GTT GTC GCG CCG TTA GTT TTG TAGATTTTA 923 
Leu Ser Val Ala Leu Gly Phe Val Val Ala Pro Leu Val Leu 
275 280 285 

TCACCAACGA TAAAAAGCTT GGCGTTGCGA TTTTTCTAT^ CCCC 967 
(2) INFORMATION FOR SEQ ID NO: 4 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:412: 
Met Ser Gly He Leu Glu Pro Leu Gly Ala Asn Tyr Leu Thr Leu Leu 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51 . . . 914 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TAGTCAAGTC GCTATTACTG CAAGCCCGGT GAGCGCAGCG GTNGGTGTTT ATG AGC 56 

Met Ser 
1 

GGC ATT TTA GAG CCT TTA GGA GCA AAT TAG TTG ACC CTT TTA ATG GTT 104 
Gly lie Leu Glu Pro Leu Gly Ala Asn Tyr Leu Thr Leu Leu Met Val 
5 10 15 

TGG ATC CCT ACG ACT TTT TTA GCA TGC ATG CTC ACG GCA TTT ATT ATG 152 
Trp lie Pro Thr Thr Phe Leu Ala Cys Met Leu Thr Ala Phe lie Met 
20 25 30 

GGT TTT ACT GAT TTG AAA TTA GAC AGC GAT CCG CAT TAT TTA GAG CGC 2 00 

Gly Phe Thr Asp Leu Lys Leu Asp Ser Asp Pro His Tyr Leu Glu Arg 
35 40 45 50 

TTG AAA GCG GGC AAA ATC TCG CCC CCT AAA ATC AAA GAA GAA AAA GAA 24 8 

Leu Lys Ala Gly Lys lie Ser Pro Pro Lys lie Lys Glu Glu Lys Glu 
55 60 65 

ACC TCA AAA AAC GCG AAA TTA TCG TTA TGG ATT TTT ATC GGT GGG GTT 2 96 

Thr Ser Lys Asn Ala Lys Leu Ser Leu Trp lie Phe lie Gly Gly Val 
70 75 80 

GTA GCG ATC GTT TTT TAT GCG AGC GCG ATT TCT AAA AAT ATC GCT TTT 344 
Val Ala lie Val Phe Tyr Ala Ser Ala lie Ser Lys Asn lie Ala Phe 
85 90 95 

GTT AGC CCG GTG GTT TTA GGC AGA GAT CAC GCG ATT GTG TCT TTC ATG 3 92 

Val Ser Pro Val Val Leu Gly Arg Asp His Ala lie Val Ser Phe Met 
100 105 110 

CTA AGC GTG GCG ACT TTA ATT GTG CTT TTT TGC AAA ATT AAC GCT AAT 44 0 

Leu Ser Val Ala Thr Leu lie Val Leu Phe Cys Lys lie Asn Ala Asn 
115 120 125 130 

GAA ATC GCT CAT TCA AGC GTG TTT AAA TCC GGC ATG CAA GCG TGC GTG 4 88 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 165 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:409: 

AAGGTTTTTA GCGAAATAAG CCCCCACGCC AAACACGCCC ATGCTCATTA G CAC AAT 5 7 

His Asn 

1 

ATC GCA CAT AAA GCA CAA AGC GCA AAT CAA AAA CAC ATA ATT CCT AGC 10 5 

lie Ala His Lys Ala Gin Ser Ala Asn Gin Lys His lie lie Pro Ser 
5 10 15 

CAT CCC CCT TTC CAC AAT AAA CAA GGA TTG CGC CCC CAC CGC CGC ACA 153 
His Pro Pro Phe His Asn Lys Gin Gly Leu Arg Pro His Arg Arg Thr 
20 25 30 

CAA AGA AAT CGC TAAACCAAAA CCTTCTATAA AAACCACAAA CATCTTGCTT AATCC 210 

Gin Arg Asn Arg 

35 

TTTCACTC 218 
(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:410: 

His Asn lie Ala His Lys Ala Gin Ser Ala Asn Gin Lys His lie lie 

15 10 15 

Pro Ser His Pro Pro Phe His Asn Lys Gin Gly Leu Arg Pro His Arg 

20 25 30 

Arg Thr Gin Arg Asn Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 411: 
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35 40 45 50 

GCG TTT TTA GCT CTG TTA GCG CCT AAA ATC TAT TTA AGC AAT AAT ATC 24 8 

Ala Phe Leu Ala Leu Leu Ala Pro Lys lie Tyr Leu Ser Asn Asn lie 
55 60 65 

TAT TAT ATT AGC CGT AAA ATC AAC ACC CTA GAA GAT CAA AAA CGC CTG 2 96 

Tyr Tyr lie Ser Arg Lys lie Asn Thr Leu Glu Asp Gin Lys Arg Leu 
70 75 80 

CTT TTA GAA GAG CAA CAA ATC CTA AAA AAC GAA TTA GAA AAA GAG CGT 344 
Leu Leu Glu Glu Gin Gin lie Leu Lys Asn Glu Leu Glu Lys Glu Arg 
85 90 95 

TTT AAA TAC TAG ATA GAA AAT AGT GAA AAT ATT GGC GAT ATT GCG TTT T 3 93 
Phe Lys Tyr Tyr lie Glu Asn Ser Glu Asn lie Gly Asp lie Ala Phe 
100 105 110 

AAGTGAAAAA CCCCCTATCC CCTTAAGAGA GCTTAATTAA TGGTCAATCA TT 44 5 

(2) INFORMATION FOR SEQ ID NO: 4 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:408: 

Met Asn lie Lys Thr His Ser Ser Asn Glu Lys Glu Arg Phe Val Arg 

15 10 15 

lie Glu Glu Asp Glu Lys Lys Gly Leu Phe Ala Gly Thr Ala Asn Glu 

20 25 30 

Asn Ser His Gly Leu Ser Leu Met Ala Leu lie Gly Val Leu Val Phe 

35 40 45 

Gly Gly Ala Phe Leu Ala Leu Leu Ala Pro Lys lie Tyr Leu Ser Asn 

50 55 60 

Asn lie Tyr Tyr lie Ser Arg Lys lie Asn Thr Leu Glu Asp Gin Lys 
65 70 75 80 

Arg Leu Leu Leu Glu Glu Gin Gin lie Leu Lys Asn Glu Leu Glu Lys 

85 90 95 

Glu Arg Phe Lys Tyr Tyr lie Glu Asn Ser Glu Asn lie Gly Asp lie 
100 105 110 

Ala Phe 



(2) INFORMATION FOR SEQ ID NO: 4 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE: nucleic acid 
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Ser Ser Lys Asp 
195 

Phe Ser Ala Pro 
210 

Asn Leu Leu Glu 
225 

Phe Ser Pro Tyr 

Glu Ser Ala Leu 
260 

Ser His Phe Lys 
275 

Leu Gin Ala Phe 
290 

Leu Asn Ala Gin 
305 

Leu Glu Ser Leu 
Glu 



Ala Lys Ala lie 
200 

Lys Leu Gly Val 
215 

Ala Phe Tyr His 
230 

Phe Ser Leu Arg 
245 

Ala Ser Leu Glu 

Asp Asp Thr Ala 
280 

Val Ser Met Ala 
295 

Ala Lys Asp Asn 
310 

Ser Val Asn Phe 
325 



Glu Glu Leu Gin 

Ser lie Gin Ala 
220 

Gin Asn Lys Glu 
235 

Ser Gin Thr Pro 
250 

Asn Tyr Phe Met 
265 

Leu Gin Gin Asn 

Lys Asp Lys Arg 
300 

Ala Lys Leu Thr 
315 

Phe Gin Ser Tyr 
330 



Asp Lys Leu Arg 
205 

His His Leu Lys 

Ser Leu Gly Phe 
240 

Ser Val Ser Tyr 
255 

Ala Leu Phe Gin 
270 

Phe Lys Gly Leu 
285 

Ser Gin lie Ala 

Phe Asn Ala Leu 
320 

Lys lie Ser His 
335 



(2) INFORMATION FOR SEQ ID NO: 4 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 392 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:407: 

AAAATGAGGG TGTTTCAATT CAAGCCATGA GTAAGTTATA TGAGTAAGCC ATG AAT 56 

Met Asn 
1 

ATC AAA ACC CAT TCT TCA AAT GAA AAA GAA CGC TTT GTG CGC ATA GAA 104 
lie Lys Thr His Ser Ser Asn Glu Lys Glu Arg Phe Val Arg lie Glu 
5 10 . 15 

GAG GAC GAA AAG AAA GGA TTA TTT GCT GGA ACT GCA AAT GAA AAT TCG 152 
Glu Asp Glu Lys Lys Gly Leu Phe Ala Gly Thr Ala Asn Glu Asn Ser 
20 25 30 

CAC GGC CTT TCT TTA ATG GCT TTA ATA GGG GTA TTG GTT TTT GGG GGC 2 00 

His Gly Leu Ser Leu Met Ala Leu lie Gly Val Leu Val Phe Gly Gly 
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TTT T^AA GAC GAT ACC GCA CTC CAA CAG AAT TTT AAA GGA TTG TTG CAA 920 
Phe Lys Asp Asp Thr Ala Leu Gin Gin Asn Phe Lys Gly Leu Leu Gin 
275 280 285 290 

GCC TTT GTT TCT ATG GCT AAA GAC AAA CGA TCC CAA ATC GCT CTT AAC 968 
Ala Phe Val Ser Met Ala Lys Asp Lys Arg Ser Gin lie Ala Leu Asn 
295 300 305 

GCC CAA GCT AAA GAC AAC GCC AAG CTA ACT TTT AAC GCC TTG TTA GAA 1016 
Ala Gin Ala Lys Asp Asn Ala Lys Leu Thr Phe Asn Ala Leu Leu Glu 
310 315 320 

AGC CTT AGC GTG AAT TTC TTT CAA TCT TAC AAA ATA AGC CAT GAG TGATT 1066 
Ser Leu Ser Val Asn Phe Phe Gin Ser Tyr Lys lie Ser His Glu 
325 330 335 

TCGAAGTCCC CCCCAAAGCT AAAGGGTTTA AACGCCTTTT TAAAGCCC 1114 

(2) INFORMATION FOR SEQ ID NO:406: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Ser Trp Met Val Val lie Gly Ala Leu lie Cys Met Leu Leu Gly 

15 10 15 

Val Phe lie Phe Phe Thr Ser Met Ser Val Lys Lys Phe Leu Ser Ala 

20 25 30 

Tyr Leu Asn Ala Tyr Leu Asp Gin Arg Pro His lie Lys Gly Met Gly 

35 40 45 

lie Ala Gly Thr Pro Phe Glu Cys Glu Gly Phe Phe Lys lie Ala Cys 

50 55 60 

Val Ser Lys Glu Leu Ser Phe Leu Asp Ser Gin Asn Ser Pro lie Val 
65 70 75 80 

Asn Phe Lys Asn Leu Ser lie Lys Leu Arg Ser Leu Asp Lys Ser Ser 

85 90 95 

Leu Thr Leu Ser Val His Ser Gin lie Lys Ser Pro lie Leu Glu Gin 

100 105 110 

Asp Met Gin Gin Lys lie Ser Gin lie Pro Leu Lys Asp Leu Asn Ala 

115 120 125 

Leu Leu Glu Lys Met Lys Pro Thr Arg Leu Asn Cys Ser Leu Thr Phe 

130 135 140 

Asn Ala Leu Asp Glu Lys Thr Leu Asn Asp Asn Leu Lys Cys Asp Leu 
145 150 155 160 

Thr Asn Ala Glu Asn lie Leu Ala Tyr Thr Phe Phe Gin Glu Gly Leu 

165 170 175 

Met Glu Ala Gin Glu Asn Leu Ser Leu Lys Asn lie Phe Lys Thr Leu 
180 185 190 

-669- 



BNSDOCID; <WO 9&43478A1 . 1 _> 



wo 98/43478 



PCT/US98/06371 



GGC ACT CCC TTT GAA TGC GAA GGG TTT TTT AAA ATC GCA TGC GTT TCT 24 8 

Gly Thr P iro Pile Glu Cys Glu Gly Phe Phe Liys lie Ala Cys Val Sgt 
55 60 65 

AAA GAG CTC AGT TTT TTA GAC TCT CAA AAC TCC CCT ATT GTG AAT TTT 2 96 

Lys Glu Leu Ser Phe Leu Asp Ser Gin Asn Ser Pro lie Val Asn Phe 
70 75 80 

AAA AAT TTG AGT ATT AAG CTC CGT TCT TTA GAT AAA AGC TCT CTT ACT 344 
Lys Asn Leu Ser lie Lys Leu Arg Ser Leu Asp Lys Ser Ser Leu Thr 
85 90 95 

CTT TCT GTC CAT TCT CAA ATC AAA TCC CCT ATT TTA GAA CAA GAT ATG 3 92 

Leu Ser Val His Ser Gin lie Lys Ser Pro lie Leu Glu Gin Asp Met 
100 105 110 

CAG CAA AAA ATC AGC CAA ATC CCC CTA AAA GAC TTG AAT GCC TTA TTA 44 0 

Gin Gin Lys lie Ser Gin lie Pro Leu Lys Asp Leu Asn Ala Leu Leu 
115 120 125 130 

GAA AAA ATG AAA CCC ACG CGC TTG AAT TGC TCT TTA ACA TTC AAC GCT 4 88 

Glu Lys Met Lys Pro Thr Arg Leu Asn Cys Ser Leu Thr Phe Asn Ala 
135 140 145 

CTA GAT GAA AAA ACC TTA AAC GAC AAC TTA AAA TGC GAT TTG ACT AAT 53 6 

Leu Asp Glu Lys Thr Leu Asn Asp Asn Leu Lys Cys Asp Leu Thr Asn 
150 155 160 

GCG GAA AAT ATC CTT GCT TAC ACT TTT TTT CAA GAG GGT TTA ATG GAG 584 
Ala Glu Asn lie Leu Ala Tyr Thr Phe Phe Gin Glu Gly Leu Met Glu 
165 170 175 

GCT CAA GAA AAT CTA TCC CTT AAA AAT ATT TTT AAA ACC TTG AGT TCT 632 
Ala Gin Glu Asn Leu Ser Leu Lys Asn lie Phe Lys Thr Leu Ser Ser 
180 185 190 

AAA GAC GCT AAA GCC ATA GAA GAG TTG CAA GAC AAA CTG CGT TTT TCA 68 0 

Lys Asp Ala Lys Ala lie Glu Glu Leu Gin Asp Lys Leu Arg Phe Ser 
195 200 205 210 

GCG CCA AAG TTG GGC GTT TCT ATC CAA GCG CAC CAT CTT AAA AAC CTT 72 8 

Ala Pro Lys Leu Gly Val Ser lie Gin Ala His His Leu Lys Asn Leu 
215 220 225 

TTG GAA GCC TTT TAT CAC CAA AAT AAA GAG AGT TTG GGC TTT TTT TCC 77 6 

Leu Glu Ala Phe Tyr His Gin Asn Lys Glu Ser Leu Gly Phe Phe Ser 
230 235 240 

CCT TAT TTT AGT TTG CGA TCT CAA ACC CCT AGC GTC TCT TAT GAA AGC 824 
Pro Tyr Phe Ser Leu Arg Ser Gin Thr Pro Ser Val Ser Tyr Glu Ser 
245 250 255 

GCG TTA GCT TCT TTA GAA AAC TAT TTT ATG GCT TTG TTC CAA TCC CAT 872 
Ala Leu Ala Ser Leu Glu Asn Tyr Phe Met Ala Leu Phe Gin Ser His 
260 265 270 
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Tyr Ala Gly lie Met Gly Met Met Ala Ser His Ser Val Gly Gly His 

325 330 335 

Ser Leu Ala Gin Met Leu Ser Leu Ala Phe Thr His lie Ala Asn Glu 

340 345 350 

Lys Thr Phe Val Leu Met Thr Phe Leu Ser Ala Gly lie Val Asn lie 

355 360 365 

Phe lie Pro Ser Gly Gly Gly Gin Trp Ala lie Gin Ala Pro lie Met 

370 375 380 

Leu Pro Ala Gly Gin Ser Leu Gly Val Asp Pro Gly Val Val Ser Met 
385 390 395 400 

Ala lie Ala Trp Gly Asp Ala Trp Thr Asn Met lie Gin Pro Phe Trp 

405 410 415 

Ala Leu Pro Ala Leu Ala lie Ala Gly Leu Gly Ala Lys Asp lie Met 

420 425 430 

Gly Tyr Cys Val Leu Thr Leu lie Phe Val Gly Leu Val Val Cys Gly 

435 440 445 

Val Phe Tyr Phe Leu Val 
450 

(2) INFORMATION FOR SEQ ID NO: 4 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1061 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:405: 

TATCACTATA CTTTCAATAA AATAAGGGGT TTTTTTTGAC TAAAAAATTC ATG TCT 5 6 

Met Ser 
1 

TGG ATG GTG GTT ATT GGG GCT TTA ATT TGC ATG CTT TTA GGG GTG TTT 104 
Trp Met Val Val lie Gly Ala Leu lie Cys Met Leu Leu Gly Val Phe 
5 10 15 

ATC TTC TTC ACT AGC ATG TCG GTT AAA AAA TTT TTA AGC GCT TAT CTT 152 
lie Phe Phe Thr Ser Met Ser Val Lys Lys Phe Leu Ser Ala Tyr Leu 
20 25 30 

AAC GCT TAT TTG GAT CAA CGC CCC CAT ATT AAG GGC ATG GGG ATT GCA 20 0 

Asn Ala Tyr Leu Asp Gin Arg Pro His lie Lys Gly Met Gly lie Ala 
35 40 45 50 
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TTTTCTCT 14 65 

(2) INFORMATION FOR SEQ ID NO: 4 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 04: 



Met 


Phe 


Leu 


Leu 


Arg 


His 


Leu 


Thr 


Ser 


Ala 


Cys 


Val 


Phe 


Leu 


Ala 


Ser 


1 








5 










10 










15 




Lys 


Cys 


Leu 


Pro 


Asp 


Ser 


Phe 


Val 


Leu 


Val 


Ala 


Leu 


Leu 


Ser 


Phe 


Val 








20 










25 










30 






Val 


Phe 


Val 


Leu 


Val 


Tyr 


Cys 


Leu 


Thr 


Gly 


Gin 


Asp 


Ala 


Phe 


Ser 


Val 






35 










40 










45 








He 


Ser 


Ser 


Trp 


Gly Asn 


Gly 


Ala 


Trp 


Thr 


Leu 


Leu 


Gly 


Phe 


Ser 


Met 




50 










55 










60 










Gin 


Met 


Ala 


Leu 


He 


Leu 


Val 


Leu 


Gly 


Gin 


Ala 


Leu 


Ala 


Asn 


Ala 


Lys 


65 










70 










75 










80 


Leu 


Val 


Gin 


Lys 


Leu 


Leu 


Lys 


Tyr 


Leu 


Ala 


Ser 


Leu 


Pro 


Lys 


Gly 


Tyr 










85 










90 










95 




Tyr 


Thr 


Ala 


Leu 


Trp 


Leu 


Val 


Thr 


Phe 


Leu 


Ser 


Leu 


He 


Ala 


Asn 


Trp 








100 










105 










110 






He 


Asn 


Trp 


Gly 


Phe 


Gly 


Leu 


Val 


He 


Ser 


Ala 


He 


Phe 


Ala 


Lys 


Glu 






115 










120 










125 








He 


Ala 


Lys 


Asn 


Val 


Lys 


Gly 


Val 


Asp 


Tyr 


Arg 


Leu 


Leu 


He 


Ala 


Ser 




130 










135 










14 0 










Ala 


Tyr 


Ser 


Gly 


Phe 


Val 


He 


Trp 


His 


Gly Gly 


Leu 


Ser 


Gly 


Ser 


He 


145 










150 










155 










160 


Pro 


Leu 


Ser 


Val 


Ala 


Thr 


Gin 


Asn 


Glu 


Asn 


Leu 


Ser 


Lys 


He 


Ser 


Ala 










165 










170 










175 




Gly 


Val 


He 


Glu 


Lys 


Ala 


He 


Pro 


He 


Ser 


Gin 


Thr 


He 


Phe 


Ser 


Ser 








180 










185 










190 






Tyr 


Asn 


Leu 


He 


He 


He 


Gly 


He 


He 


Leu 


Val 


Gly 


Leu 


Pro 


Phe 


Leu 






195 










200 










205 








Met 


Ala 


Met 


He 


His 


Pro 


Lys 


Lys 


Glu 


Glu 


He 


Val 


Glu 


He 


Asp 


Ser 




210 










215 










220 










Lys 


Leu 


Leu 


Lys 


Asp 


Glu 


Tyr 


Lys 


Glu 


He 


Glu 


Leu 


He 


Ser 


His 


Gin 


225 










230 










235 










240 


Gin 


Asp 


Lys 


Thr 


He 


Ala 


His 


Phe 


Leu 


Glu 


Asn 


Ser 


Ala 


Leu 


Leu 


Ser 










245 










250 










255 




Tyr 


Leu 


Leu 


Val 


Phe 


Leu 


Gly 


Phe 


Gly 


Tyr 


Leu 


Gly 


Val 


Tyr 


Phe 


Phe 








260 










265 










270 






Lys 


Gly 


Gly 


Gly 


He 


Ser 


Leu 


Asn 


He 


Val 


Asn 


Thr 


He 


Phe 


Leu 


Phe 






275 










280 










285 








Leu 


Gly 


He 


Leu 


Leu 


His 


Lys 


Thr 


Pro 


Leu 


Ala 


Tyr 


Val 


Lys 


Ala 


He 




290 










295 










300 










Asp 


Arg 


Ser 


Ala 


Xaa 


Ser 


Val 


Ala 


Gly 


He 


Leu 


Leu 


Gin 


Phe 


Pro 


Phe 


305 










310 










315 










320 
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Lys Thr lie Ala His Phe beu Glu Asn Ser Ala Leu Leu Ser Tyr Leu 
245 250 255 

TTG GTT TTT TTG GGT TTT GGG TAT CTT GGT GTT TAT TTT TTT AAA GGG 872 
Leu Val Phe Leu Gly Phe Gly Tyr Leu Gly Val Tyr Phe Phe Lys Gly 
260 265 270 

GGA GGG ATT AGT TTA AAC ATT GTC AAT ACG ATT TTC CTT TTT TTA GGG 92 0 

Gly Gly lie Ser Leu Asn lie Val Asn Thr lie Phe Leu Phe Leu Gly 
275 280 285 290 

ATT TTA CTG CAT AAA ACC CCT TTA GCT TAT GTG AAA GCG ATC GAT CGT 968 
lie Leu Leu His Lys Thr Pro Leu Ala Tyr Val Lys Ala lie Asp Arg 
295 300 305 

TCC GCT ANG AGC GTG GCT GGG ATT TTA TTG CAA TTC CCT TTT TAC GCT 1016 
Ser Ala Xaa Ser Val Ala Gly lie Leu Leu Gin Phe Pro Phe Tyr Ala 
310 315 320 

GGG ATT ATG GGG ATG ATG GCA AGC CAT AGC GTG GGG GGT CAT TCT TTA 1064 
Gly lie Met Gly Met Met Ala Ser His Ser Val Gly Gly His Ser Leu 
325 330 335 

GCG CAA ATG CTT TCT TTA GCT TTC ACG CAC ATC GCT AAT GAA AAA ACT 1112 
Ala Gin Met Leu Ser Leu Ala Phe Thr His lie Ala Asn Glu Lys Thr 
340 345 350 

TTC GTG CTC ATG ACT TTT TTG AGC GCA GGG ATT GTC AAT ATT TTT ATT 1160 
Phe Val Leu Met Thr Phe Leu Ser Ala Gly lie Val Asn lie Phe lie 
355 360 365 370 

CCG TCT GGC GGA GGG CAA TGG GCG ATT CAA GCT CCT ATC ATG CTT CCG 12 08 

Pro Ser Gly Gly Gly Gin Trp Ala lie Gin Ala Pro lie Met Leu Pro 
375 380 385 

GCT GGG CAA AGC TTA GGG GTG GAT CCG GGA GTG GTT TCT ATG GCT ATC 1256 
Ala Gly Gin Ser Leu Gly Val Asp Pro Gly Val Val Ser Met Ala lie 
390 395 400 

GCT TGG GGA GAT GCT TGG ACG AAT ATG ATA CAG CCT TTT TGG GCT TTG 13 04 

Ala Trp Gly Asp Ala Trp Thr Asn Met lie Gin Pro Phe Trp Ala Leu 
405 410 415 

CCC GCT TTA GCC ATT GCG GGT TTG GGC GCT AAA GAT ATT ATG GGC TAT 1352 
Pro Ala Leu Ala lie Ala Gly Leu Gly Ala Lys Asp lie Met Gly Tyr 
420 425 430 

TGC GTT TTG ACT TTA ATT TTT GTA GGC TTA GTC GTG TGT GGG GTG TTT 14 00 

Cys Val Leu Thr Leu lie Phe Val Gly Leu Val Val Cys Gly Val Phe 
435 440 445 450 

TAT TTT TTA GTG TGAGTTTTTT ATGCCTAAAA CCATGCTCTT TTCAATGGGG TAAGG 14 57 
Tyr Phe Leu Val 
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20 25 30 

GTT CTT GTT TAT TGC TTG ACA GGG CAA GAC GCT TTT TCT GTC ATT TCT 2 00 

Val Leu Val Tyr Cys Leu Thr Gly Gin Asp Ala Phe Ser Val lie Ser 
35 40 45 SO 

AGT TGG GGG AAT GGC GCT TGG ACG CTT TTA GGT TTT TCT ATG CAA ATG 24 8 

Ser Trp Gly Asn Gly Ala Trp Thr Leu Leu Gly Phe Ser Met Gin Met 
55 60 65 

GCC CTT ATT TTG GTG TTG GGT CAG GCT CTG GCT AAC GCT AAA TTA GTC 296 
Ala Leu lie Leu Val Leu Gly Gin Ala Leu Ala Asn Ala Lys Leu Val 
70 75 80 

CAA AAG CTT TTA AAA TAT CTA GCG TCT TTA CCT AAA GGG TAT TAT ACG 344 
Gin Lys Leu Leu Lys Tyr Leu Ala Ser Leu Pro Lys Gly Tyr Tyr Thr 
85 90 95 

GCT TTA TGG TTG GTT ACT TTT TTA TCG TTA ATC GCT AAT TGG ATC AAC 3 92 

Ala Leu Trp Leu Val Thr Phe Leu Ser Leu lie Ala Asn Trp lie Asn 
100 105 110 

TGG GGT TTT GGC TTG GTG ATT AGT GCG ATT TTT GCA AAA GAG ATC GCC 440 
Trp Gly Phe Gly Leu Val lie Ser Ala lie Phe Ala Lys Glu lie Ala 
115 120 125 130 

AAA AAT GTT AAG GGG GTG GAT TAC AGG CTG CTC ATT GCT AGC GCT TAT 4 88 

Lys Asn Val Lys Gly Val Asp Tyr Arg Leu Leu lie Ala Ser Ala Tyr 
135 140 145 

TCG GGT TTT GTC ATC TGG CAT GGG GGT TTA TCA GGC TCT ATC CCT TTA 536 
Ser Gly Phe Val lie Trp His Gly Gly Leu Ser Gly Ser lie Pro Leu 
150 155 160 

AGC GTT GCC ACC CAA AAT GAA AAT CTA TCC AAA ATA AGC GCT GGG GTG 584 
Ser Val Ala Thr Gin Asn Glu Asn Leu Ser Lys lie Ser Ala Gly Val 
165 170 175 

ATT GAA AAA GCT ATC CCT ATC AGT CAG ACG ATT TTT TCT TCT TAT AAT 63 2 

lie Glu Lys Ala lie Pro lie Ser Gin Thr lie Phe Ser Ser Tyr Asn 
180 185 190 

TTA ATC ATT ATA GGG ATC ATT CTT GTA GGG TTA CCC TTT TTA ATG GCA 68 0 

Leu lie lie lie Gly lie lie Leu Val Gly Leu Pro Phe Leu Met Ala 
195 200 205 210 

ATG ATC CAC CCT AAA AAA GAA GPJK ATC GTT GAG ATT GAT TCA AAG CTT 72 8 

Met lie His Pro Lys Lys Glu Glu lie Val Glu He Asp Ser Lys Leu 
215 220 225 

TTA AAA GAC GAG TAC AAA GAG ATT GAA CTC ATT AGC CAC CAA CAA GAC 776 
Leu Lys Asp Glu Tyr Lys Glu He Glu Leu He Ser His Gin Gin Asp 
230 235 240 

AAA ACG ATC GCG CAT TTT TTG GAA AAC AGC GCT TTG CTT TCT TAT CTT 824 
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lie His Asp Ser Val Gly Val Leu Asp Glu Tyr Phe Ala Gin Ser Glu 

35 40 45 

Phe Phe Thr Asn lie Glu Asp Phe Glu Lys His Leu Glu Gin Ser Lys 

50 55 60 

Asp Met Gly Lys Glu lie Asn Tyr Leu Ser Val Cys Thr Pro Thr His 
65 70 75 80 

Thr His Phe Asp His lie Arg Phe Gly Leu Arg Asn Gly Met His Val 

85 90 95 

lie Cys Glu Lys Pro Leu Val Leu Asp Pro Gly Glu lie Gin Glu Leu 

100 105 110 

Lys Asp Leu Glu Val Lys His Gin Lys Arg Val Phe Ser Leu Leu Pro 

115 120 125 

Leu Arg Leu His Cys Asp Thr Leu Ala Leu Lys Glu Lys lie Lys Ser 

130 135 140 

Glu Leu Asp Lys Asn Pro Ser Lys Val Phe Asp lie Thr Leu Thr Tyr 
145 150 155 160 

lie Ser Val Gin Gly Lys Trp Tyr Phe Ser Ser Trp Arg Ala Asp Val 

165 170 175 

Asn Arg Ser Gly Gly Leu Ala Thr Gin Met Gly Val Asn lie Phe Asp 

180 185 190 

Thr Leu lie Tyr Leu Phe Gly Ser Val Lys Asp Lys Val lie Asn Lys 

195 200 205 

Glu Glu Pro Asp Cys Val Gly Gly Tyr Ser Phe 
210 215 

(2) INFORMATION FOR SEQ ID NO: 4 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.,. 14 12 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 03: 

GACAGAAGCT GAATTTGAGG TGCGCTTATA GCTTGTAAAA GGGGGTGTTT ATG TTT 5 6 

Met Phe 
1 

TTA TTA AGG CAT TTG ACT TCA GCG TGC GTG TTT TTG GCG TCT AAA TGT 104 
Leu Leu Arg His Leu Thr Ser Ala Cys Val Phe Leu Ala Ser Lys Cys 
5 10 15 

TTG CCG GAC TCC TTT GTC TTG GTC GCT CTT TTA TCG TTT GTC GTG TTT 152 
Leu Pro Asp Ser Phe Val Leu Val Ala Leu Leu Ser Phe Val Val Phe 
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TTT GAT CAC ATC CGT TTC GGG TTA AGA AAC GGC ATG CAT GTG ATT TGT 34 6 

Phe Asp His lie Arg Phe Gly Leu Arg Asn Gly Met His Val lie Cys 
85 90 95 

GAA AAA CCC TTA GTT TTA GAC CCT GGC GAA ATA CAA GAA TTG AAA GAT 394 
Glu Lys Pro Leu Val Leu Asp Pro Gly Glu lie Gin Glu Leu Lys Asp 
100 105 110 

TTA GAG GTG AAA CAC CAA AAA AGG GTG TTT AGT CTT TTA CCC TTG CGC 442 
Leu Glu Val Lys His Gin Lys Arg Val Phe Ser Leu Leu Pro Leu Arg 
115 120 125 130 

TTG CAT TGC GAC ACG CTG GCT TTG AAA GAA AAA ATT AAG AGC GAA TTA 4 90 

Leu His Cys Asp Thr Leu Ala Leu Lys Glu Lys lie Lys Ser Glu Leu 
135 140 145 

GAC AAA AAC CCT AGC AAG GTG TTT GAC ATC ACG CTC ACT TAT ATC AGC 53 8 

Asp Lys Asn Pro Ser Lys Val Phe Asp lie Thr Leu Thr Tyr lie Ser 
150 155 160 

GTT CAA GGG AAA TGG TAT TTT TCT TCA TGG CGA GCG GAT GTG AAT AGG 586 
Val Gin Gly Lys Trp Tyr Phe Ser Ser Trp Arg Ala Asp Val Asn Arg 
165 170 175 

AGC GGA GGG TTA GCC ACT CAA ATG GGG GTG AAT ATT TTT GAC ACT TTA 634 
Ser Gly Gly Leu Ala Thr Gin Met Gly Val Asn lie Phe Asp Thr Leu 
180 185 190 

ATC TAT TTG TTT GGA AGC GTT AAA GAC AAG GTT ATC AAT AAA GAA GAG 682 
lie Tyr Leu Phe Gly Ser Val Lys Asp Lys Val lie Asn Lys Glu Glu 
195 200 205 210 

CCT GAT TGC GTA GGG GGA TAC TCT TTT TAGAGCATGC CAAAATAAGA TGGTTTT 73 6 
Pro Asp Cys Val Gly Gly Tyr Ser Phe 
215 

TTTCCATCAA TCCAGAACAC ATGGGAG 7 63 

(2) INFORMATION FOR SEQ ID NO: 4 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 02: 

Met Leu Phe Ala Met lie Gly Ser Gly Gly Phe lie Ala Pro Lys His 

15 10 15 

Leu Gin Ala lie Arg Asp Thr Gly His Phe Leu Asp Cys Ser Phe Asp 

20 25 30 

-662- 



BNSDOCID: <WO 9843478A1 J_> 



wo 98/43478 



PCT/US98/06371 



Thr Gin Gly Val 
355 

Gly Leu Lys Thr 
370 

Leu Lys Val lie 
385 

Pro Ser Trp Ser 
Leu Lys Phe 



Tyr Asp Trp Asn 
360 

Thr lie Phe Ser 
375 

Asp Ala Phe Tyr 
390 

Tyr Leu Leu Asp 
405 



Asp Tyr lie Val 

Gin Val Lys Asn 
380 

Phe Asp Phe Asn 
395 

Lys Asn Asp Leu 
410 



Val Leu Asn Met 
365 

Glu Trp Glu Leu 

Lys Ser Lys Asp 
400 

Arg Ala Phe Lys 
415 



(2) INFORI4ATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 709 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

AAATAAGCTA ATCTTTGCTA AAATGAGATT TAAAATTATT TAAGGAAGAT GA ATG CTT 5 8 

Met Leu 
1 

TTT GCG ATG ATT GGT TCA GGG GGG TTT ATC GCT CCC AAG CAC TTG CAA 106 
Phe Ala Met lie Gly Ser Gly Gly Phe lie Ala Pro Lys His Leu Gin 
5 10 15 

GCG ATT AGA GAT ACA GGG CAT TTT TTG GAT TGC TCT TTT GAT ATT CAT 154 
Ala lie Arg Asp Thr Gly His Phe Leu Asp Cys Ser Phe Asp lie His 
20 25 30 

GAT AGC GTG GGG GTT TTA GAT GAG TAT TTC GCG CAA TCA GAG TTT TTT 2 02 

Asp Ser Val Gly Val Leu Asp Glu Tyr Phe Ala Gin Ser Glu Phe Phe 
35 40 45 50 

ACG AAT ATT GAA GAT TTT GAA AAG CAT TTA GAG CAA TCT AAG GAT ATG 2 50 

Thr Asn lie Glu Asp Phe Glu Lys His Leu Glu Gin Ser Lys Asp Met 
55 60 65 

GGT AAA GAA ATC AAC TAT TTG AGT GTT TGC ACG CCT ACG CAC ACG CAT 2 98 

Gly Lys Glu lie Asn Tyr Leu Ser Val Cys Thr Pro Thr His Thr His 
70 75 80 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 00: 



Met Lys Glu Gin 
1 

lie Arg Lys Thr 
20 

Asp Val Leu Ser 
35 

Asn lie Gly Lys 
50 

Thr Asn Glu Asn 
65 

Asp Phe Ser Met 

Leu Leu Ala Tyr 
100 

Asn lie Tyr Thr 
115 

Arg Glu Thr Asn 
130 

Leu Met Asp Ser 
145 

Gin Asp Thr Glu 

Thr lie Asn Asn 
180 

Phe Thr Lys lie 
195 

Thr Arg Lys Gly 
210 

Asn Tyr Asn Arg 
225 

Thr Arg Gin Glu 

Asn Tyr Leu lie 
260 

Arg Phe Asn Asn 
275 

Gin Ala Thr Gin 
290 

lie Ala Asn Tyr 
305 

Phe lie Tyr Ala 

Phe Gin Tyr He 
340 



Ser Met He Asp 
5 

Gin Asn Ala Arg 
Leu Val Ala Asp 

40 

Ser Phe Ser He 
55 

Val Lys Met He 
70 

His Glu Tyr Asp 
85 

Ser Gly He Leu 

He Gin Asn He 
120 

Ala Leu Lys Phe 
135 

Gly He Tyr Pro 
150 

Ser Phe Lys Gin 
165 

Thr Ala He Asn 

He Asn Pro Leu 
200 

Tyr Leu Ser Asn 
215 

He Asn Trp Arg 
230 

Tyr Asp Leu He 
245 

Ser Lys Ala Lys 

Ser Leu Ser Glu 
280 

He His His He 
295 

He Glu Asn Leu 
310 

His Pro Asn Asn 
325 

Cys Leu Leu Ala 



Phe Leu Lys Leu 
10 

Trp He Asp Gin 
25 

Cys He Leu Glu 

Arg Asp He Trp 
60 

Phe Ser Lys Pro 
7 5 

Lys Phe Phe Ser 
90 

Phe Glu Thr Lys 
105 

Glu Leu Leu Glu 

Leu He Leu Tyr 
140 

Leu Phe Asp Asn 
155 

Leu Lys Asp Gly 
170 

Asn Ala Thr Glu 
185 

Ala Phe Tyr Tyr 

Thr He He Thr 
220 

Asp He Gly Lys 
235 

Asn Ser Lys Arg 
250 

Lys Val Val Lys 
265 

Val Lys Gin Glu 

Phe Pro He Gin 
300 

He Ala Leu Thr 
315 

Gin Thr Arg Leu 
330 

Lys Thr Thr Thr 
345 



Arg Asp Tyr Asp 
15 

Lys Cys Thr Pro 
30 

Phe Thr Gin Cys 
45 

Asp Ser Pro Tyr 

Asp Leu Asn Ser 
80 

Gin Pro He Lys 
95 

Thr Gly Asn Arg 
110 

Tyr Leu Met Gin 
125 

He Gin Lys Val 

Phe Leu Gin Lys 
160 

Phe Thr His Phe 
175 

Cys Phe Arg He 
190 

Gly Lys Lys Gly 
205 

Lys Asp Glu Leu 

Asp Lys Asn Thr 
240 

He Ala Asn Ser 
255 

Arg Tyr Asn Asp 
270 

Lys Glu Glu Ser 
285 

Asp Phe Pro He 

Pro Asn Gin His 
320 

He Asp Lys Asp 
335 

He Leu Asn Asp 
350 
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GGG TAT TTG TCC AAC ACT ATA ATT ACA AAA GAT GAG CTT AAT TAT AAT 72 9 

Gly Tyr Leu Ser Asn Thr lie He Thr Lys Asp Glu Leu Asn Tyr Asn 
215 220 225 

CGT ATC AAT TGG CGA GAT ATA GGA AAA GAT AAA AAT ACC ACC AGA CAA 777 
Arg He Asn Trp Arg Asp He Gly Lys Asp Lys Asn Thr Thr Arg Gin 
230 235 240 

GAA TAG GAT CTT ATA AAC TCT AAA AGG ATT GCT AAT TCT AAC TAT CTT 82 5 

Glu Tyr Asp Leu He Asn Ser Lys Arg He Ala Asn Ser Asn Tyr Leu 
245 250 255 

ATT TCA AAA GCT AAG AAA GTG GTG AAA CGA TAT AAT GAT AGA TTT AAT 873 
He Ser Lys Ala Lys Lys Val Val Lys Arg Tyr Asn Asp Arg Phe Asn 
260 265 270 275 

AAT TCT CTC TCT GAA GTA AAA CAA GAA AAA GAA GAG TCG CAA GCC ACA 921 
Asn Ser Leu Ser Glu Val Lys Gin Glu Lys Glu Glu Ser Gin Ala Thr 
280 285 290 

CAA ATA CAC CAT ATT TTT CCC ATC CAA GAC TTT CCC ATT ATT GCT AAC 96 9 

Gin He His His He Phe Pro He Gin Asp Phe Pro He He Ala Asn 
295 300 305 

TAT ATA GAG AAT CTT ATC GCA CTC ACT CCT AAT CAA CAT TTT ATT TAC 1017 
Tyr He Glu Asn Leu He Ala Leu Thr Pro Asn Gin His Phe He Tyr 
310 315 320 

GCC CAC CCT AAT AAT CAA ACC CGC TTG ATT GAT AAA GAT TTT CAA TAT 10 65 

Ala His Pro Asn Asn Gin Thr Arg Leu He Asp Lys Asp Phe Gin Tyr 
325 330 335 

ATC TGC TTA TTA GCT AAA ACG ACC ACA ATT CTT AAT GAC ACT CAA GGC 1113 
He Cys Leu Leu Ala Lys Thr Thr Thr He Leu Asn Asp Thr Gin Gly 
340 345 350 355 

GTA TAT GAT TGG AAT GAT TAT ATT GTT GTG TTG AAT ATG GGC CTC AAA 1161 
Val Tyr Asp Trp Asn Asp Tyr He Val Val Leu Asn Met Gly Leu Lys 
360 365 370 

ACA ACT ATC TTT TCT CAA GTC AAG AAC GAA TGG GAA TTA TTA AAA GTA 12 0 9 

Thr Thr He Phe Ser Gin Val Lys Asn Glu Trp Glu Leu Leu Lys Val 
375 380 385 

ATA GAT GCT TTT TAT TTT GAT TTT AAC AAG AGC AAA GAT CCA AGT TGG 125 7 

He Asp Ala Phe Tyr Phe Asp Phe Asn Lys Ser Lys Asp Pro Ser Trp 
390 395 400 

TCA TAC TTG CTA GAT AAA AAC GAT TTA AGA GCT TTC AAG CTA AAA TTT T 13 06 
Ser Tyr Leu Leu Asp Lys Asn Asp Leu Arg Ala Phe Lys Leu Lys Phe 
405 410 415 

AATAAGTTTT ATTGAAACTG GCTATAAAAA CCCGCTTGAC TTATCTTATC CTTTT 13 61 

(2) INFORMATION FOR SEQ ID NO: 4 00: 
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TATTGATAGC ATGAGTTGTT TTTGGTTTGG AATTTTAAGG AGTAGCTT ATG AAA GAG 57 

Met Lys Glu 
1 

CAA TCA ATG ATT GAT TTT TTA AAA CTT AGA GAT TAT GAG ATT AGA AAA 105 
Gin Ser Met He Asp Phe Leu Lys Leu Arg Asp Tyr Asp He Arg Lys 
5 10 15 

ACA CAA AAT GCG CGA TGG ATA GAT CAA AAA TGC ACC CCT GAT GTG TTG 15 3 

Thr Gin Asn Ala Arg Trp He Asp Gin Lys Cys Thr Pro Asp Val Leu 
20 25 30 35 

TCT CTT GTT GCT GAT TGT ATT TTA GAG TTT ACG CAA TGT AAT ATT GGA 2 01 

Ser Leu Val Ala Asp Cys He Leu Glu Phe Thr Gin Cys Asn He Gly 
40 45 50 

AAA TCA TTT TCT ATT AGG GAT ATT TGG GAT AGC CCT TAC ACC AAT GAA 24 9 

Lys Ser Phe Ser lie Arg Asp He Trp Asp Ser Pro Tyr Thr Asn Glu 
55 60 65 

AAT GTT AAA ATG ATT TTT TCT AAA CCT GAT TTA AAT TCT GAC TTT TCC 2 97 

Asn Val Lys Met He Phe Ser Lys Pro Asp Leu Asn Ser Asp Phe Ser 
70 75 80 

ATG CAT GAA TAC GAT AAG TTT TTT TCT CAG CCT ATT AAA TTA TTA GCC 34 5 

Met His Glu Tyr Asp Lys Phe Phe Ser Gin Pro He Lys Leu Leu Ala 
85 90 95 

TAT AGC GGT ATT TTA TTT GAA ACA AAA ACT GGC AAT AGA AAT ATT TAT 3 93 

Tyr Ser Gly He Leu Phe Glu Thr Lys Thr Gly Asn Arg Asn He Tyr 
100 105 110 115 

ACC ATA CAA AAC ATA GAG CTA TTA GAA TAT CTC ATG CAA AGA GAA ACA 441 
Thr He Gin Asn He Glu Leu Leu Glu Tyr Leu Met Gin Arg Glu Thr 
120 125 130 

AAC GCT TTG AAA TTC CTT ATT TTA TAT ATT CAA AAG GTA TTA ATG GAT 489 
Asn Ala Leu Lys Phe Leu He Leu Tyr He Gin Lys Val Leu Met Asp 
135 140 145 

AGT GGG ATT TAT CCT TTA TTT GAC AAC TTT TTA CAA AAA CAA GAC ACA 53 7 

Ser Gly He Tyr Pro Leu Phe Asp Asn Phe Leu Gin Lys Gin Asp Thr 
150 155 160 

GAA AGT TTT AAG CAA CTA AAA GAT GGT TTC ACT CAT TTT ACT ATC AAT 585 
Glu Ser Phe Lys Gin Leu Lys Asp Gly Phe Thr His Phe Thr He Asn 
165 170 175 

AAC ACA GCA ATC AAT AAC GCT ACG GAA TGT TTT AGG ATT TTT ACT AAA 63 3 

Asn Thr Ala He Asn Asn Ala Thr Glu Cys Phe Arg He Phe Thr Lys 
180 185 190 195 

ATT ATC AAT CCT TTA GCT TTT TAT TAT GGT AAA AAA GGC ACA AGA AAA 681 
He He Asn Pro Leu Ala Phe Tyr Tyr Gly Lys Lys Gly Thr Arg Lys 
200 205 210 
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15 10 15 

Cys Val Lys Cys Ala Lys Cys Val Pro Gly Cys Thr lie Tyr Arg lie 

20 25 30 

His Lys Asp Glu Ala Thr Ser Pro Arg Gly Phe Leu Asp Leu Met Arg 

35 40 45 

Leu Asn Ala Gin Asn Lys L»eu Gin Leu Asp Thr Asn Leu Lys His Leu 

50 55 60 

Leu Glu Thr Cys Phe Leu Cys Thr Ala Cys Val Glu lie Cys Pro Phe 
65 70 75 80 

His Leu Pro lie Asp Thr Leu lie Glu Lys Ala Arg Glu Lys lie Ala 

85 90 95 

Gin Lys His Gly He Ala Trp Tyr Lys Lys Ser Tyr Phe Ser Leu Leu 

100 105 110 

Lys Asn Arg Lys Lys Met Asp Arg Val Phe Ser Thr Ala His Phe Leu 

115 120 125 

Ala Pro Cys Val Phe Lys Gin Val Gly Asp Ser Leu Glu Pro Arg Ala 

130 135 140 

Val Phe Lys Gly Leu Phe Lys Arg Phe Lys Lys Ser Ala Leu Pro Pro 
145 150 155 160 

Leu Asn Gin Lys Ser Phe Leu Gin Lys His Ala Glu Met Lys Leu Leu 

165 170 175 

Glu Asn Pro He Gin Lys Val Ala He Phe He Gly Cys Leu Ser Asn 

180 185 190 

Tyr His Tyr Gin Gin Val Gly Glu Ser Leu Leu Tyr He Leu Glu Lys 

195 200 205 

Leu Asn He Gin Ala He He Pro Lys Gin Glu Cys Cys Ser Ala Pro 

210 215 220 

Ala Tyr Phe Thr Gly Asp Lys Asp Thr Thr Leu Phe Leu Val Lys Lys 
225 230 235 240 

Asn He Glu Trp Phe Glu Ser Tyr Leu Asp Lys Val Asp Ala He He 

245 250 255 

Val Pro Glu Ala Thr Cys Ala Thr Cys Ser Ser Thr He He Thr Arg 

260 265 270 

Cys Phe Trp Ala Lys Lys He Arg He Cys Met 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 1305 
(D) OTHER INFORMATION: 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 99: 
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Cys Val Phe Lys Gin Val Gly Asp Ser Leu Glu Pro Arg Ala Val Phe 
135 140 145 

AAA GGT TTG TTC AAA CGC TTT AAA AAA AGC GCG CTG CCT CCT TTA AAT 53 6 

Lys Gly Leu Phe Lys Arg Phe Lys Lys Ser Ala Leu Pro Pro Leu Asn 
150 155 160 

CAA AAA AGT TTT TTA CAA AAG CAT OCA GAA ATG AAG CTT TTA GAA AAC 584 
Gin Lys Ser Phe Leu Gin Lys His Ala Glu Met Lys Leu Leu Glu Asn 
165 170 175 

CCC ATT CAA AAA GTG GCC ATT TTT ATA GGG TGC TTG AGC AAT TAC CAT 632 
Pro lie Gin Lys Val Ala lie Phe lie Gly Cys Leu Ser Asn Tyr His 
180 185 190 

TAC CAG CAA GTG GGG GAA AGC TTG TTG TAT ATT TTA GAA AAA CTC AAC 680 
Tyr Gin Gin Val Gly Glu Ser Leu Leu Tyr lie Leu Glu Lys Leu Asn 
195 200 205 210 

ATT CAA GCG ATC ATC CCT AAG CAA GAA TGC TGC TCA GCG CCT GCG TAT 72 8 

lie Gin Ala lie lie Pro Lys Gin Glu Cys Cys Ser Ala Pro Ala Tyr 
215 220 225 

TTT ACC GGC GAT AAA GAC ACC ACG CTT TTT TTA GTG AAA AAA AAC ATA 77 6 

Phe Thr Gly Asp Lys Asp Thr Thr Leu Phe Leu Val Lys Lys Asn lie 
230 235 240 

GAA TGG TTT GAA AGC TAT TTA GAT AAA GTG GAT GCG ATC ATT GTG CCT 824 
Glu Trp Phe Glu Ser Tyr Leu Asp Lys Val Asp Ala lie lie Val Pro 
245 250 255 

GAA GCC ACA TGC GCT ACA TGC TCA TCA ACG ATT ATT ACA AGG TGT TTT 872 
Glu Ala Thr Cys Ala Thr Cys Ser Ser Thr lie lie Thr Arg Cys Phe 
260 265 270 

TGG GCG AAA AAG ATA AGG ATT TGT ATG TGAAGCGCTT GGAAAAAATC ACGCCTA 92 6 
Trp Ala Lys Lys lie Arg lie Cys Met 
275 280 

AAATCTATCT GGCGAGCGTG TTTTTA 952 
(2) INFORMATION FOR SEQ ID NO: 3 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO:398: 
Met Asn Glu Asn lie Asn Glu Asn lie Phe Glu Glu Val Gly Asp Ala 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 952 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51. . ,899 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:397: 

GAGTTTGACG CGTATTTGCG TGGGGGCGAA AAACATTTCA GTAAAACGCT ATG AAT 56 

Met Asn 
1 

GAA AAT ATT AAT GAA AAT ATT TTT GAA GAA GTA GGG GAC GCT TGC GTT 104 
Glu Asn He Asn Glu Asn He Phe Glu Glu Val Gly Asp Ala Cys Val 
5 10 15 

AAA TGC GCT AAG TGC GTG CCA GGC TGC ACC ATA TAC CGC ATT CAT AAA 152 
Lys Cys Ala Lys Cys Val Pro Gly Cys Thr He Tyr Arg He His Lys 
20 25 30 

GAC GAG GCG ACT TCG CCT AGA GGC TTT TTA GAT TTG ATG CGC TTA AAC 2 00 

Asp Glu Ala Thr Ser Pro Arg Gly Phe Leu Asp Leu Met Arg Leu Asn 
35 40 45 50 

GCT CAA AAC AAG CTC CAA TTA GAC ACG AAT TTA AAA CAC CTT TTA GAA 24 8 

Ala Gin Asn Lys Leu Gin Leu Asp Thr Asn Leu Lys His Leu Leu Glu 
55 60 65 

ACT TGC TTT TTA TGC ACC GCT TGC GTG GAA ATT TGC CCT TTT CAT TTG 2 96 

Thr Cys Phe Leu Cys Thr Ala Cys Val Glu He Cys Pro Phe His Leu 
70 75 80 

CCC ATA GAC ACC TTA ATA GAA AAA GCC AGA GAA AAA ATC GCT CAA AAG 344 
Pro He Asp Thr Leu He Glu Lys Ala Arg Glu Lys He Ala Gin Lys 
85 90 95 

CAT GGC ATC GCT TGG TAT AAA AAA TCC TAT TTT TCC CTT TTA AAA AAC 3 92 

His Gly He Ala Trp Tyr Lys Lys Ser Tyr Phe Ser Leu Leu Lys Asn 
100 105 110 

CGC AAA AAA ATG GAT AGG GTG TTT TCA ACT GCG CAT TTT TTA GCC CCT 44 0 

Arg Lys Lys Met Asp Arg Val Phe Ser Thr Ala His Phe Leu Ala Pro 
115 120 125 130 

TGC GTT'TTC AAG CAA GTA GGG GAT AGT TTA GAG CCT AGG GCG GTG TTT 4 88 
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AAT TTA CGC CAA AAA GAT GGG AAT TTG CCT AAT ATC AAC ACC TTA AAC 44 0 

Asn Leu Arg Gin Lys Asp Gly Asn Leu Pro Asn lie Asn Thr Leu Asn 
115 120 125 130 

TTT GTC AAA CAA ATC AAA AAA GAA CAC CCT AAT TTA TTC TTT AAT TTT 4 88 

Phe Val Lys Gin lie Lys Lys Glu His Pro Asn Leu Phe Phe Asn Phe 
135 140 145 

GAC AAC ATG TTC AAA CAA CCC CCT TTT AAT GAG AAT AAT TTT GAA AAT 53 6 

Asp Asn Met Phe Lys Gin Pro Pro Phe Asn Glu Asn Asn Phe Glu Asn 
150 155 160 

TTT GAC AAT AGO GAT GAG GAA AAT TTT TAATGCAAAC CATTGATTTT GAAAAAT 590 
Phe Asp Asn Ser Asp Glu Glu Asn Phe 
165 170 

TTTCACAATA TTCCAAGCCC GGCCCA 616 

(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:396: 

Val Glu Lys Leu Pro Lys Lys Arg Val Ser Lys Thr Lys Ser Gin Lys 

15 10 15 

Leu lie His Ser Leu Thr Thr Gin Lys Asn Arg Ala Phe Leu Lys Lys 

20 25 30 

lie Ser Ala Asn Glu Met Leu Leu Glu Leu Glu Lys Gly Ala Phe Lys 

35 40 45 

Lys Asn Glu Ala Tyr Phe lie Ser Asp Glu Glu Asp Lys Asn Tyr Val 

50 55 60 

Leu Val Pro Asp Asn Val lie Ser Leu Leu Ala Glu Asn Ala Arg Lys 
65 70 75 80 

Ala Phe Glu Ala Arg Leu Arg Ala Glu Leu Glu Arg Asp lie lie Thr 

85 90 95 

Gin Ala Pro lie Asp Phe Glu Asp Val Arg Glu Val Ser Leu Gin Leu 

100 105 110 

Leu Glu Asn Leu Arg Gin Lys Asp Gly Asn Leu Pro Asn lie Asn Thr 

115 120 125 

Leu Asn Phe Val Lys Gin lie Lys Lys Glu His Pro Asn Leu Phe Phe 

130 135 140 

Asn Phe Asp Asn Met Phe Lys Gin Pro Pro Phe Asn Glu Asn Asn Phe 
145 150 155 160 

Glu Asn Phe Asp Asn Ser Asp Glu Glu Asn Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 3 97: 
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Leu Tyx Asn Phe lie Gin Lys Phe 
355 360 

(2) INFORMATION FOR SEQ ID NO: 3 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANBEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.. .563 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 95: 

TAATGAGAAT TAAACGAAAT TGGATACAAT CAGCTTAAAA AGGATATAAA GTG GAA 56 

Val Glu 
1 

AAA TTA CCT AAA AAA CGA GTT TCT AAA ACC AAA TCA CAA AAA CTT ATC 104 
Lys Leu Pro Lys Lys Arg Val Ser Lys Thr Lys Ser Gin Lys Leu lie 
5 10 15 

CAT AGC TTA ACC ACC CAA AAA AAC AGA GCC TTT CTC AAA AAA ATC AGC 152 
His Ser Leu Thr Thr Gin Lys Asn Arg Ala Phe Leu Lys Lys lie Ser 
20 25 30 

GCT AAT GAA ATG CTT TTA GAA TTA GAA AAA GGG GCG TTT AAA AAA AAT 2 00 

Ala Asn Glu Met Leu Leu Glu Leu Glu Lys Gly Ala Phe Lys Lys Asn 
35 40 45 50 

GAA GCT TAT TTT ATT TCT GAT GAA GAA GAT AAA AAT TAT GTT TTG GTG 24 8 

Glu Ala Tyr Phe lie Ser Asp Glu Glu Asp Lys Asn Tyr Val Leu Val 
55 60 65 

CCA GAT AAC GTG ATC TCT CTT TTG GCA GAA AAC GCC AGA AAG GCT TTT 2 96 

Pro Asp Asn Val lie Ser Leu Leu Ala Glu Asn Ala Arg Lys Ala Phe 
70 75 80 

GAA GCC AGG CTT AGG GCG GAA TTA GAA AGG GAT ATT ATC ACC CAA GCG 344 
Glu Ala Arg Leu Arg Ala Glu Leu Glu Arg Asp lie lie Thr Gin Ala 
85 90 95 

CCG ATT GAT TTT GAA GAC GTG CGC GAA GTT TCC TTG CAA CTA TTG GAA 3 92 

Pro lie Asp Phe Glu Asp Val Arg Glu Val Ser Leu Gin Leu Leu Glu 
100 105 110 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECXJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 



Met Asp Phe Leu 
1 

Glu Leu Val Arg 
20 

Ala Asp Arg Met 
35 

Val Asn Arg His 
50 

Phe Cys Ala Phe 
65 

Ser Leu Glu Glu 

lie Lys Glu Val 
100 

Glu Trp Tyr Leu 
115 

Leu His Leu Lys 
130 

Lys Phe Asn Lys 
145 

Gly Val Asp Ser 

lie Arg Arg Lys 
180 

Glu lie His Ala 
195 

Met Leu Phe Gly 
210 

Leu Arg lie Lys 
225 

Glu Gly Gly Phe 

Asn Tyr Leu Asn 
260 

Thr lie Ala lie 
275 

Ala Tyr Trp Ala 
290 

Phe Gly Ala Asn 
305 

Ser Ala Ala Gly 

lie Phe Lys lie 
340 



Glu Lys Val Leu 
5 

Leu Tyr Asp Tyr 

Arg Gin Asn Met 
40 

Leu Asn Pro Ser 
55 

Ser Ala His Arg 
70 

lie Leu Glu Lys 
85 

His 'lie Val Ser 

Lys Val Phe Glu 
120 

Ala Met Thr Ala 
13 5 

Pro Phe Glu Leu 
150 

Met Pro Gly Gly 
165 

lie Cys Asn Gly 

Tyr Trp His Lys 
200 

His lie Glu Asn 
215 

Lys lie Gin Ser 
230 

Asn Ala Phe lie 
245 

Val Glu Lys Ser 

Ser Arg lie Leu 
280 

Thr Leu Gly Leu 
295 

Asp Leu Asp Gly 
310 

Ala Lys Ser Arg 
325 

Lys Asp Ala Gly 



Asp Asn Gin Val 
10 

Asp Leu Tyr Thr 
25 

His Gin Lys lie 

Asn lie Cys Ala 
60 

Lys Asn Pro Asn 
75 

Val Lys Asn Ser 
90 

Ala His Asn Pro 
105 

Thr lie Lys Gin 

Ala Glu Val His 
140 

Val Leu Glu Asp 
155 

Gly Ala Glu lie 
170 

Lys Val Gly Ser 
185 

Leu Gly Lys Met 

Lys lie His Arg 
220 

Pro Lys Asn Gin 
235 

Pro Leu Leu Tyr 
250 

Pro Ser Ala lie 
265 

Leu Asn Asn lie 

Asn Leu Ala Leu 
300 

Thr lie Glu He 
315 

His Gly Leu Glu 
330 

Phe Val Ala Val 
345 



Thr Glu Ser Lys 
15 

Leu Gly Glu Val 
30 

Val Tyr Phe Asn 
45 

Asp Ala Cys Lys 

Pro Tyr Glu Met 
80 

Tyr Asn Lys Gly 
95 

Asn Tyr Ser Tyr 
110 

Glu Met Pro Asn 
125 

Phe Leu Ser Val 

Met Leu Lys Ala 
160 

Phe Asp Glu Glu 
175 

Ser Arg Trp Leu 
190 

Ser Asn Ala Thr 
205 

He Asp His Met 

Val Glu Asn Lys 
240 

Gin Lys Glu Asn 
255 

Glu He Leu Lys 
270 

Pro His He Lys 
285 

Val Ala Gin Glu 

Glu Ser He Gin 
320 

Lys Glu Asp Leu 
335 

Glu Arg Asp Ser 
350 
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GAT TCC ATG CCT GOT GGG GGG GCG GAG ATT TTT GAT GAA GAA ATC AGG 5 84 

Asp Ser Met Pro Gly Gly Gly Ala Glu lie Phe Asp Glu Glu lie Arg 
165 170 175 

CGT AAA ATC TGT AAT GGT AAG GTG GGA TCT TCT CGG TGG TTA GAA ATC 632 
Arg Lys lie Cys Asn Gly Lys Val Gly Ser Ser Arg Trp Leu Glu lie 
180 185 190 

CAT GCT TAT TGG CAC AAA TTA GGC AAA ATG AGT AAC GCT ACC ATG CTT 680 
His Ala Tyr Trp His Lys Leu Gly Lys Met Ser Asn Ala Thr Met Leu 
195 200 205 210 

TTT GGG CAT ATT GAA AAT AAA ATC CAT CGC ATC GAT CAC ATG CTA AGA 72 8 

Phe Gly His lie Glu Asn Lys lie His Arg lie Asp His Met Leu Arg 
215 220 225 

ATC AAA AAA ATC CAA AGC CCT AAA AAT CAA GTA GAA AAC AAA GAA GGG 776 
lie Lys Lys lie Gin Ser Pro Lys Asn Gin Val Glu Asn Lys Glu Gly 
230 235 240 

GGT TTT AAC GCT TTT ATC CCC TTG TTG TAT CAA AAA GAA AAC AAT TAT 824 
Gly Phe Asn Ala Phe lie Pro Leu Leu Tyr Gin Lys Glu Asn Asn Tyr 
245 250 255 

TTG AAT GTG GAA AAA TCC CCC AGT GCG ATA GAA ATC TTA AAA ACC ATC 872 
Leu Asn Val Glu Lys Ser Pro Ser Ala lie Glu lie Leu Lys Thr lie 
260 265 270 

GCC ATA TCT CGC ATT CTT TTA AAC AAT ATC CCT CAC ATT AAA GCT TAT 92 0 

Ala lie Ser Arg lie Leu Leu Asn Asn lie Pro His lie Lys Ala Tyr 
275 280 285 290 

TGG GCG ACT TTG GGC TTG AAT TTG GCT TTA GTG GCT CAA GAA TTT GGC 968 
Trp Ala Thr Leu Gly Leu Asn Leu Ala Leu Val Ala Gin Glu Phe Gly 
295 300 305 

GCT AAC GAT TTA GAC GGC ACG ATA GAG ATA GAG AGC ATT CAA AGC GCG 1016 
Ala Asn Asp Leu Asp Gly Thr lie Glu lie Glu Ser lie Gin Ser Ala 
310 315 320 

GCA GGC GCA AAG AGC CGG CAT GGT TTA GAA AAA GAA GAT TTG ATA TTT 1064 
Ala Gly Ala Lys Ser Arg His Gly Leu Glu Lys Glu Asp Leu lie Phe 
325 330 335 

AAA ATC AAG GAC GCT GGT TTT GTT GCG GTA GAA AGG GAT AGT TTG TAT 1112 
Lys lie Lys Asp Ala Gly Phe Val Ala Val Glu Arg Asp Ser Leu Tyr 
340 345 350 

AAT TTT ATA CAG AAA TTT TAATAATTTT TAGCGTTTTT AAGAATGATT AGTTATAA 1168 
Asn Phe lie Gin Lys Phe 
355 360 

TAACGCTACT AACAA 1183 
(2) INFORMATION FOR SEQ ID NO: 3 94: 
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(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 51... 1130 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 93: 

ATCTAAAAAT GCTACAATGT TTATCTTTAA AACGAAAGGG CAATTTAACC ATG GAC 56 

Met Asp 

1 

TTT TTA GAA AAA GTA TTA GAC AAT CAA GTT ACT GAA AGT AAA GAA TTG 104 
Phe Leu Glu Lys Val Leu Asp Asn Gin Val Thr Glu Ser Lys Glu Leu 
5 10 15 

GTC AGG CTT TAT GAT TAT GAT TTA TAC ACG CTA GGG GAA GTA GCG GAT 152 
Val Arg Leu Tyr Asp Tyr Asp Leu Tyr Thr Leu Gly Glu Val Ala Asp 
20 25 30 

CGC ATG CGC CAA AAC ATG CAC CAA AAA ATC GTG TAT TTT AAT GTC AAT 2 00 

Arg Met Arg Gin Asn Met His Gin Lys lie Val Tyr Phe Asn Val Asn 
35 40 45 50 

AGG CAT TTA AAC CCT AGC AAT ATT TGC GCG GAC GCT TGC AAA TTT TGC 24 8 

Arg His Leu Asn Pro Ser Asn lie Cys Ala Asp Ala Cys Lys Phe Cys 
55 60 65 

GCT TTT TCA GCC CAC AGA AAA AAC CCA AAC CCT TAT GAA ATG AGC TTA 2 96 

Ala Phe Ser Ala His Arg Lys Asn Pro Asn Pro Tyr Glu Met Ser Leu 
70 75 80 

GAA GAA ATC CTA GAA AAG GTT AAA AAC TCC TAC AAC AAG GGG ATT AAA 344 
Glu Glu lie Leu Glu Lys Val Lys Asn Ser Tyr Asn Lys Gly lie Lys 
85 90 95 

GAA GTC CAT ATC GTG AGC GCT CAT AAC CCT AAT TAC TCC TAT GAA TGG 3 92 

Glu Val His lie Val Ser Ala His Asn Pro Asn Tyr Ser Tyr Glu Trp 
100 105 110 

TAT TTA AAG GTG TTT GAA ACC ATC AAG CAA GAA ATG CCT AAC TTG CAT 44 0 

Tyr Leu Lys Val Phe Glu Thr lie Lys Gin Glu Met Pro Asn Leu His 
115 120 125 130 

TTA AAG GCC ATG ACC GCT GCA GAA GTG CAT TTT TTA AGC GTT AAA TTC 4 88 

Leu Lys Ala Met Thr Ala Ala Glu Val His Phe Leu Ser Val Lys Phe 
135 140 145 

AAC AAA CCT TTT GAA TTG GTG CTA GAA GAC ATG CTC AAA GCC GGG GTG 53 6 

Asn Lys Pro Phe Glu Leu Val Leu Glu Asp Met Leu Lys Ala Gly Val 
150 155 160 
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TTA GAA GAC GCT ATT AAT TTT GGG CAA GAA TTA GAA AAC GCT AAC CCT 3 92 

Leu Glu Asp Ala lie Asn Phe Gly Gin Glu Leu Glu Asn Ala Asn Pro 
100 105 110 

AAT TAG CAA CAA AAA ATC GCT AAC GCT ACC GGC TTA GCC CTT AAG AAT 44 0 

Asn Tyr Gin Gin Lys lie Ala Asn Ala Thr Gly Leu Ala Leu Lys Asn 
115 120 125 130 

AAA AAA GAA AAA GGA TAGAATTGGA TTTTTTAGAG ATTGTAGGAC AAGTCCCTTT A 4 96 
Lys Lys Glu Lys Gly 
135 

AAAGGAGAGG TA 50 8 

(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 92: 



Met 


Gly 


Asn 


Leu 


Thr 


Tyr 


Tyr 


Ala 


Tyr 


Met 


Tyr 


Leu 


He 


Leu 


Phe 


Val 


1 








5 










10 










15 




Cys 


Leu 


Leu 


Pro 


Val 


Leu 


Leu 


Met 


Gly 


Leu 


Val 


Trp 


Arg 


Leu 


Thr 


Arg 








20 










25 










30 






Pro 


Pro 


Leu 


Lys 


Gin 


Asn 


He 


Pro 


Asn 


Lys 


Ser 


Leu 


Ser 


Leu 


Glu 


Asn 






35 










40 










45 








Leu 


Asn 


Glu 


Gin 


lie 


Lys 


Asn 


Leu 


Lys 


Ser 


Val 


Pro 


Ala 


Leu 


Glu 


Lys 




50 










55 










60 










Leu 


Lys 


Asn 


Asp 


Phe 


Asn 


Glu 


Arg 


Phe 


Lys 


He 


Cys 


Pro 


Lys 


Asp 


Lys 


65 










70 










75 










80 


Glu 


Thr 


Leu 


Trp 


Leu 


Glu 


Thr 


He 


Gin 


Lys 


Leu 


Val 


Ala 


Ser 


Glu 


Phe 










85 










90 










95 




Phe 


Glu 


Leu 


Glu 


Asp 


Ala 


He 


Asn 


Phe 


Gly 


Gin 


Glu 


Leu 


Glu 


Asn 


Ala 








100 










105 










110 






Asn 


Pro 


Asn 


Tyr 


Gin 


Gin 


Lys 


He 


Ala 


Asn 


Ala 


Thr 


Gly 


Leu 


Ala 


Leu 






115 










120 










125 








Lys 


Asn 


Lys 


Lys 


Glu 


Lys 


Gly 





















130 135 



(2) INFORMATION FOR SEQ ID NO: 393; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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130 

Leu Lys Ala lie 
145 

Arg Thr Asp His 
Gly Phe 



135 

Glu Thr Gin Leu 
150 

lie Ala Leu Glu 
165 



140 

Asn Ala Trp Gly 
155 

Cys lie Ala Glu 
170 



Val Lys Met Gin 
160 

Leu Ser Tyr Lys 
175 



(2) INFORMATION FOR SEQ ID NO: 3 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 455 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 91: 

ATTGAAGCGA GTAACGCTTA TTATAAAAAA CGCTTATAAA TCTTATCAAC ATG GGC 56 

Met Gly 
1 

AAT TTG ACT TAT TAG GCT TAC ATG TAT TTG ATC CTC TTT GTA TGC TTG 104 
Asn Leu Thr Tyr Tyr Ala Tyr Met Tyr Leu lie Leu Phe Val Cys Leu 
5 10 15 

CTG CCT GTG TTA TTA ATG GGG CTT GTT TGG AGG CTT ACT CGC CCC CCC 152 
Leu Pro Val Leu Leu Met Gly Leu Val Trp Arg Leu Thr Arg Pro Pro 
20 25 30 

TTA AAG CAA AAT ATT CCT AAT AAA AGC CTC TCT TTA GAA AAT TTA AAC 200 
Leu Lys Gin Asn lie Pro Asn Lys Ser Leu Ser Leu Glu Asn Leu Asn 
35 40 45 50 

GAA CAA ATC AAA AAC CTT AAA AGC GTA CCA GCT TTA GAA AAA CTG AAA 24 8 

Glu Gin lie Lys Asn Leu Lys Ser Val Pro Ala Leu Glu Lys Leu Lys 
55 60 65 

AAC GAC TTC AAT GAG CGT TTT AAA ATT TGC CCC AAA GAT AAA GAA ACT 296 
Asn Asp Phe Asn Glu Arg Phe Lys lie Cys Pro Lys Asp Lys Glu Thr 
70 75 80 

CTG TGG TTA GAA ACG ATC CAA AAA TTA GTC GCT TCA GAA TTT TTT GAA 344 
Leu Trp Leu Glu Thr lie Gin Lys Leu Val Ala Ser Glu Phe Phe Glu 
85 90 95 
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95 100 105 

TTT TTA GGG GAT TTG CCT AAA ACT TTT ATC GTG GGG CTT GTG CCT TTT 44 8 

Phe Leu Gly Asp Leu Pro Lys Thr Phe lie Val Gly Leu Val Pro Phe 
110 115 120 

GTG ATA GGG AGC GAG ACC ACT TTC AAG CTT TCA AGC AAA ATT TTA AAC 4 96 

Val lie Gly Ser Glu Thr Thr Phe Lys Leu Ser Ser Lys lie Leu Asn 
125 130 135 

GCT TTA GAA ACC GCC TTA AAA GCC ATA GAA ACC CAA CTC AAC GCA TGG 544 
Ala Leu Glu Thr Ala Leu Lys Ala lie Glu Thr Gin Leu Asn Ala Trp 
140 145 150 155 

GGG GTT AAA ATG CAA CGC ACC GAT CAT ATC GCT TTA GAA TGT ATC GCT 592 
Gly Val Lys Met Gin Arg Thr Asp His lie Ala Leu Glu Cys lie Ala 
160 165 170 

GAA CTT TCT TAT AAG GGT TTT TGAATTGGTT TTTGTTTTTC TTTTTAAATG CGTT 647 
Glu Leu Ser Tyr Lys Gly Phe 
175 

AATGAAGAAA CAAGCCTGAA TTTTACGCCC CTTTTAGAGC GAATGGCATG CAATTTGCAA 7 07 

GCGCGTTTTT ATA 72 0 

(2) INFORMATION FOR SEQ ID NO: 3 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 90: 

Met Ser Gin Lys lie Leu lie Leu Gly lie Gly Asn lie Leu Phe Gly 

15 10 15 

Asp Glu Gly lie Gly Val His Leu Ala His Tyr Leu Lys Lys Asn Phe 

20 25 30 

Ser Phe Phe Pro Ser Val Asp lie lie Asp Gly Gly Thr Met Ala Gin 

35 40 45 

Gin Leu lie Pro Leu lie Thr Ser Tyr Glu Lys Val Leu lie Leu Asp 

50 55 60 

Cys Val Ser Ala Glu Gly Val Glu lie Gly Ser Val Tyr Ala Phe Asp 
65 70 75 80 

Phe Lys Asp Ala Pro Lys Glu lie Thr Trp Ala Gly Ser Ala His Glu 

85 90 95 

Val Glu Met Leu His Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu 

100 105 110 

Pro Lys Thr Phe lie Val Gly Leu Val Pro Phe Val lie Gly Ser Glu 

115 120 125 

Thr Thr Phe Lys Leu Ser Ser Lys lie Leu Asn Ala Leu Glu Thr Ala 
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530 

lie His Sex Phe 
545 

Phe Lys Gly Gin 
Lys Phe 



535 

Asp Pro Cys lie 
550 

Ser Leu Asn Glu 
565 



540 

Ala Cys Ser Val 
555 

Phe Lys Val Glu 
570 



His Val Met Asp 
560 

Pro Asn Phe Ala 
575 



(2) INFORMATION FOR SEQ ID NO: 3 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 80... 613 
(D) OTHER INFORMATION: 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 3 89: 

ATATGGTGTT TTTCCATTCT ATCAGGTATG AAAGTTCGGG GGCGGATTCT ATGATTAATG 60 
GCTATGGTTA TACCAAAGA ATG AGT CAA AAA ATC CTA ATT CTA GGT ATT GGC 112 

Met Ser Gin Lys lie Leu lie Leu Gly lie Gly 
15 10 

AAT ATC CTT TTT GGC GAT GAA GGG ATT GGG GTG CAT TTA GCC CAC TAC 160 
Asn lie Leu Phe Gly Asp Glu Gly lie Gly Val His Leu Ala His Tyr 
15 20 25 

CTC AAA AAA AAT TTT TCT TTT TTC CCT AGC GTG GAT ATT ATA GAT GGG 208 
Leu Lys Lys Asn Phe Ser Phe Phe Pro Ser Val Asp lie lie Asp Gly 
30 35 40 

GGG ACA ATG GCC CAG CAG CTC ATT CCT TTA ATC ACT TCG TAT GAA AAG 256 
Gly Thr Met Ala Gin Gin Leu lie Pro Leu lie Thr Ser Tyr Glu Lys 
45 50 55 

GTT TTG ATT TTG GAT TGC GTG AGC GCT GAA GGC GTT GAG ATA GGA TCA 304 
Val Leu lie Leu Asp Cys Val Ser Ala Glu Gly Val Glu lie Gly Ser 
60 65 70 75 

GTC TAT GCT TTT GAT TTT AAG GAC GCT CCT AAA GAA ATC ACA TGG GCT 3 52 

Val Tyr Ala Phe Asp Phe Lys Asp Ala Pro Lys Glu lie Thr Trp Ala 
80 85 90 

GGG AGC GCT CAT GAA GTG GAA ATG CTA CAC ACT TTA AGG CTC ACG GAG 4 00 

Gly Ser -Ala His Glu Val Glu Met Leu His Thr Leu Arg Leu Thr Glu 
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Asn Met Ala Leu 
100 

His Gly Leu Asp 
115 

lie Gin Ala Ala 
130 

Thr Gly Ala Gly 
145 

Ala Lys . Ser Gly 

Lys Thr Tyr Arg 
180 

Tyr Leu Lys Leu 
195 

lie Phe Gly Ala 
210 

Val Thr Ser Val 
225 

Lys Ser Lys Phe 

Pro Asp Leu Val 
260 

lie Lys Gly Cys 
275 

Leu Gly Arg Asp 
290 

Asp lie Ser Lys 
305 

Val Thr His Ser 

His Pro Tyr Asp 
340 

Gly Glu Ser Val 
355 

Asp Thr Lys Asn 
370 

Lys Pro Met Glu 
385 

Ala Lys Asn Pro 

Thr Lys Leu Pro 
420 

Ala Arg Cys lie 
435 

Phe Asp Ala Leu 
450 

Pro Tyr His lie 
465 

Gin Val Pro Arg 

Val Val Glu Asn 
500 

Pro Arg Asp Ser 
515 

Gly Thr Lys lie 



Leu Phe His Asp 

Trp Cys Asp lie 
120 

Lys Leu Ser Phe 
135 

Glu Leu Lys Ala 
150 

Ser Leu Gly Pro 
165 

Leu Ser Pro Glu 

Leu Glu lie Gin 
200 

Lys Gin Pro His 
215 

Met Asp lie Leu 
230 

Glu Val Val Ala 
245 

Met Ala Gly Glu 

Gly Leu Arg Asn 
280 

Lys Tyr Leu Leu 
295 

Leu His Pro lie 
310 

Trp Tyr Gin Tyr 
325 

Gly Gin Thr Asn 

Gly lie Glu Asn 
360 

Lys Tyr Ser Trp 
375 

Val Gly Pro Leu 
390 

Tyr Val Thr Glu 
405 

Leu Glu Ala Leu 

Glu Ala Lys Thr 
440 

Val Glu Asn Leu 
455 

Asp Lys Asn Gin 
470 

Gly Met Leu Ser 
485 

Tyr Gin Ala Val 

Gin Asn Gin Arg 
520 

Ala Asp Leu Thr 



His Val Val His 
105 

Met Ser Ala Leu 

Lys Tyr Ser Pro 
140 

Val Gin Lys Arg 
155 

Phe Ser Asn Gly 
170 

Gin Asn Leu lie 
185 

Arg Glu Ala Ala 

Pro Gin Ser Leu 
220 

Asp Pro Thr Arg 
235 

Asn Phe lie Asn 
250 

Met Phe Ala Asn 
265 

Phe lie Ala Tyr 

Ser Ser Gly Val 
300 

Asp Glu Ser Leu 
315 

Glu Asp Thr Lys 
330 

Pro His Tyr Thr 
345 

Lys lie lie Pro 

lie Lys Ser Pro 
380 

Ser Ser Val Val 
395 

Val Ala Thr Lys 
410 

Phe Ser Thr Leu 
425 

lie Ala Asp Asn 

Lys Ser Asp Gin 
460 

Glu Tyr Lys Gly 
475 

His Trp Val Arg 
490 

Val Pro Ser Thr 
505 

Gly Ala Tyr Glu 
Gin Pro Leu Glu 



Phe Tyr Thr Leu 
110 

Lys Ala Asp Pro 
125 

Tyr Pro lie Asn 

Leu Ser Asp Phe 
160 

Tyr Tyr Gly His 
175 

Val Leu Ser His 
190 

Lys Met Thr Ala 
205 

Thr Val Gly Gly 

Leu Ala Glu Trp 
240 

His Ala Tyr Tyr 
255 

Glu Gin Ser Val 
270 

Glu Glu Val Leu 
285 

Val Leu Asp Gly 

lie Lys Glu Glu 
320 

Glu Val Gin Leu 
335 

Gly Leu Lys Asp 
350 

Ala Lys Val Leu 
365 

Arg Tyr Asp Ser 

Val Gly Leu Ala 
400 

Phe Leu Lys Asp 
415 

Gly Arg Thr Ala 
430 

Gly Leu Leu Ala 
445 

Ser Thr Cys Ala 

Arg Tyr lie Gly 
480 

lie Lys Asn Gly 
495 

Trp Asn Ala Gly 
510 

Met Ser Leu lie 
525 

He He Arg* Thr 
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CAC ATT GAT AAA AAT CAA GAA TAT AAA GGG CGC TAG ATT GGT CAA GTG 14 96 

His lie Asp Lys Asn Gin Glu Tyr Lys Gly Arg Tyr He Gly Gin Val 
470 475 480 

CCA AGG GGC ATG CTA AGC CAT TGG GTG CGT ATT AAA AAC GGC GTG GTG 1544 
Pro Arg Gly Met Leu Ser His Trp Val Arg He Lys Asn Gly Val Val 
485 490 495 

GAA AAT TAT CAA GCG GTG GTG CCT TCT ACT TGG AAT GCA GGG CCT AGA 15 92 

Glu Asn Tyr Gin Ala Val Val Pro Ser Thr Trp Asn Ala Gly Pro Arg 
500 505 510 

GAT TCT CAA AAT CAA AGG GGG GCT TAT GAA ATG AGC TTG ATT GGC ACT 164 0 

Asp Ser Gin Asn Gin Arg Gly Ala Tyr Glu Met Ser Leu He Gly Thr 
515 520 525 530 

AAA ATC GCT GAT TTA ACC CAG CCT TTA GAA ATC ATT AGG ACT ATC CAT 16 8 8 

Lys He Ala Asp Leu Thr Gin Pro Leu Glu He He Arg Thr He His 
535 540 545 

TCT TTT GAC CCA TGC ATC GCA TGC TCG GTG CAT GTG ATG GAT TTT AAA 173 6 

Ser Phe Asp Pro Cys He Ala Cys Ser Val His Val Met Asp Phe Lys 
550 555 560 

GGG CAG TCT TTA AAC GAG TTT AAA GTA GAG CCT AAT TTC GCT AAA TTC T 17 85 
Gly Gin Ser Leu Asn Glu Phe Lys Val Glu Pro Asn Phe Ala Lys Phe 
565 570 575 

AAAAAGGGTT ACGCATGGAT AAAATGAATA AGGTCGTTTT ACACAAAGAA TA 1837 

(2) INFORMATION FOR SEQ ID NO: 3 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 amino acids 

(B) TYPE: amino acid 

(C) STRT^NDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:388: 

Met Ser Lys Lys He Val Val Asp Pro He Thr Arg He Glu Gly His 

15 10 15 

Leu Arg He Glu Val He Val Asp Asp Asp Asn Val He Thr Asp Ala 

20 25 30 

Phe Ser Ser Ser Thr Leu Phe Arg Gly Leu Glu Thr He He Lys Gly 

35 40 45 

Arg Asp Pro Arg Asp Ala Gly Phe He Ala Gin Arg He Cys Gly Val 

50 55 60 

Cys Thr Tyr Ser His Tyr Lys Ala Gly He Thr Ala Val Glu Asn Ala 
65 70 75 80 

Leu Gly He Thr Pro Pro Leu Asn Ala Gin Leu Val Arg Ser Leu Met 
85 90 95 
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AAG TTT GAA GTG GTG GCC AAT TTC ATC AAC CAT GCT TAG TAG CCT GAT 824 
Lys Phe Glu Val Val Ala Asn Phe lie Asn His Ala Tyr Tyr Pro Asp 
245 250 255 

TTG GTG ATG GCA GGC GAA ATG TTC GCT AAC GAA CAA TCC GTT ATC AAA 872 
Leu Val Met Ala Gly Glu Met Phe Ala Asn Glu Gin Ser Val lie Lys 
260 265 270 

GGC TGT GGC TTA AGG AAT TTT ATC GCT TAT GAA GAA GTG CTG CTT GGG 92 0 

Gly Cys Gly Leu Arg Asn Phe lie Ala Tyr Glu Glu Val Leu Leu Gly 
275 280 285 290 

AGG GAT AAA TAC CTT TTG AGT AGT GGG GTG GTG CTT GAT GGG GAT ATT 96 8 

Arg Asp Lys Tyr Leu Leu Ser Ser Gly Val Val Leu Asp Gly Asp lie 
295 300 305 

TCT AAA TTA CAC CCC ATT GAT GAA AGT TTG ATT AAA GAA GAA GTT ACG 1016 
Ser Lys Leu His Pro lie Asp Glu Ser Leu lie Lys Glu Glu Val Thr 
310 315 320 

CAT TCT TGG TAT CAA TAC GAA GAC ACT AAA GAA GTG CAA CTC CAC CCT 1064 
His Ser Trp Tyr Gin Tyr Glu Asp Thr Lys Glu Val Gin Leu His Pro 
325 330 335 

TAT GAC GGG CAA ACG AAC CCG CAT TAT ACC GGT TTA AAA GAC GGC GAG 1112 
Tyr Asp Gly Gin Thr Asn Pro His Tyr Thr Gly Leu Lys Asp Gly Glu 
340 345 350 

AGC GTG GGG ATT GAA AAT AAA ATC ATC CCT GCT AAA GTG CTT GAC ACT 1160 
Ser Val Gly lie Glu Asn Lys lie lie Pro Ala Lys Val Leu Asp Thr 
355 360 365 370 

AAA AAT AAA TAT TCT TGG ATA AAA TCG CCC AGA TAC GAT AGT AAG CCC 12 0 8 
Lys Asn Lys Tyr Ser Trp lie Lys Ser Pro Arg Tyr Asp Ser Lys Pro 
375 380 385 

ATG GAA GTA GGT CCT TTA AGT TCC GTA GTG GTA GGT TTA GCG GCG AAA 12 56 
Met Glu Val Gly Pro Leu Ser Ser Val Val Val Gly Leu Ala Ala Lys 
390 395 400 

AAC CCT TAT GTT ACT GAA GTG GCT ACG AAG TTT TTA AAA GAC ACT AAA 1304 
Asn Pro Tyr Val Thr Glu Val Ala Thr Lys Phe Leu Lys Asp Thr Lys 
405 410 415 

CTG CCT TTA GAG GCG TTG TTT TCA ACG CTT GGG CGA ACA GCT GCA AGG 13 52 
Leu Pro Leu Glu Ala Leu Phe Ser Thr Leu Gly Arg Thr Ala Ala Arg 
420 425 430 

TGT ATT GAA GCT AAA ACG ATC GCT GAT AAT GGC CTT TTG GCG TTT GAT 14 00 
Cys lie Glu Ala Lys Thr lie Ala Asp Asn Gly Leu Leu Ala Phe Asp 
435 440 445 450 

GCG TTA GTG GAA AAT CTA AAA AGC GAT CAA AGC ACT TGT GCT CCT TAT 144 8 
Ala Leu Val Glu Asn Leu Lys Ser Asp Gin Ser Thr Cys Ala Pro Tyr 
- 455 460 465 



-643- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



lie Glu Val lie Val Asp Asp Asp Asn Val lie Thr Asp Ala Phe Ser 
20 25 30 

TCT TCT ACG CTT TTT AGG GGG CTA GAA ACC ATT ATT AAA GGC AGA GAT 2 00 

Ser Ser Thr Leu Phe Arg Gly Leu Glu Thr lie lie Lys Gly Arg Asp 
35 40 45 50 

CCA CGA GAT GCA GGC TTC ATC GCT CAA AGG ATT TGC GGG GTA TGC ACT 24 8 

Pro Arg Asp Ala Gly Phe lie Ala Gin Arg lie Cys Gly Val Cys Thr 
55 60 65 

TAT TCG CAT TAT AAG GCC GGT ATC ACG GCG GTA GAA AAC GCT CTA GGC 296 
Tyr Ser His Tyr Lys Ala Gly lie Thr Ala Val Glu Asn Ala Leu Gly 
70 75 80 

ATC ACT CCC CCA TTA AAC GCG CAA TTG GTG CGA TCT TTG ATG AAC ATG 344 
lie Thr Pro Pro Leu Asn Ala Gin Leu Val Arg Ser Leu Met Asn Met 
85 90 95 

GCG CTG CTT TTT CAT GAC CAT GTG GTG CAT TTC TAT ACT TTG CAT GGG 3 92 

Ala Leu Leu Phe His Asp His Val Val His Phe Tyr Thr Leu His Gly 
100 105 110 

CTT GAT TGG TGC GAT ATC ATG AGC GCT TTA AAA GCC GAT CCC ATT CAA 44 0 

Leu Asp Trp Cys Asp lie Met Ser Ala Leu Lys Ala Asp Pro lie Gin 
115 120 125 130 

GCG GCA AAA CTT TCT TTC AAA TAC AGC CCT TAC CCT ATT AAT ACC GGT 4 88 

Ala Ala Lys Leu Ser Phe Lys Tyr Ser Pro Tyr Pro lie Asn Thr Gly 
135 140 145 

GCC GGT GAA TTA AAA GCG GTT CAA AAA CGC TTG AGC GAT TTC GCT AAA 536 
Ala Gly Glu Leu Lys Ala Val Gin Lys Arg Leu Ser Asp Phe Ala Lys 
150 155 160 

AGC GGA TCT TTG GGG CCT TTC AGT AAC GGC TAT TAC GGG CAT AAA ACT 584 
Ser Gly Ser Leu Gly Pro Phe Ser Asn Gly Tyr Tyr Gly His Lys Thr 
165 170 175 

TAT CGT TTA AGT CCG GAG CAA AAT TTA ATC GTC TTA AGC CAC TAC CTC 632 
Tyr Arg Leu Ser Pro Glu Gin Asn Leu lie Val Leu Ser His Tyr Leu 
180 185 190 

AAG CTT TTA GAA ATC CAA AGG GAA GCG GCG AAA ATG ACC GCT ATT TTT 680 
Lys Leu Leu Glu lie Gin Arg Glu Ala Ala Lys Met Thr Ala lie Phe 
195 200 205 210 

GGG GCC AAA CAG CCT CAC CCA CAA AGC CTA ACG GTG GGG GGT GTT ACG. 728 
Gly Ala Lys Gin Pro His Pro Gin Ser Leu Thr Val Gly Gly Val Thr 
215 220 225 

AGT GTT ATG GAT ATA TTG GAT CCG ACG AGA TTG GCT GAA TGG AAG AGC 776 
Ser Val Met TVsp lie Leu Asp Pro Thr Arg Leu Ala Glu Trp Lys Ser 
230 235 240 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 



Met 


Val 


Met 


Arg 


Pro 


Gly 


Val 


Leu 


Gin 


Val 


Gly 


Ala 


Met 


Tyr 


Ala 


Asn 


1 








5 










10 










15 




Gly 


Val 


Gly 


He 
20 


Gin 


Thr 


Asn 


Arg 


Leu 
25 


Lys 


Ala 


Ala 


Arg 


Tyr 
30 


Tyr 


Glu 


Trp 


Val 


Ala 
35 


Ala 


Gly 


Ala 


Met 


Arg 
40 


Pro 


Leu 


Ala 


Arg 


He 
45 


Trp 


Leu 


Arg 


Cys 


Met 
50 


Lys 


Thr 


Arg 


Lys 


Met 
55 


Arg 


He 


Gin 


Thr 


He 
60 


Lys 


Lys 


Thr 


Leu 


Cys 


Asn 


Cys 


Met 


Arg 


Trp 


Leu 


Val 


Lys 


Gly 


Gly 


He 


Cys 


Ser 


His 


Ala 


65 










70 










75 










80 


lie 


lie 


Trp 


Gly 


Gly 
85 


Cys 


Leu 


Leu 


Thr 


Glu 
90 


Val 


Gly 


Ser 


Gin 


Lys 
95 


He 


lie 


Thr 


Lys 


Arg 
100 



























(2) INFORMATION FOR SEQ ID NO: 3 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1784 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 87: 

CGCATGCGCT CCTTTCTAAA GCGATCAAAA ACAAAGAGTA AGGGATTAAC ATG TCA 5 6 

Met Ser 
1 

AAA AAA ATC GTA GTC GAT CCT ATC ACT AGG ATT GAG GGG CAT TTA AGG 104 
Lys Lys He Val Val Asp Pro He Thr Arg He Glu Gly His Leu Arg 
5 10 15 

ATT GAA GTG ATC GTA GAT GAT GAT AAC GTG ATC ACT GAT GCG TTT TCT 152 
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(2) INFORMATION FOR SEQ ID NO: 3 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 350 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 85: 

AAAGGGCGAT TATCAAGGGG CTTTCAAGCT TTTTTCCCAA TCGTGCGATA ATG GTA 56 

Met Val 
1 

ATG CGG CCG GGT GTT TTG CAA GTG GGG GCG ATG TAT GCT AAT GGG GTA 104 
Met Arg Pro Gly Val Leu Gin Val Gly Ala Met Tyr Ala Asn Gly Val 
5 10 15 

GGG ATC CAA ACC AAC AGA TTA AAA GCC GCT CGC TAT TAT GAA TGG GTT 152 
Gly lie Gin Thr Asn Arg Leu Lys Ala Ala Arg Tyr Tyr Glu Trp Val 
20 25 30 

GCA GCG GGG GCG ATG CGA CCG CTT GCG CGA ATC TGG CTC AGA TGT ATG 2 00 

Ala Ala Gly Ala Met Arg Pro Leu Ala Arg lie Trp Leu Arg Cys Met 
35 40 45 50 

AAA ACA AGA AAA ATG CGG ATT CAA ACG ATA AAG AAA ACG CTT TGC AAT 24 8 

Lys Thr Arg Lys Met Arg lie Gin Thr lie Lys Lys Thr Leu Cys Asn 
55 60 65 

TGT ATG CGG TGG CTT GTC AAG GGG GGG ATA TGC TCG CAT GCA ATA ATT 2 96 

Cys Met Arg Trp Leu Val Lys Gly Gly lie Cys Ser His Ala lie lie 
70 75 80 

TGG GGT GGA TGT TTG CTA ACG GAA GTG GGG TCC CAA AAG ATT ATT ACA 344 
Trp Gly Gly Cys Leu Leu Thr Glu Val Gly Ser Gin Lys lie lie Thr 
85 90 95 

AAG CGA TAAGTTATTA TAAATTTTCA TGCGAGAATG GGAATGATAT GGGGTGTTAT AA 4 02 
Lys Arg 
100 



403 



(2) INFORMATION FOR SEQ ID NO: 386: 
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Leu Met Arg Leu Asp Lys Phe Lys lie Lys Lys Ala Gin Asn Leu Leu 
130 135 140 

GAC GCT ATT TTA AAA AGC AAA AAC CCT CCC TTA TOG CGT TTG ATT AAC 600 
Asp Ala lie Leu Lys Ser Lys Asn Pro Pro Leu Trp Arg Leu lie Asn 
145 150 155 

GCT TTA GGG ATT GAG CAT ATT GGT AAG GGA GCG AGT AAA ACG CTG GCC 64 8 

Ala Leu Gly lie Glu His lie Gly Lys Gly Ala Ser Lys Thr Leu Ala 
160 165 170 175 

AAA TAG GGC TTA AAT GTG TTA GAA AAA AGC GAA SCG AGT TTT TAGAAATGG 699 
Lys Tyr Gly Leu Asn Val Leu Glu Lys Ser Glu Xaa Ser Phe 
180 185 

AAGGCTTTGG GGTGGAAATG GCGCGCTCTT TAGTCAATTT TTATGCGAGC AATCAA 755 

(2) INFORMATION FOR SEQ ID NO: 3 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:384: 

Met lie Asn Arg Ala Thr Leu His Asn Tyr Ser Glu lie Glu Lys Lys 

15 10 15 

Asn lie Met Leu Ser Asp Arg Val Val Val lie Arg Ser Gly Asp Val 

20 25 30 

lie Pro Lys lie lie Lys Pro Leu Glu Ser Tyr Arg Asp Gly Ser Gin 

35 40 45 

His Lys lie Glu Arg Pro Lys Val Cys Pro lie Cys Ser His Glu Leu 

50 55 60 

Leu Cys Glu Glu lie Phe Thr Tyr Cys Gin Asn Leu Asn Cys Pro Ala 
65 70 75 80 

Arg Leu Lys Glu Ser Leu lie His Phe Ala Ser Lys Asp Ala Leu Asn 

85 90 95 

lie Gin Gly Leu Gly Asp Lys Val lie Glu Gin Leu Phe Glu Glu Lys 

100 105 110 

Leu lie Phe Asn Ala Leu Asp Leu Tyr Ala Leu Lys Leu Glu Asp Leu 

115 120 125 

Met Arg Leu Asp Lys Phe Lys lie Lys Lys Ala Gin Asn Leu Leu Asp 

130 135 140 

Ala lie Leu Lys Ser Lys Asn Pro Pro Leu Trp Arg Leu lie Asn Ala 
145 150 155 160 

Leu Gly lie Glu His lie Gly Lys Gly Ala Ser Lys Thr Leu Ala Lys 

165 170 175 

Tyr Gly Leu Asn Val Leu Glu Lys Ser Glu Xaa Ser Phe 
180 185 
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(2) INFORMATION FOR SEQ ID NO: 3 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 124... 690 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:383: 

TTGCGCTTAT AAATTCCCGG CTTTAGAAAA ACACACCAAA ATTGTAGGAG TCATTAACCA 6 0 

AGTGGGGCGC ASSGGGCGAT CACACCGGTC GCTCTTTTAG AGCCTGTGGA AATTGCTGGA 12 0 

GCT ATG ATT AAT AGA GCG ACC TTA CAC AAT TAT TCT GAA ATT GAA AAA 16 8 
Met lie Asn Arg Ala Thr Leu His Asn Tyr Ser Glu lie Glu Lys 
15 10 15 

AAG AAT ATC ATG CTC AGT GAT AGG GTC GTT GTC ATT AGA AGC GGC GAT 216 
Lys Asn lie Met Leu Ser Asp Arg Val Val Val lie Arg Ser Gly Asp 
20 25 30 

GTG ATC CCT AAA ATC ATC AAG CCT TTA GAA TCT TAT AGA GAC GGC TCG 2 64 

Val lie Pro Lys lie lie Lys Pro Leu Glu Ser Tyr Arg Asp Gly Ser 
35 40 45 

CAA CAT AAA ATT GAA CGC CCC AAG GTT TGC CCT ATA TGT TCG CAT GAG 312 
Gin His Lys lie Glu Arg Pro Lys Val Cys Pro lie Cys Ser His Glu 
50 55 60 

CTT TTG TGC GAA GAG ATT TTT ACT TAT TGT CAA AAC CTT AAT TGC CCG 360 
Leu Leu Cys Glu Glu lie Phe Thr Tyr Cys Gin Asn Leu Asn Cys Pro 
65 70 75 

GCA AGG TTG AAA GAA AGC TTG ATT CAT TTC GCT TCT AAA GAC GCT TTA 408 
Ala Arg Leu Lys Glu Ser Leu lie His Phe Ala Ser Lys Asp Ala Leu 
80 85 90 95 

AAC ATT CAA GGC TTG GGC GAT AAA GTC ATA GAG CAA CTT TTT GAA GAA 456 
Asn lie Gin Gly Leu Gly Asp Lys Val lie Glu Gin Leu Phe Glu Glu 
100 105 110 

AAG CTC ATT TTT AAC GCT CTG GAT TTG TAT GCT TTA AAA TTA GAA GAT 504 
Lys Leu lie Phe Asn Ala Leu Asp Leu Tyr Ala Leu Lys Leu Glu Asp 
115 120 125 

TTA ATG CGG CTA GAC AAA TTT AAA ATT AAA AAA GCT CAA AAT CTA TTA 552 
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1540 1545 1550 

Ser Asp Lys Pro Met Asp lie His Ala Pro Ser Leu Glu Tyr Tyr lie 

1555 1560 1565 

Lys Tyr lie Gin Gly Ser Ala Gly Leu Asp Ala lie Lys Ser Ala Gly 

1570 1575 1580 

Asn Asn Ser lie Leu Trp Leu Asn Glu Leu Phe Val Ala Lys Gly Gly 
585 1590 1595 1600 

Asn Pro Leu Phe Ala Pro Tyr Tyr Leu Gin Asp Asn Pro Thr Glu His 

1605 1610 1615 

lie Val Thr Leu Met Lys Asp lie Thr Ser Ala Leu Gly Met Leu Ser 

1620 1625 1630 

Lys Pro Asn Leu Lys Asn Asn Ser Thr Asp Ala Leu Gin Leu Asn Thr 

1635 1640 1645 

Tyr Thr Gin Gin Met Ser Arg Leu Ala Lys Leu Ser Asn Phe Ala Ser 

1650 1655 1660 

Phe Asp Ser Thr Asp Phe Ser Glu Arg Leu Ser Ser Leu Lys Asn Gin 
665 1670 1675 1680 

Arg Phe Ala Asp Ala lie Pro Asn Ala Met Asp Val lie Leu Lys Tyr 

1685 1690 1695 

Ser Gin Arg Asp Lys Leu Lys Asn Asn Leu Trp Ala Thr Gly Val Gly 

1700 1705 1710 

Gly Val Ser Phe Val Glu Asn Gly Thr Gly Thr Leu Tyr Gly Val Asn 

1715 1720 1725 

Val Gly Tyr Asp Arg Phe lie Lys Gly Val lie Val Gly Gly Tyr Ala 

1730 1735 1740 

Ala Tyr Gly Tyr Ser Gly Phe Tyr Glu Arg lie Thr Asn Ser Lys Ser 
745 1750 1755 1760 

Asp Asn Val Asp Val Gly Leu Tyr Ala Arg Ala Phe lie Lys Lys Ser 

1765 1770 1775 

Glu Leu Thr Phe Ser Val Asn Glu Thr Trp Gly Ala Asn Lys Asn Gin 

1780 1785 1790 

lie Ser Ser Asn Asp Thr Leu Leu Ser Met lie Asn Gin Ser Tyr Lys 

1795 1800 1805 

Tyr Ser Thr Trp Thr Thr Asn Ala Lys Val Asn Tyr Gly Tyr Asp Phe 

1810 1815 1820 

Met Phe Lys Asn Lys Ser lie lie Leu Lys Pro Gin lie Gly Leu Arg 
825 1830 1835 1840 

Tyr Tyr Tyr lie Gly Met Thr Gly Leu Glu Gly Val Met His Asn Ala 

1845 1850 1855 

Leu Tyr Asn Gin Phe Lys Ala Asn Ala Asp Pro Ser Lys Lys Ser Val 

1860 1865 1870 

Leu Thr lie Glu Leu Ala Leu Glu Asn Arg His Tyr Phe Asn Thr Asn 

1875 1880 1885 

Ser Tyr Phe Tyr Ala lie Gly Gly Phe Gly Arg Asp Leu Leu Val Asn 

1890 1895 1900 

Ser Met Gly Asp Lys Leu Val Arg Phe lie Gly Asn Asn Thr Leu Ser 
905 1910 1915 1920 

Tyr Arg Lys Gly Glu Leu Tyr Asn Thr Phe Ala Ser lie Thr Thr Gly 

1925 1930 1935 

Gly Glu Val Arg Leu Phe Lys Ser Phe Tyr Ala Asn Ala Gly Val Gly 

1940 1945 1950 

Ala Arg Phe Gly Leu Asp Tyr Lys Met lie Asn lie Thr Gly Asn lie 

1955 I960 1965 

Gly Met Arg Leu Ala Phe 
1970 1 
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Leu lie Asn Lys Leu lie Ser Trp Leu Gly Pro Gin Asp Leu Ser Val 
105 1110 1115 1120 

Leu Val Asn lie Ala Leu Asn Ser lie Thr Asn Pro Ser Lys Glu Leu 

1125 1130 1135 

Thr Ser Thr lie Ser Ser lie Gly Glu Lys Ala Leu Asn Asp Leu Leu 

1140 1145 1150 

Gly Asp Gly Val Val Asn Lys lie Met Ser Asn Gin Val Leu Gly Gin 

1155 1160 1165 

Met lie Asn Lys lie He Ala Asp Lys Gly Phe Gly Gly Val Tyr Gin 

1170 1175 1180 

Gin Gly Leu Gly Ser He Leu Pro Gin Ser Leu Gin Asp Glu Leu Lys 
185 1190 1195 1200 

Lys Leu Gly Met Gly Ser Leu Leu Gly Ser Arg Gly Leu His Asn Leu 

1205 1210 1215 

Trp Gin Arg Gly Asn Phe Asn Phe Val Ala Lys Asp Tyr Leu Phe Thr 

1220 1225 1230 

Asn Asn Ser Ser Phe Ser Asn Ala Thr Gly Gly Glu Leu Asn Phe Val 

1235 1240 1245 

Ala Gly Lys Ser He He Phe Asn Gly Lys Asn Thr He Asn Phe Thr 

1250 1255 1260 

Gin Tyr Gin Gly Lys Leu Ser Phe He Ser Lys Asp Phe Ser Asn He 
265 1270 1275 1280 

Ser Leu Asp Thr Leu Asn Ala Thr Asn Gly Leu Thr Leu Asn Ala Pro 

1285 1290 1295 

Lys Asn Asp He Ser Val Gin Lys Gly Gin He Cys Val Asn Val Leu 

1300 1305 1310 

Asn Cys Met Gly Glu Lys Lys Ala His Ser Ser Ser Ala Thr Ala Pro 

1315 1320 1325 

Thr Asn Glu Thr Leu Glu Ala Asn Ala Asn Asn Phe Ala Phe Leu Gly 

1330 1335 1340 

Ala He Lys Ala Asn Gly Leu Val Asp Phe Ser Lys Val Leu Gin Asn 
345 1350 1355 1360 

Thr Thr He Gly Thr Leu Asp Leu Gly Pro Asn Ala Thr Phe Lys Ala 

1365 1370 1375 

Asn His Leu He Val Asn Asn Ala Phe Asn Asn Asn Ser Asn Tyr Arg 

1380 1385 1390 

Ala Asp He Ser Gly Asn Leu Asn Val Val Lys Gly Ala Ala Leu Ser 

1395 1400 1405 

Thr Asn Glu Asn Gly Leu Asn Val Gly Gly Asp Phe Lys Ser Glu Gly 

1410 1415 1420 

Ser Leu He Phe Asn Leu Asn Asn Lys Thr Asn Gin Thr He He Asn 
425 1430 1435 1440 

Val Ala Gly Asn Ser Thr He Met Ser Tyr Asn Asn Gin Ala Leu He 

1445 1450 1455 

His Phe Asn Thr Gin Leu Lys Gin Gly Ala Tyr Thr Leu He Asn Ala 

1460 1465 1470 

Lys Arg Met Leu Tyr Gly Tyr Asp Asn Gin He He Arg Gly Gly Ser 

1475 1480 1485 

Leu Ser Asp Tyr Leu Lys Leu Tyr Thr Leu He Asp Phe Asn Gly Lys 

1490 1495 1500 

Arg Met Gin Leu Asn Gly Asp Ser Leu Ser Tyr Asp Asn Gin Pro Val 
505 1510 1515 1520 

Asn He Lys Asp Gly Gly Leu Val Val Ser Phe Lys Asp Asn Gin Gly 

1525 1530 1535 

Gin Met Val Tyr Ser Ser He Leu Tyr Asp Lys Val Gin Val Ser Val 
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660 665 670 

Glu Asn Ser Tyr Asn Tyr Thr Ser Asp Lys Val Gly Thr Tyr Tyr Leu 

675 680 685 

Thr Ser Asn lie Lys Gly Phe Asn Gin Asn Asn Lys Thr Pro Gly Thr 

690 695 700 

Tyr Asn Ala Gin Asn Gin Pro Leu Gin Ala Leu His lie Tyr Asn Gin 
705 710 715 720 

Ala lie Thr Lys Gin Asp Leu Asn Met lie Ala Ser Leu Gly Lys Glu 

725 730 735 

Phe Leu Pro Lys lie Ala Asn Leu Leu Ser Ser Gly Ala Leu Asp Asn 

740 745 750 

Leu Asn Ser Pro Asn Ser Phe Glu Thr Leu Phe Gly lie Phe Glu Lys 

755 760 765 

Tyr Gly lie Thr Leu Asn Gin Glu Asn Trp Lys Ser Leu Leu Lys lie 

770 775 780 

lie Asn Asn Phe Ser Asn Thr Thr Asn Tyr Asp Phe Ser Gin Gly Asn 
785 790 795 800 

Leu Val Val Gly Ala lie Lys Glu Gly Gin Thr Asn Thr Lys Ser Val 

805 810 815 

Val Trp Phe Gly Gly Glu Gly Tyr Lys Glu Pro Cys Ala Val Gly Asp 

820 825 830 

Asn Thr Cys Gin Met Phe Arg Gin Thr Asn Leu Gly Gin Leu Leu His 

835 840 845 

Ser Ser Thr Pro Tyr Leu Gly Tyr lie Asn Ala Asn Phe Arg Ala Lys 

850 855 860 

Asn lie Tyr lie Thr Gly Thr lie Gly Ser Gly Asn Ala Trp Gly Ser 
865 870 875 880 

Gly Gly Ser Ala Asn Val Ser Phe Glu Ser Gly Thr Asn Leu Val Leu 

885 890 895 

Asn Gin Ala Lys lie Asp Ala Gin Gly Thr Asp Lys lie Phe Ser Tyr 

900 905 910 

Leu Gly Gin Gly Gly lie Glu Lys Leu Phe Gly Glu Lys Gly Leu Gly 

915 920 925 

Asn Ala Leu Ser Asn lie lie Tyr Glu Glu Ser Leu Asn Asp Asn Ala 

930 935 940 

lie Pro Lys Asp Leu Ala Asn Met lie Pro Lys Asp Phe Gly Ser Lys 
945 950 955 960 

Thr Leu Ser Ser Leu Leu Ser Pro Thr Glu Val Asn Asn Leu Leu Gly 

965 970 975 

Val Ser Ala Phe Lys Asn Ala lie Met Glu lie Leu Asn Ser Lys Thr 

980 985 990 

Val Gly Asp Val Phe Gly Glu Asn Gly Leu Leu Asn Ala Leu Asp Pro 

995 1000 1005 

Thr Glu Arg Lys Lys lie Asp Gin Met Leu Leu Glu Gin lie Gin Ala 

1010 1015 1020 

His Ser Ser Gly Phe Glu Lys Phe lie Val Lys Thr Leu Gly lie Glu 
025 1030 1035 1040 

Asn Val Glu Asn Phe lie Asn Asn Trp Tyr Gly Lys Gin Ser Leu Ser 

1045 1050 1055 

Ser Phe Ala Asn Asn Phe Val Pro Gly Gly Leu Asn Gin Ala Leu Asp 

1060 1065 1070 

Lys lie Gly Ser Ser Ser Asp Ala Lys Asp Leu Gin Asn Phe Leu Asp 

1075 1080 1085 

Lys Thr Thr Phe Gly Asp lie Leu Asn Gin Met lie Glu Gin Ala Pro 
1090 1095 1100 
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lie Lys Asn Ser 
225 

Asn Ser Gly Arg 

Asn lie Asn Gly 
260 

Ala Asn Thr Asp 
275 

Arg Tyr Asp Leu 
290 

Val lie Asp Val 
305 

Val Thr Phe Asn 

Asn lie Gly Asp 
340 

Leu Ser Phe Tyr 
355 

Phe Asn Gly Val 
370 

Phe Asn Gin Val 
385 

Pro Leu Phe Gly 

Asn Phe Lys Gly 
420 

Ala Lys Asn lie 
435 

Asn Ser Thr Met 
450 

Leu Lys Val Glu 
465 

Leu Asn Phe Asn 

Ala Asn His Ser 
500 

Asp Ala Ser Phe 
515 

Val Leu Leu Asn 
530 

Leu Ala He Asn 
545 

Leu Asn Leu Ser 

Val Tyr Asp Val 
580 

Asn Gly Tyr Glu 
595 

Asp Tyr Ser Phe 
610 

Leu Asn Thr Thr 
625 

Lys Val Gin He 
Leu Ala- Pro Ser 
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Asn Phe Arg Asp 
230 

He Thr Phe Glu 
245 

Ala Thr Glu Ser 

Leu He Phe Ser 
280 

Lys Ala Asn Asn 
295 

Ser Lys Asn Val 
310 

Asn Ser Arg Leu 
325 

Ser Gin Thr Gin 

Asn Asn Ser Val 
360 

Ser Tyr Leu Asn 
375 

Asn Phe Asn Asn 
390 

Lys Thr Pro Asp 
405 

Asn Thr Asn Phe 

His He Asn Phe 
440 

Asn Leu Ala Glu 
455 

Gly Glu Thr Asn 
470 

Gly Asn Ser Val 
485 

Gin He Ser Phe 

Asp Leu Ser Asn 
520 

Gly Ala Leu Asn 
535 

Ala Lys Gly Asn 
550 

Tyr Met Asn Leu 
565 

Leu Gin Ala Gin 

Lys He Arg Phe 
600 

Asp Asn Gly Val 
615 

Glu Thr He Thr 
630 

Ser Gin Asn Gly 
645 

Leu Tyr Asp Tyr 



Phe Thr Trp Gly 
235 

Asn Thr Thr Phe 
250 

Gly Ser Ser Tyr 
265 

Asn Ser He Leu 

He He Phe Asn 
300 

Asn Gin Ser Ser 
315 

Ser Val Lys Pro 
330 

Thr Ala Leu Glu 
345 

Ala Asn Phe Asn 

Leu Asn Pro Asn 
380 

Ala Asn Val Thr 
395 

Phe Gly Asn Ser 
410 

Asn Gin Ala Thr 
425 

Gin Gly Val Ser 

Ser Ser Gin Ala 
460 

Phe Asn Leu Asn 
475 

Phe Asn Ala Pro 
490 

Thr Lys Leu Ala 
505 

Asn Ser Thr Leu 

Leu Leu Gly Asn 
540 

Phe Ser Phe Gly 
555 

Phe Gly Gly Asp 
570 

Asn He Asp Gly 
585 

Tyr Gly He Gin 

His Ser Trp Arg 
620 

Glu Thr Leu His 
635 

Val Ser Asn Asn 
650 

Gin Lys Asn Pro 
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Gly Phe Asn Phe 
240 

Ser Gly Trp Thr 
255 

Val Asn Met Val 
270 

Gly Gly Gly He 
285 

Asn Ser Gin Met 

Leu Asn Gly Asn 
320 

Asn Ala Ala He 
335 

Asn Ala Ser Ser 
350 

Gly Thr Thr Ala 
365 

Ala Gin Val Ser 

Phe Tyr Gly He 
400 

Ala Arg Leu He 
415 

Leu Asn Leu Arg 
430 

Thr Phe Lys Gin 
445 

Ser Phe Asn Ala 

Asn Ser Ser Leu 
480 

Val Ser Phe Tyr 
495 

Thr Phe Asn Ser 
510 

Asn Phe Gin Ser 
525 

Gly Ser Asn Asn 

Ser Lys Gly He 
560 

Lys Lys Thr Ser 
575 

Leu Met Gly Asn 
590 

He Asp Lys Ala 
605 

Phe Thr Asn Pro 

Asn Asn Arg Leu 
640 

Lys Met Phe Asn 
655 

Tyr Asn Glu Thr 
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GTG AGG TTG TTT AAA AGC TTT TAT GCG AAT GCT GGG GTG GOG GCT AGG 5912 
Val Arg Leu Phe Lys Ser Phe Tyr Ala Asn Ala Gly Val Gly Ala Arg 
1940 1945 1950 

TTT GGA TTG GAG TAT AAA ATG ATC AAC ATT ACC GGA AAT ATT GGA ATG 596 0 

Phe Gly Leu Asp Tyr Lys Met lie Asn lie Thr Gly Asn lie Gly Met 
1955 1960 1965 1970 

CGT TTA GCG TTT TAAAAGGTGG GGGCTTACCC CTTTTTTGAG CCATTAAATA AATCA 6017 
Arg Leu Ala Phe 

1 

ATGGTGGG 6025 
(2) INFORMATION FOR SEQ ID NO: 3 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1974 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 3 82: 

Arg Asp Asn Ala Phe Ser Lys Asn Leu Trp Asn Leu lie His Tyr Gly 

15 10 15 

Gly Glu Gin Gly Thr Leu Leu Arg Ala Asp Asn Asn Thr Phe Phe Val 

20 25 30 

Gin Phe Thr Gin Ser Asn Gly Gin Lys Phe Val Phe Glu Glu Thr Phe 

35 40 45 

Asn Pro Gly Ser lie Thr Tyr Lys Tyr Phe Thr lie His Ser Ser Leu 

50 55 60 

Phe His Thr Asp Ala Asp Ser Lys Asp lie Trp Ser Gin Val Arg Lys 
65 70 75 80 

Gin Phe Asp Phe lie Pro Gly Lys Thr Pro Val Cys Val Gly Val Cys 

85 90 95 

Tyr lie Ala Pro Tyr Lys Asn Gin Asp Leu lie Gly Ser Ser Ala Phe 

100 105 110 

Ala Trp Ser Leu Asn Phe Gly Ala Thr Val Val Gly Thr Leu Leu Leu 

115 120 125 

Gly Ser Ala Gin Glu Lys Ala Asn Asn Asn Gly Gly Ser lie Trp Phe 

130 135 140 

Gly Lys Asn Asn Leu Leu Tyr Leu His Gly Asn Phe Asn Ala Thr Asn 
145 150 155 160 

lie Phe Leu Thr Asn Asn Phe Asn Val Gly Asn Pro Asn Ala Gly Gly 

165 170 175 

Gly Ala Thr lie Asn Phe Asn Ala Asp Glu Thr Leu Asn Ala Asp Gly 

180 185 190 

Leu Asn Tyr Thr Asn Phe Gin Thr Val Ala Leu Gly Leu Gin Thr Ser 

195 200 205 

Ala Ser Gin His Ser Trp Ala Asn Phe Asn Ser Lys Leu Ser Met Glu 
210 215 220 
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AGC TTT GTG GAA AAT GGC ACA GGA ACG CTC TAT GGT GTC AAT GTG GGC 524 0 
Ser Phe Val Glu Asn Gly Thr Gly Thr Leu Tyr Gly Val Asn Val Gly 
1715 1720 1725 1730 

TAT GAC AGA TTC ATT AAG GGT GTG ATT GTT GGA GGG TAT GCG GCT TAT 52 8 8 

Tyr Asp Arg Phe He Lys Gly Val He Val Gly Gly Tyr Ala Ala Tyr 
1735 1740 1745 

GGG TAT AGC GGT TTT TAT GAA CGC ATC ACT AAT TCT AAA TCC GAT AAT 53 3 6 
Gly Tyr Ser Gly Phe Tyr Glu Arg He Thr Asn Ser Lys Ser Asp Asn 
1750 1755 1760 

GTG GAT GTG GGC TTG TAT GCG AGG GCT TTC ATT AAA AAG AGC GAG CTA 53 84 
Val Asp Val Gly Leu Tyr Ala Arg Ala Phe He Lys Lys Ser Glu Leu 
1765 1770 1775 

ACC TTT AGC GTC AAT GAA ACT TGG GGG GCT AAT AAA AAC CAA ATC AGC 5432 
Thr Phe Ser Val Asn Glu Thr Trp Gly Ala Asn Lys Asn Gin He Ser 
1780 1785 1790 

TCC AAC GAC ACT CTG CTT TCT ATG ATC AAT CAG TCC TAT AT^ TAC AGC 54 8 0 

Ser Asn Asp Thr Leu Leu Ser Met He Asn Gin Ser Tyr Lys Tyr Ser 
1795 1800 1805 1810 

ACA TGG ACG ACG AAC GCA AAA GTT AAT TAC GGG TAT GAT TTC ATG TTT 552 8 

Thr Trp Thr Thr Asn Ala Lys Val Asn Tyr Gly Tyr Asp Phe Met Phe 
1815 1820 1825 

AAA AAC AAA AGC ATC ATT TTA AAA CCT CAA ATT GGT TTA AGG TAT TAC 557 6 

Lys Asn Lys Ser He He Leu Lys Pro Gin He Gly Leu Arg Tyr Tyr 
1830 1835 1840 

TAT ATC GGC ATG ACC GGT TTA GAA GGG GTG ATG CAT AAT GCG CTC TAT 5624 
Tyr He Gly Met Thr Gly Leu Glu Gly Val Met His Asn Ala Leu Tyr 
1845 1850 1855 

AAC CAG TTT AAA GCG AAC GCC GAT CCG TCT AAA AAA TCC GTT TTA ACG 5672 
Asn Gin Phe Lys Ala Asn Ala Asp Pro Ser Lys Lys Ser Val Leu Thr 
1860 1865 1870 

ATT GAA CTT GCT TTG GAG AAC CGC CAT TAT TTC AAC ACA AAC TCT TAT 572 0 

He Glu Leu Ala Leu Glu Asn Arg His Tyr Phe Asn Thr Asn Ser Tyr 
1875 1880 1885 1890 

TTT TAT GCG ATT GGC GGC TTT GGT AGA GAC TTA TTA GTT AAT TCT ATG 57 68 
Phe Tyr Ala He Gly Gly Phe Gly Arg Asp Leu Leu Val Asn Ser Met 
1895 1900 1905 

GGG GAT AAA TTG GTG CGT TTT ATT GGT AAC AAC ACT TTG AGC TAC AGG 5816 
Gly Asp Lys Leu Val Arg Phe He Gly Asn Asn Thr Leu Ser Tyr Arg 
1910 1915 1920 

AAA GGC GAG CTT TAT AAC ACT TTT GCG AGC ATC ACT ACA GGC GGG GAA 5 864 

Lys Gly Glu Leu Tyr Asn Thr Phe Ala Ser He Thr Thr Gly Gly Glu 
1925 1930 1935 
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GAT TAG CTC AAG CTT TAG ACG GTC ATT GAT TTT AAC GGC AAA CGG ATG 4568 
Asp Tyr Leu Lys Leu Tyr Thr Leu lie Asp Phe Asn Gly Lys Arg Met 
1495 1500 1505 

CAA TTA AAC GGC GAT TCA CTA AGG TAT GAG AAC CAA GGG GTC AAT ATT 4 616 
Gin Leu Asn Gly Asp Ser Leu Ser Tyr Asp Asn Gin Pro Val Asn lie 
1510 1515 1520 

AAA GAT GGG GGT CTT GTG GTA AGG TTT AAA GAG AAT GAG GGG CAA ATG 4 664 
Lys Asp Gly Gly Leu Val Val Ser Phe Lys Asp Asn Gin Gly Gin Met 
1525 1530 1535 

GTG TAT TCA TCT ATC CTT TAT GAT AAA GTT CAA GTT AGG GTC TGT GAT 4712 
Val Tyr Ser Ser lie Leu Tyr Asp Lys Val Gin Val Ser Val Ser Asp 
1540 1545 1550 

AAG GGC ATG GAT ATT GAT GGG GGT AGT TTG GAG TAT TAG ATT AAA TAG 4 7 60 

Lys Pro Met Asp lie His Ala Pro Ser Leu Glu Tyr Tyr lie Lys Tyr 
1555 1560 1565 1570 

ATT CAA GGC AGT GGT GGT TTG GAT GGG ATG AAA TCT GCA GGC AAT AAT 4 8 08 
lie Gin Gly Ser Ala Gly Leu Asp Ala lie Lys Ser Ala Gly Asn Asn 
1575 1580 1585 

TGG ATT GTG TGG TTG AAT GAG CTT TTT GTG GGT AAA GGG GGT AAT GGG 4 85 6 

Ser lie Leu Trp Leu Asn Glu Leu Phe Val Ala Lys Gly Gly Asn Pro 
1590 1595 1600 

TTG TTG GGT GGT TAT TAT TTG CAA GAG AAT CGG AGT GAA GAG ATT GTT 4 904 
Leu Phe Ala Pro Tyr Tyr Leu Gin Asp Asn Pro Thr Glu His lie Val 
1605 1610 1615 

AGT TTA ATG AAA GAT ATT ACT AGG GGT TTA GGC ATG GTT TGT AAA GGG 4 952 

Thr Leu Met Lys Asp lie Thr Ser Ala Leu Gly Met Leu Ser Lys Pro 
1620 1625 1630 

AAT CTT AAA AAC AAT TGG AGG GAT GGT TTA GAG CTC AAC ACT TAG AGG 500 0 

Asn Leu Lys Asn Asn Ser Thr Asp Ala Leu Gin Leu Asn Thr Tyr Thr 
1635 1640 1645 1650 

GAA CAA ATG AGG GGT TTA GGC AAG CTT TGT AAT TTG GGT TCC TTT GAT 504 8 

Gin Gin Met Ser Arg Leu Ala Lys Leu Ser Asn Phe Ala Ser Phe Asp 
1655 1660 1665 

TCA ACG GAT TTT AGG GAA GGC TTG AGG AGT CTT AAA AAC CAA AGA TTT 50 96 
Ser Thr Asp Phe Ser Glu Arg Leu Ser Ser Leu Lys Asn Gin Arg Phe 
1670 1675 1680 

GGT GAT GCA ATC GGT AAT GGG ATG GAT GTG ATT TTA AAA TAG TGT GAA 5144 
Ala Asp Ala He Pro Asn Ala Met Asp Val He Leu Lys Tyr Ser Gin 
1685 1690 1695 

AGG GAT AAA CTA AAA AAC AAC CTT TGG GGG AGG GGG GTT GGG GGG GTG 5192 
Arg Asp Lys Leu Lys Asn Asn Leu Trp Ala Thr Gly Val Gly Gly Val 
1700 1705 1710 
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CAG GGT AAG CTT TCG TTT ATT TCT AAA GAT TTT TCT AAC ATT TCA TTA 38 96 

Gin Gly Lys Leu Ser Phe lie Ser Lys Asp Phe Ser Asn lie Ser Leu 
1270 1275 1280 

GAT ACC TTA AAC OCT ACT AAC GGA TTA ACG CTT AAT GCT CCT AAA AAT 3 944 

Asp Thr Leu Asn Ala Thr Asn Gly Leu Thr Leu Asn Ala Pro Lys Asn 
1285 1290 1295 

GAC ATT AGC GTT CAA AAA GGT CAG ATT TGC GTG AAT GTT TTA AAT TGC 3 992 

Asp lie Ser Val Gin Lys Gly Gin He Cys Val Asn Val Leu Asn Cys 
1300 1305 1310 

ATG GGC GAG AAA AAA GCT CAT TCT TCA AGC GCG ACA GCC CCA ACC AAT 4 04 0 

Met Gly Glu Lys Lys Ala His Ser Ser Ser Ala Thr Ala Pro Thr Asn 
1315 1320 1325 1330 

GAA ACA CTA GAA GCG AAT GCG AAT AAT TTC GCT TTT TTA GGT GCA ATT 4088 
Glu Thr Leu Glu Ala Asn Ala Asn Asn Phe Ala Phe Leu Gly Ala He 
1335 1340 1345 

AAG GCT AAT GGA TTA GTG GAT TTT TCA AAA GTT TTA CAA AAT ACT ACG 413 6 

Lys Ala Asn Gly Leu Val Asp Phe Ser Lys Val Leu Gin Asn Thr Thr 
1350 1355 1360 

ATC GGG ACT TTA GAT TTA GGG CCA AAC GCT ACT TTT AAA GCG AAT CAT 4184 
He Gly Thr Leu Asp Leu Gly Pro Asn Ala Thr Phe Lys Ala Asn His 
1365 1370 1375 

TTG ATC GTG AAT AAC GCT TTT AAC AAT AAC TCT AAT TAC AGG GCT GAT 42 32 

Leu He Val Asn Asn Ala Phe Asn Asn Asn Ser Asn Tyr Arg Ala Asp 
1380 1385 1390 

ATT AGC GGT AAT CTC AAT GTG GTT AAA GGA GCG GCT CTC AGC ACG AAT 42 80 

He Ser Gly Asn Leu Asn Val Val Lys Gly Ala Ala Leu Ser Thr Asn 
1395 1400 1405 1410 

GAA AAT GGT TTG AAT GTG GGG GGC GAT TTC AAG AGC GAA GGG TCA TTA 432 8 

Glu Asn Gly Leu Asn Val Gly Gly Asp Phe Lys Ser Glu Gly Ser Leu 
1415 1420 1425 

ATC TTT AAT CTT AAC AAT AAA ACC AAT CAA ACG ATT ATT AAT GTG GCT 4376 
He Phe Asn Leu Asn Asn Lys Thr Asn Gin Thr He He Asn Val Ala 
1430 1435 1440 

GGC AAT TCT ACG ATC ATG TCT TAT AAC AAT CAA GCT TTA ATC CAT TTT 4424 
Gly Asn Ser Thr He Met Ser Tyr Asn Asn Gin Ala Leu He His Phe 
1445 1450 1455 

AAT ACC CAA CTC AAG CAA GGC GCT TAC ACG CTT ATT AAT GCG AAA CGC 44 72 

Asn Thr Gin Leu Lys Gin Gly Ala Tyr Thr Leu He Asn Ala Lys Arg 
1460 1465 1470 

ATG CTT TAT GGT TAT GAC AAT CAA ATC ATT CGT GGA GGG AGC TTG AGC 4 52 0 

Met Leu Tyr Gly Tyr Asp Asn Gin He He Arg Gly Gly Ser Leu Ser 
1475 1480 1485 1490 
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GAG AAT TTC ATC AAT AAC TGG TAT GGC AAG CAA AGC TTG AGT TCT TTT 3224 
Glu Asn Phe lie Asn Asn Trp Tyr Gly Lys Gin Ser Leu Ser Ser Phe 
1045 1050 1055 

GCC AAT AAT TTT GTG CCT GGA GGC TTG AAT CAA GCC CTT GAT AAA ATA 32 72 
Ala Asn Asn Phe Val Pro Gly Gly Leu Asn Gin Ala Leu Asp Lys lie 
1060 1065 1070 

GGC TCT AGC TCT GAT GCC AAA GAC TTA GAG AAC TTC TTG GAT AAA ACG 332 0 
Gly Ser Ser Ser Asp Ala Lys Asp Leu Gin Asn Phe Leu Asp Lys Thr 
1075 1080 1085 1090 

ACT TTT GGG GAT ATT TTA AAT CAA ATG ATT GAA CAA GCC CCC TTA ATC 3368 
Thr Phe Gly Asp lie Leu Asn Gin Met lie Glu Gin Ala Pro Leu lie 
1095 1100 1105 

AAT AAA CTC ATT TCT TGG CTG GGT CCG CAG GAT TTG AGC GTT TTA GTG 3416 
Asn Lys Leu lie Ser Trp Leu Gly Pro Gin Asp Leu Ser Val Leu Val 
1110 1115 1120 

AAT ATC GCT TTA AAT AGC ATC ACT AAC CCT AGT AAA GAG CTG ACT AGC 34 64 
Asn lie Ala Leu Asn Ser lie Thr Asn Pro Ser Lys Glu Leu Thr Ser 
1125 1130 1135 

ACC ATT TCT AGC ATA GGT GAA AAA "GCG TTA AAT GAC TTA TTA GGC GAT 3 512 
Thr lie Ser Ser lie Gly Glu Lys Ala Leu Asn Asp Leu Leu Gly Asp 
1140 1145 1150 

GGC GTA GTG AAT AAA ATC ATG AGC AAT CAA GTC TTA GGG CAA ATG ATC 3 560 
Gly Val Val Asn Lys lie Met Ser Asn Gin Val Leu Gly Gin Met lie 
1155 1160 1165 1170 

AAT AAA ATC ATT GCT GAT AAG GGC TTT GGA GGC GTT TAT CAG CAA GGT 3 608 
Asn Lys lie lie Ala Asp Lys Gly Phe Gly Gly Val Tyr Gin Gin Gly 
1175 1180 1185 

TTA GGC TCC ATA CTG CCT CAA TCT TTA CAA GAT GAA TTG AAG AAA TTG 3 656 
Leu Gly Ser lie Leu Pro Gin Ser Leu Gin Asp Glu Leu Lys Lys Leu 
1190 1195 1200 

GGC ATG GGC TCT TTA CTA GGA TCT AGG GGG TTG CAC AAT CTT TGG CAA 37 04 
Gly Met Gly Ser Leu Leu Gly Ser Arg Gly Leu His Asn Leu Trp Gin 
1205 1210 1215 

AGA GGG AAT TTC AAT TTT GTG GCT AAA GAT TAT TTA TTC ACT AAT AAC 3752 
Arg Gly Asn Phe Asn Phe Val Ala Lys Asp Tyr Leu Phe Thr Asn Asn 
1220 1225 1230 

AGC TCG TTT AGT AAC GCC ACA GGG GGG GAA TTG AAT TTT GTG GCG GGC 38 00 

Ser Ser Phe Ser Asn Ala Thr Gly Gly Glu Leu Asn Phe Val Ala Gly 
1235 1240 1245 1250 

AAG TCT ATT ATT TTT AAC GGG AAA AAT ACG ATC AAT TTC ACG CAG TAT 3 84 8 
Lys Ser lie lie Phe Asn Gly Lys Asn Thr lie Asn Phe Thr Gin Tyr 
1255 1260 1265 
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TTT GGA GGC GAA GGC TAT AAA GAG CCA TGT GCG GTT GGG GAT AAC ACT 25 52 

Phe Gly Gly Glu Gly Tyr Lys Glu Pro Cys Ala Val Gly Asp Asn Thr 
820 825 830 

TGC CAG ATG TTC AGA CAG ACT AAT TTA GGG CAA TTG CTC CAT TCT AGT 260 0 

Cys Gin Met Phe Arg Gin Thr Asn Leu Gly Gin Leu Leu His Ser Ser 
835 840 845 850 

ACG CCT TAT TTA GGC TAC ATT AAC GCT AAT TTT AGG GCT AAA AAC ATT 2 64 8 

Thr Pro Tyr Leu Gly Tyr lie Asn Ala Asn Phe Arg Ala Lys Asn lie 
855 860 865 

TAC ATT ACC GGA ACC ATC GGC AGC GGG AAC GCT TGG GGG AGT GGA GGG 2 696 

Tyr lie Thr Gly Thr lie Gly Ser Gly Asn Ala Trp Gly Ser Gly Gly 
870 875 880 

AGT GCG AAT GTG TCT TTT GAA AGC GGC ACT AAT TTA GTG CTT AAT CAA 2744 
Ser Ala Asn Val Ser Phe Glu Ser Gly Thr Asn Leu Val Leu Asn Gin 
885 890 895 

GCT AAG ATT GAC GCT CAA GGG ACC GAT AAA ATC TTT TCT TAC TTG GGG 2 7 92 

Ala Lys lie Asp Ala Gin Gly Thr Asp Lys lie Phe Ser Tyr Leu Gly 
900 905 910 

CAA GGG GGT ATT GAA AAG CTT TTT GGA GAA AAA GGT TTA GGG AAT GCG 2 84 0 

Gin Gly Gly lie Glu Lys Leu Phe Gly Glu Lys Gly Leu Gly Asn Ala 
915 920 925 930 

CTT TCT AAT ATC ATT TAT GAA GAG AGC TTG AAT GAT AAC GCT ATC CCT 2 888 

Leu Ser Asn lie lie Tyr Glu Glu Ser Leu Asn Asp Asn Ala lie Pro 
935 940 945 

AAA GAT TTA GCC AAC ATG ATC CCT AAA GAT TTT GGA TCT AAG ACT TTA 2 936 

Lys Asp Leu Ala Asn Met lie Pro Lys Asp Phe Gly Ser Lys Thr Leu 
950 955 960 

AGC TCA TTG CTT AGC CCT ACT GAA GTG AAT AAC CTC TTA GGC GTG AGC 2 984 

Ser Ser Leu Leu Ser Pro Thr Glu Val Asn Asn Leu Leu Gly Val Ser 
965 970 975 

GCA TTC AAA AAC GCG ATC ATG GAA ATT TTA AAT TCT AAA ACG GTG GGC 3 032 

Ala Phe Lys Asn Ala lie Met Glu lie Leu Asn Ser Lys Thr Val Gly 
980 985 990 

GAT GTT TTT GGT GAA AAC GGG CTT TTA AAC GCG CTA GAT CCT ACG GAA 3 080 

Asp Val Phe Gly Glu Asn Gly Leu Leu Asn Ala Leu Asp Pro Thr Glu 
995 1000 1005 1010 

AGA AAA AAA ATT GAT CAA ATG CTT TTA GAG CAA ATC CAA GCC CAT TCT 312 8 

Arg Lys Lys lie Asp Gin Met Leu Leu Glu Gin lie Gin Ala His Ser 
1015 1020 1025 

TCA GGG TTT GAA AAA TTC ATC GTG AAA ACT TTA GGG ATT GAA AAT GTA 3176 
Ser Gly Phe Glu Lys Phe lie Val Lys Thr Leu Gly lie Glu Asn Val 
1030 1035 1040 
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TAT GAG AAG ATC CGT TTT TAT GGC ATA CAG ATT GAC AAG GCT GAT TAG 1880 

Tyr Glu Lys lie Arg Phe Tyr Gly lie Gin lie Asp Lys Ala Asp Tyr 

595 600 605 610 



TCG TTT GAT AAC GGC GTT CAT TCT TGG AGA TTC ACT AAC CCG CTC AAT 1928 
Ser Phe Asp Asn Gly Val His Ser Trp Arg Phe Thr Asn Pro Leu Asn 
615 620 625 



ACG ACT GAA ACG ATT ACA GAA ACC TTG CAT AAC AAC CGC TTG T^AA GTG 1976 
Thr Thr Glu Thr lie Thr Glu Thr Leu His Asn Asn Arg Leu Lys Val 
630 635 640 



CAG ATC TCT CAA AAC GGC GTT TCT AAT AAT AAG ATG TTC AAT CTC GCT 2 024 

Gin lie Ser Gin Asn Gly Val Ser Asn Asn Lys Met Phe Asn Leu Ala 
645 650 655 



CCT AGC TTG TAT GAT TAC CAA AAA AAC CCT TAT AAT GAA ACC GAG AAT 2 072 
Pro Ser Leu Tyr Asp Tyr Gin Lys Asn Pro Tyr Asn Glu Thr Glu Asn 
660 665 670 



TCC TAT AAT TAC ACA AGC GAT AAG GTT GGC ACT TAT TAT TTA ACG AGC 2120 
Ser Tyr Asn Tyr Thr Ser Asp Lys Val Gly Thr Tyr Tyr Leu Thr Ser 
675 680 685 690 



AAT ATC AAA GGC TTT AAT CAA AAC AAT AAA ACA CCC GGG ACT TAT AAC 2168 
Asn lie Lys Gly Phe Asn Gin Asn Asn Lys Thr Pro Gly Thr Tyr Asn 
695 700 705 



GCG CAA AAC CAA CCC TTA CAA GCC TTA CAC ATT TAC AAT CAG GCT ATC 2216 
Ala Gin Asn Gin Pro Leu Gin Ala Leu His lie Tyr Asn Gin Ala lie 
710 715 720 



ACT AAG CAA GAT TTG AAC ATG ATC GCC AGT TTG GGT AAG GAG TTT TTG 22 64 

Thr Lys Gin Asp Leu Asn Met lie Ala Ser Leu Gly Lys Glu Phe Leu 
725 730 735 



CCT AAA ATA GCC AAT CTT TTA TCT TCA GGG GCT TTG GAT AAT CTC AAT 2312 
Pro Lys lie Ala Asn Leu Leu Ser Ser Gly Ala Leu Asp Asn Leu Asn 
740 745 750 



AGC CCG AAT AGT TTT GAA ACT CTT TTT GGT ATC TTT GAA AAG TAT GGT 2 3 60 
Ser Pro Asn Ser Phe Glu Thr Leu Phe Gly lie Phe Glu Lys Tyr Gly 
755 760 765 770 



ATC ACT TTA AAC CAA GAA AAT TGG AAG AGC TTA TTA AAG ATT ATC AAT 24 08 

lie Thr Leu Asn Gin Glu Asn Trp Lys Ser Leu Leu Lys lie lie Asn 
775 780 785 



AAT TTT TCC AAC ACA ACT AAT TAT GAT TTC TCT CAA GGC AAT CTC GTT 24 56 
Asn Phe Ser Asn Thr Thr Asn Tyr Asp Phe Ser Gin Gly Asn Leu Val 
790 795 800 



GTA GGA GCG ATC AAA GAG GGG CAA ACG AAC ACT AAA AGC GTG GTG TGG 2504 
Val Gly Ala lie Lys Glu Gly Gin Thr Asn Thr Lys Ser Val Val Trp 
805 810 815 
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GGG GTG TCT TAT TTG AAT TTG AAC CCT AAC GCT CAA GTA AGC TTC AAT 1208 
Gly Val Ser Tyr Leu Asn Leu Asn Pro Asn Ala Gin Val Ser Phe Asn 
375 380 385 

CAA GTA AAT TTC AAT AAC GCT AAT GTA ACT TTT TAT GGC ATT CCT TTA 1256 
Gin Val Asn Phe Asn Asn Ala Asn Val Thr Phe Tyr Gly lie Pro Leu 
390 395 400 

TTT GGT AAA ACG CCT GAT TTT GGC AAC TCT GCA CGC CTT ATC AAT TTC 1304 
Phe Gly Lys Thr Pro Asp Phe Gly Asn Ser Ala Arg Leu lie Asn Phe 
405 410 415 

AAA GGG AAT ACG AAT TTT AAT CAA GCC ACG CTC AAT TTA AGG GCT AAA 1352 
Lys Gly Asn Thr Asn Phe Asn Gin Ala Thr Leu Asn Leu Arg Ala Lys 
420 425 430 

AAT ATC CAT ATC AAT TTC CAA GGC GTT TCT ACT TTT AAA CAA AAC TCT 14 00 

Asn lie His lie Asn Phe Gin Gly Val Ser Thr Phe Lys Gin Asn Ser 
435 440 445 450 

ACG ATG AAT TTA GCT GAA AGT TCC CAA GCG AGC TTT AAC GCT CTT AAA 144 8 

Thr Met Asn Leu Ala Glu Ser Ser Gin Ala Ser Phe Asn Ala Leu Lys 
455 460 465 

GTG GAA GGG GAA ACG AAT TTC AAT CTC AAT AAC TCA AGC TTG TTG AAT 14 96 

Val Glu Gly Glu Thr Asn Phe Asn Leu Asn Asn Ser Ser Leu Leu Asn 
470 475 480 

TTC AAT GGC AAT AGC GTT TTC AAC GCT CCT GTG AGT TTT TAT GCT AAT 1544 
Phe Asn Gly Asn Ser Val Phe Asn Ala Pro Val Ser Phe Tyr Ala Asn 
485 490 495 

CAT TCT CAA ATT TCT TTC ACT AAA TTA GCG ACT TTT AAT TCT GAC GCT 15 92 

His Ser Gin lie Ser Phe Thr Lys Leu Ala Thr Phe Asn Ser Asp Ala 
500 505 510 

TCT TTT GAT TTA AGC AAC AAC AGC ACC CTG AAT TTT CAA AGC GTT CTT 164 0 

Ser Phe Asp Leu Ser Asn Asn Ser Thr Leu Asn Phe Gin Ser Val Leu 
515 520 525 530 

TTA AAT GGT GCT CTA AAC CTT TTA GGC AAT GGC AGT AAC AAT CTA GCG 168 8 

Leu Asn Gly Ala Leu Asn Leu Leu Gly Asn Gly Ser Asn Asn Leu Ala 
535 540 545 

ATC AAC GCT AAA GGG AAT TTT AGT TTT GGG TCT AAA GGG ATT TTG AAT 173 6 

lie Asn Ala Lys Gly Asn Phe Ser Phe Gly Ser Lys Gly lie Leu Asn 
550 555 560 

CTG TCT TAT ATG AAT CTA TTT GGG GGG GAT AAA AAA ACT TCC GTT TAT 17 84 
Leu Ser Tyr Met Asn Leu Phe Gly Gly Asp Lys Lys Thr Ser Val Tyr 
565 570 575 

GAT GTG TTG CAA GCC CAA AAT ATT GAT GGC TTA ATG GGG AAT AAC GGC 1832 
Asp Val Leu Gin Ala Gin Asn lie Asp Gly Leu Met Gly Asn Asn Gly 
580 585 590 
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AAT AAT TTG CTG TAT TTG CAT GGC AAT TTC AAC GCG ACT AAT ATC TTT 53 6 

Asn Asn Leu Leu Tyr Leu His Gly Asn Phe Asn Ala Thr Asn lie Phe 
150 155 160 



TTA ACG AAT AAT TTT AAT GTC GGC AAC CCT AAC GCT GGC GGT GGG GCG 584 
Leu Thr Asn Asn Phe Asn Val Gly Asn Pro Asn Ala Gly Gly Gly Ala 
165 170 175 



ACG ATT AAT TTT AAC GCT GAT GAA ACC TTG AAC GCT GAC GGG TTA AAT 632 
Thr lie Asn Phe Asn Ala Asp Glu Thr Leu Asn Ala Asp Gly Leu Asn 
180 185 190 



TAG ACG AAT TTC CAA ACC GTG GCT TTG GGC TTA CAA ACC AGT GCG AGC 680 
Tyr Thr Asn Phe Gin Thr Val Ala Leu Gly Leu Gin Thr Ser Ala Ser 
195 200 205 210 



CAG CAT TCA TGG GCG AAT TTT AAT TCC AAG CTT TCT ATG GAG ATT T^AA 72 8 

Gin His Ser Trp Ala Asn Phe Asn Ser Lys Leu Ser Met Glu lie Lys 
215 220 225 



AAT TCT AAC TTT AGG GAT TTC ACA TGG GGA GGC TTT AAT TTT AAT TCA 77 6 

Asn Ser Asn Phe Arg Asp Phe Thr Trp Gly Gly Phe Asn Phe Asn Ser 
230 235 240 



GGG CGT ATC ACT TTT GAA AAC ACC ACT TTT AGC GGC TGG ACC AAT ATT 824 
Gly Arg lie Thr Phe Glu Asn Thr Thr Phe Ser Gly Trp Thr Asn lie 
245 250 255 



AAC GGA GCG ACT GAG AGC GGC TCA TCG TAT GTG AAT ATG GTT GCG AAT 8 72 

Asn Gly Ala Thr Glu Ser Gly Ser Ser Tyr Val Asn Met Val Ala Asn 
260 265 270 



ACG GAT TTG ATA TTT TCT AAT TCC ATT TTA GGA GGG GGC ATT CGC TAT 92 0 

Thr Asp Leu lie Phe Ser Asn Ser lie Leu Gly Gly Gly lie Arg Tyr 
275 280 285 290 



GAT TTG AAA GCT AAT AAC ATT ATT TTC AAT AAC TCT CAA ATG GTT ATT 96 8 

Asp Leu Lys Ala Asn Asn lie lie Phe Asn Asn Ser Gin Met Val lie 
295 300 305 



GAT GTG TCT AAG AAT GTG AAT CAG TCA TCA TTG AAT GGG AAT GTT ACT 1016 
Asp Val Ser Lys Asn Val Asn Gin Ser Ser Leu Asn Gly Asn Val Thr 
310 315 320 



TTC AAT AAT TCC AGG CTT TCA GTC AAG CCC AAT GCG GCT ATT AAT ATT 1064 
Phe Asn Asn Ser Arg Leu Ser Val Lys Pro Asn Ala Ala lie Asn lie 
325 330 335 



GGG GAT AGC CAA ACC CAA ACG GCT TTA GAA AAC GCT TCA AGC CTT TCT 1112 
Gly Asp Ser Gin Thr Gin Thr Ala Leu Glu Asn Ala Ser Ser Leu Ser 
340 345 350 

TTT TAC AAC AAC AGC GTG GCG AAT. TTT AAC GGC ACA ACC GCT TTT AAC 1160 
Phe Tyr Asn Asn Ser Val Ala Asn Phe Asn Gly Thr Thr Ala Phe Asn 
355 360 365 370 
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(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 5972 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TTACACGCTT TTAAGCGGAA ATAGTATCAA ATACAATAAC CAAGCTTTAG CGG GAC 56 

Arg Asp 
1 

AAT GCT TTT TCA AAA AAT TTA TGG AAT TTA ATC CAT TAT GGT GGC GAA 104 
Asn Ala Phe Ser Lys Asn Leu Trp Asn Leu lie His Tyr Gly Gly Glu 
5 10 15 

CAA GGG ACT CTA TTA AGA GCG GAT AAC AAC ACC TTT TTT GTG CAA TTC 152 
Gin Gly Thr Leu Leu Arg Ala Asp Asn Asn Thr Phe Phe Val Gin Phe 
20 25 30 

ACC CAA AGC AAC GGC CAA AAA TTT GTT TTT GAA GAA ACT TTT AAT CCG 2 00 

Thr Gin Ser Asn Gly Gin Lys Phe Val Phe Glu Glu Thr Phe Asn Pro 
35 40 45 50 

GGC TCT ATC ACC TAT AAA TAT TTC ACT ATC CAT TCT TCG CTT TTC CAC 248 
Gly Ser lie Thr Tyr Lys Tyr Phe Thr lie His Ser Ser Leu Phe His 
55 60 65 

ACA GAC GCT GAT TCT AAG GAT ATT TGG AGT CAA GTG AGG AAG CAA TTT 2 96 

Thr Asp Ala Asp Ser Lys Asp lie Trp Ser Gin Val Arg Lys Gin Phe 
70 75 80 

GAT TTC ATT CCA GGA AAA ACC CCT GTG TGT GTT GGC GTG TGC TAT ATC 344 
Asp Phe lie Pro Gly Lys Thr Pro Val Cys Val Gly Val Cys Tyr lie 
85 90 95 

GCG CCT TAT AAA AAT CAA GAC CTT ATT GGC TCT AGC GCT TTT GCG TGG 3 92 

Ala Pro Tyr Lys Asn Gin Asp Leu lie Gly Ser Ser Ala Phe Ala Trp 
100 105 110 

TCG CTG AAC TTT GGG GCC ACG GTG GTA GGG ACT TTG CTT TTA GGG AGC 44 0 

Ser Leu Asn Phe Gly Ala Thr Val Val Gly Thr Leu Leu Leu Gly Ser 
lis 120 125 130 

GCT CAA GAA AAA GCC AAT AAT AAT GGC GGA TCG ATC TGG TTT GGT AAG 4 88 

Ala Gin Glu Lys Ala Asn Asn Asn Gly Gly Ser lie Trp Phe Gly Lys 
135 140 145 
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370 



375 



380 



Leu Ala lie Gly lie lie lie Asp Asp Ala lie Val Val lie Glu Asn 
385 390 395 400 

lie His Lys Lys Leu Glu Met Gly Met Ser Lys Arg Lys Ala Ser Tyr 

405 410 415 

Glu Gly Val Arg Glu lie Gly Phe Ala Leu Val Ala lie Ser Ala Met 

420 425 430 

Leu Leu Ser Val Phe Val Pro lie Gly Asn Met Lys Gly lie lie Gly 

435 440 445 

Arg Phe Phe Gin Ser Phe Gly lie Thr Val Ala Leu Ala lie Ala Leu 

450 455 460 

Ser Tyr Val Val Val Val Thr lie lie Pro Met Val Ser Ser Val Val 
465 470 475 480 

Val Asn Pro Arg His Ser Arg Phe Tyr Val Trp Ser Glu Pro Phe Phe 

485 490 495 

Lys Ala Leu Glu Ser Arg Tyr Thr Lys Leu Leu Gin Trp Val Leu Asn 

500 505 510 

His Lys lie lie lie Ser lie Ala Val Val Leu Val Phe Val Gly Ser 

515 520 525 

Leu Phe Val Ala Ser Lys lie Gly Met Glu Phe Met Leu Lys Glu Asp 

530 535 540 

Arg Gly Arg Phe Leu Val Trp Leu Lys Ala Lys Pro Gly Val Ser lie 
545 550 555 560 

Asp Tyr Met Thr Gin Lys Ser Lys lie Phe Gin Lys Ala lie Glu Lys 

565 570 575 

His Ala Glu Val Glu Phe Thr Thr Leu Gin Val Gly Tyr Gly Thr Thr 

580 585 590 

Gin Asn Pro Phe Lys Ala Lys lie Phe Val Gin Leu Lys Pro Leu Lys 

595 600 605 

Glu Arg Lys Lys Glu His Gin Leu Gly Gin Phe Glu Leu Met Ser Val 

610 615 620 

Leu Arg Lys Glu Leu Arg Ser Leu Pro Glu Ala Lys Gly Leu Asp Thr 
625 630 635 640 

lie Asn Leu Ser Glu Val Thr Leu lie Gly Gly Gly Gly Asp Ser Ser 

645 650 655 

Pro Phe Gin Thr Phe Val Phe Ser His Ser Gin Glu Ala Val Asp Lys 

660 665 670 

Ser Val Glu Asn Leu Lys Lys Phe Leu Leu Glu Ser Pro Glu Leu Lys 

675 680 685 

Gly Lys Val Glu Ser Tyr His Thr Ser Thr Ser Glu Ser Gin Pro Gin 

690 695 700 

Leu Gin Leu Lys lie Leu Arg Gin Asn Ala Asn Lys Tyr Gly Val Ser 
705 710 715 720 

Ala Gin Thr lie Gly Ser Val Val Ser Ser Ala Phe Ser Gly Thr Ser 

725 730 735 

Gin Ala Ser Val Phe Lys Glu Asp Gly Lys Glu Tyr Asp Met lie Leu 

740 745 750 

Glu Cys Leu Met Thr Ser Ala Phe Leu 



755 



760 



(2) INFORMATION FOR SEQ ID NO: 3 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 5 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQXn 

Met Tyr Lys Thr 
1 

Leu Ala lie Val 
20 

Ala Leu Phe Pro 
35 

Tyr Pro Gly Ala 
50 

lie Glu Glu Ala 
65 

Thr Ser Ser Lys 

Lys Pro Asn Glu 
100 

Val Arg Phe Asp 
115 

Asp Thr Asp Ser 
130 

Val Pro Ala Thr 
145 

Met Leu Gin Lys 

Arg Glu Arg Gin 
180 

Tyr Asn Leu Thr 
195 

Val Glu lie Asp 
210 

lie Leu lie Asn 
225 

Gin Val Gly Asn 

Gly Leu Glu Glu 
260 

Val lie Leu Glu 
275 

Val Asp Arg Val 
290 

Asn Tyr Glu lie 
305 

Ser lie Glu Asp 

Val Leu Val Val 
340 

Ser Ala lie Ser 
355 

Gin Trp'Met Gly 



:NCE DESCRIPTION: 

Ala lie Asn Arg 
5 

Phe Phe Gly Thr 

Lys lie Asp Leu 
40 

Ser Ala Glu lie 
55 

Val Met Gly lie 
70 

Asn Val Ser lie 
85 

Glu Ala Leu Asn 

Asp Ser Asn lie 
120 

Gin Ala He He 
135 

Thr Leu Asn Asp 
150 

He Asn Gly Val 
165 

He Arg He Tyr 

Tyr Ala Asp Leu 
200 

Gly Gly Arg He 
215 

Ala Asn Ser Tyr 
230 

His Val Arg Leu 
245 

Asp Asn Thr Phe 

He Gin Lys He 
280 

Tyr Glu Ala Leu 
295 

Arg Pro Phe Leu 
310 

Val Lys Phe Asp 
325 

Phe Ala Phe Leu 

He Pro He Ser 
360 

Phe Ser Leu Asn 



SEQ ID NO: 3 80: 

Pro He Thr Thr 
10 

Met Gly Phe Lys 
25 

Pro Thr Val Val 

He Glu Ser Lys 
60 

Asp Gly He Lys 
75 

Val Val He Glu 
90 

Asp Val Val Asn 
105 

Lys Lys Pro Ser 

Ser Leu Phe Val 
140 

Tyr Ala Lys Asn 
155 

Gly Gly Val Gin 
170 

Ala Asn Pro Thr 
185 

Phe Ser Thr Leu 

Val Asn Ser Gin 
220 

Ser Val Ala Asp 
235 

Gly Asp He Ala 
250 

Ala Ser Phe Lys 
265 

Ala Gly Ala Asn 

Lys Arg He Gin 
300 

Asp Thr Thr Gly 
315 

Leu Val Leu Gly 
330 

Arg Asn Gly Thr 
345 

He Met Gly Thr 
Met Leu Thr Met 



Leu Met Phe Ala 
15 

Lys Leu Ser Val 
30 

Val Thr Thr Thr 
45 

Val Thr Asp Lys 

Lys Val Thr Ser 
80 

Phe Glu Leu Glu 
95 

Lys He Ser Ser 
110 

He Asn Lys Phe 
125 

Ser Ser Ser Ser 

Thr He Lys Pro 
160 

Leu Asn Gly Phe 
175 

Leu Met Asn Lys 
190 

Lys Ala Glu Asn 
205 

Arg Glu Phe Ser 

Val Glu Lys He 
240 

Lys He Glu He 
255 

Asp Lys Pro Gly 
270 

Glu He Glu He 
285 

Ala He Ser Pro 

Tyr He Arg Thr 
320 

Ala He Leu Ala 
335 

He Thr Leu Val 
350 

Phe Ala Leu He 
365 

Val Ala Leu Thr 
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GAA GTG GAA TTC ACC ACC TTG CAA GTG GGT TAT GGC ACC ACA CAA AAC 1832 
Glu Val Glu Phe Thr Thr Leu Gin Val Gly Tyr Gly Thr Thr Gin Asn 
580 585 590 

CCT TTT AAG GCT AAG ATT TTT GTG CAA CTC AAG CCT TTA AAA GAG CGT 1880 
Pro Phe Lys Ala Lys lie Phe Val Gin Leu Lys Pro Leu Lys Glu Arg 
595 600 605 610 

AAA AAA GAG CAT CAA TTG GGG CAA TTT GAG TTG ATG AGC GTT TTA AGG 1928 
Lys Lys Glu His Gin Leu Gly Gin Phe Glu Leu Met Ser Val Leu Arg 
615 620 625 

AAA GAG TTG AGA AGC TTG CCT GAA GCT AAA GGT TTA GAT ACT ATT AAT 1976 
Lys Glu Leu Arg Ser Leu Pro Glu Ala Lys Gly Leu Asp Thr lie Asn 
630 635 640 

CTT TCT GAA GTT ACT CTT ATA GGG GGC GGT GGG GAT AGT TCG CCC TTC 2 024 

Leu Ser Glu Val Thr Leu lie Gly Gly Gly Gly Asp Ser Ser Pro Phe 
645 650 655 

CAA ACC TTT GTG TTT TCC CAT TCT CAA GAA GCG GTG GAT AAA AGC GTG 2 072 

Gin Thr Phe Val Phe Ser His Ser Gin Glu Ala Val Asp Lys Ser Val 
660 665 670 

GAG AAT TTG AAA AAA TTC TTA TTA GAA AGC CCT GAA TTA AAA GGC AAG 212 0 

Glu Asn Leu Lys Lys Phe Leu Leu Glu Ser Pro Glu Leu Lys Gly Lys 
675 680 685 690 

GTT GAA AGC TAT CAT ACA AGC ACG AGC GAA TCG CAA CCG CAA TTG CAA 2168 
Val Glu Ser Tyr His Thr Ser Thr Ser Glu Ser Gin Pro Gin Leu Gin 
695 700 705 

CTC AAA ATC TTA AGA CAA AAC GCT AAC AAA TAC GGC GTG AGC GCT CAA 2216 
Leu Lys lie Leu Arg Gin Asn Ala Asn Lys Tyr Gly Val Ser Ala Gin 
710 715 720 

ACC ATT GGA TCA GTG GTG AGC TCT GCT TTT TCT GGG ACT TCT CAA GCG 22 64 

Thr lie Gly Ser Val Val Ser Ser Ala Phe Ser Gly Thr Ser Gin Ala 
725 730 735 

AGC GTG TTC AAA GAA GAT GGC AAA GAA TAC GAC ATG ATC TTA GAG TGC 2 312 

Ser Val Phe Lys Glu Asp Gly Lys Glu Tyr Asp Met lie Leu Glu Cys 
740 745 750 

CTG ATG ACA AGC GCG TTT CTG TAGAAGACAT CAAACGCTTG CAAGTGCGTA ACAA 2 367 
Leu Met Thr Ser Ala Phe Leu 
755 760 

ATACGATAAA TTGATGTTT 23 86 

(2) INFORMATION FOR SEQ ID NO: 3 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 amino acids 

(B) TYPE: amino acid 

-621- 



BNSDOCID: <WO 9843478A1 \ > 



wo 98/43478 



PCT/US98/06371 



lie Ser lie Pro lie Ser lie Met Gly Thr Phe Ala Leu lie Gin Trp 
355 360 365 370 

ATG GGC TTT TCA TTA AAC ATG CTC ACC ATG GTG GCT TTA ACG TTG GCG 12 08 

Met Gly Phe Ser Leu Asn Met Leu Thr Met Val Ala Leu Thr Leu Ala 

375 380 385 

ATA GGG ATT ATC ATT GAT GAT GCG ATC GTG GTG ATT GAA AAC ATC CAT 1256 

lie Gly lie lie lie Asp Asp Ala lie Val Val lie Glu Asn lie His 
390 395 400 

AAA AAG CTA GAA ATG GGC ATG AGT AAA CGA AAA GCG AGC TAT GAG GGG 13 04 

Lys Lys Leu Glu Met Gly Met Ser Lys Arg Lys Ala Ser Tyr Glu Gly 
405 410 415 

GTG AGA GAA ATT GGC TTT GCT CTA GTG GCG ATT TCA GCG ATG CTG CTC 13 52 

Val Arg Glu lie Gly Phe Ala Leu Val Ala lie Ser Ala Met Leu Leu 
420 425 430 

TCT GTT TTT GTG CCT ATA GGG AAC ATG AAA GGC ATT ATT GGG CGT TTT 14 0 0 

Ser Val Phe Val Pro lie Gly Asn Met Lys Gly lie lie Gly Arg Phe 
435 440 445 450 

TTT CAA AGT TTT GGG ATC ACG GTG GCT TTA GCG ATC GCT CTA TCG TAT 144 8 

Phe Gin Ser Phe Gly lie Thr Val Ala Leu Ala lie Ala Leu Ser Tyr 

455 460 465 

GTG GTG GTC GTT ACG ATT ATC CCC ATG GTA AGC TCA GTC GTG GTC AAT 14 96 

Val Val Val Val Thr lie lie Pro Met Val Ser Ser Val Val Val Asn 
470 475 480 

CCC AGG CAT TCT CGT TTT TAT GTG TGG AGT GAG CCT TTT TTT AAG GCT 1544 

Pro Arg His Ser Arg Phe Tyr Val Trp Ser Glu Pro Phe Phe Lys Ala 
485 490 495 

TTA GAG TCT CGT TAT ACC AAG TTG CTC CAA TGG GTA TTA AAC CAC AAG 15 92 

Leu Glu Ser Arg Tyr Thr Lys Leu Leu Gin Trp Val Leu Asn His Lys 
500 505 510 

ATC ATT ATC TCT ATA GCG GTG GTT TTG GTG TTT GTG GGA TCG CTT TTT 164 0 

lie lie lie Ser lie Ala Val Val Leu Val Phe Val Gly Ser Leu Phe 
515 520 525 530 

GTG GCT TCT AAG ATT GGT ATG GAG TTC ATG CTG AAA GAA GAT AGG GGG 168 8 

Val Ala Ser Lys lie Gly Met Glu Phe Met Leu Lys Glu Asp Arg Gly 

535 540 545 

AGG TTT TTG GTG TGG CTT AAG GCT AAA CCG GGC GTG AGC ATA GAT TAC 173 6 

Arg Phe Leu Val Trp Leu Lys Ala Lys Pro Gly Val Ser lie Asp Tyr 
550 555 560 

ATG ACA CAA AAG AGT AAG ATC TTT CAA AAA GCG ATT GAA AAA CAT GCT 17 84 

Met Thr Gin Lys Ser Lys lie Phe Gin Lys Ala lie Glu Lys His Ala 
565 570 575 
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135 140 145 

GCT ACA ACC CTT AAT GAC TAG GCT AAA AAC ACC ATC AAA CCC ATG CTC 53 6 

Ala Thr Thr Leu Asn Asp Tyr Ala Lys Asn Thr lie Lys Pro Met Leu 
150 155 160 

CAA AAA ATC AAT GGG GTA GGG GGC GTG GAG CTC AAC GGC TTT AGG GAG 584 
Gin Lys lie Asn Gly Val Gly Gly Val Gin Leu Asn Gly Phe Arg Glu 
165 170 175 

CGC CAG ATT AGG ATT TAT GCA AAT CCC ACT TTG ATG AAT AAA TAG AAC 632 
Arg Gin lie Arg lie Tyr Ala Asn Pro Thr Leu Met Asn Lys Tyr Asn 
180 185 190 

CTG ACT TAT GCG GAT CTT TTG AGG AGG CTT AAA GCG GAG AAT GTG GAA 680 
Leu Thr Tyr Ala Asp Leu Phe Ser Thr Leu Lys Ala Glu Asn Val Glu 
195 200 205 210 

ATT GAT GGG GGG CGC ATT GTG AAT AGG CAA AGG GAA TTT TCT ATT TTA 72 8 

lie Asp Gly Gly Arg lie Val Asn Ser Gin Arg Glu Phe Ser lie Leu 
215 220 225 

ATG AAT GCG AAT AGT TAT AGG GTT GGG GAT GTG GAA AAG ATT CAA GTG 776 
lie Asn Ala Asn Ser Tyr Ser Val Ala Asp Val Glu Lys lie Gin Val 
230 235 240 

GGT AAT CAT GTG CGT CTT GGC GAT ATT GCA AAA ATT GAA ATC GGT TTG 824 
Gly Asn His Val Arg Leu Gly Asp lie Ala Lys lie Glu lie Gly Leu 
245 250 255 

GAA GAA GAC AAC ACT TTT GGG AGG TTT AAA GAC AAA GGC GGT GTG ATT 872 
Glu Glu Asp Asn Thr Phe Ala Ser Phe Lys Asp Lys Pro Gly Val He 
260 265 270 

TTA GAA ATG CAA AAG ATT GGC GGA GCG AAT GAA ATT GAA ATC GTA GAT 920 
Leu Glu He Gin Lys He Ala Gly Ala Asn Glu He Glu He Val Asp 
275 280 285 290 

AGG GTG TAT GAA GCT TTA AAG CGC ATT CAA GGC ATT AGG GCT AAG TAT 968 
Arg Val Tyr Glu Ala Leu Lys Arg He Gin Ala He Ser Pro Asn Tyr 
295 300 305 

GAA ATC AGA CCC TTT TTA GAC AGG AGG GGC TAT ATG CGC AGG TCT ATT 1016 
Glu He Arg Pro Phe Leu Asp Thr Thr Gly Tyr He Arg Thr Ser He 
310 315 320 

GAA GAG GTG AAA TTT GAT GTA GTT TTA GGG GGG ATT TTA GCG GTT TTA 10 64 

Glu Asp Val Lys Phe Asp Leu Val Leu Gly Ala He Leu Ala Val Leu 
325 330 335 

GTG GTG TTT GCG TTG TTG CGT AAC GGC AGG ATC ACC CTG GTT TGA GCG 1112 
Val Val Phe Ala Phe Leu Arg Asn Gly Thr He Thr Leu Val Ser Ala 
340 345 350 

ATC TCT' ATG GCT ATT TCT ATC ATG GGG ACT TTT GCG CTC ATC CAA TGG 1160 
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(A) LENGTH: 23 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 2333 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

TGGGGCTTTT TGGCGATGGG TTTATCCAAA CGAAATAATA GGATATTTTG ATG TAT 56 

Met Tyr 
1 

AAA ACA GCG ATT AAT CGT CCT ATT ACG ACC TTA ATG TTT GCT TTG GCG 104 
Lys Thr Ala lie Asn Arg Pro lie Thr Thr Leu Met Phe Ala Leu Ala 
5 10 15 

ATT GTC TTT TTT GGG ACT ATG GGG TTT AAA AAA TTG AGC GTG GCG CTT 152 
He Val Phe Phe Gly Thr Met Gly Phe Lys Lys Leu Ser Val Ala Leu 
20 25 30 

TTC CCT AAA ATT GAT TTG CCT ACG GTG GTG GTT ACT ACG ACT TAT CCT 2 00 

Phe Pro Lys He Asp Leu Pro Thr Val Val Val Thr Thr Thr Tyr Pro 
35 40 45 50 

GGG GCT AGC GCT GAA ATC ATA GAG AGT AAG GTA ACC GAT AAG ATT GAA 24 8 

Gly Ala Ser Ala Glu He He Glu Ser Lys Val Thr Asp Lys He Glu 
55 60 65 

GAA GCG GTG ATG GGG ATT GAT GGG ATC AAA AAG GTT ACT TCC ACG AGT 2 96 

Glu Ala Val Met Gly He Asp Gly He Lys Lys Val Thr Ser Thr Ser 
70 75 80 

TCT AAA AAT GTG AGT ATC GTC GTC ATT GAA TTT GAG TTA GAA AAA CCT 344 
Ser Lys Asn Val Ser He Val Val He Glu Phe Glu Leu Glu Lys Pro 
85 90 95 

AAT GAA GAA GCC TTA AAC GAT GTG GTG AAT AAA ATT TCT TCG GTG CGT 3 92 

Asn Glu Glu Ala Leu Asn Asp Val Val Asn Lys He Ser Ser Val Arg 
100 105 110 

TTT GAT GAC TCT AAC ATT AAA AAA CCC TCT ATC AAT AAA TTT GAT ACC 44 0 

Phe Asp Asp Ser Asn He Lys Lys Pro Ser He Asn Lys Phe Asp Thr 
115 120 125 130 

GAC AGC CAA GCC ATT ATT TCA TTG TTT GTG AGC AGT TCA AGC GTG CCG 48 8 

Asp Ser Gin Ala He He Ser Leu Phe Val Ser Ser Ser Ser Val Pro 
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TTT GAT AAG GAC GCT TTG AGC GAA GGG TTG CAT AAA ATG TCC TTA GAC 536 
Phe Asp Lys Asp Ala Leu Ser Glu Gly Leu His Lys Met Ser Leu Asp 
150 155 160 

AAT CAA GCG GTG AGT ATC CTT GTG GCT AAA GTG GAA GAA ATC TTT AAA 5 84 

Asn Gin Ala Val Ser lie Leu Val Ala Lys Val Glu Glu lie Phe Lys 
165 170 175 

GAT TCT GTC AAT TAG GGA GAT GTT AAA GTC CCT ATA GCC ATG TAGGCTTAG 635 
Asp Ser Val Asn Tyr Gly Asp Val Lys Val Pro lie Ala Met 
180 185 190 

AACAACAAGC GTTCCTCGCT ATCGTCTGTT CTTTTGGGGG TGGG 67 9 

(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

Val Leu Glu Lys Ser Phe Leu Lys Ser Lys Gin Leu Phe Leu Cys Gly 

15 10 15 

Leu Gly Val Leu Met Leu Gin Ala Cys Thr Cys Pro Asn Thr Ser Gin 

20 25 30 

Arg Asn Ser Phe Leu Gin Asp Val Pro Tyr Trp Met Leu Gin Asn Arg 

35 40 45 

Ser Glu Tyr lie Thr Gin Gly Val Asp Ser Ser His lie Val Asp Gly 

50 55 60 

Lys Lys Thr Glu Glu lie Glu Lys lie Ala Thr Lys Arg Ala Thr lie 
65 70 75 80 

Arg Val Ala Gin Asn lie Val His Lys Leu Lys Glu Ala Tyr Leu Ser 

85 90 95 

Lys Thr Asn Arg lie Lys Gin Lys lie Thr Asn Glu Met Phe lie Gin 

100 105 110 

Met Thr Gin Pro lie Tyr Asp Ser Leu Met Asn Val Asp Arg Leu Gly 

115 120 125 

lie Tyr lie Asn Pro Asn Asn Glu Glu Val Phe Ala Leu Val Arg Ala 

130 135 140 

Arg Gly Phe Asp Lys Asp Ala Leu Ser Glu Gly Leu His Lys Met Ser 
145 150 155 160 

Leu Asp Asn Gin Ala Val Ser lie Leu Val Ala Lys Val Glu Glu lie 

165 170 175 

Phe Lys Asp Ser Val Asn Tyr Gly Asp Val Lys Val Pro lie Ala Met 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 3 79: 

(i) SEQUENCE CHARACTERISTICS: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 626 
(D) OTHER IN-PGRMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

TCTTAAAGAT TTATGTTACA CTCTGTGAAA TCAAAATCAA AGGGGATAGC GTG TTA 56 

Val Leu 
1 

GAA AAA TCT TTT TTA AAA AGC AAG CAA TTA TTT TTA TGC GGA CTG GGT 104 
Glu Lys Ser Phe Leu Lys Ser Lys Gin Leu Phe Leu Cys Gly Leu Gly 
5 10 15 

GTT TTG ATG CTG CAG GCT TGC ACT TGC CCA AAC ACT TCA CAA AGG AAT 152 
Val Leu Met Leu Gin Ala Cys Thr Cys Pro Asn Thr Ser Gin Arg Asn 
20 25 30 

TCT TTC TTG CAA GAT GTG CCT TAT TGG ATG TTG CAA AAT CGC AGT GAG 20 0 

Ser Phe Leu Gin Asp Val Pro Tyr Trp Met Leu Gin Asn Arg Ser Glu 
35 40 45 50 

TAT ATC ACG CAA GGG GTG GAT AGC TCG CAC ATT GTA GAT GGT AAG AAA 24 8 

Tyr lie Thr Gin Gly Val Asp Ser Ser His lie Val Asp Gly Lys Lys 
55 60 65 

ACT GAA GAG ATA GAA AAA ATC GCT ACC AAA AGA GCG ACA ATA AGA GTG 296 
Thr Glu Glu lie Glu Lys lie Ala Thr Lys Arg Ala Thr lie Arg Val 
70 75 80 

GCA CAA AAT ATT GTG CAT AAA CTT AAA GAA GCT TAC CTT TCC AAA ACC 344 
Ala Gin Asn lie Val His Lys Leu Lys Glu Ala Tyr Leu Ser Lys Thr 
85 90 95 

AAT CGC ATC AAG CAA AAG ATC ACT AAT GAA ATG TTT ATC CAA ATG ACA 3 92 

Asn Arg lie Lys Gin Lys lie Thr Asn Glu Met Phe lie Gin Met Thr 
100 105 110 

CAG CCC ATT TAT GAC AGC TTG ATG AAT GTG GAT CGT TTA GGG ATT TAT 44 0 

Gin Pro lie Tyr Asp Ser Leu Met Asn Val Asp Arg Leu Gly lie Tyr 
115 120 125 130 

ATC AAT CCT AAC AAT GAG GAA GTG TTT GCG TTA GTG CGC GCG CGT GGT 488 
He Asn Pro Asn Asn Glu Glu Val Phe Ala Leu Val Arg Ala Arg Gly 
135 140 145 



-616- 



BNSDOCID: <WO 9S43478A1_I_> 



wo 98/43478 



PCT/US98/06371 



Gin Gly Phe Ala 
100 

Leu Phe Val Phe 
115 

Phe Ala Pro Asn 
130 

Gin Lys lie Ala 
145 

Val Gin Ala Phe 

Asp Pro Val Arg 
180 

Glu Asn Trp Asp 
195 

Asp Val Gly Glu 
210 

Tyr Glu Arg Ser 
225 

Lys Glu Glu Leu 

Phe Asn Glu Asp 
260 

Leu Val Ser Asn 
275 

Ser Tyr Tyr Ser 
290 

Glu Asp Glu Asn 
305 

lie Thr Pro Asn 

Leu Glu Gly Val 
340 

Thr Ser Arg Ala 
355 

Lys Phe Thr Tyr 
370 

lie lie He Ser 
385 

Leu Leu Glu Leu 

Glu Leu Val He 
420 

Phe Ser Lys Glu 
435 

Glu He Leu Glu 
450 

Ala Gly Gly Phe 
465 

Asn Lys Asn He 



Leu Ala He Leu 

Phe Ser Val Val 
120 

Ala Gin Pro Phe 
135 

Phe Tyr Val Phe 
150 

He Ser Thr Gly 
165 

Phe Ser Trp Asn 

His Leu Lys Phe 
200 

Pro Leu Lys Leu 
215 

Pro Leu Glu He 
230 

Phe Leu Lys Leu 
245 

Lys Ala He Leu 

Asp Leu Lys Thr 
280 

Ala Thr Val Gly 
295 

He Val He Met 
310 

Lys Asn Ala Asn 
325 

Asn Ser Phe Asp 

Lys Asn Tyr Tyr 
360 

Leu He Ser Ala 
375 

Met Arg Asn Ser 
390 

Gly Asn Ala Lys 
405 

Ser Thr Leu Ala 

Phe Asn Thr Leu 
440 

Val Tyr Gly Leu 
455 

Arg Asn Asp Glu 
470 

Leu Tyr Thr Leu 
485 



Gly Leu His Thr 
105 

Val Leu Leu Ala 

Lys Asp His Ser 
140 

Phe Met Val Val 
155 

Pro Phe Pro Tyr 
170 

Leu Lys Glu Ser 
185 

Pro Arg Ser Val 

Ser Ala Leu Pro 
220 

Thr Lys Thr Leu 
235 

Asn Gly Ala He 
250 

Thr Thr Glu Asn 
265 

He His Ser His 

Ser Phe Val Gly 
300 

Gly Asn Asn Lys 
315 

Met Leu Lys Asn 
330 

Glu Val Glu Arg 
345 

Val Ser Val Ala 

Pro Asn Lys Arg 
380 

Asp Lys Gin Val 
395 

Leu Lys Glu Lys 
410 

Leu Lys Asp Asn 
425 

Leu Val He Asp 

Pro Lys Glu He 
460 

Leu Val Leu Val 
475 

Asn Phe 
490 



Gin Phe Trp Ala 
110 

Val Leu Leu Phe 
125 

Leu Asn Ala Leu 

Gly Ser Asn Ala 
160 

He Gly Gin Ser 
175 

Val Trp Ser Met 
190 

Leu Gly Arg Arg 
205 

Lys Asp Asn Asp 

Lys He Gly Lys 
240 

Thr Asp Leu Ser 
255 

Gin Gly Leu Tyr 
270 

Met Val Leu Asp 
285 

Ala Asp Phe Asn 

Thr Ser Val Glu 
320 

Phe Pro Tyr Phe 
335 

Ser Arg Leu Lys 
350 

Arg Arg Gly Ala 
365 

Tyr Lys Asp Leu 

His Gly Glu Phe 
400 

Arg Gly Leu Gly 
415 

Lys Leu Tyr Ala 
430 

Pro Thr Lys Glu 
445 

Lys Asn He Ser 

Ser Tyr Glu Asn 
480 



(2) INFORMATION FOR SEQ ID NO:377: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 9 base pairs 
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ATT TCC ATG CGT AAT AGC GAT AAA CAG GTG CAT GGG GAG TTT TTA CTG 12 56 

lie Ser Met Arg Asn Ser Asp Lys Gin Val His Gly Glu Phe Leu Leu 
390 395 400 

GAA TTA GGC AAT GCC AAA CTT AAA GAA AAA AGG GGA TTG GGC GAG TTA 13 04 

Glu Leu Gly Asn Ala Lys Leu Lys Glu Lys Arg Gly Leu Gly Glu Leu 
405 410 415 

GTC ATT AGC ACT TTG GCT TTA AAG GAT AAT AAA CTT TAT GCG TTC AGT 1352 
Val lie Ser Thr Leu Ala Leu Lys Asp Asn Lys Leu Tyr Ala Phe Ser 
420 425 430 

AAG GAA TTT AAC ACG CTT TTA GTC ATA GAC CCT ACA AAA GAA GAG ATT 14 0 0 

Lys Glu Phe Asn Thr Leu Leu Val lie Asp Pro Thr Lys Glu Glu lie 
435 440 445 450 

CTT GAA GTT TAT GGC TTG CCT AAA GAG ATT AAA AAT ATC AGT GCT GGA 144 8 

Leu Glu Val Tyr Gly Leu Pro Lys Glu lie Lys Asn lie Ser Ala Gly 
455 460 465 

GGG TTT AGA AAC GAT GAG CTT GTC CTT GTG AGC TAT GAG AAT AAT AAA 14 96 

Gly Phe Arg Asn Asp Glu Leu Val Leu Val Ser Tyr Glu Asn Asn Lys 
470 475 480 

AAT ATT CTC TAC ACC CTT AAT TTT TAAACTCTTT TAAAGCTACT TTTTTCTAAT 1550 
Asn lie Leu Tyr Thr Leu Asn Phe 
485 490 

ATATTAACGC ATTAGAAGAT GGT 1573 

(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

Met Asp Lys Glu Thr Arg Phe Tyr Asn Leu Phe Ser Leu Ala lie Leu 

15 10 15 

Gly lie Leu lie Phe Pro Val Gly Leu Ala Asn Phe Tyr Phe Gly Tyr 

20 25 30 

Val Leu Lys Asp Ser Pro Cys lie Phe Cys Trp Ala Gin Arg lie Asn 

35 40 45 

Met lie Leu lie Gly Ala Val Ala Leu Leu Val Val Arg Phe Gly Phe 

50 55 60 

Lys Pro Lys Tyr lie Ala Leu Leu Leu Leu Met Ala Ser Ser Gly Leu 
65 70 75 80 

Tyr Glu Ser Phe Tyr His Thr Gly Ser His Ala Leu Glu Asp Val Gly 
85 90 95 
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GCG TTT ATT TCT ACC GGG CCT TTC CCT TAC ATA GGG CAA AGC GAT CCG 584 
Ala Phe lie Ser Thr Gly Pro Phe Pro Tyr lie Gly Gin Ser Asp Pro 
165 170 175 

GTG CGT TTT TCG TGG AAT TTG AAA GAA TCG GTC TGG TCT ATG GAG AAT 632 
Val Arg Phe Ser Trp Asn Leu Lys Glu Ser Val Trp Ser Met Glu Asn 
180 185 190 

TGG GAT CAT TTG AAA TTC CCA AGA AGC GTT TTG GGC AGA AGG GAT GTG 68 0 

Trp Asp His Leu Lys Phe Pro Arg Ser Val Leu Gly Arg Arg Asp Val 
195 200 205 210 

GGC GAG CCT TTG AAA TTG AGC GCT TTG CCT AAA GAT AAC GAT TAT GAG 72 8 

Gly Glu Pro Leu Lys Leu Ser Ala Leu Pro Lys Asp Asn Asp Tyr Glu 
215 220 225 

CGT TCG CCT TTA GAA ATT ACA AAA ACT CTA AAG ATT GGA AAA AAA GAA 77 6 

Arg Ser Pro Leu Glu lie Thr Lys Thr Leu Lys lie Gly Lys Lys Glu 
230 235 240 

GAG CTT TTT TTA AAA TTG AAT GGA GCG ATC ACG GAT TTG AGT TTC AAT 824 
Glu Leu Phe Leu Lys Leu Asn Gly Ala lie Thr Asp Leu Ser Phe Asn 
245 250 255 

GAA GAC AAG GCG ATT CTT ACC ACA GAA AAC CAA GGG CTT TAT CTT GTA 872 
Glu Asp Lys Ala lie Leu Thr Thr Glu Asn Gin Gly Leu Tyr Leu Val 
260 265 270 

AGT AAC GAT TTG AAA ACC ATT CAT AGC CAT ATG GTG TTG GAT AGC TAT 92 0 

Ser Asn Asp Leu Lys Thr lie His Ser His Met Val Leu Asp Ser Tyr 
275 280 285 290 

TAT AGC GCG ACG GTG GGG TCG TTC GTG GGG GCG GAT TTC AAC GAA GAT 968 
Tyr Ser Ala Thr Val Gly Ser Phe Val Gly Ala Asp Phe Asn Glu Asp 
295 300 305 

GAA AAC ATT GTG ATC ATG GGC AAT AAT AAA ACG AGC GTG GAA ATC ACT 1016 
Glu Asn lie Val lie Met Gly Asn Asn Lys Thr Ser Val Glu lie Thr 
310 315 320 

CCT AAC AAA AAC GCT AAC ATG CTT AAA AAC TTC CCT TAT TTT TTA GAA 1064 
Pro Asn Lys Asn Ala Asn Met Leu Lys Asn Phe Pro Tyr Phe Leu Glu 
325 330 335 

GGG GTC AAC TCT TTT GAC GAA GTG GAA CGC AGC CGC TTG AAA ACT TCT 1112 
Gly Val Asn Ser Phe Asp Glu Val Glu Arg Ser Arg Leu Lys Thr Ser 
340 345 350 

AGG GCG AAA AAC TAT TAT GTT AGC GTT GCA AGA AGA GGG GCT AAA TTC 1160 
Arg Ala Lys Asn Tyr Tyr Val Ser Val Ala Arg Arg Gly Ala Lys Phe 
355 360 365 370 

ACT TAT TTG ATC AGC GCT CCT AAC AAG CGT TAT AAG GAT TTG ATT ATT 120 8 
Thr Tyr Leu lie Ser Ala Pro Asn Lys Arg Tyr Lys Asp Leu lie lie 
375 380 385 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1520 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

TAGCCCACAG CTCTAGCGTT TTGCCTAACA TGAATCTATA AGGGGGATGC ATG GAT 5 6 

Met Asp 
1 

AAA GAA ACC CGA TTT TAC AAC CTT TTT TCT TTG GCA ATT TTA GGG ATT 104 
Lys Glu Thr Arg Phe Tyr Asn Leu Phe Ser Leu Ala lie Leu Gly lie 
5 10 15 

TTG ATC TTT CCT GTG GGT TTG GCG AAT TTT TAT TTT GGC TAT GTT TTG 152 
Leu lie Phe Pro Val Gly Leu Ala Asn Phe Tyr Phe Gly Tyr Val Leu 
20 25 30 

AAA GAT TCG CCT TGT ATT TTT TGC TGG GCG CAA CGC ATC AAC ATG ATT 2 00 

Lys Asp Ser Pro Cys lie Phe Cys Trp Ala Gin Arg lie Asn Met lie 
35 40 45 50 

TTA ATA GGG GCT GTG GCG CTT TTG GTG GTG CGT TTT GGG TTT AAG CCT 24 8 

Leu lie Gly Ala Val Ala Leu Leu Val Val Arg Phe Gly Phe Lys Pro 
55 60 65 

AAA TAC ATT GCC TTG CTG TTG CTT ATG GCT AGT AGC GGG TTA TAT GAG 2 96 

Lys Tyr lie Ala Leu Leu Leu Leu Met Ala Ser Ser Gly Leu Tyr Glu 
70 75 80 

AGC TTT TAT CAT ACC GGT AGC CAT GCT TTA GAA GAT GTG GGG CAG GGA 344 
Ser Phe Tyr His Thr Gly Ser His Ala Leu Glu Asp Val Gly Gin Gly 
85 90 95 

TTC GCG CTC GCT ATT TTG GGC TTG CAC ACG CAG TTT TGG GCG CTT TTT 3 92 

Phe Ala Leu Ala lie Leu Gly Leu His Thr Gin Phe Trp Ala Leu Phe 
100 105 110 

GTC TTT TTT AGC GTG GTG GTG CTT TTA GCG GTT TTG CTC TTT TTT GCC 44 0 

Val Phe Phe Ser Val Val Val Leu Leu Ala Val Leu Leu Phe Phe Ala 
115 120 125 130 

CCT AAT GCC CAA CCT TTC AAA GAT CAT TCG TTA AAC GCG CTC CAA AAA 4 88 

Pro Asn Ala Gin Pro Phe Lys Asp His Ser Leu Asn Ala Leu Gin Lys 
135 140 145 

ATC GCT TTT TAT GTT TTC TTT ATG GTG GTT GGT TCT AAC GCC GTG CAA 53 6 

lie Ala Phe Tyr Val Phe Phe Met Val Val Gly Ser Asn Ala Val Gin 
150 155 160 
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Ala beu Gin Asp Leu Val Lys Lys Glu His Ala His Met Leu Ser Ala 

325 330 335 

Phe lie Glu Ser Ala Pro Arg Thr Val Lys Ser Leu Asp Val Gin lie 

340 345 350 

Thr Gly Leu Pro Tyr lie Asn Asp Asn Asn Leu Val Asn Ser Gly Val 

355 360 365 

lie Leu Pro Ser Phe Pro Met Ala lie Gly Asn lie Thr Lys Thr Leu 

370 375 380 

Gly Tyr Lys Asn Asn Phe Tyr Tyr Gly Gly Ser Gly lie Trp Asn Lys 
385 390 395 400 

Leu Thr Ser Phe Thr Lys Lys Gin Gly Phe His Ala Leu Tyr Phe Asn 

405 410 415 

Asn His Leu Leu Glu Phe Ala Gin Asn Lys Pro Tyr Pro Lys Pro lie 

420 425 430 

Glu Ser Asn Trp Gly Val His Asp Asn lie Leu Phe Asp Tyr lie Leu 

435 440 445 

Glu Asn Thr Asn Pro His Glu Lys Thr Phe Ser Met Val Met Thr Leu 

450 455 460 

Ser Asn His Ala lie Lys Asn Val Asn Leu Lys Ala Phe Gly Val Pro 
465 470 475 480 

Leu Glu Lys lie Gin Gin Phe Val Glu Lys Thr Pro Lys Ser Glu Asn 

485 490 495 

Leu Pro Asp Ala Asn Ser Leu Gly His lie Tyr Trp Tyr Asp Lys Val 

500 505 510 

lie Val Ser Phe lie Lys Lys Ala Ser Gin Lys Phe Pro Asn Ser Leu 

515 520 525 

Phe lie lie Thr Gly Asp His Phe Asp Arg Ser Tyr Glu Tyr Ala Lys 

530 535 540 

Asn Asp Leu Tyr lie lie Lys Ser Val Pro Leu lie Leu Tyr Ala Pro 
545 550 555 560 

Thr Leu Lys Pro Lys Lys lie Ser Gin Val Gly Ser His Leu Asp lie 

565 570 575 

Ala Pro Thr lie lie Glu Leu Val Ala Pro Lys Gly Phe Gin Phe Val 

580 585 590 

Ser Phe Gly Lys Pro Leu Phe Ser Asn Asn Thr Thr Asn Pro Pro Ser 

595 600 605 

His Pro Asn Tyr Ala Leu Gly Tyr Glu Ala lie Ala Thr Lys Asp Tyr 

610 615 620 

Phe Tyr Asn Pro Ser Leu Gly Leu Arg Tyr Leu Asn Glu Ser Pro Lys 
625 630 635 640 

Glu Pro Lys Asp Lys Gin Asn Asp Lys lie Glu Ala Ser Lys Phe Tyr 

645 650 655 

Gin Gin Leu Glu Ser Leu Lys Ala Leu Ser Tyr Tyr Leu Leu Tyr His 

660 665 670 

Gly Ala Asn Leu Lys Asp 
675 

(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1573 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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ATTTTTGCTA CAATAATAGG ATTTTAATT 2162 



(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Met Lys Ser Leu 
1 

Tyr Phe Thr Phe 
20 

Phe lie Leu Tyr 
35 

Phe Asp Glu lie 
50 

Arg Val Val Ser 
65 

Gly Leu Phe lie 

Tyr Phe Ser lie 
100 

Tyr Tyr Gly lie 
115 

Leu His Glu Asp 
130 

lie Phe Ser Ser 
14 5 

Phe lie Tyr Phe 

Tyr Gin Ala Ala 
180 

Leu Phe Ser Leu 
195 

Val Gly Ala Ser 
210 

Leu Met Lys lie 
225 

Arg Asn Tyr Arg 

Glu Thr Pro Leu 
260 

Pro Ser Asn Asn 
275 

Ser Asn Gin Thr 
290 

Ser Ser Trp His 
305 - 



Ser Asn Ala Leu 
5 

Phe Met Ser Leu 

Thr Gly Tyr Tyr 
40 

lie Lys Thr Leu 
55 

Ser Leu Ala lie 
70 

Pro Lys His Gin 
85 

Ala lie lie Leu 

Tyr Gly Asn Val 
120 

Thr Leu Thr lie 
135 

Phe Ser Leu Phe 
150 

Lys Leu Gin Asn 
165 

His Thr Lys Pro 

Thr Gin Met Phe 
200 

Leu Asp Leu Ser 
215 

Thr Pro Gly Ala 
230 

Gin Ser His Asn 
245 

Glu Val Ala Lys 

Leu Tyr Glu Leu 
280 

lie Gin His Val 
295 

Phe Asp Pro Lys 
310 



Phe Ser Leu Phe 
10 

Leu Phe Val Phe 
25 

Lys His Ala Leu 

Phe Asn Gly Ala 
60 

Leu Phe lie lie 
75 

Thr Lys Met Leu 
90 

Phe Leu Asn lie 
105 

Phe Asp Glu Asn 

Leu Lys Met Ser 
14 0 

Leu lie Leu Ser 
155 

Asp Leu Phe Lys 
170 

Leu Lys Thr Phe 
185 

Tyr lie Asn Ala 

He Glu Pro Ala 
220 

Phe Arg Asn Leu 
235 

Leu Lys Phe Ser 
250 

Asn Tyr Phe His 
265 

Leu Thr Gin Thr 

Phe Tyr He Val 
300 

Phe Asp Ala He 
315 



Leu Lys Gly Phe 
15 

Asn Arg He Gly 
30 

Lys Asn Pro Val 
45 

Arg Tyr Asp Asn 

He Gly Leu Leu 
80 

Asn He Val Ala 
95 

Ala Asn He Val 
110 

Leu Leu Glu Phe 
125 

Gly Glu Tyr Pro 

Val Leu Thr Ser 
160 

Pro Lys Asn Ala 
175 

He Leu Phe Ala 
190 

Gin Leu Ser Phe 
205 

Lys Asp Pro Phe 

Tyr Leu Leu Ala 
240 

Asp Phe Ala Lys 
255 

Leu Lys Glu Asn 
270 

Ser Arg Asn Asn 
285 

Ser Glu Ser Leu 

Gly Leu Thr Ser 
320 
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TTA AGC AAC CAT GCG ATC AAA AAC GTG AAT CTC AAA GCC TTT GGC GTG 15 02 
Leu Ser Asn His Ala lie Lys Asn Val Asn Leu Lys Ala Phe Gly Val 
465 470 475 

CCT TTA GAA AAA ATC CAA CAA TTT GTG GAA AAA ACC CCC AAA TCA GAA 1550 
Pro Leu Glu Lys lie Gin Gin Phe Val Glu Lys Thr Pro Lys Ser Glu 
480 485 490 495 

AAT TTA CCG GAC GCT AAT TCT TTA GGG CAT ATT TAC TGG TAT GAC AAA 1598 
Asn Leu Pro Asp Ala Asn Ser Leu Gly His lie Tyr Trp Tyr Asp Lys 
500 505 510 

GTA ATC GTC ACT TTC ATC AAA AAA GCC AGC CAA AAA TTC CCT AAC TCG 164 6 

Val lie Val Ser Phe lie Lys Lys Ala Ser Gin Lys Phe Pro Asn Ser 
515 520 525 

CTT TTT ATC ATC ACA GGG GAT CAT TTT GAC AGG AGC TAT GAA TAC GCT 1694 
Leu Phe lie lie Thr Gly Asp His Phe Asp Arg Ser Tyr Glu Tyr Ala 
530 535 540 

AAA AAC GAT TTG TAT ATC ATT AAA TCC GTG CCG CTT ATT TTA TAT GCC 1742 
Lys Asn Asp Leu Tyr lie lie Lys Ser Val Pro Leu lie Leu Tyr Ala 
545 550 555 

CCT ACT TTA AAG CCT MkA AAA ATC ACT CAG GTC GGA TCG CAT TTA GAC 17 90 

Pro Thr Leu Lys Pro Lys Lys lie Ser Gin Val Gly Ser His Leu Asp 
560 565 570 575 

ATC GCC CCT ACG ATT ATT GAA TTA GTC GCC CCT AAA GGC TTT CAA TTC 183 8 

lie Ala Pro Thr lie lie Glu Leu Val Ala Pro Lys Gly Phe Gin Phe 
580 585 590 

GTG AGT TTT GGG AAG CCT TTA TTT TCT AAC AAT ACA ACA AAC CCC CCA 1886 
Val Ser Phe Gly Lys Pro Leu Phe Ser Asn Asn Thr Thr Asn Pro Pro 
595 600 605 

AGC CAC CCC AAT TAC GCG CTA GGC TAT GAA GCG ATC GCT ACC AAA GAT 1934 
Ser His Pro Asn Tyr Ala Leu Gly Tyr Glu Ala lie Ala Thr Lys Asp 
610 615 620 

TAT TTT TAT AAC CCA AGT TTG GGG TTA AGG TAT TTG AAC GAA AGC CCT 1982 
Tyr Phe Tyr Asn Pro Ser Leu Gly Leu Arg Tyr Leu Asn Glu Ser Pro 
625 630 635 

AAA GAG CCA AAG GAT AAA CAA AAC GAC AAA ATA GAA GCT TCT AAG TTT 2030 
Lys Glu Pro Lys Asp Lys Gin Asn Asp Lys lie Glu Ala Ser Lys Phe 
640 645 650 655 

TAT CAG CAA TTA GAA TCT TTG AAA GCC CTT AGT TAT TAC TTG CTC TAT 2 07 8 
Tyr Gin Gin Leu Glu Ser Leu Lys Ala Leu Ser Tyr Tyr Leu Leu Tyr 
660 665 670 

CAT GGG GCT AAT CTT AAA GAT TGACAAACTA GGTTTTTATT CCATTAAACG CATG 213 3 
His Gly Ala Asn Leu Lys Asp 
675 
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GCA CGC AAT 
Ala Arg Asn 
240 

AAA GAA ACG 
Lys Glu Thr 

AAC CCT TCA 
Asn Pro Ser 



AAT TCC AAT 
Asn Ser Asn 
290 

TTG AGT TCA 
Leu Ser Ser 
305 

AGC GCT TTA 
Ser Ala Leu 
320 

GCT TTT ATT 
Ala Phe lie 



ATC ACA GGC 
lie Thr Gly 



GTG ATC CTC 
Val lie Leu 
370 

CTG GGT TAT 
Leu Gly Tyr 
385 

AAA CTC ACT 
Lys Leu Thr 
400 

AAT AAC CAT 
Asn Asn His 



ATA GAG AGC 
lie Glu Ser 



TTA GAA AAC 
Leu Glu Asn 
450 



TAC AGA CAA 
Tyr Arg Gin 
245 

CCT TTA GAA 
Pro Leu Glu 
260 

AAC AAC CTC 
Asn Asn Leu 
275 

CAA ACC ATT 
Gin Thr lie 



TGG CAT TTT 
Trp His Phe 



CAA GAC TTG 
Gin Asp Leu 
325 

GAA AGC GCC 
Glu Ser Ala 
340 

TTA CCC TAT 

Leu Pro Tyr 
355 

CCT AGC TTT 

Pro Ser Phe 



AAA AAC AAC 
Lys Asn Asn 



AGT TTC ACC 
Ser Phe Thr 
405 

CTC TTA GAA 
Leu Leu Glu 
420 

AAC TGG GGA 
Asn Trp Gly 
435 

ACC AAC CCC 
Thr Asn Pro 



AGC CAT AAC 
Ser His Asn 



GTG GCG AAA 
Val Ala Lys 



TAT GAG TTG 
Tyr Glu Leu 
280 

CAA CAT GTT 
Gin His Val 
295 

GAT CCA AAA 
Asp Pro Lys 
310 

GTT AAA AAA 
Val Lys Lys 



CCA CGG ACC 
Pro Arg Thr 



ATC AAT GAT 
lie Asn Asp 
360 

CCT ATG GCG 
Pro Met Ala 
375 

TTT TAT TAT 
Phe Tyr Tyr 
390 

AAA AAA CAA 
Lys Lys Gin 



TTT GCC CAA 
Phe Ala Gin 



GTG CAT GAT 
Val His Asp 
440 

CAT GAA AAA 
His Glu Lys 
455 



CTT AAA TTC 
Leu Lys Phe 
250 

AAT TAT TTC 
Asn Tyr Phe 
265 

CTA ACT CAG 
Leu Thr Gin 



TTT TAT ATC 
Phe Tyr lie 



TTT GAC GCT 
Phe Asp Ala 
315 

GAG CAT GCC 
Glu His Ala 
330 

GTT AAA AGC 
Val Lys Ser 
345 

AAT AAC TTA 
Asn Asn Leu 



ATT GGC AAT 
lie Gly Asn 



GGG GGT AGC 
Gly Gly Ser 
395 

GGT TTT CAC 
Gly Phe His 
410 

AAC AAG CCC 

Asn Lys Pro 
425 

AAT ATT TTA 

Asn lie Leu 



ACT TTC AGC 
Thr Phe Ser 



AGC GAT TTT 
Ser Asp Phe 



CAT CTT AAA 
His Leu Lys 
270 

ACA AGC CGC 
Thr Ser Arg 
285 

GTT TCA GAG 
Val Ser Glu 
300 

ATA GGG CTA 
lie Gly Leu 



CAC ATG CTT 
His Met Leu 



CTA GAT GTC 
Leu Asp Val 
350 

GTC AAT TCA 
Val Asn Ser 
365 

ATC ACA AAA 
lie Thr Lys 
380 

GGG ATT TGG 
Gly lie Trp 



GCC CTT TAT 
Ala Leu Tyr 



TAC CCT AAA 
Tyr Pro Lys 
430 

TTT GAC TAT 
Phe Asp Tyr 
445 

ATG GTC ATG 
Met Val Met 
460 



GCT 83 0 

Ala 

255 

GAG 87 8 

Glu 



AAC 926 
Asn 



TCT 974 
Ser 



ACG 102 2 

Thr 



TCT 107 0 

Ser 

335 

CAA 1118 
Gin 



GGG 1166 
Gly 



ACT 1214 
Thr 



AAC 1262 
Asn 



TTC 1310 

Phe 

415 

CCC 1358 
Pro 



ATT 14 06 
He 



ACT 1454 
Thr 
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TTT TAT TTC ACC TTT TTT ATG AGC TTG TTG TTT GTG TTT AAT CGT ATC 15 8 

Phe Tyr Phe Thr Phe Phe Met Ser Leu Leu Phe Val Phe Asn Arg lie 
20 25 30 

GGC TTT ATC CTT TAT ACT GGC TAT TAT AAG CAT GCT TTA AAA AAC CCT 2 06 

Gly Phe lie Leu Tyr Thr Gly Tyr Tyr Lys His Ala Leu Lys Asn Pro 
35 40 45 

GTT TTT GAT GAA ATC ATC AAA ACC CTA TTC AAT GGA GCC AGA TAT GAT 2 54 

Val Phe Asp Glu lie lie Lys Thr Leu Phe Asn Gly Ala Arg Tyr Asp 
50 55 60 

AAT CGT GTG GTC TCA AGC TTA GCG ATT CTT TTT ATC ATC ATC GGG TTA 3 02 

Asn Arg Val Val Ser Ser Leu Ala lie Leu Phe lie lie lie Gly Leu 
65 70 75 

TTG GGG TTA TTT ATC CCT AAA CAC CAA ACC AAA ATG CTT AAT ATT GTG 350 
Leu Gly Leu Phe lie Pro Lys His Gin Thr Lys Met Leu Asn lie Val 
80 85 90 95 

GCG TAT TTT TCT ATC GCT ATT ATC CTG TTT TTA AAC ATT GCA AAC ATT 3 98 

Ala Tyr Phe Ser lie Ala lie lie Leu Phe Leu Asn lie Ala Asn lie 
100 105 110 

GTT TAT TAT GGT ATT TAT GGG AAT GTG TTT GAT GAA AAT TTA TTG GAA 44 6 

Val Tyr Tyr Gly lie Tyr Gly Asn Val Phe Asp Glu Asn Leu Leu Glu 
115 120 125 

TTT TTG CAT GAA GAC ACG CTC ACG ATT TTA AAA ATG AGC GGG GAA TAG 4 94 

Phe Leu His Glu Asp Thr Leu Thr lie Leu Lys Met Ser Gly Glu Tyr 
130 135 140 

CCT ATT TTT TCT AGT TTT TCA CTC TTT TTA ATC CTT AGC GTT TTA ACC 54 2 

Pro lie Phe Ser Ser Phe Ser Leu Phe Leu lie Leu Ser Val Leu Thr 
145 150 155 

TCT" TTT ATC TAT TTC AAA CTC CAA AAC GAC CTT TTT AAA CCC AAA AAT 590 
Ser Phe lie Tyr Phe Lys Leu Gin Asn Asp Leu Phe Lys Pro Lys Asn 
160 165 170 175 

GCT TAT CAA GCC GCC CAC ACC AAA CCC CTT AAA ACT TTC ATT TTA TTT 63 8 

Ala Tyr Gin Ala Ala His Thr Lys Pro Leu Lys Thr Phe lie Leu Phe 
180 185 190 

GCG CTT TTT TCC CTC ACA CAA ATG TTT TAC ATT AAC GCG CAA TTG AGT 686 
Ala Leu Phe Ser Leu Thr Gin Met Phe Tyr lie Asn Ala Gin Leu Ser 
195 200 205 

TTT GTG GGC GCG TCT TTA GAT CTC AGC ATA GAG CCA GCC AAA GAT CCT 734 
Phe Val Gly Ala Ser Leu Asp Leu Ser He Glu Pro Ala Lys Asp Pro 
210 215 220 

TTT TTA ATG AAA ATT ACC CCC GGA GCG TTT CGC AAC CTT TAT CTT TTA 782 
Phe Leu Met Lys lie Thr Pro Gly Ala Phe Arg Asn Leu Tyr Leu Leu 
225- 230 235 
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65 

Glu Ala Lys Leu 

lie Ser Gly Val 
100 

Lys Met lie Leu 
115 

His Pro Leu Asn 
130 

Pro Ala Thr Pro 
145 

Glu lie Lys Gly 

Gly Lys Pro Leu 
180 

Val Cys His lie 
195 

Asp lie Val Cys 
210 

Met Leu Lys Lys 
225 

Asn Asp Gly Arg 

Val Ala Gly Phe 
260 

lie Val Ser Leu 
275 

Arg Lys Gly Phe 
290 



70 

Leu Ser Leu lie 
85 

Leu Val Gin Leu 

Glu Ala lie Asp 
120 

lie Gly Lys Leu 
135 

Met Gly Val Met 
150 

Lys Asp Val Ala 
165 

Ser Met Leu Met 

Leu Thr Lys Asp 
200 

Val Gly Val Gly 
215 

Gly Ala Val Val 
230 

lie Val Gly Asp 
245 

lie Thr Pro Val 

Leu Glu Asn Thr 
280 



75 

Lys Asp Tyr Asn 
90 

Pro Leu Pro Arg 
105 

Pro Asn Lys Asp 

Cys Thr Gin Lys 
140 

Arg Leu Leu Glu 
155 

lie lie Gly Ala 
170 

Leu Asn Ala Gly 
185 

lie Ser Phe Tyr 

Lys Pro Asp Leu 
220 

Val Asp lie Gly 
235 

Val Asp Phe Asn 
250 

Pro Lys Gly Val 
265 

Leu lie Ala Phe 



80 

Thr Asp Gin Asn 
95 

His lie Asp Thr 
110 

Val Asp Gly Phe 
125 

Glu Ser Phe Leu 

His Tyr His lie 
160 

Ser Asn lie lie 
175 

Ala Ser Val Ser 

. 190 
Thr Gin Asn Ala 
205 

lie Lys Ala Ser 

lie Asn His Leu 
240 

Asn Val Gin Lys 
255 

Gly Pro Met Thr 
270 

Glu Lys Gin Gin 
285 



(2) INFORMATION FOR SEQ ID NO: 373: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 2099 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:373: 



CCCTTGCTTC TTTTTGTCTT TTTTAAGACT TTATCTCTTG TTAAAAAAAG GTTGTATTAA 6 0 

CGCTT ATG AAA TCC CTA TCT AAT GCC CTT TTT TCG CTC TTT TTA AAA GGT 110 
Met Lys Ser Leu Ser Asn Ala Leu Phe Ser Leu Phe Leu Lys Gly 
15 10 15 
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Lys Pro Leu Ser Met Leu Met Leu Asn Ala Gly Ala Ser Val Ser Val 
180 185 190 

TGC CAT ATT TTG ACT AAA GAC ATT ACT TTT TAC ACC CAA AAC GCT GAT 682 
Cys His lie Leu Thr Lys Asp lie Ser Phe Tyr Thr Gin Asn Ala Asp 
195 200 205 

ATT GTC TGC GTG GGC GTG GGT AAA CCT GAT TTG ATT AAA GCG AGC ATG 730 
lie Val Cys Val Gly Val Gly Lys Pro Asp Leu lie Lys Ala Ser Met 
210 215 220 225 

TTA AAA AAA GGG GCT GTA GTG GTG GAT ATT GGG ATC AAT CAT TTG AAC 778 
Leu Lys Lys Gly Ala Val Val Val Asp lie Gly lie Asn His Leu Asn 
230 235 240 

GAT GGG CGT ATC GTG GGC GAT GTG GAT TTT AAC AAC GTG CAA AAA GTC 826 
Asp Gly Arg lie Val Gly Asp Val Asp Phe Asn Asn Val Gin Lys Val 
245 250 255 

GCC GGT TTT ATC ACC CCT GTG CCT AAA GGC GTG GGG CCT ATG ACG ATT 874 
Ala Gly Phe lie Thr Pro Val Pro Lys Gly Val Gly Pro Met Thr lie 
260 265 270 

GTC TCG CTT TTA GAA AAC ACT CTA ATC GCT TTT GAA AAA CAA CAA AGG 922 
Val Ser Leu Leu Glu Asn Thr Leu lie Ala Phe Glu Lys Gin Gin Arg 
275 280 285 

AAG GGA TTT TAATGAAATT TTTACGCTCT GTTTATGCAT TTTGCTCCAG TTGGGTAGG 9 80 

Lys Gly Phe 

290 

GACGATTGTT ATTGTGCTGT TGGT 10 04 

(2) INFORMATION FOR SEQ ID NO:372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:372: 

Met Gly Met Pro Asn Arg Gly Val Val Leu Leu Asp Gly Gin Ala Leu 

15 10 15 

Ala Asp Asn lie Glu Lys Asp Leu Lys His Lys lie Gin lie lie Thr 

20 25 30 

Ala Gin Thr His Lys Arg Pro Lys Leu Ala Val lie Leu Val Gly Lys 

35 40 45 

Asp Pro Ala Ser lie Thr Tyr Val Asn Met Lys lie Lys Ala Cys Glu 

50 55 60 

Arg Val Gly Met Asp Phe Asp Leu Lys Thr Leu Gin Glu Asn lie Thr 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 71: 

CATTAAACGC ATGATTTTTG CTACAATAAT AGGATTTTAA TTATATAAAG GACAA ATG 5 8 

Met 
1 

GGC ATG CCA AAT AGG GGC GTT GTT TTA TTA GAC GGG CAA GCG CTA GCT 106 
Gly Met Pro Asn Arg Gly Val Val Leu Leu Asp Gly Gin Ala Leu Ala 
5 10 15 

GAT AAT ATA GAA AAA GAT TTG AAA CAT AAA ATC CAA ATA ATA ACC GCA 154 
Asp Asn lie Glu Lys Asp Leu Lys His Lys lie Gin lie lie Thr Ala 
20 25 30 

CAA ACG CAT AAA CGC CCC AAA CTA GCC GTG ATT TTA GTG GGG AAA GAT 2 02 

Gin Thr His Lys Arg Pro Lys Leu Ala Val lie Leu Val Gly Lys Asp 
35 40 45 

CCC GCT AGT ATC ACT TAT GTC AAT ATG AAG ATC AAA GCA TGC GAA AGG 250 
Pro Ala Ser lie Thr Tyr Val Asn Met Lys lie Lys Ala Cys Glu Arg 
50 55 60 65 

GTG GGC ATG GAT TTT GAC TTA AAA ACC CTC CAA GAA AAT ATT ACT GAA 2 98 

Val Gly Met Asp Phe Asp Leu Lys Thr Leu Gin Glu Asn lie Thr Glu 
70 75 80 

GCC AAA TTG CTA TCC TTG ATT AAA GAT TAC AAT ACC GAT CAA AAC ATT 34 6 

Ala Lys Leu Leu Ser Leu lie Lys Asp Tyr Asn Thr Asp Gin Asn lie 
85 90 95 

TCA GGC GTT TTA GTC CAG CTC CCT TTG CCC AGA CAC ATT GAT ACT AAA 3 94 

Ser Gly Val Leu Val Gin Leu Pro Leu Pro Arg His lie Asp Thr Lys 
100 105 110 

ATG ATT TTA GAA GCC ATT GAC CCA AAC AAA GAT GTG GAT GGT TTC CAC 44 2 

Met lie Leu Glu Ala lie Asp Pro Asn Lys Asp Val Asp Gly Phe His 
115 120 125 

CCC CTT AAT ATC GGT AAG CTC TGC ACT CAA AAA GAA TCG TTT CTG CCA 4 90 

Pro Leu Asn lie Gly Lys Leu Cys Thr Gin Lys Glu Ser Phe Leu Pro 
130 135 140 145 

GCC ACC CCT ATG GGC GTG ATG CGG CTT TTA GAG CAT TAC CAT ATT GAA 53 8 

Ala Thr Pro Met Gly Val Met Arg Leu Leu Glu His Tyr His lie Glu 
150 155 160 

ATC AAG GGT AAG GAT GTG GCG ATT ATT GGA GCG AGC AAT ATC ATT GGC 5 86 

He Lys Gly Lys Asp Val Ala He He Gly Ala Ser Asn He He Gly 
165 170 175 

AAA CCT TTA AGC ATG CTC ATG CTA AAC GCT GGG GCT AGC GTG AGC GTG 634 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

Lys Lys Gin Leu Lys Asn Cys Val Lys Leu Lys Leu Phe Leu Arg Lys 

15 10 15 

Asn Leu Asp lie Leu Asp Leu Asn Lys Ala Gin Ala Val Gin Gin Asn 

20 25 30 

Glu Gin Glu Val Glu Asp Lys Glu Arg Glu Ser Lys Glu Pro Val Val 

35 40 ^ 45 

Leu Glu Asp Leu Ser Ala Leu Ala Trp Leu Glu Leu Glu Glu Phe Ser 

50 55 60 

Arg Leu Ser Gly Leu Pro Lys Glu Arg lie Leu Glu Leu Val Asn Leu 
65 70 75 80 

Gly Lys lie Lys Ser Lys lie Ser Ser Asn Lys Leu Leu lie Asp Ala 

85 90 95 

Ser Ser Gly Thr Asn Ala Leu lie Lys Lys Val Glu Asn Ser Leu lie 

100 105 110 

Ser Met Asp Met Asn Gly Arg Ser Leu Glu Pro Val Phe Val Glu Lys 

115 120 125 

Thr lie Asn Thr lie Leu Asn Leu His Asp Lys Val lie Gly Ala Lys 

130 135 140 

Asp Glu Thr lie Ser Ala Phe Lys Asn Glu Asn Met Phe Leu Lys Asp 
145 150 155 160 

Ala Leu lie Ser Met Gin Glu Val Tyr Glu Glu Asp Lys Lys Thr lie 

165 170 175 

Asp Leu Leu Arg Asp Glu Leu Asn Gin Ala Arg Glu Glu He Glu Phe 

180 185 190 

Met Lys Arg Lys Tyr Arg Leu Met Trp Gly Lys Val Ala Asp Met Ser 

195 200 205 

Ser Val Asn Lys Lys 
210 



(2) INFORMATION FOR SEQ ID NO: 3 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 56... 931 
(D) OTHER INFORMATION: 
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GAT ATT TTA GAT TTG AAC AAA GCG CAA GCG GTG CAA CAA AAT GAA CAA 152 
Asp lie Leu Asp Leu Asn Lys Ala Gin Ala Val Gin Gin Asn Glu Gin 
20 25 30 

GAG GTA GAG GAT AAA GAG CGA GAG TCT AAA GAG CCG GTG GTT TTA GAA 20 0 

Glu Val Glu Asp Lys Glu Arg Glu Ser Lys Glu Pro Val Val Leu Glu 
35 40 45 50 

GAT TTG AGC GCT TTA GCG TGG CTT GAA TTA GAA GAG TTT AGC CGC GTT 24 8 

Asp Leu Ser Ala Leu Ala Trp Leu Glu Leu Glu Glu Phe Ser Arg Leu 
55 60 65 

TCA GGG CTT CCT AAA GAA AGG ATT TTG GAA TTA GTG AAT CTT GGT AAA 296 
Ser Gly Leu Pro Lys Glu Arg lie Leu Glu Leu Val Asn Leu Gly Lys 
70 75 80 

ATC AAG AGC AAA ATA AGC AGC AAC AAG CTT TTA ATT GAT GCG AGC AGC 3 44 

lie Lys Ser Lys lie Ser Ser Asn Lys Leu Leu lie Asp Ala Ser Ser 
85 90 95 

GGG ACA AAC GCT TTA ATC AAA AAG GTA GAA AAT AGT TTG ATT TCT ATG 3 92 

Gly Thr Asn Ala Leu lie Lys Lys Val Glu Asn Ser Leu lie Ser Met 
100 105 110 

GAT ATG AAC GGG CGT TCT TTA GAA CCT GTG TTT GTG GAA AAG ACC ATT 44 0 

Asp Met Asn Gly Arg Ser Leu Glu Pro Val Phe Val Glu Lys Thr lie 
115 120 125 130 

AAC ACG ATT TTA AAC TTG CAT GAT AAG GTC ATT GGC GCT AAA GAT GAA 48 8 

Asn Thr lie Leu Asn Leu His Asp Lys Val lie Gly Ala Lys Asp Glu 
135 140 145 

ACG ATT TCA GCC TTT AAA AAT GAA AAC ATG TTT TTA AAA GAC GCT TTA 53 6 

Thr lie Ser Ala Phe Lys Asn Glu Asn Met Phe Leu Lys Asp Ala Leu 
150 155 160 

ATC TCT ATG CAA GAA GTC TAT GAA GAA GAT AAA AAA ACC ATT GAT CTT 5 84 

lie Ser Met Gin Glu Val Tyr Glu Glu Asp Lys Lys Thr lie Asp Leu 
165 170 175 

TTG CGC GAT GAA CTC AAT CAA GCG AGA GAA GAA ATT GAA TTT ATG AAG 632 
Leu Arg Asp Glu Leu Asn Gin Ala Arg Glu Glu lie Glu Phe Met Lys 
180 185 190 

AGG AAA TAC CGC TTG ATG TGG GGG AAA GTC GCT GAC ATG AGC AGC GTG 68 0 

Arg Lys Tyr Arg Leu Met Trp Gly Lys Val Ala Asp Met Ser Ser Val 
195 200 205 210 

AAT AAA AAG TAGTTTTAAA TTAACGCCCA TGCTGAGGGC TTATTAGCGG TAATTTTAG 73 8 
Asn Lys Lys 



GTGA 742 
(2). INFORMATION FOR SEQ ID NO: 370: 
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Asp Asp Asp Lys 
50 

Gin Gin lie Leu 
65 

Lys Ser Leu Lys 

Thr Phe Ser Thr 
100 

Gly Phe Asn Thr 
115 

Lys Val Gly Phe 
130 

Ser Leu Ser Gin 
145 

Leu Asp Phe Ser 

Ser Tyr Leu Gly 
180 

lie Lys Pro Gly 
195 

Phe Phe Gin Ala 
210 

Tyr Leu Ser Leu 
225 

Phe Tyr Thr Tyr 

Ala Asn Leu Ser 
260 



Pro Lys Pro Leu 
55 

Asp Lys Asn Gin 
70 

Phe Phe Phe lie 
85 

Ser Asn Gin Thr 

Ala Thr Gly Leu 
120 

Arg lie Phe Ser 
135 

Pro Gin Thr Leu 
150 

Trp lie Phe Val 
165 

lie Ala Leu Glu 

Ala lie Thr Thr 
200 

Pro Leu Arg Phe 
215 

Gin Leu Gly lie 
230 

Asn Asn His Gin 
245 

Leu lie Val Ser 



Thr lie Asp Lys 
60 

Gin lie Leu Lys 
75 

Phe Gly Tyr Asn 
90 

Leu Thr Phe Val 
105 

Glu His Phe Leu 

Val Tyr Asn Tyr 
140 

Met Val Gin Asn 
155 

Asp Lys Asn lie 
170 

Gin Gly Val Leu 
185 

lie lie Pro Arg 

Gly Phe lie Val 
220 

Glu Met Pro Leu 
235 

Glu Arg Phe Lys 
250 

Phe 
265 



Asn Asn Glu Lys 

Arg Ala Leu Glu 
80 

Tyr Ser Gin Ala 
95 

Ala Asn Ser lie 
110 

Arg Asn His Pro 
125 

Phe His Ser Val 

Tyr Gly Gly Ala 
160 

Tyr Arg Phe Arg 
175 

Leu Val Asp Thr 
190 

Thr Lys Lys Thr 
205 

Asp Phe lie Gly 

Val Arg Asn Val 
240 

Pro Arg Phe Asn 
255 



(2) INFORMATION FOR SEQ ID NO: 3 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 742 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 689 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 9: 

GCACTTCACG CATTCCATAA ACGATATTGA TTATTTGCTA GACAGCTTGA AAA AAG 56 

Lys Lys 
1 

GAG TTA AAA AAT TGC GTT AAG CTA AAA CTA TTT TTA AGG AAA AAT TTG 104 
Gin Leu Lys Asn Cys Val Lys Leu Lys Leu Phe Leu Arg Lys Asn Leu 
5 10 15 
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135 140 145 

TCC CAG CCT CAA ACC TTA ATG GTG CAA AAT TAT GGG GGC GCG TTA GAT 53 6 

Ser Gin Pro Gin Thr Leu Met Val Gin Asn Tyr Gly Gly Ala Leu Asp 
150 155 160 

TTT TCT TGG ATT TTT GTA GAT AAA AAT ATT TAT CGC TTT AGG AGT TAT 584 
Phe Ser Trp lie Phe Val Asp Lys Asn lie Tyr Arg Phe Arg Ser Tyr 
165 170 175 

TTA GGG ATC GCT TTA GAA CAA GGG GTG TTG TTA GTG GAT ACG ATT AAA 632 
Leu Gly lie Ala Leu Glu Gin Gly Val Leu Leu Val Asp Thr lie Lys 
180 185 190 

CCA GGT GCT ATC ACA ACG ATT ATC CCA AGA ACC AAA AAA ACC TTT TTT 680 
Pro Gly Ala lie Thr Thr lie lie Pro Arg Thr Lys Lys Thr Phe Phe 
195 200 205 210 

CAA GCC CCT TTG CGT TTT GGT TTT ATC GTG GAT TTT ATC GGC TAT TTG 72 8 

Gin Ala Pro Leu Arg Phe Gly Phe lie Val Asp Phe lie Gly Tyr Leu 
215 220 225 

TCT TTG CAA TTA GGG ATT GAA ATG CCT TTA GTG AGG AAT GTT TTT TAC 77 6 

Ser Leu Gin Leu Gly lie Glu Met Pro Leu Val Arg Asn Val Phe Tyr 
230 235 240 

ACC TAC AAC AAC CAT CAA GAA AGA TTC AAA CCA CGA TTT AAC GCT AAT 824 
Thr Tyr Asn Asn His Gin Glu Arg Phe Lys Pro Arg Phe Asn Ala Asn 
245 250 255 

CTT TCT TTA ATC GTT TCG TTT TAGCCCCCCT TTTCCCCTTT TAAATAAGCC CATG 8 79 
Leu Ser Leu lie Val Ser Phe 
260 265 

ATTTTCCTAG GGTATTTTA 898 

(2) INFORMATION FOR SEQ ID NO: 36 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 68: 

Leu Gly Thr lie Ala Thr Lys Ser Leu Arg Leu Tyr Leu Lys Thr Leu 

15 10 15 

Phe Ser lie Tyr Leu Phe Leu Ser Leu Asn Pro Leu Phe Leu Glu Ala 

20 25 30 

Asn Glu lie Thr Trp Ser Lys Phe Leu Glu Asn Phe Lys Asn Lys Asn 
35 40 45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 845 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

GGGGGGTTAT AGTAAAAACA TGCAAGTAAT TTAAAGTTAA TTTAAGATAA TTA GGC 5 6 

Leu Gly 
1 

ACA ATA GCC ACA AAA AGT TTA AGG CTG TAT TTG AAA ACT CTA TTT AGT 104 
Thr lie Ala Thr Lys Ser Leu Arg Leu Tyr Leu Lys Thr Leu Phe Ser 
5 10 15 

ATT TAT CTC TTT TTA TCG TTG AAC CCA CTC TTT TTA GAA GCT AAT GAA 152 
lie Tyr Leu Phe Leu Ser Leu Asn Pro Leu Phe Leu Glu Ala Asn Glu 
20 25 30 

ATC ACT TGG TCT AAA TTC TTG GAA AAT TTT AAA AAC AAG AAT GAT GAT 200 
lie Thr Trp Ser Lys Phe Leu Glu Asn Phe Lys Asn Lys Asn Asp Asp 
35 40 45 50 

GAC AAA CCT AAA CCC CTA ACT ATT GAT AAA AAC AAT GAA AAA CAG CAA 24 8 

Asp Lys Pro Lys Pro Leu Thr lie Asp Lys Asn Asn Glu Lys Gin Gin 
55 60 65 

ATC TTA GAC AAA AAC CAG CAA ATC TTA AAA AGG GCT TTG GAA AAA AGC 2 96 

lie Leu Asp Lys Asn Gin Gin lie Leu Lys Arg Ala Leu Glu Lys Ser 
70 75 80 

CTT AAA TTC TTT TTC ATT TTT GGA TAC AAC TAT TCG CAA GCC ACT TTT 344 
Leu Lys Phe Phe Phe lie Phe Gly Tyr Asn Tyr Ser Gin Ala Thr Phe 
85 90 95 

TCA ACT TCT AAC CAA ACC TTG ACT TTT GTA GCC AAT AGC ATA GGG TTT 3 92 

Ser Thr Ser Asn Gin Thr Leu Thr Phe Val Ala Asn Ser lie Gly Phe 
100 105 110 

AAC ACC GCT ACC. GGT TTA GAG CAT TTT TTA AGA AAC CAC CCT AAA GTC 44 0 

Asn Thr Ala Thr Gly Leu Glu His Phe Leu Arg Asn His Pro Lys Val 
115 120 125 130 

GGT TTT AGA ATC TTT AGC GTC TAT AAC TAT TTC CAT TCT GTT TCC CTC 4 88 

Gly Phe Arg lie Phe Ser Val Tyr Asn Tyr Phe His Ser Val Ser Leu 
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(A) LENGTH: 33 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

Met Leu Val Thr Arg Phe Lys Lys Ala Leu lie Ser Tyr Ser Leu Gly 

15 10 15 

Ala Leu Leu Val Ser Ser Leu Leu Gly Val Ala Ser Ala Ser Asn Gin 

20 25 30 

Glu lie Gin Val Lys Asp Tyr Phe Gly Asp Gin Ala lie Lys Leu Pro 

35 40 45 

Val Ser Lys lie lie Tyr Leu Gly Ser Phe Ala Glu Val Pro Ala Met 

50 55 60 

Phe His Thr Trp Asp Arg Val Val Gly lie Ser Asp Tyr Ala Phe Lys 
65 70 75 80 

Ser Asp lie Val Lys Ala Thr Leu Lys Asp Pro Lys Arg lie Lys Ser 

85 90 95 

Met Ser Ser Asp His Val Ala Ala Leu Asn Val Glu Leu Leu Lys Lys 

100 105 110 

Leu Gly Pro Asp Leu Val Val Thr Phe Val Gly Asn Pro Lys Ala Val 

115 120 125 

Glu His Ala Lys Lys Phe Gly lie Leu Phe Leu Ser Phe Gin Glu Lys 

130 135 140 

Thr lie Ala Glu Val Met Glu Asp lie Asp Ala Gin Ala Lys Ala Leu 
145 150 155 160 

Glu lie Asp Ala Ser Lys Lys Leu Ala Lys Met Gin Glu Thr Leu Asp 

165 170 175 

Phe lie Ala Glu Arg Leu Lys Gly Val Lys Lys Lys Lys Gly Val Glu 

180 185 190 

Leu Phe His Lys Ala Asn Lys lie Ser Gly His Gin Ala Leu Asp Ser 

195 200 205 

Asp lie Leu Glu Lys Gly Gly lie Asp Asn Phe Gly Leu Lys Tyr Val 

210 215 220 

Lys Phe Gly Arg Ala Asp lie Ser Val Glu Lys lie Val Lys Glu Asn 
225 230 235 240 

Pro Glu lie lie Phe lie Trp Trp lie Ser Pro Leu Thr Pro Glu Asp 

245 250 255 

Val Leu Asn Asn Pro Lys Phe Ala Thr lie Lys Ala lie Lys Asn Lys 

260 265 270 

Gin Val Tyr Lys Leu Pro Thr Met Asp lie Gly Gly Pro Arg Ala Pro 

275 280 285 

Leu lie Ser Leu Phe lie Ala Leu Lys Ala His Pro Glu Ala Phe Lys 

290 295 300 

Gly Val Asp lie Asn Ala lie Val Lys Asp Tyr Tyr Lys Val Val Phe 
305 310 315 320 

Asp Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His 
325 330 

(2) INFORMATION FOR SEQ ID NO: 367: 
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140 145 150 

ATT GAC GCT CAA GCT AAA GCC TTA GAA ATT GAT GCT TCT AAA AAA CTG 5 92 

lie Asp Ala Gin Ala Lys Ala Leu Glu lie Asp Ala Ser Lys Lys Leu 
155 160 165 

GCC AAA ATG CAA GAA ACT TTG GAT TTT ATT GCT GAG CGT TTG AAA GGT 64 0 

Ala Lys Met Gin Glu Thr Leu Asp Phe lie Ala Glu Arg Leu Lys Gly 
170 175 180 

GTC AAA AAG AAA AAA GGG GTG GAG CTT TTC CAT AAG GCC AAT AAG ATC 688 
Val Lys Lys Lys Lys Gly Val Glu Leu Phe His Lys Ala Asn Lys lie 
185 190 195 200 

AGC GGC CAT CAA GCC CTT GAT TCA GAC ATT TTA GAA AAA GGA GGC ATA 73 6 

Ser Gly His Gin Ala Leu Asp Ser Asp lie Leu Glu Lys Gly Gly lie 
205 210 215 

GAC AAT TTT GGC TTG AAA TAT GTC J\AA TTT GGG CGT GCT GAC ATT AGC 7 84 

Asp Asn Phe Gly Leu Lys Tyr Val Lys Phe Gly Arg Ala Asp lie Ser 
220 225 230 

GTG GAA AAA ATC GTT AAA GAA AAC CCT GAG ATT ATC TTT ATT TGG TGG 832 
Val Glu Lys lie Val Lys Glu Asn Pro Glu lie lie Phe lie Trp Trp 
235 240 245 

ATA AGC CCA CTC ACG CCT GAA GAT GTG TTA AAC AAC CCC AAA TTT GCT 8 80 

lie Ser Pro Leu Thr Pro Glu Asp Val Leu Asn Asn Pro Lys Phe Ala 
250 255 260 

ACC ATC AAA GCC ATT AAA AAC AAG CAG GTT TAT AAA CTC CCC ACA ATG 92 8 

Thr lie Lys Ala lie Lys Asn Lys Gin Val Tyr Lys Leu Pro Thr Met 
265 270 275 280 

GAT ATT GGC GGG CCT AGA GCC CCA CTC ATA AGT CTT TTT ATC GCT CTA 97 6 

Asp He Gly Gly Pro Arg Ala Pro Leu He Ser Leu Phe He Ala Leu 
285 290 295 

AAA GCC CAC CCT GAA GCC TTT AAG GGC GTG GAT ATT AAT GCG ATT GTT 1024 
Lys Ala His Pro Glu Ala Phe Lys Gly Val Asp He Asn Ala He Val 
300 305 310 

AAA GAC TAC TAT AAA GTG GTT TTT . GAT TTG AAT GAT GCA GAG GTT GAA 10 72 

Lys Asp Tyr Tyr Lys Val Val Phe Asp Leu Asn Asp Ala Glu Val Glu 
315 320 325 

CCC TTT TTA TGG CAT TAATTTTTAA AAAAGGGCTG ATATTTTTAG CCCTTTGTGT A 112 8 
Pro Phe Leu Trp His 
330 

TCGCGCTAGG ATTAG 114 3 

(2) INFORMATION FOR SEQ ID NO: 366: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATtJRE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 8 9... 1087 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65: 

GATTTTGTGA AAAATAGTTT CATTTTTACT GCTTGTATTT CCTTAATGGT GTTATAATCG 60 
CTCCATAAAT CATACAAAAA GGATCGTT ATG TTA GTT ACT CGC TTT AAA AAA 112 

Met Leu Val Thr Arg Phe Lys Lys 
1 5 

GCT TTA ATC TCT TAT TCT TTA GGC GCG CTT CTT GTT TCA TCG TTA TTG 16 0 

Ala Leu lie Ser Tyr Ser Leu Gly Ala Leu Leu Val Ser Ser Leu Leu 
10 15 20 

GGC GTG GCT AGT GCT TCC AAT CAA GAA ATC CAA GTC AAA GAT TAT TTT 20 8 

Gly Val Ala Ser Ala Ser Asn Gin Glu lie Gin Val Lys Asp Tyr Phe 
25 30 35 40 

GGG GAT CAA GCC ATC AAG CTT CCT GTT TCT AAA ATA ATC TAG TTG GGT 25 6 

Gly Asp Gin Ala lie Lys Leu Pro Val Ser Lys lie lie Tyr Leu Gly 
45 50 55 

AGC TTT GCA GAA GTG CCT GCT ATG TTC CAT ACT TGG GAT AGG GTC GTG 3 04 

Ser Phe Ala Glu Val Pro Ala Met Phe His Thr Trp Asp Arg Val Val 
60 65 70 

GGA ATT TCG GAT TAC GCT TTT AAA TCT GAT ATT GTT AAA GCT ACT CTC 352 
Gly lie Ser Asp Tyr Ala Phe Lys Ser Asp lie Val Lys Ala Thr Leu 
75 80 85 

AAA GAT CCT AAA CGC ATT AAA TCC ATG AGC AGT GAT CAT GTG GCG GCG 4 00 

Lys Asp Pro Lys Arg lie Lys Ser Met Ser Ser Asp His Val Ala Ala 
90 95 100 

TTG AAT GTG GAG CTT TTA AAA AAG CTT GGC CCC GAT CTT GTG GTA ACC 44 8 

Leu Asn Val Glu Leu Leu Lys Lys Leu Gly Pro Asp Leu Val Val Thr 
105 110 115 120 

TTT GTG GGC AAC CCT AAA GCG GTA GAG CAT GCG AAA AAA TTT GGT ATA 4 96 

Phe Val Gly Asn Pro Lys Ala Val Glu His Ala Lys Lys Phe Gly lie 
125 130 135 

TTA TTT CTT TCT TTC CAA GAA AAA ACC ATT GCA GAA GTC ATG GAA GAT 544 
Leu Phe Leu Ser Phe Gin Glu Lys Thr lie Ala Glu Val Met Glu Asp 
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20 25 30 

TTT TTG AAT GTT ATA GAG AAT TTC ACT ATC TCT TTG ATT TTG TTT ATA 2 00 

Phe Leu Asn Val lie Asp Asn Phe Thr lie Ser Leu lie Leu Phe lie 
35 40 45 50 

TTT TTT AGO CCC ATA CCA AAG AAA TAT TTG ATA AAA AAT AAG AAA AAT 24 8 

Phe Phe Ser Pro He Pro Lys Lys Tyr Leu He Lys Asn Lys Lys Asn 
55 60 65 

AGC GTA AAA GAA AAA GAA AAT AAA GAA AAA AGA AAG AGA AAA AGA AAG 2 96 

Ser Val Lys Glu Lys Glu Asn Lys Glu Lys Arg Lys Arg Lys Arg Lys 
70 75 80 

GAT TTT GTT TTG GGT GTA TTG GAA AAT AGA CTC AAA AAT CAA TTG AAA 344 
Asp Phe Val Leu Gly Val Leu Glu Asn Arg Leu Lys Asn Gin Leu Lys 
85 90 95 

AAC CCC TTT TAGATTAAAA ATAAAAACAA TAAGCGAAAC GACAAAAGCA AGCAGAAAA 4 02 
Asn Pro Phe 
100 

G AAG 4 0 6 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:364: 

He Lys He He Tyr Gin He He Lys Phe Phe Arg Ser Lys Asn Phe 

1-5 10 15 

He Leu Phe Phe Arg He Ala Thr He Lys Phe Phe Phe Tyr Leu He 

20 25 30 

Trp Glu Phe Leu Asn Val He Asp Asn Phe Thr He Ser Leu He Leu 

35 40 45 

Phe He Phe Phe Ser Pro He Pro Lys Lys Tyr Leu He Lys Asn Lys 

50 55 60 

Lys Asn Ser Val Lys Glu Lys Glu Asn Lys Glu Lys Arg Lys Arg Lys 
65 70 75 80 

Arg Lys Asp Phe Val Leu Gly Val Leu Glu Asn Arg Leu Lys Asn Gin 

85 90 95 

Leu Lys Asn Pro Phe 
100 

(2) INFORMATION FOR SEQ ID NO: 3 65: 
(i) SEQUENCE CHARACTERISTICS: 
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595 

Arg Tyr Gly Asp 
610 

Glu Leu Glu Leu 
625 

Ala Tyr Thr Phe 

Asn Pro Ala Asn 
660 

Leu Pro Phe Val 
675 

Tyr Ala Lys Thr 
690 

Tyr Ser Asp Val 
705 

lie Lys Asn Gly 

Tyr Trp Val Trp 
740 

Lys Gin Ser Val 
755 

Lys Tyr Trp Phe 
770 

Arg Leu Leu Gly 
785 



600 

Asn Lys Glu Pro 
615 

Tyr Tyr Thr Pro 
630 

lie Asp Ala Asn 
645 

Pro Lys Gly Pro 

Ser Pro His Gin 
680 

Thr lie Gly Leu 
695 

Leu Asn Thr Val 

710 

Ala He Thr Thr 
725 

Asn Leu Gin He 

Asn Ala Ser Leu 
760 

Ser Gly He Gly 
775 

Ala Ser Gin Arg 
790 



Val Asn Ala Arg 
620 

He Arg Gly Leu 
635 

He Thr Ser His 
650 

Lys Lys Asp He 
665 

Phe He Leu Asp 

Ser Ser Phe Phe 
700 

Pro Phe He Gin 
715 

Lys Thr Ala Gly 
730 

Ser Thr Thr Phe 
745 

Gin He Asn Asn 

Thr Ser Leu Thr 
780 

Met 



605 

Ser Gin Gly Val 

Asn Phe His Ala 
640 

Thr Met Val Thr 
655 

Phe Gly Lys Lys 
670 

Ala Ser Tyr Thr 
685 

Tyr Ser Arg Thr 

Tyr Ala Pro Thr 
720 

Met Thr Pro Trp 
735 

Trp Glu Arg Lys 
750 

He Phe Asn Met 
765 

Gly Lys Lys Pro 



(2) INFORMATION FOR SEQ ID NO: 3 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51 . . . 353 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 63: 

TCGTCTTTTG TCGGTTCAAA AAAATCTCTC GACTTAAAAA AACCTATTAA ATC AAG 5 6 

He Lys 
1 

ATA ATC TAT CAA ATC ATC AAG TTT TTT CGT TCT AAG AAT TTT ATT TTG 104 ' 

He He Tyr Gin He He Lys Phe Phe Arg Ser Lys Asn Phe He Leu 
5 10 15 

TTT TTT AGA ATA GCA ACG ATA AAG TTC TTC TTT TAT CTC ATT TGG GAA 152 
Phe Phe Arg He Ala Thr He Lys Phe Phe Phe Tyr Leu He Trp Glu 
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Asp Val lie Lys 

Gly Gly Val Val 
180 

Asn Gin Ala Ala 
195 

Asn Phe Val Asp 
210 

Gly Asn Gin Met 
225 

Gly Lys His Val 

Gly Phe Arg Gin 
260 

Ala Val Tyr Lys 
275 

Tyr Tyr Gin Tyr 
290 

Tyr Ala Tyr Asn 
305 

Gly Arg Ala Lys 

Pro Asp Arg Lys 
340 

Asp Met Ser Arg 
355 

Met Ser Ser Gin 
370 

Ser Ala Thr Asn 
385 

Ser Pro Cys Trp 

Ala Phe Glu Pro 
420 

Gin Thr Phe Asn 
435 

Arg Ser Thr Thr 
450 

Phe Asp Ala Gly 
465 

Tyr Ala Ser Asp 

Pro Gly Leu Arg 
500 

Pro Phe Lys Ala 
515 

Gin Trp Asn Pro 
530 

Leu Phe Tyr Phe 
545 

Asn lie Gly Ser 

Asn Val Met Glu 
580 

Phe Asn Ala Asn 



Gly Gly Thr Ser 
165 

Asn lie lie Thr 

Glu Arg He Thr 
200 

Pro Lys Glu Lys 
215 

Leu Phe Asn Thr 
230 

Gly He Ser Ala 
245 

Asn Ser Pro Thr 

He Asn Ala Thr 
280 

Asn Ser Tyr His 
295 

Arg Phe He Asn 
310 

Arg Phe Gly He 
325 

Val Gly Gly Asp 

Asp Phe Gly Phe 
360 

Asn Lys He Leu 
375 

Pro Asn Cys Gly 
390 

Gin Phe Phe Asp 
405 

Lys Leu Asn Leu 

Met Gly Met Arg 
440 

Arg Lys Asn Pro 
455 

Thr Ser Leu Asn 
470 

Glu He Asn Phe 
485 

Tyr Thr Phe Leu 

Gly Gin Thr Gly 
520 

Ala Val Asn Val 
535 

Asn Tyr Gin Arg 
550 

Phe Val Gly Thr 
565 

Gly Gly Ser Arg 
Tyr Phe Val He 



Val Gin Tyr Gly 
170 

Lys Glu He Pro 
185 

Phe Trp Gly Arg 

Gly Lys Pro Leu 
220 

Tyr Gly Arg Thr 
235 

Gin Gly Asn Trp 
250 

Lys Val Gin Asn 
265 

Asn Thr Phe Lys 

Pro Gly Thr Leu 
300 

Glu Arg Pro Asp 
315 

Val Tyr Gin Asn 
330 

Phe Lys Phe Thr 
345 

Ser Asn Gin Tyr 

Pro Phe Lys Gly 
380 

Leu Tyr Ser Tyr 
395 

Asn He Arg Arg 
410 

He Val Asn Thr 
425 

Phe Leu Thr Glu 

Ser Met Pro Asn 
460 

Asn Phe Asn Asn 
475 

Asn Asn Gly Met 
490 

Asn Tyr Glu Lys 
505 

Lys Thr He Lys 

Gly Tyr Lys Pro 
540 

Ser Tyr He Pro 
555 

Ser Thr Asp Tyr 
570 

Tyr Tyr Phe Asn 
585 

Phe Ala Asn Asn 



Pro Asn Thr Phe 
175 

Lys Glu Trp Glu 
190 

Ser Ser Asn Gly 
205 

Ala Gin Thr Leu 

Ala Gly Met Leu 
240 

He Asn Gly Gin 
255 

Tyr Leu Leu Asp 
270 

Ala Tyr Tyr Gin 
285 

Ser Ala Gin Asp 

Asn Gin Asp Gly 
320 

Tyr Phe Gly Asp 
335 

Tyr Phe Thr His 
350 

Gin Ser Val Tyr 
365 

Lys Gly Lys He 

Ser Asp Thr Asn 
400 

Ser Val Val Asn 
415 

Gly Lys Val Lys 
430 

Asp Leu Tyr Arg 
445 

Asn Gly Ser Gly 

Tyr Thr Ala Val 
480 

Leu Thr He Thr 
495 

Lys Asp Ala Pro 
510 

Asp Arg Tyr Asn 
525 

He Lys Glu Leu 

Pro Gin Phe Ser 
560 

Phe Gin He Phe 
575 

Asn Gin Val Ser 
590 

Tyr Phe Thr Gly 
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ACT ACC AAA ACA GCG GGC ATG ACG CCA TGG TAT TGG GTG TGG AAT TTG 2 6 83 

Thr Thr Lys Thr Ala Gly Met Thr Pro Trp Tyr Trp Val Trp Asn Leu 

730 735 740 

CAA ATT TCT ACC ACT TTT TGG GAA CGC AAA AAG CAA AGC GTT AAT GCG 2 731 

Gin lie Ser Thr Thr Phe Trp Glu Arg Lys Lys Gin Ser Val Asn Ala 
745 750 755 

AGC TTG CAA ATC AAT AAC ATT TTT AAC ATG AAA TAT TGG TTT AGC GGG 2 779 

Ser Leu Gin lie Asn Asn lie Phe Asn Met Lys Tyr Trp Phe Ser Gly 
760 765 770 

ATA GGC ACT AGC CTA ACG GGA AAG AAG CCG CGC CTC CTA GGA GCA TCA 2 827 

lie Gly Thr Ser Leu Thr Gly Lys Lys Pro Arg Leu Leu Gly Ala Ser 
775 780 785 790 

CAG CGT ATG TGAGCTATCA TTTTTAATTT TAGGGTTGTA ATGTTTTGAG AAGTTGGGC 2 8 85 
Gin Arg Met 



GTAAA 2 890 

(2) INFORMATION FOR SEQ ID NO: 3 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 62: 

Met Arg Lys Val lie Thr Met Asn Gly Tyr Leu Arg Val Lys Thr Ser 

15 10 15 

Tyr Phe Leu Ala Leu Asn Ala Leu Thr Phe Leu Ser Phe Asn Ser Leu 

20 25 30 

Val Gly Ala Lys Glu Gin His His Thr Leu Gin Lys Val Thr Thr Thr 

35 40 45 

Glu Gin Lys Phe Asn Pro Ser Ala Pro Leu Ser Trp Gin Ser Glu Glu 

50 55 60 

Met Arg Asn Ser Thr Ser Ser Arg Thr Val lie Ser Asn Lys Glu Leu 
65 70 75 80 

Lys Lys Thr Gly Asn Leu Asn lie Glu Asn Ala Leu Gin Asn Val Pro 

85 90 95 

Gly lie Gin lie Arg Asp Ala Thr Gly Thr Gly Val Leu Pro Lys lie 

100 105 110 

Ser Val Arg Gly Phe Gly Gly Gly Gly Asn Gly His Ser Asn Thr Asn 

115 120 125 

Met lie Leu Val Asn Gly lie Pro lie Tyr Gly Ala Pro Tyr Ser Asn 

130 135 140 

lie Glu Leu Ala lie Phe Pro Val Thr Phe Gin Ser Val Asp Arg lie 
145 150 155 160 
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TTT TTA AAT TAG GAA AAA AAA GAG OCT OCT GCT TTT AAA GCA GGC CAA 2 011 

Phe Leu Asn Tyr Glu Lys Lys Asp Ala Pro Pro Phe Lys Ala Gly Gin 
505 510 515 

ACA GGA AAA ACC ATT AAA GAT CGT TAT AAC CAA TGG AAT CCA GCA GTG 2 05 9 

Thr Gly Lys Thr lie Lys Asp Arg Tyr Asn Gin Trp Asn Pro Ala Val 
520 525 530 

AAT GTG GGC TAT AAA CCC ATT AAA GAA TTG TTG TTT TAT TTC AAT TAG 2107 
Asn Val Gly Tyr Lys Pro lie Lys Glu Leu Leu Phe Tyr Phe Asn Tyr 
535 540 545 550 

CAA AGA AGC TAG ATT CCG CGT CAA TTG AGG AAT ATC GGT AGT TTT GTA 2155 
Gin Arg Ser Tyr lie Pro Pro Gin Phe Ser Asn lie Gly Ser Phe Val 
555 560 565 

GGC ACA AGC ACG GAT TAT TTT CAA ATC TTT AAT GTG ATG GAA GGC GGC 2 2 03 

Gly Thr Ser Thr Asp Tyr Phe Gin lie Phe Asn Val Met Glu Gly Gly 
570 575 580 

TCA AGA TAT TAT TTT AAC AAC CAA GTG AGT TTT AAC GCG AAT TAT TTT 2251 
Ser Arg Tyr Tyr Phe Asn Asn Gin Val Ser Phe Asn Ala Asn Tyr Phe 
585 590 595 

GTG ATT TTT GCG AAT AAC TAT TTT ACG GGG GGC TAT GGG GAT AAT AAA 22 99 

Val lie Phe Ala Asn Asn Tyr Phe Thr Gly Arg Tyr Gly Asp Asn Lys 
600 605 610 

GAG CCG GTC AAT GCG AGA TGG CAA GGC GTG GAG GTA GAG TTG TAT TAC 234 7 

Glu Pro Val Asn Ala Arg Ser Gin Gly Val Glu Leu Glu Leu Tyr Tyr 
615 620 625 630 

AGG CCG ATT AGA GGG GTT AAT TTC GAT GCG GGT TAC ACT TTC ATA GAT 2 3 95 

Thr Pro lie Arg Gly Leu Asn Phe His Ala Ala Tyr Thr Phe lie Asp 
635 640 645 

GGG AAT ATC AGA AGC GAG AGG ATG GTT ACT AAC CCG GCT AAT CGT AAA 244 3 

Ala Asn lie Thr Ser His Thr Met Val Thr Asn Pro Ala Asn Pro Lys 
650 655 660 

GGG CGT AAA AAA GAT ATT TTT GGC AAA AAG GTG CGT TTT GTA AGC CCG 24 91 

Gly Pro Lys Lys Asp lie Phe Gly Lys Lys Leu Pro Phe Val Ser Pro 
665 670 675 

GAG CAA TTC ATT TTA GAC GCG AGC TAC ACT TAC GCT AAA AGC ACG ATT 2 53 9 

His Gin Phe He Leu Asp Ala Ser Tyr Thr Tyr Ala Lys Thr Thr He 
680 685 690 

GGG TTG AGT TGT TTC TTT TAT AGC CGA ACT TAT AGC GAT GTG TTA AAC 2 587 

Gly Leu Ser Ser Phe Phe Tyr Ser Arg Thr Tyr Ser Asp Val Leu Asn 
695 700 705 710 

ACC GTG GCT TTT ATT CAA TAG GCG CCC ACG ATG AAA AAT GGT GGT ATG 2 63 5 

Thr Val Pro Phe He Gin Tyr Ala Pro Thr He Lys Asn Gly Ala He 
715 720 725 
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GCG ACC AAT ACT TTT AAA GCT TAT TAC CAA TAT TAT CAA TAG AAC TCT 13 3 9 

Ala Thr Asn Thr Phe Lys Ala Tyr Tyr Gin Tyr Tyr Gin Tyr Asn Ser 
280 285 290 

TAC CAT CCA GGC ACT TTG AGT GCA CAA GAT TAT GCT TAT AAC CGC TTC 13 8 7 

Tyr His Pro Gly Thr Leu Ser Ala Gin Asp Tyr Ala Tyr Asn Arg Phe 
295 300 305 310 

ATT AAT GAG CGC CCT GAC AAT CAA GAT GGA GGG CGA GCC AAG CGC TTT 143 5 

lie Asn Glu Arg Pro Asp Asn Gin Asp Gly Gly Arg Ala Lys Arg Phe 
315 320 325 

GGG ATC GTG TAT CAA AAT TAT TTT GGC GAT CCG GAT AGG AAA GTG GGG 148 3 

Gly lie Val Tyr Gin Asn Tyr Phe Gly Asp Pro Asp Arg Lys Val Gly 
330 335 340 

GGA GAT TTT AAA TTC ACT TAT TTC ACG CAT GAC ATG AGT AGG GAT TTT 1531 
Gly Asp Phe Lys Phe Thr Tyr Phe Thr His Asp Met Ser Arg Asp Phe 
345 350 355 

GGG TTT TCC AAC CAA TAC CAA AGC GTG TAT ATG AGC AGT CAA AAC AAG 157 9 

Gly Phe Ser Asn Gin Tyr Gin Ser Val Tyr Met Ser Ser Gin Asn Lys 
360 365 370 

ATT TTA CCT TTT AAA GGC AAG GGA AAA ATT AGC GCG ACT AAC CCT AAT 162 7 
lie Leu Pro Phe Lys Gly Lys Gly Lys lie Ser Ala Thr Asn Pro Asn 
375 380 385 390 

TGC GGT TTG TAT TCT TAT AGC GAC ACG AAC AGC CCT TGT TGG CAA TTT 1675 
Cys Gly Leu Tyr Ser Tyr Ser Asp Thr Asn Ser Pro Cys Trp Gin Phe 
395 400 405 

TTT GAC AAT ATC CGC CGA TCC GTG GTG AAT GCC TTT GAG CCA AAA CTC 172 3 

Phe Asp Asn lie Arg Arg Ser Val Val Asn Ala Phe Glu Pro Lys Leu 
410 415 420 

AAT CTT ATC GTC AAT ACC GGT AAA GTC AAA CAA ACT TTT AAT ATG GGA 1771 
Asn Leu lie Val Asn Thr Gly Lys Val Lys Gin Thr Phe Asn Met Gly 
425 430 435 

ATG CGC TTT TTA ACT GAA GAT TTA TAC CGC CGA TCC ACC ACC AGG AAA 1819 
Met Arg Phe Leu Thr Glu Asp Leu Tyr Arg Arg Ser Thr Thr Arg Lys 
440 445 450 

AAC CCT AGC ATG CCT AAT AAT GGG AGT GGT TTT GAT GCA GGA ACT TCA 1867 
Asn Pro Ser Met Pro Asn Asn Gly Ser Gly Phe Asp Ala Gly Thr Ser 
455 460 465 470 

CTC AAT AAT TTC AAC AAT TAT ACC GCT GTG TAT GCC AGC GAT GAG ATC 1915 
Leu Asn Asn Phe Asn Asn Tyr Thr Ala Val Tyr Ala Ser Asp Glu lie 
475 480 485 

AAT TTC AAT AAC GGC ATG CTA ACG ATC ACG CCG GGC TTG AGA TAC ACT 1963 
Asn Phe Asn Asn Gly Met Leu Thr lie Thr Pro Gly Leu Arg Tyr Thr 
490 495 500 
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Ser Ala Pro Leu Ser Trp Gin Ser Glu Glu Met Arg Asn Ser Thr Ser 
55 60 65 70 

TCT CGC ACG GTG ATT TCC AAC AAG GAA CTC AAA AAA ACG GGG AAT TTG 715 
Ser Arg Thr Val lie Ser Asn Lys Glu Leu Lys Lys Thr Gly Asn Leu 
75 80 85 

AAT ATT GAA AAC GCC TTG CAA AAC GTG CCA GGG ATT CAA ATC AGA GAC 763 
Asn lie Glu Asn Ala Leu Gin Asn Val Pro Gly lie Gin lie Arg Asp 
90 95 100 

GCT ACA GGC ACA GGC GTG CTG CCT AAA ATT TCG GTG CGC GGT TTT GGT 811 
Ala Thr Gly Thr Gly Val Leu Pro Lys lie Ser Val Arg Gly Phe Gly 
105 110 115 

GGG GGC GGT AAC GGG CAT AGC AAT ACC AAC ATG ATT TTA GTC AAT GGT 859 
Gly Gly Gly Asn Gly His Ser Asn Thr Asn Met lie Leu Val Asn Gly 
120 125 130 

ATC CCC ATT TAT GGC GCG CCG TAT TCC AAT ATT GAA CTG GCG ATT TTC 907 
lie Pro lie Tyr Gly Ala Pro Tyr Ser Asn lie Glu Leu Ala lie Phe 
135 140 145 150 

CCT GTA ACT TTC CAG TCA GTG GAT AGG ATT GAT GTG ATT AAA GGG GGC 955 
Pro Val Thr Phe Gin Ser Val Asp Arg lie Asp Val lie Lys Gly Gly 
155 160 165 

ACG AGC GTG CAA TAC GGC CCT AAT ACT TTT GGA GGC GTG GTG AAT ATC 1003 
Thr Ser Val Gin Tyr Gly Pro Asn Thr Phe Gly Gly Val Val Asn lie 
170 175 180 

ATC ACT AAA GAA ATC CCT AAA GAG TGG GAA AAT CAA GCG GCT GAA AGG 1051 
lie Thr Lys Glu He Pro Lys Glu Trp Glu Asn Gin Ala Ala Glu Arg 
185 190 195 

ATC ACT TTT TGG GGG CGA TCC TCT AAT GGG AAT TTT GTA GAT CCC AAA 109 9 

He Thr Phe Trp Gly Arg Ser Ser Asn Gly Asn Phe Val Asp Pro Lys 
200 205 210 

GAA AAA GGC AAG CCT TTA GCC CAA ACT TTA GGA AAC CAA ATG CTG TTT 114 7 

Glu Lys Gly Lys Pro Leu Ala Gin Thr Leu Gly Asn Gin Met Leu Phe 
215 220 225 230 

AAC ACT TAC GGG CGA ACG GCT GGA ATG TTG GGT AAG CAT GTA GGA ATT 1195 
Asn Thr Tyr Gly Arg Thr Ala Gly Met Leu Gly Lys His Val Gly He 
235 240 245 

AGC GCT CAA GGC AAT TGG ATT AAC GGG CAA GGT TTC AGG CAA AAC AGC 1243 
Ser Ala Gin Gly Asn Trp He Asn Gly Gin Gly Phe Arg Gin Asn Ser 
250 255 260 

CCC ACA AAG GTG CAA AAC TAC TTG CTT GAT GCG GTT TAT AAG ATT AAT 12 91 

Pro Thr Lys Val Gin Asn Tyr Leu Leu Asp Ala Val Tyr Lys He Asn 
265 270 275 
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1 








5 










10 
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Gly 
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Gly Val 


Gly 


Gly Val 


Gly 
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Gly 
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Val 
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Leu 


Gly 
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Trp 
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Leu 
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50 










55 








60 










Lys 


Thr 


Asn 


Lys 
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Ser 


Gly 


Phe 


Asn 


Leu 


Thr Ser 


Leu 


Gly 


Phe 


Glu 


65 
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75 








80 


Gly 


Lys 


Lys 
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Ser 
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Ser 


Asn 


Lys 


Lys 











85 90 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 890 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 458,.. 2836 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TCAAAATTTA AGTTAATTTT AATAATTATT TTTATAGTAT GCATCGGTTT GAATTAAATG 60 

AGAAAGGTTA TCACAATGAA TGGTTATTTG AGAGTAAAAA CCTCTTATTT TTTAGCGTTG 12 0 

AACGCTTTGA CTTTTTTGTC TTTTAACTCT TTGGTGGGCG CGAAAGAACA GCATCACACT 180 

TTGCAAAAAG TGACAACCAC TGAGCAAAAA TTCAATCCAA GCGCGCCGCT TTCATGGCAA 24 0 

AGCGAAGAGA TGCGTAATTC CACAAGCTCT CGCACGGTGA TTTCCAACAA GGAACTCAAA 3 00 

AAAACGGGGA ATTTGAATAT TGAAAACGCC TTGCAAAACG TGCCAGGGAT TCAAATCAGA 3 60 

GACGCTACAG GCACAGGCGT GCTGCCTAAA ATTTCGGTGC TCAAAATTTA AGTTAATTTT 42 0 

AATAATTATT TTTATAGTAT GCATCGGTTT GAATTAA ATG AGA AAG GTT ATC ACA 4 75 

Met Arg Lys Val lie Thr 
1 5 

ATG AAT GGT TAT TTG AGA GTA AAA ACC TCT TAT TTT TTA GCG TTG AAC 52 3 
Met Asn Gly Tyr Leu Arg Val Lys Thr Ser Tyr Phe Leu Ala Leu Asn 
10 15 20 

GCT TTG ACT TTT TTG TCT TTT AAC TCT TTG GTG GGC GCG AAA GAA CAG 5 71 

Ala Leu Thr Phe Leu Ser Phe Asn Ser Leu Val Gly Ala Lys Glu Gin 
25 30 35 

CAT CAC ACT TTG CAA AAA GTG ACA ACC ACT GAG CAA AAA TTC AAT CCA 619 
His His Thr Leu Gin Lys Val Thr Thr Thr Glu Gin Lys Phe Asn Pro 
40 45 50 

AGC GCG CCG CTT TCA TGG CAA AGC GAA GAG ATG CGT AAT TCC ACA AGC 667 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 3 23 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 9: 

CTTTTTTCTC TTCCACTTTT TCCACTTTTT TAAGAGCCTT GTGTTTATGA TTG GGC 56 

Leu Gly 
1 

TTT TTG GGT TCA GGT TTG GGT TTT GGT TTA GGC TCA GGC TTA GGC TTT 104 
Phe Leu Gly Ser Gly Leu Gly Phe Gly Leu Gly Ser Gly Leu Gly Phe 
5 10 15 

TCT ATA GGT TTT GGC GGG GTT GGC GGG GTT GGC GGA GTT GGG GGT GTG 152 
Ser lie Gly Phe Gly Gly Val Gly Gly Val Gly Gly Val Gly Gly Val 
20 25 30 

GGA GGC GTT GGA GGT TTT TGG GGG CCA GCC AGC GTG GGT TTA GGA GCG 2 00 

Gly Gly Val Gly Gly Phe Trp Gly Pro Ala Ser Val Gly Leu Gly Ala 
35 40 45 50 

CCC TGG GTG TTT TTA CTG GGA TCT TGC GAA TGG CCT CTT TTT AAA ACC 24 8 

Pro Trp Val Phe Leu Leu Gly Ser Cys Glu Trp Pro Leu Phe Lys Thr 
55 60 65 

AAT AAA TTT TCA GGA TTT AAT TTA ACA AGC TTG GGT TTT GAA GGA AAA 2 96 

Asn Lys Phe Ser Gly Phe Asn Leu Thr Ser Leu Gly Phe Glu Gly Lys 
70 75 80 

AAA TCT TCT CTG TGT TCA AAT AAA AAA TAAATCAACC AGTGTAACAA TACAGAC 3 50 
Lys Ser Ser Leu Cys Ser Asn Lys Lys 
85 90 

AGAATGAGAG AAAAGAAAAA ATTCCT 376 

(2) INFORMATION FOR SEQ ID NO: 36 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 0: 

Leu Gly Phe Leu Gly Ser Gly Leu Gly Phe Gly Leu Gly Ser Gly Leu 
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TTT AAA GAA ATT CTA GAG GAC TAG AAA TAT CTA GAA AAA GAA TTT AAA 3 99 

Phe Lys Glu lie Leu Glu Asp Tyr Lys Tyr Leu Glu Lys Glu Phe Lys 
95 100 105 

GAG CTC TOT AAC ACC GCT GAA AAA GAA GGC GAT AAA GTT ACC GTA ACT 44 7 

Glu Leu Ser Asn Thr Ala Glu Lys Glu Gly Asp Lys Val Thr Val Thr 
110 115 120 

TAT GCG GAT GAT CAA TTA GCC AAG TTG CAA AAA TCC ATT TGG ATG CTG 4 95 

Tyr Ala Asp Asp Gin Leu Ala Lys Leu Gin Lys Ser lie Trp Met Leu 
125 130 135 

CAA GCC CAT TTG GCT TAAGCGACCA AAAAGAAGCC AGCATGAGAG ATTACAGCGA 550 

Gin Ala His Leu Ala 

140 

(2) INFORMATION FOR SEQ ID NO: 35 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 144 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

Met Lys Thr Phe Glu lie Leu Lys His Leu Gin Ala Asp Ala lie Val 

15 10 15 

Leu Phe Met Lys Val His Asn Phe His Trp Asn Val Lys Gly Thr Asp 

20 25 30 

Phe Phe Asn Val His Lys Ala Thr Glu Glu lie Tyr Glu Glu Phe Ala 

35 40 45 

Asp Met Phe Asp Asp Leu Ala Glu Arg lie Val Gin Leu Gly His His 

50 55 60 

Pro Leu Val Thr Leu Ser Glu Ala lie Lys Leu Thr Arg Val Lys Glu 
65 70 75 80 

Glu Thr Lys Thr Ser Phe His Ser Lys Asp lie Phe Lys Glu lie Leu 

85 90 95 

Glu Asp Tyr Lys Tyr Leu Glu Lys Glu Phe Lys Glu Leu Ser Asn Thr 

100 105 110 

Ala Glu Lys Glu Gly Asp Lys Val Thr Val Thr Tyr Ala Asp Asp Gin 

115 120 125 

Leu Ala Lys Leu Gin Lys Ser lie Trp Met Leu Gin Ala His Leu Ala 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Asn Thr Glu Ser 
355 

Val Arg Leu Ser 
370 

Val Tyr Arg His 
385 

lie His Glu Lys 

Lys Glu Trp Ser 
420 



Ala Val Leu Phe 
360 

Leu Lys Asp His 
375 

Arg Val lie Trp 
390 

Thr Ser Pro Asp 
405 

Lys Val Leu Pro 



Asp Asn Pro Ser 

Ala Gin Gin Ser 
380 

Glu Ala Val Glu 
395 

Thr Ser Phe Phe 
410 

Glu Arg Glu Leu 
425 



Phe Ala Lys Arg 
365 

Trp His Leu Val 

Glu Gly lie Leu 
400 

Leu Arg Leu lie 
415 



(2) INFORMATION FOR SEQ ID NO: 3 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
{ ix ) FEATURE : 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 79... 510 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:357: 



AACTCTTTTG GTGTAGGATA GCGATCAAGG TTTTTATGAA AATAAAAGCC TAAAACAATT 6 0 

TTAAAAAAAG GACTTTTG ATG AAA ACA TTT GAA ATT CTA AAA CAT TTG CAA 111 

Met Lys Thr Phe Glu lie Leu Lys His Leu Gin 
15 10 



GCG GAT GCG ATC GTG TTA TTT ATG AAA GTG CAT AAC TTC CAT TGG AAT 159 
Ala Asp Ala lie Val Leu Phe Met Lys Val His Asn Phe His Trp Asn 
15 20 25 



GTG AAA GGC ACC GAT TTT TTC AAT GTG CAT AAA GCC ACT GAA GAA ATT 2 07 

Val Lys Gly Thr Asp Phe Phe Asn Val His Lys Ala Thr Glu Glu lie 
30 35 40 



TAT GAA GAG TTT GCG GAC ATG TTT GAC GAT CTC GCT GAA AGG ATC GTT 255 
Tyr Glu Glu Phe Ala Asp Met Phe Asp Asp Leu Ala Glu Arg lie Val 
45 50 55 



CAA TTA GGG CAT CAC CCC TTA GTC ACT TTA TCC GAA GCG ATC AAA CTC 303 
Gin Leu Gly His His Pro Leu Val Thr Leu Ser Glu Ala lie Lys Leu 
60 65 70 75 



ACT CGT GTT AAA GAA GAA ACT AAA ACG AGC TTC CAC TCT AAA GAC ATC 351 
Thr Arg Val Lys Glu Glu Thr Lys Thr Ser Phe His Ser Lys Asp lie 
80 85 90 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Ser Gin Lys Ser lie Asp Met Gin Thr Tyr lie Tyr Lys Asn Asp 

15 10 15 

Leu Ser Ser Gin Val lie Ala Lys Glu Leu Leu Asn Ala Ala Asn Arg 

20 25 30 

Gly Val Lys Val Arg lie Leu Leu Asp Asp Asn Gly Leu Asp Ser Asp 

35 40 45 

Phe Ser Asp lie Met Leu Leu Asn Phe His Lys Asn lie Glu Val Lys 

50 55 60 

lie Phe Asn Pro Tyr Tyr lie Arg Asn Lys Gly Leu Arg Tyr Phe Glu 
65 70 75 80 

Met Leu Ala Asp Tyr Glu Arg lie Lys Lys Arg Met His Asn Lys Leu 

85 90 95 

Phe lie Val Asp Asn Phe Ala Val lie lie Gly Gly Arg Asn lie Gly 

100 105 110 

Asp Asn Tyr Phe Asp Asn Asp Leu Asp Thr Asn Phe Leu Asp Leu Asp 

115 120 125 

Ala Leu Phe Phe Gly Gly Val Ala Ser Lys Ala Lys Glu Ser Phe Glu 

130 135 140 

Arg Tyr Trp Arg Phe His Arg Ser lie Pro Val Ser Leu Leu Arg Thr 
145 150 155 160 

His Lys Arg Leu Lys Asn Asn Ala Lys Glu lie Ala Lys Leu His Glu 

165 170 175 

Lys lie Pro lie Ser Ala Glu Asp Lys Asn Gin Phe Glu Lys Lys Val 

180 185 190 

Asn Asp Phe lie Asp Arg Phe Gin Lys Tyr Gin Tyr Pro lie Tyr Tyr 

195 200 205 

Gly Asn Ala lie Phe Leu Ala Asp Ser Pro Lys Lys lie Asp Thr Pro 

210 215 220 

Leu Tyr Ser Pro lie Lys lie Ala Phe Glu Lys Ala Leu Lys Asn Ala 
225 230 235 240 

Lys Asp Ser Val Phe lie Ala Ser Ser Tyr Phe He Pro Gly Lys Lys 

245 250 255 

Met Met Lys He Phe Lys Asn Gin He Ser Lys Gly He Glu Leu Asn 

260 265 270 

He Leu Thr Asn Ser Leu Ser Ser Thr Asp Ala He Val Val Tyr Gly 

275 280 285 

Ala Trp Glu Arg Tyr Arg Asn Gin Leu Val Arg Met Gly Ala Asn Val 

290 295 300 

Tyr Glu He Arg Asn Asp Phe Phe Asn Arg Gin He Lys Gly Arg Phe 
305 310 315 320 

Ser Thr Lys His Ser Leu His Gly Lys Thr He Val Phe Asp Asp Asn 

325 330 335 

Leu Thr Leu Leu Gly Ser Phe Asn He Asp Pro Arg Ser Ala Tyr He 
340 345 350 
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Tyr Ser Pro lie Lys lie Ala Phe Glu Lys Ala Leu Lys Asn Ala Lys 
230 235 240 

GAC TCC GTT TTT ATC GCT TCA TCG TAT TTT ATT CCA GGC AAA AAG ATG 1064 
Asp Ser Val Phe lie Ala Ser Ser Tyr Phe lie Pro Gly Lys Lys Met 
245 250 255 

ATG AAA ATC TTT AAA AAT CAA ATT TCT AAG GGG ATT GAA TTG AAC ATC 1112 
Met Lys lie Phe Lys Asn Gin lie Ser Lys Gly lie Glu Leu Asn lie 
260 265 270 

CTT ACC AAT TCC CTT TCA TCT ACT GAT GCG ATA GTG GTC TAT GGG GCA 1160 
Leu Thr Asn Ser Leu Ser Ser Thr Asp Ala lie Val Val Tyr Gly Ala 
275 280 285 

TGG GAA AGG TAT CGC AAC CAA TTA GTG CGA ATG GGC GCG AAT GTC TAT 12 08 

Trp Glu Arg Tyr Arg Asn Gin Leu Val Arg Met Gly Ala Asn Val Tyr 
290 295 300 305 

GAA ATA CGA AAC GAT TTT TTC AAC CGC CAG ATT AAA GGG CGC TTT AGC 1256 
Glu lie Arg Asn Asp Phe Phe Asn Arg Gin lie Lys Gly Arg Phe Ser 
310 315 320 

ACC AAA CAT TCC TTG CAT GGC AAG ACG ATT GTT TTT GAT GAC AAT TTA 1304 
Thr Lys His Ser Leu His Gly Lys Thr lie Val Phe Asp Asp Asn Leu 
325 330 335 

ACG CTT CTA GGG AGT TTC AAT ATT GAT CCG CGC TCT GCA TAG ATC AAC 1352 
Thr Leu Leu Gly Ser Phe Asn lie Asp Pro Arg Ser Ala Tyr lie Asn 
340 345 350 

ACT GAA AGC GCG GTT TTG TTT GAC AAC CCG TCT TTT GCT AAA AGG GTG 14 00 

Thr Glu Ser Ala Val Leu Phe Asp Asn Pro Ser Phe Ala Lys Arg Val 
355 360 365 

CGT TTG TCG CTT AAA GAT CAT GCC CAA CAA TCA TGG CAT TTG GTG GTG 144 8 

Arg Leu Ser Leu Lys Asp His Ala Gin Gin Ser Trp His Leu Val Val 
370 375 380 385 

TAT CGG CAT AGA GTG ATT TGG GAA GCG GTG GAA GAA GGC ATT TTA ATC 14 96 

Tyr Arg His Arg Val lie Trp Glu Ala Val Glu Glu Gly lie Leu lie 
390 395 400 

CAT GAA AAA ACT TCG CCT GAC ACT TCC TTC TTT TTG CGC TTG ATT AAA 1544 
His Glu Lys Thr Ser Pro Asp Thr Ser Phe Phe Leu Arg Leu lie Lys 
405 410 415 

GAA TGG TCT AAA GTC CTT CCT GAA AGA GAG CTT TAAAACTTTT AATGCGCTTT 1597 
Glu Trp Ser Lys Val Leu Pro Glu Arg Glu Leu 
420 425 

ATTTTGCGAA AAAGCGATGT TATTGGTAAC GGC 1630 

(2) INFORMATION FOR SEQ ID NO: 356: 
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5 10 15 

TCT TCT CAA GTG ATT GCT AAA GAA CTT TTA AAT GCG GCC AAT CGT GGG 3 92 

Ser Ser Gin Val lie Ala Lys Glu Leu Leu Asn Ala Ala Asn Arg Gly 
20 25 30 

GTA AAA GTG CGC ATC CTT TTA GAC GAT AAC GGA TTG GAT TCG GAT TTT 44 0 

Val Lys Val Arg lie Leu Leu Asp Asp Asn Gly Leu Asp Ser Asp Phe 
35 40 45 

TCA GAT ATT ATG CTC TTA AAT TTC CAT AAA AAC ATT GAG GTG AAA ATT 4 88 

Ser Asp lie Met Leu Leu Asn Phe His Lys Asn lie Glu Val Lys He 
50 55 60 65 

TTT AAC CCC TAG TAT ATC CGC AAT AAA GGC TTG CGT TAT TTT GAA ATG 536 
Phe Asn Pro Tyr Tyr He Arg Asn Lys Gly Leu Arg Tyr Phe Glu Met 
70 75 80 

CTT GCG GAT TAT GAG CGC ATT AAA AAA CGC ATG CAC AAC AAG CTT TTC 5 84 

Leu Ala Asp Tyr Glu Arg He Lys Lys Arg Met His Asn Lys Leu Phe 
85 90 95 

ATC GTG GAT AAT TTC GCT GTC ATT ATA GGG GGG CGC AAT ATT GGG GAC 632 
He Val Asp Asn Phe Ala Val He He Gly Gly Arg Asn He Gly Asp 
100 105 110 

AAT TAT TTT GAT AAC GAT TTA GAC ACG AAT TTT TTA GAT TTA GAC GCT 680 
Asn Tyr Phe Asp Asn Asp Leu Asp Thr Asn Phe Leu Asp Leu Asp Ala 
115 120 125 

TTG TTT TTT GGG GGG GTT GCT TCA AAA GCC AAA GAA AGC TTT GAA CGC 728 
Leu Phe Phe Gly Gly Val Ala Ser Lys Ala Lys Glu Ser Phe Glu Arg 
130 135 140 145 

TAT TGG AGA TTC CAC CGC TCT ATC CCT GTT TCA TTA CTA AGA ACC CAT 77 6 

Tyr Trp Arg Phe His Arg Ser He Pro Val Ser Leu Leu Arg Thr His 
150 155 160 

AAA AGA CTC AAA AAC AAC GCT AAA GAA ATC GCT AAA CTC CAT GAA AAA 824 
Lys Arg Leu Lys Asn Asn Ala Lys Glu He Ala Lys Leu His Glu Lys 
165 170 175 

ATC CCT ATC AGC GCT GAA GAC AAA AAC CAG TTT GAA AAA AAA GTC AAT 872 
He Pro He Ser Ala Glu Asp Lys Asn Gin Phe Glu Lys Lys Val Asn 
180 185 190 

GAT TTT ATA GAT CGT TTC CAA AAA TAC CAA TAC CCC ATT TAT TAT GGG 92 0 

Asp Phe He Asp Arg Phe Gin Lys Tyr Gin Tyr Pro He Tyr Tyr Gly 
195 200 205 

AAT GCC ATT TTT TTA GCC GAT TCA CCC AAA AAA ATT GAC ACG CCC TTG 96 8 

Asn Ala He Phe Leu Ala Asp Ser Pro Lys Lys He Asp Thr Pro Leu 
210 215 220 225 

TAT TCG CCT ATC AAA ATC GCT TTT GAG AAA GCC CTT AAA AAC GCT AAG 1016 
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Leu Ala Asp Glu Lys Tyr Tyr Arg Ala Leu Ser Cys He Arg Cys Gly 
305 310 315 320 

Thr Cys Leu Asn His Cys Pro Val Tyr Asp Lys He Gly Gly His Ala 

325 330 335 

Tyr Leu Ser Thr Tyr Pro Gly Pro He Gly Val Val Val Ser Pro Gin 

340 345 350 

Leu Phe Gly Leu Asn Asn Tyr Gly His He Pro Asn Leu Cys Ser Leu 

355 360 365 

Cys Gly Arg Cys Thr Glu Val Cys Pro Val Glu He Pro Leu Ala Glu 

370 375 380 

Leu He Arg Asp Leu Arg Ser Asp Lys Val Gly Glu Gly Arg Gly Val 
385 390 395 400 

He Lys Gly Ala Lys Ser Thr Gin His Ser Gly Met Glu Lys Phe Ser 

405 410 415 

Met Lys Met Phe Ala Lys Met Ala Ser Asp Gly Ala Lys Trp Arg Phe 

420 425 430 

Gin Leu Lys Met Ala Gin Phe Phe Ser Pro Leu Gly Lys Leu Leu Ala 

435 440 445 

Pro He Leu Pro Leu Val Lys Glu Trp Ala Ser Val Arg Thr Leu Pro 

450 455 460 

Asn Met Asp Thr Ser Leu His Ala Lys Val Gin His Leu Glu Gly Val 
465 470 475 480 

He Tyr Glu 

(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 294... 1577 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:355: 

GTATCGCCAC TTTGCGTAAC AGGATGTTGA AAGTGGGTAA AAGCCGCTAA TTTCTTTTTA 6 0 

GTGGGTCGTT TTTGAAAATC TTTTTAGTCT TTTTAAGCGT CTTTTTTTTT AATGGGTGTT 12 0 

TTGGGTTAGT CTATAAGACT CCCATTTCAA GCTCTCCTAT CTCTTATGAT CCCTACACTA 180 

CCCCCATTGG GAGCTTGTAT GCTGAAAAAT TAAAAGAAAA CCCTAACCAT AGCGCGGCCA 24 0 

TTCTTTTAGA AGATGGCTTT GACGCTCTGT TGCATAGAGT GGGACTTATT AGA ATG 2 96 



Met 
1 



AGC CAA AAA AGC ATT GAC ATG CAA ACT TAT ATC 
Ser Gin 'Lys Ser He Asp Met Gin Thr Tyr He 



TAT AAA AAC GAC CTT 
Tyr Lys Asn Asp Leu 



344 
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GAG TAAAGAGCTT ATTTTAAAGC GCATTAT^GA AGCCAGAGCC AAGCATGCCA TTCAGG 1555 
Glu 



(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:354: 

Met lie Met Glu Lys Tyr His Ser Asp Gin Glu Tyr Glu Glu lie lie 

15 10 15 

Thr Asp Gin Leu Gly Asp Met Gin Leu Arg Glu Asn Leu Arg Ser Ala 

20 25 30 

Met Asp Thr Leu Arg Ala Asn Arg Lys Asn Leu Leu Lys Asn Arg Tyr 

35 40 45 

Ser Glu Trp Glu Asn Leu Arg Glu Leu Gly Lys Glu Val Lys Leu Lys 

50 55 60 

lie Leu Ser Arg Leu Asp Glu Tyr Leu Glu Leu Phe Glu Lys Asn Ala 
65 70 75 80 

Thr Gin Asn Gly Phe Lys lie His Tyr Ala Lys Asp Gly Asp Glu Ala 

85 90 95 

Asn Glu lie lie Tyr Asn Leu Ala Lys Glu Lys Asn lie Lys Arg lie 

100 105 110 

Leu Lys Gin Lys Ser Met Ala Ser Glu Glu lie Gly Leu Asn His Tyr 

115 120 125 

Leu Lys Glu Lys Gly lie Gin Ala Gin Glu Thr Asp Leu Gly Glu Leu 

130 135 140 

lie lie Gin Leu lie Asn Glu His Pro Val His He Val Val Pro Ala 
145 150 155 160 

He His Lys Asn Arg Lys Gin He Gly Lys He Phe Glu Glu Lys Leu 

165 170 175 

Asn Ala Ala Tyr Glu Glu Glu Pro Glu Lys Leu Asn Ala He Ala Arg 

180 185 190 

Lys His Met Arg Lys Glu Phe Glu Ser Phe Lys Met Gly He Ser Gly 

195 200 205 

Val Asn Phe Ala He Ala Asn Glu Gly Ala He Trp Leu Val Glu Asn 

210 215 220 

Glu Gly Asn Gly Arg Met Ser Thr Thr Ala Cys Asp Val His Val Ala 
225 230 235 240 

He Cys Gly He Glu Lys Leu Val Glu Ser Phe Asp Asp Ala Ala He 

245 250 255 

Leu Asn Asn Leu Leu Ala Pro Ser Ala Val Gly Val Pro He Thr Cys 

260 265 270 

Tyr Gin Asn He He Thr Gly Pro Arg Lys Glu Gly Asp Leu Asp Gly 

275 280 285 

Pro Lys Glu Ala His He He Leu Leu Asp Asn Asn Arg Ser Asn He 
290 295 300 
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AAT TTG CTC GCC CCA AGC GCT GTG GGT GTG CCT ATC ACT TGC TAT CAA 872 
Asn Leu Leu Ala Pro Ser Ala Val Gly Val Pro lie Thr Cys Tyr Gin 
260 265 270 

AAC ATT ATC ACA GGC CCT AGA AAA GAG GGC GAT TTA GAC GGC CCT AAA 92 0 

Asn lie lie Thr Gly Pro Arg Lys Glu Gly Asp Leu Asp Gly Pro Lys 
275 280 285 290 

GAA GCC CAC ATC ATT TTA TTA GAC AAC AAC CGC TCT AAT ATT TTG GCT 968 
Glu Ala His lie lie Leu Leu Asp Asn Asn Arg Ser Asn lie Leu Ala 
295 300 305 

GAT GAA AAG TAT TAT CGC GCT CTT TCA TGC ATC CGT TGC GGG ACT TGT 1016 
Asp Glu Lys Tyr Tyr Arg Ala Leu Ser Cys lie Arg Cys Gly Thr Cys 
310 315 320 

TTG AAC CAC TGC CCT GTG TAT GAT AAA ATC GGT GGG CAT GCC TAT CTT 1064 
Leu Asn His Cys Pro Val Tyr Asp Lys lie Gly Gly His Ala Tyr Leu 
325 330 335 

TCT ACT TAT CCT GGC CCT ATA GGC GTG GTG GTA TCC CCC CAA CTC TTT 1112 
Ser Thr Tyr Pro Gly Pro lie Gly Val Val Val Ser Pro Gin Leu Phe 
340 345 350 

GGC TTG AAT AAT TAC GGG CAT ATC CCT AAT TTG TGC AGT CTT TGC GGG 116 0 

Gly Leu Asn Asn Tyr Gly His lie Pro Asn Leu Cys Ser Leu Cys Gly 
355 360 365 370 

CGT TGC ACT GAA GTA TGC CCC GTA GAA ATC CCT TTA GCC GAA CTC ATT 12 0 8 

Arg Cys Thr Glu Val Cys Pro Val Glu lie Pro Leu Ala Glu Leu lie 
375 380 385 

AGG GAT TTA CGA TCC GAT AAA GTG GGC GAG GGC AGG GGT GTA ATT AAG 12 56 

Arg Asp Leu Arg Ser Asp Lys Val Gly Glu Gly Arg Gly Val lie Lys 
390 395 400 

GGG GCT AAA AGC ACC CAA CAC AGC GGG ATG GAA AAA TTC TCT ATG AAA 13 04 
Gly Ala Lys Ser Thr Gin His Ser Gly Met Glu Lys Phe Ser Met Lys 
405 410 415 

ATG TTT GCC AAA ATG GCA AGC GAT GGG GCT AAG TGG CGT TTC CAA TTG 13 52 

Met Phe Ala Lys Met Ala Ser Asp Gly Ala Lys Trp Arg Phe Gin Leu 
420 425 430 

AAA ATG GCT CAA TTT TTC TCG CCT TTA GGC AAG CTT TTA GCT CCC ATA 14 0 0 
Lys Met Ala Gin Phe Phe Ser Pro Leu Gly Lys Leu Leu Ala Pro lie 
435 440 445 450 

CTG CCT TTA GTC AAA GAG TGG GCG AGC GTT AGG ACC TTA CCC AAT ATG 144 8 

Leu Pro Leu Val Lys Glu Trp Ala Ser Val Arg Thr Leu Pro Asn Met 
455 460 465 

GAC ACG AGC TTG CAT GCA AAA GTC CAG CAC TTA GAA GGG GTG ATT TAT 14 96 

Asp Thr Ser Leu His Ala Lys Val Gin His Leu Glu Gly Val lie Tyr 
470 475 480 
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Thr Leu Arg Ala Asn Arg Lys Asn Leu Leu Lys Asn Arg Tyr Ser Glu 
35 40 45 50 

TGG GAA AAT TTA AGO GAA TTA GGC AAA GAA GTC AAG CTT AAA ATC TTA 24 8 

Trp Glu Asn Leu Arg Glu Leu Gly Lys Glu Val Lys Leu Lys lie Leu 
55 60 65 

TCC AGG CTT GAT GAA TAT TTG GAA TTG TTT GAA AAA AAC GCC ACT CAA 296 
Ser Arg Leu Asp Glu Tyr Leu Glu Leu Phe Glu Lys Asn Ala Thr Gin 
70 75 80 

AAC GGC TTT AAA ATC CAT TAG GCT AAA GAC GGC GAT GAA GCT AAT GAA 344 
Asn Gly Phe Lys lie His Tyr Ala Lys Asp Gly Asp Glu Ala Asn Glu 
85 90 95 

ATC ATT TAC AAC CTC GCT AAA GAA AAG AAT ATC AAG CGC ATT TTA AAG 3 92 

lie lie Tyr Asn Leu Ala Lys Glu Lys Asn lie Lys Arg lie Leu Lys 
100 105 110 

CAA AAA TCC ATG GCG AGC GAA GAA ATT GGC TTG AAC CAT TAC TTG AAA 44 0 

Gin Lys Ser Met Ala Ser Glu Glu lie Gly Leu Asn His Tyr Leu Lys 
115 120 125 130 

GAA AAG GGC ATT CAA GCA CAA GAA ACG GAT TTG GGC GAA TTG ATT ATC 4 88 

Glu Lys Gly lie Gin Ala Gin Glu Thr Asp Leu Gly Glu Leu lie lie 
135 140 145 

CAA CTC ATC AAT GAA CAC CCT GTG CAT ATT GTC GTG CCA GCT ATC CAT 53 6 

Gin Leu lie Asn Glu His Pro Val His lie Val Val Pro Ala lie His 
150 155 160 

AAA AAC CGC AAG CAA ATC GGT AAG ATT TTT GAA GAA AAA CTC AAC GCC 584 
Lys Asn Arg Lys Gin lie Gly Lys lie Phe Glu Glu Lys Leu Asn Ala 
165 170 175 

GCT TAT GAA GAA GAG CCT GAA AAG CTT AAT GCG ATC GCC AGA AAA CAC 632 
Ala Tyr Glu Glu Glu Pro Glu Lys Leu Asn Ala lie Ala Arg Lys His 
180 185 190 

ATG CGC AAA GAA TTT GAA AGC TTT AAA ATG GGG ATT AGT GGG GTT AAT 68 0 

Met Arg Lys Glu Phe Glu Ser Phe Lys Met Gly lie Ser Gly Val Asn 
195 200 205 210 

TTT GCT ATC GCT AAC GAA GGA GCG ATC TGG TTA GTG GAA AAT GAA GGC 72 8 

Phe Ala lie Ala Asn Glu Gly Ala lie Trp Leu Val Glu Asn Glu Gly 
215 220 225 

AAT GGC AGA ATG AGC ACC ACT GCA TGC GAT GTG CAT GTC GCA ATT TGT 77 6 

Asn Gly Arg Met Ser Thr Thr Ala Cys Asp Val His Val Ala lie Cys 
230 235 240 

GGG ATT GAA AAA TTA GTA GAA AGC TTT GAT GAT GCG GCG ATT TTA AAC 824 
Gly lie Glu Lys Leu Val Glu Ser Phe Asp Asp Ala Ala lie Leu Asn 
245 250 255 
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115 

Ser Leu Pro Leu 
130 

Leu His Glu Gin 
145 

lie Ser Met Val 

Ala lie Val Leu 
180 

Ala Ala Phe Gin 
195 

Ser Pro Glu Phe 
210 

Ser Asp Leu Asn 
225 

Ser Lys Glu Pro 

Lys Arg Arg Leu 
260 

Glu Phe Glu Lys 
275 



120 

Phe Asn Trp Leu 
135 

Phe Gly Asp Met 
150 

Arg lie Ser Gin 
165 

Lys Lys Met Glu 

Lys Arg Ser Ser 
200 

lie Ser Ser Ser 
215 

Ser Met Thr Asn 
230 

Glu Ala Ser Ser 
245 

Ser Lys Lys Glu 

Gin lie Ser Asp 
280 



Tyr Lys Gly Ser 
140 

Tyr Asp Gly Tyr 
155 

Lys Glu Lys Ala 
170 

Glu Gin Ala Glu 
185 

Gly Glu Leu Glu 

Lys Arg Thr Gin 
220 

Ala Asn Ala Leu 
235 

Lys Lys Glu Lys 
250 

Lys Gin Gin Gin 
265 

Ser Ser Lys Ser 



125 

Asp Phe Gly Ala 

lie Lys Tyr Leu 
160 

Arg Lys Val Asp 
175 

Lys Asp Thr Lys 
190 

Ser His Thr Asp 
205 

Asn Ala Ser Asn 

Lys Glu Thr Ala 
240 

Lys Ser Lys Lys 
255 

Ala Leu Gin Gin 

270 
Glu Lys 
285 



(2) INFORMATION FOR SEQ ID NO: 35 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATXXRE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1499 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 

CTTTTTAGCC TCAAGACTTG GGCTTTAACA TTAAAGAATT ATTTTAAGGA ATG ATC 56 

Met lie 
1 

ATG GAA AAA TAG CAT AGC GAC CAA GAA TAG GAA GAA ATC ATC ACC GAC 104 
Met Glu Lys Tyr His Ser Asp Gin Glu Tyr Glu Glu lie lie Thr Asp 
5 10 15 

CAA TTA GGC GAT ATG CAA TTA AGG GAA AAT TTG CGT TCT GCA ATG GAT 152 
Gin Leu Gly Asp Met Gin Leu Arg Glu Asn Leu Arg Ser Ala Met Asp 
20 25 30 

ACC TTA AGG GCT AAT CGT AAG AAT CTC CTT AAA AAT CGT TAC AGC GAA 200 
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TTT CAA AAG AGG AGC ACT GGG GAG CTT GAA AGC CAT ACT GAT AGC CCT 68 0 

Phe Gin Lys Arg Ser Ser Gly Glu Leu Glu Ser His Thr Asp Ser Pro 
195 200 205 210 

GAA TTT ATA AGC TCT TCT AAG AGG ACA CAG AAC GCT TCT AAT TCG GAT 72 8 

Glu Phe lie Ser Ser Ser Lys Arg Thr Gin Asn Ala Ser Asn Ser Asp 
215 220 225 

CTC AAT TCT ATG ACC AAT GCT AAC GCG CTC AAA GAA ACA GCT TCA AAA 77 6 

Leu Asn Ser Met Thr Asn Ala Asn Ala Leu Lys Glu Thr Ala Ser Lys 
230 235 240 

GAG CCA GAG GCT TCT TCA AAA AAA GAG AAA AAG TCT AAG AAA AAA CGT 824 
Glu Pro Glu Ala Ser Ser Lys Lys Glu Lys Lys Ser Lys Lys Lys Arg 
245 250 255 

CGC CTT TCA AAG AAA GAA AAA CAA CAA CAA GCC TTG CAA CAA GAG TTT 872 
Arg Leu Ser Lys Lys Glu Lys Gin Gin Gin Ala Leu Gin Gin Glu Phe 
260 265 270 

GAA AAA CAA ATT AGC GAC TCT AGT AAG TCT GAA AAA TAGTAATAAT AGTTAA 924 
Glu Lys Gin lie Ser Asp Ser Ser Lys Ser Glu Lys 
275 280 285 

GCTTACCTTT TTAGGGGGCT TTCAATAAAT CTCTTAA 961 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECtJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

Val Lys Arg lie Leu Phe Phe Leu Val Ala Thr Thr Phe Leu Leu Arg 

15 10 15 

Ala Glu Thr Asp Ser Ala Thr lie Asn Thr Thr Val Asp Pro Asn Val 

20 25 30 

Met Phe Ser Glu Ser Ser Thr Gly Asn Val Lys Lys Asp Arg Lys Arg 

35 40 45 

Val Leu Lys Ser Met Val Asn Leu Glu Lys Glu Arg Val Lys Asn Phe 

50 55 60 

Asn Arg Tyr Ser Glu Thr Lys Met Ser Lys Gly Asp Leu Ser Ala Phe 

65 70 75 80 

Gly Ala Phe Phe Lys Gly Ser Leu Glu Ser Cys Val Asp Gin Lys lie 

85 90 95 

Cys Tyr Tyr Glu His Lys Asp Gly Lys Val Ser Phe Val Val Asn Asp 

100 105 110 

Arg Glu Lys Phe Tyr Lys His Val Leu Lys Asp Leu Gly Thr Glu Leu 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 51: 

AGAACAGTAT CTATTTTTGT TGCGGTTGTA TATTTAATTA GGAGTTTGGT GTG AAA 56 

Val Lys 
1 

CGG ATT TTA TTT TTT TTA GTA GCT ACG ACT TTT TTG TTG AGA GCA GAA 104 
Arg lie Leu Phe Phe Leu Val Ala Thr Thr Phe Leu Leu Arg Ala Glu 
5 10 15 

ACG GAT TCT GCC ACT ATT AAC ACT ACA GTT GAT CCC AAT GTT ATG TTT 152 
Thr Asp Ser Ala Thr lie Asn Thr Thr Val Asp Pro Asn Val Met Phe 
20 25 30 

TCT GAA AGC TCC ACA GGG AAT GTG AAA AAA GAC CGC AAG AGG GTT TTA 2 00 

Ser Glu Ser Ser Thr Gly Asn Val Lys Lys Asp Arg Lys Arg Val Leu 
35 40 45 50 

AAG AGC ATG GTT AAT TTG GAA AAA GAG CGC GTG AAG AAT TTT AAC CGG 24 8 

Lys Ser Met Val Asn Leu Glu Lys Glu Arg Val Lys Asn Phe Asn Arg 
55 60 65 

TAT TCT GAA ACC AAG ATG AGT AAG GGC GAC TTA TCC GCT TTT GGA GCT 2 96 

Tyr Ser Glu Thr Lys Met Ser Lys Gly Asp Leu Ser Ala Phe Gly Ala 
70 75 80 

TTC TTT AAG GGG AGT TTG GAA AGT TGT GTG GAT CAA AAG ATT TGT TAT 3 44 

Phe Phe Lys Gly Ser Leu Glu Ser Cys Val Asp Gin Lys lie Cys Tyr 
85 90 95 

TAT GAG CAT AAA GAT GGC AAG GTT TCT TTT GTG GTG AAT GAC AGG GAG 3 92 

Tyr Glu His Lys Asp Gly Lys Val Ser Phe Val Val Asn Asp Arg Glu 
100 105 110 

AAG TTT TAT AAA CAT GTG CTT AAA GAC TTA GGG ACA GAG CTT TCG CTC 44 0 

Lys Phe Tyr Lys His Val Leu Lys Asp Leu Gly Thr Glu Leu Ser Leu 
115 120 125 130 

CCT TTG TTT AAC TGG CTT TAC AAA GGC TCG GAT TTT GGG GCT TTG CAT 4 88 

Pro Leu Phe Asn Trp Leu Tyr Lys Gly Ser Asp Phe Gly Ala Leu His 
135 140 145 

GAG CAG TTT GGG GAT ATG TAT GAT GGG TAT ATC AAA TAC TTG ATC AGT 536 
Glu Gin Phe Gly Asp Met Tyr Asp Gly Tyr lie Lys Tyr Leu lie Ser 
150 155 160 

ATG GTT AGA ATA AGC CAA AAA GAA AAG GCT AGA AAA GTG GAT GCA ATC 5 84 

Met Val Arg lie Ser Gin Lys Glu Lys Ala Arg Lys Val Asp Ala lie 
165 170 175 

GTT CTT AAG AAA ATG GAA GAA CAA GCT GAG AAA GAC ACT AAG GCA GCG 632 
Val Leu Lys Lys Met Glu Glu Gin Ala Glu Lys Asp Thr Lys Ala Ala 
180 185 190 
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Ala Gly Gly Ala 
100 

Pro Leu Glu Phe 
115 

Leu Ser Arg Thr 
130 

Leu lie Thr Phe 
145 

Gly Val Gly Val 

Ala Val Leu Ala 
180 

Phe Gly Leu lie 
195 

Val Thr Ser Glu 
210 

Leu Gin Asn Ser 
225 

Leu Arg lie Ser 

Asn Ala Met His 
260 

Gin lie Lys Leu 
275 

Ala Gly lie Ala 
290 

lie Leu Phe Ser 
305 

Leu Lys Glu Pro 

lie Ser Phe Ser 
340 

Pro Val Lys Gly 
355 

Leu Leu Ala Asn 
370 

Tyr Thr Lys Gly 
385 



Lys Arg Trp lie 

Leu Lys lie Gly 
120 

Phe Val Ala Lys 
135 

Val Pro Tyr Ser 
150 

Leu Gin Asn Asp 
165 

Val Leu Leu Val 

lie Ser Gly Ala 
200 

His Arg lie Leu 
215 

Leu Phe Thr Leu 
230 

Asp Leu Pro Glu 
245 

Asn Gly Gly Leu 

Gly Phe Leu Ser 
280 

Glu Glu Trp Gly 
2 95 

Val Leu lie Val 
310 

Lys Tyr Ser Leu 
325 

Leu Val lie Asn 

Leu Ala Val Pro 
360 

Cys lie Ala lie 
375 



Arg Leu Gly Phe 
105 

Phe Thr Phe Phe 

Glu Lys Ala Asn 
140 

Val Val Phe Val 
155 

Leu Gly Gin lie 
170 

Phe Ser Gly Gly 
185 

Phe Ala He Ser 

Arg Leu Lys Leu 
220 

Leu Pro Asp Arg 
235 

Ser Tyr Gin Val 
250 

Phe Gly Gin Gly 
265 

Glu Val His Thr 

Phe Leu Gly Leu 
300 

Leu He Phe Arg 
315 

Phe Cys Val Gly 
330 

Ala Phe Gly Val 
345 

Phe Leu Ser Tyr 

Gly Leu Val Leu 
380 



Phe Ser Leu Ala 
110 

Leu Ala Trp Ser 
125 

Val Lys Glu Glu 

Ala Leu Ala lie 
160 

Val Leu Leu Gly 
175 

Ser Val His Leu 
190 

Val Leu Ala He 
205 

Trp Trp Ser Asn 

Leu Ala Asn Ala 
240 

Phe His Ala Gly 
255 

Leu Gly Leu Gly 
270 

Asp Met Val Leu 
285 

Cys Val Cys Phe 

He Ala Asn Arg 
320 

Val Val Leu Leu 
335 

Gly Gly He Leu 
350 

Gly Gly Ser Ser 
365 

Ser Leu Ala Arg 



(2) INFORMATION FOR SEQ ID NO: 351: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 908 
(D) OTHER INFORMATION: 
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GCT GGG ATC GCC GAA GAA TGG GGG TTT TTG GGG CTA TGC GTT TGT TTT 972 
Ala Gly lie Ala Glu Glu Trp Gly Phe Leu Gly Leu Cys Val Cys Phe 
290 295 300 

ATT TTG TTT TCT GTT TTG ATT GTT TTG ATT TTT AGG ATC GCT AAC CGC 102 0 
lie Leu Phe Ser Val Leu lie Val Leu lie Phe Arg lie Ala Asn Arg 
305 310 315 320 

TTG AAA GAG CCA AAA TAT TCG CTA TTT TGC GTG GGC GTG GTG CTG CTT 10 68 
Leu Lys Glu Pro Lys Tyr Ser Leu Phe Cys Val Gly Val Val Leu Leu 
325 330 335 

ATT AGT TTT TCT TTG GTG ATC AAC GCC TTT GGG GTG GGC GGG ATT CTT 1116 
lie Ser Phe Ser Leu Val lie Asn Ala Phe Gly Val Gly Gly lie Leu 
340 345 350 

CCG GTT AAA GGT CTA GCG GTG CCG TTT TTG AGC TAT GGA GGG AGT TCG 1164 
Pro Val Lys Gly Leu Ala Val Pro Phe Leu Ser Tyr Gly Gly Ser Ser 
355 360 365 

CTT CTA GCG AAT TGT ATC GCT ATA GGG CTT GTT CTA AGC CTA GCG CGA 1212 
Leu Leu Ala Asn Cys lie Ala lie Gly Leu Val Leu Ser Leu Ala Arg 
370 375 380 

TAC ACG AAA GGC TAAAAACATC AACCCCTTTT TAAAAATTAA TGCCATAAAA AGGGC 12 69 

Tyr Thr Lys Gly 

385 

TCAACCTC 127 7 

(2) INFORMATION FOR SEQ ID NO: 3 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Met Thr Thr Asp Arg Asn Leu Phe Phe Cys Ala Ser Leu Leu lie Phe 

15 10 15 

Leu Gly Val Leu Met Ser Tyr Ser Leu Ser Thr Tyr Thr Thr Val Val 

20 25 30 

Leu Tyr His Tyr Gly Glu Phe His Phe Phe lie Arg Gin Leu Val Ser 

35 40 45 

Ala lie lie Gly lie Val lie Met Trp Gly Leu Ser Arg Val Asp Pro 

50 55 60 

Ser Lys Trp Phe Ser Arg Leu Gly Phe Phe Leu Leu Phe Val Pro Pro 
65 70 75 80 

Leu Leu lie lie Gly Met Phe Phe Leu Pro Glu Ser Leu Ser Ser Ser 
85 90 95 
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AGC AAG TGG TTT AGC CGT TTG GGG TTT TTT CTT CTT TTT GTC CCA CCA 300 
Ser Lys Trp Phe Ser Arg Leu Gly Phe Phe Leu Leu Phe Val Pro Pro 
65 70 75 80 

TTA CTC ATT ATT GGC ATG TTT TTT TTG CCA GAA AGC CTT TCT AGC AGT 348 
Leu Leu lie lie Gly Met Phe Phe Leu Pro Glu Ser Leu Ser Ser Ser 
85 90 95 

GCT GGG GGG GCG AAG CGA TGG ATT CGT TTG GGG TTT TTT TCT CTA GCG 3 96 

Ala Gly Gly Ala Lys Arg Trp lie Arg Leu Gly Phe Phe Ser Leu Ala 
100 105 110 

CCT TTG GAG TTT TTG AAG ATT GGT TTC ACC TTT TTT CTT GCG TGG AGT 444 
Pro Leu Glu Phe Leu Lys lie Gly Phe Thr Phe Phe Leu Ala Trp Ser 
115 120 125 

TTG TCT CGC ACT TTT GTG GCA AAA GAA AAG GCT AAT GTT AAA GAA GAA 4 92 

Leu Ser Arg Thr Phe Val Ala Lys Glu Lys Ala Asn Val Lys Glu Glu 
130 135 140 

CTC ATC ACT TTT GTG CCT TAT TCA GTG GTG TTT GTA GCC TTA GCG ATT 54 0 

Leu He Thr Phe Val Pro Tyr Ser Val Val Phe Val Ala Leu Ala He 
145 150 155 • 160 

GGG GTG GGG GTT TTG CAA AAC GAT TTG GGG CAG ATT GTT CTT TTG GGG 588 
Gly Val Gly Val Leu Gin Asn Asp Leu Gly Gin He Val Leu Leu Gly 
165 170 175 

GCG GTT TTA GCG GTG TTG TTG GTT TTT TCT GGG GGG AGC GTG CAT TTG 636 
Ala Val Leu Ala Val Leu Leu Val Phe Ser Gly Gly Ser Val His Leu 
180 185 190 

TTT GGC TTG ATT ATT TCA GGG GCG TTT GCG ATC AGC GTT TTA GCG ATT 684 
Phe Gly Leu He He Ser Gly Ala Phe Ala He Ser Val Leu Ala He 
195 200 205 

GTT ACA AGC GAG CAT AGG ATT TTG CGC CTG AAA TTG TGG TGG TCT AAT 732 
Val Thr Ser Glu His Arg He Leu Arg Leu Lys Leu Trp Trp Ser Asn 
210 215 220 

TTG CAA AAT TCG CTT TTC ACG CTC TTG CCG GAT AGA TTA GCG AAC GCT 780 
Leu Gin Asn Ser Leu Phe Thr Leu Leu Pro Asp Arg Leu Ala Asn Ala 
225 230 235 240 

CTT AGA ATA AGC GAC TTG CCC GAA TCC TAT CAG GTC TTT CAT GCA GGC 82 8 

Leu Arg He Ser Asp Leu Pro Glu Ser Tyr Gin Val Phe His Ala Gly 
245 250 255 

AAT GCC ATG CAT AAT GGG GGG TTG TTT GGG CAA GGG CTT GGG CTT GGG 876 
Asn Ala Met His Asn Gly Gly Leu Phe Gly Gin Gly Leu Gly Leu Gly 
260 265 270 

CAA ATC AAG CTT GGG TTT TTG AGC GAA GTG CAT ACG GAC ATG GTC TTA 924 
Gin He Lys Leu Gly Phe Leu Ser Glu Val His Thr Asp Met Val Leu 
275 280 285 
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35 










40 










45 








Glu 


Phe 
50 


Lys 


Asp 


Leu 


Met 


Asn 
55 


Arg 


Leu 


Val 


Val 


Gin 
60 


Arg 


Phe 


Gin 


Ser 


Arg 


Leu 


Ala 


Ser 


Glu 


Lys 


Asp 


Ala 


Asp 


Ser 


He 


He 


He 


He 


Glu 


He 


65 










70 










75 










80 


Thr 


Asn 


Val 


Thr 


Asp 
85 


Thr 


Ser 


He 


Thr 


Gin 
90 


Asn 


Lys 


Glu 


Gly 


Phe 
95 


Thr 


Thr 


Phe 


Tyr 


Arg 
100 


Ala 


Thr 


Val 


Ser 


Val 
105 


Asn 


Tyr 


Thr 


Tvr 


Asp 
110 


Asn 


Lvs 


Arg 


Gly 


Thr 
115 


Gin 


Lys 


Thr 


Phe 


Gin 
120 


Asp 


Ser 


Gly 


Tyr 


Tyr 
125 


Asn 


Tyr 


Ala 


Val 


Asn 
130 


Leu 


Gin 


Asp 


Pro 


Leu 
135 


Asn 


Thr 


Tyr 


Gin 


Asn 
140 


Arg 


Tyr 


Tyr 


Ala 


He 


Asn 


Gin 


Ala 


Val 


Glu 


Gin 


Thr 


Leu 


Thr 


Lys 


Phe 


Val 


Ala 


Gin 


He 


145 










150 










155 










160 


Ala 


Tyr 


Glu 


Gly 


Lys 


Phe 


Asn 


Asn 


Glu 


Lys 















165 170 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 61.., 1224 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

AATACCCATA AAATACCTTA AGAGAACGCC TATTCAAAAA CCAAAAATAA GGAAATCCTA 60 

ATG ACT ACA GAC AGA AAT TTG TTT TTT TGC GCT TCG CTA TTG ATT TTT 10 8 

Met Thr Thr Asp Arg Asn Leu Phe Phe Cys Ala Ser Leu Leu He Phe 
15 10 15 

TTG GGG GTA TTG ATG AGC TAT TCG CTC TCA ACT TAC ACC ACA GTG GTG 156 
Leu Gly Val Leu Met Ser Tyr Ser Leu Ser Thr Tyr Thr Thr Val Val 
20 25 30 

CTG TAT CAT TAT GGG GAG TTC CAT TTT TTC ATA CGC CAG CTT GTG AGC 2 04 

Leu Tyr His Tyr Gly Glu Phe His Phe Phe He Arg Gin Leu Val Ser 
35 40 45 

GCG ATC ATA GGG ATT GTT ATC ATG TGG GGG TTG TCT AGG GTT GAT CCT 252 
Ala He He Gly He Val He Met Trp Gly Leu Ser Arg Val Asp Pro 
50 55 60 
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Val Tyr Val Lys Leu lie Val Asn Leu Pro Asn Pro Glu Asn Ser Val 
35 40 45 

GAG TTT AAG GAT TTG ATG AAT CGT TTA GTC GTG CAA CGC TTC CAA AGC 2 53 

Glu Phe Lys Asp Leu Met Asn Arg Leu Val Val Gin Arg Phe Gin Ser 
50 55 60 

CGC TTA GCG AGT GAA AAG GAT GCG GAT TCT ATC ATT ATT ATA GAA ATC 3 01 

Arg Leu Ala Ser Glu Lys Asp Ala Asp Ser He He He He Glu He 
65 70 75 80 

ACG AAT GTA ACC GAT ACG AGT ATC ACG CAA AAT AAA GAA GGC TTC ACG 34 9 

Thr Asn Val Thr Asp Thr Ser He Thr Gin Asn Lys Glu Gly Phe Thr 
85 90 95 

ACT TTC TAT CGC GCA ACC GTG TCT GTG AAT TAC ACC TAC GAT AAT AAA 3 97 

Thr Phe Tyr Arg Ala Thr Val Ser Val Asn Tyr Thr Tyr Asp Asn Lys 
100 105 110 

AGA GGC ACA CAA AAG ACT TTT CAA GAT AGC GGG TAT TAC AAT TAC GCT 44 5 

Arg Gly Thr Gin Lys Thr Phe Gin Asp Ser Gly Tyr Tyr Asn Tyr Ala 
115 120 125 

GTG AAT TTG CAA GAC CCC CTT AAT ACC TAC CAG AAC CGC TAT TAT GCT 4 93 

Val Asn Leu Gin Asp Pro Leu Asn Thr Tyr Gin Asn Arg Tyr Tyr Ala 
130 135 140 

ATC AAT CAG GCT GTG GAA CAG ACT TTG ACT AAA TTT GTG GCT CAA ATC 541 
He Asn Gin Ala Val Glu Gin Thr Leu Thr Lys Phe Val Ala Gin He 
145 150 155 160 

GCT TAT GAG GGG AAA TTC AAT AAT GAA AAA TAGCCCTTTG AATGGATTGA ATG 5 94 
Ala Tyr Glu Gly Lys Phe Asn Asn Glu Lys 
165 170 

GACTAAAGGC GTTTTTAGAA ACAA 618 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Arg Ala Leu Leu Phe Phe He Leu Leu Leu Trp Phe Lys Gly Cys 

15 10 15 

Gly Tyr Lys Pro He Ala Ala Tyr Ala Gin Asn Ala Leu Gly Asp Ser 

20 25 30 

Val Tyr Val Lys Leu He Val Asn Leu Pro Asn Pro Glu Asn Ser Val 
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Val Val Thr Phe 
65 

Asn Met Glu Glu 

Glu Ser Phe Tyr 
100 

Leu Ser Arg Leu 
115 

Phe Leu Lys Ala 
130 

Leu Asn Val Ala 
145 

Arg Lys Asn Ala 

Leu Leu Asn Ala 
180 

Lys Glu Lys Ala 
195 

Lys Asn lie Gly 
210 

Lys Thr Arg Leu 
225 

Asp His Cys Pro 



Thr Lys Lys Glu 
70 

His Asp Lys Glu 
85 

Leu Val Asn Val 

Ser Tyr Arg Met 
120 

Leu Glu Leu Lys 
135 

His Asn Glu lie 
150 

Gly Phe Ser Asp 
165 

Gly Phe lie Asp 

Tyr Thr Trp Trp 
200 

Trp Arg lie Asp 
215 

Lys Asp Ala Leu 
230 

Val Gly Leu Glu 
245 



Pro Leu Ser Val 
75 

Gly Arg Val He 
90 

Tyr Thr Pro Asn 
105 

Ser Trp Glu Val 

Lys Pro Val He 
140 

Asp Leu Glu Asn 
155 

Glu Glu Arg Glu 
170 

Thr Phe Arg Tyr 
185 

Ser Tyr Met Gin 

Tyr Phe Leu Cys 
220 

He Tyr Lys Asp 
235 

Leu Val 
250 



Ser Tyr Gly He 
80 

Thr Cys Glu Phe 
95 

Ser Gin Gin Ala 
110 

Glu Phe Lys Lys 
125 

Val Cys Gly Asp 

Pro Lys Thr Asn 
160 

Lys Phe Ser Glu 
175 

Phe Tyr Pro Asn 
190 

Gin Ala Arg Asp 
205 

Ser Asn Pro Leu 

He Leu Gly Ser 
240 



(2) INFORMATION FOR SEQ ID NO: 3 47: 



(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 62 . . . 571 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

AGAATGCGAG CGTTAAAAAA GAAATTTATG TGCCTAATAA GCTTGTTAAT TTTGTTATCG 60 
C ATG AGG GCT TTA CTT TTT TTT ATT TTG TTA CTT TGG TTC AAG GGT TGT 109 
Met Arg Ala Leu Leu Phe Phe He Leu Leu Leu Trp Phe Lys Gly Cys 
15 10 15 

GGG TAT AAG CCT ATT GCA GCT TAC GCT CAA AAC GCT TTA GGC GAT AGC 157 
Gly Tyr Lys Pro He Ala Ala Tyr Ala Gin Asn Ala Leu Gly Asp Ser 
20 25 30 

GTA TAC-GTG AAA CTC ATT GTG AAT TTG CCT AAC CCT GAA AAC TCT GTA 205 
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TTA GAG TTG AAA AAA CCG GTC ATT GTG TGT GGG GAT TTG AAT GTG GCT 4 89 

Leu Glu Leu Lys Lys Pro Val lie Val Cys Gly Asp Leu Asn Val Ala 
135 140 145 

CAC AAT GAA ATT GAT TTA GAA AAC CCC AAA ACC AAC CGA AAA AAT GCC 537 
His Asn Glu lie Asp Leu Glu Asn Pro Lys Thr Asn Arg Lys Asn Ala 
150 155 160 

GGC TTT AGC GAT GAA GAG AGA GAA AAA TTC AGC GAG CTT TTG AAC GCC 5 85 

Gly Phe Ser Asp Glu Glu Arg Glu Lys Phe Ser Glu Leu Leu Asn Ala 
165 170 175 180 

GGT TTT ATT GAG ACT TTC CGT TAT TTT TAC CCT AAC AAA GAA AAG GCT 633 
Gly Phe lie Asp Thr Phe Arg Tyr Phe Tyr Pro Asn Lys Glu Lys Ala 
185 190 195 

TAC ACC TGG TGG AGT TAC ATG CAA CAA GCA AGG GAT AAA AAC ATT GGT 6 81 

Tyr Thr Trp Trp Ser Tyr Met Gin Gin Ala Arg Asp Lys Asn lie Gly 
200 205 210 

TGG CGC ATT GAT TAT TTT TTA TGC TCT AAC CCT TTA AAA ACG CGC TTA 729 
Trp Arg lie Asp Tyr Phe Leu Cys Ser Asn Pro Leu Lys Thr Arg Leu 
215 220 225 

AAA GAC GCT TTA ATC TAT AAA GAT ATT TTA GGG AGC GAT CAT TGC CCG 777 
Lys Asp Ala Leu lie Tyr Lys Asp lie Leu Gly Ser Asp His Cys Pro 
230 235 240 

GTA GGG TTG GAA TTA GTT TAAAGGTAGA AAGTGTGCGA AATAAAGACA GAAAAAAG 83 3 
Val Gly Leu Glu Leu Val 
245 250 

CCTTACAA 841 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 34 6: 

Met Lys Leu lie Ser Trp Asn Val Asn Gly Leu Arg Ala Cys Met Thr 

15 10 15 

Lys Gly Phe Met Asp Phe Phe Asn Ser Val Asp Ala Asp Val Phe Cys 

20 25 30 

lie Gin Glu Ser Lys Met Gin Gin Glu Gin Asn Thr Phe Glu Phe Lys 

35 40 45 

Gly Tyr Phe Asp Phe Trp Asn Cys Ala lie Lys Lys Gly Tyr Ser Gly 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 795 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345: 

ATGACTAAAA AGATATAATT CATTCAAAAT TAAACAAGGA TTACA ATG AAA CTG ATT 57 

Met Lys Leu lie 
1 

TCA TGG AAT GTG AAC GGG TTA AGG GCT TGC ATG ACT AAG GGC TTT ATG 105 
Ser Trp Asn Val Asn Gly Leu Arg Ala Cys Met Thr Lys Gly Phe Met 
5 10 15 20 

GAT TTT TTC AAT AGC GTT GAT GCG GAT GTT TTT TGC ATT CAA GAA TCT 153 
Asp Phe Phe Asn Ser Val Asp Ala Asp Val Phe Cys lie Gin Glu Ser 
25 30 35 

AAA ATG CAG CAA GAA CAA AAC ACC TTT GAA TTT AAA GGG TAT TTT GAT 201 
Lys Met Gin Gin Glu Gin Asn Thr Phe Glu Phe Lys Gly Tyr Phe Asp 
40 45 50 

TTT TGG AAT TGC GCG ATT AAA AAG GGC TAT TCT GGG GTG GTA ACT TTC 24 9 

Phe Trp Asn Cys Ala lie Lys Lys Gly Tyr Ser Gly Val Val Thr Phe 
55 60 65 

ACT AAA AAA GAG CCT TTA AGC GTG AGC TAT GGT ATT AAT ATG GAA GAG 2 97 

Thr Lys Lys Glu Pro Leu Ser Val Ser Tyr Gly lie Asn Met Glu Glu 
70 75 80 

CAT GAC AAA GAA GGG CGC GTA ATA ACT TGC GAA TTT GAG TCG TTT TAT 34 5 

His Asp Lys Glu Gly Arg Val lie Thr Cys Glu Phe Glu Ser Phe Tyr 
85 90 95 100 

TTG GTG AAT GTT TAT ACC CCT AAT TCC CAA CAA GCC CTA TCC AGG CTT 3 93 

Leu Val Asn Val Tyr Thr Pro Asn Ser Gin Gin Ala Leu Ser Arg Leu 
105 110 115 

AGT TAT CGC ATG AGT TGG GAA GTG GAG TTT AAG AAA TTT TTA AAA GCT 441 
Ser Tyr Arg Met Ser Trp Glu Val Glu Phe Lys Lys Phe Leu Lys Ala 
120 125 130 
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ATCGCTCAAA TTTCAACGAC CATGCTTGTT AAAAAAAACT AAAGGAATGT T ATG CAA 57 

Met Gin 
1 

GAT GAA TTA TTT GAA ACC GAA AAA ATC CCC CCA AAA AAC ACT AAA T^T 105 
Asp Glu Leu Phe Glu Thr Glu Lys lie Pro Pro Lys Asn Thr Lys Asn 
5 10 15 

ACT AAA AAC GCC CCT AAA AAA ACT TTT GAA GAG CAT GTT CAT TCC CTA 153 
Thr Lys Asn Ala Pro Lys Lys Ser Phe Glu Glu His Val His Ser Leu 
20 25 30 

GAG CGA GCC ATA GAT CGC TTG AAT GAT CCC AAT TTA TCC TTA AAA GAC 2 01 

Glu Arg Ala lie Asp Arg Leu Asn Asp Pro Asn Leu Ser Leu Lys Asp 
35 40 45 50 

GGG ATG GAT TTG TAT AAA ACG GCC ATG CAA GAG TTG TTT TTG GCT CAA 24 9 

Gly Met Asp Leu Tyr Lys Thr Ala Met Gin Glu Leu Phe Leu Ala Gin 
55 60 65 

AAG CTT TTA GAA AAC GCT TAT TTA GAG CAT GAA AAA CTC CAA ACG CCA 2 97 

Lys Leu Leu Glu Asn Ala Tyr Leu Glu His Glu Lys Leu Gin Thr Pro 
70 75 80 

GAC CAA AAG GCT TAAAGCATGC GAGTGTTTGC TTTGCAATTA GAATCTTTTA AAGAA 354 
Asp Gin Lys Ala 
85 

AATCTC 3 60 

(2) INFORMATION FOR SEQ ID NO: 3 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Met Gin Asp Glu Leu Phe Glu Thr Glu Lys lie Pro Pro Lys Asn Thr 

15 10 15 

Lys Asn Thr Lys Asn Ala Pro Lys Lys Ser Phe Glu Glu His Val His 

20 25 30 

Ser Leu Glu Arg Ala lie Asp Arg Leu Asn Asp Pro Asn Leu Ser Leu 

35 40 45 

Lys Asp Gly Met Asp Leu Tyr Lys Thr Ala Met Gin Glu Leu Phe Leu 

50 55 60 

Ala Gin Lys Leu Leu Glu Asn Ala Tyr Leu Glu His Glu Lys Leu Gin 
65 70 75 80 

Thr Pro Asp Gin Lys Ala 
85 
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35 40 45 

ATT GCT TTA GGC GAT AGG GTT AAG CTA GAG CTT ACG CCC TAT AGC TTA 255 
lie Ala Leu Gly Asp Arg Val Lys Leu Glu Leu Thr Pro Tyr Ser Leu 
50 55 60 

GAC AAA GGT CGG ATA ACT TTT AGA TAT AAA TGAATTTAAG GGTTATTTCA ATG 3 08 
Asp Lys Gly Arg lie Thr Phe Arg Tyr Lys 
65 70 

AAAATATGTT AATATAA 325 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Met Ala Arg Asp Asp Val lie Glu Val Asp Gly Lys Val lie Glu Ala 

15 10 15 

Leu Pro Asn Ala Thr Phe Lys Val Glu Leu Asp Asn Lys His Val Val 

20 25 30 

Leu Cys Arg lie Ser Gly Lys Met Arg Met His Tyr lie Arg lie Ala 

35 40 45 

Leu Gly Asp Arg Val Lys Leu Glu Leu Thr Pro Tyr Ser Leu Asp Lys 

50 55 60 

Gly Arg lie Thr Phe Arg Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 34 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 309 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 



-565- 



BNSDOCID: <WO 984347aAlJ_> 



wo 98/43478 



PCTAJS98/06371 





50 










55 










60 










Thr 


Tyr 


Asp 


He 


Leu 


Phe 


Leu 


Glu 


Pro 


He 


Leu 


Ala 


Pro 


Leu 


Asn 


Ser 


65 










70 










75 










80 


Lys 


Cys 


Glu 


He 


Ala 


Leu 


Val 


Ser 


Glu 


Ser 


Met 


Asp 


He 


Val 


Met 


Asn 










85 










90 










95 




Glu 


Ser 


Leu 


He 


Asn 


Ser 


Phe 


Asp 


He 


Thr 


Leu 


Asp 


Tyr 


Val 


Tyr 


Gly 








100 










105 










110 






Glu 


Ala 


Lys 


Arg 


Ala 


Tyr 


Glu 


Glu 


Asp 


He 


Leu 


Ser 


His 


He 


Tyr 


Gin 






115 










120 










125 








Tyr 


Arg 


Lys 


Gly 


Asn 


Ala 


He 


Lys 


Ser 


Leu 


Lys 


Asp 


Lys 


Asn 


He 


Phe 




130 










135 










14 0 










lie 


Val 


Asp 


Arg 


Gly 


He 


Glu 


Thr 


Gly 


Phe 


Arg 


Ala 


Gly 


Leu 


Gly 


Val 


145 










150 










155 










160 


Gin 


Thr 


Cys 


Leu 


Lys 


Lys 


Glu 


Cys 


Gin 


Asp 


He 


Tyr 


He 


Leu 


Thr 


Pro 










165 










170 










175 




He 


Val 


Ala 


Gin 


Asn 


Val 


Ala 


Gin 


Gly 


Leu 


Glu 


Ser 


Leu 


Cys 


Asp 


Gly 








180 










185 










190 






Val 


He 


Ser 


Val 


Tyr 


Arg 


Pro 


Glu 


Cys 


Phe 


Val 


Ser 


Val 


Glu 


His 


His 






195 










200 










205 








Tyr 


Lys 


Glu 


Leu 


Lys 


Arg 


Leu 


Ser 


Asn 


Glu 


Glu 


Val 


Glu 


Lys 


Tyr 


Leu 




210 










215 










220 










Gly Ala 


Asn 


Asn 


Met 


Pro 


Asn 


Leu 


Lys 


Lys 


Glu 


His 











225 230 235 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 285 
(D) OTHER INFORMATION; 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:341: 

TAACACAAGC CACCATGAGC ATACTATCGC CATAGTTGGC AATAAAGCAG TGATTCTTAC 60 
GGAGCGTTA ATG GCA AGA GAT GAT GTT ATA GAA GTG GAT GGG AAA GTG ATT 111 
Met Ala Arg Asp Asp Val He Glu Val Asp Gly Lys Val He 
15 10 

GAG GCG TTG CCT AAC GCT ACT TTT AAG GTG GAG TTA GAC AAT AAG CAT 159 
Glu Ala Leu Pro Asn Ala Thr Phe Lys Val Glu Leu Asp Asn Lys His 
15 20 25 30 

GTG GTG TTG TGC CGT ATT TCT GGA AAG ATG CGC ATG CAC TAT ATT AGG 2 07 

Val Val Leu Cys Arg He Ser Gly Lys Met Arg Met His Tyr He Arg 
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GCC AAG CGA GCT TAT GAA GAA GAG ATT TTG TCT CAC ATC TAT GAG TAT 444 
Ala Lys Arg Ala Tyr Glu Glu Asp lie Leu Ser His lie Tyr Gin Tyr 
115 120 125 

CGC AAA GGC AAT GCG ATC AAA AGO TTA AAA GAT AAA AAT ATT TTT ATC 4 92 

Arg Lys Gly Asn Ala lie Lys Ser Leu Lys Asp Lys Asn lie Phe lie 
130 135 140 145 

GTA GAT AGG GGG ATT GAA ACC GGG TTT AGA GCA GGG TTA GGC GTG CAA 54 0 

Val Asp Arg Gly lie Glu Thr Gly Phe Arg Ala Gly Leu Gly Val Gin 
150 155 160 

ACT TGC TTG AAA AAA GAA TGC CAA GAC ATT TAT ATT TTA ACC CCC ATT 58 8 

Thr Cys Leu Lys Lys Glu Cys Gin Asp lie Tyr lie Leu Thr Pro lie 
165 170 175 

GTC GCG CAA AAT GTC GCT CAA GGC TTA GAA AGT TTG TGC GAT GGG GTG 63 6 

Val Ala Gin Asn Val Ala Gin Gly Leu Glu Ser Leu Cys Asp Gly Val 
180 185 190 

ATT AGT GTG TAT CGC CCT GAA TGT TTT GTC TCT GTG GAG CAT CAT TAT 684 
lie Ser Val Tyr Arg Pro Glu Cys Phe Val Ser Val Glu His His Tyr 
195 200 205 

AAA GAA CTC AAG CGA TTA AGC AAT GAA GAA GTT GAA AAA TAC TTG GGC 73 2 

Lys Glu Leu Lys Arg Leu Ser Asn Glu Glu Val Glu Lys Tyr Leu Gly 
210 215 220 225 

GCT AAC AAC ATG CCT AAT TTA AAA AAG GAA CAT TAAATATGGA TTTTATCACC 7 85 
Ala Asn Asn Met Pro Asn Leu Lys Lys Glu His 
230 235 

ATCAATTCTA GTAACAAAAC CGAA 809 



(2) INFORMATION FOR SEQ ID NO: 34 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:340: 



Met His Leu Asn Thr Asp Phe Ser 

1 5 

Arg Phe lie Asn Glu Glu Asp Ala 
20 

His Thr Arg His lie Asp Leu Lys 

35 40 
Phe Asn Ala Leu Tyr Leu Ala His 



His lie Thr Asp lie Glu Gly Met 

10 15 
Leu Asn Lys Leu lie Asn Glu lie 
25 30 
Asp Ser lie Met Leu Ala Leu Ser 
45 

Ala Leu Ala Gin Lys Phe Gly Ala 
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Tyr Lys Gin Glu Glu Asn Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO: 33 9: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58.,. 765 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

ACCGATCACT AAAACCAATG TAACTTACCG CTCTTTACAG CGTAAGTGAG AAAAGGA ATG 60 

Met 
1 

CAT TTG AAT ACG GAT TTT AGC CAT ATC ACC GAT ATA GAG GGC ATG CGT 10 8 

His Leu Asn Thr Asp Phe Ser His lie Thr Asp lie Glu Gly Met Arg 
5 10 15 

TTT ATC AAT GAA GAA GAC GCT TTG AAC AAA TTG ATT AAT GAA ATC CAC 156 
Phe lie Asn Glu Glu Asp Ala Leu Asn Lys Leu lie Asn Glu lie His 
20 25 30 

ACG CGC CAC ATT GAT TTA AAA GAT TCC ATC ATG CTC GCT TTG AGT TTT 2 04 

Thr Arg His lie Asp Leu Lys Asp Ser lie Met Leu Ala Leu Ser Phe 
35 40 45 

AAC GCT CTG TAT TTA GCT CAC GCT TTA GCG CAA AAA TTT GGA GCG ACT 2 52 

Asn Ala Leu Tyr Leu Ala His Ala Leu Ala Gin Lys Phe Gly Ala Thr 
50 55 60 65 

TAT GAT ATA CTT TTT TTA GAA CCT ATC CTA GCC CCT TTA AAC TCA AAA 3 00 

Tyr Asp lie Leu Phe Leu Glu Pro lie Leu Ala Pro Leu Asn Ser Lys 
70 75 80 

TGC GAG ATC GCT TTA GTG AGT GAG AGC ATG GAT ATA GTG ATG AAT GAA 34 8 

Cys Glu lie Ala Leu Val Ser Glu Ser Met Asp He Val Met Asn Glu 
85 90 95 

AGT TTG ATC AAT TCC TTT GAC ATC ACT TTA GAC TAT GTT TAT GGG GAA 3 96 

Ser Leu He Asn Ser Phe Asp He Thr Leu Asp Tyr Val Tyr Gly Glu 
100 105 110 
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TCT ATC ATG TTC CCT ACC ATC TTT TCT TTG GCT ACG CTC AAT TTA GGG 42 6 

Ser lie Met Phe Pro Thr lie Phe Ser Leu Ala Thr Leu Asn Leu Gly 
55 60 65 

CAT CTC ACT TCT AAA GCT TCT GGG GTG ATT AGC ATG GCG ATT GTG GGA 4 74 

His Leu Thr Ser Lys Ala Ser Gly Val lie Ser Met Ala lie Val Gly 
70 75 80 85 

GGG GCG TTA ATC CCC CCC ATT CAA GGT GCG GTT ACA GAC ATG CTA ACA 522 
Gly Ala Leu lie Pro Pro lie Gin Gly Ala Val Thr Asp Met Leu Thr 
90 95 100 

GCA ACC GAA TCA AAT TTG CTC TAC GCT TAT GGT GTG CCG TTG TTG TGC 57 0 

Ala Thr Glu Ser Asn Leu Leu Tyr Ala Tyr Gly Val Pro Leu Leu Cys 
105 110 115 

TAT TTT TAT ATT CTC TTC TTT GCG CTT AAA GGG TAT AAG CAA GAA GAA 618 
Tyr Phe Tyr lie Leu Phe Phe Ala Leu Lys Gly Tyr Lys Gin Glu Glu 
120 125 130 

AAC TCC TAAAAAAAGG GGGGGTTTCT TTCTTCTTTC CTTTCTTTTA TCTTGTTTTA AA 67 6 
Asn Ser 
135 

AATCAGTAA 6 85 

(2) INFORMATION FOR SEQ ID NO:338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:338: 

Met Val Gly Arg Phe Leu Gly Ser Val Leu Met Asn Lys lie Ala Pro 

15 10 15 

Asn Lys Tyr Leu Ala Phe Asn Ala Leu Ser Ser lie Val Leu lie Ala 

20 25 30 

Leu Ala lie lie lie Gly Gly Lys lie Ala Leu Phe Ala Leu Thr Phe 

35 40 45 

Val Gly Phe Phe Asn Ser lie Met Phe Pro Thr lie Phe Ser Leu Ala 

50 55 60 

Thr Leu Asn Leu Gly His Leu Thr Ser Lys Ala Ser Gly Val lie Ser 
65 70 75 80 

Met Ala lie Val Gly Gly Ala Leu lie Pro Pro lie Gin Gly Ala Val 

85 90 95 

Thr Asp Met Leu Thr Ala Thr Glu Ser Asn Leu Leu Tyr Ala Tyr Gly 

100 105 110 

Val Pro Leu Leu Cys Tyr Phe Tyr lie Leu Phe Phe Ala Leu Lys Gly 
115 120 125 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Val His Asp lie Thr Lys Leu Cys Tyr Thr Lys Pro Leu Gly Cys Val 

15 10 15 

Val Leu Phe Ser Lys Asp Thr Asp Leu Val Pro Val Leu Glu Ser Ala 

20 25 30 

Trp Glu Lys Gly Phe Glu Val Phe lie Ala Asn lie Gin Glu Cys Pro 

35 40 45 

Asn Ser Val Pro Ser Asp Leu Lys Lys Ser Cys Asn Val Arg Glu Arg 

50 55 60 

Ser Val Ala Glu lie Val Asp Asn Leu Pro Lys Asn Gin His Thr Pro 
65 70 75 80 

Lys Lys Lys Asn Phe Ser Thr Asn Glu Pro Phe Asn Asn Pro Phe Lys 

85 90 95 

Asp Gin Leu Phe Lys Lys Asn 
100 

(2) INFORMATION FOR SEQ ID NO: 3 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 220... 624 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

TCTTTTGAAA TTGCCTGATG TGGAAAAAGA AATGCCCAAA GAGACGACTC AAAAAAGCTT 60 
GTTTTCGCAC AAACACTTTG TTTTTGGGGC TTGGGGATCT TTTTTTATGT GGGGGGAGAA 12 0 

NTGGCGATTG GCTCATTCTT GGTGCTAAGC TTTGAAAAGC TTTTGAATTT AGACTCTCAA 180 
TCAAGCGCGC ATTACTTGGT GTATTATTGG GGAGGCGCG ATG GTG GGC CGT TTC 234 

Met Val Gly Arg Phe 
1 5 

TTA GGC AGT GTG TTG ATG AAT AAA ATT GCC CCT AAT AAA TAC TTG GCT 2 82 

Leu Gly Ser Val Leu Met Asn Lys lie Ala Pro Asn Lys Tyr Leu Ala 
10 15 20 

TTC AAC GCC TTA AGC TCT ATT GTT CTC ATC GCT TTA GCC ATT ATC ATT 3 30 

Phe Asn Ala Leu Ser Ser lie Val Leu lie Ala Leu Ala lie lie lie 
25 30 35 

GGA GGC AAG ATC GCT TTA TTC GCT CTG ACT TTT GTG GGC TTT TTC AAC 3 78 

Gly Gly Lys He Ala Leu Phe Ala Leu Thr Phe Val Gly Phe Phe Asn 
-40 45 50 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 120... 428 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

GTCGGTCGGG TAATGTTCAA ATTCACAAAT GAGTCTGAAG ACAAAGAAGT CTTGATCTAG 60 
AAGCCGAAAT TCTCATACCG CACTTAGAAT TGCGTCAAAA ACAAATTGAT GCGCTGTTG 119 



GTG CAC GAT ATT ACC AAG CTA TGT TAG ACC AAA CCA CTA GGG TGT GTT 16 7 

Val His Asp lie Thr Lys Leu Cys Tyr Thr Lys Pro Leu Gly Cys Val 
15 10 15 

GTG CTG TTC AGC AAG GAT ACT GAT CTT GTG CCT GTG TTA GAA TCC GCT 215 
Val Leu Phe Ser Lys Asp Thr Asp Leu Val Pro Val Leu Glu Ser Ala 
20 25 30 

TGG GAG AAA GGC TTT GAA GTC TTC ATT GCT AAC ATT CAA GAA TGC CCC 263 
Trp Glu Lys Gly Phe Glu Val Phe He Ala Asn He Gin Glu Cys Pro 
35 40 45 

AAT TCT GTC CCT TCA GAC TTG AAG AAG TCT TGC AAT GTG AGG GAA CGC 311 
Asn Ser Val Pro Ser Asp Leu Lys Lys Ser Cys Asn Val Arg Glu Arg 
50 55 60 

AGT GTC GCT GAA ATT GTA GAT AAC TTG CCC AAA AAT CAG CAC ACT CCC 35 9 

Ser Val Ala Glu He Val Asp Asn Leu Pro Lys Asn Gin His Thr Pro 
65 70 75 80 

AAG AAA AAG AAC TTT TCC ACC AAC GAG CCT TTT AAC AAC CCA TTT AAA 4 07 

Lys Lys Lys Asn Phe Ser Thr Asn Glu Pro Phe Asn Asn Pro Phe Lys 
85 90 95 

GAC CAA CTC TTT AAG AAG AAC TAACACGATC CCCACACCAA GGGGACAAAA AAGCA 4 63 
Asp Gin Leu Phe Lys Lys Asn 
100 

CCCATTTTAA AAGG 4 77 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
.(V) FRAGMENT TYPE: internal 
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435 440 445 

Val Asn Asn Asn Pro Asn Asp Pro Thr Leu Cys Gin Arg Gly Lys Ala 

450 455 460 

Asn Gly Asn lie Tyr Gly Gly Tyr Val Gin Ala Asn Tyr Ser Pro His 
465 470 475 480 

Lys lie lie Thr Phe Gly Ala Gly Val Arg Trp Asp Ala Tyr Thr Leu 

485 490 495 

Tyr Asp Lys Asp Trp Asn His Arg Tyr Thr Gin Gly Phe Ser Pro Ser 

500 505 510 

Ala Ala Leu Val Leu Ser Pro lie Glu Pro Leu Ser Leu Lys lie Thr 

515 520 525 

Tyr Ser Gin Val Thr Arg Gly Val Met Pro Gly Asp Gly Val Tyr Met 

530 535 540 

Arg Gin Asn Asp Leu Arg Tyr Ala Lys Asn lie Lys Pro Glu Val Gly 
545 550 555 560 

Ser Asn Ala Glu Phe Asn lie Asp Tyr Ser Ser Gin Tyr Phe Ser Gly 

565 570 575 

Arg Ala Ala Ala Phe Tyr Gin Ala Leu Asp Asn Phe lie Ser Gin Tyr 

580 585 590 

Ala Gin Asn Leu lie Val Thr Asn Leu Ser Gin Ala lie Arg lie Tyr 

595 600 605 

Gly Tyr Glu Val Gly Gly Thr Phe Arg Tyr Lys Gly Val Ser Leu Asn 

610 615 620 

Val Gly Val Ser Arg Thr Trp Pro Thr Thr Arg Gly Tyr Leu Met Ala 
625 630 635 640 

Asp Ser Tyr Glu Leu Ala Ala Ser Thr Gly Asn Val Phe lie lie Lys 

645 650 655 

Leu Asp Tyr Thr lie Pro Lys Thr Gly lie Asn Leu Ala Trp Leu Ser 

660 665 670 

Arg Phe Val Thr Gly Leu Asp Tyr Cys Gly Phe Asp lie Tyr Leu Pro 

675 680 685 

Asp Tyr Gly Thr Ala Glu Lys Pro Lys Thr Pro Thr Asp Leu Ala Lys 

690 695 700 

Cys Gly Ser Gin Leu Gly Leu Val His Met His Lys Pro Gly Tyr Gly 
705 710 715 720 

Val Ser Asn Phe Tyr lie Asn Trp Ser Pro Lys Thr Lys Ser Arg Trp 

725 730 735 

Lys Gly Leu Leu Leu Ser Ala Val Phe Asn Asn Val Phe Asn Lys Phe 

740 745 750 

Tyr Val Asp Gin Thr Ser Pro Tyr Val Met Ser Pro Asp Met Pro Gly 

755 760 765 

Thr Asp Ala Val Lys Arg Ala He Ala Glu Pro Gly Phe Asn Ala Arg 

770 775 780 

Phe Glu Val Ala Tyr Lys Trp 
785 790 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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Met Phe Leu Arg 
1 

Leu Leu Ala Glu 
20 

Thr Thr Gin Ala 
35 

Thr Ser Lys Glu 
50 

Phe Arg Thr Asn 
65 

Gin Lys lie Tyr 

Val Asp Gly Ala 
100 

Thr Val lie Asp 
115 

Ala Ala Asn Ala 
13 0 

Met Glu Thr Lys 
145 

Ala Ala Ser Gly 

Thr Phe Arg Ser 
180 

Tyr Thr His Gin 
195 

Lys Asn Leu Phe 
210 

Ser Glu Gin Asn 
225 

Arg Asp Thr Leu 

Arg Pro Leu Arg 
260 

Pro Phe Ser Pro 
275 

Phe Thr Arg Tyr 
290 

Leu Gin Tyr Lys 
305 

Asp Phe Thr Leu 

Phe Asp Pro Asn 
340 

Ala Trp Glu Lys 
355 

Pro Ser Phe Ser 
370 

Phe Leu Asn Asn 
385 

Glu Ala Thr Asp 

Ser Val Phe Asp 
420 

Gin Val -Tyr Thr 



Val Tyr Pro Lys 
5 

Thr Cys Tyr Ser 

Lys Arg lie Phe 
40 

Leu Asp Gin Arg 
55 

Pro Asp Val Asn 
70 

Val Arg Gly Val 
85 

Ala Gin Asn Gly 

Pro Gly Met Leu 
120 

Ser Ala Gly Pro 
135 

Gly Ala Ala Asp 
150 

Ala Val Ser Phe 
165 

Ala Tyr Gin Asn 

Asn lie Phe Tyr 
200 

Asn Pro Thr Gin 
215 

Asn Ala Leu lie 
230 

Thr Phe Ser Trp 
245 

Ser Asn Ala lie 



Asp 


Ser 


Ser 


Gin 








280 


Met 


Tyr 


His 


Ser 






295 




Arg 


Glu 


Ala 


Gly 




310 






Tyr 


Thr 


Ser 


He 


325 








Gly 


Val 


Tyr 


Ala 


Glu 


Asn 


Tyr 


Pro 








360 


Asp 


Val 


Asp 


Lys 






375 




Thr 


Gly 


Leu 


Asn 




390 






Ser 


Leu 


Phe 


Glu 


405 








Ala 


Arg 


lie 


Pro 


Asp 


Asp 


Lys 


Gly 



Leu Arg Tyr Ala 
10 

Glu Glu Arg Thr 
25 

Thr Tyr Asn Asn 

Gin Ser Asn Glu 
60 

Val Gly Gly Gly 
75 

Glu Asp Arg Leu 
90 

Asn He Tyr His 
105 

Lys Ser Val Glu 

Gly Ala He Ala 
140 

Phe He Pro Arg 
155 

Tyr Thr Asn Phe 
170 

Ala His Phe Asp 
185 

Tyr Arg Ser Gly 

Ala Asp Lys Glu 
220 

Lys Met Asn Gly 
235 

Asn Met Thr Arg 
250 

Gly Leu Ala Tyr 
265 

Gly Cys Pro Asn 

He Asn Ser Ala 
300 

Asn Ser Phe Gly 
315 

Arg Asn Ala Gin 
330 

Lys Phe Pro Thr 
345 

Cys Val Glu Gly 

Pro Ser Ser Gin 
380 

Leu Lys Val Ala 
395 

Tyr Gly Phe Asn 
410 

Lys Ser Glu Leu 
425 

Gin Lys Gin He 
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Leu Cys Phe Pro 
15 

Leu Asn Lys Val 
30 

Glu Phe Lys Val 
45 

Val Lys Asp Leu 

Ser Val Met Gly 
80 

Leu Arg Val Thr 
95 

His Gin Gly Asn 
110 

Val Thr Lys Gly 
125 

Gly Val He Lys 

Gly Lys Asn Tyr 
160 

Gly Asp Arg Glu 
175 

He He Ala Tyr 
190 

Ala Thr Ala Met 
205 

Pro Gly Thr Pro 

Tyr Leu Ser Asp 
240 

Asp Asn Ala Thr 
255 

Pro Cys Glu Ala 
270 

Val Leu Asp Ser 
285 

Asn Asn Leu Ser 

Asp Pro Arg Leu 
320 

Phe Asp Pro Leu 
335 

Ser Leu Ala Ser 
350 

Ala Tyr Cys Thr 
365 

Pro Arg Asn Leu 

His Val He Asp 
400 

Tyr Gin Asn Leu 
415 

Tyr Arg Pro Asn 
430 

Ala Cys Ser Leu 
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AGC TAT GAG CTT GCC GCA AGC ACC GGT AAT GTT TTT ATC ATC AAA TTG 2 02 8 
Ser Tyr Glu Leu Ala Ala Ser Thr Gly Asn Val Phe lie lie Lys Leu 
645 650 655 

GAT TAG ACC ATC CCC AAA ACA GGG ATC AAT CTT GCA TGG CTT AGC CGC 2 0 76 

Asp Tyr Thr lie Pro Lys Thr Gly lie Asn Leu Ala Trp Leu Ser Arg 
660 665 670 

TTT GTT ACC GGT TTA GAT TAT TGC GGG TTT GAT ATT TAG TTG CCT GAT 2124 
Phe Val Thr Gly Leu Asp Tyr Cys Gly Phe Asp lie Tyr Leu Pro Asp 
675 680 685 

TAT GGG ACG GCT GAG AAA CCC AAA ACC CCT ACC GAT TTA GCC AAA TGC 2172 
Tyr Gly Thr Ala Glu Lys Pro Lys Thr Pro Thr Asp Leu Ala Lys Cys 
690 695 700 705 

GGA TCT CAA TTA GGG TTA GTG CAT ATG CAT AAA CCG GGC TAT GGC GTG 22 2 0 

Gly Ser Gin Leu Gly Leu Val His Met His Lys Pro Gly Tyr Gly Val 
710 715 720 

AGT AAT TTT TAT ATC AAT TGG AGT CCT AAA ACC AAA AGC CGC TGG AAG 2 268 

Ser Asn Phe Tyr lie Asn Trp Ser Pro Lys Thr Lys Ser Arg Trp Lys 
725 730 735 

GGT TTG TTG CTT TCA GCC GTG TTT AAT AAT GTT TTC AAC AAA TTC TAT 2316 
Gly Leu Leu Leu Ser Ala Val Phe Asn Asn Val Phe Asn Lys Phe Tyr 
740 745 750 

GTG GAT CAA ACA AGC CCT TAT GTC ATG AGC CCG GAT ATG CCA GGC ACT 2 3 64 
Val Asp Gin Thr Ser Pro Tyr Val Met Ser Pro Asp Met Pro Gly Thr 
755 760 765 

GAC GCT GTT AAA AGA GCG ATC GCT GAG CCT GGG TTT AAC GCG CGT TTT 2412 
Asp Ala Val Lys Arg Ala lie Ala Glu Pro Gly Phe Asn Ala Arg Phe 
770 775 780 785 

GAA GTG GCT TAC AAA TGG TAGTTAATGG AGCTTTAAGC GTTGCGCATG CGTGATAG 24 6 8 
Glu Val Ala Tyr Lys Trp 
790 

CAACGGCTAT CGC 24 81 

(2) INFORMATION FOR SEQ ID NO:334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 34: 
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GTT TTT GAC GCT CGC ATC CCT AAA TCA GAA TTA TAG AGG OCT AAT CAA 13 5 6 

Val Phe Asp Ala Arg lie Pro Lys Ser Glu Leu Tyr Arg Pro Asn Gin 
420 425 430 

GTT TAT ACT GAT GAT AAA GGA CAA AAA CAA ATC GCT TGC TCT CTT GTG 14 04 

Val Tyr Thr Asp Asp Lys Gly Gin Lys Gin lie Ala Cys Ser Leu Val 
435 440 445 

AAT AAT AAC CCC AAT GAC CCC ACT CTG TGC CAA AGA GGG AAA GCG AAC 14 52 

Asn Asn Asn Pro Asn Asp Pro Thr Leu Cys Gin Arg Gly Lys Ala Asn 
450 455 460 465 

GGG AAT ATT TAT GGA GGC TAC GTG CAA GCG AAT TAC TCG CCT CAT AAA 150 0 
Gly Asn lie Tyr Gly Gly Tyr Val Gin Ala Asn Tyr Ser Pro His Lys 
470 475 480 

ATC ATC ACT TTT GGA GCC GGG GTA AGG TGG GAC GCT TAC ACG CTT TAT 154 8 
lie lie Thr Phe Gly Ala Gly Val Arg Trp Asp Ala Tyr Thr Leu Tyr 
485 490 495 

GAT AAA GAC TGG AAC CAC CGC TAC ACT CAA GGC TTT AGC CCT AGC GCG 15 96 

Asp Lys Asp Trp Asn His Arg Tyr Thr Gin Gly Phe Ser Pro Ser Ala 
500 505 510 

GCT CTT GTG CTA AGC CCC ATT GAG CCT TTA TCT TTA AAA ATC ACT TAT 1644 
Ala Leu Val Leu Ser Pro lie Glu Pro Leu Ser Leu Lys lie Thr Tyr 
515 520 525 

TCT CAA GTT AC A AGA GGG GTG ATG CCA GGA GAT GGC GTG TAC ATG CGT 1692 
Ser Gin Val Thr Arg Gly Val Met Pro Gly Asp Gly Val Tyr Met Arg 
530 535 540 545 

CAA AAC GAT TTA CGA TAC GCC AAA AAC ATC AAG CCT GAA GTG GGC TCT 174 0 

Gin Asn Asp Leu Arg Tyr Ala Lys Asn lie Lys Pro Glu Val Gly Ser 
550 555 560 

AAC GCT GAA TTT AAT ATT GAT TAT TCA AGC CAG TAT TTT AGC GGG AGG 17 88 

Asn Ala Glu Phe Asn lie Asp Tyr Ser Ser Gin Tyr Phe Ser Gly Arg 
565 570 575 

GCT GCG GCG TTT TAT CAG GCT TTG GAT AAT TTC ATC TCA CAA TAC GCA 1836 
Ala Ala Ala Phe Tyr Gin Ala Leu Asp Asn Phe lie Ser Gin Tyr Ala 
580 585 590 

CAA AAT TTG ATT GTA ACC AAT TTG AGT CAA GCG ATT CGT ATT TAT GGC 1884 
Gin Asn Leu lie Val Thr Asn Leu Ser Gin Ala lie Arg lie Tyr Gly 
595 600 605 

TAT GAA GTG GGT GGG ACT TTC AGA TAC AAG GGC GTG AGT TTG AAT GTA 1932 
Tyr Glu Val Gly Gly Thr Phe Arg Tyr Lys Gly Val Ser Leu Asn Val 
610 615 620 625 

GGG GTC TCG CGC ACC TGG CCC ACC ACT AGG GGG TAT TTA ATG GCG GAT 1980 
Gly Val Ser Arg Thr Trp Pro Thr Thr Arg Gly Tyr Leu Met Ala Asp 
630 635 640 
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Thr His Gin Asn lie Phe Tyr Tyr Arg Ser Gly Ala Thr Ala Met Lys 
195 200 205 

AAC CTT TTC AAT CCC ACA CAA GCC GAT AAA GAG CCA GGA ACT CCT AGC 732 
Asn Leu Phe Asn Pro Thr Gin Ala Asp Lys Glu Pro Gly Thr Pro Ser 
210 215 220 225 

GAG CAA AAC AAC GCT TTG ATT AAA ATG AAT GGT TAT TTG AGC GAC AGA 7 80 

Glu Gin Asn Asn Ala Leu lie Lys Met Asn Gly Tyr Leu Ser Asp Arg 
230 235 240 

GAC ACG CTC ACT TTC AGC TGG AAC ATG ACA CGA GAT AAC GCT ACA CGC 82 8 

Asp Thr Leu Thr Phe Ser Trp Asn Met Thr Arg Asp Asn Ala Thr Arg 
245 250 255 

CCT TTA AGG AGT AAC GCT ATA GGG TTA GCC TAT CCT TGT GAA GCC CCC 876 
Pro Leu Arg Ser Asn Ala lie Gly Leu Ala Tyr Pro Cys Glu Ala Pro 
260 265 270 

TTT AGT CCT GAT AGT TCT CAA GGG TGT CCT AAT GTG TTA GAT AGT TTC 924 
Phe Ser Pro Asp Ser Ser Gin Gly Cys Pro Asn Val Leu Asp Ser Phe 
275 280 285 

ACA AGA TAC ATG TAT CAC TCT ATT AAT AGT GCC AAC AAT CTT TCC TTA 972 
Thr Arg Tyr Met Tyr His Ser lie Asn Ser Ala Asn Asn Leu Ser Leu 
290 295 300 305 

CAA TAC AAA AGG GAA GCG GGA AAT TCT TTT GGC GAC CCA CGA TTA GAT 102 0 

Gin Tyr Lys Arg Glu Ala Gly Asn Ser Phe Gly Asp Pro Arg Leu Asp 
310 315 320 

TTT ACC CTT TAT ACA AGC ATC AGG AAC GCT CAG TTT GAT CCC CTA TTT 1068 
Phe Thr Leu Tyr Thr Ser lie Arg Asn Ala Gin Phe Asp Pro Leu Phe 
325 330 335 

GAT CCT AAT GGC GTT TAT GCT AAA TTC CCC ACT TCT TTA GCG AGC GCA 1116 
Asp Pro Asn Gly Val Tyr Ala Lys Phe Pro Thr Ser Leu Ala Ser Ala 
340 345 350 

TGG GAA AAA GAA AAT TAC CCA TGC GTT GAA GGC GCT TAT TGC ACC CCA 1164 
Trp Glu Lys Glu Asn Tyr Pro Cys Val Glu Gly Ala Tyr Cys Thr Pro 
355 360 365 

AGC TTT TCA GAT GTG GAT AAA CCA AGC TCA CAG CCT AGG AAT TTG TTT 1212 
Ser Phe Ser Asp Val Asp Lys Pro Ser Ser Gin Pro Arg Asn Leu Phe 
370 375 380 385 

TTA AAC AAC ACC GGC TTA AAC CTT AAA GTC GCG CAT GTG ATT GAT GAA 12 60 

Leu Asn Asn Thr Gly Leu Asn Leu Lys Val Ala His Val lie Asp Glu 
390 395 400 

GCC ACA GAC AGC CTT TTT GAA TAC GGA TTC AAC TAC CAA AAT TTG AGC 13 0 8 

Ala Thr Asp Ser Leu Phe Glu Tyr Gly Phe Asn Tyr Gin Asn Leu Ser 
405 410 415 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

TTTTTTTTTT TTTTTGATTT TTATTTTTTA AATTTTTAGA TTAAGGAGAG TTGTTGG ATG 60 

Met 
1 

TTT TTA AGA GTA TAG CCA AAG CTT AGA TAC GCT TTA TGT TTC CCC CTA 10 8 

Phe Leu Arg Val Tyr Pro Lys Leu Arg Tyr Ala Leu Cys Phe Pro Leu 
5 10 15 

CTC GCT GAG ACT TGC TAT AGC GAA GAG CGG ACT TTA AAT AAG GTT ACC 15 6 

Leu Ala Glu Thr Cys Tyr Ser Glu Glu Arg Thr Leu Asn Lys Val Thr 
20 25 30 

ACC CAA GCT AAA AGG ATT TTC ACT TAC AAC AAT GAG TTT AAA GTA ACT 2 04 

Thr Gin Ala Lys Arg lie Phe Thr Tyr Asn Asn Glu Phe Lys Val Thr 
35 40 45 

TCT AAA GAA CTA GAT CAA CGC CAA AGC AAT GAA GTC AAG GAC TTG TTT 252 
Ser Lys Glu Leu Asp Gin Arg Gin Ser Asn Glu Val Lys Asp Leu Phe 
50 55 60 65 

AGG ACT AAC CCT GAT GTG AAT GTG GGC GGA GGG AGC GTG ATG GGG CAG 300 
Arg Thr Asn Pro Asp Val Asn Val Gly Gly Gly Ser Val Met Gly Gin 
70 75 80 

AAA ATC TAT GTG AGA GGC GTT GAA GAC AGG CTT TTA AGG GTT ACA GTG 34 8 

Lys lie Tyr Val Arg Gly Val Glu Asp Arg Leu Leu Arg Val Thr Val 
85 90 95 

GAT GGG GCT GCA CAA AAT GGC AAT ATC TAC CAC CAC CAA GGC AAC ACC 3 96 

Asp Gly Ala Ala Gin Asn Gly Asn lie Tyr His His Gin Gly Asn Thr 
100 105 110 

GTG ATT GAC CCT GGC ATG CTC AAA AGC GTG GAA GTT ACC AAA GGC GCG 444 
Val lie Asp Pro Gly Met Leu Lys Ser Val Glu Val Thr Lys Gly Ala 
115 120 125 

GCG AAT GCG AGC GCG GGG CCA GGA GCG ATT GCG GGA GTG ATT AAA ATG 4 92 

Ala Asn Ala Ser Ala Gly Pro Gly Ala lie Ala Gly Val lie Lys Met 
130 135 140 145 

GAG ACT AAA GGA GCG GCT GAT TTT ATC CCT AGG GGG AAA AAT TAT GCT 54 0 

Glu Thr Lys Gly Ala Ala Asp Phe lie Pro Arg Gly Lys Asn Tyr Ala 
150 155 160 

GCC AGT GGG GCG GTG AGT TTT TAT ACC AAT TTT GGC GAT CGA GAG ACT 588 
Ala Ser Gly Ala Val Ser Phe Tyr Thr Asn Phe Gly Asp Arg Glu Thr 
165 170 175 

TTC AGA TCG GCT TAT CAA AAC GCG CAT TTT GAT ATT ATC GCT TAC TAC 636 
Phe Arg Ser Ala Tyr Gin Asn Ala His Phe Asp lie lie Ala Tyr Tyr 
180 185 190 

ACG CAC CAA AAC ATC TTC TAT TAT AGA AGC GGC GCT ACA GCG ATG AAA 684 
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Glu Met Gly Trp Leu Ala Phe Leu Ala Leu Ala His Arg Met Asn Leu 
30 35 40 

GGC ATT AAA AAA AGC GTG AGA AAA AAA GCT ATG AAA AAC CCT AAC CCT 2 56 

Gly lie Lys Lys Ser Val Arg Lys Lys Ala Met Lys Asn Pro Asn Pro 
45 50 55 

GCT CTA AAA GTC AAA TAG TOG AAA AGA TTG ATA TTG AAA TAGCCATATA GT 3 07 
Ala Leu Lys Val Lys Tyr Trp Lys Arg Leu lie Leu Lys 
60 65 70 

AAAGAATAGA GCATAAAATC CCCTAAAATC GCCA 341 

(2) INFORMATION FOR SEQ ID NO: 33 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:332: 

Met Val Ala Lys Thr Asn Thr lie Pro Pro lie Val Gly Val Ser Phe 

15 10 15 

Leu Phe Trp Trp Leu Gly Thr Lys Leu Glu Met Gly Trp Leu Ala Phe 

20 25 30 

Leu Ala Leu Ala His Arg Met Asn Leu Gly lie Lys Lys Ser Val Arg 

35 40 45 

Lys Lys Ala Met Lys Asn Pro Asn Pro Ala Leu Lys Val Lys Tyr Trp 

50 55 60 

Lys Arg Leu lie Leu Lys 
65 70 



(2) INFORMATION FOR SEQ ID NO: 333 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58... 2430 
(D) OTHER INFORMATION: 
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Met Arg Phe Lys Gly Val Val Ala Phe lie Ser Leu Ala Val Ala Leu 

15 10 15 

Gly Val Leu Ala Tyr Leu Phe Leu Ser Val Lys Lys Glu Met Pro Ala 

20 25 30 

Thr Ser His Ala lie Ser Gin Thr His Ala lie Ser Gin Thr Asn Glu 

35 40 45 

Gly Leu Ser Gin Thr Asp Ala Lys Ser His Asp lie Asp Leu Glu Glu 

50 55 60 

Asn Ser Pro Thr Glu Thr Ser His Asn Glu Lys Ala Ser His Asn Glu 
65 70 75 80 

Glu Asp His Asn Asn Ala Leu Ser Gin Asn Leu Asp Ala Gin Glu Ser 

85 90 95 

lie Asn Tyr Pro Val Val Glu His Tyr Ser Glu lie Pro Phe Glu Glu 

100 105 110 

Lys Lys Arg Glu Tyr Ser Lys Leu lie lie Lys Asp Leu Lys Asp Tyr 

115 120 125 

Gin Trp Trp Cys Leu Lys Glu lie Leu Lys Lys Glu Gin lie Asp Tyr 

130 135 140 

Ala Tyr Asp Asn Thr Lys Asn Gin Pro Asn Leu lie lie Tyr Leu Asp 
145 150 155 160 

Glu Asn Lys Lys Glu Arg Leu Leu Ala Asp Leu Asp Tyr Tyr Lys lie 

165 170 175 

Arg Tyr His Ala Val Phe 
180 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 86... 295 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

ACCCACAAAA CTAAAACCCA CTAACACAAT TAACCCTAAC AACACATAAA GATTGCCCAA 60 
AGACGCGCAC AACACGCTCG CAACA ATG GTT GCA AAA ACA AAC ACA ATC CCC 112 



Met Val Ala 
1 



Lys 



Thr Asn Thr lie Pro 
5 



CCC ATC GTA GGG GTA TCT TTT TTA TTC TGG 
Pro lie Val Gly Val Ser Phe Leu Phe Trp 
10 15 



TGG 
Trp 
20 



CTT GGC ACG AAG CTA 
Leu Gly Thr Lys Leu 
25 



160 



GAA ATG GGC TGG TTA GCC TTT TTA GCC TTG 



GCC 



CAT AGA ATG AAT TTA 



208 
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GCT ACT TCT CAT GCG ATC TCT CAA ACA CAT GCG ATC TCT CAA ACC AAT 2 06 

Ala Thr Ser His Ala lie Ser Gin Thr His Ala lie Ser Gin Thr Asn 
35 40 45 

GAA GGC CTC TCT CAA ACA GAT GCA AAA AGC CAT GAC ATC GAT CTA GAA 254 
Glu Gly Leu Ser Gin Thr Asp Ala Lys Ser His Asp lie Asp Leu Glu 
50 55 60 

GAA AAT AGC CCC ACT GAA ACC TCT CAT AAT GAA AAA GCC TCC CAT AAC 3 02 

Glu Asn Ser Pro Thr Glu Thr Ser His Asn Glu Lys Ala Ser His Asn 
65 70 75 

GAA GAA GAT CAC AAT AAC GCC CTT TCT CAA AAT CTT GAT GCG CAA GAA 3 50 

Glu Glu Asp His Asn Asn Ala Leu Ser Gin Asn Leu Asp Ala Gin Glu 
80 85 90 95 

TCT ATC AAT TAC CCC GTT GTG GAA CAT TAT TCT GAA ATC CCT TTT GAA 3 98 

Ser lie Asn Tyr Pro Val Val Glu His Tyr Ser Glu lie Pro Phe Glu 
100 105 110 

GAA AAA AAA AGG GAA TAT TCA AAG CTT ATC ATT AAG GAT TTA AAG GAC 446 
Glu Lys Lys Arg Glu Tyr Ser Lys Leu lie lie Lys Asp Leu Lys Asp 
115 120 125 

TAT CAA TGG TGG TGC TTA AAA GAA ATC CTC AAA AAA GAA CAG ATT GAT 4 94 

Tyr Gin Trp Trp Cys Leu Lys Glu lie Leu Lys Lys Glu Gin lie Asp 
130 135 140 

TAC GCT TAC GAT AAC ACC AAA AAC CAA CCT AAC CTC ATC ATC TAT TTA 542 
Tyr Ala Tyr Asp Asn Thr Lys Asn Gin Pro Asn Leu lie lie Tyr Leu 
145 150 155 

GAT GAA AAT AAA AAA GAA CGC TTG CTG GCT GAT TTA GAC TAT TAT AAA 5 90 

Asp Glu Asn Lys Lys Glu Arg Leu Leu Ala Asp Leu Asp Tyr Tyr Lys 
160 165 170 175 

ATA CGC TAT CAT GCT GTT TTT TAAATTCAAA GGATAAAAAT GTATCAAGTA GCCA 64 5 
lie Arg Tyr His Ala Val Phe 
180 

TTTGCGACCC CATCCATGCT AAAGGC 671 
(2) INFORMATION FOR SEQ ID NO: 33 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 0: 
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Pro Leu Met Trp Ser Tyr Gin 
50 55 

TACAACTAAA ACG 269 
(2) INFORMATION FOR SEQ ID NO: 32 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

Met Leu Ser Lys Leu Pro Phe Thr Gly Val Leu Ala Leu Val Leu Lys 

15 10 15 

Ala Val His Val Ser Leu Ala Glu Asp Lys Ser Lys Phe Thr Ala Cys 

20 25 30 

Lys Asn Pro Ala Ser Lys Thr Asp Thr Lys Thr lie Phe Phe lie His 

35 40 45 

Tyr Pro Leu Met Trp Ser Tyr Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 32 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 611 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 329: 

AATGACTTTA GGGGATATTC TTAAAGAAAA ACTCTAAAGA GTGATTTTAA AAGCATGAGA 60 
ATGGC ATG AGA TTT AAG GGT GTT GTT GCT TTT ATT TCC CTA GCT GTC GCT 110 
Met Arg Phe Lys Gly Val Val Ala Phe He Ser Leu Ala Val Ala 
15 10 15 

CTT GGC GTT TTA GCC TAT TTG TTT TTA AGC GTT AAA AAA GAA ATG CCC 15 8 

Leu Gly Val Leu Ala Tyr Leu Phe Leu Ser Val Lys Lys Glu Met Pro 
20 25 30 
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1 








5 










10 










15 




Ala 


Phe 


Gly 


Leu 


Asp 


Tyr 


Gly 


He 


Asp 


Lys 


Thr 


Leu 


Glu 


Leu 


Lys 


Lys 








20 










25 










30 






Asp 


Glu 


Val 


Phe 


Lys 


Ala 


He 


He 


Lys 


Asp 


Thr 


Ser 


Asn 


Glu 


Gin 


Thr 






35 










40 










45 








Lys 


Glu 


He 


Thr 


Leu 


Tyr 


Trp 


Thr 


Leu 


Tyr 


Ala 


Asn 


Lys 


Gly 


Leu 


Val 




50 










55 










60 










lie 


Asn 


Met 


Arg 


Phe 


Asn 


His 


Phe 


Pro 


Tyr 


Gin 


Phe 


He 


Leu 


Tyr 


Thr 


65 










70 










75 










80 


Asp 


His 


Ala 


Arg 


Asn 


Thr 


Tyr 


Asn 


Leu 


Lys 


Val 


Phe 


Glu 


Glu 


Lys 


Phe 










85 










90 










95 




Ser 


Ser 


Asn 


Ser 


Thr 


Leu 


Ser 


Leu 


Val 


Phe 


Lys 


Asp 


Phe 


Lys 


Glu 


Asp 








100 










105 










110 






Lys 


Ala 


Ala 


Leu 


Arg 


Leu 


Leu 


Ala 


Leu 


Met 


Pro 


Leu 


Val 


Phe 


Ser 


Pro 






115 










12 0 










125 








Lys 


Glu 


Pro 





























130 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQXIENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 55 . . . 222 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

GCTCCCTCTA AAAGGGTTTT TAAACTATCT TGAGATTTAC CCAATTTATA GGTG ATG 57 

Met 
1 

CTT TCA AAA CTC CCA TTT ACT GGT GTT TTA GCC TTA GTT TTA AAG GCT 105 
Leu Ser Lys Leu Pro Phe Thr Gly Val Leu Ala Leu Val Leu Lys Ala 
5 10 15 

GTC CAT GTT AGC TTA GCC GAA GAT AAA TCC AAA TTC ACC GCT TGC AAA 153 
Val His Val Ser Leu Ala Glu Asp Lys Ser Lys Phe Thr Ala Cys Lys 
20 25 30 

AAC CCT GCT AGT AAA ACC GAT ACC AAA ACC ATT TTT TTC ATT CAT TAT 2 01 

Asn Pro Ala Ser Lys Thr Asp Thr Lys Thr He Phe Phe He His Tyr 
35 40 45 

CCT TTA -ATG TGG TCT TAT CAA TAACGCTTAT TATTTTAGTG TAAATAAGCA CGCT 25 6 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325: 

GTTGCAACAA AAAATAGAGA GCAGGAAAAC AGACATTGTG ATCCAATCCA TGGCGAATAT 60 
TCTCAGCGGG A ATG AAT GAG CTT ATC CGC TAT GGC TTG ATA TTT .CTC TTT 110 
Met Asn Glu Leu lie Arg Tyr Gly Leu lie Phe Leu Phe 
15 10 

TTT TTA AAG GCG TTT GGG CTT GAT TAT GGG ATA GAT AAA ACG CTA GAA 15 8 

Phe Leu Lys Ala Phe Gly Leu Asp Tyr Gly lie Asp Lys Thr Leu Glu 
15 20 25 

TTA AAA AAA GAT GAA GTG TTT AAA GCG ATC ATC AAA GAC ACT TCA AAT 206 
Leu Lys Lys Asp Glu Val Phe Lys Ala lie lie Lys Asp Thr Ser Asn 
30 35 40 45 

GAA CAA ACC AAA GAA ATC ACG CTC TAT TGG ACG CTA TAT GCA AAT AAA 2 54 

Glu Gin Thr Lys Glu lie Thr Leu Tyr Trp Thr Leu Tyr Ala Asn Lys 
50 55 60 

GGT TTA GTC ATC AAC ATG CGT TTT AAC CAT TTC CCT TAC CAG TTT ATT 3 02 

Gly Leu Val lie Asn Met Arg Phe Asn His Phe Pro Tyr Gin Phe lie 
65 70 75 

TTA TAC ACC GAT CAT GCG AGA AAC ACC TAT AAT CTC AAA GTT TTT GAA 35 0 

Leu Tyr Thr Asp His Ala Arg Asn Thr Tyr Asn Leu Lys Val Phe Glu 
80 85 90 

GAA AAA TTT TCT TCT AAC AGC ACT CTG TCG CTT GTG TTT AAA GAT TTT 3 98 

Glu Lys Phe Ser Ser Asn Ser Thr Leu Ser Leu Val Phe Lys Asp Phe 
95 100 105 

AAA GAA GAT AAA GCC GOT TTA AGG CTT TTA GCC CTT ATG CCC CTT GTT 44 6 

Lys Glu Asp Lys Ala Ala Leu Arg Leu Leu Ala Leu Met Pro Leu Val 
110 115 120 125 

TTT TCT CCT AAA GAG CCT TAAGGAATTT GCATGCAAGA AAAACAACTT AAAACCAT 502 
Phe Ser Pro Lys Glu Pro 
130 

TCAAAATAAG ATCGCTTCC 521 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 6: 

Met Asn Glu Leu lie Arg Tyr Gly Leu lie Phe Leu Phe Phe Leu Lys 
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95 100 105 

ATC TCT TTA GAG CAT AAA GAA GAG CGT AAG GAT TCT CCT AAG CCT TAATC 450 
lie Ser Leu Glu His Lys Glu Glu Arg Lys Asp Ser Pro Lys Pro 
110 115 120 

AAAGCGCTTG ATTTATGCTA AAATGGAGCG TTGCATTTTT GTTTTG 4 96 

(2) INFORMATION FOR SEQ ID NO: 3 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:324: 



Met 


Arg 


Leu 


Phe 


He 


Ala 


Leu 


Val 


Leu 


Phe 


Trp 


Trp 


Trp 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Asn 


Ala 


Lys 


Glu 


Ala 


Asp 


Phe 


He 


Ser 


Asp 


Leu 


Glu 


Tyr 


Gly 


Met 


Ala 








20 










25 










30 






Leu 


Tyr 


Lys 


Asn 


Pro 


Arg 


Gly 


Val 


Ala 


Cys 


Ala 


Lys 


Cys 


His 


Gly 


He 






35 










40 










45 








Lys 


Gly 


Glu 


Gin 


Gin 


Glu 


He 


Thr 


Phe 


Tyr 


Tyr 


Glu 


Lys 


Gly 


Glu 


Lys 




50 










55 










60 










Lys 


He 


Leu 


Tyr 


Ala 


Pro 


Lys 


He 


Asn 


His 


Leu 


Asp 


Phe 


Lys 


Thr 


Phe 


65 










70 










75 










80 


Lys 


Asp 


Ala 


Leu 


Ser 


Leu 


Gly 


Lys 


Gly 


Met 


Met 


Pro 


Lys 


Tyr 


Asn 


Leu 










85 










90 










95 




Asn 


Leu 


Glu 


Glu 


He 


Gin 


Ala 


He 


Tyr 


Leu 


Tyr 


He 


He 


Ser 


Leu 


Glu 








100 










105 










110 






His 


Lys 


Glu 


Glu 


Arg 


Lys 


Asp 


Ser 


Pro 


Lys 


Pro 













115 120 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 72... 464 
(D) OTHER INFORMATION: 
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660 

Tyr Lys Ala Arg Asn 
675 

Gly Glu His Leu Tyr 
690 

Asn Ala Asn Leu Lys 
705 



665 

Met Arg Leu Glu Asp Glu 
680 

Ser Phe Asn Val His Glu 
695 

Gly Glu His His Glu 
710 



670 

Glu Val Arg Ala Arg 
685 

Leu Leu Asp Gin His 
700 



(2) INFORMATION FOR SEQ ID NO: 323: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 77... 445 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:323: 

ATTTAGTTCA AGAGCTTTTA GAAGAATTTT TGCAAAGCGG GGCTAAAGAG ATTTTAGAAA 6 0 

AGGCGCAGTT GTTTTA ATG CGT TTG TTT ATC GCG CTA GTT TTG TTT TGG TGG 112 

Met Arg Leu Phe lie Ala Leu Val Leu Phe Trp Trp 
15 10 

TGG TTA AGC TTG AAC GCT AAA GAA GCG GAT TTT ATC TCT GAT TTA GAA 160 
Trp Leu Ser Leu Asn Ala Lys Glu Ala Asp Phe lie Ser Asp Leu Glu 
15 20 25 

TAG GGG ATG GCT CTT TAT AAA AAC CCT AGG GGT GTT GCG TGC GCG AAA 2 08 

Tyr Gly Met Ala Leu Tyr Lys Asn Pro Arg Gly Val Ala Cys Ala Lys 
30 35 40 

TGC CAT GGC ATT AAA GGC GAA CAA CAA GAA ATC ACC TTT TAT TAT GAA 2 56 

Cys His Gly lie Lys Gly Glu Gin Gin Glu lie Thr Phe Tyr Tyr Glu 
45 50 55 60 

AAA GGC GAG AAA AAA ATC CTC TAC GCC CCT AAA ATC AAC CAT TTG GAT 3 04 

Lys Gly Glu Lys Lys lie Leu Tyr Ala Pro Lys lie Asn His Leu Asp 
65 70 75 

TTT AAA ACC TTT AAA GAC GCC TTG AGT TTA GGC AAA GGC ATG ATG CCT 3 52 

Phe Lys Thr Phe Lys Asp Ala Leu Ser Leu Gly Lys Gly Met Met Pro 
80 85 90 

AAA TAC AAT CTC AAT TTA GAA GAA ATC CAA GCG ATT TAT CTT TAT ATC 4 00 

Lys Tyr Asn Leu Asn Leu Glu Glu lie Gin Ala lie Tyr Leu Tyr lie 
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His Thr Thr Asn 
225 

Leu Glu Thr Gly 

His Pro Thr Ala 
260 

Arg Gly Asp Gly 
275 

Pro Ala Tyr Glu 
290 

Ser Arg Arg lie 
305 

Pro Tyr Gly Asp 

His Val Glu Lys 
340 

Ala Gly lie Asp 
355 

Gin Gly Val Pro 
370 

Gin Lys Gly Trp 
385 

Gly Val Arg Thr 

Cys Ala Gly Glu 
420 

Gly Gly Asn Ser 
435 

Asp Tyr Phe Ala 
450 

Gin Lys Val Glu 
465 

Leu Leu His Asn 

Met Lys Glu Val 
500 

Arg Leu Glu Glu 
515 

Ly s As n lie Cy s 
530 

Glu Asp Ala Tyr 
545 

Thr Gin Gly Ala 

lie Asp Tyr Pro 
580 

Ala Ser Trp Pro 
595 

Glu Leu Asp Val 
610 

Gly Lys Lys Gly 
625 

Ala Glu lie Leu 
Arg lie Glu Val 



Ala Val lie Cys 
230 

Val Ala Lys Leu 
245 

Leu Val Pro Ser 

Gly Val Leu Arg 
280 

Pro Glu Lys Lys 
295 

Leu Glu His lie 
310 

His Val Trp Leu 
325 

Asn Leu Arg Asp 

Pro Ala Asp Ser 
360 

Ala Asn Glu Pro 
375 

lie Pro lie Lys 
390 

Asn Pro Lys Gly 
405 

Ala Ala Cys Trp 

Val Ser Glu Ala 
440 

Ser His Cys Leu 
455 

Ala Phe lie Lys 
470 

Glu Gly Lys Glu 
485 

Met Asp Glu Lys 

Ala Leu Lys Glu 
520 

Val Lys Asn Lys 
535 

Arg Thr Lys Lys 
550 

Leu Leu Arg Thr 
565 

Lys Arg Asp Asp 

Ser Ala Glu Gin 
600 

Met Lys Met Glu 
615 

Asn Phe lie Pro 
630 

Lys Thr lie Leu 
645 

Gin His Ala Leu 



Asp Gly Ala Gly 
235 

Gly Asn Met Glu 
250 

Gly lie Leu Met 
265 

Asp Lys Phe Gly 

Glu Leu Ala Ser 
300 

Gin Lys Gly Tyr 
315 

Asp lie Ala lie 
330 

Val Arg Asp lie 
345 

Lys Glu Gin Thr 

Glu Tyr Gly Gin 
380 

Pro Met Gin His 
395 

Glu Thr His Leu 
410 

Asp Leu His Gly 
425 

Val Val Ala Gly 

Glu Ala Gin lie 
460 

Glu Ser Gin Asp 
475 

Asp Val Tyr Glu 
490 

Val Gly Val Phe 
505 

Leu Gin Glu Leu 

Val Leu His Asn 
54 0 

Met Leu Lys Leu 
555 

Glu Ser Arg Gly 
570 

Glu Lys Trp Leu 
585 

Asp Met Pro Thr 

lie Ser Pro Asp 
620 

His Pro Lys Lys 
635 

Glu Leu Glu Lys 
650 

Met Pro Phe Glu 



Ala Ala Ser Ala 
240 

Ala Val Gin Phe 
255 

Thr Glu Gly Cys 
270 

Arg Arg Phe Met 
285 

Arg Asp Val Val 

Gly Ala Lys Ser 
320 

Leu Gly Arg Asn 
335 

Ala Met Thr Phe 
350 

Lys Asp Asn Met 
365 

Ala Met Ala Lys 

Tyr Ser Met Gly 
400 

Lys Gly Leu Phe 
415 

Phe Asn Arg Leu 
430 

Met lie lie Gly 
445 

Glu lie Asn Thr 

Tyr Met His Phe 
480 

lie Arg Glu Arg 
495 

Arg Glu Gly Lys 
510 

Tyr Ala Arg Ser 
525 

Asn Pro Glu Leu 

Ala Leu Cys lie 
560 

Ala His Thr Arg 
575 

Asn Arg Thr Leu 
590 

lie Glu Tyr Glu 
605 

Phe Arg Gly Tyr 

Glu Glu Arg Asp 
640 

Leu Gly Lys Asp 
655 

Leu Gin Glu Lys 
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GCT AGG AAT ATG CGT TTA GAA GAT GAG GAA GTC AGG GCT AGG GGG GAA 2121 
Ala Arg Asn Met Arg Leu Glu Asp Glu Glu Val Arg Ala Arg Gly Glu 
675 680 685 690 

CAT TTG TAT TCT TTC AAT GTC CAT GAG TTA TTG GAC CAA CAC AAC GCT 2169 
His Leu Tyr Ser Phe Asn Val His Glu Leu Leu Asp Gin His Asn Ala 
695 700 705 

AAC CTA AAA GGA GAA CAC CAT GAG TGATAATGAA CGAACGATTG TAGTTAGAGT 2223 
Asn Leu Lys Gly Glu His His Glu 
710 

GCTAAAATTT GACCCTCA 2241 
(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

Met Lys lie Thr Tyr Cys Asp Ala Leu lie lie Gly Gly Gly Leu Ala 

15 10 15 

Gly Leu Arg Ala Ser lie Ala Cys Lys Gin Lys Gly Leu Asn Thr lie 

20 25 30 

Val Leu Ser Leu Val Pro Val Arg Arg Ser His Ser Ala Ala Ala Gin 

35 40 45 

Gly Gly Met Gin Ala Ser Leu Ala Asn Ala Lys Lys Ser Glu Gly Asp 

50 55 60 

Asn Glu Asp Leu His Phe Leu Asp Thr Val Lys Gly Ser Asp Trp Gly 
65 70 75 80 

Cys Asp Gin Gin Val Ala Arg Met Phe Val Thr Thr Ala Pro Lys Ala 

85 90 95 

He Arg Glu Leu Ala Ser Trp Gly Val Pro Trp Thr Arg He Lys Lys 

100 105 110 

Gly Asp Arg Pro Ala Val Val Asn Gly Glu His Val Thr He Thr Glu 

115 120 125 

Arg Asp Asp Arg His Gly Tyr He Leu Ser Arg Asp Phe Gly Gly Thr 

130 135 140 

Lys Lys Trp Arg Thr Cys Phe Thr Ala Asp Ala Thr Gly His Thr Met 
145 150 155 160 

Leu Tyr Ala Val Ala Asn Glu Ala Leu His His Lys Val Asp He Gin 

165 170 175 

Asp Arg Lys Asp Met Leu Ala Phe He His His Asp Asn Lys Cys Tyr 

180 185 190 

Gly Ala Val Val Arg Asp Leu He Thr Gly Glu He Ser Ala Tyr Val 

195 200 205 

Ser Lys Gly Thr Leu Leu Ala Thr Gly Gly Tyr Gly Arg Val Tyr Lys 
210 215 220 



-543- 



BNSDOCID: <WO 9843478A1 J_> - 



wo 98/43478 



PCTAJS98/06371 



TTT GCC TCG CAT TGT TTA GAA GCG CAA ATT GAA ATC AAC ACQ CAA AAA 144 9 

Phe Ala Ser His Cys L.eu Glu Ala Gin lie Glu lie Asn Thr Gin Lys 
455 460 465 

GTT GAA GCT TTC ATT AAA GAA AGC CAA GAC TAT ATG CAT TTT TTA TTG 14 97 

Val Glu Ala Phe lie Lys Glu Ser Gin Asp Tyr Met His Phe Leu Leu 
470 475 480 

CAT AAT GAA GGC AAA GAA GAT GTG TAT GAA ATT AGA GAG CGC ATG AAA 1545 
His Asn Glu Gly Lys Glu Asp Val Tyr Glu lie Arg Glu Arg Met Lys 
485 490 495 

GAA GTC ATG GAT GAA AAA GTG GGC GTT TTT AGA GAA GGC AAA AGG CTA 1593 
Glu Val Met Asp Glu Lys Val Gly Val Phe Arg Glu Gly Lys Arg Leu 
500 505 510 

GAA GAA GCC CTT AAA GAA TTG CAA GAG CTT TAT GCA CGC TCC AAA AAC 1641 
Glu Glu Ala Leu Lys Glu Leu Gin Glu Leu Tyr Ala Arg Ser Lys Asn 
515 520 525 530 

ATT TGC GTG AAA AAC AAG GTT TTA CAC AAT AAC CCT GAA TTA GAA GAC 1689 
lie Cys Val Lys Asn Lys Val Leu His Asn Asn Pro Glu Leu Glu Asp 
535 540 545 

GCT TAC CGC ACC AAA AAA ATG CTC AAA CTC GCG CTT TGT ATC ACT CAA 1737 
Ala Tyr Arg Thr Lys Lys Met Leu Lys Leu Ala Leu Cys lie Thr Gin 
550 555 560 

GGA GCG TTA CTG CGC ACT GAA AGC AGA GGG GCT CAC ACA AGG ATT GAC 17 85 

Gly Ala Leu Leu Arg Thr Glu Ser Arg Gly Ala His Thr Arg lie Asp 
565 570 575 

TAC CCT AAA AGA GAC GAT GAA AAA TGG CTT AAT CGG ACT CTA GCG AGC 1833 
Tyr Pro Lys Arg Asp Asp Glu Lys Trp Leu Asn Arg Thr Leu Ala Ser 
580 585 590 

TGG CCT AGC GCT GAG CAA GAC ATG CCC ACG ATT GAA TAC GAA GAA TTA 18 81 
Trp Pro Ser Ala Glu Gin Asp Met Pro Thr lie Glu Tyr Glu Glu Leu 
595 600 605 610 

GAT GTG ATG AAA ATG GAA ATC AGC CCT GAT TTT AGG GGC TAT GGC AAA 1929 
Asp Val Met Lys Met Glu lie Ser Pro Asp Phe Arg Gly Tyr Gly Lys 
615 620 625 

AAG GGT AAT TTC ATC CCC CAC CCC AAA AAA GAA GAG CGC GAC GCT GAG 1977 
Lys Gly Asn Phe lie Pre His Pro Lys Lys Glu Glu Arg Asp Ala Glu 
630 635 640 

ATT TTG AAA ACG ATT TTA GAA CTA GAA AAG CTT GGA AAA GAC AGA ATA 2 025 
lie Leu Lys Thr lie Leu Glu Leu Glu Lys Leu Gly Lys Asp Arg lie 
645 650 655 

GAA GTC CAA CAT GCG CTC ATG CCT TTT GAA TTG CAA GAA AAA TAC AAG 2073 
Glu Val Gin His Ala Leu Met Pro Phe Glu Leu Gin Glu Lys Tyr Lys 
660 665 670 
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Thr Asn Ala Val lie Cys Asp Gly Ala Gly Ala Ala Ser Ala Leu Glu 
230 235 240 

ACC GGC GTG GCT AAA TTG GGC AAC ATG GAA GCG GTG CAA TTC CAC CCT 825 
Thr Gly Val Ala Lys Leu Gly Asn Met Glu Ala Val Gin Phe His Pro 
245 250 255 

ACC GCT TTA GTG CCA AGC GGG ATT TTA ATG ACC GAA GGT TGC AGG GGC 873 
Thr Ala Leu Val Pro Ser Gly lie Leu Met Thr Glu Gly Cys Arg Gly 
260 265 270 

GAT GGC GGT GTT TTA AGA GAC AAG TTT GGC AGA CGC TTC ATG CCC GCT 921 
Asp Gly Gly Val Leu Arg Asp Lys Phe Gly Arg Arg Phe Met Pro Ala 
275 280 285 290 

TAT GAG CCG GAG AAA AAA GAG CTT GCA AGC AGA GAT GTG GTC TCA AGG 969 
Tyr Glu Pro Glu Lys Lys Glu Leu Ala Ser Arg Asp Val Val Ser Arg 
295 300 305 

CGG ATT TTA GAG CAT ATC CAA AAA GGC TAT GGA GCC AAA TCG CCT TAT 1017 
Arg lie Leu Glu His lie Gin Lys Gly Tyr Gly Ala Lys Ser Pro Tyr 
310 315 320 

GGG GAT CAT GTG TGG CTG GAT ATT GCT ATT TTA GGG CGT AAC CAT GTG 1065 
Gly Asp His Val Trp Leu Asp lie Ala lie Leu Gly Arg Asn His Val 
325 330 335 

GAA AAA AAC TTA AGG GAT GTG CGC GAT ATA GCC ATG ACT TTT GCG GGC 1113 
Glu Lys Asn Leu Arg Asp Val Arg Asp lie Ala Met Thr Phe Ala Gly 
340 345 350 

ATT GAT CCG GCT GAT AGC AAG GAA CAA ACC AAA GAC AAC ATG CAA GGA 1161 
lie Asp Pro Ala Asp Ser Lys Glu Gin Thr Lys Asp Asn Met Gin Gly 
355 360 365 370 

GTG CCC GCA AAT GAG CCT GAA TAC GGG CAA GCG ATG GCC AAG CAA AAA 12 0 9 

Val Pro Ala Asn Glu Pro Glu Tyr Gly Gin Ala Met Ala Lys Gin Lys 
375 380 385 

GGC TGG ATC CCC ATA AAA CCC ATG CAA CAC TAT TCT ATG GGT GGG GTT 12 57 

Gly Trp lie Pro lie Lys Pro Met Gin His Tyr Ser Met Gly Gly Val 
390 395 400 

AGG ACA AAC CCT AAA GGC GAA ACC CAT TTA AAA GGC TTG TTT TGC GCG 1305 
Arg Thr Asn Pro Lys Gly Glu Thr His Leu Lys Gly Leu Phe Cys Ala 
405 410 415 

GGT GAA GCG GCA TGC TGG GAT TTG CAT GGG TTT AAC CGC TTG GGG GGT 13 5 3 

Gly Glu Ala Ala Cys Trp Asp Leu His Gly Phe Asn Arg Leu Gly Gly 
420 425 430 

AAT TCT GTG AGT GAA GCG GTG GTC GCT GGC ATG ATC ATT GGG GAT TAT 14 01 

Asn Ser Val Ser Glu Ala Val Val Ala Gly Met lie lie Gly Asp Tyr 
435 440 445 450 
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5 10 15 

AGG GCT AGT ATC GCA TGC AAA CAA AAG GGT TTA AAC ACC ATC GTT TTA 153 
Arg Ala Ser lie Ala Cys Lys Gin Lys Gly Leu Asn Thr lie Val Leu 
20 25 30 

AGC CTA GTG CCT GTC AGG CGT TCG CAC TCT GCA GCC GCT CAA GGG GGC 2 01 

Ser Leu Val Pro Val Arg Arg Ser His Ser Ala Ala Ala Gin Gly Gly 
35 40 45 50 

ATG CAA GCG AGC CTT GCG AAC GCT AAA AAA AGC GAG GGC GAT AAT GAA 24 9 

Met Gin Ala Ser Leu Ala Asn Ala Lys Lys Ser Glu Gly Asp Asn Glu 
55 60 65 

GAT TTA CAC TTT TTA GAC ACG GTT AAG GGG AGC GAT TGG GGG TGC GAT 2 97 

Asp Leu His Phe Leu Asp Thr Val Lys Gly Ser Asp Trp Gly Cys Asp 
70 75 80 

CAG CAA GTG GCT AGG ATG TTT GTA ACC ACT GCT CCT AAA GCC ATT AGG 345 
Gin Gin Val Ala Arg Met Phe Val Thr Thr Ala Pro Lys Ala lie Arg 
85 90 95 

GAA TTG GCC AGT TGG GGG GTG CCT TGG ACT AGG ATT AAA AAG GGC GAT 3 93 

Glu Leu Ala Ser Trp Gly Val Pro Trp Thr Arg lie Lys Lys Gly Asp 
100 105 110 

AGG CCT GCG GTC GTC AAT GGT GAG CAT GTA ACT ATC ACT GAA AGA GAC 441 
Arg Pro Ala Val Val Asn Gly Glu His Val Thr lie Thr Glu Arg Asp 
115 120 125 130 

GAC AGG CAT GGT TAT ATC TTA AGC CGT GAT TTT GGC GGC ACT AAA AAA 4 89 

Asp Arg His Gly Tyr lie Leu Ser Arg Asp Phe Gly Gly Thr Lys Lys 
135 140 145 

TGG CGC ACA TGC TTT ACG GCT GAT GCC ACA GGG CAT ACC ATG CTT TAT 537 
Trp Arg Thr Cys Phe Thr Ala Asp Ala Thr Gly His Thr Met Leu Tyr 
150 155 160 

GCG GTC GCT AAT GAA GCC TTA CAC CAC AAA GTG GAT ATT CAA GAC AGA 5 85 

Ala Val Ala Asn Glu Ala Leu His His Lys Val Asp lie Gin Asp Arg 
165 170 175 

AAG GAC ATG CTC GCT TTC ATT CAT CAT GAT AAT AAA TGC TAT GGG GCG 63 3 

Lys Asp Met Leu Ala Phe lie His His Asp Asn Lys Cys Tyr Gly Ala 
180 185 190 

GTG GTA AGG GAT TTG ATC ACA GGC GAA ATT TCA GCG TAT GTT TCT AAA 681 
Val Val Arg Asp Leu lie Thr Gly Glu lie Ser Ala Tyr Val Ser Lys 
195 200 205 210 

GGC ACG CTT TTA GCT ACC GGA GGT TAT GGG CGC GTG TAT AAA CAC ACC 72 9 

Gly Thr Leu Leu Ala Thr Gly Gly Tyr Gly Arg Val Tyr Lys His Thr 
215 220 225 

ACT AAC GCT GTG ATT TGC GAT GGA GCC GGG GCT GCA AGC GCC TTA GAA 777 
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Gly Ala Ala Lys Asp Phe Ala Ser Asp Lys Ser Pro Lys His Phe Gin 

165 170 175 

lie Val Arg Glu Lys Asp Pro Arg Gin Ala Phe Ser Leu Tyr Glu His 

180 185 190 

Tyr Leu Asn Glu Ser Asp Met Gin lie lie Tyr Glu Ser Asn Tyr Asn 

195 200 205 

Ala Gly Phe Leu Val Val Asn Leu Lys Leu Trp Arg Ala Asp His Leu 

210 215 220 

Glu Glu Arg Leu Leu Asn Leu Thr His Gin Lys Gly Gin Cys Val Phe 
225 230 235 240 

Tyr Pro Glu Gin Asp Leu Leu Thr Leu Ala Cys Tyr Gin Lys Val Leu 

245 250 255 

lie Leu Pro Tyr lie Tyr Asn Thr His Pro Phe Met Ala Asn Gin Lys 

260 265 270 

Arg Phe lie Pro Asp Lys Lys Glu lie Val Met Leu His Phe Tyr Phe 

275 280 285 

Val Gly Lys Pro Trp Val Leu Pro Thr Phe Ser Tyr Ser Lys Glu Trp 

290 295 300 

His Glu Thr Leu Leu Lys Thr Pro Phe Tyr Ala Glu Tyr Ser Val Lys 
305 310 315 320 

Phe Leu Lys Gin Met Thr Glu Cys Leu Ser Leu Lys Asp Lys Gin Lys 

325 330 335 

Thr Phe Glu Phe Leu Ala Pro Leu Leu Asn Lys Lys Thr Leu Leu Glu 

340 345 350 

Tyr Val Phe Phe Arg Leu Asn Arg lie Phe Lys Arg Leu Lys Glu Lys 

355 360 365 

Phe Phe Asn Ser 
370 



(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 2193 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

CAAGAAGCCA TAGAAGCTGA TGGGAAATTC CACAAAGAAT AAGGGTAGAA A ATG AAA 57 

Met Lys 

1 . 

ATA ACA TAT TGT GAT GCG CTA ATT ATT GGA GGC GGA CTA GCT GGG TTA 105 
lie Thr Tyr Cys Asp Ala Leu lie lie Gly Gly Gly Leu Ala Gly Leu 
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TTT TCA TAT TCT AAA GAA TGG CAT GAG ACT CTT TTA AAA ACC COT TTT 1024 
Phe Ser Tyr Ser Lys Glu Trp His Glu Thr Leu Leu Lys Thr Pro Ptie 
300 305 310 

TAT GCT GAA TAT TOO GTG AAA TTC CTT AAA CAA ATG ACA GAA TGT TTA 1072 
Tyr Ala Glu Tyr Ser Val Lys Phe Leu Lys Gin Met Thr Glu Cys Leu 
315 320 325 

AGO CTT AAA GAC AAA CAA AAA ACC TTT GAA TTT CTT GCC CCC CTA CTC 112 0 

Ser Leu Lys Asp Lys Gin Lys Thr Phe Glu Phe Leu Ala Pro Leu Leu 
330 335 340 345 

AAT AAA AAA ACC CTT TTA GAA TAG GTC TTT TTT AGA TTG AAT AGG ATT 1168 
Asn Lys Lys Thr Leu Leu Glu Tyr Val Phe Phe Arg Leu Asn Arg lie 
350 355 360 

TTC AAA CGC TTA AAA GAA AAA TTT TTT AAC TCT TAGCGTTCTC GTTTGGGCAA 1221 
Phe Lys Arg Leu Lys Glu Lys Phe Phe Asn Ser 
365 370 

CACGCTATAG GCGAATTTGA CATAAATCGC 12 51 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 0: 

Met Ser lie lie lie Pro lie Val lie Ala Phe Asp Asn His Tyr Ala 

15 10 15 

Met Pro Ala Gly Val Ser Leu Tyr Ser Met Leu Ala Cys Ala Lys Thr 

20 25 30 

Glu His Pro Gin Ser Gin Asn Asp Ser Glu Lys Leu Phe Tyr Lys lie 

35 40 45 

His Cys Leu Val Asp Asn Leu Ser Leu Glu Asn Gin Ser Lys Leu Lys 

50 55 60 

Glu Thr Leu Ala Pro Phe Ser Ala Phe Ser Ser Leu Glu Phe Leu Asp 
65 70 75 80 

lie Ser Thr Pro Asn Leu His Ala Thr Pro lie Glu Pro Ser Ala lie 

85 90 95 

Asp Lys lie Asn Glu Ala Phe Leu Gin Leu Asn lie Tyr Ala Lys Thr 

100 105 110 

Arg Phe Ser Lys Met Val Met Cys Arg Leu Phe Leu Ala Ser Leu Phe 

115 120 125 

Pro Gin Tyr Asp Lys lie lie Met Phe Asp Ala Asp Thr Leu Phe Leu 

130 135 140 

Asn Asp Val Ser Glu Ser Phe Phe lie Pro Leu Asp Gly Tyr Tyr Phe 
145 150 155 160 
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TCG AGC CTA GAA TTT TTA GAC ATT TCA ACC CCC AAT CTT CAC GCC ACT 352 
Ser Ser Leu Glu Phe Leu Asp lie Ser Thr Pro Asn Leu His Ala Thr 
75 80 85 

CCA ATA GAA CCC TCT GCG ATT GAT AAA ATC AAT GAA GCT TTT TTG CAA 400 
Pro lie Glu Pro Ser Ala lie Asp Lys lie Asn Glu Ala Phe Leu Gin 
90 95 100 105 

CTC AAT ATT TAG GCT AAG ACT CGC TTT TCT AAA ATG GTC ATG TGC CGC 44 8 

Leu Asn lie Tyr Ala Lys Thr Arg Phe Ser Lys Met Val Met Cys Arg 
110 115 120 

TTG TTT TTG GCT TCT TTA TTC CCA CAA TAC GAC AAA ATC ATC ATG TTT 4 96 

Leu Phe Leu Ala Ser Leu Phe Pro Gin Tyr Asp Lys lie lie Met Phe 
125 130 135 

GAT GCA GAC ACT TTG TTT TTA AAC GAT GTG AGC GAG AGC TTT TTC ATC 544 
Asp Ala Asp Thr Leu Phe Leu Asn Asp Val Ser Glu Ser Phe Phe lie 
140 145 150 

CCA CTA GAT GGC TAT TAT TTT GGA GCG GCT AAA GAT TTT GCT TCC GAT 5 92 

Pro Leu Asp Gly Tyr Tyr Phe Gly Ala Ala Lys Asp Phe Ala Ser Asp 
155 160 165 

AAA AGC CCT AAA CAT TTT CAA ATA GTG CGA GAA AAA GAC CCT CGT CAA 64 0 

Lys Ser Pro Lys His Phe Gin lie Val Arg Glu Lys Asp Pro Arg Gin 
170 175 180 185 

GCC TTT TCC CTT TAT GAG CAT TAC CTT AAT GAA AGC GAT ATG CAA ATC 688 
Ala Phe Ser Leu Tyr Glu His Tyr Leu Asn Glu Ser Asp Met Gin lie 
190 195 200 

ATC TAT GAA AGC AAT TAT AAC GCC GGG TTT TTA GTC GTG AAT TTA AAG 73 6 

lie Tyr Glu Ser Asn Tyr Asn Ala Gly Phe Leu Val Val Asn Leu Lys 
205 210 215 

CTG TGG CGT GCT GAT CAT TTA GAA GAG CGC TTA CTC AAT TTA ACC CAT 7 84 

Leu Trp Arg Ala Asp His Leu Glu Glu Arg Leu Leu Asn Leu Thr His 
220 225 230 

CAA AAA GGC CAG TGC GTG TTT TAC CCT GAA CAG GAC CTT TTA ACG CTC 8 32 

Gin Lys Gly Gin Cys Val Phe Tyr Pro Glu Gin Asp Leu Leu Thr Leu 
235 240 245 

GCA TGC TAT CAA AAA GTT TTA ATC TTG CCT TAT ATT TAT AAC ACC CAC 8 80 

Ala Cys Tyr Gin Lys Val Leu lie Leu Pro Tyr lie Tyr Asn Thr His 
250 255 260 265 

CCT TTC ATG GCC AAT CAA AAA CGC TTC ATC CCT GAC AAA AAA GAA ATC 92 8 

Pro Phe Met Ala Asn Gin Lys Arg Phe lie Pro Asp Lys Lys Glu lie 
270 275 280 

GTC ATG CTG CAT TTT TAT TTT GTA GGA AAA CCT TGG GTT TTA CCT ACT 976 
Val Met Leu His Phe Tyr Phe Val Gly Lys Pro Trp Val Leu Pro Thr 
285 290 295 
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Leu Glu Leu Phe Pro lie Asp Leu Pro Tyr Ala Ser Ala Lys Glu lie 

50 55 60 

Ala lie Ala Lys Met Gin His Leu Pro Lys Leu Val Arg Asp Ala Leu 
65 70 75 80 

Lys Cys Met Gly Phe Asp Arg Val Ser Gin Glu lie Val Phe Glu Tyr 

85 90 . 95 

Glu Pro Lys Leu Leu Lys Pro Ser Arg Leu Thr Tyr Phe Phe Gly Tyr 

100 105 110 

Phe Gin Asp Pro Arg Tyr Phe Asp Ala lie Ser Pro Leu lie Lys Gin 

115 120 125 

Thr Phe Thr Leu Pro Pro Pro Pro Pro Lys lie lie Arg lie lie lie 

130 135 140 

Lys Lys Arg Lys Asn lie Ser Ala Ser Phe Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 86... 12 01 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

GGTTATTATA GTATAATATT GTCAAAAAAT AAATTCAACT TTGGATTAAA TTGATTAAAA 60 

ACCTATTTTA GGGAAACCGC TTAAA ATG AGT ATT ATT ATT CCT ATT GTC ATC 112 

Met Ser lie lie lie Pro lie Val lie 
1 5 

GCT TTT GAT AAT CAC TAT GCC ATG CCG GCT GGC GTG AGC TTG TAT TCC 160 

Ala Phe Asp Asn His Tyr Ala Met Pro Ala Gly Val Ser Leu Tyr Ser 
10 15 20 25 

ATG CTA GCT TGC GCT AAA ACA GAA CAC CCC CAA TCA CAA AAT GAT AGT 2 08 

Met Leu Ala Cys Ala Lys Thr Glu His Pro Gin Ser Gin Asn Asp Ser 
30 35 40 

GAA AAA CTT TTT TAT AAG ATC CAC TGC CTG GTG GAT AAC TTA AGC CTT 256 
Glu Lys Leu Phe Tyr Lys lie His Cys Leu Val Asp Asn Leu Ser Leu 
45 50 55 

GAA AAC CAG AGC AAA CTA AAA GAG ACT CTA GCC CCC TTT AGC GCT TTT 3 04 

Glu Asn Gin Ser Lys Leu Lys Glu Thr Leu Ala Pro Phe Ser Ala Phe 
60 65 70 
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25 30 35 

ATC ACT TCT TTT GAT TGG AGC GAT AGG AAA ATG CAA TTA GAA CTT TTC 2 01 

lie Thr Ser Phe Asp Trp Ser Asp Arg Lys Met Gin Leu Glu Leu Phe 
40 45 50 

CCT ATT GAT TTG CCC TAT GCG AGC GCG AAA GAA ATC GOT ATA GCT AAA 24 9 

Pro lie Asp Leu Pro Tyr Ala Ser Ala Lys Glu lie Ala lie Ala Lys 
55 60 65 

ATG CAA CAC CTC CCC AAG CTA GTA AGA GAC GCG CTC AAA TGC ATG GGA 2 97 

Met Gin His Leu Pro Lys Leu Val Arg Asp Ala Leu Lys Cys Met Gly 
70 75 80 

TTT GAT AGG GTG AGT CAA GAA ATC GTT TTT GAA TAC GAG CCT AAA TTG 34 5 

Phe Asp Arg Val Ser Gin Glu lie Val Phe Glu Tyr Glu Pro Lys Leu 
85 90 95 100 

CTA AAG CCA AGC CGC TTG ACT TAT TTT TTT GGC TAT TTC CAA GAT CCA 3 93 

Leu Lys Pro Ser Arg Leu Thr Tyr Phe Phe Gly Tyr Phe Gin Asp Pro 
105 110 115 

CGA TAC TTT GAT GCT ATA TCC CCT TTA ATC AAG CAA ACC TTC ACT CTA 441 
Arg Tyr Phe Asp Ala lie Ser Pro Leu lie Lys Gin Thr Phe Thr Leu 
120 125 130 

CCC CCC CCC CCC CCG AAA ATA ATA AGA ATA ATA ATA AAA AAG AGG AAG 4 89 

Pro Pro Pro Pro Pro Lys lie lie Arg lie lie lie Lys Lys Arg Lys 
135 140 145 

AAT ATC AGT GCA AGC TTT CTT TGATTTTAGC CGCTAAAAAC AGCGTGTTTG TGCA 544 
Asn lie Ser Ala Ser Phe Leu 
150 155 

TATAAGAAGA GGGGATT 561 

(2) INFORMATION FOR SEQ ID NO:318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:318: 

Met Ala Phe Lys Val Val Gin lie Cys Gly Gly Leu Gly Asn Gin Met 

1 5 10 15 

Phe Gin Tyr Ala Phe Ala Lys Ser Leu Gin Lys His Ser Asn Thr Pro 

20 25 30 

Val Leu Leu Asp lie Thr Ser Phe Asp Trp Ser Asp Arg Lys Met Gin 
35 40 45 
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lie Leu Leu Asp Glu Asp Lys Lys Tyr Leu Pro Lys Glu Thr Arg Glu 

195 200 205 

Tyr He Arg Ser He Leu Ser Leu Ala Leu Lys Phe Asn Ser Leu Asp 

210 215 220 

Asn Leu Lys Asp Lys Glu Tyr Leu Leu Asn Arg Gly Ala Arg Val Ser 
225 230 235 240 

Leu Val Gly Val Pro Phe Lys Arg Arg Ala Ser Leu Val Gin Val Ala 

245 250 255 

Lys Asn Leu Asn Leu Ser Leu Glu Thr Leu Lys Ser Tyr Asn His Gin 

260 265 270 

Phe Arg Tyr Asn He Leu Pro Ser Lys Asp Pro Thr Tyr Thr He Tyr 

275 280 285 

He Pro Tyr Glu Lys Leu Ala Leu Phe Lys Gin Arg Gin He Lys Gin 

290 295 300 

Asn Lys Asn He Gin Ala Ser Ser Lys Ser Pro Phe He Thr His Val 
305 310 315 320 

Val Leu Pro Lys Glu Thr Leu Ser Ser He Ala Lys Arg Tyr Gin Val 

325 330 335 

Ser He Ser Asn He Gin Leu Ala Asn Asp Leu Lys Asp Ser Asn He 

340 345 350 

Phe He His Gin Arg Leu He He Pro Thr Asn Lys Lys Leu Leu Ala 

355 360 365 

Thr Arg Glu Phe 
370 

(2) INFORMATION FOR SEQ ID NO: 3 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 6... 510 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 17: 

AAATGAGCTA AAATGAGCGT TTCATTTGAC AAATAAAGGG ATTGA ATG GCT TTT AAG 5 7 

Met Ala Phe Lys 
1 

GTG GTG CAA ATT TGC GGA GGG CTT GGG AAT CAA ATG TTT CAA TAC GCT 105 
Val Val Gin He Cys Gly Gly Leu Gly Asn Gin Met Phe Gin Tyr Ala 
5 10 15 20 

TTC GCT AAA ACT TTG CAA AAA CAC TCT AAT ACG CCT GTG CTG TTA GAT 153 
Phe Ala Lys Ser Leu Gin Lys His Ser Asn Thr Pro Val Leu Leu Asp 
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CCT AAA GAA ACC CTA TCT TCT ATC GCT AAA CGC TAT CAA GTC AGT ATT 1065 
Pro Lys Glu Thr Leu Ser Ser lie Ala Lys Arg Tyr Gin Val Ser lie 
325 330 335 

TCC AAT ATC CAA TTA GCC AAT GAT CTC AAA GAT TCT AAT ATT TTT ATC 1113 
Ser- Asn lie Gin Leu Ala Asn Asp Leu Lys Asp Ser Asn lie Phe lie 
340 345 350 

CAC GAG CGT TTA ATC ATC CCC ACC AAC AAA AAA TTA CTC GCT ACA AGG 1161 
His Gin Arg Leu lie lie Pro Thr Asn Lys Lys Leu Leu Ala Thr Arg 
355 360 365 370 

GAA TTT TAATGGGTTT GGCGTTGGAA AAAGTTTGTT TTTTAGGCGT TATTTTTTTG AT 1219 
Glu Phe 



TA 1221 
(2) IMFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:316: 

Met Ser Lys Arg Met Lys Cys Phe Ser Gin Lys Trp Leu Val Phe Phe 

15 10 15 

Val Thr Leu Leu Leu Ala Ser Leu Gly His Ala Lys Met Ala Phe Glu 

20 25 30 

Ser Asp lie Asp Thr Lys Ala Leu Glu Ala Phe Gly Val Asn Ala Gly 

35 40 45 

Phe Leu Ser Gin Met Pro Asn Ala Leu Lys Lys Met Asn Lys Glu Glu 

50 55 60 

Glu Trp Lys Arg Leu Val Lys Arg Phe Asp Val Asn Tyr Gin Phe lie 
65 70 75 80 

Pro lie lie Lys Asn Met Leu lie Glu Ala Ser Val Pro Gin Glu Phe 

85 90 95 

Leu Phe Leu Ala Met Ala Glu Ser Lys Phe Ser Ser Arg Ala Tyr Ser 

100 105 110 

Arg Lys Lys Ala Val Gly lie Trp Gin Phe Met Pro Ser Thr Ala Lys 

115 120 125 

Glu Leu Gly Leu Lys Val Asn His Tyr lie Asp Glu Arg Arg Asp Pro 

130 135 140 

lie Lys Ser Thr Gin Ala Ala lie Thr Tyr Leu Lys Arg Leu Tyr Lys 
145 150 155 160 

Gin Thr Gly Glu Trp Tyr Leu Val Ala Met Ala Tyr Asn Tyr Gly Leu 

165 170 175 

Arg Lys Val Gin Asn Ala lie Lys Ala Ala Gly Thr Ser Asp lie Lys 
180 185 190 
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Leu Ala Met Ala Glu Ser Lys Phe Ser Ser Arg Ala Tyr Ser Arg Lys 
100 105 110 

AAA GCG OTA GGG ATT TGG CAA TTC ATG CCA AGC ACG GCT AAA GAA TTA 441 
Lys Ala Val Gly lie Trp Gin Phe Met Pro Ser Thr Ala Lys Glu Leu 
115 120 125 130 

GGG CTT AAG GTC AAT CAT TAC ATT GAT GAA AGA AGA GAT CCC ATT AAA 4 89 

Gly Leu Lys Val Asn His Tyr lie Asp Glu Arg Arg Asp Pro lie Lys 
135 140 145 

AGC ACT CAA GCG GCG ATC ACT TAT TTG AAA CGG CTC TAC AAG CAA ACC 537 
Ser Thr Gin Ala Ala lie Thr Tyr Leu Lys Arg Leu Tyr Lys Gin Thr 
150 155 160 

GGA GAG TGG TAT TTG GTC GCT ATG GCG TAT AAT TAC GGC TTA CGC AAG 58 5 

Gly Glu Trp Tyr Leu Val Ala Met Ala Tyr Asn Tyr Gly Leu Arg Lys 
165 170 175 

GTT CAA AAC GCT ATT AAA GCC GCC GGC ACT TCG GAC ATT AAA ATT TTG 633 
Val Gin Asn Ala lie Lys Ala Ala Gly Thr Ser Asp lie Lys lie Leu 
180 185 190 

TTG GAT GAA GAT AAG AAA TAC CTC CCT AAA GAA ACA CGA GAG TAT ATC 681 
Leu Asp Glu Asp Lys Lys Tyr Leu Pro Lys Glu Thr Arg Glu Tyr lie 
195 200 205 210 

CGC TCC ATT CTA AGC CTA GCG TTA AAA TTC AAC AGC CTA GAC AAC CTC 72 9 

Arg Ser lie Leu Ser Leu Ala Leu Lys Phe Asn Ser Leu Asp Asn Leu 
215 220 225 

AAA GAT AAA GAA TAT CTG CTC AAT CGT GGG GCG AGG GTG AGT TTA GTG 777 
Lys Asp Lys Glu Tyr Leu Leu Asn Arg Gly Ala Arg Val Ser Leu Val 
230 235 240 

GGC GTC CCG TTT AAA AGG CGT GCT TCT TTA GTC CAA GTA GCC AAA AAT 825 
Gly Val Pro Phe Lys Arg Arg Ala Ser Leu Val Gin Val Ala Lys Asn 
245 250 255 

TTG AAT TTG AGT TTG GAA ACC TTA AAA TCC TAC AAC CAC CAA TTC CGT 873 
Leu Asn Leu Ser Leu Glu Thr Leu Lys Ser Tyr Asn His Gin Phe Arg 
260 265 270 

TAT AAC ATT CTG CCT TCT AAA GAC CCC ACT TAT ACC ATT TAT ATC CCT 921 
Tyr Asn lie Leu Pro Ser Lys Asp Pro Thr Tyr Thr lie Tyr lie Pro 
275 280 285 290 

TAT GAA AAA' CTC GCT CTT TTC AAA CAA CGC CAG ATC AAA CAA AAT AAA 969 
Tyr Glu Lys Leu Ala Leu Phe Lys Gin Arg Gin lie Lys Gin Asn Lys 
295 300 305 

AAC ATT CAA GCC AGT TCA AAA AGC CCT TTT ATC ACC CAT GTG GTC TTA 1017 
Asn He Gin Ala Ser Ser Lys Ser Pro Phe He Thr His Val Val Leu 
310 315 320 
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130 135 140 

Lys lie Glu Gly Thr Lys lie Thr Phe Gly Glu Ala Gly Ala Glu Tyr 
145 150 155 160 

Lys Lys Met Met Ala Lys Ala 
165 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY : Coding Sequence 

(B) LOCATION: 52 . . . 1167 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

TTATAATCAA AGCTATTTTA AAAGCTGAAT AGCTATAGTT ATTAGGATGC G ATG TCA 57 

Met Ser 
1 

AAA AGA ATG AAG TGT TTT AGT CAA AAA TGG TTG GTT TTT TTT GTT ACC 105 
Lys Arg Met Lys Cys Phe Ser Gin Lys Trp Leu Val Phe Phe Val Thr 
5 10 15 

CTT TTA TTG GCT TCT TTA GGC CAT GCG AAA ATG GCT TTT GAA TCC GAT 153 
Leu Leu Leu Ala Ser Leu Gly His Ala Lys Met Ala Phe Glu Ser Asp 
20 25 30 

ATT GAC ACC AAA GCG CTA GAG GCT TTT GGG GTT AAT GCG GGC TTT TTA 2 01 

lie Asp Thr Lys Ala Leu Glu Ala Phe Gly Val Asn Ala Gly Phe Leu 
35 40 45 50 

TCC CAA ATG CCC AAC GCT TTA AAA AAA ATG AAT AAA GAA GAA GAA TGG 24 9 

Ser Gin Met Pro Asn Ala Leu Lys Lys Met Asn Lys Glu Glu Glu Trp 
55 60 65 

AAG AGA CTT GTC AAA AGA TTT GAT GTG AAT TAC CAG TTC ATC CCC ATC 2 97 

Lys Arg Leu Val Lys Arg Phe Asp Val Asn Tyr Gin Phe lie Pro lie 
70 75 80 

ATT AAA AAC ATG CTC ATA GAA GCG AGC GTG CCG CAA GAA TTT TTA TTT 34 5 

lie Lys Asn Met Leu lie Glu Ala Ser Val Pro Gin Glu Phe Leu Phe 
85 90 95 

TTA GCC ATG GCC GAG TCT AAA TTT TCA TCA AGG GCT TAT AGC AGG AAA 393 
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Asn Glu Lys Arg Asp Phe Lys Val Gly Glu Ser Val Phe Thr Thr Ala 
80 85 90 

ATT CAA ATT TGC ACG CAT TTA GGG TGT ATC CCC ACT TAT CAA GAT GAA 3 97 

lie Gin lie Cys Thr His Leu Gly Cys lie Pro Thr Tyr Gin Asp Glu 
95 100 105 110 



GAA AAA GGC TTT TTA TGC CCA TGC CAT GGG GGG CGT TTC ACT TCT GAT 44 5 

Glu Lys Gly Phe Leu Cys Pro Cys His Gly Gly Arg Phe Thr Ser Asp 
115 120 125 



GGC GTG AAT ATT GCC GGC ACT CCC CCT CCA CGC CCT TTT GAT ATC CCG 4 93 

Gly Val Asn lie Ala Gly Thr Pro Pro Pro Arg Pro Phe Asp lie Pro 
130 135 140 



CCT TTT AAA ATT GAA GGC ACT AAG ATC ACT TTT GGT GAA GCC GGG GCT 541 
Pro Phe Lys lie Glu Gly Thr Lys lie Thr Phe Gly Glu Ala Gly Ala 
145 150 155 



GAA TAC AAG AAA ATG ATG GCT AAA GCG TAAGGAGAGT TTAATGGCAG AGATAAA 595 
Glu Tyr Lys Lys Met Met Ala Lys Ala 
160 165 



AAAAGCGAAA AATTTAGGCG AATGG 62 0 

(2) INFORMATION FOR SEQ ID NO: 3 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 14: 



Met Ala Asp lie 
1 

Val Thr Ala He 
20 

Trp Asp Pro Leu 
35 

Val Ala Asn Met 
50 

Lys Pro Val Tyr 
65 

Lys Arg Asp Phe 

He Cys Thr His 
100 

Gly Phe Leu Cys 
115 

Asn He Ala Gly 



Gin Arg Arg Asp 
5 

Gly Ala He Ala 

Pro Ser Val Val 
40 

Gin Glu Gly Gin 
55 

He Leu Lys Arg 
70 

Lys Val Gly Glu 
85 

Leu Gly Cys He 

Pro Cys His Gly 
120 

Thr Pro Pro Pro 



Phe Leu Gly Met 
10 

Ser Leu Val Ala 
25 

Ser Ala Gly Phe 

Phe Ser Thr Val 
60 

Ser Lys Lys Glu 
75 

Ser Val Phe Thr 
90 

Pro Thr Tyr Gin 
105 

Gly Arg Phe Thr 
Arg Pro Phe Asp 



Ser Leu Ala Ser 
15 

Met Lys Lys Thr 
30 

Thr Thr He Asp 
45 

Glu Trp Arg Gly 

Gly Phe Asn Glu 
80 

Thr Ala He Gin 
95 

Asp Glu Glu Lys 
110 

Ser Asp Gly Val 
125 

He Pro Pro Phe 
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Leu Ala Tyr Ser Ser Thr Arg Lys Ser Glu lie Thr His Asn Glu Leu 
385 390 395 400 

Asn Leu Asn Glu Lys Phe Met Glu Phe Val Glu Val Tyr Glu Gly His 

405 410 415 

Tyr Leu Asn Asp lie lie Lys Glu Ser Ser Glu Tyr Lys Glu Trp Val 

420 425 430 

Lys Asn His Val Arg Phe Lys Glu Gly Val Cys Met Ala Leu Glu lie 

435 440 445 

Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser lie Glu Asn Ser 

450 455 460 

Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 3 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 68,.. 568 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: 

TGTAAAATAG GGATTTGCTA GGCCTTTAGT CGTTAAAAGG TAATTATCAT TAAGGAGTTT 60 
TTTAATC ATG GCA GAT ATT CAA AGG CGT GAT TTT TTA GGA ATG AGC CTT 10 9 

Met Ala Asp lie Gin Arg Arg Asp Phe Leu Gly Met Ser Leu 
15 10 

GCT AGT GTT ACA GCT ATA GGG GCT ATA GCG AGT CTG GTA GCG ATG AAA 157 
Ala Ser Val Thr Ala lie Gly Ala lie Ala Ser Leu Val Ala Met Lys 
15 20 25 30 

AAG ACT TGG GAT CCG CTT CCA AGC GTT GTT TCA GCC GGT TTT ACG ACC 2 05 

Lys Thr Trp Asp Pro Leu Pro Ser Val Val Ser Ala Gly Phe Thr Thr 
35 40 45 

ATA GAT GTG GCG AAT ATG CAA GAA GGG CAG TTT TCC ACC GTG GAA TGG 2 53 

lie Asp Val Ala Asn Met Gin Glu Gly Gin Phe Ser Thr Val Glu Trp 
50 55 60 

CGT GGG AAA CCG GTC TAT ATC CTC AAG CGT TCT AAA AAA GAG GGC TTT 3 01 

Arg Gly Lys Pro Val Tyr lie Leu Lys Arg Ser Lys Lys Glu Gly Phe 
65 70 75 

AAT GAA AAG CGC GAT TTT AAA GTT GGC GAG AGC GTT TTT ACC ACA GCC 3 49 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 12: 



Met Lys Lys Ser 
1 

Leu Gin Ala Leu 
20 

His Lys Gly Thr 
35 

Gin Val Leu Gly 
50 

Thr lie Thr His 
65 

Lys Leu Ser Leu 

Arg Asn Thr Gin 
100 

Pro His Gin Met 
115 

Lys Pro His Ser 
130 

Pro Lys Leu Gly 
145 

Ala lie Pro Ser 

Asn Asp Thr Lys 
180 

Asn Leu Phe lie 
195 

Ser Asn Ala Asp 
210 

Asn Val Glu Lys 
225 

Cys Gly Lys Trp 

Ser lie Leu Lys 
260 

Thr Pro Cys Asp 
275 

Thr Pro Tyr Thr 
290 

Pro Gin Thr Phe 
305 

Ser Ser Thr Glu 

Thr Arg Gin Cys 
340 

Ser Glu Tyr Glu 
355 

Pro Lys Gin Asp 
370 



Leu Cys Leu Ser 
5 

Val lie Glu Leu 

Phe Lys Ala Lys 
40 

Val Tyr Asn lie 
55 

lie Ser Thr Ala 
70 

Glu Thr Thr Leu 
85 

lie Val Phe Ser 

Pro Ser Leu Asn 
120 

Ser Gin Gin Pro 
135 

Ser Lys Asn Ser 
150 

Lys lie Ser Pro 
165 

Pro Pro Leu Lys 

Thr Pro Pro Thr 
200 

lie Ser Glu Asn 
215 

Gin Ala lie Arg 
230 

Val Tyr Asp Asp 
245 

Arg Val Asp Glu 

Tyr Ser Thr Ala 
280 

Lys lie Ser Val 
295 

Glu Ala Lys Asn 
310 

Lys Cys Lys Arg 
325 

Tyr Leu lie Glu 

lie Thr Thr Gin 
360 

Asp Gin Val Glu 
375 



Phe Phe Leu Thr 
10 

Leu Glu Glu lie 
25 

Val Leu Asp Ser 

Ser Pro His Lys 
60 

lie Val Tyr Gin 
75 

Asn Pro Asn Arg 
90 

Ser Lys Glu Leu 
105 

Ala Pro Met Gin 

Ser Gin Asn Phe 
140 

Lys Asn Ser Leu 
155 

Thr Asn Glu Thr 
170 

His Ser Ser Glu 
185 

Glu Lys Thr Leu 

Asn Glu Ser Asn 
220 

Asp Ala Asn lie 
235 

Glu Asn Leu Gin 
250 

Asp Lys Gin Thr 
265 

Glu Asn Lys Ser 

His Lys Thr Glu 
300 

Asn Phe Ala lie 
315 

Ala Arg Ala Arg 
330 

Glu Pro Leu Lys 
345 

Leu Val Lys Ala 

Pro Thr Phe Tyr 
380 



Phe Ser Asn Pro 
15 

Lys Thr Ser Pro 
30 

Lys Lys Pro Arg 
45 

Lys Leu Thr Leu 

Pro Leu Asp Glu 
80 

Pro Thr lie Pro 
95 

Lys Glu Ser His 
110 

Lys Pro Gin Asn 
125 

Ser Tyr Pro Glu 

Leu Gin Pro Leu 
160 

Gin Thr Pro Thr 
175 

Asp Gin Glu Ser 
190 

Pro Asn Asn Thr 
205 

Glu Asn Lys Asp 

Lys Glu Phe Ala 
240 

Ala Tyr Arg Pro 
255 

Ala Thr Asp lie 
270 

Gly Lys lie lie 
285 

Pro Leu Glu Glu 

Leu Gin Ala Arg 
320 

Lys Asp Gly Thr 
335 

Gin Ala Trp Glu 
350 

lie Tyr Glu Arg 
365 

Glu Thr Ser Glu 
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lie Thr Pro Tyr Thr Lys lie Ser Val His Lys Thr Glu Pro Leu Glu 
290 295 300 

GAG CCA CAA ACT TTT GAA OCT AAA AAT AAT TTC GCC ATT CTT CAA GCC 102 2 

Glu Pro Gin Thr Phe Glu Ala Lys Asn Asn Phe Ala lie Leu Gin Ala 
305 310 315 

AGA AGC TCT ACA GAA AAA TGC AAA AGG GCT AGA GCA AGA AAA GAC GGC 1070 
Arg Ser Ser Thr Glu Lys Cys Lys Arg Ala Arg Ala Arg Lys Asp Gly 
320 325 330 335 

ACG ACT AGG CAA TGC TAT CTA ATA GAA GAG CCT TTA AAA CAA GCA TGG 1118 
Thr Thr Arg Gin Cys Tyr Leu lie Glu Glu Pro Leu Lys Gin Ala Trp 
340 345 350 

GAG AGT GAG TAT GAA ATC ACC ACG CAA TTA GTG AAA GCC ATT TAT GAG 1166 
Glu Ser Glu Tyr Glu lie Thr Thr Gin Leu Val Lys Ala lie Tyr Glu 
355 360 365 

CGC CCC AAA CAA GAC GAT CAA GTA GAG CCG ACT TTT TAT GAA ACC AGC 1214 
Arg Pro Lys Gin Asp Asp Gin Val Glu Pro Thr Phe Tyr Glu Thr Ser 
370 375 380 

GAA TTG GCT TAT TCT TCC ACA CGA AAA AGC GAA ATA ACG CAC AAT GAA 12 62 

Glu Leu Ala Tyr Ser Ser Thr Arg Lys Ser Glu lie Thr His Asn Glu 
385 390 395 

TTG AAT TTG AAT GAA AAA TTC ATG GAA TTT GTG GAA GTG TAT GAG GGG 1310 
Leu Asn Leu Asn Glu Lys Phe Met Glu Phe Val Glu Val Tyr Glu Gly 
400 405 410 415 

CAT TAT TTA AAC GAT ATA ATT AAA GAG AGC AGT GAA TAT AAA GAA TGG 13 5 8 

His Tyr Leu Asn Asp lie lie Lys Glu Ser Ser Glu Tyr Lys Glu Trp 
420 425 430 

GTT AAA AAC CAT GTG CGC TTT AAA GAA GGG GTG TGC ATG GCT TTA GAA 14 06 

Val Lys Asn His Val Arg Phe Lys Glu Gly Val Cys Met Ala Leu Glu 
435 440 445 

ATA GAA GAA CAG CCA CGA GCT AAA AGC ACG CCT TTG AGT ATT GAA AAC 14 54 

lie Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser lie Glu Asn 
450 455 460 

TCT CGT GTG GTA TGT GTC AAA AAG GGG AAT TAT TTA TTC AAC GAA GTT T 15 03 
Ser Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 
465 470 475 

AAGATGGTGG CTTGAGGCGG AATCGAACCA CCGACACGAA GATTTTC 1550 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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65 70 75 

GAA AAA CTT TCT TTA GAA ACA ACC TTA AAC CCT AAC CGC CCT ACT ATC 35 0 

Glu Lys Leu Ser Leu Glu Thr Thr Leu Asn Pro Asn Arg Pro Thr lie 
80 85 90 95 

CCT AGA AAC ACC CAG ATT GTT TTT TCT TCA AAA GAA TTG AAA GAG TCG 3 98 

Pro Arg Asn Thr Gin lie Val Phe Ser Ser Lys Glu Leu Lys Glu Ser 
100 105 110 

CAC CCG CAC CAA ATG CCT TCT TTA AAC GCG CCC ATG CAA AAA CCA CAA 44 6 

His Pro His Gin Met Pro Ser Leu Asn Ala Pro Met Gin Lys Pro Gin 
115 120 125 

AAC AAA CCC CAT TCA TCG CAA CAA CCT TCT CAA AAC TTT TCT TAC CCA 4 94 

Asn Lys Pro His Ser Ser Gin Gin Pro Ser Gin Asn Phe Ser Tyr Pro 
130 135 140 

GAG CCC AAA CTA GGC TCT AAA AAC TCT AAA AAC AGC CTT TTA CAG CCT 542 
Glu Pro Lys Leu Gly Ser Lys Asn Ser Lys Asn Ser Leu Leu Gin Pro 
145 150 155 

TTA GCA ATT CCT AGC AAA ATA AGT CCC ACT AAC GAA ACT CAA ACG CCA 5 90 

Leu Ala lie Pro Ser Lys lie Ser Pro Thr Asn Glu Thr Gin Thr Pro 
160 165 170 175 

ACA AAC GAC ACT AAA CCC CCT TTA AAG CAT TCT TCA GAA GAT CAA GAA 63 8 

Thr Asn Asp Thr Lys Pro Pro Leu Lys His Ser Ser Glu Asp Gin Glu 
180 185 190 

AGC AAC CTC TTT ATA ACG CCA CCC ACT GAA AAA ACG CTC CCT AAC AAC 6 86 

Ser Asn Leu Phe lie Thr Pro Pro Thr Glu Lys Thr Leu Pro Asn Asn 
195 200 205 

ACC TCT AAC GCT GAT ATT AGT GAA AAC AAT GAA AGC AAT GAG AAT AAA 73 4 

Thr Ser Asn Ala Asp lie Ser Glu Asn Asn Glu Ser Asn Glu Asn Lys 
210 215 220 

GAT AAT GTG GAA AAA CAA GCC ATT AGA GAT GCT AAT ATT AAA GAA TTT 7 82 

Asp Asn Val Glu Lys Gin Ala lie Arg Asp Ala Asn lie Lys Glu Phe 
225 230 235 

GCA TGC GGG AAG TGG GTC TAT GAC GAT GAA AAT TTA CAA GCC TAC CGC 83 0 

Ala Cys Gly Lys Trp Val Tyr Asp Asp Glu Asn Leu Gin Ala Tyr Arg 
240 245 250 255 

CCA AGC ATT TTA AAA CGC GTT GAT GAA GAC AAA CAA ACT GCA ACA GAT 87 8 

Pro Ser lie Leu Lys Arg Val Asp Glu Asp Lys Gin Thr Ala Thr Asp 
260 265 270 

ATT ACC CCT TGC GAT TAC AGC ACC GCT GAA AAT AAA AGC GGT AAA ATC 92 6 

lie Thr Pro Cys Asp Tyr Ser Thr Ala Glu Asn Lys Ser Gly Lys lie 
275 280 285 

ATT ACC CCC TAT ACT AAA ATC TCC GTT CAT AAA ACA GAG CCT TTA GAA 974 
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65 70 75 80 

Asp Thr Ala Pro Pro Leu Asp Thr Ala Ala Gin Lys Gin Glu Thr Lys 

85 90 95 

Gin Glu Gin Glu Lys Glu Asn Glu Pro Lys Gin Asp Ser Val Pro Pro 

100 105 110 

Val Gin Asn Asn Gin Lys Thr Pro Thr Thr Pro Leu Met Gly Lys Lys 

115 120 125 

Pro Leu Glu Tyr Lys Val Ala Val Ser Gly Val Asn Val Arg Ala Phe 

130 135 140 

Pro Ser Thr Lys Gly Lys lie Leu Gly Leu Leu Leu Lys Asn Lys Ser 
145 150 155 160 

Val Lys Val Leu Glu lie Gin Asn Asp Trp Ala Glu lie Glu Phe Ser 

165 170 175 

His Glu Thr Lys Gly Tyr Val Phe Leu Lys Leu Leu Lys Lys Ala Glu 
180 185 190 



(2) INFORiytATION FOR SEQ ID NO: 311: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 1502 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



TTATGACTTT TACTAAACCT TTTTTTAAGC TATAATCCAA AAATCTAAAA TAAAAAGGAA 60 
TAAGC ATG AAA AAA TCC CTT TGT CTG TCT TTC TTT CTG ACT TTC TCT AAC 110 
Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn 
15 10 15 



CCT CTT CAA GCC CTT GTG ATC GAG CTT TTA GAA GAA ATC AAA ACT TCG 15 8 

Pro Leu Gin Ala Leu Val lie Glu Leu Leu Glu Glu lie Lys Thr Ser 
20 25 30 



CCG CAT AAA GGC ACT TTT AAG GCT AAA GTC CTT GAT TCT AAA AAA CCA 2 06 

Pro His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser Lys Lys Pro 
35 40 45 

AGA CAA GTT TTA GGC GTT TAT AAT ATC TCC CCA CAC AAA AAA CTC ACG 254 
Arg Gin Val Leu Gly Val Tyr Asn lie Ser Pro His Lys Lys Leu Thr 
50 55 60 

CTC ACT ATC ACC CAC ATA TCC ACT GCA ATC GTC TAT CAA CCC CTT GAT 3 02 

Leu Thr lie Thr His lie Ser Thr Ala lie Val Tyr Gin Pro Leu Asp 
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AAA GAC ACA GCG CCG OCT TTA GAG ACA GCC GCG CAA AAA CAA GAA ACT 34 9 

Lys Asp Thr Ala Pro Pro Leu Asp Thr Ala Ala Gin Lys Gin Glu Thr 
80 85 90 95 

AAA CAA GAG CAA GAA AAA GAA AAC GAG CCT AAA CAA GAT AGC GTC CCG 3 97 

Lys Gin Glu Gin Glu Lys Glu Asn Glu Pro Lys Gin Asp Ser Val Pro 
100 105 110 

CCC GTT CAA AAC AAT CAA AAA ACC CCT ACA ACC CCC TTA ATG GGA AAA 445 
Pro Val Gin Asn Asn Gin Lys Thr Pro Thr Thr Pro Leu Met Gly Lys 
115 120 125 

AAA CCT TTA GAG TAT AAA GTC GCA GTC AGT GGC GTG AAT GTG CGC GCT 4 93 

Lys Pro Leu Glu Tyr Lys Val Ala Val Ser Gly Val Asn Val Arg Ala 
130 135 140 

TTT CCC AGC ACA AAA GGT AAA ATC TTG GGA TTG CTT TTA AAA AAT AAA 541 
Phe Pro Ser Thr Lys Gly Lys lie Leu Gly Leu Leu Leu Lys Asn Lys 
145 150 155 

AGC GTG AAA GTT TTA GAA ATC CAA AAC GAT TGG GCT GAA ATT GAA TTT 5 89 

Ser Val Lys Val Leu Glu lie Gin Asn Asp Trp Ala Glu lie Glu Phe 
160 165 170 175 

TCT CAC GAA ACA AAG GGC TAT GTG TTT TTA AAA CTT TTA AAA AAG GCT 63 7 

Ser His Glu Thr Lys Gly Tyr Val Phe Leu Lys Leu Leu Lys Lys Ala 
180 185 190 

GAA TGAAAGAATA ATGAAATTAA AATCTTTTGG GGTTTTTGGA AATCCCATTA 6 90 

Glu 



(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

Met Lys Thr Glu Met Lys Ser Ser Leu Lys Leu Phe Met Arg Pro Leu 

15 10 15 

Leu Val Val Leu Ala Phe Met Leu Leu Tyr Ala Leu Val His Ala Ala 

20 25 30 

Leu Gly Phe Tyr Val Lys Lys Asp Ser Ala Pro lie Ser Pro Asn Val 

35 . 40 45 

Glu Lys Thr Glu Thr Glu Arg Gin Asn Gly Val Leu Ser Pro Lys Gin 

50 55 60 

Glu Glu Ala Asn Ala Thr Thr Thr Ala Thr Glu Glu Ser Pro Thr Lys 
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Val Ser Lys Lys Leu Phe Lys Glu Glu Asn Ala Arg Val lie Ala Leu 

1700 1705 1710 

Lys Gly Glu Pro Ala lie Ala Glu Pro Val Leu Leu Gly lie Thr Arg 

1715 1720 1725 

Ala Ala lie Gly Ser Asp Ser lie lie Ser Ala Ala Ser Phe Gin Glu 

1730 1735 1740 

Thr Thr Lys Val Leu Thr Glu Ala Ser lie Ala Met Lys Lys Asp Phe 
745 1750 1755 1760 

Leu Glu Asp Leu Lys Glu Asn Val Val Leu Gly Arg Met lie Pro Val 

1765 1770 1775 

Gly Thr Gly Met Tyr Lys Asn Lys Lys lie Val Leu Arg Ala Leu Glu 

1780 1785 1790 

Asp Asn Ser Lys Phe 
1795 

(2) INFORMATION FOR SEQ ID NO: 3 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 65... 640 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 09: 

TAAGCGATTT GCTCTGTGTG GTGATTGACC CTAGGATTGA TTTTGAAAAG CGTTGAGGGT 6 0 

AGGA ATG AAA ACT GAG ATG AAA TCT TCT TTA AAA CTT TTT ATG CGG CCT 109 

Met Lys Thr Glu Met Lys Ser Ser Leu Lys Leu Phe Met Arg Pro 
15 10 15 

TTG TTG GTG GTT TTA GCG TTC ATG TTG TTG TAT GCT TTA GTG CAT OCT 157 
Leu Leu Val Val Leu Ala Phe Met Leu Leu Tyr Ala Leu Val His Ala 
20 25 30 

GCG CTT GGT TTT TAT GTA AAA AAA GAC AGC GCT CCA ATA AGC CCA AAT 2 05 

Ala Leu Gly Phe Tyr Val Lys Lys Asp Ser Ala Pro lie Ser Pro Asn 
35 40 45 

GTA GAA AAA ACC GAG ACA GAG CGT CAA AAC GGC GTG CTT TCG CCC AAA 2 53 

Val Glu Lys Thr Glu Thr Glu Arg Gin Asn Gly Val Leu Ser Pro Lys 
50 55 60 

CAA GAA GAA GCC AAC GCA ACC ACA ACT GCC ACA GAA GAA AGC CCC ACC 3 01 

Gin Glu Glu Ala Asn Ala Thr Thr Thr Ala Thr Glu Glu Ser Pro Thr 
65 70 75 
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1250 1255 1260 

Pro Phe Asp Gly Glu Leu Arg lie Glu Thr Val Tyr Glu Glu Val Val 
265 1270 1275 1280 

Val Ser Val Lys Asn Gly Asp Gin Glu Ala Lys Phe Val Leu Arg Arg 

1285 1290 1295 

Ser Asp lie Val Lys Pro Ser Glu Leu Ala Gly Val Gly Gly Lys lie 

1300 1305 1310 

Glu Gly Lys Val Tyr Leu Pro Tyr Ala Ser Gly His Lys Val His Lys 

1315 1320 1325 

Gly Gly Ser lie Ala Asp lie lie Gin Glu Gly Trp Asn Val Pro Asn 

1330 1335 1340 

Arg lie Pro Tyr Ala Ser Glu Leu Leu Val Lys Asp Asn Asp Pro lie 
345 1350 1355 1360 

Ala Gin Asp Val Tyr Ala Lys Glu Lys Gly Val lie Lys Tyr Tyr Val 

1365 1370 1375 

Leu Glu Ala Asn His Leu Glu Arg Thr His Gly lie Lys Lys Gly Asp 

1380 1385 1390 

Met Val Ser Glu Lys Gly Leu Phe Ala Val lie Ala Asp Asp Asn Gly 

1395 1400 1405 

Arg Glu Ala Ala Arg His Tyr lie Ala Arg Gly Ser Glu lie Leu lie 

1410 1415 1420 

Asp Asp Asn Ser Glu Val Ser Thr Asn Ser Val lie Ser Lys Pro Thr 
425 1430 1435 1440 

Thr Asn Thr Phe Lys Thr lie Ala Thr Trp Asp Pro Tyr Asn Thr Pro 

1445 1450 1455 

lie lie Ala Asp Phe Lys Gly Lys Val Gly Phe Val Asp Val lie Ala 

1460 1465 1470 

Gly Val Thr Val Ala Glu Lys Glu Asp Glu Asn Thr Gly lie Thr Ser 

1475 1480 1485 

Leu Val Val Asn Asp Tyr lie Pro Ser Gly Tyr Lys Pro Ser Leu Phe 

1490 1495 1500 

Leu Glu Gly Ala Asn Gly Glu Glu Met Arg Tyr Phe Leu Glu Pro Lys 
505 1510 1515 1520 

Thr Ser lie Ala lie Ser Asp Gly Ser Ser Val Glu Gin Ala Glu Val 

1525 1530 1535 

Leu Ala Lys He Pro Lys Ala Thr Val Lys Ser Arg Asp He Thr Gly 

1540 1545 1550 

Gly Leu Pro Arg Val Ser Glu Leu Phe Glu Ala Arg Lys Pro Lys Pro 

1555 1560 1565 

Lys Asp Val Ala He Leu Ser Glu Val Asp Gly He Val Ser Phe Gly 

1570 1575 1580 

Lys Pro He Arg Asn Lys Glu His He He Val Thr Ser Lys Asp Gly 
585 1590 1595 1600 

Arg Ser Met Asp Tyr Phe Val Asp Lys Gly Lys Gin He Leu Val His 

1605 1610 1615 

Ala Asp Glu Phe Val His Ala Gly Glu Ala Met Thr Asp Gly Val He 

1620 1625 1630 

Ser Ser His Asp He Leu Arg He Ser Gly Glu Lys Glu Leu Tyr Lys 

1635 1640 1645 

Tyr He Val Ser Glu Val Gin Gin Val Tyr Arg Arg Gin Gly Val Ser 

1650 1655 1660 

He Ala Asp Lys His He Glu He He Val Ser Gin Met Leu Arg Gin 
665 1670 1675 1680 

Val Arg He Leu Asp Ser Gly Asp Ser Lys Phe He Glu Gly Asp Leu 
1685 1690 1695 
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Lys 


Gly 


Glu 


His 
820 


Lys 


Leu 


Phe 


Ser 


Ser 
825 


Val 


Asn 


Glu 


He 


He 
830 


Thr 


Ala 


lie 


Asp 


Thr 
835 


Lys 


Glu 


Leu 


Asp 


He 
840 


His 


Ala 


Lys 


He 


Arg 
845 


Val 


Leu 


Asp 


Gin 


Gly 


Asn 


He 


He 


Ala 


Thr 


Ser 


Ala 


Gly Arg 


Met 


He 


He 


Lys 


Ser 




850 










855 










860 










lie 


Leu 


Pro 


Asp 


Phe 


He 


Pro 


Thr 


Asp 


Leu 


Trp 


Asn 


Arg 


Pro 


Met 


Lys 


865 










870 










875 










880 


Lys 


Lys 


Asp 


He 


Gly 
885 


Val 


Leu 


Val 


Asp 


Tyr 
890 


Val 


His 


Lys 


Val 


Gly 
895 


Gly 


He 


Gly 


He 


Thr 
900 


Ala 


Thr 


Phe 


Leu 


Asp 
905 


Asn 


Leu 


Lys 


Thr 


Leu 
910 


Gly 


Phe 


Arg 


Tyr 


Ala 
915 


Thr 


Lys 


Ala 


Gly 


He 
920 


Ser 


He 


Ser 


Met 


Glu 
925 


Asp 


He 


He 


Thr 


Pro 
930 


Lys 


Asp 


Lys 


Gin 


Lys 
935 


Met 


Val 


Glu 


Lys 


Ala 
940 


Lys 


Val 


Glu 


Val 


Lys 


Lys 


He 


Gin 


Gin 


Gin 


Tyr 


Asp 


Gin 


Gly 


Leu 


Leu 


Thr 


Asp 


Gin 


Glu 


945 










950 










955 










960 


Arg 


Tyr 


Asn 


Lys 


He 
965 


He 


Asp 


Thr 


Trp 


Thr 
970 


Glu 


Val 


Asn 


Asp 


Lys 
975 


Met 


Ser 


Lys 


Glu 


Met 
980 


Met 


Thr 


Ala 


He 


Ala 
985 


Gin 


Asp 


Lys 


Glu 


Gly 
990 


Phe 


Asn 


Ser 


He 


Tyr 


Met 


Met 


Ala 


Asp 


Ser 


Gly Ala 


Arg 


Gly 


Ser 


Ala 


Ala 


Gin 






995 








1000 








1005 








He 


Arg 


Gin 


Leu 


Ser 


Ala 


Met 


Arg 


Gly 


Leu 


Met 


Thr 


Lys 


Pro 


Asp 


Gly 


1010 








1015 








1020 










Ser 


He 


He 


Glu 


Thr 


Pro 


He 


He 


Ser 


Asn 


Phe 


Lys 


Glu 


Gly 


Leu 


Asn 


025 








1030 








1035 








1040 


Val 


Leu 


Glu 


Tyr 


Phe 


Asn 


Ser 


Thr 


His 


Gly Ala 


Arg 


Lys 


Gly 


Leu 


Ala 








1045 








1050 








1055 




Asp 


Thr 


Ala 


Leu 


Lys 


Thr 


Ala 


Asn 


Ala 


Gly 


Tyr 


Leu 


Thr 


Arg 


Lys 


Leu 






1060 








1065 








1070 






He 


Asp 


Val 


Ser 


Gin 


Asn 


Val 


Lys 


Val 


Val 


Ser 


Asp 


Asp 


Cys 


Gly 


Thr 




1075 








1080 








1085 








His 


Glu 


Gly 


He 


Glu 


He 


Thr 


Asp 


He 


Ala 


Val 


Gly 


Ser 


Glu 


Leu 


He 


1090 








1095 








1100 










Glu 


Pro 


Leu 


Glu 


Glu 


Arg 


He 


Phe 


Gly Arg 


Val 


Leu 


Leu 


Glu 


Asp 


Val 


105 








1110 








1115 








1120 


He 


Asp 


Pro 


He 


Thr 


Asn 


Glu 


He 


Leu 


Leu 


Tyr 


Ala 


Asp 


Thr 


Leu 


He 








1125 








1130 








1135 




Asp 


Glu 


Glu 


Gly 


Ala 


Lys 


Lys 


Val 


Val 


Glu 


Ala 


Gly 


He 


Lys 


Ser 


He 






1140 








1145 








1150 






Thr 


He 


Arg 


Thr 


Pro 


Val 


Thr 


Cys 


Lys 


Ala 


Pro 


Lys 


Gly Val 


Cys 


Ala 






1155 








1160 










1165 








Lys 


Cys 
1170 


Tyr 


Gly 


Leu 


Asn 


Leu 
1175 


Gly 


Glu 


Gly 


Lys 


Met 
1180 


Ser 


Tyr 


Pro 


Gly 


Glu 


Ala 


Val 


Gly Val 


Val 


Ala 


Ala 


Gin 


Ser 


He 


Gly 


Glu 


Pro 


Gly 


Thr 



185 1190 1195 1200 

Gin Leu Thr Leu Arg Thr Phe His Val Gly Gly Thr Ala Ser Arg Ser 

1205 1210 1215 

Gin Asp Glu Arg Glu He Val Ala Ser Lys Glu Gly Phe Val Arg Phe 

1220 1225 1230 

Tyr Asn Leu Arg Thr Tyr Thr Asn Lys Glu Gly Lys Asn He He Ala 

1235 1240 1245 

Asn Arg ' Arg Asn Ala Ser He Leu Val Val Glu Pro Lys He Lys Ala 
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370 

Arg Arg Phe Arg 
385 

lie Trp Tyr Val 

Val Lys Met Lys 
420 

Val Lys Glu Pro 
435 

Val Met Lys Tyr 
450 

Arg Arg Tyr Glu 
465 

lie Lys Asp Leu 

Leu Lys Glu Glu 
500 

Leu lie Lys Arg 
515 

Arg Pro Glu Trp 
530 

Leu Arg Pro Leu 
545 

Val Asn Glu Leu 

Arg Leu Met Glu 
580 

Arg Met Leu Gin 
595 

Thr Asn Ala Val 
610 

Glu lie lie Lys 
625 

Lys Arg Val Asp 

Leu Lys Met Asp 
660 

Phe Lys Pro His 
675 

Thr Leu Lys Gin 
690 

Trp Glu Cys Leu 
705 

Arg Ala Pro Thr 

Leu lie Asp Gly 
740 

Phe Asn Ala Asp 
755 

Ser Gin Glu Ala 
770 

Asn lie Leu Leu 
785 

Asp Met Val Leu 



375 

Met Gly His He 
390 

Asn Ser Leu Pro 
405 

Asp Leu Glu Arg 

Gly Glu Ala Ala 
440 

Asp He Leu Asn 
455 

Asp Arg Gly Phe 
470 

Leu Glu Glu He 
485 

Val Lys Asp Thr 

Leu Lys Val Val 
520 

Met Met Leu Thr 
535 

Val Ala Leu Asp 
550 

Tyr Arg Arg Val 
565 

Leu Gly Ala Pro 

Glu Ala Val Asp 
600 

Lys Gly Ala Asn 
615 

Gly Lys Gin Gly 
630 

Phe Ser Gly Arg 
645 

Glu Cys Gly Leu 

Leu Leu Ser Lys 
680 

Ala Lys Arg Met 
695 

Gin Glu He Thr 
710 

Leu His Lys Gin 
725 

Lys Ala He Gin 

Phe Asp Gly Asp 
760 

He Ala Glu Cys 
775 

Pro Ala Ser Gly 
790 

Gly Leu Tyr Tyr 
805 



380 

Glu Leu Ala Thr 
395 

Ser Arg He Gly 
410 

Val Leu Tyr Tyr 
425 

Tyr Asp Asn Glu 

Glu Glu Gin Tyr 
460 

Val Ala Gin Met 
475 

Asp Leu He Thr 
490 

Asn Ser Asp Ala 
505 

Glu Ser Phe Leu 

Val Leu Pro Val 
540 

Gly Gly Lys Phe 
555 

He Asn Arg Asn 
570 

Glu He He Val 
585 

Val Leu Phe Asp 

Lys Arg Pro Leu 
620 

Arg Phe Arg Gin 
635 

Ser Val He Val 
650 

Pro Lys Asn Met 
665 

Leu Glu Glu Arg 

He Glu Gin Lys 
700 

Glu Gly Tyr Pro 
715 

Ser He Gin Ala 
730 

Leu His Pro Leu 
745 

Gin Met Ala Val 

Lys Val Leu Met 
780 

Lys Ala Val Ala 
795 

Leu Ser Leu Glu 
810 



Pro Val Ala His 
400 

Thr Leu Leu Gly 
415 

Glu Ala Tyr He 
430 

Gly Thr Lys Leu 
445 

Gin Asn He Ser 

Gly Gly Glu Ala 
480 

Leu Leu Gin Ser 
495 

Lys Lys Lys Lys 
510 

Asn Ser Gly Asn 
525 

Leu Pro Pro Asp 

Ala Val Ser Asp 
560 

Gin Arg Leu Lys 
575 

Arg Asn Glu Lys 
590 

Asn Gly Arg Ser 
605 

Lys Ser Leu Ser 

Asn Leu Leu Gly 
640 

Val Gly Pro Asn 
655 

Ala Leu Glu Leu 
670 

Gly Tyr Ala Thr 
685 

Ser Asn Glu Val 

Val Leu Leu Asn 
720 

Phe His Pro Lys 
735 

Val Cys Ser Ala 
750 

His Val Pro Leu 
765 

Leu Ser Ser Met 

He Pro Ser Gin 
800 

Lys Ser Gly Val 
815 



-520- 



BNSDOCID; <WO 9843478A1_I_> 



wo 98/43478 



PCT/US98/06371 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

Met Ala Gly Arg His Gly Asn Lys Gly lie Val Ser Asn lie Val Pro 

15 10 15 

Val Ala Asp Met Pro Tyr Thr Ala Asp Gly Glu Pro Val Asp lie Val 

20 25 30 

Leu Asn Pro Leu Gly Val Pro Ser Arg Met Asn lie Gly Gin lie Leu 

35 40 45 

Glu Met His Leu Gly Leu Val Gly Lys Glu Phe Gly Lys Gin lie Ala 

50 55 60 

Arg Met Leu Glu Asp Lys Thr Lys Asp Phe Ala Lys Glu Leu Arg Ala 
65 70 75 80 

Lys Met Leu Glu Xaa Ala Asn Ala lie Asn Glu Lys Asp Pro Leu Thr 

85 90 95 

lie His Ala Leu Glu Asn Cys Ser Asp Glu Glu Leu Leu Glu Tyr Ala 

100 105 110 

Lys Asp Trp Ser Lys Gly Val Lys Met Ala lie Pro Val Phe Glu Gly 

115 120 125 

lie Ser Gin Glu Lys Phe Tyr Lys Leu Phe Glu Leu Ala Lys lie Ala 

130 135 140 

Met Asp Gly Lys Met Asp Leu Tyr Asp Gly Arg Thr Gly Glu Lys Met 
145 150 155 160 

Arg Glu Arg Val Asn Val Gly Tyr Met Tyr Met lie Lys Leu His His 

165 170 175 

Leu Val Asp Glu Lys Val His Ala Arg Ser Thr Gly Pro Tyr Ser Leu 

180 185 190 

Val Thr His Gin Pro Val Gly Gly Lys Ala Leu Phe Gly Gly Gin Arg 

195 200 205 

Phe Gly Glu Met Glu Val Trp Ala Leu Glu Ala Tyr Gly Ala Ala His 

210 215 220 

Thr Leu Lys Glu Met Leu Thr lie Lys Ser Asp Asp lie Arg Gly Arg 
225 230 235 240 

Glu Asn Ala Tyr Arg Ala lie Ala Lys Gly Glu Gin Val Gly Glu Ser 

245 250 255 

Glu lie Pro Glu Thr Phe Tyr Val Leu Thr Lys Glu Leu Gin Ser Leu 

260 265 270 

Ala Leu Asp lie Asn lie Phe Gly Asp Asp Val Asp Glu Asp Gly Ala 

275 280 285 

Pro Lys Pro lie Val lie Lys Glu Asp Asp Arg Pro Lys Asp Phe Ser 

290 295 300 

Ser Phe Gin Leu Thr Leu Ala Ser Pro Glu Lys lie His Ser Trp Ser 
305 310 315 320 

Tyr Gly Glu Val Lys Lys Pro Glu Thr lie Asn Tyr Arg Thr Leu Lys 

325 330 335 

Pro Glu Arg Asp Gly Leu Phe Cys Met Lys lie Phe Gly Pro Thr Lys 

340 345 350 

Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Lys Pro Arg Phe Lys Asp 

355 360 365 

lie Gly Thr Cys Glu Lys Cys Gly Val Ala lie Thr His Ser Lys Val 
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GGC AAG CAA ATT TTA GTG CAT GCC GAT GAA TTT GTG CAT GCG GGA GAA 5261 
Gly Lys Gin lie Leu Val His Ala Asp Glu Phe Val His Ala Gly Glu 
1610 1615 1620 1625 

GCG ATG ACG GAC GGA GTA ATT TCA AGC CAT GAT ATT TTA AGG ATC AGT 53 09 

Ala Met Thr Asp Gly Val lie Ser Ser His Asp lie Leu Arg lie Ser 
1630 1635 1640 

GGC GAA AAA GAG CTT TAT AAA TAG ATT GTG AGC GAA GTC CAG CAA GTG 53 57 
Gly Glu Lys Glu Leu Tyr Lys Tyr lie Val Ser Glu Val Gin Gin Val 
1645 1650 1655 

TAT CGC AGG CAG GGG GTG AGC ATT GCG GAC AAG CAC ATT GAA ATC ATT 54 05 
Tyr Arg Arg Gin Gly Val Ser lie Ala Asp Lys His lie Glu lie lie 
1660 1665 1670 

GTT TCT CAA ATG CTA AGA CAG GTG CGT ATT TTA GAC AGC GGG GAT AGC 5453 
Val Ser Gin Met Leu Arg Gin Val Arg lie Leu Asp Ser Gly Asp Ser 
1675 1680 1685 

AAG TTT ATT GAA GGG GAT TTA GTC AGT AAA AAA CTT TTC AAA GAA GAA 5501 
Lys Phe lie Glu Gly Asp Leu Val Ser Lys Lys Leu Phe Lys Glu Glu 
1690 1695 1700 1705 

AAC GCT CGT GTG ATC GCT TTA AAA GGC GAG CCA GCG ATT GCT GAA CCG 5 54 9 

Asn Ala Arg Val lie Ala Leu Lys Gly Glu Pro Ala lie Ala Glu Pro 
1710 1715 1720 

GTG CTT TTA GGG ATC ACT AGA GCG GCT ATT GGG AGC GAT AGC ATC ATC 5597 
Val Leu Leu Gly lie Thr Arg Ala Ala He Gly Ser Asp Ser He He 
1725 1730 1735 

TCA GCG GCC TCT TTC CAA GAA ACG ACT AAA GTT TTA ACA GAA GCC AGT 5645 
Ser Ala Ala Ser Phe Gin Glu Thr Thr Lys Val Leu Thr Glu Ala Ser 
1740 1745 1750 

ATC GCT ATG AAA AAA GAC TTT TTA GAG GAT TTG AAA GAG AAT GTG GTG 5693 
He Ala Met Lys Lys Asp Phe Leu Glu Asp Leu Lys Glu Asn Val Val 
1755 1760 1765 

TTG GGG AGG ATG ATC CCT GTG GGA ACA GGC ATG TAT AAG AAT AAA AAA 5741 
Leu Gly Arg Met He Pro Val Gly Thr Gly Met Tyr Lys Asn Lys Lys 
1770 1775 1780 1785 

ATC GTG TTA AGA GCG CTT GAG GAT AAC TCT AAA TTT TGATATGAAA AATCGG 57 93 
He Val Leu Arg Ala Leu Glu Asp Asn Ser Lys Phe 
1790 1795 

TTAAGATTTT TAAAAGAAAA ATTAGGGTAA AATGGGGGA 58 32 

(2) INFORMATION FOR SEQ ID NO: 3 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 97 amino acids 

(B) TYPE: amino acid 
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His Gly lie Lys Lys Gly Asp Met Val Ser Glu Lys Gly Leu Phe Ala 
1390 1395 1400 

GTG ATA GCT GAT GAT AAT GGT AGG GAA GCC GCT CGC CAT TAT ATC GCT 4637 
Val He Ala Asp Asp Asn Gly Arg Glu Ala Ala Arg His Tyr lie Ala 
1405 1410 1415 

AGG GGT TCT GAG ATC TTG ATT GAT GAT AAT AGT GAA GTG AGC ACT AAT 4 6 85 

Arg Gly Ser Glu He Leu He Asp Asp Asn Ser Glu Val Ser Thr Asn 
1420 1425 1430 

AGC GTG ATT TCT AAA CCC ACG ACT AAC ACT TTC AAA ACG ATT GCC ACA 4 733 
Ser Val He Ser Lys Pro Thr Thr Asn Thr Phe Lys Thr He Ala Thr 
1435 1440 1445 

TGG GAT CCT TAC AAC ACC CCT ATC ATT GCG GAC TTT AAA GGT AAG GTG 47 81 

Trp Asp Pro Tyr Asn Thr Pro He He Ala Asp Phe Lys Gly Lys Val 
1450 1455 1460 1465 

GGT TTT GTG GAT GTT ATC GCA GGG GTT ACG GTC GCT GAA AAA GAA GAC 4 82 9 

Gly Phe Val Asp Val He Ala Gly Val Thr Val Ala Glu Lys Glu Asp 
1470 1475 1480 

GAA AAT ACC GGT ATC ACA AGC TTA GTG GTG AAT GAT TAC ATT CCA AGC 4 877 

Glu Asn Thr Gly He Thr Ser Leu Val Val Asn Asp Tyr He Pro Ser 
1485 1490 1495 

GGA TAC AAA CCA AGC TTG TTT TTA GAG GGG GCT AAT GGC GAA GAG ATG 4 92 5 

Gly Tyr Lys Pro Ser Leu Phe Leu Glu Gly Ala Asn Gly Glu Glu Met 
1500 1505 1510 

CGT TAT TTC CTA GAG CCA AAA ACC TCT ATC GCC ATT AGC GAT GGC TCT 4 973 
Arg Tyr Phe Leu Glu Pro Lys Thr Ser He Ala He Ser Asp Gly Ser 
1515 1520 1525 

AGC GTG GAG CAA GCT GAA GTG TTA GCG AAA ATC CCT AAA GCG ACC GTT 5 021 

Ser Val Glu Gin Ala Glu Val Leu Ala Lys He Pro Lys Ala Thr Val 
1530 1535 1540 1545 

AAA TCT AGG GAT ATT ACC GGG GGT CTC CCA AGG GTT TCG GAA CTC TTT 5 069 

Lys Ser Arg Asp He Thr Gly Gly Leu Pro Arg Val Ser Glu Leu Phe 
1550 1555 1560 

GAA GCG AGA AAA CCC AAG CCT AAA GAT GTG GCG ATC CTT TCT GAA GTT 5117 
Glu Ala Arg Lys Pro Lys Pro Lys Asp Val Ala He Leu Ser Glu Val 
1565 1570 1575 

GAT GGG ATT GTG AGT TTT GGC AAA CCC ATT CGC AAT AAA GAA CAC ATC 5165 
Asp Gly He Val Ser Phe Gly Lys Pro He Arg Asn Lys Glu His He 
1580 1585 1590 

ATC GTA ACT TCT AAA GAT GGC CGT TCC ATG GAT TAT TTT GTG GAT AAA 5213 
He Val Thr Ser Lys Asp Gly Arg Ser Met Asp Tyr Phe Val Asp Lys 
1595 1600 1605 
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1165 1170 1175 

GGC AAG ATG AGT TAT CCG GGT GAA GCG GTG GGC GTG GTA GCC GCG CAA 3 965 

Gly Lys Met Ser Tyr Pro Gly Glu Ala Val Gly Val Val Ala Ala Gin 
1180 1185 1190 

TCT ATT GGG GAG CCT GGA ACG CAG CTC ACT TTA AGG ACT TTC CAT GTG 4 013 

Ser lie Gly Glu Pro Gly Thr Gin Leu Thr Leu Arg Thr Phe His Val 
1195 1200 1205 

GGC GGG ACA GCG AGC AGG AGT CAG GAT GAG CGC GAA ATC GTA GCG AGC 4 061 

Gly Gly Thr Ala Ser Arg Ser Gin Asp Glu Arg Glu lie Val Ala Ser 
1210 1215 1220 1225 

AAA GAA GGT TTT GTG CGT TTT TAG AAC CTT AGG ACT TAC ACG AAT AAA 410 9 

Lys Glu Gly Phe Val Arg Phe Tyr Asn Leu Arg Thr Tyr Thr Asn Lys 
1230 1235 1240 

GAG GGT AAA AAC ATT ATC GCT AAC CGC CGT AAC GCT TCT ATT TTA GTG 415 7 

Glu Gly Lys Asn He He Ala Asn Arg Arg Asn Ala Ser He Leu Val 
1245 1250 1255 

GTA GAG CCT AAG ATT AAA GCG CCT TTT GAT GGG GAA TTA CGC ATT GAA 42 05 

Val Glu Pro Lys He Lys Ala Pro Phe Asp Gly Glu Leu Arg He Glu 
1260 1265 1270 

ACG GTT TAT GAA GAA GTC GTT GTG AGC GTG AAA AAT GGC GAT CAA GAA 4253 
Thr Val Tyr Glu Glu Val Val Val Ser Val Lys Asn Gly Asp Gin Glu 
1275 1280 1285 

GCT AAA TTT GTT TTA AGG AGA AGC GAT ATT GTC AAG CCA AGC GAA TTA 43 01 

Ala Lys Phe Val Leu Arg Arg Ser Asp He Val Lys Pro Ser Glu Leu 
1290 1295 1300 1305 

GCC GGC GTT GGC GGT AAG ATT GAG GGG AAA GTG TAT TTG CCT TAT GCT 434 9 

Ala Gly Val Gly Gly Lys He Glu Gly Lys Val Tyr Leu Pro Tyr Ala 
1310 1315 1320 

AGT GGG CAT AAG GTG CAT AAG GGG GGA AGT ATC GCT GAT ATT ATC CAA 43 97 

Ser Gly His Lys Val His Lys Gly Gly Ser He Ala Asp He He Gin 
1325 1330 1335 

GAG GGC TGG AAT GTG CCT AAT CGC ATC CCT TAT GCG AGC GAA TTG CTA 444 5 

Glu Gly Trp Asn Val Pro Asn Arg He Pro Tyr Ala Ser Glu Leu Leu 
1340 1345 1350 

GTC AAG GAT AAT GAC CCT ATT GCG CAA GAT GTG TAT GCC AAA GAA AAA 44 93 

Val Lys Asp Asn Asp Pro He Ala Gin Asp Val Tyr Ala Lys Glu Lys 
1355 1360 1365 

GGC GTA ATC AAA TAC TAT GTT TTA GAG GCT AAC CAT TTA GAG CGC ACC 4 541 

Gly Val He Lys Tyr Tyr Val Leu Glu Ala Asn His Leu Glu Arg Thr 
1370 1375 1380 1385 

CAT GGG ATC AAA AAG GGC GAT ATG GTG AGT GAA AAA GGC TTG TTT GCG 4 589 
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GGG CTG CTC ACT GAC CAA GAG CGT TAG AAT AAG ATC ATT GAG ACT TGG 32 93 
Gly Leu Leu Thr Asp Gin Glu Arg Tyr Asn Lys lie lie Asp Thr Trp 
955 960 965 

ACT GAA GTC AAT GAC AAA ATG AGT AAA GAA ATG ATG ACC GCT ATC GCG 3341 
Thr Glu Val Asn Asp Lys Met Ser Lys Glu Met Met Thr Ala lie Ala 
970 975 980 985 

CAA GAT AAA GAG GGC TTT AAC TCT ATT TAT ATG ATG GCA GAT AGC GGC 33 89 

Gin Asp Lys Glu Gly Phe Asn Ser lie Tyr Met Met Ala Asp Ser Gly 
990 995 1000 

GCA AGG GGT AGC GCG GCG CAA ATC CGT GAG CTT TCA GCG ATG AGG GGT 34 37 
Ala Arg Gly Ser Ala Ala Gin lie Arg Gin Leu Ser Ala Met Arg Gly 
1005 1010 1015 

CTT ATG ACA AAG CCG GAC GGC AGT ATC ATT GAA ACG CCC ATT ATT TCT 34 85 
Leu Met Thr Lys Pro Asp Gly Ser lie lie Glu Thr Pro lie lie Ser 
1020 1025 1030 

AAC TTT AAA GAG GGG TTG AAT GTC TTA GAA TAG TTC AAT TCC ACG CAT 3 53 3 
Asn Phe Lys Glu Gly Leu Asn Val Leu Glu Tyr Phe Asn Ser Thr His 
1035 1040 1045 

GGC GCT AGA AAG GGC TTA GCG GAT ACA GCG CTA AAA ACA GCG AAT GCG 35 81 
Gly Ala Arg Lys Gly Leu Ala Asp Thr Ala Leu Lys Thr Ala Asn Ala 
1050 1055 1060 1065 

GGG TAT TTG ACC AGA AAG CTC ATT GAT GTT TCG CAA AAT GTC AAG GTG 362 9 

Gly Tyr Leu Thr Arg Lys Leu lie Asp Val Ser Gin Asn Val Lys Val 
1070 1075 1080 

GTG TCT GAT GAT TGC GGC ACG CAT GAA GGG ATT GAA ATC ACG GAT ATT 3 677 
Val Ser Asp Asp Cys Gly Thr His Glu Gly lie Glu lie Thr Asp lie 
1085 1090 1095 

GCG GTG GGG AGT GAG CTG ATT GAA CCT TTA GAA GAG CGT ATT TTT GGG 3 72 5 
Ala Val Gly Ser Glu Leu lie Glu Pro Leu Glu Glu Arg lie Phe Gly 
1100 1105 1110 

CGC GTT TTA TTA GAA GAT GTG ATC GAT CCC ATT ACG AAT GAA ATC TTG 377 3 
Arg Val Leu Leu Glu Asp Val lie Asp Pro lie Thr Asn Glu lie Leu 
1115 1120 1125 

CTT TAT GCG GAC ACT TTG ATT GAT GAA GAG GGT GCT AAA AAG GTG GTT 3 821 
Leu Tyr Ala Asp Thr Leu lie Asp Glu Glu Gly Ala Lys Lys Val Val 
1130 1135 1140 1145 

GAA GCC GGG ATT AAA TCC ATT ACG ATC CGC ACC CCA GTA ACT TGT AAA 3 86 9 
Glu Ala Gly lie Lys Ser lie Thr lie Arg Thr Pro Val Thr Cys Lys 
1150 1155 1160 

GCG CCA AAG GGC GTG TGC GCG AAA TGC TAT GGC TTG AAT TTG GGC GAA 3 917 
Ala Pro Lys Gly Val Cys Ala Lys Cys Tyr Gly Leu Asn Leu Gly Glu 
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ATT CAA GCG TTC CAT CCA AAG CTG ATT GAC GGC AAA GCG ATC CAA TTG 2621 
lie Gin Ala Phe His Pro Lys Leu lie Asp Gly Lys Ala lie Gin Leu 
730 735 740 745 

CAC CCG TTA GTG TGT TCA GCG TTC AAC GCC GAT TTT GAC GGG GAC CAA 2 669 

His Pro Leu Val Cys Ser Ala Phe Asn Ala Asp Phe Asp Gly Asp Gin 
750 755 760 

ATG GCG GTG CAT GTG CCT TTA AGC CAG GAA GCG ATC GCT GAA TGC AAG 2717 
Met Ala Val His Val Pro Leu Ser Gin Glu Ala lie Ala Glu Cys Lys 
765 770 775 

GTG CTG ATG CTA AGC TCT ATG AAT ATC CTT TTG CCT GCT AGC GGT AAG 2765 
Val Leu Met Leu Ser Ser Met Asn lie Leu Leu Pro Ala Ser Gly Lys 
780 785 790 

GCC GTA GCC ATT CCT AGC CAA GAT ATG GTT TTA GGG CTT TAT TAT CTT 2 813 

Ala Val Ala lie Pro Ser Gin Asp Met Val Leu Gly Leu Tyr Tyr Leu 
795 800 805 

TCT TTA GAA AAG AGC GGG GTC AAG GGC GAG CAT AAG CTT TTT TCT AGC 2 861 

Ser Leu Glu Lys Ser Gly Val Lys Gly Glu His Lys Leu Phe Ser Ser 
810 815 820 825 

GTG AAT GAA ATC ATC ACC GCC ATT GAC ACG AAA GAA TTA GAC ATC CAC 2 909 

Val Asn Glu lie lie Thr Ala lie Asp Thr Lys Glu Leu Asp lie His 
830 835 840 

GCA AAG ATT AGG GTT TTA GAT CAA GGG AAT ATT ATC GCT ACG AGT GCA 2 957 

Ala Lys lie Arg Val Leu Asp Gin Gly Asn lie lie Ala Thr Ser Ala 
845 850 855 

GGG CGC ATG ATC ATT AAG TCC ATT TTG CCT GAT TTT ATC CCT ACG GAT 3 005 

Gly Arg Met lie lie Lys Ser lie Leu Pro Asp Phe lie Pro Thr Asp 
860 865 870 

TTG TGG AAC AGA CCC ATG AAG AAA AAA GAT ATT GGC GTG CTT GTG GAT 3053 
Leu Trp Asn Arg Pro Met Lys Lys Lys Asp lie Gly Val Leu Val Asp 
875 880 885 

TAT GTG CAT AAA GTT GGC GGT ATC GGT ATT ACT GCA ACC TTT TTG GAT 3101 
Tyr Val His Lys Val Gly Gly lie Gly lie Thr Ala Thr Phe Leu Asp 
890 895 900 905 

AAT TTA AAA ACG CTT GGC TTT AGG TAT GCG ACT AAG GCT GGT ATT TCT 314 9 

Asn Leu Lys Thr Leu Gly Phe Arg Tyr Ala Thr Lys Ala Gly lie Ser 
910 915 920 

ATC TCT ATG GAG GAT ATT ATC ACG CCA AAA GAC AAG CAA AAA ATG GTG 3197 
lie Ser Met Glu Asp lie lie Thr Pro Lys Asp Lys Gin Lys Met Val 
925 930 935 

GAA AAA GCC AAA GTA GAG GTT AAA AAA ATC CAG CAA CAA TAC GAT CAA 3 245 
Glu Lys Ala Lys Val Glu Val Lys Lys lie Gin Gin Gin Tyr Asp Gin 
940 945 950 
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TCT GAT GCG AAA AAG AAA AAA CTC ATT AAG CGT TTG AAA GTG GTA GAA 194 9 
Ser Asp Ala Lys Lys Lys Lys Leu lie Lys Arg Leu Lys Val Val Glu 
510 515 520 

AGC TTT TTA AAT TCT GGT AAT AGG CCT GAA TGG ATG ATG CTC ACG GTT 1997 
Ser Phe Leu Asn Ser Gly Asn Arg Pro Glu Trp Met Met Leu Thr Val 
525 530 535 

TTA CCG GTA TTG CCA CCG GAT TTA AGG CCT TTA GTC GCG CTA GAT GGC 2 045 

Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Ala Leu Asp Gly 
540 545 550 

GGG AAG TTT GCA GTC AGC GAT GTG AAT GAA TTG TAT CGT CGT GTC ATC 2 093 

Gly Lys Phe Ala Val Ser Asp Val Asn Glu Leu Tyr Arg Arg Val lie 
555 560 565 

AAT CGT AAC CAA CGC TTG AAA CGC TTA ATG GAG CTT GGA GCG CCA GAA 2141 
Asn Arg Asn Gin Arg Leu Lys Arg Leu Met Glu Leu Gly Ala Pro Glu 
570 575 580 585 

ATC ATT GTG CGC AAT GAA AAA AGG ATG TTG CAA GAA GCC GTG GAT GTG 2189 
lie lie Val Arg Asn Glu Lys Arg Met Leu Gin Glu Ala Val Asp Val 
590 595 600 

CTT TTT GAT AAC GGC CGC AGC ACT AAT GCG GTT AAA GGG GCT AAC AAA 2237 
Leu Phe Asp Asn Gly Arg Ser Thr Asn Ala Val Lys Gly Ala Asn Lys 
605 610 615 

CGC CCT TTA AAA TCG CTC AGT GAA ATC ATT AAA GGC AAG CAG GGG CGT 22 85 
Arg Pro Leu Lys Ser Leu Ser Glu lie lie Lys Gly Lys Gin Gly Arg 
620 625 630 

TTC AGG CAA AAC CTT TTA GGT AAG CGC GTG GAT TTT TCA GGC AGA AGC 23 3 3 

Phe Arg Gin Asn Leu Leu Gly Lys Arg Val Asp Phe Ser Gly Arg Ser 
635 640 645 

GTG ATT GTG GTT GGG CCT AAT CTC AAA ATG GAT GAA TGC GGG TTG CCT 23 81 
Val lie Val Val Gly Pro Asn Leu Lys Met Asp Glu Cys Gly Leu Pro 
650 655 660 665 

AAA AAC ATG GCG TTA GAA CTC TTC AAA CCG CAT TTG TTA TCC AAG CTT 242 9 
Lys Asn Met Ala Leu Glu Leu Phe Lys Pro His Leu Leu Ser Lys Leu 
670 675 680 

GAA GAG AGA GGC TAT GCC ACC ACG CTC AAA CAG GCT AAA CGC ATG ATT 2477 
Glu Glu Arg Gly Tyr Ala Thr Thr Leu Lys Gin Ala Lys Arg Met lie 
685 690 695 

GAG CAA AAA AGC AAT GAA GTA TGG GAG TGC TTG CAA GAA ATC ACA GAG 2525 
Glu Gin Lys Ser Asn Glu Val Trp Glu Cys Leu Gin Glu lie Thr Glu 
700 705 710 

GGG TAT CCG GTG CTA CTC AAC CGC GCT CCT ACC TTG CAC AAG CAA TCC 2573 
Gly Tyr Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Lys Gin Ser 
715 720 725 
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Asp Val Asp Glu Asp Gly Ala Pro Lys Pro lie Val lie Lys Glu Asp 
285 290 295 

GAC AGG CCT AAA GAC TTT AGC TCT TTC CAG CTC ACA CTA GCT AGC CCT 132 5 

Asp Arg Pro Lys Asp Phe Ser Ser Phe Gin Leu Thr Leu Ala Ser Pro 
300 305 310 

GAA AAA ATC CAT TCT TGG AGT TAT GGG GAA GTT AAA AAG CCA GAA ACG 13 73 
Glu Lys lie His Ser Trp Ser Tyr Gly Glu Val Lys Lys Pro Glu Thr 
315 320 325 

ATC AAT TAT CGC ACC CTA AAA CCT GAA CGA GAC GGC TTG TTT TGC ATG 1421 
lie Asn Tyr Arg Thr Leu Lys Pro Glu Arg Asp Gly Leu Phe Cys Met 
330 335 340 345 

AAA ATC TTT GGC CCC ACT AAA GAT TAT GAA TGC TTG TGC GGC AAA TAC 1469 
Lys lie Phe Gly Pro Thr Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr 
350 355 360 

AAA AAG CCT CGC TTC AAA GAC ATT GGC ACA TGC GAA AAA TGC GGC GTG 1517 
Lys Lys Pro Arg Phe Lys Asp lie Gly Thr Cys Glu Lys Cys Gly Val 
365 370 375 

GCG ATC ACG CAC TCC AAA GTC AGG CGT TTT AGA ATG GGG CAT ATT GAA 1565 
Ala lie Thr His Ser Lys Val Arg Arg Phe Arg Met Gly His lie Glu 
380 385 390 

TTG GCC ACT CCT GTA GCG CAT ATC TGG TAT GTT AAT TCC TTG CCT AGC 1613 
Leu Ala Thr Pro Val Ala His lie Trp Tyr Val Asn Ser Leu Pro Ser 
395 400 405 

CGT ATC GGC ACG CTT TTA GGC GTT AAG ATG AAA GAC TTA GAG CGC GTG 1661 
Arg lie Gly Thr Leu Leu Gly Val Lys Met Lys Asp Leu Glu Arg Val 
410 415 420 425 

TTG TAT TAT GAA GCT TAT ATC GTT AAA GAA CCA GGC GAA GCC GCT TAT 17 0 9 

Leu Tyr Tyr Glu Ala Tyr lie Val Lys Glu Pro Gly Glu Ala Ala Tyr 
430 435 440 

GAC AAT GAA GGC ACT AAG CTT GTG ATG AAA TAC GAT ATT TTG AAT GAA 1757 
Asp Asn Glu Gly Thr Lys Leu Val Met Lys Tyr Asp lie Leu Asn Glu 
445 450 455 

GAG CAG TAT CAA AAT ATC TCA CGA AGA TAC GAA GAC AGG GGC TTT GTA 18 05 

Glu Gin Tyr Gin Asn lie Ser Arg Arg Tyr Glu Asp Arg Gly Phe Val 
460 465 470 

GCG CAA ATG GGC GGT GAA GCG ATC AAG GAT TTG TTA GAA GAA ATT GAT 18 53 

Ala Gin Met Gly Gly Glu Ala lie Lys Asp Leu Leu Glu Glu lie Asp 
475 480 485 

TTG ATC ACC TTA TTG CAG AGT TTG AAA GAA GAA GTG AAA GAC ACC AAT 1901 
Leu lie Thr Leu Leu Gin Ser Leu Lys Glu Glu Val Lys Asp Thr Asn 
490 495 500 505 



-512- 



BNSDOCID: <WO 9S43478A1J_> 



wo 98/43478 



PCTAJS98/06371 



60 65 70 

TTT GCC AAA GAA TTG CGT GCT AAA ATG CTA GAA AWC GCT AAC GCT ATT 653 
Phe Ala Lys Glu Leu Arg Ala Lys Met Leu Glu Xaa Ala Asn Ala lie 
75 80 85 

AAT GAA AAA GAC CCC TTG ACA ATC CAT GCG CTT GAG AAT TGT TCT GAT 701 
Asn Glu Lys Asp Pro Leu Thr lie His Ala Leu Glu Asn Cys Ser Asp 
90 95 100 105 

GAA GAG CTT TTG GAA TAG GCA AAA GAT TGG AGC AAG GGC GTT AAG ATG 74 9 

Glu Glu Leu Leu Glu Tyr Ala Lys Asp Trp Ser Lys Gly Val Lys Met 
110 115 120 

GCT ATC CCT GTG TTT GAA GGC ATC TCG CAA GAA AAA TTT TAT AAG CTA 7 97 

Ala lie Pro Val Phe Glu Gly lie Ser Gin Glu Lys Phe Tyr Lys Leu 
125 130 135 

TTT GAA TTA GCT AAG ATC GCT ATG GAT GGC AAA ATG GAT CTG TAT GAC 84 5 

Phe Glu Leu Ala Lys lie Ala Met Asp Gly Lys Met Asp Leu Tyr Asp 
140 145 150 

GGA CGC ACA GGC GAG AAA ATG AGG GAG CGC GTG AAT GTG GGC TAC ATG 8 93 

Gly Arg Thr Gly Glu Lys Met Arg Glu Arg Val Asn Val Gly Tyr Met 
155 160 165 

TAT ATG ATC AAA CTC CAC CAT TTA GTG GAT GAA AAA GTC CAT GCC AGA 941 
Tyr Met lie Lys Leu His His Leu Val Asp Glu Lys Val His Ala Arg 
170 175 180 185 

AGC ACA GGC CCT TAT AGC TTA GTA ACG CAC CAG CCC GTG GGG GGT AAA 98 9 

Ser Thr Gly Pro Tyr Ser Leu Val Thr His Gin Pro Val Gly Gly Lys 
190 195 200 

GCG CTC TTT GGG GGT CAA AGG TTT GGG GAA ATG GAA GTG TGG GCC TTG 103 7 

Ala Leu Phe Gly Gly Gin Arg Phe Gly Glu Met Glu Val Trp Ala Leu 
205 210 215 

GAA GCT TAT GGC GCA GCG CAC ACT CTA AAA GAA ATG CTC ACC ATT AAA 10 85 

Glu Ala Tyr Gly Ala Ala His Thr Leu Lys Glu Met Leu Thr lie Lys 
220 225 230 

TCC GAT GAT ATT AGA GGC AGA GAG AAC GCT TAT AGG GCT ATC GCT AAA 113 3 

Ser Asp Asp lie Arg Gly Arg Glu Asn Ala Tyr Arg Ala lie Ala Lys 
235 240 245 

GGT GAG CAA GTG GGC GAG AGT GAA ATC CCT GAG ACT TTC TAT GTT TTG 1181 
Gly Glu Gin Val Gly Glu Ser Glu lie Pro Glu Thr Phe Tyr Val Leu 
250 255 260 265 

ACT AAA GAA TTG CAA TCG CTC GCT TTG GAT ATT AAT ATT TTT GGG GAC 122 9 

Thr Lys Glu Leu Gin Ser Leu Ala Leu Asp lie Asn lie Phe Gly Asp 
270 275 280 

GAT GTG GAT GAG GAT GGA GCA CCT AAA CCC ATT GTC ATT AAA GAA GAT 12 77 
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Glu Ala Leu Leu Lys Glu Ala Ala Glu Lys Arg Arg Glu lie lie Ala 
65 70 75 80 

Glu Ala lie Gin Lys Ala Thr Glu Ser Tyr Asp Ala Val lie Lys Gin 

85 90 95 

Lys Glu Asn Glu Leu Asn Gin Glu Phe Glu Ala Phe Ala Lys Gin Leu 

100 105 110 

Gin Asn Glu Lys Gin Ala Leu Lys Glu Gin Leu Gin Ala Gin Met Pro 

115 120 125 

Val Phe Glu Asp Glu Leu Asn Lys Arg Val Ala Met Gly Leu Gly Ser 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 3 07: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 387... 5777 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:307: 

AAGCTTGATA TGGAGCATTT TGATCGCTTG ACCATGCTCA ATAGAGAAGA ATTGTTGCGC 60 

GTTACTCGCT CCTTTCTCAA GCGATTTTAG AAGAGCCTTT CAGCCATAAC GGCAAGGATT 12 0 

ATAAAGAAGG CGATCAAATC CCTAAAGAAG AAATCGCTTC AATCAACCGC TTCACTTTGG 18 0 

CTAGTTTGGT CAAAAAGTAT TCTAAAGAAG TGCAAAACCA CTATGAAATC ACTAAAAACA 24 0 

ATTTCTTAGA GCAAAAGAAA GTTTTGGGCG AAGAGCATGA AGAAAAGCTT TCTATTTTAG 3 00 

AAAAAGATGA TATTTTGCCT AATGGCGTGA TCAAAAAAGT CAAGCTCTAT ATCGCTACAA 3 60 

AACGAAAGCT TAAAGTGGGC GATAAA ATG GCA GGA AGG CAT GGG AAT AAA GGG 413 

Met Ala Gly Arg His Gly Asn Lys Gly 
1 5 

ATT GTG TCT AAT ATC GTG CCG GTT GCG GAT ATG CCT TAT ACC GCT GAT 461 
lie Val Ser Asn lie Val Pro Val Ala Asp Met Pro Tyr Thr Ala Asp 
10 15 20 25 

GGC GAG CCT GTA GAT ATT GTT TTA AAC CCT TTA GGC GTG CCA AGC CGC 509 
Gly Glu Pro Val Asp lie Val Leu Asn Pro Leu Gly Val Pro Ser Arg 
30 35 40 

ATG AAT ATC GGG CAG ATT TTA GAA ATG CAT TTA GGC TTA GTG GGG AAA 557 
Met Asn lie Gly Gin lie Leu Glu Met His Leu Gly Leu Val Gly Lys 
45 50 55 

GAA TTT GGG AAG CAA ATC GCT CGC ATG CTA GAG GAT AAA ACC AAA GAT 605 
Glu Phe Gly Lys Gin lie Ala Arg Met Leu Glu Asp Lys Thr Lys Asp 
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TTA TTG TTA TGG GCG ATG AAT GTT TGG GTG TAT AGG COT TTG TTG GCT 153 
Leu Leu Leu Trp Ala Met Asn Val Trp Val Tyr Arg Pro Leu Leu Ala 
20 25 30 

TTT ATG GAT AAC AGA CAG GCA GAG ATA AAG GAT AGO TTG GCT AAA ATC 201 
Phe Met Asp Asn Arg Gin Ala Glu lie Lys Asp Ser Leu Ala Lys lie 
35 40 45 50 

AAA ACG GAT AAT GCC CAA AGT GTG GAG ATT GGC CAT CAA ATT GAG GCT 24 9 

Lys Thr Asp Asn Ala Gin Ser Val Glu lie Gly His Gin lie Glu Ala 
55 60 65 

CTT CTT AAA GAA GCG GCT GAA AAA CGC AGA GAA ATA ATA GCA GAA GCG 297 
Leu Leu Lys Glu Ala Ala Glu Lys Arg Arg Glu lie lie Ala Glu Ala 
70 75 80 

ATT CAA AAA GCC ACA GAG TCC TAT GAC GCT GTG ATC AAG CAA AAA GAG 345 
lie Gin Lys Ala Thr Glu Ser Tyr Asp Ala Val lie Lys Gin Lys Glu 
85 90 95 

AAC GAA CTC AAT CAA GAG TTT GAA GCG TTT GCG AAG CAA TTA CAA AAT 3 93 

Asn Glu Leu Asn Gin Glu Phe Glu Ala Phe Ala Lys Gin Leu Gin Asn 
100 105 110 

GAA AAG CAA GCG CTA AAA GAG CAG TTG CAA GCG CAA ATG CCG GTA TTT 441 
Glu Lys Gin Ala Leu Lys Glu Gin Leu Gin Ala Gin Met Pro Val Phe 
115 120 125 130 

GAA GAC GAG TTA AAC AAG CGT GTG GCT ATG GGT TTA GGG AGT TGATGAATG 492 
Glu Asp Glu Leu Asn Lys Arg Val Ala Met Gly Leu Gly Ser 
135 140 

TTTGTAGTTA AAATGGTGTT AGGGTTTTTG ATCCTTTTA 531 

(2) INFORMATION FOR SEQ ID NO: 3 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:306: 

Met Asn He Ser Val Asn Pro Tyr Leu Met Ala Val Val Phe Val Val 

15 10 15 

Phe Val Leu Leu Leu Trp Ala Met Asn Val Trp Val Tyr Arg Pro Leu 

20 25 30 

Leu Ala Phe Met Asp Asn Arg Gin Ala Glu He Lys Asp Ser Leu Ala 

35 40 45 

Lys He Lys Thr Asp Asn Ala Gin Ser Val Glu He Gly His Gin He 
50 . 55 60 
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Lys Gin Asn Lys Glu Met Ser Glu Leu Leu Thr Lys Leu Ser Gin Asp 

130 135 140 

Leu Val Ser Gin lie Ala Leu lie Gin Lys Ala Leu Lys Glu Gin Glu 
145 150 155 160 

Glu Lys Ala Glu Lys Pro Leu Lys Ser Asn Ala Pro Ala Asn Lys Thr 

165 170 175 

Pro Ser Leu Lys Ala Glu Ser Pro Lys Asn Gin Glu Gly Lys Thr Gin 

180 185 190 

Glu Lys Ala Lys lie Glu Phe Asp Lys Asp Leu Ser Lys Gin Lys Glu 

195 200 205 

lie Phe Gin Glu Ala Leu Ser Phe Phe Lys Asn Lys Ser Tyr Ala Glu 

210 215 220 

Ala Lys Glu Arg Leu Leu Trp Leu Glu Ala Asn Ser Tyr Arg Leu Tyr 
225 230 235 240 

Tyr Val Arg Tyr Val Leu Gly Glu Val Ala Tyr Gly Glu Lys Arg Tyr 

245 250 255 

Arg Glu Ala lie Lys Tyr Tyr Lys Glu Ser Ala Leu Leu Asn Lys Lys 

260 265 270 

Ala Ser Tyr Met Pro Val Leu Leu Trp His Thr Ala Trp Ser Phe Lys 

275 280 285 

Lys lie Lys Asp Asp Gin Asn Tyr Tyr Lys Phe Leu Asn Thr Leu Gin 

290 295 300 

His Leu Tyr Pro Ser Ser Glu Gin Ala Lys Met Ala Gin Lys lie Leu 
305 310 315 320 

Glu Asn Lys Glu Lys His His His Ala Lys Pro 
325 330 

(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 483 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 05: 

AACAAATGCG AGTTTCAAAT ATTTTGTAGG ATTTTAGGAA AGAAATAGGT T ATG AAT 57 

Met Asn 
1 

ATA TCG GTT AAC CCC TAT TTA ATG GCG GTC GTT TTT GTG GTG TTT GTG 105 
lie Ser Val Asn Pro Tyr Leu Met Ala Val Val Phe Val Val Phe Val 
5 10 15 
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TAT TAT GTG CGT TAT GTT CTT GGA GAA GTG GCT TAT GGG GAA AAG AGA 83 0 

Tyr Tyr Val Arg Tyr Val Leu Gly Glu Val Ala Tyr Gly Glu Lys Arg 
240 245 250 255 

TAG AGA GAA GCG ATC AAG TAT TAG AAA GAG AGO GCT CTT TTA AAC AAA 87 8 

Tyr Arg Glu Ala lie Lys Tyr Tyr Lys Glu Ser Ala Leu Leu Asn Lys 
260 265 270 

AAA GCG TCT TAG ATG CCT GTG CTT TTG TGG CAT ACG GCA TGG TCG TTT 92 6 

Lys Ala Ser Tyr Met Pro Val Leu Leu Trp His Thr Ala Trp Ser Phe 
275 280 285 

AAA AAA ATC AAA GAC GAT CAA AAC TAT TAT AAA TTT TTA AAC ACT TTG 974 
Lys Lys lie Lys Asp Asp Gin Asn Tyr Tyr Lys Phe Leu Asn Thr Leu 
290 295 300 

CAA CAC TTG TAT CCT TCA AGC GAA CAA GCT AAA ATG GCG CAA AAA ATC 1022 
Gin His Leu Tyr Pro Ser Ser Glu Gin Ala Lys Met Ala Gin Lys lie 
305 310 315 

TTA GAA AAC AAG GAG AAA CAC CAC CAT GCA AAA CCA TGATTTAGAG TCAATC 1074 
Leu Glu Asn Lys Glu Lys His His His Ala Lys Pro 
320 325 330 

AAACAAGCCG CTTTGATTGA ATATGAAGTG AGAGAACA 1112 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 304: 

Met Lys Arg Leu Phe Phe lie Pro Phe lie Ala Pro Phe Phe Leu Asn 

15 10 15 

Gly Glu Pro Ser Ala Phe Asp Leu Gin Ser Gly Ala Thr Lys Lys Glu 

20 25 30 

Leu Lys Gin Leu Gin lie Asn Ser Lys Asn Phe Ser Asn lie Leu Thr 

35 40 45 

Lys lie His Ser Gin Val Glu Ala Asn Thr Gin Ala Gin Glu Gly Leu 

50 55 60 

Arg Ser Val Tyr Glu Gly Gin Ala Asn Lys lie Lys Asp Leu Asn Asn 
65 70 75 80 

Ala lie Leu Ser Gin Glu Glu Ser Leu Arg Ala Leu Lys Ala Ser Gin 

85 90 95 

Glu Val Gin Ala Asn Thr Leu Lys Gin Gin Ser Gin Thr Leu Glu Asp 

100 105 110 

Leu Arg Asn Glu lie His Ala Asn Gin Gin Ala lie Gin Gin Leu Asp 
-115 120 125 
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Asn Gly Glu Pro Ser Ala Phe Asp Leu Gin Ser Gly Ala Thr Lys Lys 
20 25 30 

GAA CTC AAG CAG TTG CAA ATC AAT AGT AAG AAT TTT TCT AAT ATT TTG 2 06 

Glu Leu Lys Gin Leu Gin lie Asn Ser Lys Asn Phe Ser Asn lie Leu 
35 40 45 

ACC AAA ATC CAT TCG CAA GTA GAG GCT AAC ACT CAA GCT CAA GAG GGT 254 
Thr Lys lie His Ser Gin Val Glu Ala Asn Thr Gin Ala Gin Glu Gly 
50 55 60 

TTG AGA AGC GTT TAT GAG GGG CAG GCT AAT AAG ATT AAA GAT CTC AAT 3 02 

Leu Arg Ser Val Tyr Glu Gly Gin Ala Asn Lys lie Lys Asp Leu Asn 
65 70 75 

AAC GCT ATC CTT TCC CAA GAA GAA TCC TTA CGA GCC TTA AAA GCT TCG 350 
Asn Ala lie Leu Ser Gin Glu Glu Ser Leu Arg Ala Leu Lys Ala Ser 
80 85 90 95 

CAA GAA GTG CAG GCT AAC ACG CTT AAG CAG CAA TCG CAA ACT TTA GAG 3 98 

Gin Glu Val Gin Ala Asn Thr Leu Lys Gin Gin Ser Gin Thr Leu Glu 
100 105 110 

GAT TTG AGG AAT GAG ATT CAC GCT AAC CAG CAA GCT ATC CAG CAG TTA 44 6 

Asp Leu Arg Asn Glu lie His Ala Asn Gin Gin Ala lie Gin Gin Leu 
115 120 125 

GAC AAG CAA AAT AAA GAG ATG AGT GAA TTA TTG ACC AAG TTA AGC CAG 4 94 

Asp Lys Gin Asn Lys Glu Met Ser Glu Leu Leu Thr Lys Leu Ser Gin 
130 135 140 

GAT TTG GTT TCA CAA ATC GCC TTA ATC CAA AAA GCT CTC AAA GAA CAA 54 2 

Asp Leu Val Ser Gin lie Ala Leu lie Gin Lys Ala Leu Lys Glu Gin 
145 150 155 

GAG GAA AAA GCT GAA AAG CCG CTC AAA TCA AAC GCT CCG GCT AAT AAA 590 
Glu Glu Lys Ala Glu Lys Pro Leu Lys Ser Asn Ala Pro Ala Asn Lys 
160 165 170 175 

ACC CCC TCT TTG AAA GCC GAA TCC CCA AAA AAT CAA GAG GGA AAA ACT 63 8 

Thr Pro Ser Leu Lys Ala Glu Ser Pro Lys Asn Gin Glu Gly Lys Thr 
180 185 190 

CAA GAA AAG GCG AAA ATT GAG TTT GAT AAA GAC TTG TCT AAG CAA AAA 686 
Gin Glu Lys Ala Lys lie Glu Phe Asp Lys Asp Leu Ser Lys Gin Lys 
195 200 205 

GAG ATC TTT CAA GAA GCT CTG TCT TTT TTT AAA AAT AAA TCC TAT GCA 73 4 

Glu lie Phe Gin Glu Ala Leu Ser Phe Phe Lys Asn Lys Ser Tyr Ala 
210 215 220 

GAA GCC AAA GAG CGT TTG TTG TGG TTA GAA GCC AAT AGT TAC AGA CTT 7 82 

Glu Ala Lys Glu Arg Leu Leu Trp Leu Glu Ala Asn Ser Tyr Arg Leu 
225 230 235 
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He Ser Lys He His 
65 

GAATATCAA 319 
(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Asn Ser Ser Asn Leu Lys Asn Trp Leu Phe Pro Thr He Cys Phe 

15 10 15 

Phe Leu Phe Cys Tyr He Leu He Phe Leu Met Phe Phe Met Phe Lys 

20 25 30 

Ser Leu Gin Ser Gin Ser Phe Gly Ser Val Ala Glu Thr Gly Lys Lys 

35 40 45 

Pro He Thr Thr Thr Lys Lys Phe Gly Lys Glu Leu Gin Lys Gin He 

50 55 60 

Ser Lys He His 
65 

(2) INFORMATION FOR SEQ ID NO; 3 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1112 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 1058 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 

AAAAACTAGA GAGTGTTATA AAGAAAACAG AAGAGTGGAT GTCAAATTAA TGAAGTAATT 60 
CTAGG ATG AAA AGG CTT TTT TTT ATC CCT TTT ATC GCT CCC TTT TTT CTC 110 
Met Lys Arg Leu Phe Phe He Pro Phe He Ala Pro Phe Phe Leu 
1 5 10 15 

AAT GGG GAG CCT TCA GCG TTT GAT TTG CAA AGT GGG GCT ACC AAA AAA 15 8 
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Thr Glu Lys Gly Tyr Thr Asn Ala Tyr lie Asn Leu Gly lie Met Tyr 

130 135 140 

Met Glu Gly Arg Gly Val Pro Ser Asn Tyr Val Lys Ala Thr Glu Cys 
145 150 155 160 

Phe Arg Lys Ala Met His Lys Gly Asn Val Glu Ala Tyr lie Leu Leu 

165 170 175 

Gly Asp lie Tyr Tyr Ser Gly Asn Asp Gin Leu Gly lie Glu Pro Asp 

180 185 190 

Lys Asp Lys Ala lie Val Tyr Tyr Lys Met Ala Ala Asp Met Ser Ser 

195 200 205 

Ser Arg Ala Tyr Glu Gly Leu Ala Glu Ser Tyr Gin Tyr Gly Leu Gly 

210 215 220 

Val Glu Lys Asp Lys Lys Lys Ala Glu Glu Tyr Met Gin Lys Ala Cys 
225 230 235 240 

Asp Phe Asp lie Asp Lys Asn Cys Lys Lys Lys Asn Thr Ser Ser Arg 
245 250 255 

(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 66... 269 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TGGGGGTTGT GGTTGCTCAT GTTCGCATGG GTAGTAAGGT ATAGGAGTAT TTAAAAGGCA 60 
AGGTC ATG AAT AGT TCT AAT CTC AAA AAT TGG CTA TTC CCT ACC ATT TGC 110 
Met Asn Ser Ser Asn Leu Lys Asn Trp Leu Phe Pro Thr lie Cys 
15 10 15 

TTT TTT TTA TTT TGT TAT ATT TTA ATT TTT TTA ATG TTC TTT ATG TTT 158 
Phe Phe Leu Phe Cys Tyr lie Leu lie Phe Leu Met Phe Phe Met Phe 
20 25 30 

AAA AGT TTG CAA TCG CAA TCG TTT GGC TCT GTG GCA GAA ACC GGA AAA 206 
Lys Ser Leu Gin Ser Gin Ser Phe Gly Ser Val Ala Glu Thr Gly Lys 
35 40 45 

AAA CCC ATC ACC ACC ACC AAG AAA TTT GGT AAG GAA TTG CAA AAA CAG 2 54 

Lys Pro lie Thr Thr Thr Lys Lys Phe Gly Lys Glu Leu Gin Lys Gin 
50 55 60 

ATT TCA AAA ATC CAT TAACTTTTTT TCTTTTTTGC CGATACTTGC TGTAATGGAA T 310 
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TTT AGA AAA GCG ATG CAT AAG GGT AAT GTA GAA GCT TAT ATC CTT TTA 586 
Phe Arg Lys Ala Met His Lys Gly Asn Val Glu Ala Tyr lie Leu Leu 
165 170 175 

GGG GAT ATT TAT TAT AGT GGG AAT GAT CAA TTG GGT ATT GAG CCA GAC 634 
Gly Asp lie Tyr Tyr Ser Gly Asn Asp Gin Leu Gly lie Glu Pro Asp 
180 185 190 

AAA GAT AAG GCG ATT GTC TAT TAT AAA ATG GCG GCT GAT ATG AGC TCT 6 82 

Lys Asp Lys Ala lie Val Tyr Tyr Lys Met Ala Ala Asp Met Ser Ser 
195 200 205 

TCT AGA GCT TAT GAA GGG TTA GCA GAG TCT TAT CAG TAT GGG TTA GGC 73 0 

Ser Arg Ala Tyr Glu Gly Leu Ala Glu Ser Tyr Gin Tyr Gly Leu Gly 
210 215 220 

GTG GAA AAA GAT AAG AAA AAG GCT GAA GAA TAC ATG CAA AAA GCA TGC 778 
Val Glu Lys Asp Lys Lys Lys Ala Glu Glu Tyr Met Gin Lys Ala Cys 
225 230 235 240 

GAT TTT GAC ATT GAT AAA AAT TGT AAG AAA AAG AAC ACT TCA AGC CGA 826 
Asp Phe Asp lie Asp Lys Asn Cys Lys Lys Lys Asn Thr Ser Ser Arg 
245 250 255 

TAACTCTCAA ACTTGGGCTT GATTAGGATT TTTGTTTTAT TTTAAGTAGC ATG 87 9 

(2) INFORMATION FOR SEQ ID NO: 3 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 00: 

Met Gly Tyr Ala Ser Lys Leu Ala Leu Lys lie Cys Leu Ala Ser Leu 

15 10 15 

Cys Leu Phe Ser Ala Leu Gly Ala Glu His Leu Glu Gin Lys Arg Asn 

20 25 30 

Tyr lie Tyr Xaa Gly Glu Glu Ala Tyr Asn Asn Lys Glu Tyr Glu Arg 

35 40 45 

Ala Ala Ser Phe Tyr Lys Ser Ala lie Lys Asn Gly Glu Pro Leu Ala 

50 55 60 

Tyr Val Leu Leu Gly lie Met Tyr Glu Asn Gly Arg Gly Val Pro Lys 
65 70 75 80 

Asp Glu Lys Lys Ala Ala Glu Tyr Phe Gin Lys Ala Val Asp Asn Asp 

85 90 95 

lie Pro Arg Gly Tyr Asn Asn Leu Gly Val Met Tyr Lys Glu Gly Arg 

100 105 110 

Gly Val Pro Lys Asp Glu Lys Lys Ala Val Glu Tyr Phe Arg lie Ala 
115 120 125 
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(ii) MOLECUliE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 59.,. 826 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 
TTTTAAGATT GGTAGCCATT GGCATTATGT TTGATCTTAT TAAAGCAGAG GAGTAACA 58 



ATG GGA TAC GCA AGC AAA TTA GCC TTG AAG ATT TGT TTG GCA AGT TTA 106 
Met Gly Tyr Ala Ser Lys Leu Ala Leu Lys lie Cys Leu Ala Ser Leu 
15 10 15 

TGT TTA TTT AGC GCT CTT GGT GCA GAA CAC CTT GAA CAA AAA AGG AAC 154 
Cys Leu Phe Ser Ala Leu Gly Ala Glu His Leu Glu Gin Lys Arg Asn 
20 25 30 

TAT ATT TAT AAM GGG GAG GAA GCC TAT AAT AAT AAG GAA TAT GAG CGG 2 02 

Tyr lie Tyr Xaa Gly Glu Glu Ala Tyr Asn Asn Lys Glu Tyr Glu Arg 
35 40 45 

GCG GCT TCT TTT TAT AAG AGC GCG ATT AAA AAT GGC GAG CCG CTT GCT 250 
Ala Ala Ser Phe Tyr Lys Ser Ala lie Lys Asn Gly Glu Pro Leu Ala 
50 55 60 

TAT GTT CTT TTA GGG ATC ATG TAT GAA AAT GGT AGG GGT GTG CCT AAA 2 98 

Tyr Val Leu Leu Gly lie Met Tyr Glu Asn Gly Arg Gly Val Pro Lys 
65 70 75 80 

GAT GAA AAG AAA GCG GCT GAA TAT TTT CAA AAA GCG GTT GAT AAC GAT 346 
Asp Glu Lys Lys Ala Ala Glu Tyr Phe Gin Lys Ala Val Asp Asn Asp 
85 90 95 

ATA CCT AGA GGG TAT AAC AAT TTA GGC GTG ATG TAT AAA GAG GGT AGA 3 94 

lie Pro Arg Gly Tyr Asn Asn Leu Gly Val Met Tyr Lys Glu Gly Arg 
100 105 110 

GGT GTG CCT AAA GAT GAA AAG AAA GCC GTG GAG TAT TTT AGA ATA GCT 442 
Gly Val Pro Lys Asp Glu Lys Lys Ala Val Glu Tyr Phe Arg lie Ala 
115 120 125 

ACC GAG AAG GGC TAT ACT AAC GCC TAT ATA AAC TTA GGC ATC ATG TAT 4 90 

Thr Glu Lys Gly Tyr Thr Asn Ala Tyr lie Asn Leu Gly lie Met Tyr 
130 135 140 

ATG GAG GGT AGG GGA GTT CCA AGC AAC TAT GTG AAA GCG ACA GAG TGC 53 8 

Met Glu Gly Arg Gly Val Pro Ser Asn Tyr Val Lys Ala Thr Glu Cys 
145 150 155 160 
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TAG AGC AAT GGG TGT CAT TTG CTA GGG AAT TTA TAT TAG AGC GGG CAA 4 57 

Tyr Ser Asn Gly Cys His Leu Leu Gly Asn Leu Tyr Tyr Ser Gly Gin 
100 105 110 



GGG GTG TGC CAA AAC ACC AAT AAA GCC CTA CAA TAG TAG TGT AAA GGG 505 
Gly Val Ser Gin Asn Thr Asn Lys Ala Leu Gin Tyr Tyr Ser Lys Ala 
115 120 125 



TGC GAT TTG AAA TAG GGT GAA GGG TGC GCG ACT TAGGGGGGAT TTATCATGAT 558 
Cys Asp Leu Lys Tyr Ala Glu Gly Cys Ala Thr 
130 135 



GGTAAAGTGG TAAGTAGGGA TTTTAAAAAA GCGGTGGAAT AT 6 00 

(2) INFORMATION FOR SEQ ID NO: 2 98: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: 



Met Leu Glu Asn 
1 

Ala Leu Cys Leu 
20 

Val Gly Leu Gly 
35 

Lys Lys Tyr Phe 
50 

Phe Asn Leu Gly 
65 

Leu Lys Lys Al a 

Ser Asn Gly Cys 
100 

Val Ser Gin Asn 
115 

Asp Leu Lys Tyr 
130 



Val Lys Lys Ser 
5 

Gly Gly Leu Met 

Ala Lys Ser Tyr 
40 

Glu Lys Ala Cys 
55 

Val Leu Tyr Tyr 
70 

Ala Ser Phe Tyr 
85 

His Leu Leu Gly 

Thr Asn Lys Ala 
120 

Ala Glu Gly Cys 
135 



Phe Phe Arg Val 
10 

Ala Glu Gin Asp 
25 

Lys Glu Lys Asp 

Asp Leu Lys Glu 
60 

Gin Gly Gin Gly 
75 

Ala Lys Ala Cys 
90 

Asn Leu Tyr Tyr 
105 

Leu Gin Tyr Tyr 
Ala Thr 



Leu Cys Leu Gly 
15 

Pro Lys Glu Leu 
30 

Phe Thr Gin Ala 
45 

Asn Ser Gly Cys 

Val Glu Lys Asn 
80 

Asp Leu Asn Tyr 
95 

Ser Gly Gin Gly 
110 

Ser Lys Ala Cys 
125 



(2) INFORMATION FOR SEQ ID NO: 2 99: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 879 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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lie 


He 


Glu 


Lys 


Asp 


Gly 


He 


Lys 


Trp 


Ala 


Val 


Gin 


Val 


Lys 


Arg 


Tyr 






115 










120 










125 








Ser 


His 


Lys 


Val 


Ser 


Pro 


Lys 


Ala 


He 


Gin 


Glu 


Val 


Val 


Ser 


Ser 


Lys 




130 










135 










140 










Ala 


Tyr 


Tyr 


Ala 


Cys 


Glu 


Lys 


Ala 


Cys 


Val 


He 


Thr 


Asn 


Ser 


Tyr 


Phe 


145 










150 










155 










160 


Thr 


Gin 


Ala 


Ala 


Gin 


Lys 


Leu 


Ala 


Gin 


Ala 


Asn 


Glu 


Val 


Leu 


Leu 


He 










165 










170 










175 




Asp 


Arg 


Asp 


Glu 


Trp 


Val 


Arg 


Phe 


Leu 


Asn 


Glu 


Lys 


Arg 









180 185 



(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 125... 538 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:297: 

CAAAGAGTGG GATAAAATCA CAGAAATTTG TAAGAGAGCG CTAGCTTTAA GATAACAAAA 60 
AGATCATGGC ATTTTGTATT TGCTTAATAA CACTATAATA AAATTTTTAA TAAGGAGATA 12 0 

CATC ATG TTA GAA AAT GTC AAA AAG TCC TTT TTT AGG GTT TTG TGC TTG 169 
Met Leu Glu Asn Val Lys Lys Ser Phe Phe Arg Val Leu Cys Leu 
15 10 15 

GGT GCG TTG TGT TTA GGG GGG CTA ATG GCA GAG CAA GAC CCT AAA GAG 217 
Gly Ala Leu Cys Leu Gly Gly Leu Met Ala Glu Gin Asp Pro Lys Glu 
20 25 30 

CTT GTG GGT TTG GGG GCA AAG AGC TAC AAA GAG AAA GAT TTC ACT CAA 265 
Leu Val Gly Leu Gly Ala Lys Ser Tyr Lys Glu Lys Asp Phe Thr Gin 
35 40 45 

GCG AAG AAA TAT TTT GAG AAA GCG TGC GAT TTG AAA GAA AAT AGC GGG 313 
Ala Lys Lys Tyr Phe Glu Lys Ala Cys Asp Leu Lys Glu Asn Ser Gly 
50 55 60 

TGT TTT AAT TTA GGG GTG CTT TAT TAT CAA GGG CAA GGG GTG GAA AAG 3 61 

Cys Phe Asn Leu Gly Val Leu Tyr Tyr Gin Gly Gin Gly Val Glu Lys 
65 70 75 

AAC TTG AAA AAA GCC GCC TCC TTT TAC GCT AAA GCT TGC GAT TTG AAT 4 09 

Asn Leu Lys Lys Ala Ala Ser Phe Tyr Ala Lys Ala Cys Asp Leu Asn 



-500- 



BNSDOCID: <WO 9843478A1_I_> 



wo 98/43478 



PCT/US98/06371 



85 90 95 

GTG AGC ATC ACQ CAA AAA AGC GGC GAT TAT GGA GCG GAT TTG ATT ATA 392 
Val Ser lie Thr Gin Lys Ser Gly Asp Tyr Gly Ala Asp Leu lie lie 
100 105 110 

GAA AAA GAC GGC ATC AAG TGG GCG GTT CAA GTC AAA CGC TAC TCG CAT 44 0 

Glu Lys Asp Gly lie Lys Trp Ala Val Gin Val Lys Arg Tyr Ser His 
115 120 125 130 

AAA GTT TCG CCC AAA GCC ATT CAA GAG GTG GTC TCT TCT AAA GCT TAT 488 
Lys Val Ser Pro Lys Ala lie Gin Glu Val Val Ser Ser Lys Ala Tyr 
135 140 145 

TAC GCT TGC GAA AAA GCT TGC GTG ATC ACC AAC AGC TAT TTC ACG CAA 53 6 

Tyr Ala Cys Glu Lys Ala Cys Val lie Thr Asn Ser Tyr Phe Thr Gin 
150 155 160 

GCC GCT CAA AAA CTG GCT CAA GCT AAC GAA GTG CTC TTG ATT GAC AGA 584 
Ala Ala Gin Lys Leu Ala Gin Ala Asn Glu Val Leu Leu lie Asp Arg 
165 170 175 

GAC GAA TGG GTC AGG TTT TTG AAC GAA AAG AGA TGAACCGATC CCATCAGATC 63 7 
Asp Glu Trp Val Arg Phe Leu Asn Glu Lys Arg 
180 185 

GTTTGTTCTC AAGTTCTTTT AAAATTTTGT CGT 67 0 

(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 96: 

Met Lys Lys lie Ala Phe Phe lie Phe Val lie Leu Phe Ser Val Gly 

15 10 15 

lie Tyr Leu lie Trp His Val Leu Leu Glu Lys Ala Leu Glu Leu Lys 

20 25 30 

Leu Ala Thr Ser Ala Asn Asp Leu Leu Leu Lys Leu Leu Ala lie Leu 

35 40 45 

Gly Val Phe Ser Met Leu Val Leu Phe Gin Gly lie lie Ser Ser Tyr 

50 55 60 

Lys Lys Arg Gin Leu Lys Arg lie Leu Gin Lys lie Asp Ala Met Asn 
65 70 75 80 

Gly Phe Glu Phe Glu Glu Tyr Ser Lys lie Phe Phe Thr Ser Lys Gly 

85 90 95 

Phe Glu Val Ser lie Thr Gin Lys Ser Gly Asp Tyr Gly Ala Asp Leu 
100 105 110 
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Leu Phe Ala Ser lie Phe lie Ala Cys 

50 55 
Phe Asp Lys Lys Gly Thr Glu Glu Ser 
65 70 
Phe Lys Asn Arg Glu Lys Glu Val Ala 
85 

Ser Tyr Asp Phe Asp Asp Gly Lys 
100 

(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECXJLE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 617 
(D) OTHER INFORMATION: 



Thr Asp Leu Asp lie Cys Leu 
60 

Leu Leu Arg Phe Asn His Glu 

75 80 
Ser lie Leu Glu Tyr lie Arg 
90 95 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 95: 

GTTTAGAAAA GCGTTTGAAC GCTATAAAAA ATGCAGAGTG GCTTTAAGGC ATG AAA 56 

Met Lys 
1 

AAG ATT GCA TTT TTT ATT TTT GTC ATT TTG TTT TCG GTA GGG ATT TAT 104 
Lys lie Ala Phe Phe lie Phe Val lie Leu Phe Ser Val Gly lie Tyr 
5 10 15 

TTA ATT TGG CAT GTT TTA TTG GAA AAA GCC CTA GAA TTG AAA TTA GCA 152 
Leu lie Trp His Val Leu Leu Glu Lys Ala Leu Glu Leu Lys Leu Ala 
20 25 30 

ACC TCA GCT AAT GAT TTG CTT TTA AAA TTG TTG GCA ATT CTT GGC GTT 200 
Thr Ser Ala Asn Asp Leu Leu Leu Lys Leu Leu Ala lie Leu Gly Val 
35 40 45 50 

TTT TCA ATG TTA GTG CTT TTT CAA GGC ATT ATT TCT TCG TAT AAG AAG 24 8 

Phe Ser Met Leu Val Leu Phe Gin Gly lie lie Ser Ser Tyr Lys Lys 
55 60 65 

CGC CAA CTC AAA CGC ATT TTA CAA AAA ATA GAC GCC ATG AAC GGC TTT 2 96 

Arg Gin Leu Lys Arg lie Leu Gin Lys lie Asp Ala Met Asn Gly Phe 
70 75 80 

GAA TTT GAA GAA TAT TCC AAA ATC TTT TTC ACT TCA AAG GGT TTT GAA 344 
Glu Phe ' Glu Glu Tyr Ser Lys lie Phe Phe Thr Ser Lys Gly Phe Glu 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

AAAGAAGGTT TGAAAAGCTT TTTTAAAAGG CTTTTGAAGT ATTTGGGGTA GGCTTGA ATG 60 

Met 
1 

AAA GTG CAA AAT TTT ATC CAT TTT TCT GTT GTG GTA GGG TTT TTT TTG 108 
Lys Val Gin Asn Phe lie His Phe Ser Val Val Val Gly Phe Phe Leu 
5 10 15 

GGG TTA GTG TTT TCG GTG TTG AAA TTC AAT GAG CCA GAG AGC ATT TTA 156 
Gly Leu Val Phe Ser Val Leu Lys Phe Asn Glu Pro Glu Ser lie Leu 
20 25 30 

TTA TGG ACG GTG TTA TCC ACG CTT GGG GGG TAC TTG ATT GCG TTG TTG 2 04 

Leu Trp Thr Val Leu Ser Thr Leu Gly Gly Tyr Leu lie Ala Leu Leu 
35 40 45 

TTT GCG TCT ATT TTT ATC GCT TGC ACG GAT TTG GAT ATT TGT CTT TTT 2 52 

Phe Ala Ser lie Phe lie Ala Cys Thr Asp Leu Asp lie Cys Leu Phe 
50 55 60 65 

GAC AAA AAA GGC ACT GAA GAG AGT TTG CTT CGT TTC AAC CAT GAG TTT 30 0 

Asp Lys Lys Gly Thr Glu Glu Ser Leu Leu Arg Phe Asn His Glu Phe 
70 75 80 

AAA AAC AGA GAA AAA GAA GTG GCT AGT ATT TTA GAA TAC ATT AGA AGT 34 8 

Lys Asn Arg Glu Lys Glu Val Ala Ser lie Leu Glu Tyr lie Arg Ser 
85 90 95 

TAT GAT TTT GAT GAT GGA AAA TAGAATGCCC AAAGGAATTC AAAAAACTGA AACA 4 03 
Tyr Asp Phe Asp Asp Gly Lys 
100 

AGCGAAAAAA ATATAGAA 421 
(2) INFORMATION FOR SEQ ID NO: 2 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

Met Lys Val Gin Asn Phe lie His Phe Ser Val Val Val Gly Phe Phe 

15 10 15 

Leu Gly Leu Val Phe Ser Val Leu Lys Phe Asn Glu Pro Glu Ser lie 

20 25 30 

Leu Leu Trp Thr Val Leu Ser Thr Leu Gly Gly Tyr Leu lie Ala Leu 
35 40 45 
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AGGGACTCGC TCGTGGCGTT ACTGCCCACG CCGGCTAACA CTTTC ATG CGC GAA TTT 117 

Met Arg Glu Phe 
1 

GAG GGC GTT TTA GTG TTT TTG CAA GTT TCT ATG GCG ATT TCA ATG CAA 165 
Glu Gly Val Leu Val Phe Leu Gin Val Ser Met Ala lie Ser Met Gin 
5 10 15 20 

CGC ATG TGC TCT TTG TGG GTG AGC GTG GCG GAT TCT CCT GTC GTG CCA 213 
Arg Met Cys Ser Leu Trp Val Ser Val Ala Asp Ser Pro Val Val Pro 
25 30 35 

ACA GGC ACG CAT GCG TCC ATG CCC TGAAAAATTT GGCGCTTGAT CAAGGTTTCA 267 
Thr Gly Thr His Ala Ser Met Pro 
40 

TAAGCGGCCT CATCAACGCT CAA 2 90 

(2) INFORMATION FOR SEQ ID NO: 2 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

Met Arg Glu Phe Glu Gly Val Leu Val Phe Leu Gin Val Ser Met Ala 

15 10 15 

lie Ser Met Gin Arg Met Cys Ser Leu Trp Val Ser Val Ala Asp Ser 

20 25 30 

Pro Val Val Pro Thr Gly Thr His Ala Ser Met Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58... 369 
(D) OTHER INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Met Pro Ala Leu Ser Arg Ser Tyr Gly Val Val Leu lie Phe lie Th.r 

15 10 15 

Gin Ser Asn Ala Leu lie Glu Lys Tyr Tyr Gly Arg Glu Asp Ala Arg 

20 25 30 

lie Val Asn Ser Thr Val Ala Tyr Lys lie lie Phe Lys Met Asp Asp 

35 40 45 

Leu Glu Tyr Ala Lys Gin Val Ser Glu Glu Val Gly Lys Met Thr Arg 

50 55 60 

Lys Thr Arg Ser His Ser Thr Glu Lys Gly Gin Leu lie Thr Gly Gly 
65 70 75 80 

Thr Ser Ser lie Gly Lys Glu Ala Trp Asp Leu Leu Ser Ala Gin Asp 

85 90 95 

lie Met Asn lie Asp Lys Asp Glu Val lie Val Leu Val Ser Gly His 

100 105 110 

Lys Ala Lys Pro Leu Lys Leu Lys Ala Asn Tyr Tyr Phe Lys Asn Lys 

115 120 125 

Glu Leu Leu Ser Arg lie Asn Trp Glu Val Lys Pro Asn Glu Glu Val 
130 135 140 

Phe 
145 

(2) INFORMATION FOR SEQ ID NO: 2 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 106... 237 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2 91: 
TAAGGGCTTA CGCATAAAAT CGCATCCGCG CCGATTTTTT GAGCGAACTT TGCTAAAGAA 60 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 145... 579 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 89: 

GCGTTCAATA GAATGCTTTA GTTAGGAAGC TCCTTGCTTT AGCAAGGNGT GGTTTCACTG 60 
AAAGAGAGTA AGAAATTTGA AGAAAGGGTT TATCTTTTTT TAGATGAATT TGTGCGTTTT 12 0 

GGTAAATTGC CTTTTTTATT AGAA ATG CCA GCA TTA AGT AGG AGC TAT GGT 171 

Met Pro Ala Leu Ser Arg Ser Tyr Gly 
1 5 

GTG GTT TTA ATT TTT ATC ACG CAA TCC AAC GCT CTT ATT GAA AAA TAT 219 
Val Val Leu lie Phe lie Thr Gin Ser Asn Ala Leu lie Glu Lys Tyr 
10 15 20 25 

TAG GGC AGA GAA GAT GCA AGA ATT GTT AAT AGC ACC GTG GCT TAC AAA 2 67 

Tyr Gly Arg Glu Asp Ala Arg lie Val Asn Ser Thr Val Ala Tyr Lys 
30 35 40 

ATA ATT TTC AAA ATG GAT GAT TTA GAA TAC GCT AAA CAG GTG AGC GAA 315 
lie lie Phe Lys Met Asp Asp Leu Glu Tyr Ala Lys Gin Val Ser Glu 
45 50 55 

GAA GTC GGT AAG ATG ACT AGA AAA ACA CGA AGC CAC TCT ACA GAA AAA 363 
Glu Val Gly Lys Met Thr Arg Lys Thr Arg Ser His Ser Thr Glu Lys 
60 65 70 

GGA CAA CTC ATT ACC GGA GGG ACT TCT AGT ATA GGT AAA GAG GCG TGG 411 
Gly Gin Leu lie Thr Gly Gly Thr Ser Ser lie Gly Lys Glu Ala Trp 
75 80 85 

GAC TTA TTG AGC GCG CAA GAT ATT ATG AAT ATT GAT AAA GAT GAA GTG 459 
Asp Leu Leu Ser Ala Gin Asp lie Met Asn lie Asp Lys Asp Glu Val 
90 95 100 105 

ATC GTT TTA GTA AGC GGT CAT AAG GCT AAA CCC TTA AAA TTA AAA GCG 507 
lie Val Leu Val Ser Gly His Lys Ala Lys Pro Leu Lys Leu Lys Ala 
110 115 120 

AAT TAT TAT TTC AAA AAC AAA GAA TTA CTC TCT CGT ATT AAC TGG GAA 5 55 

Asn Tyr Tyr Phe Lys Asn Lys Glu Leu Leu Ser Arg lie Asn Trp Glu 
125 130 135 

GTC AAG CCC AAT GAA GAA GTG TTT TGATGGATTA AAAAAGTTTG CATGAGTATT 6 09 

Val Lys Pro Asn Glu Glu Val Phe 
140 145 

TTTTAATTGC TTTTTTAAAA AT 631 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 112... 252 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:287: 

ATGCCTGCCA TTTCATAGCC TAAATTTTCT TTAGAGCCGA ATTGATAAGC GGCTTTTAAG 6 0 

ACTTCTTTTT GCTTAGCGTT AAAATCTTTA ATATTGTCGC AATTGGTCAT C ATG ACT 117 

Met Thr 
1 

TTA GTA ACG GGC GAT TTG GGC TTG TTT TTA ACC CCT TTA GCG GGC TTA 165 
Leu Val Thr Gly Asp Leu Gly Leu Phe Leu Thr Pro Leu Ala Gly Leu 
5 10 15 

GGC TCT GTT TTA GTG GGG CTT TCT GTT GCG GCT AAA CTT AAA GAT GCA 213 
Gly Ser Val Leu Val Gly Leu Ser Val Ala Ala Lys Leu Lys Asp Ala 
20 25 30 

CTT AAG GCT GTG CCT AGC CAT AAG GCT TTA AAG ATG GTG TGAGTGAGTG GG 2 64 
Leu Lys Ala Val Pro Ser His Lys Ala Leu Lys Met Val 
35 40 45 

GTTAAATGTT TCAAAACGCC TACCTTTTGT ATTAAGAAAT AAACTA 310 
(2) INFORMATION FOR SEQ ID NO: 2 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 

Met Thr Leu Val Thr Gly Asp Leu Gly Leu Phe Leu Thr Pro Leu Ala 

15 10 15 

Gly Leu Gly Ser Val Leu Val Gly Leu Ser Val Ala Ala Lys Leu Lys 

20 25 30 

Asp Ala Leu Lys Ala Val Pro Ser His Lys Ala Leu Lys Met Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 2 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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AAG GTC GCT CAA AAA GAA TTT GGT AGC GTT TGT GCG TTA AGO TAGTTTTAT 64 2 
Lys Val Ala Gin Lys Glu Phe Gly Ser Val Cys Ala Leu Arg 
175 180 

AGCATTCTAG CGAGCATGTT TAAGGCATGC TCTACGCTTT TATT 6 86 

(2) INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 86: 



Met 


Val 


Leu 


He 


Ala 


Leu 


Leu 


Gly 


Val 


Phe 


Ser 


Ser 


Val 


Ser 


Leu 


Ser 


1 








5 










10 










15 




Ala 


Lys 


Ser 


Leu 


Leu 


Arg 


Asp 


Asp 


Gly 


He 


Leu 


Val 


Ser 


Asp 


Leu 


Lys 








20 










25 










30 






Gly 


Met 


Lys 


Ser 


Glu 


Leu 


Ser 


Asp 


Ala 


Pro 


Ala 


Trp 


Val 


Phe 


Glu 


Asp 






35 










40 










45 








Ala 


Lys 


Ala 


Pro 


Tyr 


Glu 


Glu 


Met 


Gly 


Val 


Ala 


Tyr 


He 


Pro 


Val 


Asn 




50 










55 










60 










Asn 


Lys 


Tyr 


Leu 


Gly 


He 


Glu 


Gin 


Ala 


Thr 


Leu 


Asn 


Ala 


Lys 


Leu 


Ser 


65 










70 










75 










80 


Leu 


He 


Val 


Val 


Phe 


His 


Glu 


He 


Met 


Met 


Lys 


Tyr 


Lys 


Lys 


Arg 


Phe 










85 










90 










95 




Met 


Glu 


Gin 


Phe 


His 


Glu 


Ser 


Glu 


Gin 


Thr 


Thr 


Thr 


Asn 


He 


Ser 


Tyr 








100 










105 










110 






Ala 


He 


Tyr 


Asn 


Tyr 


Leu 


Ala 


Thr 


Lys 


He 


Gin 


Val 


Ser 


Asn 


Thr 


Tyr 






115 










120 










125 








Thr 


Asn 


Leu 


Lys 


Ser 


Glu 


Val 


Ala 


Val 


Val 


Lys 


He 


Lys 


Leu 


Val 


Gly 




130 










135 










140 










Cys 


Gin 


He 


Glu 


Gin 


He 


Lys 


Arg 


Tyr 


Leu 


Lys 


Ala 


Ser 


Val 


Glu 


Asn 


145 










150 










155 










160 


Leu 


Asn 


Asp 


Asn 


Glu 


He 


Ala 


Tyr 


He 


Ala 


Lys 


Val 


Ala 


Gin 


Lys 


Glu 










165 










170 










175 




Phe 


Gly 


Ser 


Val 


Cys 


Ala 


Leu 


Arg 



















180 

(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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(ix) FRATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 82... 633 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

TGTGATTAAA CAAAATCAAA AACTTTTTAA CTATAATCAA ACCTAAATTA AAGTTCAAGG 60 
AGTGGCATTT TGTTTAAAAG A ATG GTT TTA ATC GCT CTT TTA GGG GTG TTT 111 

Met Val Leu lie Ala Leu Leu Gly Val Phe 
15 10 

TCA AGC GTT TCA TTA AGC GCT AAG AGT CTT TTA AGA GAT GAT GGG ATT 159 
Ser Ser Val Ser Leu Ser Ala Lys Ser Leu Leu Arg Asp Asp Gly lie 
15 20 25 

TTA GTC TCT GAT TTA AAG GGC ATG AAA TCA GAA CTA TCT GAT GCT CCT 2 07 

Leu Val Ser Asp Leu Lys Gly Met Lys Ser Glu Leu Ser Asp Ala Pro 
30 35 40 

GCT TGG GTT TTT GAA GAC GCT AAA GCC CCC TAC GAA GAA ATG GGC GTG 2 55 

Ala Trp Val Phe Glu Asp Ala Lys Ala Pro Tyr Glu Glu Met Gly Val 
45 50 55 

GCG TAT ATC CCT GTT AAT AAT AAA TAT TTA GGG ATT GAG CAA GCG ACC 3 03 

Ala Tyr lie Pro Val Asn Asn Lys Tyr Leu Gly lie Glu Gin Ala Thr 
60 65 70 

TTA AAC GCT AAA TTG AGT CTG ATC GTG GTT TTT CAT GAA ATC ATG ATG 3 51 

Leu Asn Ala Lys Leu Ser Leu lie Val Val Phe His Glu lie Met Met 
75 80 85 90 

AAG TAT AAA AAA CGC TTC ATG GAG CAA TTC CAT GAG TCC GAG CAG ACG 3 99 

Lys Tyr Lys Lys Arg Phe Met Glu Gin Phe His Glu Ser Glu Gin Thr 
95 100 105 

ACT ACG AAT ATC AGT TAC GCT ATC TAT AAT TAT CTA GCG ACT AAG ATC 447 
Thr Thr Asn He Ser Tyr Ala He Tyr Asn Tyr Leu Ala Thr Lys He 
110 115 120 

CAG GTA TCC AAC ACC TAC ACG AAT TTA AAA TCG GAG GTG GCG GTG GTG 4 95 

Gin Val Ser Asn Thr Tyr Thr Asn Leu Lys Ser Glu Val Ala Val Val 
125 130 135 

AAA ATC AAG CTA GTG GGT TGT CAG ATT GAG CAA ATC AAA AGG TAT TTA 543 
Lys He Lys Leu Val Gly Cys Gin He Glu Gin He Lys Arg Tyr Leu 
140 145 150 

AAA GCG AGC GTT GAA AAC CTT AAC GAT AAT GAA ATC GCT TAC ATC GCT 591 
Lys Ala Ser Val Glu Asn Leu Asn Asp Asn Glu He Ala Tyr He Ala 
155 160 165 170 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 84: 



Met 


Lys 


Arg 


Arg 


Asp 


Phe 


He 


Lys 


Thr 


Thr 


Thr 


Leu 


Gly Ala 


Thr 


Gly 


1 








5 










10 










15 




Ala 


Val 


Leu 


Gly 


Ala 


Gin 


He 


Leu 


Gin 


Ala 


Glu 


Glu 


Ser 


Lys 


Gly 


Ser 








20 










25 










30 






Val 


Ala 


Lys 


Tyr 


Lys 


He 


Glu 


Ala 


Gin 


Tyr 


Ser 


He 


Asp 


Phe 


Asp 


Ser 






35 










40 










45 








Ala 


Glu 


His 


Thr 


Ser 


Leu 


Phe 


He 


Pro 


Met 


Pro 


Ser 


Val 


Val 


Ala 


Ser 




50 










55 










60 










Asn 


Val 


His 


Leu 


Gin 


Gly 


Asn 


His 


Ala 


Ser 


Tyr 


Lys 


Ser 


Met 


Leu 


Asn 


65 










70 










75 










80 


Phe 


Gly 


Val 


Pro 


Tyr 


Leu 


Gin 


Val 


Asp 


Phe 


Leu 


Lys 


Ser 


Thr 


Gin 


Lys 










85 










90 










95 




Lys 


Gin 


Val 


His 


Leu 


Ser 


Tyr 


Glu 


He 


Ala 


Ser 


Tyr 


Gin 


Leu 


Asn 


Glu 








100 










105 










110 






Arg 


Leu 


Phe 


Glu 


Thr 


Ser 


Asp 


Phe 


Val 


Ala 


Met 


Gly Arg 


Tyr 


Glu 


Arg 






115 










120 










125 








Asp 


Asp 


Ala 


Ser 


Val 


Ala 


Asn 


He 


Ala 


Asn 


Gin 


Leu 


Lys 


Gly 


Thr 


Thr 




130 










135 










140 










Pro 


Lys 


Glu 


Ser 


Val 


Arg 


Asn 


Phe 


Tyr 


Ala 


Phe 


He 


Lys 


His 


Glu 


Met 


145 










150 










155 










160 


Pro 


Lys 


Arg 


Gin 


Lys 


Ala 


Leu 


Glu 


Gly 


Lys 


Glu 


Asn 


Leu 


Pro 


Lys 


Arg 










165 










170 










175 




Glu 


Ser 


Leu 


Pro 


Trp 


Phe 


Ala 


Thr 


He 


Ser 


Lys 


Glu 


Ser 


Met 


Phe 


Val 








180 










185 










190 






Ser 


Leu 


Cys 


His 


Ala 


Cys 


Gly 


He 


Lys 


Ser 


Ala 


Glu 


Val 


Gin 


Gly 


Leu 






195 










200 










205 








Lys 


Leu 


Gly 


Gin 


Asn 


Ser 


Val 


Val 


Lys 


Asn 


Ala 


Pro 


Arg 


Val 


Glu 


Val 




210 










215 










220 










Tyr 


Leu 


Lys 


Asp 


Ser 


Phe 


Leu 


Ala 


Phe 


Asp 


Phe 


Gin 


Asn 


Asn 


His 


Lys 


225 










230 










235 










240 


Glu 


Val 


Phe 


He 


Pro 


Leu 


Asn 


Arg 


His 


Lys 


Asp 


Met 


Gin 


Leu 


Asp 


Ser 










245 










250 










255 




Ala 


Leu 


Leu 


Ala 


Thr 


Phe 


Gly 


Asp 


Ala 


Phe 


Ala 


Leu 


Val 


Asp 


Gly 


Arg 








260 










265 










270 






Asp 


Leu 


Gly 


Asn 


Tyr 


Glu 


Ser 


Lys 


Leu 


Phe 


Glu 


Lys 


Arg 


Val 


Ser 


Tyr 






275 










280 










285 









Thr He Val 



290 

(2) INFORMATION FOR SEQ ID NO: 2 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
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TTG TTT GAA ACG AGC GAT TTT GTA GCA ATG GGG CGT TAT GAA AGA GAC 44 3 

Leu Phe Glu Thr Ser Asp Phe Val Ala Met Gly Arg Tyr Glu Arg Asp 
115 120 125 



GAT GCG AGC GTG GCT AAC ATT GCC AAC CAG CTT AAG GGA ACA ACC CCT 4 91 

Asp Ala Ser Val Ala Asn lie Ala Asn Gin Leu Lys Gly Thr Thr Pro 
130 135 140 145 



AAA GAA AGC GTT CGC AAT TTT TAT GCG TTC ATC AAG CAT GAG ATG CCT 53 9 

Lys Glu Ser Val Arg Asn Phe Tyr Ala Phe lie Lys His Glu Met Pro 
150 155 160 

AAG AGA CAG AAG GCT TTA GAG GGT AAA GAA AAT TTA CCT AAG CGT GAG 5 87 

Lys Arg Gin Lys Ala Leu Glu Gly Lys Glu Asn Leu Pro Lys Arg Glu 
165 170 175 



AGT TTG CCC TGG TTT GCA ACC ATT TCA AAA GAG AGC ATG TTT GTG TCC 63 5 

Ser Leu Pro Trp Phe Ala Thr lie Ser Lys Glu Ser Met Phe Val Ser 
180 185 190 



TTA TGC CAT GCG TGC GGG ATT AAA AGC GCT GAA GTG CAA GGC TTG AAA 68 3 

Leu Cys His Ala Cys Gly lie Lys Ser Ala Glu Val Gin Gly Leu Lys 
195 200 205 



CTG GGT CAA AAC AGC GTG GTG AAA 
Leu Gly Gin Asn Ser Val Val Lys 
210 215 

TTG AAA GAT TCA TTT CTA GCG TTT 
Leu Lys Asp Ser Phe Leu Ala Phe 
230 



AAC GCT CCT AGA GTG GAA GTG TAT 731 

Asn Ala Pro Arg Val Glu Val Tyr 

220 225 

GAT TTT CAA AAT AAT CAC AAG GAA 779 

Asp Phe Gin Asn Asn His Lys Glu 
235 240 



GTT TTT ATC CCG TTG AAT CGT CAT AAA GAC ATG CAG TTA GAT TCT GCC 82 7 

Val Phe lie Pro Leu Asn Arg His Lys Asp Met Gin Leu Asp Ser Ala 
245 250 255 



TTA TTG GCG ACT TTT GGC GAT GCT TTT GCC CTT GTG GAT GGT AGG GAT 87 5 

Leu Leu Ala Thr Phe Gly Asp Ala Phe Ala Leu Val Asp Gly Arg Asp 
260 265 270 



TTA GGC AAT TAC GAG AGC AAA CTT TTT GAA AAA AGA GTG TCC TAT ACG 92 3 

Leu Gly Asn Tyr Glu Ser Lys Leu Phe Glu Lys Arg Val Ser Tyr Thr 
275 280 285 



ATT GTC TAAAGGCATG AAATCTAGGA ATATTCCTTG ATAGCGGGCT TTCCTTTTTA GG 981 

He Val 

290 



(2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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Ala lie Leu His 



(2) INFORMATION FOR SEQ ID NO: 2 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 57... 929 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 

GTCCTATTTT TTCATTCATT CAACGAATTT AAAAATTACA ATAAAGAGTT ATAGTT ATG 5 9 

Met 
1 

AAA CGA AGG GAT TTT ATT AAA ACG ACT ACT TTA GGC GCT ACA GGT GCT 107 
Lys Arg Arg Asp Phe lie Lys Thr Thr Thr Leu Gly Ala Thr Gly Ala 
5 10 15 

GTT TTA GGA GCA CAG ATT TTG CAG GCA GAA GAA AGT AAA GGG AGT GTT 155 
Val Leu Gly Ala Gin lie Leu Gin Ala Glu Glu Ser Lys Gly Ser Val 
20 25 30 

GCA AAA TAT AAA ATA GAA GCT CAA TAG AGC ATT GAT TTT GAT TCT GCA 203 
Ala Lys Tyr Lys lie Glu Ala Gin Tyr Ser lie Asp Phe Asp Ser Ala 
35 40 45 

GAA CAC ACT TCA CTT TTC ATT CCC ATG CCG AGT GTT GTA GCG AGC AAT 251 
Glu His Thr Ser Leu Phe lie Pro Met Pro Ser Val Val Ala Ser Asn 
50 55 60 65 

GTG CAT TTA CAA GGC AAT CAT GCT AGC TAT AAA AGC ATG CTC AAT TTT 2 99 

Val His Leu Gin Gly Asn His Ala Ser Tyr Lys Ser Met Leu Asn Phe 
70 75 80 

GGA GTG CCT TAT TTG CAA GTG GAT TTT TTA AAA AGC ACT CAA AAA AAG 34 7 

Gly Val Pro Tyr Leu Gin Val Asp Phe Leu Lys Ser Thr Gin Lys Lys 
85 90 95 

CAA GTC CAT TTG TCT TAT GAG ATC GCT AGC TAT CAA TTG AAT GAG CGT 3 95 

Gin Val His Leu Ser Tyr Glu lie Ala Ser Tyr Gin Leu Asn Glu Arg 
100 105 110 
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210 215 220 

AAC CCA TTG ACT TTA AAT CGC GCT TCA GCC GAA GAG ATT CAA GAT CAT 7 81 

Asn Pro Leu Thr Leu Asn Arg Ala Ser Ala Glu Glu lie Gin Asp His 
225 230 235 

GAA TGC GCG ATT TTG CAC TAAAGCGAGT TAGATTCTTA AGCTTGAGCG ATAACCTT 83 7 
Glu Cys Ala lie Leu His 
240 

TAAAAAGCGT TAT 8 50 

(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

Met Lys Lys Ala Gly Phe Leu Phe Leu Ala Val Met Ala He Val Val 

15 10 15 

Met Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys He Val Phe 

20 25 30 

Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys 

35 40 45 

He Glu Val Lys Pro Asp Ala Gly Tyr Val Val Leu Lys Asp He Asn 

50 55 60 

Gly Pro Leu Gin Tyr Leu Leu Met Pro Thr Thr His He Ser Gly He 
65 70 75 80 

Glu Ser Pro Leu Leu Leu Asp Pro Ser Thr Pro Asn Phe Phe Tyr Leu 

85 90 95 

Ser Trp Gin Ala Arg Asp Phe Met Ser Lys Lys Tyr Gly Gin Pro He 

100 105 110 

Pro Asp Tyr Ala He Ser Leu Thr He Asn Ser Ser Lys Gly Arg Ser 

115 120 125 

Gin Asn His Phe His He His He Ser Cys He Ser Leu Glu Ala Arg 

130 135 140 

Lys Gin Leu Asp Asn Asn Leu Lys Lys He Asn Ser Arg Trp Ser Pro 
145 150 155 160 

Leu Pro Gly Gly Leu Asn Gly His Lys Tyr Leu Ala Arg Arg Val Thr 

165 170 175 

Glu Ser Glu Leu Val Gin Lys Ser Pro Phe Val Met Leu Asn Lys Glu 

180 185 190 

Val Pro Asn Ala Tyr Lys Arg Met Gly Asp Tyr Gly Leu Ala Val Val 

195 200 205 

Gin Gin Ser Asp Asn Ser Phe Val Leu Leu Ala Thr Gin Phe Asn Pro 

210 215 220 

Leu Thr Leu Asn Arg Ala Ser Ala Glu Glu He Gin Asp His Glu Cys 
225 230 235 240 
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ACCACTATTG TAGAAAAATA ACAAGAGGGT TTGCAAAAAC TCTCATTAAA AACAAGGAGC 60 
AAAAAAG ATG AAA AAG GCG GGC TTT CTT TTT TTA GCG GTA ATG OCT ATC 109 
Met Lys Lys Ala Gly Phe Leu Phe lieu Ala Val Met Ala lie 
15 10 

GTT GTT ATG AGT TTA AAC GCT AAA GAT CCG AAT GTG TTG CGT AAG ATT 157 
Val Val Met Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys lie 
15 20 25 30 

GTT TTT GAG AAA TGT GTG OCT AAT TAT GAG AAA AAT GAG AAT CCT TCG 2 05 

Val Phe Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser 
35 40 45 

CCA TGC ATA GAA GTC AAA CCC GAT GCC GGC TAT GTG GTT TTA AAA GAT 253 
Pro Cys lie Glu Val Lys Pro Asp Ala Gly Tyr Val Val Leu Lys Asp 
50 55 60 

ATT AAC GGC CCG TTG CAA TAT TTG TTG ATG CCA ACA ACT CAC ATT AGC 3 01 

lie Asn Gly Pro Leu Gin Tyr Leu Leu Met Pro Thr Thr His lie Ser 
65 70 75 

GGT ATT GAA AGC CCT TTG TTA CTT GAT CCT TCT ACG CCT AAC TTT TTT 34 9 

Gly lie Glu Ser Pro Leu Leu Leu Asp Pro Ser Thr Pro Asn Phe Phe 
80 85 90 

TAT TTA TCC TGG CAA GCG CGT GAT TTT ATG AGT AAA AAA TAC GGC CAA 3 97 

Tyr Leu Ser Trp Gin Ala Arg Asp Phe Met Ser Lys Lys Tyr Gly Gin 
95 100 105 110 

CCC ATT CCT GAT TAT GCG ATT TCT TTG ACG ATT AAC TCT AGC AAA GGG 44 5 

Pro lie Pro Asp Tyr Ala lie Ser Leu Thr lie Asn Ser Ser Lys Gly 
115 120 125 

CGA TCG CAA AAC CAT TTT CAT ATC CAT ATC TCT TGC ATT AGT CTT GAA 4 93 

Arg Ser Gin Asn His Phe His lie His lie Ser Cys lie Ser Leu Glu 
130 135 140 

GCA CGC AAA CAG CTG GAT AAT AAC CTA AAA AAA ATC AAC AGC CGT TGG 541 
Ala Arg Lys Gin Leu Asp Asn Asn Leu Lys Lys lie Asn Ser Arg Trp 
145 150 155 

TCG CCA TTA CCG GGC GGT TTG AAT GGG CAT AAA TAC TTG GCG CGT CGG 5 89 

Ser Pro Leu Pro Gly Gly Leu Asn Gly His Lys Tyr Leu Ala Arg Arg 
160 165 170 

GTA ACA GAG AGC GAG TTA GTG CAA AAA AGC CCG TTT GTC ATG CTT AAT 63 7 

Val Thr Glu Ser Glu Leu Val Gin Lys Ser Pro Phe Val Met Leu Asn 
175 ISO 185 190 

AAA GAA GTG CCT AAT GCG TAC AAA CGC ATG GGG GAC TAT GGC TTA GCG 6 85 

Lys Glu Val Pro Asn Ala Tyr Lys Arg Met Gly Asp Tyr Gly Leu Ala 
195 200 205 

GTG GTG CAA CAA AGC GAT AAC TCC TTT GTC TTA TTA GCG ACA CAA TTT 73 3 

Val Val Gin Gin Ser Asp Asn Ser Phe Val Leu Leu Ala Thr Gin Phe 
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Thr Asp Asn Asn 

Leu Leu Thr Glu 
500 

Lys Lys Asp Phe 
515 

lie Leu Asn Ser 
530 

Arg Asp Phe Gin 
545 

Met Arg Ser Glu 

lie Phe Glu lie 
580 

lie Leu Asn Leu 
595 

Gin Gly Val Lys 
610 

Leu Arg Asn Lys 
625 

Lys Asp Gin Glu 

Arg Glu Glu Glu 
660 

Glu Lys Ala Tyr 
675 

Ser Phe Ser Gly 
690 

Met Phe Gly Lys 
705 

Ala Ser Leu Ser 

lie Phe Lys Lys 
740 



Lys Glu Tyr Glu 
485 

Phe Pro Leu Lys 

lie Lys Leu Phe 
520 

Phe Glu Asn Phe 
535 

Asp Tyr Gin Ser 
550 

Lys Gly Lys Asp 
565 

Glu Leu lie Lys 

He Glu Glu Phe 
600 

Thr Lys He Glu 
615 

Lys Asp Leu He 
630 

Val His Ala His 
645 

Phe Gin Asn He 

Ser Phe Met Gin 
680 

Thr Glu Phe Pro 
695 

Asn Ser Arg Tyr 
710 

Arg Phe Phe His 
725 

Asn Glu Val Lys 



Gly Tyr Glu Gly 
490 

Glu Pro He Val 
505 

Gly Lys He Leu 

Lys Lys Asp Asp 
540 

Lys Tyr Leu Asp 
555 

Lys Glu Glu He 
570 

Gin Val Glu Val 
585 

Ala Lys Glu His 

Pro He He Asn 
620 

Met Asp Phe He 
635 

Phe Gin Asp Tyr 
650 

He Glu Glu Asn 
665 

His Ala Phe Lys 

Lys He He Glu 
700 

Gin Glu Val Lys 
715 

Arg Phe Cys Asp 
730 

Lys Asp Glu Val 
745 



Leu He Lys Arg 
495 

Ser Glu Ser Gin 
510 

Lys Leu Glu Asn 
525 

Tyr He Asn Pro 

Phe Tyr Asp Ala 
560 

Asn Asp Asp Leu 
575 

Asn He Asp Tyr 
590 

Gly Val Glu He 
605 

Ser Ser He Glu 

Asp Lys Tyr Asn 
640 

He His Gin Lys 
655 

Arg Leu Asn Glu 
670 

Gly Gly Glu He 
685 

Glu Lys Pro Ser 

Glu Lys Val Ala 
720 

Leu Thr Ser Ala 
735 

Asn Glu Lys 
750 



(2) INFORMATION FOR SEQ ID NO: 281: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 850 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 68 . . . 799 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 281: 
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35 

Thr Ala Gin Asn 
50 

His Thr Thr Gly 
65 

Leu Ala Lys Glu 

Arg Lys Asp Leu 
100 

Lys Asp Cys Ala 
115 

Leu Glu Asp Ser 
130 

Asp Lys Phe Val 
145 

Val Val Met lie 

His Gin Ala lie 
180 

Thr Gly Thr Pro 
195 

Gly Thr Thr Glu 
210 

lie Asp Ala lie 
225 

His Asn Thr lie 

Ala Val Asp Glu 
260 

lie Thr Lys Cys 
275 

Lys Phe Asn Ser 
290 

Tyr Tyr Gin Ala 
305 

lie Phe Ser Tyr 

Asn Asn Glu Ser 
340 

Glu Gly Ala lie 
355 

Thr Ser Asp Gin 
370 

Met Lys Glu Arg 
385 

Thr Gly Phe Asp 

Leu Lys Tyr His 
420 

Leu Asp Ser Val 
435 

Glu Gin Asp Leu 
450 

Gin Ser lie Ala 
465 



40 

Ser Lys Thr Lys 
55 

Ser Gly Lys Thr 
70 



Leu Glu Ser Val 
85 



Asp 


Tyr 


Gin 


Thr 


Asn 


Ser 


Asn 


Thr 








120 


Asn 


Ala 


Lys 


He 






135 




Lys 


Ser 


His 


Lys 




150 






Phe 


ASD 


Glu 


Cvs 


165 








Thr 


Lys 


Ala 


Phe 


He 


Phe 


Ala 


Ala 








200 


Gin 


Lys 


Phe 


Gly 






215 




Arg 


Asp 


Lys 


Asn 




230 






Lys 


Ala 


Lys 


Glu 


245 








Lys 


Asn 


Ala 


Leu 


He 


Leu 


Glu 


Arg 








280 


He 


Leu 


Ala 


Cys 






295 




Phe 


Lys 


Glu 


Glu 




310 






Ser 


Ala 


Asn 


Glu 


325 








Ala 


Cys 


Arg 


Leu 


Ala 


Asp 


Tyr 


Asn 








360 


Lys 


Phe 


Gin 


Ser 






375 




Lys 


He 


Asp 


Leu 




390 






Ala 


Thr 


Arg 


Leu 


405 








Gly 


Leu 


He 


Gin 


Lys 


Thr 


His 


Gly 








440 


Asn 


Asp 


Ala 


Leu 






455 




Leu 


Leu 


Arg 


Lys 




470 







Asn Gin Ser Lys 
60 

Leu Thr Ser Phe 
75 

Ser Lys Val Leu 
90 

Met Lys Glu Tyr 
105 

Ser Thr Lys He 

He He Thr Thr 
140 

Gly His Ala He 
155 

His Arg Ser Gin 
170 

Lys Lys Tyr His 
185 

Asn Cys Asp Lys 

Lys Cys Leu His 
220 

Val Leu Pro Phe 
235 

Asp He Lys Asp 
250 

Leu Asp Thr Arg 
265 

Phe Asn Gin Ala 

Ser Ser He Glu 
300 

Lys His Asp Leu 
315 

Glu He Asp Thr 
330 

Asp Lys Ser Ser 
345 

Gly Met Phe Gly 

Tyr Tyr Lys Asp 
380 

Leu Met Val Val 
395 

Asn Thr Leu Trp 
410 

Ala Phe Ser Arg 
425 

Asn He Val Cys 

Met Leu Phe Gly 
460 

Tyr Glu Asp Tyr 
475 



45 

Gly Tyr He Trp 

Lys Ser Ala Thr 
80 

Phe Val Val Asp 
95 

Asp Lys Phe Gin 
110 

Leu Lys Glu Gin 
125 

He Gin Lys Leu 

Phe Asn Glu Glu 
160 

Leu Gly Ser Met 
175 

Leu Phe Gly Phe 
190 

Asn Asn Pro Leu 
205 

Gin Tyr Thr He 

Arg Val Glu Tyr 
240 

Asn Lys Val Arg 
255 

Arg He Lys Glu 
270 

Thr Lys Asn Lys 
285 

Ala Leu Lys Lys 

Lys He Ala Ala 
320 

Leu Glu Asp Glu 
335 

Arg Asp Phe Leu 
350 

Val Ser Phe Asp 
365 

Leu Ser Gin Lys 

Asn Met Phe Leu 
400 

Val Asp Lys Asn 
415 

Ala Asn Arg He 
430 

Phe Arg Asp Leu 
445 

Asn Lys Asp Ala 

Leu Lys Gly Tyr 
480 
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ATA GAG TTA AGG AAT AAA AAA GAT TTG ATC ATG GAT TTC ATT GAC AAA 2642 
lie Glu Leu Arg Asn Lys Lys Asp Leu lie Met Asp Phe lie Asp Lys 
625 630 635 

TAG AAC AAA GAC CAA GAA GTC CAT GCG CAT TTT CAA GAT TAT ATC CAC 2 690 

Tyr Asn Lys Asp Gin Glu Val His Ala His Phe Gin Asp Tyr lie His 
640 645 650 

CAA AAA AGA GAA GAG GAA TTC CAA AAT ATC ATA GAA GAA AAC CGC TTG 273 8 

Gin Lys Arg Glu Glu Glu Phe Gin Asn lie lie Glu Glu Asn Arg Leu 
655 660 665 670 

AAT GAA GAA AAA GCC TAT TCG TTC ATG CAG CAT GCC TTT AAA GGG GGC 2786 
Asn Glu Glu Lys Ala Tyr Ser Phe Met Gin His Ala Phe Lys Gly Gly 
675 680 685 

GAA ATC AGT TTT AGT GGG ACG GAA TTC CCT AAA ATC ATT GAA GAA AAA 2 83 4 
Glu lie Ser Phe Ser Gly Thr Glu Phe Pro Lys lie lie Glu Glu Lys 
690 695 700 

CCC TCC ATG TTT GGT AAA AAT TCG CGC TAT CAA GAG GTG AAA GAA AAA 2 8 82 
Pro Ser Met Phe Gly Lys Asn Ser Arg Tyr Gin Glu Val Lys Glu Lys 
705 710 715 

GTC GCT GCA AGC CTT TCT CGT TTT TTC CAC CGC TTT TGT GAT CTC ACT 293 0 

Val Ala Ala Ser Leu Ser Arg Phe Phe His Arg Phe Cys Asp Leu Thr 
720 725 730 

AGC GCT ATA TTT AAG AAA AAT GAG GTT AAA AAA GAT GAG GTT AAT GAA 2978 
Ser Ala lie Phe Lys Lys Asn Glu Val Lys Lys Asp Glu Val Asn Glu 
735 740 745 750 

AAA TAGTTCATGA ACGCTTTTGC ATTAAGGCTC AAAAAAAGCG CCGTTTAATG GATT 3 03 5 

Lys 



(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 80: 

Met Asp Phe Ala Lys Ala Phe Phe Ala Lys Arg Ser Leu Leu Asn Val 

15 10 15 

Leu Thr Cys Tyr Cys Val Phe Thr Ser Glu Glu Val Leu Leu Val Met 

20 25 30 

Arg Pro Tyr Gin lie Val Ala Ala Glu Arg lie Leu Glu Lys lie Lys 
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TTT TTG ACC GGG TTT GAC GCT ACA AGG CTC AAC ACC CTT TGG GTG GAT 1970 
Phe Leu Thr Gly Phe Asp Ala Thr Arg Leu Asn Thr Leu Trp Val Asp 
400 405 410 

AAA AAT CTC AAA TAC CAT GGG CTA ATT CAA GCT TTT TCA CGC GCA AAC 2 018 

Lys Asn Leu Lys Tyr His Gly Leu lie Gin Ala Phe Ser Arg Ala Asn 
415 420 425 430 

CGC ATT TTA GAT AGC GTT AAA ACG CAT GGG AAT ATC GTG TGT TTT AGG 2066 
Arg lie Leu Asp Ser Val Lys Thr His Gly Asn lie Val Cys Phe Arg 
435 440 445 

GAT TTA GAA CAG GAT TTG AAT GAC GCT CTC ATG CTT TTT GGC AAC AAG 2114 
Asp Leu Glu Gin Asp Leu Asn Asp Ala Leu Met Leu Phe Gly Asn Lys 
450 455 460 

GAC GCT CAA -TCT ATT GCG CTG TTA AGA AAA TAT GAA GAT TAT TTG AAA 2162 
Asp Ala Gin Ser lie Ala Leu Leu Arg Lys Tyr Glu Asp Tyr Leu Lys 
465 470 475 

GGC TAC ACG GAT AAC AAC AAA GAA TAC GAG GGC TAT GAG GGT TTG ATT 2210 
Gly Tyr Thr Asp Asn Asn Lys Glu Tyr Glu Gly Tyr Glu Gly Leu lie 
480 485 490 

AAA AGG CTT TTA ACC GAA TTC CCA TTA AAA GAG CCA ATC GTT TCA GAA 225 8 

Lys Arg Leu Leu Thr Glu Phe Pro Leu Lys Glu Pro lie Val Ser Glu 
495 500 505 510 

AGC CAG AAA AAG GAT TTT ATT AAG CTT TTT GGC AAG ATT TTG AAA TTA 23 06 

Ser Gin Lys Lys Asp Phe lie Lys Leu Phe Gly Lys lie Leu Lys Leu 
515 520 525 

GAA AAT ATT TTA AAC AGC TTT GAA AAT TTC AAA AAA GAC GAT TAC ATC 2 354 

Glu Asn lie Leu Asn Ser Phe Glu Asn Phe Lys Lys Asp Asp Tyr lie 
530 535 540 

AAT CCC AGG GAT TTT CAA GAC TAT CAA AGC AAA TAC CTT GAT TTT TAC 24 02 

Asn Pro Arg Asp Phe Gin Asp Tyr Gin Ser Lys Tyr Leu Asp Phe Tyr 
545 550 555 

GAT GCA ATG AGA TCA GAA AAA GGG AAG GAT AAA GAA GAG ATT AAT GAT 245 0 

Asp Ala Met Arg Ser Glu Lys Gly Lys Asp Lys Glu Glu lie Asn Asp 
560 565 570 

GAT TTG ATT TTT GAA ATT GAA CTC ATC AAA CAA GTG GAA GTC AAT ATT 24 98 

Asp Leu lie Phe Glu lie Glu Leu lie Lys Gin Val Glu Val Asn lie 
575 580 585 590 

GAC TAT ATT TTG AAT TTG ATT GAA GAG TTC GCT AAA GAG CAT GGG GTG 254 6 

Asp Tyr lie Leu Asn Leu lie Glu Glu Phe Ala Lys Glu His Gly Val 
595 600 605 

GAA ATC CAA GGC GTT AAA ACC AAA ATA GAG CCA ATC ATC AAC TCC AGC 2 594 

Glu He Gin Gly Val Lys Thr Lys He Glu Pro He He Asn Ser Ser 
610 615 620 
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TCT ATG CAT CAA GCC ATC ACT AAA GCG TTT AAA AAA TAC CAC CTT TTT 12 98 

Ser Met His Gin Ala lie Thr Lys Ala Phe Lys Lys Tyr His Leu Phe 
175 180 185 190 

GGC TTT ACT GGC ACG CCC ATT TTT GCA GCT AAT TGC GAT AAA AAC AAC 13 46 

Gly Phe Thr Gly Thr Pro lie Phe Ala Ala Asn Cys Asp Lys Asn Asn 
195 200 205 

OCT TTA GGC ACG ACA GAG CAA AAG TTT GGG AAA TGC CTC CAC CAA TAC 13 94 

Pro Leu Gly Thr Thr Glu Gin Lys Phe Gly Lys Cys Leu His Gin Tyr 
210 215 220 

ACC ATT ATT GAT GCG ATC AGG GAT AAA AAC GTT TTG CCC TTT AGA GTG 1442 
Thr lie lie Asp Ala lie Arg Asp Lys Asn Val Leu Pro Phe Arg Val 
225 230 235 

GAA TAC CAC AAC ACC ATT AAA GCT AAA GAG GAC ATT AAG GAT AAT AAG 14 90 

Glu Tyr His Asn Thr lie Lys Ala Lys Glu Asp lie Lys Asp Asn Lys 
240 245 250 

GTT AGA GCG GTT GAT GAA AAA AAC GCC CTT TTG GAT ACT AGG AGG ATC 153 8 

Val Arg Ala Val Asp Glu Lys Asn Ala Leu Leu Asp Thr Arg Arg lie 
255 260 265 270 

AAA GAA ATC ACT AAA TGC ATT TTA GAG CGT TTC AAT CAA GCC ACT AAA 15 86 

Lys Glu lie Thr Lys Cys lie Leu Glu Arg Phe Asn Gin Ala Thr Lys 
275 280 285 

AAT AAA AAA TTC AAT TCC ATT CTG GCA TGC TCT AGC ATA GAA GCG CTG 1634 
Asn Lys Lys Phe Asn Ser lie Leu Ala Cys Ser Ser lie Glu Ala Leu 
290 295 300 

AAA AAA TAC TAC CAA GCC TTT AAA GAA GAA AAA CAC GAT CTT AAA ATC 16 82 

Lys Lys Tyr Tyr Gin Ala Phe Lys Glu Glu Lys His Asp Leu Lys lie 
305 310 315 

GCT GCC ATT TTT AGC TAT AGC GCT AAT GAG GAA ATT GAC ACG CTA GAA 17 3 0 

Ala Ala lie Phe Ser Tyr Ser Ala Asn Glu Glu lie Asp Thr Leu Glu 
320 325 330 

GAT GAA AAC AAT GAA AGC GCT TGC CGG CTA GAC AAA AGC TCA AGG GAT 177 8 

Asp Glu Asn Asn Glu Ser Ala Cys Arg Leu Asp Lys Ser Ser Arg Asp 
335 340 345 350 

TTT TTA GAG GGC GCG ATT GCG GAT TAT AAT GGG ATG TTT GGC GTT TCT 182 6 

Phe Leu Glu Gly Ala lie Ala Asp Tyr Asn Gly Met Phe Gly Val Ser 
355 360 365 

TTT GAC ACT TCG GAT CAA AAA TTC CAA AGT TAT TAC AAG GAT CTT TCT 1874 
Phe Asp Thr Ser Asp Gin Lys Phe Gin Ser Tyr Tyr Lys Asp Leu Ser 
370 375 380 

CAA AAA ATG AAA GAG CGT AAA ATC GAT CTT TTA ATG GTG GTG AAC ATG 1922 
Gin Lys Met Lys Glu Arg Lys lie Asp Leu Leu Met Val Val Asn Met 
385 390 395 
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GAAACGATCG CAGAAAGCAA TGAAAGCACG GTAGTAGCGG AATTTCATAG CAGTAATGAA 60 

AAAAAAGCGC TTATGAGAGC GAAGCAGAGC TAGAAAGGGC GTTTATTAAG CTTTTAGAAA 12 0 

AACAAGGCTA TGAATTTAAA AAAATCCACA AAGAAGAAGA ATTAAAAGAC AATTTAAAAG 180 

AGCAGTTAGA AAAGCTTAAT GATCATTCTT TCACGCCTAA AGAATGGGAC ACTCTTTATT 24 0 

CTCAATTCAT CGCTAATAAA AACGATGACT ATAAGGCTAA AACGAAAAAG ATCCAAGAAG 3 00 

ATCCGATTTT TAATCTCACG CTAGAGAACG GGAAAACCAA AAACATTAAA ATCATTGATA 360 

AGAAAAATAT CCATAGAAAC GCCTTGCAAG TGATCCACCA ATACAGCAAT AAAGGGGGGA 420 

AGTATCAAAA CCGCTATGAT GTGAGTATCC TTGTGAATGG CTTGCCTTTA GTGCATGTGG 480 

AATTGAAAAA AAGAGGCGTG GCGATCAGGG AGGCGTTCAA CCAGATCAAG CGCTATAAAA 54 0 

GGGATAGTTT TAGCGCTGAA GACGGGCTTT TTGATTTTGT GCAGATTTTT GTCATCAGTA 600 

ACGGCACGAG CTCTAAATAC TATTCAAACA CCACAAGAAT AGCCCAGCTG GAAAAAAACC 660 

ATAAAGCCGA TACTTTTGAA TTCACGAATT ATTGGGCGGA TAGCAAGAAT CACAATATTG 72 0 

AGGATTTA ATG GAT TTT GCT AAG GCG TTT TTT GCA AAG CGC AGC CTT TTG 770 
Met Asp Phe Ala Lys Ala Phe Phe Ala Lys Arg Ser Leu Leu 
15 10 

AAC GTT TTA ACG TGC TAT TGC GTT TTC ACA AGC GAA GAG GTT TTA TTG 818 
Asn Val Leu Thr Cys Tyr Cys Val Phe Thr Ser Glu Glu Val Leu Leu 
15 20 25 30 

GTG ATG CGG CCT TAT CAA ATC GTG GCG GCC GAA AGG ATT TTG GAA AAG 866 
Val Met Arg Pro Tyr Gin lie Val Ala Ala Glu Arg lie Leu Glu Lys 
35 40 45 

ATC AAA ACC GCG CAA AAT AGT AAA ACG AAA AAT CAA AGC AAA GGC TAT 914 
lie Lys Thr Ala Gin Asn Ser Lys Thr Lys Asn Gin Ser Lys Gly Tyr 
50 55 60 

ATC TGG CAC ACG ACA GGG AGC GGT AAA ACC CTA ACG AGC TTT AAA AGC 962 
lie Trp His Thr Thr Gly Ser Gly Lys Thr Leu Thr Ser Phe Lys Ser 
65 70 75 

GCA ACG TTG GCT AAA GAA TTA GAG AGC GTT TCA AAA GTC TTG TTC GTG 1010 
Ala Thr Leu Ala Lys Glu Leu Glu Ser Val Ser Lys Val Leu Phe Val 
80 85 90 

GTG GAC AGG AAG GAT TTG GAC TAT CAA ACC ATG AAA GAA TAC GAT AAA 105 8 

Val Asp Arg Lys Asp Leu Asp Tyr Gin Thr Met Lys Glu Tyr Asp Lys 
95 100 105 110 

TTC CAA AAA GAT TGC GCT AAT TCC AAC ACA AGC ACT AAG ATT TTA MiA 1106 
Phe Gin Lys Asp Cys Ala Asn Ser Asn Thr Ser Thr Lys lie Leu Lys 
115 120 125 

GAA CAG CTT GAA GAT TCT AAC GCT AAA ATC ATT ATC ACC ACG ATC CAA 1154 
Glu Gin Leu Glu Asp Ser Asn Ala Lys lie He He Thr Thr He Gin 
130 135 140 

AAA TTA GAC AAA TTC GTT AAA TCC CAT AAA GGG CAT GCG ATT TTT AAT 1202 
Lys Leu Asp Lys Phe Val Lys Ser His Lys Gly His Ala He Phe Asn 
145 150 155 

GAA GAA GTT GTG ATG ATT TTT GAT GAA TGC CAC AGG AGT CAG TTA GGC 1250 
Glu Glu Val Val Met He Phe Asp Glu Cys His Arg Ser Gin Leu Gly 
160 165 170 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 8: 

Met Arg Ala Phe Leu Lys lie Leu Met Val Leu lie Phe Met Ser Val 

15 10 15 

Ala Tyr Ala Lys Asn Pro Ser Thr Leu Ser Lys Glu Glu Glu Val Leu 

20 25 30 

Gin His Leu Gin Ser Phe Ser Ala His Phe Lys Gin Val Leu Lys Asn 

35 40 45 

Glu Lys Pro Leu Val Tyr Tyr Gly Val Leu Lys Ala Lys Ala Pro Asn 

50 55 60 

Trp Ala Leu Trp Val Tyr Glu Lys Pro Leu Lys Lys Glu lie Tyr Met 
65 70 75 80 

Asn Asp Lys Glu Val Val lie Tyr Glu Pro Asn Leu Phe Gin Ala Thr 

85 90 95 

lie Thr Pro Leu Lys Asp Lys Thr Asp Phe Phe Thr lie Leu Lys Arg 

100 105 110 

Leu Lys Lys Gin Asp Asp Gly Ser Phe Lys Thr Thr lie Asn Lys Thr 

115 120 125 

Thr Tyr Arg Leu Val Phe Lys Asp Gly Lys Pro Phe Ser Leu Glu Phe 

130 135 140 

Lys Asp Gly Met Asn Asn Leu Val Thr lie Thr Phe Ser Gin Ala Glu 
145 150 155 160 

lie Asn Pro Thr lie Ala Asn Glu lie Phe Val Phe Lys Pro Lys Asp 

165 170 175 

Glu Asn He Asp He Val Arg Gin 
180 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 729... 2981 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 
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TCATGTAAAA TAAAGGGTTT TATTAAAGAT GAGAGATTGT TTTAAGGTTT GAATA ATG 58 

Met 
1 

AGA GCT TTT TTA AAG ATT TTA ATG GTT TTG ATT TTT ATG AGC GTT GCT 10 6 

Arg Ala Phe Leu Lys lie Leu Met Val Leu lie Phe Met Ser Val Ala 
5 10 15 

TAT GCT AAA AAT CCT TCA ACG CTT TCT AAA GAA GAA GAG GTT TTG CAG 154 
Tyr Ala Lys Asn Pro Ser Thr Leu Ser Lys Glu Glu Glu Val Leu Gin 
20 25 30 

CAT TTG CAA AGT TTT AGC GCG CAT TTC AAG CAG GTT TTA AAA AAT GAA 2 02 

His Leu Gin Ser Phe Ser Ala His Phe Lys Gin Val Leu Lys Asn Glu 
35 40 45 

AAA CCT TTA GTT TAT TAG GGG GTT TTA AAG GCT AAA GCC CCT AAT TGG 2 50 

Lys Pro Leu Val Tyr Tyr Gly Val Leu Lys Ala Lys Ala Pro Asn Trp 
50 55 60 65 

GCT TTA TGG GTT TAT GAA AAG CCT TTA AAA AAA GAA ATT TAC ATG AAC 2 98 

Ala Leu Trp Val Tyr Glu Lys Pro Leu Lys Lys Glu lie Tyr Met Asn 
70 75 80 

GAT AAA GAA GTG GTA ATT TAT GAG CCT AAT TTG TTT CAA GCG ACC ATC 34 6 

Asp Lys Glu Val Val lie Tyr Glu Pro Asn Leu Phe Gin Ala Thr lie 
85 90 95 

ACG CCC TTA AAA GAC AAG ACG GAT TTT TTC ACC ATT CTC AAG CGT TTA 3 94 

Thr Pro Leu Lys Asp Lys Thr Asp Phe Phe Thr lie Leu Lys Arg Leu 
100 105 110 

AAA AAG CAA GAT GAC GGA TCT TTT AAA ACG ACT ATC AAC AAA ACC ACT 442 
Lys Lys Gin Asp Asp Gly Ser Phe Lys Thr Thr lie Asn Lys Thr Thr 
115 120 125 

TAT CGT TTG GTT TTT AAA GAC GGC AAG CCT TTT TCA TTG GAA TTT AAA 4 90 

Tyr Arg Leu Val Phe Lys Asp Gly Lys Pro Phe Ser Leu Glu Phe Lys 
130 135 140 145 

GAT GGA ATG AAC AAT CTT GTA ACG ATC ACT TTT TCT CAA GCA GAA ATC 53 8 

Asp Gly Met Asn Asn Leu Val Thr lie Thr Phe Ser Gin Ala Glu lie 
150 155 160 

AAC CCC ACC ATT GCT AAT GAA ATC TTT GTT TTT AAG CCT AAA GAT GAA 5 86 

Asn Pro Thr lie Ala Asn Glu lie Phe Val Phe Lys Pro Lys Asp Glu 
165 170 175 

AAC ATT GAT ATT GTG CGC CAA TGATTTTTAA TGATTCATTG CATCTTGTTA GCAA 641 
Asn lie Asp lie Val Arg Gin 
180 

AAGTTAGCTA AAATAGAC 659 
(2) INFORMATION FOR SEQ ID NO: 2 78: 
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(2) INFORMATION FOR SEQ ID NO: 276: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:276: 



Met Gly Met Leu 
1 

Leu Glu His lie 
20 

Lys Glu lie Cys 
35 

Glu Lys Gly Arg 
50 

Phe Thr Thr Asp 
65 

Asp Leu lie lie 

Pro Asn Lys Phe 
100 

Asp Ala Leu He 
115 

Leu Ala Lys Asp 
130 

Lys Gly Asn Leu 
145 

Met Val Lys 



Ala Thr Val He 
5 

Gly Leu Asp Thr 

Glu Ser Tyr He 
40 

Val Val He Phe 
55 

Thr Ala Ala Thr 
70 

Lys Ala Thr Lys 
85 

Lys Asp Ala Lys 

Gly Asp He Glu 
12 0 

Asn Lys Leu Pro 
135 

Leu Gin Val He 
150 



Asn Ala Val Ala 
10 

Arg Val Gin Ser 
25 

Tyr Arg Lys Ala 

Gly Ala Gly Thr 
60 

Leu Arg Ala He 
75 

Val Asp Gly He 
90 

Lys Leu Asp Thr 
105 

Val Met Asp Asp 

He Val Val Cys 
140 

Lys His Gin Gin 
155 



Met Gin Glu Ala 
15 

Ala He Glu He 
30 

He Arg His Leu 
45 

Gly Asn Pro Phe 

Glu He Gly Ser 
80 

Tyr Asp Lys Asp 
95 

Leu Ser Tyr Asn 
110 

Thr Ala He Ser 
125 

Asn Met Phe Lys 

Gly Val Phe Ser 
160 



(2) INFORMATION FOR SEQ ID NO: 2 77: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 56.,. 607 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 
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6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

GCAAAGATAA AAAACAAACG GGTTTTGGTG AAATTTTCTG GGGAAGCGTT AGCTGGNGGA 60 
CAACCAGTTT GGGATTGACA TTCATGTGTT AGATCACATC GCTAAAGAGA TCAAAAGTTT 12 0 
AGTGGAAAAC GATATTGAAG TGGGTATTGT GATTGGTGGA GGCAATATTA TTAGGGGGGT 180 
TAGCGCGGCT CAAGGGGGGA TTATTAGGCG CACCAGTGGG GATTAT ATG GGC ATG 235 

Met Gly Met 
1 

TTA GCC ACC GTG ATT AAT GCG GTA GCG ATG CAA GAA GCT TTA GAG CAT 2 83 

Leu Ala Thr Val lie Asn Ala Val Ala Met Gin Glu Ala Leu Glu His 
5 10 15 

ATC GGC TTA GAC ACA AGG GTG CAG AGC GCG ATT GAA ATC AAA GAG ATT 3 31 

lie Gly Leu Asp Thr Arg Val Gin Ser Ala lie Glu lie Lys Glu lie 
20 25 30 35 

TGT GAA AGT TAG ATT TAG AGA AAA GCG ATC AGG CAT TTA GAA AAG GGT 3 79 

Cys Glu Ser Tyr lie Tyr Arg Lys Ala lie Arg His Leu Glu Lys Gly 
40 45 50 

AGG GTG GTG ATT TTT GGC GCA GGC ACG GGA AAC CCG TTT TTC ACT ACG 427 
Arg Val Val lie Phe Gly Ala Gly Thr Gly Asn Pro Phe Phe Thr Thr 
55 60 65 

GAT ACG GCT GCC ACT TTA AGA GCG ATT GAA ATT GGA TCG GAT TTA ATC 4 75 

Asp Thr Ala Ala Thr Leu Arg Ala lie Glu lie Gly Ser Asp Leu lie 
70 75 80 

ATT AAA GCG ACT AAA GTG GAT GGC ATT TAC GAC AAA GAT CCT AAC AAG 52 3 

lie Lys Ala Thr Lys Val Asp Gly lie Tyr Asp Lys Asp Pro Asn Lys 
85 90 95 

TTT AAA GAC GCT AAA AAA TTA GAC ACT TTA AGC TAT AAC GAT GCC TTG 571 
Phe Lys Asp Ala Lys Lys Leu Asp Thr Leu Ser Tyr Asn Asp Ala Leu 
100 105 110 115 

ATA GGG GAT ATT GAA GTG ATG GAC GAT ACC GCT ATT TCT TTA GCT AAA 619 
lie Gly Asp lie Glu Val Met Asp Asp Thr Ala lie Ser Leu Ala Lys 
120 125 130 

GAC AAT AAG CTC CCC ATT GTG GTG TGT AAC ATG TTC AAA AAA GGG AAT 667 
Asp Asn Lys Leu Pro lie Val Val Cys Asn Met Phe Lys Lys Gly Asn 
135 140 145 

TTA TTG CAA GTG ATC AAG CAC CAA CAA GGC GTA TTT TCT ATG GTA AAA T 716 
Leu Leu Gin Val lie Lys His Gin Gin Gly Val Phe Ser Met Val Lys 
150 155 160 

AAGCCCTTTA ACATTGGATA GAACTCAAAA TAAAAGGATC AGTTTGAAAA AAGAG 7 71 
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Pro Leu Leu Glu Ser Lys Asn Asn Ala lie Gly Leu Leu His Leu Ala 

130 135 140 

Asp Pro Lys Gly Tyr Gly Arg Val Val Leu Glu Asn His Gin Val Lys 
145 150 155 160 

Lys lie Val Glu Glu Lys Asp Ala Asn Asp Glu Glu Lys Glu He Lys 

165 170 175 

Ser Val Asn Ala Gly Val Tyr Gly Phe Glu Arg Asp Phe Leu Glu Lys 

180 185 190 

Tyr Leu Pro Lys Leu His Asp Gin Asn Ala Gin Lys Glu Tyr Tyr Leu 

195 200 205 

Thr Asp Leu He Ala Leu Gly He Asn Glu Asn Glu Thr He Asp Ala 

210 215 220 

He Phe Leu Lys Glu Glu Cys Phe Leu Gly Val Asn Ser Gin Thr Glu 
225 230 235 240 

Arg Ala Lys Ala Glu Glu He Met Leu Glu Arg Leu Arg Lys Asn Ala 

245 250 255 

Met Asp Leu Gly Val Val Met Gin Leu Pro Asn Ser He Tyr Leu Glu 

260 265 270 

Lys Gly Val Ser Phe Lys Gly Glu Cys Val Leu Glu Gin Gly Val Arg 

275 280 285 

Leu He Gly Asn Cys Leu He Glu Asn Ala His He Lys Ala Tyr Ser 

290 295 300 

Val He Glu Glu Ser Gin He Val Asn Ser Ser Val Gly Pro Phe Ala 
305 310 315 320 

His Ala Arg Pro Lys Ser Val He Cys Asn Ser His Val Gly Asn Phe 

325 330 335 

Val Glu Thr Lys Asn Ala Lys Leu Gin Gly Thr Lys Ala Gly His Leu 

340 345 350 

Ser Tyr Leu Gly Asp Cys Glu He Gly Lys Asn Thr Asn Val Gly Ala 

355 360 365 

Gly Val He Thr Cys Asn Tyr Asp Gly Lys Lys Lys His Gin Thr He 

370 375 380 

He Gly Glu Asn Val Phe He Gly Ser Asp Ser Gin Leu Val Ala Pro 
385 390 395 400 

He Asn He Gly Ser Asn Val Leu He Gly Ser Gly Thr Thr He Thr 

405 410 415 

Lys Asp He Pro Ser Gly Ser Leu Ser Leu Ser Arg Ala Pro Gin Thr 

420 425 430 

Asn He Glu Asn Gly Tyr Phe Lys Phe Phe Lys Lys Pro 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 2 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 227... 715 
(D) OTHER INFORMATION: 



-475- 



BNSDOCID: <WO 984347aA1J_> 



wo 98/43478 



PCT/US98/06371 



TAT TTA GGG GAT TGT GAG ATA GGG AAA AAC AC A AAT GTA GGG GCT GGC 1162 
Tyr Leu Gly Asp Cys Glu lie Gly Lys Asn Thr Asn Val Gly Ala Gly 
355 360 365 

GTG ATC ACT TGC AAT TAC GAT GGT AAA AAG AAA CAC CAA ACA ATC ATC 1210 
Val lie Thr Cys Asn Tyr Asp Gly Lys Lys Lys His Gin Thr lie lie 
370 375 380 385 

GGT GAA AAT GTC TTT ATA GGG AGC GAT AGC CAG CTA GTC GCC CCC ATA 1258 
Gly Glu Asn Val Phe lie Gly Ser Asp Ser Gin Leu Val Ala Pro lie 
390 395 400 

AAT ATC GGC TCT AAT GTC TTA ATC GGC AGC GGC ACC ACT ATC ACT AAA 13 06 

Asn lie Gly Ser Asn Val Leu lie Gly Ser Gly Thr Thr lie Thr Lys 
405 410 415 

GAC ATT CCT AGC GGT TCG TTG AGC CTT TCA CGC GCC CCT CAA ACC AAC 13 54 

Asp He Pro Ser Gly Ser Leu Ser Leu Ser Arg Ala Pro Gin Thr Asn 
420 425 430 

ATT GAA AAC GGG TAT TTT AAG TTT TTT AAG PJiA CCT TAATTTGTTT GAATAA 14 06 
He Glu Asn Gly Tyr Phe Lys Phe Phe Lys Lys Pro 
435 440 445 

TGAAAAATCC TAAAATATTA ATCATTTACT TTAA 144 0 

(2) INFORMATION FOR SEQ ID NO:274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 

Met Leu Lys Phe Pro Lys Met Ser Leu Arg lie Leu Met Leu Ser Val 

15 10 15 

lie lie Leu Ala Ala Gly Lys Gly Thr Arg Met Arg Ser Ser Leu Pro 

20 25 30 

Lys Thr Leu His Thr lie Cys Gly Glu Pro Met Leu Phe Tyr lie Leu 

35 40 45 

Glu Thr Ala Phe Ser lie Ser Asp Asp Val His Leu lie Leu His His 

50 55 60 

Gin Gin Glu Arg lie Lys Glu Ala Val Leu Glu Arg Phe Lys Gly Val 
65 70 75 80 

He Phe His Thr Gin He Val Glu Lys Tyr Ser Gly Thr Gly Gly Ala 

85 90 95 

He Met Gin Lys Asp Lys Thr Pro He Ser Thr Lys His Glu Arg Val 

100 105 110 

Leu He Leu Asn Ala Asp Met Pro Leu He Thr Lys Asp Ala Leu Ala 
"115 120 125 
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TTA TTA GAA AGC AAG AAT AAC OCT ATA GGC TTA CTC CAT TTA GCT GAC 4 90 

Leu Leu Glu Ser Lys Asn Asn Ala lie Gly Leu Leu His Leu Ala Asp 
130 135 140 145 

CCT AAA GGT TAT GGG CGC GTT GTT TTA GAA AAC CAT CAG GTT AAA AAG 53 8 

Pro Lys Gly Tyr Gly Arg Val Val Leu Glu Asn His Gin Val Lys Lys 
150 155 160 

ATT GTA GAA GAA AAG GAC GCT AAT GAT GAA GAA AAA GAA ATT AAA AGC 586 
lie Val Glu Glu Lys Asp Ala Asn Asp Glu Glu Lys Glu lie Lys Ser 
165 170 175 

GTG AAT GCT GGC GTG TAT GGG TTT GAA AGG GAT TTT TTA GAA AAA TAC 634 
Val Asn Ala Gly Val Tyr Gly Phe Glu Arg Asp Phe Leu Glu Lys Tyr 
180 185 190 

TTA CCC AAG CTC CAT GAC CAA AAC GCC CAA AAA GAA TAC TAC CTC ACG 682 
Leu Pro Lys Leu His Asp Gin Asn Ala Gin Lys Glu Tyr Tyr Leu Thr 
195 200 205 

GAT TTA ATC GCT CTA GGG ATC AAT GAA AAC GAA ACA ATT GAC GCT ATT 73 0 

Asp Leu lie Ala Leu Gly lie Asn Glu Asn Glu Thr lie Asp Ala lie 
210 215 220 225 

TTC TTA AAA GAA GAG TGT TTT TTA GGG GTG AAT AGC CAA ACA GAA AGG 77 8 

Phe Leu Lys Glu Glu Cys Phe Leu Gly Val Asn Ser Gin Thr Glu Arg 
230 235 240 

GCG AAA GCT GAA GAA ATC ATG CTA GAA AGA CTG CGC AAA AAC GCC ATG 826 
Ala Lys Ala Glu Glu lie Met Leu Glu Arg Leu Arg Lys Asn Ala Met 
245 250 255 

GAC TTG GGG GTA GTG ATG CAA TTG CCT AAT AGC ATT TAT TTA GAA AAA 874 
Asp Leu Gly Val Val Met Gin Leu Pro Asn Ser lie Tyr Leu Glu Lys 
260 265 270 

GGC GTG AGT TTT AAG GGG GAG TGC GTT TTA GAG CAA GGG GTG CGT TTG 922 
Gly Val Ser Phe Lys Gly Glu Cys Val Leu Glu Gin Gly Val Arg Leu 
275 280 285 

ATT GGG AAT TGT TTG ATA GAA AAC GCG CAT ATT AAG GCT TAT AGC GTG 97 0 

lie Gly Asn Cys Leu lie Glu Asn Ala His lie Lys Ala Tyr Ser Val 
290 295 300 305 

ATA GAA GAG AGC CAG ATT GTT AAT AGC AGT GTG GGG CCG TTT GCC CAT 1018 
He Glu Glu Ser Gin He Val Asn Ser Ser Val Gly Pro Phe Ala His 
310 315 320 

GCG CGC CCT AAA AGC GTG ATT TGT AAT AGC CAT GTG GGG AAT TTT GTA 1066 
Ala Arg Pro Lys Ser Val He Cys Asn Ser His Val Gly Asn Phe Val 
325 330 335 

GAG ACT AAA AAC GCT AAA CTT CAA GGC ACT AAA GCA GGG CAT TTG AGC 1114 
Glu Thr Lys Asn Ala Lys Leu Gin Gly Thr Lys Ala Gly His Leu Ser 
340 345 350 
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(2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 56... 13 90 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

GCAAAATTCT AGCCTTAAAT CTTTGATGAA ACGAAGTCAA ATTATAAGAT AAGGC ATG 58 

Met 
1 

TTA AAA TTC CCT AAA ATG AGT TTA AGG ATT TTA ATG CTT TCT GTC ATC 106 
Leu Lys Phe Pro Lys Met Ser Leu Arg lie Leu Met Leu Ser Val lie 
5 10 15 

ATA CTG GCC GCT GGT AAA GGC ACT CGC ATG CGT TCT AGC CTG CCT AAA 154 
lie Leu Ala Ala Gly Lys Gly Thr Arg Met Arg Ser Ser Leu Pro Lys 
20 25 30 

ACT TTA CAC ACC ATT TGT GGG GAG CCT ATG TTG TTT TAG ATT TTA GAA 2 02 

Thr Leu His Thr lie Cys Gly Glu Pro Met Leu Phe Tyr lie Leu Glu 
35 40 45 

ACG GCT TTT TCA ATC AGC GAT GAT GTG CAT CTT ATC TTA CAC CAC CAA 2 50 

Thr Ala Phe Ser lie Ser Asp Asp Val His Leu lie Leu His His Gin 
50 55 60 65 

CAA GAA CGC ATT AAA GAA GCG GTG TTG GAG CGT TTT AAG GGC GTC ATT 2 98 

Gin Glu Arg lie Lys Glu Ala Val Leu Glu Arg Phe Lys Gly Val lie 
70 75 80 

TTT CAC ACT CAA ATT GTG GAA AAA TAT TCA GGG ACA GGT GGG GCT ATC 34 6 

Phe His Thr Gin He Val Glu Lys Tyr Ser Gly Thr Gly Gly Ala He 
85 90 95 

ATG CAA AAA GAT AAA ACG CCT ATT TCT ACG AAA CAT GAG CGG GTT TTG 3 94 

Met Gin Lys Asp Lys Thr Pro He Ser Thr Lys His Glu Arg Val Leu 
100 105 110 

ATT TTG AAT GCG GAC ATG CCT TTA ATC ACT AAA GAC GCT CTC GCC CCC 442 
He Leu Asn Ala Asp Met Pro Leu He Thr Lys Asp Ala Leu Ala Pro 
115 120 125 
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Glu Leu Tyr Gly 
195 

Asp Pro Lys lie 
210 

Lys lie Leu Met 
225 

Lys Asp Asn Ala 

Arg Ser Ser Asn 
260 

His Tyr Tyr Met 
275 

Glu Glu Lys Glu 
290 

Asn Lys Leu Thr 
305 

Ala Thr Lys Val 

Ser lie Asn Leu 
340 

Val Pro lie Met 
355 

Tyr Pro Asn Arg 
370 

Gly Leu Gly Val 
385 

Ala Trp Gly Ser 

Gin lie Ser Tyr 
420 

Lys Gly Val Tyr 
435 

Phe Pro lie Asp 
450 

Gly Leu Phe Asn 
465 

Lys Ala Asn Gly 

Ser Ser lie Ser 
500 

Tyr Lys Lys Lys 
515 

Val Val Pro Asn 
530 

Tyr Asp lie Asp 
545 

Lys Trp lie Asp 

Asn Ala Ser Gly 
580 

Leu Gly Leu Lys 
595 

Asp Glu Lys Ser 
610 



Gin Asp Phe Glu 
200 

lie Lys Glu Tyr 
215 

Asn Tyr Phe Glu 
230 

Asn Arg Cys Asn 
245 

Leu Cys Thr Glu 

Gin lie Glu Tyr 
280 

Leu Val Thr Thr 
295 

Ser Thr Asp lie 
310 

Ala Lys Asp Gly 
325 

Ser Lys lie Asn 

Val Arg Leu Leu 
360 

Lys Val Lys Ala 
375 

Met Gly Glu Ala 
390 

Lys Glu His Leu 
405 

His Ala lie Asp 

Lys Asp Phe Glu 
440 

Lys Ala Asn Asn 
455 

His Ala Cys Asp 
470 

Met Arg Asn Gly 
485 

lie Leu Val Gly 

Trp Phe Glu Glu 
520 

Leu Asn Val Glu 
535 

Ala Lys Asp Leu 
550 

Gin Gly Gin Ser 
565 

Lys Thr Leu His 

Ser Thr Tyr Tyr 
600 

Val Leu Asp Arg 
615 



Lys Arg Tyr Leu 

lie Asn Ala Lys 
220 

Ala Gly Leu Pro 
235 

Pro Asn Ala His 
250 

lie Phe Gin Asn 
265 

Thr Asp Gly Thr 

Asp Ser Asn lie 
300 

Leu Lys Gly Lys 
315 

Gin Thr Ala Val 
330 

Thr Glu Glu Asp 
345 

Asp Asn Val lie 

Thr Asn Leu Gin 
380 

Gin Met Leu Ala 
395 

Glu Lys lie Asp 
410 

Thr Ser Ala Asn 
425 

Asn Ser Glu Trp 

Glu Ala Leu Lys 
460 

Trp Gin Gly Leu 
475 

Tyr Leu Met Ala 
490 

Thr Thr Gin Thr 
505 

Asn Leu Ser Gly 

Thr Trp Asn Phe 
540 

lie Lys Ala Ala 
555 

Leu Asn Val Phe 
570 

Asp lie Tyr Thr 
585 

Leu Arg Ser Glu 

Ser Val Glu Cys 
620 



Glu Tyr Glu Lys 
205 

Asp Leu Trp Lys 

Phe Leu Ala Phe 
240 

Ala Gly He He 
255 

Thr Ala Pro Asn 
270 

He Glu Phe Phe 
285 

Thr Lys Cys Ala 

Pro He Tyr He 
320 

Cys Asn Leu Ala 
335 

He Lys Arg Val 
350 

Asp Leu Asn Phe 
365 

Asn Arg Ala He 

Glu His Gin He 
400 

Ala Leu Met Glu 
415 

Leu Ala Lys Glu 
430 

Ser Lys Gly He 
445 

Leu Thr Glu Lys 

Arg Glu Lys Val 
480 

He Ala Pro Thr 
495 

He Glu Pro He 
510 

Leu He Pro Val 
525 

Tyr Thr Ser Ala 

Ala Val Arg Gin 
560 

Leu Arg He Glu 
575 

Leu Ala Trp Lys 
590 

Ser Pro Ser He 
605 

Phe Asn Cys Gin 
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ATA GAA AAC GCC AGC GGT AAA ACC TTG CAT GAG ATC TAG ACG CTC GCT 2317 
lie Glu Asn Ala Ser Gly Lys Thr Leu His Asp lie Tyr Tlir Leu Ala 
575 580 585 590 

TGG AAA TTA GGA CTC AAA TCC ACT TAT TAT TTG CGC AGC GAA AGC CCT 2 3 65 

Trp Lys Leu Gly Leu Lys Ser Thr Tyr Tyr Leu Arg Ser Glu Ser Pro 
595 600 605 

AGC ATA GAT GAA AAA AGC GTG TTG GAT CGA TCG GTG GAG TGT TTT AAT 2413 
Ser lie Asp Glu Lys Ser Val Leu Asp Arg Ser Val Glu Cys Phe Asn 
610 615 620 

TGC CAA TAATATAAGC TTAAATAAGC TAATCTTTGC TAAAATGAGA TTTAAAATTA TT 24 71 
Cys Gin 



TA 2473 
(2) INFORMATION FOR SEQ ID NO: 272: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: 



Met 


Phe 


Met 


Ser 


He 


Ala 


Met 


Phe 


Leu 


Ala 


Gin 


Asn 


Glu 


Gin 


Glu Pro 


1 








5 










10 










15 


Asn 


Lys 


He 


Ala 


Leu 


Glu 


Phe 


Tyr 


Glu 


Val 


Leu 


Ser 


Lys 


Phe 


Glu Ala 








20 










25 










30 




Met 


Cys 


Ala 


Thr 


Pro 


Thr 


Leu 


Ala 


Asn 


Ala 


Arg 


Thr 


Thr 


Lys 


His Gin 






35 










40 










45 






Leu 


Ser 


Ser 


Cys 


Tyr 


He 


Gly 


Ser 


Thr 


Pro 


Asp 


Asn 


He 


Glu 


Gly He 




50 










55 










60 








Phe 


Asp 


Ser 


Tyr 


Lys 


Glu 


Met 


Ala 


Leu 


Leu 


Ser 


Lys 


Tyr 


Gly 


Gly Gly 


65 










70 










75 








80 


lie 


Gly 


Trp 


Asp 


Phe 


Ser 


Leu 


Val 


Arg 


Ser 


He 


Gly 


Ser 


Tyr 


He Asp 










85 










90 










95 


Gly 


His 


Lys 


Asn 


Ala 


Ser 


Ala 


Gly Thr 


He 


Pro 


Phe 


Leu 


Lys 


He Ala 








100 










105 










110 




Asn 


Asp 


Val 


Ala 


He 


Ala 


Val 


Asp 


Gin 


Leu 


Gly 


Thr 


Arg 


Lys 


Gly Ala 






115 










120 










125 






He 


Ala 


Val 


Tyr 


Leu 


Glu 


He 


Trp 


His 


He 


Asp 


Val 


Met 


Glu 


Phe He 




130 










135 










140 








Asp 


Leu 


Arg 


Lys 


Asn 


Ser 


Gly 


Asp 


Glu 


Arg 


Arg 


Arg 


Ala 


His 


Asp Leu 


145 










150 










155 








160 


Phe 


Pro 


Ala 


Leu 


Trp 


Val 


Cys 


Asp 


Leu 


Phe 


Leu 


Lys 


Arg 


Val 


Leu Glu 










165 










170 










175 


Asp 


Ala 


Met 


Trp 


Thr 


Leu 


Phe 


Asp 


Pro 


Tyr 


Glu 


Cys 


Lys 


Asp 


Leu Thr 



ISO 185 190 
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Arg Val Val Pro lie Met Val Arg Leu Leu Asp Asn Val lie Asp Leu 
355 360 365 

AAT TTC TAC CCT AAC CGC AAA GTC AAA GCC ACT AAT TTA CAA AAT AGO 1693 
Asn Phe Tyr Pro Asn Arg Lys Val Lys Ala Thr Asn Leu Gin Asn Arg 
370 375 380 

GCC ATA GGG TTA GGG GTT ATG GGT GAA GCG CAA ATG CTC GCA GAA CAC 1741 
Ala lie Gly Leu Gly Val Met Gly Glu Ala Gin Met Leu Ala Glu His 
385 390 395 

CAA ATC GCT TGG GGG TCT AAA GAG CAT TTA GAA AAA ATT GAC GCT TTA 1789 
Gin lie Ala Trp Gly Ser Lys Glu His Leu Glu Lys lie Asp Ala Leu 
400 405 410 

ATG GAG CAA ATC AGC TAC CAT GCG ATT GAC ACG AGC GCG AAT TTA GCG 1837 
Met Glu Gin lie Ser Tyr His Ala lie Asp Thr Ser Ala Asn Leu Ala 
415 420 425 430 

AAA GAA AAA GGG GTT TAT AAG GAT TTT GAA AAT TCA GAA TGG AGT AAG 18 85 

Lys Glu Lys Gly Val Tyr Lys Asp Phe Glu Asn Ser Glu Trp Ser Lys 
435 440 445 

GGG ATT TTC CCC ATT GAT AAA GCC AAT AAT GAA GCC TTA AAG CTC ACC 193 3 

Gly lie Phe Pro lie Asp Lys Ala Asn Asn Glu Ala Leu Lys Leu Thr 
450 455 460 

GAA AAA GGG CTT TTT AAT CAC GCT TGC GAT TGG CAA GGT TTG AGG GAA 1981 
Glu Lys Gly Leu Phe Asn His Ala Cys Asp Trp Gin Gly Leu Arg Glu 
465 470 475 

AAA GTC AAA GCC AAT GGC ATG CGT AAT GGC TAT TTA ATG GCG ATC GCT 2 02 9 

Lys Val Lys Ala Asn Gly Met Arg Asn Gly Tyr Leu Met Ala lie Ala 
480 485 490 

CCC ACA AGC TCC ATT TCT ATT TTA GTA GGC ACA ACC CAA ACG ATT GAA 2 077 
Pro Thr Ser Ser lie Ser lie Leu Val Gly Thr Thr Gin Thr lie Glu 
495 500 505 510 

CCC ATT TAT AAG AAA AAA TGG TTT GAA GAA AAT TTG AGC GGG CTT ATT 212 5 
Pro lie Tyr Lys Lys Lys Trp Phe Glu Glu Asn Leu Ser Gly Leu lie 
515 520 525 

CCT GTT GTG GTG CCT AAT TTG AAT GTA GAA ACC TGG AAT TTT TAC ACA 217 3 
Pro Val Val Val Pro Asn Leu Asn Val Glu Thr Trp Asn Phe Tyr Thr 
530 535 540 

TCA GCC TAT GAT ATT GAC GCT AAA GAT TTG ATT AAA GCA GCG GCC GTG 2221 
Ser Ala Tyr Asp lie Asp Ala Lys Asp Leu lie Lys Ala Ala Ala Val 
545 550 555 

CGC CAA AAG TGG ATT GAT CAA GGC CAA AGC CTT AAT GTG TTT TTA CGC 2269 
Arg Gin Lys Trp lie Asp Gin Gly Gin Ser Leu Asn Val Phe Leu Arg 
560 565 570 
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130 135 140 

TTC ATT GAT TTA AGG AAA AAT AGC GGC GAT GAA AGG CGA AGA GCG CAT 1021 
Phe lie Asp Leu Arg Lys Asn Ser Gly Asp Glu Arg Arg Arg Ala His 
145 150 155 

GAT TTA TTC CCG GCT CTT TGG GTG TGC GAT TTG TTT TTG AAA AGG GTT 106 9 
Asp Leu Phe Pro Ala Leu Trp Val Cys Asp Leu Phe Leu Lys Arg Val 
160 165 170 

TTA GAA GAT GCG ATG TGG ACT TTA TTT GAC CCT TAT GAG TGT AAG GAT 1117 
Leu Glu Asp Ala Met Trp Thr Leu Phe Asp Pro Tyr Glu Cys Lys Asp 
175 180 185 190 

TTG ACT GAG CTT TAT GGG CAG GAT TTT GAA AAA CGC TAT TTA GAG TAT 1165 
Leu Thr Glu Leu Tyr Gly Gin Asp Phe Glu Lys Arg Tyr Leu Glu Tyr 
195 200 205 

GAA AAA GAT CCC AAG ATC ATT AAG GAA TAC ATT AAC GCT AAA GAT TTA 1213 
Glu Lys Asp Pro Lys lie lie Lys Glu Tyr He Asn Ala Lys Asp Leu 
210 215 220 

TGG AAA AAA ATC TTA ATG AAT TAT TTT GAA GCC GGT TTG CCT TTC TTA 12 61 
Trp Lys Lys He Leu Met Asn Tyr Phe Glu Ala Gly Leu Pro Phe Leu 
225 230 235 

GCC TTT AAA GAT AAC GCC AAT CGG TGC AAC CCA AAC GCT CAT GCA GGA 13 09 
Ala Phe Lys Asp Asn Ala Asn Arg Cys Asn Pro Asn Ala His Ala Gly 
240 245 250 

ATC ATT CGA TCC AGC AAT CTA TGC ACG GAG ATT TTC CAA AAT ACC GCG 13 57 

He He Arg Ser Ser Asn Leu Cys Thr Glu He Phe Gin Asn Thr Ala 
255 260 265 270 

CCT AAC CAC TAC TAC ATG CAA ATA GAA TAC ACC GAC GGC ACC ATA GAG 14 05 
Pro Asn His Tyr Tyr Met Gin He Glu Tyr Thr Asp Gly Thr He Glu 
275 280 285 

TTT TTT GAA GAA AAA GAG TTG GTA ACG AC A GAT AGT AAT ATC ACT AAA 14 53 
Phe Phe Glu Glu Lys Glu Leu Val Thr Thr Asp Ser Asn He Thr Lys 
290 295 300 

TGC GCT AAC AAG CTC ACT AGC ACC GAT ATT CTA AAG GGC AAG CCA ATC 1501 
Cys Ala Asn Lys Leu Thr Ser Thr Asp He Leu Lys Gly Lys Pro He 
305 310 315 

TAT ATC GCT ACT AAA GTC GCT AAA GAC GGG CAA ACG GCG GTG TGC AAT 154 9 
Tyr He Ala Thr Lys Val Ala Lys Asp Gly Gin Thr Ala Val Cys Asn 
320 325 330 

CTG GCG AGC ATC AAT TTA AGC AAA ATC AAC ACT GAA GAA GAC ATT AAA 1597 
Leu Ala Ser He Asn Leu Ser Lys He Asn Thr Glu Glu Asp He Lys 
335 340 345 350 

AGG GTT GTG CCG ATC ATG GTC AGG CTT TTA GAC AAT GTG ATT GAT TTG 1645 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 548... 2419 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID N0:271: 

GATACAATTC CAAATTTAAA AAACAAACGA TTTAATTCAA ATTTAAGGAA AAATTTTGAT 60 

TACAGTGGTT AAACGAAACG GGCGCATTGA GCCTTTGGAC ATTACCAAAA TCCAAAAATA 120 

CACTAAGGAC GCTACGGACA ATTTAGAGGG CGTGAGCCAA AGTGAGCTGG AAGTGGATGC 180 

GAGGTTGCAA TTCAGGGACA AGATCACTAC TGAAGAAATC CAACAAACTT TGATTAAAAC 24 0 

CGCTGTGGAT AAGATAGATA TTGACACGCC TAATTGGAGT TTTGTCGCCT CAAGGCTTTT 3 00 

TTTGTATGAT TTATACCATA AAGTAAGTGG TTTTACAGGG TATAGGCATT TGAAAGAGTA 360 

TTTTGAAAAC GCTGAAGAAA AGGGCCGCAT CCTTAAGGGC TTTAAGGAAA AATTTGATCT 42 0 

AGAGTTTTTA AATAGCCAGA TCAAGCCTGA AAGGGATTTC CAATTCAATT ATTTAGGGAT 4.8 0 

TAAAACCTTG TATGATCGCT ATTTGTTAAA AGACGCTAAC AACAACCCTA TTGAATTGCC 54 0 

CCAACAC ATG TTT ATG AGC ATT GCG ATG TTT TTA GCA CAA AAC GAA CAA 58 9 



Met Phe Met Ser lie Ala Met Phe Leu Ala Gin Asn Glu Gin 
15 10 



GAA CCC AAT AAA ATC GCC TTA GAA TTT TAT GAA GTT TTG AGC AAG TTT 
Glu Pro Asn Lys lie Ala Leu Glu Phe Tyr Glu Val Leu Ser Lys Phe 
15 20 25 30 



637 



GAA GCG ATG TGC GCG ACC CCC ACT CTA GCG AAC GCC CGC ACC ACC AAA 
Glu Ala Met Cys Ala Thr Pro Thr Leu Ala Asn Ala Arg Thr Thr Lys 
35 40 45 



685 



CAC CAG CTC AGC TCA TGC TAT ATT GGC AGC ACG CCG GAT AAT ATT GAG 
His Gin Leu Ser Ser Cys Tyr lie Gly Ser Thr Pro Asp Asn lie Glu 
50 55 60 



733 



GGG ATT TTT GAC AGC TAT AAG GAA ATG GCG CTG TTG TCC AAA TAC GGC 
Gly lie Phe Asp Ser Tyr Lys Glu Met Ala Leu Leu Ser Lys Tyr Gly 
65 70 75 



781 



GGA GGG ATT GGC TGG GAT TTT TCT TTG GTG CGC TCT ATT GGG AGT TAT 
Gly Gly lie Gly Trp Asp Phe Ser Leu Val Arg Ser lie Gly Ser Tyr 
80 85 90 



829 



ATT GAT GGG CAT AAA AAT GCG AGC GCT GGC ACG ATC CCT TTT TTA AAA 
lie Asp Gly His Lys Asn Ala Ser Ala Gly Thr lie Pro Phe Leu Lys 
95 100 105 110 



877 



ATC GCT AAC GAT GTG GCG ATT GCG GTG GAT CAA TTA GGC ACA CGA AAG 
lie Ala Asn Asp Val Ala lie Ala Val Asp Gin Leu Gly Thr Arg Lys 
115 120 125 



925 



GGC GCG ATT GCG GTG TAT TTG GAA ATT TGG CAC ATT GAT GTG ATG GAG 
Gly Ala lie Ala Val Tyr Leu Glu lie Trp His lie Asp Val Met Glu 



973 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 

Met Lys Phe Leu Arg Ser Val Tyr Ala Phe Cys Ser Ser Trp Val Gly 

15 10 15 

Thr lie Val lie Val Leu Leu Val lie Phe Phe lie Ala Gin Ala Phe 

20 25 30 

lie lie Pro Ser Arg Ser Met Val Gly Thr Leu Tyr Glu Gly Asp Met 

35 40 45 

Leu Phe Val Lys Lys Phe Ser Tyr Gly lie Pro lie Pro Lys lie Pro 

50 55 60 

Trp lie Glu Leu Pro Val Met Pro Asp Phe Lys Asn Asn Gly His Leu 
65 70 75 80 

He Glu Gly Asp Arg Pro Lys Arg Gly Glu Val Val Val Phe He Pro 

85 90 95 

Pro His Glu Lys Lys Ser Tyr Tyr Val Lys Arg Asn Phe Ala He Gly 

100 105 110 

Gly Asp Glu Val Leu Phe Thr Asn Glu Gly Phe Tyr Leu His Pro Phe 

115 120 125 

Glu Ser Asp Thr Asp Lys Asn Tyr He Ala Lys His Tyr Pro Asn Ala 

130 135 140 

Met Thr Lys Glu Phe Met Gly Lys He Phe Val Leu Asn Pro Tyr Lys 
145 150 155 160 

Asn Glu His Pro Gly He His Tyr Gin Lys Asp Asn Glu Thr Phe His 

165 170 175 

Leu Met Glu Gin Leu Ala Thr Gin Gly Ala Glu Ala Asn He Ser Met 

180 185 190 

Gin Leu He Gin Met Glu Gly Glu Lys Val Phe Tyr Lys Lys He Asn 

195 200 205 

Asp Asp Glu Phe Phe Met He Gly Asp Asn Arg Asp Asn Ser Ser Asp 

210 215 220 

Ser Arg Phe Trp Gly Ser Val Ala Tyr Lys Asn He Val Gly Ser Pro 
225 230 235 240 

Trp Phe Val Tyr Phe Ser Leu Ser Leu Lys Asn Ser Leu Glu Met Asp 

245 250 255 

Ala Glu Asn Asn Pro Lys Lys Arg Tyr Leu Val Arg Trp Glu Arg Met 

260 265 270 

Phe Lys Ser Val Gly Gly Leu Glu Lys He He Lys Lys Glu Asn Ala 
275 280 285 

Thr His 
290 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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TTT GCC ATT GGA GGC GAT GAG GTG TTG TTC ACT AAT GAG GGT TTT TAT 44 5 

Phe Ala lie Gly Gly Asp Glu Val Leu Phe Thr Asn Glu Gly Phe Tyr 
110 115 120 

TTG CAC CCT TTT GAG AGC GAG ACG GAC AAA PiAT TAG ATC GOT AAA CAT 4 93 

Leu His Pro Phe Glu Ser Asp Thr Asp Lys Asn Tyr lie Ala Lys His 
125 130 135 140 

TAG CCT AAC GCC ATG ACA AAA GAA TTT ATG GGT AAA ATT TTT GTT TTA 541 
Tyr Pro Asn Ala Met Thr Lys Glu Phe Met Gly Lys lie Phe Val Leu 
145 150 155 

AAC CCT TAT AAA AAT GAG CAT CCG GGT ATC CAT TAC CAA AAA GAC AAT 5 89 

Asn Pro Tyr Lys Asn Glu His Pro Gly lie His Tyr Gin Lys Asp Asn 
160 165 170 

GAA ACC TTC CAC TTA ATG GAG CAA TTA GCC ACT CAA GGC GCA GAA GCT 63 7 

Glu Thr Phe His Leu Met Glu Gin Leu Ala Thr Gin Gly Ala Glu Ala 
175 180 185 

AAT ATC AGC ATG CAA CTC ATT CAA ATG GAG GGC GAA AAG GTG TTT TAT 6 85 

Asn lie Ser Met Gin Leu lie Gin Met Glu Gly Glu Lys Val Phe Tyr 
190 195 200 

AAG AAA ATC AAT GAC GAT GAA TTT TTC ATG ATC GGC GAC AAC AGA GAC 73 3 

Lys Lys lie Asn Asp Asp Glu Phe Phe Met lie Gly Asp Asn Arg Asp 
205 210 215 220 

AAT TCT AGC GAC TCG CGC TTT TGG GGG AGT GTG GCT TAT AAA AAC ATC 7 81 

Asn Ser Ser Asp Ser Arg Phe Trp Gly Ser Val Ala Tyr Lys Asn lie 
225 230 235 

GTG GGT TCG CCA TGG TTT GTT TAT TTC AGT TTG AGT TTA AAA AAT AGC 82 9 

Val Gly Ser Pro Trp Phe Val Tyr Phe Ser Leu Ser Leu Lys Asn Ser 
240 245 250 

CTA GAA ATG GAT GCA GAA AAT AAC CCT AAA AAA CGC TAT CTG GTG CGT 877 
Leu Glu Met Asp Ala Glu Asn Asn Pro Lys Lys Arg Tyr Leu Val Arg 
255 260 265 

TGG GAA CGC ATG TTT AAA AGC GTT GGA GGC TTA GAA AAA ATC ATT AAA 925 
Trp Glu Arg Met Phe Lys Ser Val Gly Gly Leu Glu Lys lie lie Lys 
270 275 280 

AAA GAA AAC GCA ACG CAT TAAGGTTTTT TGTGCAATTT TTTGATTTCT CTTTAGAA 981 
Lys Glu Asn Ala Thr His 
285 290 

AGTTTTATTA CCAC 995 
(2) INFORMATION FOR SEQ ID NO: 2 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 

(B) TYPE: amino acid 
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* 



420 425 430 

Gin Glu Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys Ala Leu Glu 

435 440 445 

Ala Gly Lys Lys 
450 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 74 ... 943 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

GGCCTATGAC GATTGTCTCG CTTTTAGAAA ACACTCTAAT CGCTTTTGAA AAACAACAAA 60 
GGAAGGGATT TTA ATG AAA TTT TTA CGC TCT GTT TAT GCA TTT TGC TCC 109 
Met Lys Phe Leu Arg Ser Val Tyr Ala Phe Cys Ser 
15 10 

AGT TGG GTA GGG ACG ATT GTT ATT GTG CTG TTG GTT ATC TTT TTT ATC 15 7 

Ser Trp Val Gly Thr lie Val He Val Leu Leu Val He Phe Phe He 
15 20 25 

GCG CAA GCC TTT ATC ATT CCC TCT CGC TCT ATG GTT GGC ACG CTC TAT 2 05 

Ala Gin Ala Phe He He Pro Ser Arg Ser Met Val Gly Thr Leu Tyr 
30 35 40 

GAG GGC GAC ATG CTC TTT GTC AAA AAG TTT TCT TAG GGC ATA CCC ATT 253 
Glu Gly Asp Met Leu Phe Val Lys Lys Phe Ser Tyr Gly He Pro He 
45 50 55 60 

CCT AAA ATC CCA TGG ATT GAG CTT CCT GTT ATG CCT GAT TTT AAA AAT 3 01 

Pro Lys He Pro Trp He Glu Leu Pro Val Met Pro Asp Phe Lys Asn 
65 70 75 

AAC GGA CAT TTG ATA GAG GGG GAT CGC CCT AAG CGT GGC GAA GTG GTG 34 9 

Asn Gly His Leu He Glu Gly Asp Arg Pro Lys Arg Gly Glu Val Val 
80 85 90 

GTG TTT ATC CCT CCC CAT GAA AAA AAG TCT TAG TAT GTT AAA AGG AAT 3 97 

Val Phe He Pro Pro His Glu Lys Lys Ser Tyr Tyr Val Lys Arg Asn 
95 100 105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68: 



Met Leu Arg Leu 
1 

Gin Ser Ala Ser 
20 

Asp Leu Pro Lys 
35 

Gly Glu Phe Gly 
50 

Asn Ser Glu Val 
65 

Lys Phe Arg Ala 

Arg Met Val Ala 
100 

Asn Gin Ala Phe 
115 

Arg Ala Thr Asn 
130 

Thr Phe Leu Asn 
145 

Cys Asn Val Tyr 

Leu Asn lie Leu 
180 

Leu Asp Thr Ser 
195 

Val Glu Gly lie 
210 

Gin Ser Lys Asn 
225 

Lys Arg Lys Glu 

Arg Glu lie Lys 
260 

Lys Ala Asn Gin 
275 

Tyr Gin Lys Ala 
290 

Leu Arg Asp Glu 
305 

Phe Glu Glu Arg 

Glu Gly Arg Arg 
340 

Glu Arg Val Lys 
355 

lie Lys Glu Pro 
370 

Gin Asn Ala Pro 
385 

Asn Ser Lys Glu 
Lys Ala Glu Gin 



Leu lie Gly Leu 
5 

Trp Gin Glu Pro 

Lys lie lie Arg 
40 

Phe Val Val Thr 
55 

Val He He Ala 
70 

Phe Glu Ser Met 
85 

Arg Lys Gly Asp 

Leu He Ala Pro 
120 

Thr Asp He Asn 
135 

Gly Phe Asp Pro 
150 

Ser Val Gly Val 
165 

Ser Cys Glu Ser 

Gly Val Thr Lys 
200 

Asp Ala Gly Thr 
215 

Tyr Phe Ala Tyr 
230 

Val Arg He Lys 
245 

Arg Glu He Lys 

Gly Thr Gin Asn 
280 

Glu Arg Lys Leu 
295 

Arg Lys Lys Ala 
310 

Glu Lys Glu His 
325 

Lys Ala Leu Glu 

Pro Lys Glu Asn 
360 

Ser Asp Gly Asn 
375 

Lys Glu Asn Asn 
390 

Glu Lys Arg Arg 
405 

Arg Ala Arg Glu 



Leu Leu Met Ser 
10 

Leu Arg Val Ser 
25 

Phe Pro Ala His 

Lys Leu Ser Asp 
60 

Val Glu Asn Gly 
75 

Lys Gin Arg His 
90 

Leu Val Tyr Phe 
105 

Asn Asp Glu Leu 

Phe He Ser Ser 
140 

Lys He Ala Asn 
155 

He Tyr He Val 
170 

Phe Glu He Leu 
185 

Thr Ser Thr Pro 

Leu Gly Lys Leu 
220 

Tyr Asp Ala Leu 
235 

Lys Arg Glu Glu 
250 

Gin Glu Ala He 
265 

Ala Pro Thr Leu 

Asp Ala Lys Glu 
300 

Lys Ala Thr Lys 
315 

Asp Glu Arg Asp 
330 

Met Asp Lys Gly 
345 

Glu Arg Glu He 

Asn Ala Thr Gin 
380 

Ala Gin Lys Glu 
395 

Leu Lys Glu Glu 
410 

Phe Glu Gin Arg 



Phe He Ser Leu 
15 

He Glu Phe Val 
30 

Asp Leu Gin Val 
45 

Tyr Glu He Val 

Val Ala Thr Ala 
80 

Leu Pro Thr Pro 
95 

Arg Gin Phe Asn 
110 

Tyr Glu Gin He 
125 

Asp Leu Leu Val 

Leu Arg Lys Ala 
160 

Thr Thr Asn Thr 
175 

Glu Lys Arg Glu 
190 

Phe Phe Ser Arg 
205 

Phe Ser Gly Ser 

Val Lys Lys Glu 
240 

Lys He Asp Ser 
255 

Lys Glu Pro Lys 
270 

Glu Glu Lys Asn 
285 

Glu Arg Arg Tyr 

Lys Ala Met Glu 
320 

Glu Gin Glu Thr 
335 

Asp Lys Lys Glu 
350 

Lys Gin Glu Ala 
365 

Gin Gly Glu Lys 

Glu Asn Lys Pro 
400 

Lys Lys Lys Ala 
415 

Ala Arg Glu His 
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AAA GCA GAG CGC AAA CTT GAT GCT AAA GAA GAA AGG CGT TAT TTG AGA 968 
Lys Ala Glu Arg Lys Leu Asp Ala Lys Glu Glu Arg Arg Tyr Leu Arg 
295 300 305 

GAT GAA AGG AAA AAA GCC AAA GCC ACC AAA AAG GCT ATG GAA TTT GAA 1016 
Asp Glu Arg Lys Lys Ala Lys Ala Thr Lys Lys Ala Met Glu Phe Glu 
310 315 320 

GAA AGA GAA AAA GAG CAT GAT GAA AGG GAC GAA CAA GAG ACT GAA GGA 1064 
Glu Arg Glu Lys Glu His Asp Glu Arg Asp Glu Gin Glu Thr Glu Gly 
325 330 335 

AGA AGA AAA GCT TTA GAA ATG GAT AAA GGC GAT AAA AAA GAA GAA AGA 1112 
Arg Arg Lys Ala Leu Glu Met Asp Lys Gly Asp Lys Lys Glu Glu Arg 
340 345 350 

GTC AAA CCC AAA GAA AAT GAG CGA GAA ATC AAG CAA GAA GCC ATT AAA 1160 
Val Lys Pro Lys Glu Asn Glu Arg Glu lie Lys Gin Glu Ala lie Lys 
355 360 365 370 

GAG CCA AGT GAT GGA AAT AAC GCC ACC CAA CAA GGC GAA AAA CAA AAC 12 0 8 

Glu Pro Ser Asp Gly Asn Asn Ala Thr Gin Gin Gly Glu Lys Gin Asn 
375 380 385 

GCT CCT AAA GAG AAC AAC GCT CAA AAA GAA GAG AAT AAA CCA AAT TCT 125 6 
Ala Pro Lys Glu Asn Asn Ala Gin Lys Glu Glu Asn Lys Pro Asn Ser 
390 395 400 

AAA GAA GAA AAA CGC CGC TTG AAA GAA GAA AAG AAA AAA GCC AAA GCC 1304 
Lys Glu Glu Lys Arg Arg Leu Lys Glu Glu Lys Lys Lys Ala Lys Ala 
405 410 415 

GAA CAA AGA GCG AGA GAA TTT GAA CAA AGA GCG AGA GAG CAT CAA GAA 1352 
Glu Gin Arg Ala Arg Glu Phe Glu Gin Arg Ala Arg Glu His Gin Glu 
420 425 430 

AGA GAT GAA AAA GAG CTT GAA GAG CGA AGA AAG GCG CTA GAA GCG GGT 14 00 

Arg Asp Glu Lys Glu Leu Glu Glu Arg Arg Lys Ala Leu Glu Ala Gly 
435 440 445 450 

AAA AAA TAACATGTTA GACCAACAAC ACATCCAATA CTTTAAAAAC CTAGTAGGGG GA 14 58 
Lys Lys 

G 1459 
(2) INFORMATION FOR SEQ ID NO: 2 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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GAA GTG GTC ATT ATT GCC GTT GAA AAT GGC GTC GCA ACG GCT AAA TTC 296 
Glu Val Val lie lie Ala Val Glu Asn Gly Val Ala Thr Ala Lys Phe 
70 75 80 

AGA GCG TTT GAG TCT ATG AAA CAA AGG CAT TTA CCC ACT CCA AGA ATG 344 
Arg Ala Phe Glu Ser Met Lys Gin Arg His Leu Pro Thr Pro Arg Met 
85 90 95 

GTC GCT AGA AAG GGT GAT TTA GTC TAT TTT AGG CAA TTC AAC AAC CAA 3 92 

Val Ala Arg Lys Gly Asp Leu Val Tyr Phe Arg Gin Phe Asn Asn Gin 
100 105 110 

GCG TTT TTA ATC GCT CCT AAT GAT GAA CTC TAT GAG CAA ATC AGA GCG 44 0 

Ala Phe Leu lie Ala Pro Asn Asp Glu Leu Tyr Glu Gin lie Arg Ala 
X15 120 125 130 

ACT AAC ACC GAT ATT AAT TTT ATT AGT TCT GAT TTG TTG GTT ACT TTT 4 88 

Thr Asn Thr Asp lie Asn Phe lie Ser Ser Asp Leu Leu Val Thr Phe 
135 140 145 

TTG AAT GGG TTT GAC CCA AAA ATC GCT AAT TTA AGG AAA GCG TGC AAC 536 
Leu Asn Gly Phe Asp Pro Lys lie Ala Asn Leu Arg Lys Ala Cys Asn 
150 155 160 

GTT TAT AGC GTG GGG GTG ATT TAT ATT GTA ACC ACC AAC ACG CTC AAT 5 84 

Val Tyr Ser Val Gly Val lie Tyr lie Val Thr Thr Asn Thr Leu Asn 
165 170 175 

ATT TTA AGT TGT GAG AGT TTT GAA ATT TTA GAA AAA AGA GAG CTG GAT 632 
lie Leu Ser Cys Glu Ser Phe Glu lie Leu Glu Lys Arg Glu Leu Asp 
180 185 190 

ACA AGC GGC GTT ACT AAA ACT TCC ACG CCG TTT TTT TCT AGG GTT GAG 680 
Thr Ser Gly Val Thr Lys Thr Ser Thr Pro Phe Phe Ser Arg Val Glu 
195 200 205 210 

GGT ATT GAT GCA GGC ACG CTA GGG AAA CTT TTT TCA GGC AGT CAG TCT 72 8 

Gly lie Asp Ala Gly Thr Leu Gly Lys Leu Phe Ser Gly Ser Gin Ser 
215 220 225 

AAA AAT TAC TTC GCT TAC TAT GAC GCT TTA GTG AAG AAA GAA AAA CGC 77 6 

Lys Asn Tyr Phe Ala Tyr Tyr Asp Ala Leu Val Lys Lys Glu Lys Arg 
230 235 240 

AAA GAA GTG AGG ATT AAA AAG AGG GAA GAA AAG ATT GAT TCT AGA GAA 824 
Lys Glu Val Arg lie Lys Lys Arg Glu Glu Lys lie Asp Ser Arg Glu 
245 250 255 

ATT AAA CGA GAA ATC AAG CAA GAG GCC ATT AAA GAG CCT AAA AAA GCC 872 
lie Lys Arg Glu lie Lys Gin Glu Ala lie Lys Glu Pro Lys Lys Ala 
260 265 270 

AAT CAA GGC ACA CAA AAC GCT CCT ACT TTA GAA GAG AAA AAC TAC CAA 92 0 

Asn Gin Gly Thr Gin Asn Ala Pro Thr Leu Glu Glu Lys Asn Tyr Gin 
275 280 285 290 
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Leu 


Glu 


Asn 


Leu 


Lys 


Leu 


Thr 


Pro 


Ser 


Leu 


Lys 


Ser 


Lys 


He 


Val 


Tyr 


145 










150 










155 










160 


Gly 


Ala 


Ser 


He 


Ser 


Gin 


Ala 


His 


Gin 


Phe 


Val 


Ala 


Thr 


Lys 


Asn 


Ala 










165 










170 










175 




Gin 


He 


Gly 


Phe 


Gly 


Ala 


Leu 


Ser 


Leu 


Met 


Asp 


Lys 


Lys 


Asp 


Lys 


Asn 








180 










185 










190 






Leu 


Ser 


Tyr 


Phe 


He 


He 


Asp 


Lys 


Ala 


Leu 


Tyr 


Asn 


Pro 


He 


Glu 


Gin 






195 










200 










205 








Ala 


Leu 


He 


He 


Thr 


Lys 


Asn 


Gly 


Ala 


Asn 


Asn 


Pro 


Leu 


Ala 


Lys 


Val 




210 










215 










220 










Phe 


Lys 


Asp 


Phe 


Leu 


Phe 


Ser 


Pro 


Lys 


Ala 


Arg 


Ala 


He 


Phe 


Lys 


Glu 


225 










230 










235 










240 


Tyr 


Gly 


Tyr 


He 


Val 


Asp 























245 



(2) INFORMATION FOR SEQ ID N0:267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1459 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1406 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

AGCCATTTTA TGGCATTTAA AAAAGTTTAA AAATGTTTAA AAGGAATTTT ATG TTA 56 

Met Leu 
1 

AGG CTT TTG ATA GGA CTT CTT CTA ATG AGT TTT ATA AGC TTG CAA TCA 104 
Arg Leu Leu He Gly Leu Leu Leu Met Ser Phe He Ser Leu Gin Ser 
5 10 15 

GCC TCT TGG CAA GAA CCC TTA AGA GTG AGT ATA GAA TTT GTG GAT TTG 152 
Ala Ser Trp Gin Glu Pro Leu Arg Val Ser He Glu Phe Val Asp Leu 
20 25 30 

CCT AAA AAA ATC ATT CGT TTT CCG GCT CAT GAT TTG CAA GTG GGG GAG 2 00 

Pro Lys Lys He He Arg Phe Pro Ala His Asp Leu Gin Val Gly Glu 
35 40 45 50 

TTT GGT TTT GTC GTT ACT AAA CTT TCA GAT TAT GAA ATC GTT AAT TCT 24 8 

Phe Gly Phe Val Val Thr Lys Leu Ser Asp Tyr Glu He Val Asn Ser 
55 60 65 
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170 175 



GGC TTT GGA GCG TTA TCC TTG ATG GAT AAA AAA GAT AAA AAC CTC TCT 632 
Gly Phe Gly Ala Leu Ser Leu Met Asp Lys Lys Asp Lys Asn Leu Ser 
180 185 190 



TAT TTC ATC ATT GAT AAA GCC CTT TAT AAC OCT ATT GAA CAA GCC TTG 6 80 

Tyr Phe lie He Asp Lys Ala Leu Tyr Asn Pro He Glu Gin Ala Leu 
195 200 205 210 



ATT ATC ACT AAA AAT GGG GCT AAC AAC CCT TTA GCC AAA GTC TTT AAA 728 
He He Thr Lys Asn Gly Ala Asn Asn Pro Leu Ala Lys Val Phe Lys 
215 220 225 



GAT TTT TTA TTC AGC CCT AAA GCC AGA GCT ATT TTT AAA GAA TAC GGC 77 6 

Asp Phe Leu Phe Ser Pro Lys Ala Arg Ala He Phe Lys Glu Tyr Gly 
230 235 240 



TAT ATT GTG GAT TAAAACGCAT AAAAAAGGCG AGCAATGGAT CATGAGTTTT TGATT 833 
Tyr He Val Asp 
245 



ACCATGCG 841 
(2) INFORMATION FOR SEQ ID NO:266: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:266: 



Met Lys Asn Thr 
1 

Ser Ala Leu Leu 
20 

Thr Arg Ala Leu 
35 

Lys Asp Thr Val 
50 

Gin He He Gin 
65 

He Arg Pro Lys 

Glu Val Tyr Ala 
100 

Met Asp Ser Leu 
115 

Met Ala Asn Pro 
130 



Phe Lys Al a Phe 
5 

Ala Gin Asp Leu 

Lys Ala Leu Val 
40 

Asn He Ser Phe 
55 

Asn Ala Pro Phe 
70 

Lys Leu Tyr Asp 
85 

Lys Gly Val Leu 

Glu He Leu Lys 
120 

Lys Leu Ala Pro 
135 



Ala Phe Leu He 
10 

Lys He Ala Ala 
25 

Lys Glu Phe Gin 

Asn Ser Ser Gly 
60 

Asp Leu Phe He 
75 

Lys Lys He Thr 
90 

Val Leu Trp Ser 
105 

Asn Pro Lys He 

Tyr Gly Lys Ala 
140 



Val Phe Phe Ser 
15 

Ala Ala Asn Leu 
30 

Lys Glu His Pro 
45 

Lys Leu Tyr Ala 

Ser Ala Asp Met 
80 

Pro Phe Lys Glu 
95 

Glu Asp Leu Lys 
110 

Lys Arg He Ala 
125 

Ser Met Glu Val 



-459- 



BNSDOCID: <WO__9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 788 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 

TACAAATAGG TATAATCACC AATTCCAATC ATTTAATCAA AGGGAGTTCT ATG AAA 56 

Met Lys 
1 

AAT ACT TTC AAA GCG TTT GCC TTT TTA ATT GTA TTT TTT TCA AGC GCT 104 
Asn Thr Phe Lys Ala Phe Ala Phe Leu lie Val Phe Phe Ser Ser Ala 
5 10 15 

TTA TTA GCG CAG GAT TTA T^AA ATC GCT GCT GCT GCT AAT CTT ACA CGC 152 
Leu Leu Ala Gin Asp Leu Lys lie Ala Ala Ala Ala Asn Leu Thr Arg 
20 25 30 

GCT TTA AAA GCC CTT GTT AAA GAA TTT CAA AAA GAA CAC CCC AAA GAC 200 
Ala Leu Lys Ala Leu Val Lys Glu Phe Gin Lys Glu His Pro Lys Asp 
35 40 45 50 

ACT GTT AAT ATT AGC TTT AAT TCT TCA GGC AAA CTC TAC GCT CAA ATC 24 8 

Thr Val Asn lie Ser Phe Asn Ser Ser Gly Lys Leu Tyr Ala Gin lie 
55 60 65 

ATT CAA AAC GCC CCT TTT GAT TTA TTC ATT TCA GCA GAT ATG ATT AGA 296 
lie Gin Asn Ala Pro Phe Asp Leu Phe lie Ser Ala Asp Met lie Arg 
70 75 80 

CCT AAA AAG CTT TAT GAT AAA AAA ATA ACC CCT TTT AAA GAA GAA GTC 344 
Pro Lys Lys Leu Tyr Asp Lys Lys lie Thr Pro Phe Lys Glu Glu Val 
85 90 95 

TAT GCT AAA GGC GTG TTG GTT TTA TGG AGT GAA GAT CTA AAA ATG GAT 3 92 

Tyr Ala Lys Gly Val Leu Val Leu Trp Ser Glu Asp Leu Lys Met Asp 
100 105 110 

TCT TTA GAA ATT CTT AAA AAT CCT AAA ATC AAG CGT ATC GCT ATG GCT 44 0 

Ser Leu Glu lie Leu Lys Asn Pro Lys lie Lys Arg lie Ala Met Ala 
115 120 125 130 

AAT CCT AAA CTA GCC CCT TAT GGA AAA GCC AGC ATG GAA GTC TTA GAG 4 88 

Asn Pro Lys Leu Ala Pro Tyr Gly Lys Ala Ser Met Glu Val Leu Glu 
135 140 145 

AAT TTA AAA CTC ACT CCC AGT CTT AAA TCT AAA ATC GTT TAT GGC GCT 536 
Asn Leu Lys Leu Thr Pro Ser Leu Lys Ser Lys lie Val Tyr Gly Ala 
150 155 160 

TCT ATT TCT CAA GCC CAT CAA TTT GTC GCT ACT AAA AAC GCT CAA ATA 584 
Ser lie Ser Gin Ala His Gin Phe Val Ala Thr Lys Asn Ala Gin lie 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 



Met 


Lys 


Lys 


Gly 


Ser 


Leu 


Ala 


He 


Val 


Leu 


Gly 


Ser 


Leu 


Leu 


Ala 


Ser 


1 








5 










10 










15 




Gly Ala 


Phe 


Tyr 


Thr 


Ala 


Leu 


Ala 


Asp 


Gly 


Met 


Pro 


Ala 


Lys 


Gin 


Gin 








20 










25 










30 






His 


Asn 


Asn 


Thr 


Gly 


Glu 


Ser 


Val 


Glu 


Leu 


His 


Phe 


His 


Tyr 


Pro 


He 






35 










40 










45 








Lys 


Gly 


Lys 


Gin 


Glu 


Pro 


Lys 


Asn 


Ser 


His 


Leu 


Val 


Val 


Leu 


He 


Glu 




50 










55 










60 










Pro 


Lys 


He 


Glu 


He 


Asn 


Lys 


Val 


He 


Pro 


Glu 


Ser 


Tyr 


Gin 


Lys 


Glu 


65 










70 










75 










80 


Phe 


Glu 


Lvs 


Ser 


Leu 


Phe 


Leu 


Gin 


Leu 


Ser 


Ser 


Phe 


Leu 


Glu Arg 


Lys 










85 










90 










95 




Gly 


Tyr 


Ser 


Val 


Ser 


Gin 


Phe 


Lys 


Asp 


Ala 


Ser 


Glu 


He 


Pro 


Gin 


Asp 








100 










105 










110 






lie 


Lys 


Glu 


Lys 


Ala 


Leu 


Leu 


Val 


Leu Arg 


Met 


Asp 


Gly 


Asn 


Val 


Ala 






115 










120 










125 








lie 


Leu 


Glu 


Asp 


He 


Val 


Glu 


Glu 


Ser 


Asp 


Ala 


Leu 


Ser 


Glu 


Glu 


Lys 




130 










135 










140 










Val 


lie 


Asp 


Met 


Ser 


Ser 


Gly 


Tyr 


Leu 


Asn 


Leu 


Asn 


Phe 


Val 


Glu 


Pro 


145 










150 










155 










160 


Lys 


Ser 


Glu 


Asp 


He 


He 


His 


Ser 


Phe 


Gly 


He 


Asp 


Val 


Ser 


Lys 


He 










165 










170 










175 




Lys 


Ala 


Val 


He 


Glu 


Arg 


Val 


Glu 


Leu 


Arg 


Arg 


Thr 


Asn 


Ser 


Gly Gly 








180 










185 










190 






Phe 


Val 


Pro 


Lys 


Thr 


Phe 


Val 


His 


Arg 


He 


Lys 


Glu 


Thr 


Asp 


His 


Asp 






195 










200 










205 








Gin 


Ala 


He 


Arg 


Lys 


He 


Met 


Asn 


Gin 


Ala 


Tyr 


His 


Lys 


Val 


Met 


Val 




210 










215 










220 










His 


He 


Thr 


Lys 


Glu 


Leu 


Ser 


Lys 


Lys 


His 


Met 


Glu 


His 


Tyr 


Glu 


Lys 


225 










230 










235 










240 


Val 


Ser 


Ser 


Glu 


Met 


Lys 


Lys 


Arg 


Lys 

















245 



(2) INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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AAG CAA GAG CCT AAA AAC AGC CAT TTA GTC GTT TTG ATC GAA CCT AAA 24 8 

Lys Gin Glu Pro Lys Asn Ser His Leu Val Val Leu lie Glu Pro Lys 
55 60 65 

ATA GAG ATC AAT AAA GTT ATC CCT GAA AGT TAT CAA AAA GAG TTT GAG 2 96 

He Glu He Asn Lys Val He Pro Glu Ser Tyr Gin Lys Glu Phe Glu 
70 75 80 

AAG TCT TTG TTT CTC CAG TTG AGT AGT TTT TTA GAG AGA AAA GGC TAT 344 
Lys Ser Leu Phe Leu Gin Leu Ser Ser Phe Leu Glu Arg Lys Gly Tyr 
85 90 95 

AGC GTT TCG CAA TTT AAA GAT GCT AGC GAA ATC CCT CAA GAC ATC AAA 3 92 

Ser Val Ser Gin Phe Lys Asp Ala Ser Glu He Pro Gin Asp He Lys 
100 105 110 

GAA AAA GCG TTG CTC GTT TTA CGC ATG GAT GGG AAT GTG GCT ATC TTG 44 0 

Glu Lys Ala Leu Leu Val Leu Arg Met Asp Gly Asn Val Ala He Leu 
115 120 125 130 

GAA GAT ATT GTA GAA GAG AGC GAT GCG CTT AGC GAA GAA AAA GTG ATA 4 88 

Glu Asp He Val Glu Glu Ser Asp Ala Leu Ser Glu Glu Lys Val He 
135 140 145 

GAC ATG TCT TCA GGG TAT TTG AAC TTG AAT TTT GTT GAG CCA AAA AGT 53 6 

Asp Met Ser Ser Gly Tyr Leu Asn Leu Asn Phe Val Glu Pro Lys Ser 
150 155 160 

GAA GAT ATT ATC CAT AGT TTT GGT ATT GAT GTT TCA AAG ATT AAG GCT 584 
Glu Asp He He His Ser Phe Gly He Asp Val Ser Lys He Lys Ala 
165 170 175 

GTG ATT GAA AGA GTG GAA TTG CGG CGC ACC AAT TCT GGA GGT TTT GTC 632 
Val He Glu Arg Val Glu Leu Arg Arg Thr Asn Ser Gly Gly Phe Val 
180 185 190 

CCC AAA ACT TTT GTG CAT AGG ATT AAG GAA ACC GAT CAT GAT CAA GCC 68 0 

Pro Lys Thr Phe Val His Arg He Lys Glu Thr Asp His Asp Gin Ala 
195 200 205 210 

ATT AGA AAA ATC ATG AAT CAA GCC TAT CAC AAA GTG ATG GTG CAT ATT 72 8 

He Arg Lys He Met Asn Gin Ala Tyr His Lys Val Met Val His He 
215 220 225 

ACC AAA GAG TTA AGC AAA AAA CAC ATG GAA CAT TAT GAA AAA GTT TCT 776 
Thr Lys Glu Leu Ser Lys Lys His Met Glu His Tyr Glu Lys Val Ser 
230 235 240 

AGT GAA ATG AAA AAA CGA AAG TAGTTTTTAA GAAACGAAAA GCTTAAAAAT CATT 831 
Ser Glu Met Lys Lys Arg Lys 
245 

GAGAGCTATT TTTAAAAAA 850 
(2) INFORMATION FOR SEQ ID NO: 2 64: 
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Gly Asn Ser Thr Pro Pro Arg Leu Thr Tyr Asn Gly Leu Asp Glu Arg 

100 105 110 

Lys Lys Lys Glu Ala Glu Tyr Leu Asp Asp Lys Asn Asn Tyr Asn Phe 

115 120 125 

Thr Lys Ser Ser Asn Asn Thr Asn Phe Lys Gly Gly Gly Ser Gin Lys 

130 135 140 

Lys Ser Glu Asp Leu Glu lie Val Leu Ser Ala Arg lie lie Lys Val 
145 150 155 160 

Leu Glu Asn Gly Asn Tyr Phe lie Tyr Gly Asn Lys Glu Val Leu Val 

165 170 175 

Asp Gly Glu Lys Gin lie Leu Lys Val Ser Gly Val lie Arg Pro Tyr 

180 185 190 

Asp lie Glu Arg Asn Asn Thr lie Gin Ser Lys Phe Leu Ala Asp Ala 

195 200 205 

Lys lie Glu Tyr Thr Asn Leu Gly His Leu Ser Asp Ser Asn Lys Lys 

210 215 220 

Lys Phe Ala Ala Asp Ala Met Glu Thr Gin Met Pro Tyr 
225 230 235 

(2) INFORMATION FOR SEQ ID NO:263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 7 97 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 
TTGGGTAAGA TTAGGAATTG ATTTTAAAGA AAAAGAAAGA AAGGAATTTA ATG AAA 56 



Met Lys 
1 



AAA GGT AGT TTG GCA ATC 
Lys Gly Ser Leu Ala lie 
5 



GTT TTA GGA TCG 
Val Leu Gly Ser 
10 



CTA TTA GCG AGT GGG GCG 
Leu Leu Ala Ser Gly Ala 
15 



104 



TTT TAT ACG GCT CTA GCT 
Phe Tyr Thr Ala Leu Ala 
20 



GAT GGA ATG CCT 
Asp Gly Met Pro 
25 



GCA AAA CAG CAG CAC AAT 
Ala Lys Gin Gin His Asn 
30 



152 



AAT ACG GGC GAG TCA GTG 
Asn Thr Gly Glu Ser Val 
35 40 



GAG TTG CAT TTC 
Glu Leu His Phe 



CAC TAT CCT ATT AAA GGC 
His Tyr Pro lie Lys Gly 
45 50 



200 
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4 



TCC AGC AAT AAC ACG AAT TTT AAA GGC GGT GGC TCG CAA AAA AAG AGC 488 
Ser Ser Asn Asn Thr Asn Phe Lys Gly Gly Gly Ser Gin Lys Lys Ser 
135 140 145 

GAA GAT TTA GAG ATT-GTG TTG AGC GCT CGA ATC ATT AAG GTG CTA GAA 53 6 

Glu Asp Leu Glu lie Val Leu Ser Ala Arg lie lie Lys Val Leu Glu 
150 155 160 

AAC GGG AAT TAT TTC ATC TAT GGG AAT AAG GAA GTG CTA GTG GAT GGG 584 
Asn Gly Asn Tyr Phe lie Tyr Gly Asn Lys Glu Val Leu Val Asp Gly 
165 170 175 

GAA AAG CAA ATC CTT AAG GTG AGT GGG GTG ATC CGC CCT TAT GAT ATT 632 
Glu Lys Gin lie Leu Lys Val Ser Gly Val lie Arg Pro Tyr Asp lie 
180 185 190 

GAA AGG AAT AAC ACC ATC CAA TCC AAG TTT TTA GCC GAC GCT AAG ATT 68 0 

Glu Arg Asn Asn Thr lie Gin Ser Lys Phe Leu Ala Asp Ala Lys lie 
195 200 205 210 

GAA TAG ACG AAT TTA GGG CAT TTG AGC GAT TCC AAT AAG AAG AAA TTC 72 8 

Glu Tyr Thr Asn Leu Gly His Leu Ser Asp Ser Asn Lys Lys Lys Phe 
215 220 225 

GCT GCT GAT GCG ATG GAA ACC CAA ATG CCT TAT TAAAAAGAGC AAAGCCTAGC 7 81 
Ala Ala Asp Ala Met Glu Thr Gin Met Pro Tyr 
230 235 

ATGAGAGCGA TCGCTATTGT TTTAGCCAGA AGT 814 

(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

Met Lys Lys Ala Leu Tyr Leu Gly Ala Val Ala Phe Ser Val Ala Phe 

15 10 15 

Ser Met Ala Ser Ala Asn Glu Pro Lys lie Asp Phe Asn Pro Pro Asn 

20 25 30 

Tyr Val Glu Glu Thr Pro Ser Lys Glu Phe lie Pro Glu Leu Asn Lys 

35 40 45 

Leu Gly Ser Leu Phe Gly Gin Gly Glu Arg Pro Leu Phe Ala Asp Arg 

50 55 60 

Arg Ala Met Lys Pro Asn Asp Leu lie Thr lie lie Val Ser Glu Lys 
65 70 75 80 

Ala Ser Ala Asn Tyr Ser Ser Ser Lys Asp Tyr Lys Ser Ala Ser Gly 
85 90 95 
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(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 761 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

TATAAGAGAG TATAATTCAA GGCTTAAAAT AACTCAAGTA AGGCTAGTGG ATG AAA 56 

Met Lys 
1 

AAA GCG CTT TAT TTA GGG GCT GTT GCG TTT AGC GTT GCA TTC AGC ATG 104 
Lys Ala Leu Tyr Leu Gly Ala Val Ala Phe Ser Val Ala Phe Ser Met 
5 10 15 

GCA TCA GCC AAT GAG CCA AAA ATT GAT TTT AAC CCT CCC AAT TAT GTA 152 
Ala Ser Ala Asn Glu Pro Lys lie Asp Phe Asn Pro Pro Asn Tyr Val 
20 25 30 

GAA GAA ACC CCC TCT AAA GAA TTT ATC CCT GAA TTG AAC AAG TTA GGG 200 
Glu Glu Thr Pro Ser Lys Glu Phe lie Pro Glu Leu Asn Lys Leu Gly 
35 40 45 50 

AGT TTG TTT GGG CAG GGT GAG CGC CCC TTG TTT GCG GAC AGG AGG GCG 24 8 

Ser Leu Phe Gly Gin Gly Glu Arg Pro Leu Phe Ala Asp Arg Arg Ala 
55 60 65 

ATG AAG CCT AAC GAT TTG ATC ACA ATC ATT GTT TCT GAA AAA GCG AGC 2 96 

Met Lys Pro Asn Asp Leu lie Thr lie lie Val Ser Glu Lys Ala Ser 
70 75 80 

GCG AAT TAT TCC AGC TCT AAA GAT TAT AAA AGC GCT TCA GGG GGT AAT 344 
Ala Asn Tyr Ser Ser Ser Lys Asp Tyr Lys Ser Ala Ser Gly Gly Asn 
85 90 95 

TCC ACG CCC CCA AGA CTC ACT TAT AAC GGG CTA GAT GAA AGA AAG AAA 3 92 

Ser Thr Pro Pro Arg Leu Thr Tyr Asn Gly Leu Asp Glu Arg Lys Lys 
100 105 110 

AAA GAA GCG GAG TAT TTA GAC GAT AAG AAT AAT TAC AAT TTC ACC AAA 44 0 

Lys Glu Ala Glu Tyr Leu Asp Asp Lys Asn Asn Tyr Asn Phe Thr Lys 
115 120 125 130 



-453- 



BNSDOCID: <WO_984347BA1_L> 



wo 98/43478 



PCT/US98/06371 



GAT GAT AAG ATC GTG TTT TTA GGC TCA GCG AAT TGG AGC AAA AAC GCT 48 8 

Asp Asp Lys lie Val Phe Leu Gly Ser Ala Asn Trp Ser Lys Asn Ala 
135 140 145 

TTT GAA AAC AAT TAT GAA GTG CTT TTA AAA ACC GAT GAC ACA GAA ACG 53 6 

Phe Glu Asn Asn Tyr Glu Val Leu Leu Lys Thr Asp Asp Thr Glu Thr 
150 155 160 

ATC CTC AAA GCC AAG AGC TAT TAC CAA AAG ATG TTA GGG AGT TGC GTT 584 
lie Leu Lys Ala Lys Ser Tyr Tyr Gin Lys Met Leu Gly Ser Cys Val 
165 170 175 

GGG TTT TAAAAGCCCT TTAGAAGTGG TAATTATACC CCACATAAAA GGCAAAGACC CT 642 
Gly Phe 
180 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:260: 



Met 


Leu 


Asn 


Lys 


Phe 


Lys 


Lys 


He 


Val 


Gly 


Val 


Ser 


Val 


Leu 


Val 


Gly 


1 








5 










10 










15 




Cys 


Leu 


Gly 


Val 


Leu 


Gin 


Ala 


Lys 


Asn 


Ser 


Leu 


Phe 


Val 


Leu 


Pro 


Tyr 








20 










25 










30 






Glu 


Gin 


Lys 


Asp 


Ala 


Leu 


Asn 


Ser 


Leu 


Val 


Ser 


Gly 


He 


Ser 


Asn 


Ala 






35 










40 










45 








Arg 


Glu 


Ser 


Val 


Lys 


He 


Ala 


He 


Tyr 


Ser 


Phe 


Thr 


His 


Arg 


Asp 


He 




50 










55 










60 










Ala 


Arg 


Ala 


He 


Lys 


Ser 


Val 


Ala 


Ser 


Arg 


Gly 


He 


Lys 


Val 


Gin 


He 


65 










70 










75 










80 


He 


Tyr 


Asp 


Tyr 


Glu 


Ser 


Asn 


His 


His 


Asn 


Lys 


Gin 


Ser 


Thr 


He 


Gly 










85 










90 










95 




Tyr 


Leu 


Asp 


Lys 


Tyr 


Pro 


Asn 


Thr 


Lys 


Val 


Cys 


Leu 


Leu 


Lys 


Gly 


Leu 








100 










105 










110 






Lys 


Ala 


Lys 


Asn 


Gly Asn 


Tyr 


Tyr 


Gly 


He 


Met 


His 


Gin 


Lys 


Val 


Ala 






115 










120 










125 








He 


He 


Asp 


Asp 


Lys 


He 


Val 


Phe 


Leu 


Gly 


Ser 


Ala 


Asn 


Trp 


Ser 


Lys 




130 










135 










140 










Asn 


Ala 


Phe 


Glu 


Asn 


Asn 


Tyr 


Glu 


Val 


Leu 


Leu 


Lys 


Thr 


Asp 


Asp 


Thr 


145 










150 










155 










160 


Glu 


Thr 


He 


Leu 


Lys 


Ala 


Lys 


Ser 


Tyr 


Tyr 


Gin 


Lys 


Met 


Leu 


Gly Ser 



165 170 175 

Cys Val Gly Phe 
180 
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(2) INFORMATION FOR SEQ ID NO: 2 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 64 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 590 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 59: 

TTGGTTGGTT GTTTTTATCA TAGAGTGTAA TTTAAAATAA GGATCATTTG ATG TTA 56 

Met Leu 
1 

AAC AAG TTT AAA AAA ATC GTT GGC GTT AGT GTG TTA GTG GGC TGT TTA 104 
Asn Lys Phe Lys Lys lie Val Gly Val Ser Val Leu Val Gly Cys Leu 
5 10 15 

GGG GTT TTG CAA GCT AAA AAC AGC TTA TTT GTC TTA CCT TAT GAG CAA 152 
Gly Val Leu Gin Ala Lys Asn Ser Leu Phe Val Leu Pro Tyr Glu Gin 
20 25 30 

AAA GAC GCT CTC AAT TCT TTA GTT TCT GGC ATT AGT AAC GCC AGA GAG 2 00 

Lys Asp Ala Leu Asn Ser Leu Val Ser Gly lie Ser Asn Ala Arg Glu 
35 40 45 50 

AGC GTG AAA ATC GCT ATC TAT AGT TTC ACG CAC AGA GAT ATT GCA AGA 24 8 

Ser Val Lys lie Ala lie Tyr Ser Phe Thr His Arg Asp lie Ala Arg 
55 60 65 

GCG ATT AAA AGC GTA GCG AGT AGG GGG ATT AAG GTG CAA ATC ATT TAT 2 96 

Ala lie Lys Ser Val Ala Ser Arg Gly lie Lys Val Gin lie lie Tyr 
70 75 80 

GAT TAT GAA AGC AAT CAT CAT AAC AAG CAA TCC ACT ATT GGC TAT CTG 344 
Asp Tyr Glu Ser Asn His His Asn Lys Gin Ser Thr lie Gly Tyr Leu 
85 90 95 

GAC AAA TAC CCT AAC ACG AAA GTG TGC TTA TTG AAA GGG CTT AAG GCT 3 92 

Asp Lys Tyr Pro Asn Thr Lys Val Cys Leu Leu Lys Gly Leu Lys Ala 
100 105 110 

AAA AAC GGG AAT TAT TAC GGC ATC ATG CAC CAA AAA GTA GCG ATC ATT 44 0 

Lys Asn Gly Asn Tyr Tyr Gly lie Met His Gin Lys Val Ala lie lie 
115 120 125 130 
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Val Asn 
1 

TTT TTG AAA AAG CCA AAG TAT TAT AAA TTC ATA GAG GGG GCG AAT TAT 104 
Phe Leu Lys Lys Pro Lys Tyr Tyr Lys Phe lie Glu Gly Ala Asn Tyr 
5 10 15 

TTG AGC TTG GGG CTT TOT ATG GTG GTA GCG ATC CTT ATG GGC GTG GCT 152 
Leu Ser Leu Gly Leu Ser Met Val Val Ala lie Leu Met Gly Val Ala 
20 25 30 

ATA GGC TAT GGG CTT AAA AAA CTC ACT CAT ATT TCG TGG CTT TTT TGG 200 
lie Gly Tyr Gly Leu Lys Lys Leu Thr His He Ser Trp Leu Phe Trp 
35 40 45 50 

CTT GGG GTT ATT TGG GGC GTC TTA GCG AGC TTT CTC AAT GTC TAT AAA 24 8 

Leu Gly Val He Trp Gly Val Leu Ala Ser Phe Leu Asn Val Tyr Lys 
55 60 65 

GCT TAT AAA AAC ATG CAA AAA GAC TAT GAA GAA CTA GCC AAA GAC CCT 296 
Ala Tyr Lys Asn Met Gin Lys Asp Tyr Glu Glu Leu Ala Lys Asp Pro 
70 75 80 

AAA TAG ACA CAA AAT AAA ACA AAA TAAATCCAAT CAAATCCCAT GTGCCAAATC 350 
Lys Tyr Thr Gin Asn Lys Thr Lys 
85 90 

CAATGCTTGC TTATTTTACT TTC 373 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:258: 

Val Asn Phe Leu Lys Lys Pro Lys Tyr Tyr Lys Phe lie Glu Gly Ala 

15 10 15 

Asn Tyr Leu Ser Leu Gly Leu Ser Met Val Val Ala lie Leu Met Gly 

20 25 30 

Val Ala He Gly Tyr Gly Leu Lys Lys Leu Thr His He Ser Trp Leu 

35 40 45 

Phe Trp Leu Gly Val He Trp Gly Val Leu Ala Ser Phe Leu Asn Val 

50 55 60 

Tyr Lys Ala Tyr Lys Asn Met Gin Lys Asp Tyr Glu Glu Leu Ala Lys 
65 70 75 80 

Asp Pro Lys Tyr Thr Gin Asn Lys Thr Lys 
85 90 
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Arg 


Glu 


Pro 


Asp 


Lys 


Cys 


Thr 


Lys 


Ser 


Phe 


Lys 


Lys 


Ser 


Ala 


Leu 


Met 


65 










70 










75 










80 


Ser 


Tyr 


Asp 


Leu 


Ala 


Leu 


Gly 


Tyr 


Leu 


Val 


Ser 


Lys 


Asn 


Lys 


Gin 


Tvx 










85 










90 










95 




Gly 


Leu 


Lys 


Ala 


He 


Glu 


He 


Leu 


Asn 


Ala 


Trp 


Ala 


Lys 


Glu 


Leu 


Gin 








100 










105 










110 






Ser 


Val 


Asp 


Thr 


Tyr 


Gin 


Ser 


Glu 


Asp 


Asn 


He 


Asn 


Phe 


Tyr 


Met 


Pro 






115 










120 










125 








Tyr 


Met 


Asn 


Met 


Ala 


Tyr 


Trp 


Phe 


Val 


Lys 


Lys 


Ala 


Phe 


Pro 


Ser 


Pro 




130 










135 










140 










Glu 


Tyr 


Glu 


Asp 


Phe 


He 


Lys 


Arg 


Met 


Arg 


Gin 


Tyr 


Ser 


Gin 


Ser 


Ala 


145 










150 










155 










160 


Leu 


Asn 


Thr 


Asn 


His 


Gly 


Ala 


Trp 


Gly 


He 


Leu 


Phe 


Asp 


Val 


Ser 


Ser 










165 










170 










175 




Ala 


Leu 


Ala 


Leu 


Asp 


Asp 


Asn 


Ala 


Leu 


Leu 


His 


Asn 


Ser 


Ala 


Asn 


ArQ 








180 










185 










190 






Trp 


Gin 


Glu 


Trp 


Val 


Phe 


Lys 


Ala 


He 


Asp 


Glu 


Asn 


Gly 


Val 


He 


Xaa 






195 










200 










205 








Ser 


Ala 


He 


Thr 


Arg 


Ser 


Asp 


Thr 


Ser 


Asp 


Tyr 


His 


Gly 


Gly 


Pro 


Thr 




210 










215 










220 










Lys 


Gly 


He 


Lys 


Gly 


He 


Ala 


Tyr 


Thr 


Asn 


Phe 


Ala 


Leu 


Leu 


Ala 


Leu 


225 










230 










235 










240 


Thr 


lie 


Ser 


Gly 


Glu 


Leu 


Leu 


Phe 


Glu 


Asn 


Gly 


Tyr 


Asp 


Leu 


Trr) 


Glv 










245 










250 










255 




Ser 


Gly 


Ala 


Gly 


Lys 


Arg 


Leu 


Ser 


Val 


Ala 


Tyr 


Asn 


Lys 


Val 


Ala 


Thr 








260 










265 










270 






Trp 


He 


Leu 


Asn 


Pro 


Glu 


Thr 


Phe 


Pro 


Tyr 


Phe 


Gin 


Pro 


Asn 


Leu 


He 






275 










280 










285 








Gly 


Val 


His 


Asn 


Asn 


Ala 


Tyr 


Phe 


He 


He 


Leu 


Ala 


Lys 


His 


Tyr 


Ser 




290 










295 










300 










Ser 


Pro 


Ser 


Ala 


Asn 


Glu 


Leu 


Leu 


Lys 


Gin 


Gly 


Asp 


Leu 


His 


Glu 


Asp 


305 










310 










315 










320 


Gly 


Phe 


Arg 


Leu 


Lys 


Leu 


Arg 


Ser 


Pro 

















325 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 320 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 57: 
TTAACGATCG CAAAAGCCGA TGAAAGTTTT GATGAAATCA TAAAAGGTGT GTG AAT 56 
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ATC ACT AGG AGC GAT ACG AGC GAT TAT CAT GGC GGC CCT ACA AAG GGC 72 8 

lie Thr Arg Ser Asp Thr Ser Asp Tyr His Gly Gly Pro Thr Lys Gly 
215 220 225 

ATT AAG GGG ATA GCT TAT ACC AAT TTC GCG CTT CTT GCG CTA ACC ATA 77 6 

lie Lys Gly lie Ala Tyr Thr Asn Phe Ala Leu Leu Ala Leu Thr lie 
230 235 240 

TCA GGC GAA TTG CTT TTT GAG AAC GGG TAT GAT TTG TGG GGT AGT GGA 824 
Ser Gly Glu Leu Leu Phe Glu Asn Gly Tyr Asp Leu Trp Gly Ser Gly 
245 250 255 

GCT GGG AAA AGG CTC TCT GTG GCG TAT AAC AAA GTT GCA ACA TGG ATT 872 
Ala Gly Lys Arg Leu Ser Val Ala Tyr Asn Lys Val Ala Thr Trp lie 
260 265 270 

TTA AAC CCT GAA ACT TTC CCT TAT TTC CAG CCT AAC CTT ATC GGG GTG 92 0 

Leu Asn Pro Glu Thr Phe Pro Tyr Phe Gin Pro Asn Leu lie Gly Val 
275 280 285 290 

CAT AAC AAC GCC TAT TTC ATT ATT TTA GCC AAG CAT TAT TCT AGC CCT 96 8 

His Asn Asn Ala Tyr Phe lie lie Leu Ala Lys His Tyr Ser Ser Pro 
295 300 305 

AGT GCA AAT GAG CTT TTA AAG CAA GGC GAT TTA CAC GAA GAT GGT TTC 1016 
Ser Ala Asn Glu Leu Leu Lys Gin Gly Asp Leu His Glu Asp Gly Phe 
310 315 320 

AGG CTG AAA CTC CGA TCG CCA TGAATTTTTC TGTATCCAAG GTTAGCCTTA AGGA 1071 
Arg Leu Lys Leu Arg Ser Pro 
325 

TGGCCATGCG CTTTAACCT 10 90 

(2) INFORMATION FOR SEQ ID NO: 25 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

Met Lys Arg Phe Val Leu Phe Leu Leu Phe Met Cys Val Cys Val Gin 

15 10 15 

Ala Tyr Ala Glu Gin Asp Tyr Phe Phe Arg Asp Phe Lys Ser Arg Asp 

20 25 30 

Leu Pro Gin Lys Leu His Leu Asp Lys Lys Leu Ser Gin Thr lie Gin 

35 40 45 

Pro Cys Met Gin Leu Asn Ala Ser Lys His Tyr Thr Ser Thr Gly Val 
50 55 60 
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TTACCCTAAA ACGCTATTTT TAAAATAATC CATTAAAATA AAGGCGAGGA ATG AAA 56 

Met Lys 
1 

AGA TTT GTT TTG TTT TTA TTG TTC ATG TGC GTT TGC GTT CAA GCT TAG 104 
Arg Phe Val beu Phe Leu Leu Phe Met Cys Val Cys Val Gin Ala Tyr 
5 10 15 

GCC GAG CAA GAT TAG TTT TTT AGG GAT TTT AAA TCT AGA GAT TTG CCC 152 
Ala Glu Gin Asp Tyr Phe Phe Arg Asp Phe Lys Ser Arg Asp Leu Pro 
20 25 30 

CAA AAA CTC CAT CTT GAT AAA AAG CTC TCC CAA ACA ATA CAG CCA TGC 2 00 

Gin Lys Leu His Leu Asp Lys Lys Leu Ser Gin Thr lie Gin Pro Cys 
35 40 45 50 

ATG CAA CTT AAC GCA TCA AAA CAC TAC ACT TCT ACC GGG GTT AGA GAG 24 8 

Met Gin Leu Asn Ala Ser Lys His Tyr Thr Ser Thr Gly Val Arg Glu 
55 60 65 

CCT GAT AAA TGC ACA AAG AGT TTT AAA AAA TCC GCT CTC ATG TCC TAT 296 
Pro Asp Lys Cys Thr Lys Ser Phe Lys Lys Ser Ala Leu Met Ser Tyr 
70 75 80 

GAC TTA GCG CTA GGT TAT TTG GTG AGT AAG AAT AAG CAA TAC GGC TTA 344 
Asp Leu Ala Leu Gly Tyr Leu Val Ser Lys Asn Lys Gin Tyr Gly Leu 
85 90 95 

AAG GCT ATA GAA ATT TTA AAC GCT TGG GCT AAA GAG CTT CAA AGC GTG 3 92 

Lys Ala lie Glu He Leu Asn Ala Trp Ala Lys Glu Leu Gin Ser Val 
100 105 110 

GAT ACT TAT CAG AGC GAG GAT AAT ATC AAT TTT TAC ATG CCT TAT ATG 44 0 

Asp Thr Tyr Gin Ser Glu Asp Asn He Asn Phe Tyr Met Pro Tyr Met 
115 120 125 130 

AAC ATG GCT TAT TGG TTT GTC AAA AAG GCG TTT CCT AGC CCA GAA TAT 4 88 

Asn Met Ala Tyr Trp Phe Val Lys Lys Ala Phe Pro Ser Pro Glu Tyr 
135 140 145 

GAA GAT TTC ATT AAG CGG ATG CGC CAG TAT TCT CAA TCA GCT CTT AAC 53 6 

Glu Asp Phe He Lys Arg Met Arg Gin Tyr Ser Gin Ser Ala Leu Asn 
150 155 160 

ACT AAC CAT GGG GCG TGG GGC ATT CTT TTT GAT GTG AGT TCT GCG CTA 584 
Thr Asn His Gly Ala Trp Gly He Leu Phe Asp Val Ser Ser Ala Leu 
165 170 175 

GCG TTA GAC GAT AAT GCC CTT TTG CAC AAT AGC GCT AAT CGG TGG CAG 632 
Ala Leu Asp Asp Asn Ala Leu Leu His Asn Ser Ala Asn Arg Trp Gin 
180 185 190 

GAG TGG GTG TTT AAA GCC ATA GAT GAG AAT GGG GTT ATT GNT AGC GCG 680 
Glu Trp Val Phe Lys Ala He Asp Glu Asn Gly Val He Xaa Ser Ala 
195 " 200 205 210 
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His 


Asp 


Glu 


Asp 


Ser 


Leu 


Glu 


Lys 


Leu 


He 


Thr 


He 


Tvr 


Phe 


Leu 


Gin 










165 










170 










175 




Asn 


Arg 


Lys 


Lvs 


Glu 


Gly 


Leu 


Asp 


Leu 


Leu 


Gin 


Ser 


His 


He 


Asp 


Arg 








180 










185 










190 






Tyr Gly Cys 


Ser 


Glu 


Gin 


Leu 


Cys 


Gin 


Lvs 


Ala 


Leu 


Asn 


Thr 


Phe 


Thr 






195 










200 










205 








Gin 


Phe 


Asn 


Glu 


Leu 


Asp 


Leu 


Ala 


Lys 


Thr 


Thr 


Phe 


Ala 


Arg 


Leu 


Tyr 




210 










215 










220 










Glu 


Lys 


Asn 


Pro 


He 


Val 


Gin 


Asn 


Ala 


Gin 


Phe 


Tvr 


He 


Gly 


Val 


Leu 


225 










230 










235 










240 


He 


Leu 


Leu 


Lvs 

■2 


Glu 


Phe 


Asp 


Lys 


Ala 


Gin 


Lys 


He 


Ala 


Glu 


Leu 


Phe 










245 










250 










255 




Pro 


Phe 


Asp 


Ara 


Arg 


Leu 


Leu 


Leu 


Asp 


Leu 


Tvr 


Thr 


Ala 


Gin 


Lys 


Lys 








260 










265 










270 






Phe 


Asp 


Gin 


Ala 


Ser 


Lys 


Gin 


Ala 


Ser 


Leu 


He 


Tyr 


Gin 


Glu 


Lys 


Lys 






275 










280 










285 








Asp 


Pro 


Lvs 


Phe 


Leu 


Gly 


Leu 


Glu 


Ala 


He 


Tyr 


His 


Tyr 


Glu 


Ser 


Leu 




290 










295 










300 










Ser 


Ala 


Asn 


Lvs 


Lys 


Lys 


Leu 


Thr 


Lys 


Glu 


Glu 


Met 


Leu 


Pro 


He 


He 


305 










310 










315 










320 


Gin 


Lys 


Leu 


Glu 


Gin 


Ala 


Thr 


Lys 


Glu 


Arg 


Gin 


Ala 


Trp 


Leu 


Ala 


Lys 










325 










330 










335 




Thr 


Lys 


Asp 


Lvs 


Glu 


Asp 


Ala 


Gin 


Asp 


Ala 


Phe 


Phe 


Tyr 


Asn 


Phe 


Leu 








340 










345 










350 






Gly Tyr 


Ser 


Leu 


He 


Asp 

^ IT 


Tvr 


Asp 


Met 


Asp 


He 


Lvs 

2 " 


Ara 


Gly 


Met 


Asp 






355 










360 










365 








Phe 


Val 


Arg 


Lvs 


Ala 


Leu 


Ala 


Leu 


Asp 


Ser 


Gly 

2 


Ser 


Val 


Leu 


Tyr 


Leu 




370 










375 










3 80 










Asp 


Ser 


Leu 


Ala 


Trp 


Gly 


Tyr 


Tyr 


Lys 


Leu 


Gly 


Asn 


Cys 


Leu 


Glu 


Ala 


385 










390 










395 










400 


Lys 


Lys 


He 


Phe 


Ser 


Ser 


He 


Ala 


Lys 


Glu 


Ser 


He 


Gin 


Ala 


Glu 


Pro 










405 










410 










415 




Glu 


Leu 


Lys 


Glu 


His 


Asn 


Lys 


He 


He 


Gin 


Glu 


Cys 


Lys 


Lys 












420 










425 










430 







(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1090 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.,. 1037 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 
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TCC TTA ATA GAT TAT GAC ATG GAT ATT AAA AGG GGC ATG GAT TTT GTG 1160 
Ser Leu lie Asp Tyr Asp Met Asp lie Lys Arg Gly Met Asp Phe Val 
355 360 365 370 

AGG AAA GCC TTA GCG TTG GAT TCT GGA TCA GTG CTT TAT TTG GAT TCT 1208 
Arg Lys Ala Leu Ala Leu Asp Ser Gly Ser Val Leu Tyr Leu Asp Ser 
375 380 385 

TTA GCA TGG GGT TAT TAG AAA TTA GGG AAT TGT TTG GAA GCT AAA AAA 12 56 
Leu Ala Trp Gly Tyr Tyr Lys Leu Gly Asn Cys Leu Glu Ala Lys Lys 
390 395 400 

ATC TTT TCT AGC ATC GCT AAA GAG TCT ATC CAA GCC GAA CCT GAA TTG 13 04 

lie Phe Ser Ser lie Ala Lys Glu Ser lie Gin Ala Glu Pro Glu Leu 
405 410 415 

AAA GAA CAC AAT AAA ATC ATT CAA GAA TGC AAG AAA TAGGGATTTT AGAAAA 1356 
Lys Glu His Asn Lys lie lie Gin Glu Cys Lys Lys 
420 425 430 

TTTACAAAAA AGCTTAGCCT TAAAAGAGGG CATGCTT 13 93 

(2) INFORMATION FOR SEQ ID NO:254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:254: 

Met Asn lie Gin lie Lys Lys Arg Phe Leu Ala Asn Leu Leu Leu Phe 

15 10 15 

Ser Leu Phe Cys Leu Lys Ala Glu Thr Leu Ser Glu Asp His Gin lie 

20 25 30 

Leu Leu Ser Ser Asp Ala Phe His Arg Gly Asp Phe Ala Ala Ala Gin 

35 40 45 

Lys Gly Tyr Met Asn Leu Tyr Lys Gin Thr Asn Lys Val Val Tyr Ala 

50 55 60 

Lys Glu Ala Ala lie Ser Ala Ala Ser Leu Gly Asp lie Lys Thr Ala 
65 70 75 80 

Met His Leu Ala Met Leu Tyr Gin Lys lie Thr Asn Asn Arg Asn Asp 

85 90 95 

Val Ser lie Asn Lys lie Leu Val Asp Gly Tyr Ala Gin Met Gly Gin 

100 105 110 

lie Asp Lys Ala lie Glu Leu Leu His Lys lie Arg Lys Glu Glu Lys 

115 120 125 

Thr lie Ala Thr Asp Asn Val Leu Gly Thr Leu Tyr Leu Thr Gin Lys 

130 135 140 

Arg Leu Asp Lys Ala Phe Pro Leu Leu Asn Lys Phe Tyr Asn Gin Val 
145 150 155 160 
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GCC ACA GAC AAT GTG TTA GGG ACT TTG TAT TTG ACT CAA AAG CGT TTG 488 
Ala Thr Asp Asn Val Leu Gly Thr Leu Tyr Leu Thr Gin Lys Arg Leu 
135 140 145 

GAT AAG GCT TTC CCA TTG TTG AAT AAG TTT TAT AAC CAA GTG CAT GAT 53 6 

Asp Lys Ala Phe Pro Leu Leu Asn Lys Phe Tyr Asn Gin Val His Asp 
150 155 160 

GAA GAC AGC CTA GAA AAA CTC ATT ACG ATC TAT TTT TTG CAA AAT CGT 584 
Glu Asp Ser Leu Glu Lys Leu lie Thr lie Tyr Phe Leu Gin Asn Arg 
165 170 175 

AAA AAA GAG GGC TTG GAT TTG TTG CAA TCT CAT ATA GAC AGG TAT GGT 632 
Lys Lys Glu Gly Leu Asp Leu Leu Gin Ser His lie Asp Arg Tyr Gly 
180 185 190 

TGC TCA GAG CAA TTG TGC CAA AAA GCG CTC AAC ACT TTC ACG CAA TTT 6 80 

Cys Ser Glu Gin Leu Cys Gin Lys Ala Leu Asn Thr Phe Thr Gin Phe 
195 200 205 210 

AAC GAG CTT GAT TTG GCT AAA ACG ACT TTT GCT CGT TTG TAT GAA AAA 72 8 

Asn Glu Leu Asp Leu Ala Lys Thr Thr Phe Ala Arg Leu Tyr Glu Lys 
215 220 225 

AAC CCT ATT GTT CAA AAT GCT CAG TTT TAC ATA GGG GTA TTA ATC TTG 776 
Asn Pro lie Val Gin Asn Ala Gin Phe Tyr lie Gly Val Leu lie Leu 
230 235 240 

TTA AAA GAG TTT GAT AAG GCC CAG AAA ATC GCA GAA TTA TTC CCT TTT 824 
Leu Lys Glu Phe Asp Lys Ala Gin Lys lie Ala Glu Leu Phe Pro Phe 
245 250 255 

GAC AGG CGT TTG TTG TTA GAC TTA TAC ACC GCA CAA AAA AAA TTC GAT 872 
Asp Arg Arg Leu Leu Leu Asp Leu Tyr Thr Ala Gin Lys Lys Phe Asp 
260 265 270 

CAA GCT TCC AAA CAA GCT TCT TTG ATC TAT CAA GAA AAA AAA GAC CCT 92 0 

Gin Ala Ser Lys Gin Ala Ser Leu lie Tyr Gin Glu Lys Lys Asp Pro 
275 280 285 290 

AAA TTC TTA GGA TTA GAG GCC ATT TAT CAT TAT GAA AGC TTG AGT GCG 96 8 

Lys Phe Leu Gly Leu Glu Ala lie Tyr His Tyr Glu Ser Leu Ser Ala 
295 300 305 

AAT AAG AAA AAG CTC ACC AAA GAA GAG ATG TTG CCT ATC ATT CAA AAA 1016 
Asn Lys Lys Lys Leu Thr Lys Glu Glu Met Leu Pro lie lie Gin Lys 
310 315 320 

TTA GAG CAA GCC ACC AAA GAG CGC CAA GCA TGG CTC GCT AAA ACC AAA 1064 
Leu Glu Gin Ala Thr Lys Glu Arg Gin Ala Trp Leu Ala Lys Thr Lys 
325 330 335 

GAT AAA GAA GAC GCG CAA GAC GCT TTC TTT TAT AAT TTT TTA GGG TAT 1112 
Asp Lys Glu Asp Ala Gin Asp Ala Phe Phe Tyr Asn Phe Leu Gly Tyr 
340 345 350 
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(2) INFORMATION FOR SEQ ID NO: 25 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1340 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

CTAAAGTGCG CTAAAATTCA CTTCAGTGAT ACAAAAAAGG AAATAAAATA ATG AAT 56 

Met Asn 
1 

ATT CAA ATA AAG AAA AGG TTT TTA GCA AAT TTG TTG CTT TTT AGC CTG 104 
lie Gin lie Lys Lys Arg Phe Leu Ala Asn Leu Leu Leu Phe Ser Leu 
5 10 15 

TTT TGC CTT AAG GCT GAA ACC CTT TCA GAA GAT CAT CAA ATC CTG TTG 152 
Phe Cys Leu Lys Ala Glu Thr Leu Ser Glu Asp His Gin lie Leu Leu 
20 25 30 

AGT TCA GAC GCT TTC CAT AGA GGG GAT TTT GCT GCC GCT CAA AAA GGC 2 00 

Ser Ser Asp Ala Phe His Arg Gly Asp Phe Ala Ala Ala Gin Lys Gly 
35 40 45 50 

TAT ATG AAT CTC TAT AAG CAA ACC AAT AAG GTG GTG TAT GCT AAA GAA 24 8 

Tyr Met Asn Leu Tyr Lys Gin Thr Asn Lys Val Val Tyr Ala Lys Glu 
55 60 65 

GCG GCC ATT TCA GCG GCG AGC TTA GGG GAT ATT AAA ACC GCT ATG CAT 2 96 

Ala Ala lie Ser Ala Ala Ser Leu Gly Asp lie Lys Thr Ala Met His 
70 75 80 

TTA GCC ATG CTC TAT CAA AAA ATC ACC AAT AAT CGT AAT GAT GTT TCT 344 
Leu Ala Met Leu Tyr Gin Lys lie Thr Asn Asn Arg Asn Asp Val Ser 
85 90 95 

ATC AAT AAG ATT TTA GTG GAT GGC TAT GCG CAA ATG GGG CAG ATT GAT 3 92 

lie Asn Lys lie Leu Val Asp Gly Tyr Ala Gin Met Gly Gin lie Asp 
100 105 110 

AAG GCG ATT GAA TTG CTG CAC AAA ATC CGT AAA GAA GAA AAG ACC ATA 44 0 

Lys Ala lie Glu Leu Leu His Lys lie Arg Lys Glu Glu Lys Thr lie 
115 120 125 130 
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CAA AAA CCC CCA AAA GAA AAT AAA GAA ATC CCA AAA ACC ACA GCA CCC 632 
Gin Lys Pro Pro Lys Glu Asn Lys Glu lie Pro Lys Thr Thr Ala Pro 
180 185 190 

ACG CCC CCT ATA GCA CCC ACG CCT TTA AAA GAG CCG ATG ATA AAA GAT 6 80 

Thr Pro Pro lie Ala Pro Thr Pro Leu Lys Glu Pro Met lie Lys Asp 
195 200 205 210 

CCT AAC ACC AAA GAG CCT GCA GGG GTA TGATGTTCAT TGTAGCGGTT TTGATGC 734 
Pro Asn Thr Lys Glu Pro Ala Gly Val 
215 

TGGCGTTTTT AATCTTTGTC CATGAA 7 60 

(2) INFORMATION FOR SEQ ID NO:252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 

Met Arg Lys lie Leu Leu Leu Gly Leu lie Leu Gin Ala Leu Phe Ser 

15 10 15 

Glu Glu Ala Ala Gin Glu Leu Leu Gin Cys Ser Ala lie Phe Glu Ser 

20 25 30 

Lys Lys Ala Glu Leu Lys Asp Asp Leu Arg Arg Leu Ser Glu Lys Glu 

35 40 45 

Gin Ser Leu Arg lie Leu Gin Thr Glu Asn Ala Arg Leu Leu Asp Glu 

50 55 60 

Lys Thr Asp Leu Leu Asn Gin Lys Glu Lys Glu Val Glu Glu Lys Leu 
65 70 75 80 

Lys Asn Leu Ala Ala Lys Glu Glu Ala Phe Lys Thr Leu Gin Thr Glu 

85 90 95 

Glu Lys Lys Arg Leu Lys Asn Leu lie Glu Glu Asn Glu Gly lie Leu 

100 105 110 

Arg Glu lie Lys Gin Ala Lys Asp Ser Lys lie Gly Glu Thr Tyr Ser 

115 120 125 

Lys Met Lys Asp Ser Lys Ser Ala Leu lie Leu Glu Asn Leu Pro Thr 

130 135 140 

Gin Asn Ala Leu Glu lie Leu Met Ala Leu Lys Pro Gin Glu Leu Gly 
145 150 155 160 

Lys lie Leu Ala Lys Met Asp Pro Lys Lys Ala Ala Ala Leu Thr Glu 

165 170 175 

Leu Trp Gin Lys Pro Pro Lys Glu Asn Lys Glu lie Pro Lys Thr Thr 

180 185 190 

Ala Pro Thr Pro Pro lie Ala Pro Thr Pro Leu Lys Glu Pro Met lie 

195 200 205 

Lys Asp Pro Asn Thr Lys Glu Pro Ala Gly Val 
210 215 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

TCTTTTAGAC GAAAACGCCA TGATTTTACA CTGGCAAAAA GAGGGCTTGC ATG CGT 5 6 

Met Arg 
1 

AAA ATC TTG TTA TTG GGT CTG ATT TTA CAA GCG CTC TTC AGC GAA GAA 104 
Lys lie Leu Leu Leu Gly Leu lie Leu Gin Ala Leu Phe Ser Glu Glu 
5 10 15 

GCC GCG CAA GAA TTG TTG CAA TGC TCT GCG ATT TTT GAA TCT AAA AAA 152 
Ala Ala Gin Glu Leu Leu Gin Cys Ser Ala lie Phe Glu Ser Lys Lys 
20 25 30 

GCC GAA TTG AAA GAC GAT TTG CGC CGA TTG AGT GAA AAA GAG CAG TCT 2 00 

Ala Glu Leu Lys Asp Asp Leu Arg Arg Leu Ser Glu Lys Glu Gin Ser 
35 40 45 50 

TTA AGG ATC TTG CAA ACC GAA AAC GCC CGC CTT TTA GAT GAA AAA ACC 24 8 

Leu Arg lie Leu Gin Thr Glu Asn Ala Arg Leu Leu Asp Glu Lys Thr 
55 60 65 

GAT CTG TTG AAC CAA AAA GAA AAA GAA GTG GAA GAA AAA CTG AAA AAT 2 96 

Asp Leu Leu Asn Gin Lys Glu Lys Glu Val Glu Glu Lys Leu Lys Asn 
70 75 80 

TTA GCC GCT AAA GAA GAA GCC TTT AAA ACC TTA CAA ACG GAA GAA AAA 344 
Leu Ala Ala Lys Glu Glu Ala Phe Lys Thr Leu Gin Thr Glu Glu Lys 
85 90 95 

AAA CGC CTT AAA AAT TTG ATA GAA GAA AAC GAA GGC ATT TTA AGA GAA 3 92 

Lys Arg Leu Lys Asn Leu lie Glu Glu Asn Glu Gly lie Leu Arg Glu 
100 105 110 

ATC AAG CAG GCT AAA GAC AGC AAG ATT GGC GAG ACT TAT TCT AAA ATG 44 0 

lie Lys Gin Ala Lys Asp Ser Lys lie Gly Glu Thr Tyr Ser Lys Met 
115 120 125 130 

AAA GAT TCT AAA TCG GCT CTG ATT TTA GAA AAT TTA CCC ACT CAA AAC 4 88 

Lys Asp Ser Lys Ser Ala Leu lie Leu Glu Asn Leu Pro Thr Gin Asn 
135 140 145 

GCA TTA GAA ATT TTA ATG GCG CTA AAA CCC CAA GAA CTC GGT AAA ATT 53 6 

Ala Leu Glu lie Leu Met Ala Leu Lys Pro Gin Glu Leu Gly Lys lie 
150 155 160 

TTA GCC AAA ATG GAT CCT AAA AAA GCG GCG GCT TTG ACA GAG TTG TGG 5 84 

Leu Ala Lys Met Asp Pro Lys Lys Ala Ala Ala Leu Thr Glu Leu Trp 
165 170 175 



-441- 



BNSOOCID: <WO 9S43478A1_L> 



wo 98/43478 



PCT/US98/06371 



( D ) TOPOLOGY : 1 inear 

(ii) MOLECUIiE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250: 

Met lie Leu Arg Ala Ser Val Leu Ser Ala Leu Leu Leu Val Gly Leu 

15 10 15 

Gly Ala Ala Pro Lys His Ser Val Ser Ala Asn Asp Lys Arg Met Gin 

20 25 30 

Asp Asn Leu Val Ser Val lie Glu Lys Gin Thr Asn Lys Lys Val Arg 

35 40 45 

lie Leu Glu lie Lys Pro Leu Lys Ser Ser Gin Asp Leu Lys Met Val 

50 55 60 

Val lie Glu Asp Pro Asp Thr Lys Tyr Asn lie Pro Leu Val Val Ser 
65 70 75 80 

Lys Asp Gly Asn Leu lie lie Gly Leu Ser Asn lie Phe Phe Ser Asn 

85 90 95 

Lys Ser Asp Asp Val Gin Leu Val Ala Glu Thr Asn Gin Lys Val Gin 

100 105 110 

Ala Leu Asn Ala Thr Gin Gin Asn Ser Ala Lys Leu Asn Ala lie Phe 

115 120 125 

Asn Glu lie Pro Ala Asp Tyr Ala lie Glu Leu Pro Ser Thr Asn Ala 

130 135 140 

Ala Asn Lys Asp Lys lie Leu Tyr lie Val Ser Asp Pro Met Cys Pro 
145 150 155 160 

His Cys Gin Lys Glu Leu Thr Lys Leu Arg Asp His Leu Lys Glu Asn 

165 170 175 

Thr Val Arg Met Val Val Val Gly Trp Leu Gly Val Asn Ser Ala Lys 

180 185 190 

Lys Ala Ala Leu lie Gin Glu Glu Met Ala Lys Ala Arg Ala Arg Gly 

195 200 205 

Ala Ser Val Glu Asp Lys lie Ser lie Leu Glu Lys lie Tyr Ser Thr 

210 215 220 

Gin Tyr Asp lie Asn Ala Gin Lys Glu Pro Glu Asp Leu Arg Thr Lys 
225 230 235 240 

Val Glu Asn Thr Thr Lys Lys lie Phe Glu Ser Gly Val He Lys Gly 

245 250 255 

Val Pro Phe Leu Tyr His Tyr Lys Ala 
260 265 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 707 
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Gly Asn Leu lie lie Gly Leu Ser Asn lie Phe Phe Ser Asn Lys Ser 
85 90 95 

GAT GAT GTG CAA TTA GTT GCA GAA ACC AAT CAA AAA GTT CAA GCT CTT 3 92 

Asp Asp Val Gin Leu Val Ala Glu Thr Asn Gin Lys Val Gin Ala Leu 
100 105 110 

AAC GCC ACC CAA CAA T^T AGC GCG AAA TTG AAC GCT ATT TTT AAT GAA 440 
Asn Ala Thr Gin Gin Asn Ser Ala Lys Leu Asn Ala lie Phe Asn Glu 
115 120 125 130 

ATA CCG GCT GAT TAT GCG ATA GAG TTG CCC TCT ACT AAC GCT GCA AAT 4 88 

lie Pro Ala Asp Tyr Ala lie Glu Leu Pro Ser Thr Asn Ala Ala Asn 
135 140 145 

AAG GAT AAA ATC CTT TAT ATT GTC TCT GAT CCC ATG TGC CCA CAT TGC 536 
Lys Asp Lys lie Leu Tyr lie Val Ser Asp Pro Met Cys Pro His Cys 
150 155 160 

CAA AAA GAG CTC ACT AAA CTT AGG GAT CAT TTA AAA GAA AAC ACC GTG 584 
Gin Lys Glu Leu Thr Lys Leu Arg Asp His Leu Lys Glu Asn Thr Val 
165 170 175 

AGA ATG GTC GTG GTG GGG TGG CTT GGG GTC AAT TCA GCT AAA AAA GCG 632 
Arg Met Val Val Val Gly Trp Leu Gly Val Asn Ser Ala Lys Lys Ala 
180 185 190 

GCT TTA ATC CAA GAA GAA ATG GCG AAA GCT AGG GCT AGG GGA GCG AGC 6 80 

Ala Leu lie Gin Glu Glu Met Ala Lys Ala Arg Ala Arg Gly Ala Ser 
195 200 205 210 

GTG GAA GAT AAG ATC TCT ATT CTT GAA AAG ATT TAT TCC ACC CAA TAG 72 8 

Val Glu Asp Lys lie Ser lie Leu Glu Lys lie Tyr Ser Thr Gin Tyr 
215 220 225 

GAT ATT AAC GCT CAA AAA GAG CCT GAA GAT TTA CGC ACT AAA GTG GAA 7 76 

Asp lie Asn Ala Gin Lys Glu Pro Glu Asp Leu Arg Thr Lys Val Glu 
230 235 240 

AAT ACC ACT AAA AAG ATT TTT GAA TCT GGC GTG ATT AAG GGT GTG CCT 824 
Asn Thr Thr Lys Lys lie Phe Glu Ser Gly Val lie Lys Gly Val Pro 
245 250 255 

TTC TTA TAC CAT TAT AAG GCA TGATATAAGG TTGCTCTCAT GAAAAAACCC TATA 87 9 
Phe Leu Tyr His Tyr Lys Ala 
260 265 

GGAAGATTTC TGATTATGC 8 98 

(2) INFORMATION FOR SEQ ID NO:250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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290 295 300 

Phe Asn Asp Gly Pro Lys Glu Leu Gly Gly Leu Arg Tyr Cys lie Asn 
305 310 315 320 

Ser Ala Ala Leu Arg Phe lie Pro Leu Lys Asp Met Glu Lys Glu Gly 

325 330 335 

Tyr Gly Glu Phe lie Pro Tyr lie Lys Lys Gly Glu Leu Lys Lys Tyr 

340 345 350 

lie Asn Asp Lys Lys Ser His 
355 

(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 845 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9: 

TTATTTCTTA TTATTGTAAG GATTTAGGCT ATTGAACTTT AGGAGTTTTA ATG ATA 56 

Met lie 
1 

TTA AGA GCG AGT GTG TTG AGC GCG TTA CTT CTT GTA GGC TTA GGG GCA 104 
Leu Arg Ala Ser Val Leu Ser Ala Leu Leu Leu Val Gly Leu Gly Ala 
5 10 15 

GCC CCT AAA CAT TCA GTT TCA GCT AAT GAC AAA CGG ATG CAG GAT AAT 152 
Ala Pro Lys His Ser Val Ser Ala Asn Asp Lys Arg Met Gin Asp Asn 
20 25 30 

TTA GTG AGC GTG ATT GAA AAA CAG ACC AAT AAA AAG GTG CGT ATT TTA 200 
Leu Val Ser Val lie Glu Lys Gin Thr Asn Lys Lys Val Arg lie Leu 
35 40 45 50 

GAA ATC AAA CCT TTA AAA TCT AGC CAG GAT TTA AAA ATG GTC GTT ATT 24 8 

Glu lie Lys Pro Leu Lys Ser Ser Gin Asp Leu Lys Met Val Val lie 
55 60 65 

GAA GAT CCG GAC ACT AAA TAC AAT ATC CCG CTT GTG GTG AGT AAG GAT 2 96 

Glu Asp Pro Asp Thr Lys Tyr Asn lie Pro Leu Val Val Ser Lys Asp 
70 75 80 

GGT AAT TTA ATC ATA GGG CTT AGC AAC ATA TTC TTT AGC AAT AAA AGC 344 

-438- 



BNSOOCID: <WO_9843478A1J_> 



wo 98/43478 PCT/US98/06371 

Asp Lys liys Ser His 
355 

GGGGTTTAAG CG 1180 
(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

Met Lys Val Leu Ser Tyr Leu Lys Asn Phe Tyr Leu Phe Leu Ala lie 

15 10 15 

Gly Ala lie Met Gin Ala Ser Glu Asn Met Gly Ser Gin His Gin Lys 

20 25 30 

Thr Asp Glu Arg Val lie Tyr Leu Ala Gly Gly Cys Phe Trp Gly Leu 

35 40 45 

Glu Ala Tyr Met Glu Arg lie Tyr Gly Val lie Asp Ala Ser Ser Gly 

50 55 60 

Tyr Ala Asn Gly Lys Thr Ser Ser Thr Asn Tyr Glu Lys Leu His Glu 
65 70 75 80 

Ser Asp His Ala Glu Ser Val Lys Val lie Tyr Asp Pro Lys Lys lie 

85 90 95 

Ser Leu Asp Lys Leu Leu Arg Tyr Tyr Phe Lys Val Val Asp Pro Val 

100 105 110 

Ser Val Asn Lys Gin Gly Asn Asp Val Gly Arg Gin Tyr Arg Thr Gly 

115 120 125 

lie Tyr Tyr Val Asn Ser Ala Asp Lys Glu Val lie Asp His Ala Leu 

130 135 140 

Lys Ala Leu Gin Lys Glu Val Lys Gly Lys lie Ala lie Glu Val Glu 
145 150 155 160 

Pro Leu Lys Asn Tyr Val Arg Ala Glu Glu Tyr His Gin Asp Tyr Leu 

165 170 175 

Lys Lys His Pro Ser Gly Tyr Cys His lie Asp Leu Lys Lys Ala Asp 

180 185 190 

Glu Val He Val Asp Asp Asp Lys Tyr Thr Lys Pro Ser Asp Glu Val 

195 200 205 

Leu Lys Lys Lys Leu Thr Lys Leu Gin Tyr Glu Val Thr Gin Asn Lys 

210 215 220 

His Thr Glu Lys Pro Phe Glu Asn Glu Tyr Tyr Asn Lys Glu Glu Glu 
225 230 235 240 

Gly He Tyr Val Asp He Thr Thr Gly Glu Pro Leu Phe Ser Ser Ala 

245 250 255 

Asp Lys Tyr Asp Ser Gly Cys Gly Trp Pro Ser Phe Ser Lys Pro He 

260 265 270 

Asn Lys Asp Val Val Lys Tyr Glu Asp Asp Glu Ser Leu Asn Arg Lys 

275 280 285 

Arg He Glu Val Leu Ser Arg He Gly Lys Ala His Leu Gly His Val 
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135 140 145 

TTA CAG AAA GAA GTG AAA GOT AAA ATC GOT ATT GAA GTA GAG CCT TTA 53 6 

Leu Gin Lys Glu Val Lys Gly Lys lie Ala lie Glu Val Glu Pro Leu 
150 155 160 

AAA AAT TAT GTG AGG GCT GAA GAG TAT CAT CAG GAT TAT TTG AAG AAA 584 

Lys Asn Tyr Val Arg Ala Glu Glu Tyr His Gin Asp Tyr Leu Lys Lys 

165 170 175 

CAC CCT ACT GGT TAT TGC CAT ATT GAT TTG AAA AAG GCG GAT GAA GTG 632 

His Pro Ser Gly Tyr Cys His lie Asp Leu Lys Lys Ala Asp Glu Val 
180 185 190 

ATT GTG GAT GAC GAT AAA TAG ACC AAA CCT AGC GAT GAA GTT TTA AAG 68 0 

lie Val Asp Asp Asp Lys Tyr Thr Lys Pro Ser Asp Glu Val Leu Lys 
195 200 205 210 

AAA AAA CTC ACC AAA CTC CAG TAT GAG GTT ACG CAA AAC AAA CAC ACT 72 8 

Lys Lys Leu Thr Lys Leu Gin Tyr Glu Val Thr Gin Asn Lys His Thr 
215 220 225 

GAG AAA CCC TTT GAA AAC GAG TAT TAC AAC AAA GAA GAA GAG GGC ATT 776 

Glu Lys Pro Phe Glu Asn Glu Tyr Tyr Asn Lys Glu Glu Glu Gly lie 
230 235 240 

TAT GTG GAT ATT ACC ACA GGC GAG CCG TTA TTT TCT TCA GCG GAT AAA 824 

Tyr Val Asp lie Thr Thr Gly Glu Pro Leu Phe Ser Ser Ala Asp Lys 

245 250 255 

TAC GAC TCC GGT TGC GGG TGG CCA AGC TTT TCT AAG CCT ATC AAT AAA 872 

Tyr Asp Ser Gly Cys Gly Trp Pro Ser Phe Ser Lys Pro lie Asn Lys 
260 265 270 

GAT GTG GTG AAA TAC GAA GAC GAT GAG AGC CTT AAT AGG AAA CGC ATT 92 0 

Asp Val Val Lys Tyr Glu Asp Asp Glu Ser Leu Asn Arg Lys Arg lie 
275 280 285 290 

GAA GTG TTG AGC CGT ATT GGT AAG GCG CAT TTA GGG CAT GTG TTT AAC 96 8 

Glu Val Leu Ser Arg lie Gly Lys Ala His Leu Gly His Val Phe Asn 
295 300 305 

GAT GGG CCT AAA GAA TTA GGG GGC TTA AGG TAT TGC ATC AAC AGC GCG 1016 

Asp Gly Pro Lys Glu Leu Gly Gly Leu Arg Tyr Cys lie Asn Ser Ala 
310 315 320 

GCT TTA AGG TTT ATC CCC TTA AAA GAC ATG GAA AAA GAG GGT TAT GGC 1064 

Ala Leu Arg Phe lie Pro Leu Lys Asp Met Glu Lys Glu Gly Tyr Gly 

325 330 335 

GAG TTT ATC CCT TAT ATC AAA AAG GGT GAA TTG AAA AAA TAC ATC AAT 1112 

Glu Phe lie Pro Tyr lie Lys Lys Gly Glu Leu Lys Lys Tyr lie Asn 
340 345 350 

GAT AAA' AAG TCG CAT TAAGGGGTAA TGACTAAGCC CCCTAAGGGG GGTTAAAATG A 116 8 
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(A) LENGTH: 118 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
{ ix ) FEATURE : 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 51... 1127 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:247: 

TTTTTAGTTA TAATGGCGGA CACCATTAAA ATTAAAACAA AGGTTATTCA ATG AAG 56 

Met Lys 
1 

GTA TTA TCT TAT TTG AAA AAT TTT TAT CTT TTT TTA GCG ATA GGA GCA 104 
Val Leu Ser Tyr Leu Lys Asn Phe Tyr Leu Phe Leu Ala lie Gly Ala 
5 10 15 

ATT ATG CAA GCG AGT GAA AAC ATG GGA TCT CAA CAC CAA AAA ACC GAT 152 
lie Met Gin Ala Ser Glu Asn Met Gly Ser Gin His Gin Lys Thr Asp 
20 25 30 

GAA AGA GTG ATT TAG TTG GCT GGG GGG TGT TTT TGG GGG CTA GAG GCG 200 
Glu Arg Val lie Tyr Leu Ala Gly Gly Cys Phe Trp Gly Leu Glu Ala 
35 40 45 50 

TAT ATG GAG AGG ATT TAT GGC GTC ATA GAC GCA AGC TCT GGT TAG GCT 24 8 

Tyr Met Glu Arg lie Tyr Gly Val lie Asp Ala Ser Ser Gly Tyr Ala 
55 60 65 

AAC GGC AAG ACT TCA AGC ACG AAT TAT GAG AAA TTG CAT GAA AGT GAT 2 96 

Asn Gly Lys Thr Ser Ser Thr Asn Tyr Glu Lys Leu His Glu Ser Asp 
70 75 80 

CAT GCT GAA AGC GTG AAA GTC ATT TAT GAT CCT AAA AAA ATC AGT TTG 344 
His Ala Glu Ser Val Lys Val lie Tyr Asp Pro Lys Lys lie Ser Leu 
85 90 95 

GAC AAA TTG TTG CGT TAC TAT TTT AAG GTG GTT GAT CCG GTG AGC GTG 3 92 

Asp Lys Leu Leu Arg Tyr Tyr Phe Lys Val Val Asp Pro Val Ser Val 
100 105 110 

AAC AAG CAG GGT AAT GAT GTG GGC AGG CAG TAT CGC ACG GGG ATT TAT 44 0 

Asn Lys Gin Gly Asn Asp Val Gly Arg Gin Tyr Arg Thr Gly lie Tyr 
115 120 125 130 

TAT GTC AAT AGC GCG GAT AAA GAA GTG ATA GAT CAT GCC TTA AAA GCG 4 88 

Tyr Val Asn Ser Ala Asp Lys Glu Val lie Asp His Ala Leu Lys Ala 
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AAG GTT GTT TTT TTA TTG TTA GTT ATA CTA GGG GGT TTA GAA GCG CAA 104 
Lys Val Val Phe Leu Leu Leu Val He Leu Gly Gly Leu Glu Ala Gin 
5 10 15 

AGT ACT TAT TGC AGT GAT CAT TGC GAA GGC ACG CCA GAT AGC CGT ATC 152 
Ser Thr Tyr Cys Ser Asp His Cys Glu Gly Thr Pro Asp Ser Arg He 
20 25 30 

CCT CCT ATG GGG TTT CAT TTC AGT TTT GTG CAT TCA GTG AAA TAT TAG 2 00 

Pro Pro Met Gly Phe His Phe Ser Phe Val His Ser Val Lys Tyr Tyr 
35 40 45 50 

TTG CAA GAT CCG CAA GAG CGC GAT CAC AAG CTT GAA AAA TGC CAT CAA 24 8 

Leu Gin Asp Pro Gin Glu Arg Asp His Lys Leu Glu Lys Cys His Gin 
55 60 65 

GCC TTT GAT TCG ACT CTT AAG GTT AAT TTT ATT ACG AAT CTT TTA AAA 296 
Ala Phe Asp Ser Thr Leu Lys Val Asn Phe lie Thr Asn Leu Leu Lys 
70 75 80 

AGG ATT GCA AGC ATG CGC AAA TGG CTT TAGAGCAAGC CCAAAAAGGG ACTCCAT 350 
Arg He Ala Ser Met Arg Lys Trp Leu 
85 90 

AAAAGGGGTT TCTTTAGGGA TTTTAT 3 76 

(2) INFORMATION FOR SEQ ID NO:246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 6: 

Met Lys Lys Val Val Phe Leu Leu Leu Val He Leu Gly Gly Leu Glu 

15 10 15 

Ala Gin Ser Thr Tyr Cys Ser Asp His Cys Glu Gly Thr Pro Asp Ser 

20 25 30 

Arg He Pro Pro Met Gly Phe His Phe Ser Phe Val His Ser Val Lys 

35 40 45 

Tyr Tyr Leu Gin Asp Pro Gin Glu Arg Asp His Lys Leu Glu Lys Cys 

50 55 60 

His Gin Ala Phe Asp Ser Thr Leu Lys Val Asn Phe He Thr Asn Leu 

65 70 75 80 

Leu Lys Arg He Ala Ser Met Arg Lys Trp Leu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 24 7: 

(i) SEQUENCE CHARACTERISTICS: 
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Asn Pro Asn Phe 
65 

Leu lie Asp Gin 

Thr Glu Lys Leu 
100 

Val Lys Lys Gin 
115 

Glu Val Lys Lys 
130 

Asn Ala Asn Lys 
145 

His lie Leu Val 

lie Asp Lys Gin 
180 

Ala Asn Arg Asp 
195 

Gly Asp Leu Gly 
210 

Lys Ala Ala Phe 
225 

Lys Thr Glu Phe 

Pro Val Thr Tyr 
260 

Leu Gin Glu Lys 
275 

Leu Arg Lys His 
290 



Asp Phe Asp Lys 
70 

Ala lie Arg Thr 
85 

Asp Ser Thr Pro 

Ala Leu Val Glu 
120 

Val Gin lie Pro 
135 

Asp Gin Leu Phe 
150 

Lys Thr Glu Asp 
165 

Pro Lys Ala Lys 

Thr lie Asp Pro 
200 

Lys Phe Gin Lys 
215 

Ala Leu Thr Pro 
230 

Gly Tyr His lie 
245 

Thr Tyr Glu Gin 

Leu Phe Gin Glu 
280 

Ala Lys lie Val 
295 



Leu Lys Glu Lys 
75 

Ala Leu Val Glu 
90 

Glu Phe Lys Ala 
105 

Phe Trp Ala Lys 

Glu Lys Glu Met 
140 

Val Lys Gin Glu 
155 

Glu Ala Lys Arg 
170 

Lys Glu Ala Lys 
185 

Asn Ser Lys Asn 

Asn Gin Met Ala 
220 

Gly Asp Tyr Thr 
235 

lie Tyr Leu lie 
250 

Ala Lys Pro Thr 
265 

Arg Met Asn Gin 
lie Asn Lys 



Glu Lys Glu Ala 
80 

Asn Glu Ala Lys 
95 

Met Met Glu Ala 
110 

Lys Gin Ala Glu 
125 

Gin Asp Phe Tyr 

Ala His Ala Arg 
160 

lie lie Ser Glu 
175 

Phe lie Glu Leu 
190 

Ala Gin Asn Gly 
205 

Pro Asp Phe Ser 

Lys Thr Pro Val 
240 

Ser Lys Asp Ser 
255 

lie Lys Gly Met 
270 

Arg lie Glu Glu 
285 



(2) INFORMATION FOR SEQ ID NO: 24 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51. . .323 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:245: 

CAGCGTTGGT GTATTTTGGA GGAAGTTAGG AAAATATTGA AGGAGTATTG ATG AAA 56 

Met Lys 
1 
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AAA CAG CCA AAG OCT AAA AAA GAA GCT AAA TTC ATT GAG TTA GCC AAT 632 
Lys Gin Pro Lys Ala Lys Lys Glu Ala Lys Phe lie Glu Leu Ala Asn 
180 185 190 



CGG GAT ACG ATT GAT CCT AAC AGC AAG AAC GCG CAA AAT GGC GGT GAT 680 
Arg Asp Thr lie Asp Pro Asn Ser Lys Asn Ala Gin Asn Gly Gly Asp 
195 200 205 210 



TTG GGG AAA TTC CAA AAG AAC CAA ATG GCT CCG GAT TTT TCT AAA GCC 72 8 

Leu Gly Lys Phe Gin Lys Asn Gin Met Ala Pro Asp Phe Ser Lys Ala 
215 220 225 



GCT TTC GCT TTA ACT CCT GGG GAT TAC ACT AAA ACC CCT GTT AAA ACA 77 6 

Ala Phe Ala Leu Thr Pro Gly Asp Tyr Thr Lys Thr Pro Val Lys Thr 
230 235 240 



GAG TTT GGT TAT CAT ATT ATC TAT TTG ATT TCT AAA GAT AGC CCT GTA 824 

Glu Phe Gly Tyr His lie lie Tyr Leu lie Ser Lys Asp Ser Pro Val 

245 250 255 

ACT TAT ACT TAT GAA CAG GCT AAA CCT ACC ATT AAG GGG ATG TTA CAA 872 

Thr Tyr Thr Tyr Glu Gin Ala Lys Pro Thr lie Lys Gly Met Leu Gin 
260 265 270 



GAA AAG CTT TTC CAA GAA CGC ATG AAT CAA CGC ATT GAG GAA CTA AGA 92 0 

Glu Lys Leu Phe Gin Glu Arg Met Asn Gin Arg lie Glu Glu Leu Arg 
275 280 285 290 



AAG CAC GCT AAA ATT GTT ATC AAC AAG TAATTGATGA GGTGTTATCA TGTTAGT 974 
Lys His Ala Lys lie Val lie Asn Lys 
295 

TAAAGGCAAT GAAATTTTAT TGAAAG 1000 



(2) INFORMATION FOR SEQ ID NO: 244: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:244: 



Met Lys Lys Asn 
1 

Ser Phe Leu Met 
20 

Asn Thr Lys Lys 
35 

Asp Gly Arg Pro 
50 



lie Leu Asn Leu 
5 

Ala Lys Pro Ala 

Thr Thr Asp Ser 
40 

lie Thr Lys Ser 
55 



Ala Leu Val Gly 
10 

His Asn Ala Asn 
25 

Ser Ala Gly Val 

Asp Phe Asp Met 
60 



Ala Leu Ser Thr 
15 

Asn Ala Thr His 
30 

Leu Ala Thr Val 
45 

lie Lys Gin Arg 
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(B) LOCATION: 51... 94 7 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

CTATAATGTG AATTTT^TGA TGAAAATTAG TTTAGAGTGG AGAACACACA ATG AAA 56 

Met Lys 
1 

AAA AAT ATC TTA AAT TTA GCG TTA GTG GGT GCG TTG AGC ACG TCG TTT 104 
Lys Asn lie Leu Asn Leu Ala Leu Val Gly Ala Leu Ser Thr Ser Phe 
5 10 15 

TTG ATG GCT AAG CCG GCT CAT AAC GCA AAT AAC GCT ACG CAT AAC ACG 152 
Leu Met Ala Lys Pro Ala His Asn Ala Asn Asn Ala Thr His Asn Thr 
20 25 30 

AAA AAA ACG ACT GAT TCT TCA GCA GGC GTG TTA GCG ACA GTG GAT GGC 200 
Lys Lys Thr Thr Asp Ser Ser Ala Gly Val Leu Ala Thr Val Asp Gly 
35 40 45 50 

AGA CCT ATC ACT AAA AGC GAT TTT GAC ATG ATT AAG CAA CGA AAT CCT 24 8 

Arg Pro lie Thr Lys Ser Asp Phe Asp Met lie Lys Gin Arg Asn Pro 
55 60 65 

AAT TTT GAT TTT GAC AAG CTT AAA GAG AAA GAA AAA GAA GCC TTG ATT 2 96 

Asn Phe Asp Phe Asp Lys Leu Lys Glu Lys Glu Lys Glu Ala Leu lie 
70 75 80 

GAT CAA GCT ATT CGC ACC GCC CTT GTA GAA AAT GAA GCT AAA ACC GAG 344 
Asp Gin Ala lie Arg Thr Ala Leu Val Glu Asn Glu Ala Lys Thr Glu 
85 90 95 

AAA TTG GAC AGC ACT CCA GAA TTT AAA GCG ATG ATG GAA GCG GTT AAA 3 92 

Lys Leu Asp Ser Thr Pro Glu Phe Lys Ala Met Met Glu Ala Val Lys 
100 105 110 

AAA CAG GCT TTA GTG GAA TTT TGG GCT AAA AAA CAG GCT GAA GAA GTG 440 
Lys Gin Ala Leu Val Glu Phe Trp Ala Lys Lys Gin Ala Glu Glu Val 
115 120 125 130 

AAA AAA GTC CAA ATC CCA GAA AAA GAA ATG CAA GAT TTT TAC AAC GCT 488 
Lys Lys Val Gin lie Pro Glu Lys Glu Met Gin Asp Phe Tyr Asn Ala 
135 140 145 

AAC AAA GAT CAG CTT TTT GTC AAG CAA GAA GCC CAT GCT AGG CAT ATT 536 
Asn Lys Asp Gin Leu Phe Val Lys Gin Glu Ala His Ala Arg His lie 
150 155 160 

TTA GTG AAA ACC GAA GAT GAG GCT AAA CGG ATT ATT TCT GAG ATT GAC 584 
Leu Val Lys Thr Glu Asp Glu Ala Lys Arg lie lie Ser Glu lie Asp 
165 170 175 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 

Met lie Lys Ser Trp Thr Lys Lys Trp Phe Leu lie Leu Phe Leu Met 

15 10 15 

Ala Ser Cys Ser Ser Tyr Leu Val Ala Thr Thr Gly Glu Lys Tyr Phe 

20 25 30 

Lys Met Ala Thr Gin Ala Phe Lys Arg Gly Asp Tyr His Lys Ala Val 

35 40 45 

Ala Phe Tyr Lys Arg Ser Cys Asn Leu Arg Val Gly Val Gly Cys Thr 

50 55 60 

Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Asp Gin Asn lie 
65 70 75 80 

Thr Lys Ala Val Phe Tyr Tyr Arg Arg Gly Cys Asn Leu Arg Asn His 

85 90 95 

Leu Ala Cys Ala Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val 

100 105 110 

Gin Lys Asn Leu Pro Lys Ala lie Tyr Tyr Tyr Arg Arg Gly Cys His 

115 120 125 

Leu Lys Gly Gly Val Ser Cys Gly Ser Leu Gly Phe Met Tyr Phe Asn 

130 135 140 

Gly Thr Gly Val Lys Gin Asn Tyr Ala Lys Ala Leu Phe Leu Ser Lys 
145 150 155 160 

Tyr Ala Cys Ser Leu Asn Tyr Gly lie Ser Cys Asn Phe Val Gly Tyr 

165 170 175 

Met Tyr Arg Asn Ala Lys Gly Val Gin Lys Asp Leu Lys Lys Ala Leu 

180 185 190 

Ala Asn Phe Lys Arg Gly Cys His Leu Lys Asp Gly Ala Ser Cys Val 

195 200 205 

Ser Leu Gly Tyr Met Tyr Glu Val Gly Met Asp Val Lys Gin Asn Gly 

210 215 220 

Glu Gin Ala Leu Asn Leu Tyr Lys Lys Gly Cys Tyr Leu Lys Arg Gly 
225 230 235 240 

Ser Gly Cys His Asn Val Ala Val Met Tyr Tyr Thr Gly Lys Gly Val 

245 250 255 

Pro Lys Asp Leu Asp Lys Ala lie Ser Tyr Tyr Lys Lys Gly Cys Thr 

260 265 270 

Leu Gly Phe Ser Gly Ser Cys Lys Val Leu Glu Glu Val lie Gly Lys 

275 280 285 

Lys Ser Asp Asp Leu Gin Asp Asp Ala Gin Asn Asp Thr Gin Asp Asp 

290 295 300 

Met Gin 
305 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
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Gly Gly Val Ser Cys Gly Ser Leu Gly Phe Met Tyr Phe Asn Gly Thr 
135 140 145 

GGC GTT AAG CAA AAT TAT GCC AAA GCC CTT TTT CTT TCT AAA TAG GOT 536 
Gly Val Lys Gin Asn Tyr Ala Lys Ala Leu Phe Leu Ser Lys Tyr Ala 
150 155 160 

TGC AGT TTG AAT TAG GGC ATT AGT TGT AAC TTT GTA GGG TAT ATG TAT 584 
Cys Ser Leu Asn Tyr Gly lie Ser Cys Asn Phe Val Gly Tyr Met Tyr 
165 170 175 

AGG AAC GCC AAA GGC GTA CAG AAG GAT TTG AAA AAA GCC CTT GCG AAT 63 2 

Arg Asn Ala Lys Gly Val Gin Lys Asp Leu Lys Lys Ala Leu Ala Asn 
180 185 190 

TTT AAA AGA GGG TGC CAT TTG AAA GAC GGA GCG AGT TGT GTG AGC TTG 68 0 

Phe Lys Arg Gly Cys His Leu Lys Asp Gly Ala Ser Cys Val Ser Leu 
195 200 205 210 

GGA TAC ATG TAT GAA GTC GGT ATG GAT GTC AAA CAA AAT GGA GAG CAA 72 8 

Gly Tyr Met Tyr Glu Val Gly Met Asp Val Lys Gin Asn Gly Glu Gin 
215 220 225 

GCC TTG AAT CTT TAT AAA AAG GGT TGT TAT TTA AAA AGG GGG AGC GGT 776 
Ala Leu Asn Leu Tyr Lys Lys Gly Cys Tyr Leu Lys Arg Gly Ser Gly 
230 235 240 

TGT CAT AAT GTG GCG GTG ATG TAT TAC ACC GGT AAG GGC GTT CCA AAG 824 
Cys His Asn Val Ala Val Met Tyr Tyr Thr Gly Lys Gly Val Pro Lys 
245 250 • 255 

GAT TTA GAT AAA GCC ATT TCG TAT TAT AAG AAA GGT TGC ACT CTA GGC 872 
Asp Leu Asp Lys Ala lie Ser Tyr Tyr Lys Lys Gly Cys Thr Leu Gly 
260 265 270 

TTT AGT GGT AGC TGT AAA GTG TTA GAA GAA GTG ATT GGC AAG AAG TCT 92 0 

Phe Ser Gly Ser Cys Lys Val Leu Glu Glu Val lie Gly Lys Lys Ser 
275 280 285 290 

GAT GAT TTG CAA GAT GAC GCG CAA AAC GAC ACG CAA GAT GAT ATG CAA T 969 
Asp Asp Leu Gin Asp Asp Ala Gin Asn Asp Thr Gin Asp Asp Met Gin 
295 300 305 

AAGTTAAAGC TTATGGACTA ATGATTAAAA CTCATCTTAT AGAAATCTTT CT 1021 

(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE; 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 51... 968 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

TTACTGATTT TTCTTTGTGT GAGCTTTGGC TTAGTTTTGT AAGGAATGAG ATG ATA 5 6 

Met lie 
1 

AAG AGT TGG ACT AAA AAG TGG TTT TTG ATT TTA TTT TTA ATG GCA AGT 104 
Lys Ser Trp Thr Lys Lys Trp Phe Leu lie Leu Phe Leu Met Ala Ser 
5 10 15 

TGT TCC AGT TAT TTG GTG GCT AC A ACC GGT GAG AAA TAT TTT AAA ATG 152 
Cys Ser Ser Tyr Leu Val Ala Thr Thr Gly Glu Lys Tyr Phe Lys Met 
20 25 30 

GCT ACT CAA GCC TTT AAG AGA GGG GAC TAC CAT AAA GCG GTG GCT TTT 2 00 

Ala Thr Gin Ala Phe Lys Arg Gly Asp Tyr His Lys Ala Val Ala Phe 
35 40 45 50 

TAT AAG AGG AGC TGT AAT TTA AGG GTG GGG GTT GGT TGC ACG AGT TTA 24 8 

Tyr Lys Arg Ser Cys Asn Leu Arg Val Gly Val Gly Cys Thr Ser Leu 
55 60 65 

GGC TCT ATG TAT GAA GAT GGC GAT GGC GTG GAT CAG AAT ATT ACA AAA 2 96 

Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Asp Gin Asn lie Thr Lys 
70 75 80 

GCC GTT TTT TAT TAC AGA AGA GGG TGT AAT TTA AGG AAT CAT CTC GCT 344 
Ala Val Phe Tyr Tyr Arg Arg Gly Cys Asn Leu Arg Asn His Leu Ala 
85 90 95 

TGC GCG AGT CTA GGC TCT ATG TAT GAA GAT GGC GAT GGT GTG CAA AAA 392 
Cys Ala Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Gin Lys 
100 105 110 

AAC CTT CCA AAG GCT ATC TAT TAT TAC AGG AGA GGG TGC CAC TTA AAG 44 0 

Asn Leu Pro Lys Ala lie Tyr Tyr Tyr Arg Arg Gly Cys His Leu Lys 
115 120 125 130 

GGT GGG GTG AGC TGT GGG AGT TTA GGT TTT ATG TAT TTT AAT GGC ACG 48 8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 9: 

CTTGGGTAAT TCCAGGATCA ACGCCAATCA AAACGCTAAA TAAGGGGTGA ATG ATG 5 6 

Met Met 
1 

GAT TTA GAA AGT TTG AGA GGT TTT GCG TAT GCG TTT TTT ACC ATT CTT 104 
Asp Leu Glu Ser Leu Arg Gly Phe Ala Tyr Ala Phe Phe Thr lie Leu 
5 10 15 

TTT ACG CTC TTT TTG TAT GCC TAT ATT TTT AGC ATG TAT AGA AAG CAA 152 
Phe Thr Leu Phe Leu Tyr Ala Tyr lie Phe Ser Met Tyr Arg Lys Gin 
20 25 30 

AAA AAA GGC ATT ATG GAT TAT GAG CGA TAG GGA TAG TTA GCG TTA AAT 2 00 

Lys Lys Gly lie Met Asp Tyr Glu Arg Tyr Gly Tyr Leu Ala Leu Asn 
35 40 45 50 

GAT GCT TTA GAA GAC GAG TTG ATT GAA CCA CGC CAT AAA AAA GTT CAT 24 8 

Asp Ala Leu Glu Asp Glu Leu lie Glu Pro Arg His Lys Lys Val His 
55 60 65 

GAT AAT GGC ATA AAG GAA AGT TGAAATGGAT TTTTTAAACG ACCATATAAA TGTT 3 03 
Asp Asn Gly lie Lys Glu Ser 
70 

TTTGGCTTGA TTGCAGCGC 322 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 0: 

Met Met Asp Leu Glu Ser Leu Arg Gly Phe Ala Tyr Ala Phe Phe Thr 

15 10 15 

lie Leu Phe Thr Leu Phe Leu Tyr Ala Tyr lie Phe Ser Met Tyr Arg 

20 25 30 

Lys Gin Lys Lys Gly lie Met Asp Tyr Glu Arg Tyr Gly Tyr Leu Ala 

35 40 45 

Leu Asn Asp Ala Leu Glu Asp Glu Leu lie Glu Pro Arg His Lys Lys 

50 55 60 

Val His Asp Asn Gly lie Lys Glu Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO: 241: 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: 

Met Phe Ser Phe Leu Glu Lys Asn Pro Phe Phe Phe Thr Leu Ala Phe 

15 10 15 

lie Phe Val Phe Ala lie Ala Gly Leu Val Glu lie Leu Pro Asn Phe 

20 25 30 

Phe Lys Ser Ala Arg Pro lie Glu Gly Leu Arg Pro Tyr Thr Val Leu 

35 40 45 

Glu Thr Ala Gly Arg Gin lie Tyr lie Gin Glu Gly Cys Tyr His Cys 

50 55 60 

His Ser Gin Leu lie Arg Pro Phe Gin Ala Glu Val Asp Arg Tyr Gly 
65 70 75 80 

Ala Tyr Ser Leu Ser Gly Glu Tyr Ala Tyr Asp Arg Pro Phe Leu Trp 

85 90 95 

Gly Ser Lys Arg lie Gly Pro Asp Leu His Arg Val Gly Asp Tyr Arg 

100 105 110 

Thr Thr Asp Trp His Glu Lys His Met Phe Asp Pro Lys Ser Val Val 

115 120 125 

Pro His Ser lie Met Pro Ala Tyr Lys His Leu Phe Thr Lys Lys Ser 

130 135 140 

Asp Phe Asp Thr Ala Tyr Ala Glu Ala Leu Thr Gin Lys Lys Val Phe 
145 150 155 160 

Gly Val Pro Tyr Asp Thr Glu Asn Gly Val Lys Leu Gly Ser Val Glu 

165 170 175 

Glu Ala Lys Lys Ala Tyr Leu Glu Glu Ala Lys Lys lie Thr Ala Asp 

180 185 190 

Met Lys Asp Lys Arg Val Leu Glu Ala lie Glu Arg Gly Glu Val Leu 

195 200 205 

Glu lie Val Ala Leu lie Ala Tyr Leu Asn Ser Leu Gly Asn Ser Arg 

210 215 220 

lie Asn Ala Asn Gin Asn Ala Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 269 
(D) OTHER INFORMATION: 
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GCG GGG AGG CAA ATT TAT ATC CAA GAA GGT TGC TAT CAT TGC CAT TCC 24 8 

Ala Gly Arg Gin He Tyr He Gin Glu Gly Cys Tyr His Cys His Ser 
55 60 65 

CAG CTT ATT CGC CCT TTC CAA GCT GAG GTG GAT CGA TAT GGC GCG TAT 2 96 

Gin Leu He Arg Pro Phe Gin Ala Glu Val Asp Arg Tyr Gly Ala Tyr 
70 75 80 

AGT TTG AGT GGG GAA TAG GCG TAT GAC AGG CCA TTT TTG TGG GGT TCT 344 
Ser Leu Ser Gly Glu Tyr Ala Tyr Asp Arg Pro Phe Leu Trp Gly Ser 
85 90 95 

AAA AGG ATT GGC CCT GAT TTG CAC AGG GTA GGG GAT TAT CGC ACA ACC 3 92 

Lys Arg He Gly Pro Asp Leu His Arg Val Gly Asp Tyr Arg Thr Thr 
100 105 110 

GAT TGG CAT GAA AAG CAC ATG TTT GAT CCT AAA AGC GTT GTG CCG CAC 44 0 

Asp Trp His Glu Lys His Met Phe Asp Pro Lys Ser Val Val Pro His 
115 120 125 130 

AGC ATC ATG CCC GCC TAT AAG CAT TTA TTT ACA AAA AAG AGC GAT TTT 4 88 

Ser He Met Pro Ala Tyr Lys His Leu Phe Thr Lys Lys Ser Asp Phe 
135 140 145 

GAC ACC GCT TAT GCA GAA GCT TTG ACG CAA AAA AAG GTT TTT GGC GTG 536 
Asp Thr Ala Tyr Ala Glu Ala Leu Thr Gin Lys Lys Val Phe Gly Val 
150 155 160 

CCT TAT GAC ACA GAA AAC GGC GTG AAA TTA GGG AGC GTA GAA GAA GCG 5 84 

Pro Tyr Asp Thr Glu Asn Gly Val Lys Leu Gly Ser Val Glu Glu Ala 
165 170 175 

AAA AAA GCC TAT TTA GAA GAA GCT AAA AAA ATC ACA GCC GAT ATG AAA 632 
Lys Lys Ala Tyr Leu Glu Glu Ala Lys Lys He Thr Ala Asp Met Lys 
180 185 190 

GAC AAG AGG GTG CTA GAA GCG ATT GAG AGA GGT GAA GTG TTA GAA ATT 680 
Asp Lys Arg Val Leu Glu Ala He Glu Arg Gly Glu Val Leu Glu He 
195 200 205 210 

GTG GCT TTG ATC GCT TAT TTG AAT AGC TTG GGT AAT TCC AGG ATC AAC 72 8 

Val Ala Leu He Ala Tyr Leu Asn Ser Leu Gly Asn Ser Arg He Asn 
215 220 225 

GCC AAT CAA AAC GCT AAA TAAGGGGTGA ATGATGGATT TAGAAAGTTT GAGAGGTT 7 84 
Ala Asn Gin Asn Ala Lys 
230 

TTGCGTATGC GTTTT 7 99 

(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 36: 

Met Lys Ala lie Phe Ser Leu Phe Phe Leu Leu lie Val Leu Lys Ala 

15 10 15 

Asn Pro lie Asn Pro Leu Leu Glu Pro Leu Tyr Phe Pro Ser Tyr Ala 

20 25 30 

Gin Phe Leu Asn Leu Ala Pro His Phe Val lie Lys Lys Lys Arg Ala 

35 40 45 

Tyr Arg Pro Phe Gin Trp Gly Asn Thr lie lie lie Lys Arg His Asp 

50 55 60 

Leu Glu Glu Arg Gin Ser Asn Gin Pro Ser Asp lie Phe Arg Gin Asn 
65 70 75 80 

Ala Glu lie Asn Val Ser Ser Gin Thr Phe Leu Lys Gly Met Ser Asn 

85 90 95 

Ala Ser Ser Arg Thr Val Leu Asp Ser Ala Ala Gin 
100 105 

(2) INFORMATION FOR SEQ ID NO: 2 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 746 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

AGCGTGAGCC CAATTACGCC ACGCCTATGT CTAGATAGGG GAGGTTGGAA ATG TTT 56 

Met Phe 
1 

AGT TTT TTA GAA AAA AAC CCG TTC TTT TTC ACT CTT GCG TTT ATT TTT 104 
Ser Phe Leu Glu Lys Asn Pro Phe Phe Phe Thr Leu Ala Phe lie Phe 
5 10 15 

GTG TTT GCG ATC GCG GGC TTG GTG GAG ATT TTG CCC AAC TTC TTC AAA 152 
Val Phe Ala lie Ala Gly Leu Val Glu lie Leu Pro Asn Phe Phe Lys 
20 25 30 

TCC GCT CGC CCG ATT GAA GGC TTA CGG CCT TAT ACG GTT TTA GAG ACA 2 00 

Ser Ala Arg Pro lie Glu Gly Leu Arg Pro Tyr Thr Val Leu Glu Thr 
35 40 45 50 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51 . . . 374 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5: 

GTTAGAGATT TCTCCCAATT CTCAAGTGGG AGCGAGCGTG AAAATCCGCT ATG AAA 5 6 

Met Lys 
1 

GCA ATC TTT AGC CTC TTT TTC CTT CTT ATT GTT TTA AAA GCA AAC CCC 104 
Ala lie Phe Ser Leu Phe Phe Leu Leu lie Val Leu Lys Ala Asn Pro 
5 10 15 

ATA AAC CCT TTA TTA GAG CCG TTA TAT TTC CCC AGT TAG GCG CAA TTT 152 
lie Asn Pro Leu Leu Glu Pro Leu Tyr Phe Pro Ser Tyr Ala Gin Phe 
20 25 30 

TTA AAC TTA GCA CCT CAC TTT GTC ATT AAA AAA AAG CGC GCT TAT AGA 200 
Leu Asn Leu Ala Pro His Phe Val lie Lys Lys Lys Arg Ala Tyr Arg 
35 40 45 50 

CCC TTT CAA TGG GGG AAT ACC ATT ATC ATC AAA CGC CAT GAT TTA GAA 24 8 

Pro Phe Gin Trp Gly Asn Thr lie lie lie Lys Arg His Asp Leu Glu 
55 60 65 

GAA CGC CAA AGC AAC CAG CCA AGC GAT ATT TTC CGC CAA AAC GCT GAA 296 
Glu Arg Gin Ser Asn Gin Pro Ser Asp lie Phe Arg Gin Asn Ala Glu 
70 75 80 

ATC AAT GTG TCT TCT CAA ACT TTT TTA AAA GGA ATG AGC AAC GCT TCT 344 
lie Asn Val Ser Ser Gin Thr Phe Leu Lys Gly Met Ser Asn Ala Ser 
85 90 95 

TCA CGA ACA GTG CTT GAT TCA GCC GCT CAG TAAAATGCTA AAACTTTTTT TAA 397 
Ser Arg Thr Val Leu Asp Ser Ala Ala Gin 
100 105 

TCACATTTTT CTTGGTATTT TCTTAATCC 426 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 



Met 


Lys 


Lys 


Ser 


He 


Leu 


Leu 


Gly 


Val 


Cys 


Leu 


Ala 


Phe 


Ser 


Cys 


Ala 


1 








5 










10 










15 




His 


Ala 


Leu 


Asn 


Asp 


Leu 


Glu 


Leu 


He 


Lys 


Lys 


Ala 


Arg 


Glu 


Ser 


Gin 








20 










25 










30 






Leu 


Glu 


Pro 


Met 


Pro 


Met 


Gly 


Lys 


Ala 


Leu 


Lys 


Glu 


Tyr 


Gin 


He 


Lys 






35 










40 










45 








Lys 


Thr 


Arg 


Asp 


Val 


Gly 


He 


Gly 


Thr 


Lys 


Asn 


Ser 


Glu 


He 


Met 


Thr 




50 










55 










60 










Ser 


Ala 


Gin 


Val 


Glu 


Leu 


Gly 


Lys 


Met 


Leu 


Tyr 


Phe 


Asp 


Pro 


Arg 


He 


65 










70 










75 










80 


Ser 


Thr 


Ser 


Tyr 


Leu 


Val 


Ser 


Cys 


Asn 


Thr 


Cys 


His 


Asn 


Leu 


Gly 


Leu 










85 










90 










95 




Gly 


Gly 


Val 


Asp 


Leu 


Val 


Pro 


Ser 


Ala 


He 


Gly 


Ser 


Gin 


Trp 


Lys 


Lys 








100 










105 










110 






Asn 


Pro 


His 


Leu 


Leu 


Ser 


Ser 


Pro 


Thr 


Val 


Tyr 


Asn 


Ser 


Val 


Phe 


Asn 






115 










120 










125 








Asp 


Val 


Gin 


Phe 


Trp 


Asp 


Gly 


Arg 


Val 


Thr 


His 


Leu 


Asn 


Glu 


Gin 


Ala 




130 










135 










140 










Gin 


Gly 


Pro 


He 


Gin 


Ser 


Ser 


Phe 


Glu 


Met 


Gly Ala 


Asp 


Pro 


Lys 


Val 


145 










150 










155 










160 


Val 


Val 


Glu 


Lys 


He 


Asn 


Ser 


Met 


Pro 


Gly 


Tyr 


Val 


Lys 


Leu 


Phe 


Arg 










165 










170 










175 




Lys 


Ala 


Tyr 


Gly 


Ser 


Lys 


Val 


Lys 


He 


Asp 


Phe 


Lys 


Leu 


He 


Ala 


Asp 








180 










185 










190 






Ser 


He 


Ala 


Met 


Phe 


Glu 


Ala 


Thr 


Leu 


He 


Thr 


Pro 


Ser 


Arg 


Tyr 


Asp 






195 










200 










205 








Asp 


Phe 


Leu Arg 


Gly 


Asn 


Pro 


Lys 


Ala 


Leu 


Ser 


Lys 


Ala 


Glu 


Lys 


Glu 




210 










215 










220 










Gly 


Leu 


Asn 


Leu 


Phe 


He 


Ser 


Lys 


Gly 


Cys 


Val 


Ala 


Cys 


His 


Asn 


Gly 


225 










230 










235 










240 


He 


Asn 


Leu 


Gly 


Gly 


Thr 


Met 


Gin 


Pro 


Phe 


Gly Val 


Val 


Lys 


Pro 


Tyr 










245 










250 










255 




Lys 


Phe 


Ala 


Asn 


Val 


Gly 


Asp 


Phe 


Lys 


Gly 


Asp 


Lys 


Asn 


Gly 


Leu 


Val 








260 










265 










270 






Lys 


Val 


Pro 


Thr 


Leu 


Arg 


Asn 


He 


Thr 


Glu 


Thr 


Met 


Pro 


Tyr 


Phe 


His 






275 










280 










285 








Asn 


Gly 


Gin 


Phe 


Trp 


Asp 


Val 


Lys 


Asp 


Ala 


He 


Lys 


Glu 


Met 


Gly 


Ser 




290 










295 










300 










He 


Gin 


Leu 


Gly 


He 


Glu 


He 


Ser 


Asp 


Glu 


Glu 


Ala 


Lys 


Lys 


He 


Glu 


305 










310 










315 










320 


Thr 


Phe 


Phe 


Gly 


Ala 


Leu 


Arg 


Gly 


Lys 


Lys 


Pro 


Lys 


He 


He 


Tyr 


Pro 










325 










330 










335 




Glu 


Leu 


Pro 


He 


Met 


Thr 


Asp 


Lys 


Thr 


Pro 


Lys 


Pro 


Ser 


Phe 












340 










345 










350 







(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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TAT GGC TCT AAA GTC AAA ATT GAT TTT AAA TTG ATC GCT GAT AGT ATC 63 2 

Tyr Gly Ser Lys Val Lys lie Asp Phe Lys Leu lie Ala Asp Ser He 
180 185 190 

GCT ATG TTT GAA GCC ACG CTT ATT ACC CCA AGC CGT TAC GAC GAT TTT 680 
Ala Met Phe Glu Ala Thr Leu He Thr Pro Ser Arg Tyr Asp Asp Phe 
195 200 205 210 

TTA AGA GGC AAT CCT AAA GCG CTC AGC AAA GCC GAA AAA GAG GGG CTG 728 
Leu Arg Gly Asn Pro Lys Ala Leu Ser Lys Ala Glu Lys Glu Gly Leu 
215 220 225 

AAT TTA TTC ATT TCT AAA GGC TGT GTG GCT TGC CAT AAC GGC ATT AAT 776 
Asn Leu Phe He Ser Lys Gly Cys Val Ala Cys His Asn Gly He Asn 
230 235 240 

CTT GGG GGA ACG ATG CAG CCT TTT GGG GTG GTC AAA CCT TAT AAA TTC 824 
Leu Gly Gly Thr Met Gin Pro Phe Gly Val Val Lys Pro Tyr Lys Phe 
245 250 255 

GCT AAT GTG GGC GAT TTC AAA GGC GAT AAA AAC GGG CTT GTG AAA GTG 872 
Ala Asn Val Gly Asp Phe Lys Gly Asp Lys Asn Gly Leu Val Lys Val 
260 265 270 

CCT ACT TTA AGG AAT ATC ACC GAA ACG ATG CCC TAT TTC CAT AAC GGG 92 0 

Pro Thr Leu Arg Asn He Thr Glu Thr Met Pro Tyr Phe His Asn Gly 
275 280 285 290 

CAA TTC TGG GAT GTT AAG GAT GCG ATT AAA GAA ATG GGC TCT ATC CAG 968 
Gin Phe Trp Asp Val Lys Asp Ala He Lys Glu Met Gly Ser He Gin 
295 300 305 

TTA GGC ATT GAA ATC AGC GAT GAA GAA GCG AAA AAA ATT GAA ACT TTC 1016 
Leu Gly He Glu He Ser Asp Glu Glu Ala Lys Lys He Glu Thr Phe 
310 315 320 

TTT GGA GCC TTA AGG GGT AAA AAA CCT AAA ATA ATC TAC CCA GAA CTC 1064 
Phe Gly Ala Leu Arg Gly Lys Lys Pro Lys He He Tyr Pro Glu Leu 
325 330 335 

CCC ATA ATG ACA GAC AAA ACC CCT AAA CCC TCT TTT TGATTTAAAA AAGTCC 1116 
Pro He Met Thr Asp Lys Thr Pro Lys Pro Ser Phe 
340 345 350 

TTTTAGGGGT CTTTGGCGCT AAATCTAAAA AATACTC 1153 

(2) INFORMATION FOR SEQ ID NO: 2 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1100 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

TTATAATAAG AAAGTTTTTA TTATTTTTAA TGCTATTTTA GGAGTTCATC ATG AAA 56 

Met Lys 
1 

AAA TCC ATT TTA TTG GGC GTT TGC TTG GCT TTT TCT TGC GCT CAT GCC 104 
Lys Ser lie Leu Leu Gly Val Cys Leu Ala Phe Ser Cys Ala His Ala 
5 10 15 

CTA AAC GAT TTA GAA TTG ATC AAA AAA GCG AGG GAA AGC CAG CTA GAA 152 
Leu Asn Asp Leu Glu Leu lie Lys Lys Ala Arg Glu Ser Gin Leu Glu 
20 25 30 

CCC ATG CCT ATG GGC AAA GCG CTC AAA GAA TAG CAG ATT AAA AAG ACC 200 
Pro Met Pro Met Gly Lys Ala Leu Lys Glu Tyr Gin lie Lys Lys Thr 
35 40 45 50 

AGA GAT GTG GGT ATT GGC ACC AAA AAC AGC GAA ATC ATG ACC TCC GCT 24 8 

Arg Asp Val Gly lie Gly Thr Lys Asn Ser Glu lie Met Thr Ser Ala 
55 60 65 

CAA GTG GAA TTA GGC AAA ATG CTC TAT TTT GAC CCT AGG ATT TCC ACT 296 
Gin Val Glu Leu Gly Lys Met Leu Tyr Phe Asp Pro Arg lie Ser Thr 
70 75 80 

TCC TAC CTC GTG TCT TGC AAC ACA TGC CAT AAT CTG GGC TTA GGC GGG 344 
Ser Tyr Leu Val Ser Cys Asn Thr Cys His Asn Leu Gly Leu Gly Gly 
85 90 95 

GTG GAT TTA GTC CCA AGC GCC ATA GGC TCT CAA TGG AAG AAA AAC CCC 3 92 

Val Asp Leu Val Pro Ser Ala lie Gly Ser Gin Trp Lys Lys Asn Pro 
100 105 110 

CAC CTT TTA AGC TCC CCA ACG GTG TAT AAC TCT GTG TTT AAC GAT GTG 44 0 

His Leu Leu Ser Ser Pro Thr Val Tyr Asn Ser Val Phe Asn Asp Val 
115 120 125 130 

CAG TTT TGG GAT GGC AGG GTT ACG CAT TTA AAC GAA CAG GCG CAA GGG 488 
Gin Phe Trp Asp Gly Arg Val Thr His Leu Asn Glu Gin Ala Gin Gly 
135 140 145 

CCC ATC CAG TCT TCT TTT GAA ATG GGG GCT GAT CCC AAA GTG GTG GTA 536 
Pro lie Gin Ser Ser Phe Glu Met Gly Ala Asp Pro Lys Val Val Val 
150 155 160 

GAA AAA ATC AAT TCC ATG CCA GGC TAT GTC AAG CTC TTT AGA AAA GCC 5 84 

Glu Lys lie Asn Ser Met Pro Gly Tyr Val Lys Leu Phe Arg Lys Ala 
165 170 175 
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CCA AGC CTG ATC CAA TCT CAT AAA GAT TCT AAT CGC AAT TTT AAA ACC 344 
Pro Ser Leu lie Gin Ser His Lys Asp Ser Asn Arg Asn Phe Lys Thr 
85 90 95 

ATC ACT TTT TGG CTC GTT CCC ACA AAA AGC CAC GCA ACT TAC TAC ATC 3 92 

lie Thr Phe Trp Leu Val Pro Thr Lys Ser His Ala Thr Tyr Tyr lie 
100 105 110 

ATT AAG GTT TAATCACAAT GGATAAAAAC AACAATAATC TCCGCTTGAT TTTAGCGAT 4 50 

lie Lys Val 

115 

CGCT 4 54 

(2) INFORMATION FOR SEQ ID NO: 2 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: 

Met Arg Asn Asn Lys Thr Pro Phe Leu Ser Ala lie Phe Thr Ala Ser 

1 5 10 15 

lie Arg Gly Tyr Gin Arg Phe Phe Ser Ala Phe Thr Pro Ser Ser Cys 

20 25 30 

Arg Phe Tyr Pro Thr Cys Ser Asn Tyr Ala Leu Trp Leu Leu Cys Phe 

35 40 45 

Glu Ser Pro Leu Ser Ala Met Gly Lys lie Ala lie Arg lie Leu Ser 

50 55 60 

Cys Asn Pro Phe Cys Ser Gly Gly lie Ala Tyr Pro Thr Thr Arg Leu 
65 70 75 80 

Lys Arg Pro Ser Leu lie Gin Ser His Lys Asp Ser Asn Arg Asn Phe 

85 90 95 

Lys Thr lie Thr Phe Trp Leu Val Pro Thr Lys Ser His Ala Thr Tyr 

100 105 110 

Tyr He He Lys Val 
115 

(2) INFORMATION FOR SEQ ID NO: 2 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECUIiE TYPE: Genomic DNA 
(ix) FEATURE: 
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Asp Ser Gin Pro 
145 

Leu Glu Lys Arg 

Asn Thr Met Lys 
180 

Phe Asn Asn Leu 
195 

Leu Lys His Ser 
210 



Phe Lys Ala lie 
150 

Leu Thr Leu Lys 
165 

Leu Ala Lys Arg 

Tyr Val Gly Ser 
200 

Lys Ser Gly lie 
215 



Gly Val Lys Glu 
155 

Glu Leu Glu Glu 
170 

Gin Asn Thr Phe 
185 

Ala Glu Glu Val 
Lys Gly 



Ser Val Leu Phe 
160 

Ala lie Thr Ser 
175 

Asn Lys Thr Gin 
190 

Arg His Ala lie 
205 



(2) INFORMATION FOR SEQ ID NO: 231: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 401 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CTATATGAAC AAAGCCTTAA AAGACTTGAA AAAAGGAATA ACTCATACCT ATG CGA 5 6 

Met Arg 
1 



AAC AAT AAA ACG CCT TTT TTG AGC GCG ATT TTT ACG GCA TCA ATT AGO 104 
Asn Asn Lys Thr Pro Phe Leu Ser Ala lie Phe Thr Ala Ser lie Arg 
5 10 15 



GGT TAG CAA CGC TTT TTT TCG GCT TTC ACC CCT TCA AGC TGC CGG TTT 152 
Gly Tyr Gin Arg Phe Phe Ser Ala Phe Thr Pro Ser Ser Cys Arg Phe 
20 25 30 



TAC CCC ACT TGT TCC AAC TAC GCT CTG TGG TTG CTC TGT TTT GAA AGC 20 0 

Tyr Pro Thr Cys Ser Asn Tyr Ala Leu Trp Leu Leu Cys Phe Glu Ser 
35 40 45 50 



CCT TTG AGC GCT ATG GGT AAG ATC GCT ATA AGG ATA CTC TCA TGC AAC 24 8 

Pro Leu Ser Ala Met Gly Lys lie Ala lie Arg lie Leu Ser Cys Asn 
55 60 65 



CCT TTT TGC TCT GGG GGC ATT GCT TAC CCT ACT ACT CGC TTG AAA CGC 296 
Pro Phe Cys Ser Gly Gly lie Ala Tyr Pro Thr Thr Arg Leu Lys Arg 
70 75 80 
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135 140 145 

CAG CCT TTT AAA GCC ATA GGC GTT AAA GAG AGC GTT CTT TTT TTA GAA 536 
Gin Pro Phe Lys Ala lie Gly Val Lys Glu Ser Val Leu Phe Leu Glu 
150 155 160 

AAA CGA CTC ACT TTA AAG GAG CTA GAA GAA GCG ATT ACC TCT AAC ACC 584 
Lys Arg Leu Thr Leu Lys Glu Leu Glu Glu Ala lie Thr Ser Asn Thr 
165 170 175 

ATG AAA TTA GCC AAG CGC CAA AAC ACT TTC AAT AAA ACC CAA TTC AAT 63 2 

Met Lys Leu Ala Lys Arg Gin Asn Thr Phe Asn Lys Thr Gin Phe Asn 
180 185 190 

AAC CTT TAT GTG GGG AGC GCT GAA GAA GTT AGG CAT GCG ATT TTA AAA 6 80 

Asn Leu Tyr Val Gly Ser Ala Glu Glu Val Arg His Ala lie Leu Lys 
195 200 205 210 

CAC TCA AAA AGC GGC ATT AAA GGA TAATCTAATG GATACACAAA ACTTACCCGA 734 
His Ser Lys Ser Gly lie Lys Gly 
215 

TCAAATTATC CCTATTTTTA TGA 75 7 

(2) INFORMATION FOR SEQ ID NO: 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 0: 

Met Arg Val Ser Ser Lys Glu lie Leu Leu lie Val Gly Gly Ser Ser 

15 10 15 

Phe Tyr Leu Lys Ser lie Leu Glu Gly Leu Ser Arg Met Pro Lys Leu 

20 25 30 

Ser Gly Glu Glu Val Val Lys lie Glu Arg Glu lie Ala Thr Leu Ser 

35 40 45 

Asn Pro Tyr lie Phe Leu Lys Ser lie Asp Pro Asn Met Ala Phe Lys 

50 55 60 

lie His Pro Asn Asp Thr Tyr Arg Thr His Lys Ala Leu Glu lie Phe 
65 70 75 80 

Tyr Ala Thr Cys Thr Pro Pro Ser Glu Tyr Phe Lys Ala Asn Pro Lys 

85 90 95 

Lys Pro Phe Glu His Ala lie Ser Leu Phe Ala Leu Ser lie Glu Lys 

100 105 110 

Ser Ala Leu His Asn Asn lie Lys Arg Arg Thr Lys Asn Met Leu His 

115 120 125 

Ser Gly Leu Val Glu Glu lie Lys Ala Leu Tyr Thr Gin Tyr Pro Lys 
130 135 140 
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(A) LENGTH: 75 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKTDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 704 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

TTGATGAAAA AAATAACGCT CCTCTTTTTA AAACTCTTTT AGAGGATGCC ATG AGA 56 

Met Arg 
1 

GTG TCT TCT AAA GAG ATT TTA CTC ATT GTG GGG GGG AGC AGT TTT TAC 104 
Val Ser Ser Lys Glu lie Leu Leu He Val Gly Gly Ser Ser Phe Tyr 
5 10 15 

CTC AAA TCC ATT TTA GAA GGT TTG AGC CGC ATG CCA AAA CTG AGC GGT 152 
Leu Lys Ser He Leu Glu Gly Leu Ser Arg Met Pro Lys Leu Ser Gly 
20 25 30 

GAG GAG GTT GTA AAA ATA GAG CGA GAA ATT GCC ACT CTT TCT AAC CCT 2 00 

Glu Glu Val Val Lys He Glu Arg Glu He Ala Thr Leu Ser Asn Pro 
35 40 45 50 

TAT ATA TTT TTA AAA TCC ATT GAC CCT AAC ATG GCT TTT AAA ATC CAT 24 8 

Tyr He Phe Leu Lys Ser He Asp Pro Asn Met Ala Phe Lys He His 
55 60 65 

CCA AAC GAC ACT TAC CGC ACC CAT AAG GCT TTA GAA ATC TTT TAT GCC 2 96 

Pro Asn Asp Thr Tyr Arg Thr His Lys Ala Leu Glu He Phe Tyr Ala 
70 75 80 

ACC TGC ACG CCC CCA AGC GAG TAT TTT AAG GCC AAC CCT AAA AAA CCC 344 
Thr Cys Thr Pro Pro Ser Glu Tyr Phe Lys Ala Asn Pro Lys Lys Pro 
85 90 95 

TTT GAG CAT GCT ATC TCC TTA TTC GCT CTG TCT ATT GAA AAA AGC GCG 3 92 

Phe Glu His Ala He Ser Leu Phe Ala Leu Ser He Glu Lys Ser Ala 
100 105 110 

CTC CAT AAC AAT ATC AAA CGG CGC ACC AAA AAC ATG CTC CAT TCA GGG 44 0 

Leu His Asn Asn He Lys Arg Arg Thr Lys Asn Met Leu His Ser Gly 
115 120 125 130 

CTT GTT GAA GAA ATC AAA GCC CTC TAT ACT CAA TAC CCT AAA GAT TCG 488 
Leu Val Glu Glu He Lys Ala Leu Tyr Thr Gin Tyr Pro Lys Asp Ser 
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GGCGATTCAA GCCTTAAAAG CCGGGTCAAA ATC 95 8 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 85 amino acids 

(B) TYPE: amino acid 

(C) STRAWDEDNESS : single 

( D ) TOPOLOGY : 1 i ne a r 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 8: 

Met Lys lie Ser Pro Ser Pro Arg Lys Leu Ser Lys Val Ser Thr Ser 

15 10 15 

Val Ser Phe Leu lie Ser Phe Ala Leu Tyr Ala lie Gly Phe Gly Tyr 

20 25 30 

Phe Leu Leu Arg Glu Asp Ala Pro Glu Pro Leu Ala Gin Ala Gly Thr 

35 40 45 

Thr Lys Val Thr Met Ser Leu Ala Ser lie Asn Thr Asn Ser Asn Thr 

50 55 60 

Lys Thr Asn Ala Glu Ser Ala Lys Pro Lys Glu Glu "Pro Lys Glu Lys 
65 70 75 80 

Pro Lys Lys Glu Glu Pro Lys Lys Glu Glu Pro Lys Lys Glu Val Thr 

85 90 95 

Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro 

100 105 110 

Lys Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro 

115 120 125 

Lys Val Glu Glu Val Lys Lys Glu Glu Pro Lys Glu Glu Pro Lys Lys 

130 135 140 

Glu Glu Ala Lys Glu Glu Ala Lys Glu Lys Ser Ala Pro Lys Gin Val 
145 150 155 160 

Thr Thr Lys Asp lie Val Lys Glu Lys Asp Lys Gin Glu Glu Ser Asn 

165 170 175 

Lys Thr Ser Glu Gly Ala Thr Ser Glu Ala Gin Ala Tyr Asn Pro Gly 

180 185 190 

Val Ser Asn Glu Phe Leu Met Lys lie Gin Thr Ala lie Ser Ser Lys 

195 200 205 

Asn Arg Tyr Pro Lys Met Ala Gin lie Arg Gly lie Glu Gly Glu Val 

210 215 220 

Leu Val Ser Phe Thr lie Asn Ala Asp Gly Ser Val Thr Asp lie Lys 
225 230 235 240 

Val Val Lys Ser Asn Thr Thr Asp lie Leu Asn His Ala Ala Leu Glu 

245 250 255 

Ala lie Lys Ser Ala Ala His Leu Phe Pro Lys Pro Glu Glu Thr Val 

260 265 270 

His Leu Lys lie Pro lie Ala Tyr Ser Leu Lys Glu Asp 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 
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AAT GCT GAG TCG GCT AAA CCC AAA GAA GAG CCT AAA GAA AAA CCC AAG 2 96 

Asn Ala Glu Ser Ala Lys Pro Lys Glu Glu Pro Lys Glu Lys Pro Lys 
70 75 80 

AAA GAA GAG CCA AAA AAA GAA GAA CCC AAA AAG GAG GTT ACA AAG CCT 344 
Lys Glu Glu Pro Lys Lys Glu Glu Pro Lys Lys Glu Val Thr Lys Pro 
85 90 95 

AAA CCT AAG CCT AAA CCC AAG CCA AAG CCA AAA CCA AAA CCT AAG CCT 3 92 

Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro Lys Pro 
100 105 110 

GAA CCC AAA CCT GAA CCA AAA CCC GAG CCT AAG CCT GAG CCT AAA GTT 44 0 

Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro Lys Pro Glu Pro Lys Val 
115 120 125 130 

GAA GAG GTT AAA AAA GAA GAG CCT AAA GAA GAG CCC AAA AAA GAA GAA 488 
Glu Glu Val Lys Lys Glu Glu Pro Lys Glu Glu Pro Lys Lys Glu Glu 
135 140 145 

GCT AAA GAG GAA GCT AAA GAA AAA AGC GCT CCT AAA CAA GTA ACA ACT 536 
Ala Lys Glu Glu Ala Lys Glu Lys Ser Ala Pro Lys Gin Val Thr Thr 
150 155 160 

AAG GAT ATA GTC AAA GAA AAA GAC AAG CAA GAA GAA TCC AAC AAA ACC 584 
Lys Asp lie Val Lys Glu Lys Asp Lys Gin Glu Glu Ser Asn Lys Thr 
165 170 175 

TCT GAG GGG GCC ACT TCT GAA GCT CAA GCT TAT AAC CCA GGG GTG AGC 632 
Ser Glu Gly Ala Thr Ser Glu Ala Gin Ala Tyr Asn Pro Gly Val Ser 
180 185 190 

AAC GAA TTT TTA ATG AAG ATC CAA ACC GCT ATT TCT TCT AAA AAC CGC 680 
Asn Glu Phe Leu Met Lys lie Gin Thr Ala lie Ser Ser Lys Asn Arg 
195 200 205 210 

TAG CCT AAA ATG GCG CAG ATT AGG GGT ATT GAG GGC GAA GTG TTG GTG 728 
Tyr Pro Lys Met Ala Gin lie Arg Gly lie Glu Gly Glu Val Leu Val 
215 220 225 

AGC TTT ACG ATC AAT GCT GAT GGG AGC GTT ACG GAC ATT AAA GTG GTC 776 
Ser Phe Thr lie Asn Ala Asp Gly Ser Val Thr Asp lie Lys Val Val 
230 235 240 

AAA AGC AAC ACC ACA GAT ATT TTA AAC CAT GCG GCT TTA GAA GCC ATT 824 
Lys Ser Asn Thr Thr Asp lie Leu Asn His Ala Ala Leu Glu Ala lie 
245 250 255 

AAA AGC GCG GCA CAT CTA TTC CCT AAA CCA GAA GAA ACC GTG CAT CTA 872 
Lys Ser Ala Ala His Leu Phe Pro Lys Pro Glu Glu Thr Val His Leu 
260 265 270 

AAA ATC CCT ATC GCT TAT AGC TTG AAA GAA GAC TGATTAGTCT TTCTTTTAGG 925 
Lys lie Pro lie Ala Tyr Ser Leu Lys Glu Asp 
275 280 285 
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lie Ala Gin Gly 
35 

Glu Lys Ser Gin 
50 

Glu His Asp Asn 
65 

Leu Ser Ala Val 

Lys Ser Asp Lys 
100 

lie Leu Lys Glu 
115 

Gin 



Lys lie Lys Val 
40 

Pro Asn Asp Gin 
55 

lie Phe Val Asp 
70 

Val Lys Gin Thr 
85 

Ser Ser Arg Phe 

His Asn His Glu 
120 



Ser Leu Pro Asn 

Lys Val Val Val 
60 

Asp Lys Pro Met 
75 

Asp Pro Lys Thr 
90 

Glu Thr Phe lie 
105 

Asn Phe Ser lie 



Ala Lys Asn Ala 
45 

lie Ser Val Asp 

Asn Leu Glu Ala 
80 

Leu lie Asp Leu 
95 

Ser lie Met Asp 
110 

Ser Thr Gin Ala 
125 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 51 . . . 905 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:227: 

TCGTTTTGAA ACTTTTATCA GCATTATGGA TATTTTAAAA GAGCATAATC ATG AAA 5 6 

Met Lys 
1 

ATT TCT CCA TCT CCA CGC AAG CTC AGT AAA GTT TCA ACG AGT GTT AGC 104 
lie Ser Pro Ser Pro Arg Lys Leu Ser Lys Val Ser Thr Ser Val Ser 
5 10 15 

TTT TTA ATC TCT TTT GCC CTA TAC GOT ATA GGG TTT GGC TAT TTT TTA 152 
Phe Leu lie Ser Phe Ala Leu Tyr Ala lie Gly Phe Gly Tyr Phe Leu 
20 25 30 

CTG CGC GAA GAC GCC CCA GAG OCT TTA GCG CAA GCC GGG ACC ACT AAG 2 00 

Leu Arg Glu Asp Ala Pro Glu Pro Leu Ala Gin Ala Gly Thr Thr Lys 
35 40 45 50 

GTT ACC ATG AGT TTA GCC AGC ATC AAC ACT AAT TCC AAT ACA AAG ACT 24 8 

Val Thr Met Ser Leu Ala Ser lie Asn Thr Asn Ser Asn Thr Lys Thr 
55 60 65 
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TTGTTGAGAA AATCCGATGT TTTGAGCGAA AAATTCAGGA TCATGAAAAA ATG AAA 56 

Met Lys 
1 

AGC ATC AGA AGA GGC GAT GGG CTG AAT GTT GTC CCT TTC ATT GAT ATT 104 
Ser lie Arg Arg Gly Asp Gly Leu Asn Val Val Pro Phe lie Asp He 
5 10 15 

ATG CTC GTT TTG CTA GCG ATT GTG TTG AGC ATT TCT ACT TTT ATT GCA 152 
Met Leu Val Leu Leu Ala He Val Leu Ser He Ser Thr Phe He Ala 
20 25 30 

CAA GGT AAG ATT AAG GTC AGT CTC CCT AAC GCT AAA AAT GCG GAA AAA 2 00 

Gin Gly Lys He Lys Val Ser Leu Pro Asn Ala Lys Asn Ala Glu Lys 
35 40 45 50 

TCC CAG CCA AAC GAT CAA AAA GTG GTG GTC ATC TCT GTA GAT GAG CAT 24 8 

Ser Gin Pro Asn Asp Gin Lys Val Val Val He Ser Val Asp Glu His 
55 60 65 

GAC AAT ATT TTC GTA GAT GAC AAA CCG ATG AAT TTA GAA GCT TTG AGC 2 96 

Asp Asn He Phe Val Asp Asp Lys Pro Met Asn Leu Glu Ala Leu Ser 
70 75 80 

GCT GTA GTC AAA CAA ACA GAC CCT AAA ACC CTT ATA GAC TTA AAA AGC 344 
Ala Val Val Lys Gin Thr Asp Pro Lys Thr Leu He Asp Leu Lys Ser 
85 90 95 

GAC AAA AGC TCT CGT TTT GAA ACT TTT ATC AGC ATT ATG GAT ATT TTA 3 92 

Asp Lys Ser Ser Arg Phe Glu Thr Phe He Ser He Met Asp He Leu 
100 105 110 

AAA GAG CAT AAT CAT GAA AAT TTC TCC ATC TCC ACG CAA GCT CAG TAAAG 442 
Lys Glu His Asn His Glu Asn Phe Ser He Ser Thr Gin Ala Gin 
115 120 125 

TTTCAACGAG TGTTAGCTTT TTAATCTCTT TTGCCCTATA CGCTATAG 4 90 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 

Met Lys Ser He Arg Arg Gly Asp Gly Leu Asn Val Val Pro Phe He 

15 10 15 

Asp He Met Leu Val Leu Leu Ala He Val Leu Ser He Ser Thr Phe 
20 25 30 
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100 105 110 

Gly Met Glu Gin Glu Arg Leu lie Glu Glu Ser Gin Thr Leu Phe Leu 

115 120 125 

Gin Glu His Cys Pro Tyr Leu Ser Gly Val Lys Asn Leu Glu Glu Ala 

130 135 140 

Ser Asn Ala Leu Glu Val Gin Glu Gin Asn Asn Ala Leu Phe Leu Leu 
145 150 155 160 

Lys Glu Pro Lys Leu Ala Arg Leu Leu Ser Arg Leu Asp Leu Met Ser 

165 170 175 

Ala Leu Asn Ala Leu Cys Asp Gin Val Leu Glu Asn Gin Ala His Asn 

180 185 190 

Gin Gin Ser His Asn Lys lie Leu Glu Tyr Asn Ala Leu Lys Asn His 

195 200 205 

Asp Phe Gin Ala Tyr Lys Ala Met Arg Leu Lys Lys Phe Lys Asn Lys 

210 215 220 

Leu Gin Ser Gin lie Gin Ala Gin Glu Asp Ala Leu Lys Thr Phe Leu 
225 230 235 240 

Pro Leu Glu Lys Arg Leu Glu Thr Leu Lys Thr His Phe Leu Cys Asp 

245 250 255 

Lys Glu Asn Leu Lys Ser Cys Ala Lys Glu Leu His Gin Arg Tyr Gin 

260 265 270 

Asn Ala Leu lie Glu Arg Asp Lys Glu Leu Lys Asn Ala Lys Asn Asn 

275 280 285 

Lys Glu Lys His Ala Leu lie Leu Ala Asn Tyr Glu His Thr Leu Lys 

290 295 300 

Thr Leu Asn lie Glu Phe Leu Ser Glu Leu Asn Lys Gin Met Ala Phe 
305 310 315 320 

Leu Asn Glu Thr Met Ala Leu Asn Ala Arg Val Leu Ala Leu Leu Ala 

325 330 335 

Lys Gin His Ala Lys Thr Pro Lys Pro Phe Asn Leu Ser Gly Gly Leu 

340 345 350 

Ser Gly Asp Leu Ser Gly Gly Lys Ala Leu lie Lys Asn lie Arg Leu 

355 360 365 

Asp Pro His Gly Phe Pro Ser Phe Lys Asn Phe Lys Gin Glu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 437 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
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Leu lie Glu Arg Asp Lys Glu Leu Lys Asn Ala Lys Asn Asn Lys Glu 
275 280 285 290 

AAG CAT GCT CTA ATC TTA GCC AAT TAC GAG CAT ACT TTA AAA ACC TTG 96 8 

Lys His Ala Leu lie Leu Ala Asn Tyr Glu His Thr Leu Lys Thr Leu 
295 300 305 

AAT ATA GAA TTT TTA AGC GAA TTA AAT AAG CAA ATG GCG TTT TTG AAT 1016 
Asn lie Glu Phe Leu Ser Glu Leu Asn Lys Gin Met Ala Phe Leu Asn 
310 315 320 

GAA ACC ATG GCG TTA AAC GCC CGA GTT TTA GCC CTT TTA GCC AAA CAG 1064 
Glu Thr Met Ala Leu Asn Ala Arg Val Leu Ala Leu Leu Ala Lys Gin 
325 330 335 

CAT GCC AAA ACG CCA AAG CCT TTC AAT TTG AGC GGT GGT TTA AGC GGT 1112 
His Ala Lys Thr Pro Lys Pro Phe Asn Leu Ser Gly Gly Leu Ser Gly 
340 345 350 

GAT TTG AGC GGT GGG AAA GCT CTT ATT AAA AAT ATC CGC TTA GAT CCG 1160 
Asp Leu Ser Gly Gly Lys Ala Leu lie Lys Asn lie Arg Leu Asp Pro 
355 360 365 370 

CAT GGA TTC CCT AGC TTT AAA AAT TTT AAG CAA GAG TAGGACAATA TTTGAC 1212 
His Gly Phe Pro Ser Phe Lys Asn Phe Lys Gin Glu 
375 380 

AAGCAAAAAC AATTATAGTA AAATAAGAGC ATAACTT 124 9 

(2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Met Arg Phe Phe Cys Phe Phe Leu Phe Phe Leu Thr Phe Ser Asn Ala 

15 10 15 

Gin lie Met Met Thr Phe Asp Ser Gin Thr Asn Ala Lys Leu Ser Arg 

20 25 30 

Ser Asn Glu Gin Leu Ser Asp Met Leu Tyr Lys Leu Asn Glu Ser Leu 

35 40 45 

Arg lie Tyr Gin Ser Val Leu Ser Asn Asn Gin Asp Gin Leu Lys Glu 

50 55 60 

lie Lys Lys Ala Asn Ser Thr Leu Asn Ser Gin Arg Arg Phe Phe Asn 
65 70 75 80 

Ala Ser Gin lie Arg Leu Met Asp Thr Asp Ala Leu Leu Lys Gin Ser 

85 90 95 

Ala Leu Glu Leu Glu Lys Leu Gin Ala Leu Glu Lys His lie Lys Lys 
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55 60 65 

AAA GCT AAC AGC ACC CTA AAT AGC CAA AGG CGT TTT TTT AAC GCC AGC 296 
Lys Ala Asn Ser Thr Leu Asn Ser Gin Arg Arg Phe Phe Asn Ala Ser 
70 75 80 

GAG ATC CGC CTT ATG GAC ACT GAT GCA CTA TTG AAA CAA AGC GCT TTG 344 
Gin lie Arg Leu Met Asp Thr Asp Ala Leu Leu Lys Gin Ser Ala Leu 
85 90 95 

GAA TTA GAA AAA TTA CAA GCT TTA GAA AAA CAC ATA AAA AAG GGC ATG 3 92 

Glu Leu Glu Lys Leu Gin Ala Leu Glu Lys His lie Lys Lys Gly Met 
100 105 110 

GAA CAA GAA CGC TTA ATA GAA GAA TCC CAA ACG CTT TTT TTA CAA GAG 44 0 

Glu Gin Glu Arg Leu lie Glu Glu Ser Gin Thr Leu Phe Leu Gin Glu 
115 120 125 130 

CAT TGC CCT TAT TTG AGC GGC GTT AAG AAT TTA GAA GAG GCT TCA AAC 488 
His Cys Pro Tyr Leu Ser Gly Val Lys Asn Leu Glu Glu Ala Ser Asn 
135 140 145 

GCT TTA GAA GTC CAA GAG CAA AAC AAC GCC CTT TTC TTA CTC AAA GAG 53 6 

Ala Leu Glu Val Gin Glu Gin Asn Asn Ala Leu Phe Leu Leu Lys Glu 
150 155 160 

CCT AAA CTC GCC CGT TTG CTC TCA CGA TTG GAT TTG ATG AGC GCT TTA 584 
Pro Lys Leu Ala Arg Leu Leu Ser Arg Leu Asp Leu Met Ser Ala Leu 
165 170 175 

AAC GCC TTG TGC GAT CAG GTT TTA GAA AAC CAA GCC CAT AAC CAA CAA 632 
Asn Ala Leu Cys Asp Gin Val Leu Glu Asn Gin Ala His Asn Gin Gin 
180 185 190 

TCC CAT AAC AAA ATT TTA GAA TAC AAC GCT CTT AAA AAC CAT GAT TTT 680 
Ser His Asn Lys lie Leu Glu Tyr Asn Ala Leu Lys Asn His Asp Phe 
195 200 205 210 

CAA GCC TAT AAA GCC ATG CGT TTG AAA AAA TTT AAA AAC AAG CTT CAA 72 8 

Gin Ala Tyr Lys Ala Met Arg Leu Lys Lys Phe Lys Asn Lys Leu Gin 
215 220 225 

AGT CAA ATC CAA GCC CAA GAA GAC GCT CTA AAA ACC TTT TTA CCC TTA 77 6 

Ser Gin lie Gin Ala Gin Glu Asp Ala Leu Lys Thr Phe Leu Pro Leu 
230 235 240 

GAA AAA CGC TTG GAA ACT TTA AAA ACG CAT TTT TTA TGC GAT AAA GAA 824 
Glu Lys Arg Leu Glu Thr Leu Lys Thr His Phe Leu Cys Asp Lys Glu 
245 250 255 

AAC CTA AAA TCA TGC GCT AAA GAA TTG CAC CAA CGC TAC CAA AAC GCC 872 
Asn Leu Lys Ser Cys Ala Lys Glu Leu His Gin Arg Tyr Gin Asn Ala 
260 265 270 

CTT ATA GAG CGA GAT AAA GAA TTA AAA AAC GCT AAA AAT AAT AAA GAA 920 
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Glu Asn lie Asn Pro lie lie Asn Lys Glu Asp Lys Met Leu Glu Ala 
225 230 235 240 

Arg Phe Asn Val Pro Asn Val Lys Gin lie Tyr Tyr Pro Asn Met Phe 

245 250 255 

Ala Gin Val Glu lie Phe Gin Lys Pro Gin Lys Met Lys He Leu Pro 

260 265 270 

Lys Glu Ala Val Leu lie Lys Gly Gly Lys Ala He Val Phe Lys Lys 

275 280 285 

Asp Asp Phe Gly Leu Ser Pro Leu Glu He Lys Ala Val Arg Leu Ser 

290 295 300 

Asp Gly Ser Tyr Glu He Leu Glu Gly Leu Lys Ala Gly Glu Glu Val 
305 310 315 320 

Ala Asn Asn Ala Leu Phe Val Leu Asp Ala Asp Ala Gin Asn Asn Gly 
325 330 335 

Asp Tyr 



(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1196 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 

AAAACTTAGA TAAAATAACA CGATAAAACC ATAGTAATAA AGATAACCCC ATG AGA 5 6 

Met Arg 
1 

TTT TTT TGC TTT TTC TTA TTT TTT CTA ACC TTT TCA AAC GCA CAG ATA 104 
Phe Phe Cys Phe Phe Leu Phe Phe Leu Thr Phe Ser Asn Ala Gin He 
5 10 15 

ATG ATG ACT TTT GAT TCT CAA ACT AAC GCC AAA CTC TCG CGC TCT AAC 152 
Met Met Thr Phe Asp Ser Gin Thr Asn Ala Lys Leu Ser Arg Ser Asn 
20 25 30 

GAA CAG CTT TCA GAC ATG CTC TAT AAA CTC AAT GAA AGT TTA AGA ATC 2 00 

Glu Gin Leu Ser Asp Met Leu Tyr Lys Leu Asn Glu Ser Leu Arg He 
35 40 45 50 

TAT CAA AGC GTG CTT TCC AAT AAC CAA GAT CAA CTC AAA GAA ATC AAA 24 8 

Tyr Gin Ser Val Leu Ser Asn Asn Gin Asp Gin Leu Lys Glu He Lys 
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TTT GGC TTA AGC CCG TTA GAA ATT AAA GCC GTC CGC TTG AGC GAT GGG 96 8 

Phe Gly Leu Ser Pro Leu Glu lie Lys Ala Val Arg Leu Ser Asp Gly 
295 300 305 

AGT TAT GAG ATT TTA GAG GGT TTA AAG GOG GGC GAA GAA GTC GCT AAT 1016 
Ser Tyr Glu lie Leu Glu Gly Leu Lys Ala Gly Glu Glu Val Ala Asn 
310 315 320 

AAC GCT TTA TTC GTG CTA GAC GCT GAC GCT CAA AAC AAT GGG GAT TAT T 1065 
Asn Ala Leu Phe Val Leu Asp Ala Asp Ala Gin Asn Asn Gly Asp Tyr 
325 330 335 

GAATGATAGA AAAGATCATT GATTTAAGCG TTAAAAACAA ACTCCTTACC AC 1117 

(2) INFORMATION FOR SEQ ID NO:222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

Met Lys Arg lie Leu Trp Leu Ala Leu lie Leu Phe Phe Ser Pro Leu 

15 10 15 

Phe Ala Asn Ala Gin Lys Thr Gin Glu lie Lys Lys Thr Lys Glu Ala 

20 25 30 

Lys Ser Gin Thr Arg Phe Asn lie Ser Thr Thr Lys Val lie Glu Lys 

35 40 45 

Glu Phe Ser Gin Ser Arg Arg Tyr Tyr Ala Leu Leu Glu Pro Asn Glu 

50 55 60 

Ala Leu lie Phe Ser Gin Thr Leu Arg Phe Asp Gly Tyr Val Glu Lys 
65 70 75 80 

Leu Tyr Ala Asn Lys Thr Tyr Thr Pro lie Lys Lys Gly Asp Arg Leu 

85 90 95 

Leu Ser Val Tyr Ser Pro Glu Leu Val Ser Ala Gin Ser Glu Leu Leu 

100 105 110 

Ser Ser Leu Lys Phe Asn Gin Gin Val Gly Ala lie Lys Glu Lys Leu 

115 120 125 

Lys Leu Leu Gly Leu Glu Asn Ser Ser lie Glu Lys lie lie Ser Ser 

130 135 140 

His Lys Val Gin Asn Glu Met Thr lie Tyr Ser His Phe Asn Gly lie 
145 150 155 160 

lie Phe Lys Lys Ser Pro Asp Leu Asn Glu Gly Ser Phe lie Lys Lys 

165 170 175 

Gly Gin Glu Leu Phe Gin lie He Asp Leu Ser Gin Leu Trp Ala Leu 

180 185 190 

Val Lys Val Asn Gin Glu Asp Leu Glu Phe Leu Lys Asn Thr His Lys 

195 200 205 

Ala He Leu Phe Val Glu Gly He Lys Gly Glu Gin Glu He Thr Leu 
210 215 220 
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ATT TTT TCT CAA ACC CTG CGT TTT GAT GGC TAT GTG GAA AAG CTT TAT 2 96 

lie Phe Ser Gin Thr Leu Arg Phe Asp Gly Tyr Val Glu Lys Leu Tyr 
70 75 80 

GCG AAT AAA ACC TAT ACC CCC ATT AAA AAG GGC GAC AGG TTA TTG AGC 3 44 

Ala Asn Lys Thr Tyr Thr Pro lie Lys Lys Gly Asp Arg Leu Leu Ser 
85 90 95 

GTG TAT TCC OCT GAA TTA GTG AGC GCT CAA AGC GAA TTG CTA TCA TCA 3 92 

Val Tyr Ser Pro Glu Leu Val Ser Ala Gin Ser Glu Leu Leu Ser Ser 
100 105 110 

TTG AAA TTC AAC CAA CAA GTG GGA GCG ATT AAA GAA AAA TTA AAA CTA 44 0 

Leu Lys Phe Asn Gin Gin Val Gly Ala lie Lys Glu Lys Leu Lys Leu 
115 120 125 130 

TTA GGG TTA GAA AAC TCT AGC ATT GAA AAA ATC ATT AGC AGC CAT AAA 48 8 

Leu Gly Leu Glu Asn Ser Ser lie Glu Lys lie lie Ser Ser His Lys 
135 140 145 

GTC CAA AAT GAA ATG ACT ATT TAC TCT CAC TTC AAC GGC ATT ATT TTT 53 6 

Val Gin Asn Glu Met Thr He Tyr Ser His Phe Asn Gly He He Phe 
150 155 160 

AAA AAA AGC CCG GAT CTC AAT GAG GGG AGC TTC ATT AAA AAA GGG CAA 5 84 

Lys Lys Ser Pro Asp Leu Asn Glu Gly Ser Phe He Lys Lys Gly Gin 
165 170 175 

GAG TTG TTT CAA ATC ATA GAT TTA AGC CAA TTG TGG GCG CTG GTT AAA 63 2 

Glu Leu Phe Gin He He Asp Leu Ser Gin Leu Trp Ala Leu Val Lys 
180 185 190 

GTC AAT CAA GAG GAT TTA GAA TTT TTA AAA AAC ACG CAT AAA GCG ATC 680 
Val Asn Gin Glu Asp Leu Glu Phe Leu Lys Asn Thr His Lys Ala He 
195 200 205 210 

TTG TTT GTA GAA GGG ATT AAA GGC GAG CAA GAA ATC ACG CTT GAA AAT 72 8 

Leu Phe Val Glu Gly He Lys Gly Glu Gin Glu He Thr Leu Glu Asn 
215 220 225 

ATC AAC CCC ATC ATC AAC AAA GAA GAT AAA ATG CTA GAA GCG CGC TTC 776 
He Asn Pro He He Asn Lys Glu Asp Lys Met Leu Glu Ala Arg Phe 
230 235 240 

AAT GTG CCT AAT GTT AAA CAG ATT TAT TAC CCT AAC ATG TTC GCT CAA 824 
Asn Val Pro Asn Val Lys Gin He Tyr Tyr Pro Asn Met Phe Ala Gin 
245 250 255 

GTA GAA ATC TTT CAA AAA CCA CAA AAA ATG AAG ATT TTG CCT PiAA GAA 872 
Val Glu He Phe Gin Lys Pro Gin Lys Met Lys He Leu Pro Lys Glu 
260 265 270 

GCG GTT TTG ATT AAA GGG GGG AAA GCT ATC GTG TTT AAA AAA GAC GAT 92 0 

Ala Val Leu He Lys Gly Gly Lys Ala He Val Phe Lys Lys Asp Asp 
275 280 285 290 
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Gly Leu Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val 

20 25 30 

Lys lie Lys Ser Val Gly Asn Pro lie Lys Gly Asp Asn Thr Phe lie 

35 40 45 

Leu Ser Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala lie Val Arg 

50 55 60 

Val Gin Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu 
65 70 75 80 

Met Ala Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn 

85 90 95 

Leu Ser Met Asn Gly Thr Trp Gin Val Arg Val Asp lie Lys Ser Lys 

100 105 110 

Glu Gly Gin Val Tyr Arg Ala Lys Thr Ser Leu Asp Leu 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECtJLE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1064 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

AGCGCTTTAA ATAGCACTTA TTTGTCTTTA CAAAACCTTA AAGGATTAGA ATG AAA 56 

Met Lys 
1 

CGG ATT TTA TGG TTA GCC TTG ATT TTA TTT TTT AGC CCC TTA TTC GCT 104 
Arg lie Leu Trp Leu Ala Leu lie Leu Phe Phe Ser Pro Leu Phe Ala 
5 10 15 

AAC GCT CAA AAA ACT CAA GAA ATT AAA AAA ACT AAA GAA GCT AAA AGC 152 
Asn Ala Gin Lys Thr Gin Glu lie Lys Lys Thr Lys Glu Ala Lys Ser 
20 25 30 

CAA ACC CGT TTT AAT ATT TCC ACC ACT AAG GTC ATA GAA AAA GAA TTT 2 00 

Gin Thr Arg Phe Asn lie Ser Thr Thr Lys Val lie Glu Lys Glu Phe 
35 40 45 50 

TCT CAA AGC CGG CGC TAT TAC GCG CTT TTA GAG CCC AAT GAA GCG CTG 24 8 

Ser Gin Ser Arg Arg Tyr Tyr Ala Leu Leu Glu Pro Asn Glu Ala Leu 
55 60 65 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 19: 

TCCGTTCGCA ACAAGAATTT TCTTGTTATC TTAATGTAAA GGTCAAAACG ATG AAA 56 

Met Lys 
1 

AAG TTA GCC GCT TTA TTT TTA GTA AGC GTG TTG GGG GTT ATG GGT TTA 104 
Lys Leu Ala Ala Leu Phe Leu Val Ser Val Leu Gly Val Met Gly Leu 
5 10 15 

AAC GCA TGG GAG CAA ACC CTA AAA GCT AAT GAC TTG GAA GTG AAA ATC 152 
Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val Lys lie 
20 25 30 

AAA TCC GTG GGT AAC CCC ATT AAA GGC GAT AAC ACT TTC ATT CTC AGC 2 00 

Lys Ser Val Gly Asn Pro lie Lys Gly Asp Asn Thr Phe lie Leu Ser 
35 40 45 50 

CCC ACT TTA AAA GGT AAG GCT TTA GAA AAA GCT ATC GTT AGG GTG CAG 24 8 

Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala lie Val Arg Val Gin 
55 60 65 

TTT ATG ATG CCT GAA ATG CCC GGC ATG CCA GCG ATG AAA GAA ATG GCG 2 96 

Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu Met Ala 
70 75 80 

CAA GTG AGT GAA AAA AAC GGC CTT TAT GAA GCT AAA ACC AAT CTT TCT 344 
Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn Leu Ser 
85 90 95 

ATG AAC GGG AC A TGG CAG GTT AGG GTG GAT ATT AAA TCT AAA GAG GGT 3 92 

Met Asn Gly Thr Trp Gin Val Arg Val Asp He Lys Ser Lys Glu Gly 
100 105 110 

CAG GTT TAT CGC GCT AAA ACA AGC CTG GAT TTA TAAGAGCATG CTATCTTTTA 445 
Gin Val Tyr Arg Ala Lys Thr Ser Leu Asp Leu 
115 120 125 

TAAGCGCGTT TGATAAAAGG GGCGTTTCAA TAC 478 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 

Met Lys Lys Leu Ala Ala Leu Phe Leu Val Ser Val Leu Gly Val Met 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 18: 

Met Lys Lys Ala Leu lie Ser Thr Leu Phe Gly Val Ser Leu Ala Phe 

15 10 15 

Ala Lys Pro Tyr Thr lie Asp Lys Ala Asn Ser Ser Val Trp Phe Glu 

20 25 30 

Val Lys His Phe Thr Phe Asn Glu Thr Arg Gly Ala Phe Asp Asn Phe 

35 40 45 

Asp Gly Lys lie Asp Leu Glu Pro Asn Thr Lys Met Leu Ser Val Phe 

50 55 60 

Glu Gly Asn lie Asp Val Lys Ser Val Asn Thr Arg Asp Arg Lys Arg 
65 70 75 80 

Asp Asn His Leu Lys Thr Ala Asp Phe Phe Asp Val Val Lys Tyr Pro 

85 90 95 

Lys Gly Ser Phe Lys Met Thr Lys Tyr Glu Asp Gly Lys lie Tyr Gly. 

100 105 110 

Asp Leu Thr Leu Arg Gly Val Thr Lys Pro Val Val Leu Glu Ala Lys 

115 120 125 

He Gin Ala Pro Leu Gin Asn Pro Met Asn Lys Lys Glu Phe Met Val 

130 135 140 

Leu Gin Ala Glu Gly Lys He Asn Arg Lys Asp Phe Gly He Gly Lys 
145 150 155 160 

Thr Phe Ser Asp Ala Val Val Gly Asp Glu Val Lys He Glu Le\i Lys 

165 170 17b 

Leu Glu Ala Tyr Ala Gin 
180 

(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.., 425 
(D) OTHER INFORMATION: 



-403- 



BNSOCKID: <WO__984347aA1J_> 



wo 98/43478 



PCT/US98/06371 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

CGAATTGCTG TATAGTTAGC GTTTTTAATT CAAAATGAAG TGAGGAAACA ATG AAA 56 

Met Lys 
1 

AAA GCG TTA ATA TCC ACC CTT TTT GGT GTT AGT TTG GCG TTT GCA AAA 104 
L.ys Ala Leu lie Ser Thr Leu Phe Gly Val Ser Leu Ala Phe Ala Lys 
5 10 15 

CCT TAT ACG ATT GAT AAG GCA AAC TCT AGC GTG TGG TTT GAG GTC AAA 152 
Pro Tyr Thr lie Asp Lys Ala Asn Ser Ser Val Trp Phe Glu Val Lys 
20 25 30 

CAC TTC ACG TTC AAT GAA ACA AGA GGC GCG TTT GAT AAT TTT GAT GGC 200 
His Phe Thr Phe Asn Glu Thr Arg Gly Ala Phe Asp Asn Phe Asp Gly 
35 40 45 50 

AAA ATT GAT CTA GAG CCC AAC ACT AAA ATG CTC AGC GTT TTT GAA GGC 24 8 

Lys lie Asp Leu Glu Pro Asn Thr Lys Met Leu Ser Val Phe Glu Gly 
55 60 65 

AAT ATT GAT GTG AAA AGC GTC AAT ACT AGG GAT AGA AAA AGA GAT AAC 2 96 

Asn lie Asp Val Lys Ser Val Asn Thr Arg Asp Arg Lys Arg Asp Asn 
70 75 80 

CAC TTG AAA ACA GCG GAC TTT TTT GAT GTG GTA AAA TAC CCC AAA GGG 3 44 

His Leu Lys Thr Ala Asp Phe Phe Asp Val Val Lys Tyr Pro Lys Gly 
85 90 95 

AGC TTT AAA ATG ACC AAA TAC GAA GAT GGT AAA ATC TAT GGG GAT TTG 3 92 

Ser Phe Lys Met Thr Lys Tyr Glu Asp Gly Lys lie Tyr Gly Asp Leu 
100 105 110 

ACT CTT CGT GGC GTA ACC AAG CCT GTC GTA TTG GAA GCC AAA ATC CAA 44 0 

Thr Leu Arg Gly Val Thr Lys Pro Val Val Leu Glu Ala Lys lie Gin 
115 120 125 130 

GCC CCC TTA CAA AAC CCC ATG AAT AAA AAA GAA TTC ATG GTG TTA CAA 4 88 

Ala Pro Leu Gin Asn Pro Met Asn Lys Lys Glu Phe Met Val Leu Gin 
135 140 145 

GCT GAA GGC AAA ATC AAC CGC AAG GAT TTT GGT ATC GGT AAA ACC TTT 53 6 

Ala Glu Gly Lys lie Asn Arg Lys Asp Phe Gly lie Gly Lys Thr Phe 
150 155 160 

AGC GAT GCT GTC GTT GGA GAT GAG GTA AAG ATT GAG CTC AAA CTA GAA 5 84 

Ser Asp Ala Val Val Gly Asp Glu Val Lys lie Glu Leu Lys Leu Glu 
165 170 175 

GCT TAC GCC CAA TAATCGTTTT GCAAGAGATA GATATCTTCT TCTCTTGCGT TTTTC 641 
Ala Tyr Ala Gin 
180 

TAACAGCA 64 9 
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Val Lys Leu lie His Lys Ala Leu Glu Leu Gly lie Asn Phe Phe Asp 
35 40 45 50 

ACT OCA GAG GOT TAT GGG GAA GAT AAT GAA AAG CTT TTA GGC GAA CGA 24 8 

Thr Ala Glu Ala Tyr Gly Glu Asp Asn Glu Lys Leu Leu Gly Glu Arg 
55 60 65 

TCA AGC CTT TTA AAG ACA AGG TTG TGG TAGCGAGCAA GTTTGGGATT TACTACG 3 02 
Ser Ser Leu Leu Lys Thr Arg Leu Trp 
70 75 

CAGATCCTAA TGACAAATAC GCAACC 32 8 

(2) INFORMATION FOR SEQ ID NO: 2 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 
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Val 


His 


Asp 


Lys 


Lys 








20 










25 










30 






Gin 


Met 


Val 


Lys 


Leu 


He 


His 


Lys 


Ala 


Leu 


Glu 


Leu 


Gly 


He 


Asn 


Phe 






35 










40 










45 








Phe 


Asp 


Thr 


Ala 


Glu 


Ala 


Tyr 


Gly 


Glu 


Asp 


Asn 


Glu 


Lys 


Leu 


Leu 


Gly 




50 










55 










60 










Glu 


Arg 


Ser 


Ser 


Leu 


Leu 


Lys 


Thr 


Arg 


Leu 


Trp 













65 70 75 

(2) INFORMATION FOR SEQ ID NO: 2 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 596 
(D) OTHER INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
C D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:214: 

Met Met Lys Lys Thr Leu Phe lie Ser Leu Ala Leu Ala Leu Ser Leu 

1 5 10 15 

Asn Ala Gly Asn lie Gin lie Gin Ser Met Pro Lys Val Lys Glu Arg 

20 25 30 

Val Ser Val Pro Ser Lys Asp Asp Thr Asp Leu Phe Leu Pro Arg Phe 
35 40 45 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 275 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 

TTAAGATTGC GGTTATACTG AAAAAAACAA TATGAAATCA AGGAGCTTGT ATG CAA 56 

Met Gin 
1 

CAG CGT CAT TTA GGC CCT TTA AAA GTG GGT GCA TTA GCT CTA GGG TGC 104 
Gin Arg His Leu Gly Pro Leu Lys Val Gly Ala Leu Ala Leu Gly Cys 
5 10 15 

ATG GGC ATG ACT TAT GGG TAT GGG GAA GTC CAT GAT AAA AAG CAG ATG 152 
Met Gly Met Thr Tyr Gly Tyr Gly Glu Val His Asp Lys Lys Gin Met 
20 25 30 

GTT AAA' CTT ATC CAT AAG GCT TTG GAA TTG GGT ATT AAC TTT TTT GAC 200 
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Ser Gin Ala Phe 

Leu Phe lie Ala 
340 

Dys Glu lie Val 
355 

Gin Ala Glu Leu 
370 

Ser Asn Leu Glu 
385 

Leu Val Gin Asn 

Leu Asp Leu Lys 
420 

Lys Asp Thr Gin 
435 



Ser His Asn Met 
325 

Gly Gly Asn Pro 

Ala Leu Leu Glu 
360 

Asp Lys Leu Lys 
375 

Ser Ser Ser Asp 
390 

Asp lie Gin Gly 
405 

Val Ser Asp Leu 

Ser Thr Thr Val 
440 



Gin Leu Gin Asp 
330 

Asn Val Lys Ala 
345 

Lys Leu Lys Lys 

lie Asn Gin Lys 
380 

Val Ala Gly Leu 
395 

Leu Thr Asp Tyr 
410 

Val Arg Val Ala 
425 

Phe Leu Lys Pro 



Glu Ser Val Phe 
335 

Glu Ala Leu Gin 
350 

Gly Glu lie Thr 
365 

Ala Asp Phe lie 

Phe Ala Asp Tyr 
400 

Gin Arg Gin Phe 
415 

Asn Glu Tyr Phe 
430 



(2) INFORMATION FOR SEQ ID NO: 2 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 197 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

ATATAATCTT TTCTTAATTT TGAAGTTTAG CAAATTTTAA GGAAGTAACC ATG ATG 56 

Met Met 
1 

AAA AAA ACC CTT TTT ATC TCT TTG GCT TTA GCG TTA AGC TTG AAT GCG 104 
Lys Lys Thr Leu Phe lie Ser Leu Ala Leu Ala Leu Ser Leu Asn Ala 
5 10 15 

GGC AAT ATC CAA ATC CAG AGC ATG CCC AAA GTT AAA GAG CGA GTG AGT 152 
Gly Asn He Gin He Gin Ser Met Pro Lys Val Lys Glu Arg Val Ser 
20 25 30 

GTC CCC TCT AAA GAC GAT ACG GAT CTA TTC TTA CCA CGA TTC TAT TAAGG 2 02 
Val Pro Ser Lys Asp Asp Thr Asp Leu Phe Leu Pro Arg Phe Tyr 
35 40 45 

ACTCTATTAA GGCGGTGGTG AATATCTCCA CTGAAAAGAA GATTAAAA 25 0 
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TTTCATTCAT CTAGCGCGTT GATTACGCCT 1435 
(2) INFORMATION FOR SEQ ID NO:212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 12: 



Met 


Lys 


His 


Phe 


Ser 


Val 


Lys 


Arg 


Leu 


Leu 


Gly 


Leu 


Ser 


Ser 


Val 


Leu 


1 








5 










10 










15 




Leu 


Val 


Thr 


Leu 


Gly Ala 


Ser 


Met 


His 


Ala 


Gin 


Ser 


Tyr 


Leu 


Pro 


Lys 








20 










25 










30 






His 


Glu 


Ser 


Val 


Thr 


Leu 


Lys 


Asn 


Gly 


Leu 


Gin 


Val 


Val 


Ser 


Val 


Pro 






35 










40 










45 








Leu 


Glu 


Asn 


Lys 


Thr 


Gly 


Val 


He 


Glu 


Val 


Asp 


Val 


Leu 


Tyr 


Lys 


Val 




50 










55 










60 










Gly 


Ser 


Arg 


Asn 


Glu 


Thr 


Met 


Gly 


Lys 


Ser 


Gly 


He 


Ala 


His 


Met 


Leu 


65 










70 










75 










80 


Glu 


His 


Leu 


Asn 


Phe 


Lys 


Ser 


Thr 


Lys 


Asn 


Leu 


Lvs 


Ala 


Gly 


Glu 


Phe 










85 










90 










95 




Asp 


Lys 


lie 


Val 


Lys 


Arg 


Phe 


Gly 


Gly Val 


Ser 


Asn 


Ala 


Ser 


Thr 


Ser 








100 










105 










110 






Phe 


Asp 


He 


Thr 


Arg 


Tyr 


Phe 


He 


Lys 


Thr 


Ser 


Gin 


Ala 


Asn 


Leu 


Asp 






115 










120 










125 








Lys 


Ser 


Leu 


Glu 


Leu 


Phe 


Ala 


Glu 


Thr 


Met 


Gly 


Ser 


Leu 


Asn 


Leu 


Lys 




130 










135 










140 










Glu 


Asp 


Glu 


Phe 


Leu 


Pro 


Glu 


Arg 


Gin 


Val 


Val 


Ala 


Glu 


Glu 


Arg 


ArQ 


145 










150 










155 










160 


Trp 


Arg 


Thr 


Asp 


Asn 


Ser 


Pro 


He 


Gly 


Met 


Leu 


Tyr 


Phe 


Arg 


Phe 


Phe 










165 










170 










175 




Asn 


Thr 


Ala 


Tyr 


Val 


Tyr 


His 


Pro 


Tyr 


His 


Trp 


Thr 


Pro 


He 


Gly 


Phe 








180 










185 










190 






Met 


Asp 


Asp 


He 


Gin 


Asn 


Trp 


Thr 


Leu 


Lys 


Asp 


He 


Lys 


Lys 


Phe 


His 






195 










200 










205 








Ser 


Leu 


Tyr 


Tyr 


Gin 


Pro 


Lys 


Asn 


Ala 


He 


Val 


Leu 


Val 


Val 


Gly 


Asp 




210 










215 










220 










Val 


Asn 


Ser 


Gin 


Lys 


Val 


Phe 


Glu 


Leu 


Ser 


Lys 


Lys 


His 


Phe 


Glu 


Ser 


225 










230 










235 










240 


Leu 


Lys 


Asn 


Leu 


Asp 


Glu 


Lys 


Ala 


He 


Pro 


Thr 


Pro 


Tyr 


Met 


Lys 


Glu 










245 










250 










255 




Pro 


Lys 


Gin 


Asp 


Gly Ala 


Arg 


Thr 


Ala 


Val 


Val 


His 


Lys 


Asp 


Gly 


Val 








260 










265 










270 






His 


Leu 


Glu 


Trp 


Val 


Ala 


Leu 


Gly 


Tyr 


Lys 


Val 


Pro 


Ala 


Phe 


Lys 


His 






275 










280 










285 








Lys 


Asp 


Gin 


Val 


Ala 


Leu 


Asp 


Ala 


Leu 


Ser 


Arg 


Leu 


Leu 


Gly 


Glu 


Gly 




290 










295 










300 










Lys 


Ser 


Ser 


Trp 


Leu 


Gin 


Ser 


Glu 


Leu 


Val 


Asp 


Lys 


Lys 


Arg 


Leu 


Ala 


305 










310 










315 










320 
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TCC CAA AAG GTT TTT GAA TTG AGT AAA AAG CAT TTT GAA TCC TTA AAA 776 
Ser Gin Lys Val Phe Glu Leu Ser Lys Lys His Phe Glu Ser Leu Lys 
230 235 240 

AAC CTT GAT GAA AAA GCT ATC CCC ACC CCT TAG ATG AAA GAG CCT AAG 824 
Asn Leu Asp Glu Lys Ala lie Pro Thr Pro Tyr Met Lys Glu Pro Lys 
245 250 255 

CAA GAT GGA GCC AGA ACG GCA GTC GTG CAT AAA GAT GGG GTC CAT TTA 872 
Gin Asp Gly Ala Arg Thr Ala Val Val His Lys Asp Gly Val His Leu 
260 265 270 

GAA TGG GTG GCC CTT GGG TAT AAA GTG CCT GCT TTC AAG CAT AAA GAT 92 0 

Glu Trp Val Ala Leu Gly Tyr Lys Val Pro Ala Phe Lys His Lys Asp 
275 280 285 290 

CAA GTC GCC TTA GAC GCA CTA AGT AGG CTT TTA GGC GAA GGC AAA AGC 968 
Gin Val Ala Leu Asp Ala Leu Ser Arg Leu Leu Gly Glu Gly Lys Ser 
295 300 305 

TCG TGG TTG CAA AGC GAA TTA GTG GAT AAA AAA CGC TTG GCT TCT CAA 1016 
Ser Trp Leu Gin Ser Glu Leu Val Asp Lys Lys Arg Leu Ala Ser Gin 
310 315 320 

GCT TTC TCG CAC AAC ATG CAA TTA CAA GAT GAA AGC GTG TTT TTA TTC 1064 
Ala Phe Ser His Asn Met Gin Leu Gin Asp Glu Ser Val Phe Leu Phe 
325 330 335 

ATT GCG GGG GGT AAT CCT AAT GTC AAA GCC GAA GCC TTA CAA AAA GAA 1112 
lie Ala Gly Gly Asn Pro Asn Val Lys Ala Glu Ala Leu Gin Lys Glu 
340 345 350 

ATC GTA GCG CTT TTA GAA AAG CTG AAA AAA GGC GAA ATC ACT CAA GCG 1160 
lie Val Ala Leu Leu Glu Lys Leu Lys Lys Gly Glu lie Thr Gin Ala 
355 360 365 370 

GAA TTA GAC AAG CTC AAA ATC AAT CAA AAA GCT GAC TTT ATT TCT AAT 12 0 8 

Glu Leu Asp Lys Leu Lys lie Asn Gin Lys Ala Asp Phe lie Ser Asn 
375 380 385 

TTA GAA AGT TCT AGC GAT GTT GCG GGG CTT TTT GCG GAC TAT TTA GTG 1256 
Leu Glu Ser Ser Ser Asp Val Ala Gly Leu Phe Ala Asp Tyr Leu Val 
390 395 400 

CAA AAC GAT ATT CAA GGC TTG ACG GAT TAC CAG CGA CAA TTT TTG GAT 13 04 

Gin Asn Asp lie Gin Gly Leu Thr Asp Tyr Gin Arg Gin Phe Leu Asp 
405 410 415 

TTA AAA GTG AGC GAT TTG GTG CGT GTG GCC AAT GAA TAT TTT AAA GAC 1352 
Leu Lys Val Ser Asp Leu Val Arg Val Ala Asn Glu Tyr Phe Lys Asp 
420 425 430 

ACC CAA TCA ACC ACC GTG TTT TTG AAA CCT TAAAAGAGCC TTATAACATG CAA 14 05 
Thr Gin Ser Thr Thr Val Phe Leu Lys Pro 
435 440 
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CAT TTT TCT GTT AAA AGA CTT TTA GGG CTT ACT TCT GTC TTG TTA GTC 104 
His Phe Ser Val Lys Arg Leu Leu Gly Leu Ser Ser Val Leu Leu Val 
5 10 15 

ACT TTA GGA GCG AGC ATG CAC GCA CAA TCT TAC TTA CCC AAA CAT GAG 152 
Thr Leu Gly Ala Ser Met His Ala Gin Ser Tyr Leu Pro Lys His Glu 
20 25 30 

AGC GTT ACC TTA AAA AAC GGG TTG CAA GTC GTG AGC GTC CCC CTA GAA 200 
Ser Val Thr Leu Lys Asn Gly Leu Gin Val Val Ser Val Pro Leu Glu 
35 40 45 50 

AAT AAA ACC GGG GTT ATA GAA GTG GAT GTG CTT TAT AAA GTC GGC TCT 24 8 

Asn Lys Thr Gly Val lie Glu Val Asp Val Leu Tyr Lys Val Gly Ser 
55 60 65 

AGA AAC GAA ACC ATG GGA AAG AGC GGG ATC GCT CAC ATG TTA GAG CAT 2 96 

Arg Asn Glu Thr Met Gly Lys Ser Gly lie Ala His Met Leu Glu His 
70 75 80 

TTG AAT TTT AAA AGC ACC AAA AAC CTT AAA GCC GGC GAA TTT GAT AAA 344 
Leu Asn Phe Lys Ser Thr Lys Asn Leu Lys Ala Gly Glu Phe Asp Lys 
85 90 95 

ATC GTT AAG CGT TTT GGG GGC GTG AGT AAC GCT TCT ACG AGT TTT GAT 3 92 

lie Val Lys Arg Phe Gly Gly Val Ser Asn Ala Ser Thr Ser Phe Asp 
100 105 110 

ATT ACG CGC TAC TTC ATT AAA ACC AGT CAG GCT AAC TTG GAT AAG TCT 44 0 

lie Thr Arg Tyr Phe lie Lys Thr Ser Gin Ala Asn Leu Asp Lys Ser 
115 120 125 130 

TTA GAA TTG TTC GCT GAA ACC ATG GGT TCA TTG AAT TTA AAA GAA GAT 4 88 

Leu Glu Leu Phe Ala Glu Thr Met Gly Ser Leu Asn Leu Lys Glu Asp 
135 140 145 

GAG TTT TTG CCT GAG CGT CAA GTG GTC GCT GAA GAA AGG CGA TGG CGC 536 
Glu Phe Leu Pro Glu Arg Gin Val Val Ala Glu Glu Arg Arg Trp Arg 
150 155 160 

ACT GAT AAT TCC CCT ATC GGC ATG CTT TAT TTC CGC TTT TTT AAC ACC 584 
Thr Asp Asn Ser Pro lie Gly Met Leu Tyr Phe Arg Phe Phe Asn Thr 
165 170 175 

GCT TAT GTC TAT CAC CCC TAC CAT TGG ACG CCC ATT GGT TTT ATG GAT 632 
Ala Tyr Val Tyr His Pro Tyr His Trp Thr Pro lie Gly Phe Met Asp 
180 185 190 

GAT ATT CAA AAT TGG ACT TTA AAA GAC ATT AAA AAA TTC CAT TCG CTC 68 0 

Asp lie Gin Asn Trp Thr Leu Lys Asp lie Lys Lys Phe His Ser Leu 
195 200 205 210 

TAT TAT CAG CCT AAA AAC GCT ATC GTT TTG GTG GTA GGC GAT GTC AAT 72 8 

Tyr Tyr Gin Pro Lys Asn Ala lie Val Leu Val Val Gly Asp Val Asn 
215 220 225 
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Tyr Leu Asp Lys Glu Gly Lys Leu Gin Asp Phe Lys Ser Val Gin Glu 

260 265 270 

Gin Val Lys His Asp Leu Ser Met Gin Lys Ala Asn Glu Lys Ala Leu 

275 280 285 

Arg Ser Tyr lie Ala Leu Lys Lys Ala Asn Ala Gin Asn Tyr Thr Thr 

290 295 300 

Gin Asp Phe Glu Glu Asn Asn Ser Pro Tyr Thr Ala Glu lie Thr Gin 
305 310 315 320 

Lys Leu Thr Ala Leu Lys Pro Leu Glu He Leu Lys Pro Glu Pro Phe 

325 330 335 

Lys Asp Gly Phe He Val Val Gin Leu He Ser Gin He Lys Asp Glu 

340 345 350 

Leu Gin Asn Phe Asn Glu Ala Lys Ser Ala Leu Lys Thr Arg Leu Thr 

355 360 365 

Gin Glu Lys Thr Leu Met Ala Leu Gin Thr Leu Ala Lys Glu Lys Leu 

370 375 380 

Lys Asp Phe Lys Gly Lys Ser Val Gly Tyr Val Ser Pro Asn Phe Gly 
385 390 395 400 

Gly Thr He Ser Glu Leu Asn Gin Glu Glu Ser Ala Lys Phe He Asn 

405 410 415 

Ala Leu Phe Asn Arg Gin Glu Lys Lys Gly Phe He Ala He Asn Asn 

420 425 430 

Lys Val Val Leu Tyr Gin He Thr Glu Gin Asn Phe Asn His Ser Phe 

435 440 445 

Ser Ala Glu Glu Ser Gin Tyr Met Gin Arg Leu Val Asn Asn Thr Lys 

450 455 460 

Thr Asp Phe Phe Asp Lys Ala Leu He Glu Glu Leu Lys Lys Arg Tyr 
465 470 475 480 

Lys He Val Lys Tyr He Gin 
485 

(2) INFORMATION FOR SEQ ID NO: 211: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 143 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1382 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CAAAAAGATG GATTTTTGAG CGTCAAAGAG GCTATAGGAG CGGATTTAAG ATG AAA 56 

Met Lys 
1 
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TTT TTT GAT AAA GCG TTG ATA GAA GAA TTG AAA AAA CGC TAT AAG ATA 14 96 

Phe Phe Asp Lys Ala Leu lie Glu Glu Leu Lys Lys Arg Tyr Lys lie 
470 475 480 

GTC AAA TAG ATT CAA TAAATGCAAG GGGAAATCAT GGAACATAAA GAAATCGTTA T 1552 
Val Lys Tyr lie Gin 
485 

AGGGGTTGAT CT 1564 
(2) INFORMATION FOR SEQ ID NO: 210: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

Met lie Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr lie 

15 10 15 

Trp lie Ser Thr lie Ala Phe lie Ala Ala Gly Met He Gly Trp Gly 

20 25 30 

Gin Tyr Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin 

35 40 45 

He Lys He Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys 

50 55 60 

Asp Ala Tyr Ala Glu Ser He Pro Asp Phe Lys Glu Leu Thr Lys Asp 
65 70 75 80 

Gin He Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu He 

85 90 95 

Asn Gin Ala Leu Leu Arg Asn Leu Ala Leu Asp Leu Gly Leu Gly Ala 

100 105 110 

Thr Lys Gin Glu Val Ala Lys Glu He Arg Lys Thr Ser Val Phe Gin 

115 120 125 

Lys Asp Gly Val Phe Asp Glu Glu Leu Tyr Lys Asn He Leu Lys Gin 

130 135 140 

Ser His Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu 
145 150 155 160 

He Leu Gin Lys He Ser Thr Leu Phe Pro Lys Thr Thr Thr Pro Leu 

165 170 175 

Glu Gin Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp 

180 185 190 

He Leu He Leu Asn Pro Ser Asp Val Lys He Ser Leu Asn Glu Glu 

195 200 205 

Glu Met Lys Lys Tyr Tyr Glu Ser His Lys Lys Asp Phe Lys Lys Pro 

210 215 220 

Thr Ser Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys 
225 230 235 240 

Pro Asp Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser 
245 250 255 
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TTG AAG GAG TTG GAG GAA TAG TAG CAT AAA AAC AAG GTG TOT TAT TTG 824 
Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser Tyr Leu 
245 250 255 

GAG AAA GAG GGG AAA TTG GAG GAT TTT AAA AGC GTT CAA GAG CAA GTG 872 
Asp Lys Glu Gly Lys Leu Gin Asp Phe Lys Ser Val Gin Glu Gin Val 
260 265 270 

AAG CAT GAT TTA AGC ATG CAA AAA GCG AAT GAA AAA GCC TTA AGG AGC 92 0 

Lys His Asp Leu Ser Met Gin Lys Ala Asn Glu Lys Ala Leu Arg Ser 
275 280 285 290 

TAT ATC GCT CTA AAA AAA GCG AAC GCG CAA AAC TAC ACC ACA CAA GAT 968 
Tyr lie Ala Leu Lys Lys Ala Asn Ala Gin Asn Tyr Tiir Thr Gin Asp 
295 300 305 

TTT GAA GAG AAC AAC TCC CCC TAT ACT GCT GAA ATC ACG CAA AAA CTC 1016 
Phe Glu Glu Asn Asn Ser Pro Tyr Thr Ala Glu lie Thr Gin Lys Leu 
310 315 320 

ACC GCT CTC AAA CCC CTT GAA ATC CTA AAG CCA GAG CCT TTT AAA GAT 1064 
Thr Ala Leu Lys Pro Leu Glu lie Leu Lys Pro Glu Pro Phe Lys Asp 
325 330 335 

GGT TTT ATT GTG GTG CAA CTC ATC TCT CAA ATT AAA GAG GAA TTG CAA 1112 
Gly Phe lie Val Val Gin Leu lie Ser Gin lie Lys Asp Glu Leu Gin 
340 345 350 

AAT TTT AAT GAA GCT AAA AGC GCT CTT AAA ACC CGC CTA ACT CAA GAA 1160 
Asn Phe Asn Glu Ala Lys Ser Ala Leu Lys Thr Arg Leu Thr Gin Glu 
355 , 360 365 370 

AAA ACC CTT ATG GCG TTG CAA ACT TTA GCC AAA GAA AAG CTT AAG GAT 12 0 8 

Lys Thr Leu Met Ala Leu Gin Thr Leu Ala Lys Glu Lys Leu Lys Asp 
375 380 385 

TTT AAG GGC AAA AGC GTG GGC TAT GTA AGC CCT AAT TTT GGA GGC ACT 12 56 

Phe Lys Gly Lys Ser Val Gly Tyr Val Ser Pro Asn Phe Gly Gly Thr 
390 395 400 

ATT AGT GAG CTT AAC CAA GAA GAA AGT GCT AAG TTT ATC AAC GCT CTT 13 04 

lie Ser Glu Leu Asn Gin Glu Glu Ser Ala Lys Phe lie Asn Ala Leu 
405 410 415 

TTT AAC CGC GAG GAA AAA AAG GGG TTT ATC GCT ATT AAT AAT AAA GTG 13 52 

Phe Asn Arg Gin Glu Lys Lys Gly Phe lie Ala lie Asn Asn Lys Val 
420 425 430 

GTG CTC TAT CAA ATC ACA GAA CAA AAT TTC AAC CAC TCA TTT AGT GCA 14 0 0 

Val Leu Tyr Gin lie Thr Glu Gin Asn Phe Asn His Ser Phe Ser Ala 
435 440 445 450 

GAA GAA AGC GAG TAT ATG GAG CGT TTA GTC AAT AAC ACT AAA ACG GAT 144 8 

Glu Glu Ser Gin Tyr Met Gin Arg Leu Val Asn Asn Thr Lys Thr Asp 
455 460 465 
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AGC ACG ATC GCT TTT ATT GCC GCT GGG ATG ATA GGC TGG GGG CAA TAG 152 
Ser Thr He Ala Phe He Ala Ala Gly Met He Gly Trp Gly Gin Tyr 
20 25 30 

AGC TTT TCT TTA GAT AGC GAT AGC GCT GCC AAA GTG GGA CAG ATT AAG 2 00 

Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin He Lys 
35 40 45 50 

ATT TCT CAA GAA GAA TTA GCC CAA GAA TAC CGC CGC CTT AAA GAC GCA 24 8 

He Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys Asp Ala 
55 60 65 

TAT GCT GAG TCT ATC CCT GAT TTT AAA GAA CTC ACC AAA GAT CAA ATC 296 
Tyr Ala Glu Ser He Pro Asp Phe Lys Glu Leu Thr Lys Asp Gin He 
70 75 80 

AAA GCC ATG CAT TTA GAA AAA AGC GCT TTA GAT TCG CTC ATC AAT CAA 344 
Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu He Asn Gin 
85 90 95 

GCC TTA TTG AGA AAT CTC GCT TTA GAT TTA GGG CTT GGC GCT ACA AAG 3 92 

Ala Leu Leu Arg Asn Leu Ala Leu Asp Leu Gly Leu Gly Ala Thr Lys 
100 105 110 

CAA GAA GTG GCG AAA GAG ATC AGA AAA ACG AGC GTT TTC CAA AAA GAT 44 0 

Gin Glu Val Ala Lys Glu He Arg Lys Thr Ser Val Phe Gin Lys Asp 
H5 120 125 130 

GGC GTT TTT GAT GAA GAA TTG TAT AAA AAT ATC TTA AAG CAA AGC CAT 488 
Gly Val Phe Asp Glu Glu Leu Tyr Lys Asn He Leu Lys Gin Ser His 
135 140 145 

TAC CGC CCC AAA CAT TTT GAA GAA AGC GTT GAA AGG CTT TTA ATC CTT 53 6 

Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu He Leu 
150 155 160 

CAA AAA ATC AGC ACT CTA TTC CCC AAA ACC ACT ACC CCT TTG GAG CAA 584 
Gin Lys He Ser Thr Leu Phe Pro Lys Thr Thr Thr Pro Leu Glu Gin 
165 170 175 

TCC AGC CTA TCG CTT TGG GCA AAA TTG CAA GAC A7^ TTA GAC ATT CTT 632 
Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp He Leu 
180 185 190 

ATC CTA AAC CCT AGT GAT GTT AAA ATC TCT CTT AAT GAA GAA GAG ATG 680 
He Leu Asn Pro Ser Asp Val Lys He Ser Leu Asn Glu Glu Glu Met 

200 205 210 

AAA AAA TAT TAC GAG TCC CAT AAA AAG GAT TTT AAA AAG CCC ACG AGC 72 8 

Lys Lys Tyr Tyr Glu Ser His Lys Lys Asp Phe Lys Lys Pro Thr Ser 
215 220 225 

TTT AAA ACA CGC TCT TTA TAT TTT GAC GCT AGT TTG GAA AAA CCT GAT 77 6 

Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys Pro Asp 
230 235 240 



-392- 



BNSDOCID; <WO 9e43478AlJ_> 



wo 98/43478 



PCT/US98/06371 



CTCTAGGC 3 61 

(2) INFORMATION FOR SEQ ID NO: 20 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 6 amino acids 

(B) TYPE: amino acid 

(C) STRAWDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

Met Ser Gly Cys Ala Ser Ser Ser Pro Thr Gly Thr Leu lie Thr Met 

15 10 15 

Val Thr Met Pro Val Ser Gly Asn Asp Ala Gin Tyr Ser Lys Glu Gly 

20 25 30 

Arg Ala Ser Cys Trp Ser Val Phe Ser Leu Val Ala Ala Gly Asn Cys 

35 40 45 

Ser Val Glu Lys Ala Ala Lys Ser Gly Gly Val Thr Lys lie Lys Met 

50 55 60 

Val Ser Arg Glu Thr Asn Asn Phe Leu Gly lie Val Gly Lys Tyr Thr 
65 70 75 80 

Thr lie Val Gin Gly Asp 
85 

(2) INFORMATION FOR SEQ ID NO: 2 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1511 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 09: 

TTTAAACCTG AATTTCATAT TTTTGATTTT TTAAAGGGAT TAGAGTTCTT ATG ATT 56 

Met lie 
1 

GAA TGG ATG CAA AAT CAT AGA AAA TAT TTA GTG GTT ACA ATA TGG ATA 104 
Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr lie Trp lie 
5 10 15 
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Gin Leu Asp Lys lie Ala Pro Thr lie Lys Ser Gly Met Arg lie Gin 

355 360 365 

Lys Gly Tyr Val Leu Gly Arg lie Asp Gin Arg Leu Gly Phe Glu Val 

370 375 380 

Thr Met Arg Glu Lys His lie Asn Pro Leu Glu Leu lie Ala Arg Asn 

385 390 395 400 

(2) INFORMATION FOR SEQ ID NO:207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 308 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 

AGGATTTAAG ATTGCGTTTG AAGTTAGCTA GCGTATGCTT GGGCGTTTTG ATG AGT 5 6 

Met Ser 

1 

GGT TGT GCG TCT TCT TCG CCA ACT GGC ACT CTT ATC ACT ATG GTA ACG 104 
Gly Cys Ala Ser Ser Ser Pro Thr Gly Thr Leu lie Thr Met Val Thr 
5 10 15 

ATG CCA GTT TCT GGG AAT GAT GCA CAA TAG TCC AAA GAA GGG CGT GCG 152 
Met Pro Val Ser Gly Asn Asp Ala Gin Tyr Ser Lys Glu Gly Arg Ala 
20 25 30 

AGT TGT TGG AGT GTT TTT AGT CTT GTG GCT GCC GGT AAT TGT TCG GTA 2 00 

Ser Cys Trp Ser Val Phe Ser Leu Val Ala Ala Gly Asn Cys Ser Val 
35 40 45 50 

GAA AAA GCG GCT AAA AGT GGC GGT GTT ACC AAG ATT AAA ATG GTG AGC 24 8 

Glu Lys Ala Ala Lys Ser Gly Gly Val Thr Lys lie Lys Met Val Ser 
55 60 65 

CGT GAG ACA AAC AAC TTT TTA GGT ATT GTT GGC AAA TAC ACC ACG ATC 2 96 

Arg Glu Thr Asn Asn Phe Leu Gly lie Val Gly Lys Tyr Thr Thr lie 
70 75 80 

GTT CAA GGC GAC TAGTTTTAAT ATTTAGAGAG CGTAGTTGAA TCGTCTTTCG TTCCA 3 53 
Val Gin Gly Asp 
85 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 



Met Tyr Lys Leu 
1 

Thr Gin Lys Val 
20 

Leu Leu Lys Lys 
35 

Ser Ser Leu Gly 
50 

Met Glu Arg Gin 
65 

Asn Glu Ser Leu 

Leu Asp His Leu 
100 

Asp Thr Leu Leu 
115 

Asn Leu Ala Ser 
130 

Leu His Gin Ser 
145 

Lys Glu Leu Asn 

lie Ser Ser lie 
180 

Ser Leu Lys Thr 
195 

Tyr Ala lie Tyr 
210 

Asn Leu Asn Ala 
225 

Glu Asn Glu Glu 

Val Lys Gin Val 
260 

Asn Gly Pro Lys 
275 

Lys Phe Gly Pro 
290 

Glu Ser lie Thr 
305 

Val Leu Asp Gly 

Lys Val Val lie 
340 



Gly Val Phe Leu 
5 

Ser Asp lie Ala 

Thr His Glu Glu 
40 

Glu Ala lie Arg 
55 

Met lie Ala Leu 
70 

Ala Gin Glu Lys 
85 

Gin Lys Lys Arg 

Gin Asp Phe Leu 
120 

Ser Asn Asp Val 
135 

Thr Leu Ser Lys 
150 

Thr Gin Ala Leu 
165 

lie Asp Glu Gin 

Glu Gin Asp Lys 
200 

Asn Gin Arg Leu 
215 

Leu Leu Lys Arg 
230 

Lys Val Ser Leu 
245 

Ala Ser Ser Tyr 

Thr lie Ala Pro 
280 

Tyr lie Asp Pro 
295 

Leu Val Ser Lys 
310 

Lys lie Val Phe 
325 

lie Glu His Lys 



Leu Ala Thr Leu 
10 

Lys Asp lie Gin 
25 

Lys Asn Gin Leu 

Ser Lys Glu Leu 
60 

Lys Lys Ser Leu 
75 

Val Leu Thr Asn 
90 

Ser Phe Leu Gin 
105 

Phe Ser Gin Ala 

Val Leu Gin Val 
140 

Met Ser Gin Leu 
155 

Lys Val Lys Asn 
170 

Lys Thr Arg Glu 
185 

Leu lie Leu Ser 

Thr Leu Leu Glu 
220 

Leu Asn lie lie 
235 

Lys Lys Ser Ser 
250 

Gin Asn He Asn 
265 

Leu Asn Asp Tyr 

Val Tyr Asn Leu 
300 

Thr Pro Asn Ala 
315 

Ala Lys Glu He 
330 

Asn Gly He Arg 
345 



Leu Ser Ala Asn 
15 

His Lys Glu Thr 
30 

Asn Ser Arg Leu 
45 

Gin Lys Ala Glu 

Glu Lys Asn Arg 
80 

Tyr Arg Lys Ser 
95 

Lys Arg Val Phe 
110 

Leu Lys Gly Gin 
125 

Ala Phe Glu Asn 

Ser Gin Glu Glu 
160 

Ser He Gin Lys 
175 

Val Thr Leu Lys 
190 

Met Gin Lys Asp 
205 

Lys Glu Arg Gin 

Lys Gin Asn Arg 
240 

Gin Ala Leu Glu 
255 

Thr Thr Ser Tyr 
270 

Glu Val Val Gin 
285 

Lys He Phe Ser 

Leu Val Arg Asn 
320 

Asn Met Leu Lys 
335 

Thr He Tyr Ser 
350 
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Lys Thr Glu Gin Asp Lys Leu lie Leu Ser Met Gin Lys Asp Tyr Ala 
195 200 205 210 

ATC TAG AAC CAA CGC CTA ACC CTT TTA GAA AAA GAG CGC GAG AAT TTA 72 8 

lie Tyr Asn Gin Arg Leu Thr Leu Leu Glu Lys Glu Arg Gin Asn Leu 
215 220 225 

AAC GOT CTT TTA AAA CGC TTG AAT ATC ATC AAA CAA AAC AGA GAA AAT 77 6 

Asn Ala Leu Leu Lys Arg Leu Asn lie lie Lys Gin Asn Arg Glu Asn 
230 235 240 

GAA GAA AAA GTC AGT TTG AAA AAA TCT TCT CAA GCC TTA GAA GTC AAA 824 
Glu Glu Lys Val Ser Leu Lys Lys Ser Ser Gin Ala Leu Glu Val Lys 
245 250 255 

CAA GTG GCT AGC TCT TAT CAA AAT ATC AAC ACC ACG AGC TAT AAC GGA 872 
Gin Val Ala Ser Ser Tyr Gin Asn lie Asn Thr Thr Ser Tyr Asn Gly 
260 265 270 

CCA AAA ACG ATC GCT CCC TTG AAC GAT TAT GAA GTG GTG CAA AAA TTT 92 0 

Pro Lys Thr lie Ala Pro Leu Asn Asp Tyr Glu Val Val Gin Lys Phe 
275 280 285 290 

GGC CCC TAT ATT GAC CCG GTT TAT AAT TTA AAA ATT TTT AGC GAG TCT 96 8 

Gly Pro Tyr lie Asp Pro Val Tyr Asn Leu Lys lie Phe Ser Glu Ser 
295 300 305 

ATT ACG CTC GTG TCA AAA ACC CCA AAC GCT TTG GTG CGT AAT GTT TTA 1016 
lie Thr Leu Val Ser Lys Thr Pro Asn Ala Leu Val Arg Asn Val Leu 
310 315 320 

GAC GGG AAA ATC GTG TTC GCT AAA GAA ATC AAC ATG CTT AAA AAA GTC 1064 
Asp Gly Lys lie Val Phe Ala Lys Glu lie Asn Met Leu Lys Lys Val 
325 330 335 

GTT ATC ATT GAG CAT AAA AAT GGG ATC CGC ACG ATT TAT TCT CAA TTG 1112 
Val lie lie Glu His Lys Asn Gly lie Arg Thr lie Tyr Ser Gin Leu 
340 345 350 

GAT AAA ATC GCT CCC ACC ATT AAA AGC GGC ATG CGG ATC CAA AAA GGC 1160 
Asp Lys lie Ala Pro Thr lie Lys Ser Gly Met Arg lie Gin Lys Gly 
355 360 365 370 

TAT GTT TTA GGG CGC ATT GAT CAA CGC TTG GGC TTT GAA GTT ACC ATG 12 08 

Tyr Val Leu Gly Arg lie Asp Gin Arg Leu Gly Phe Glu Val Thr Met 
375 380 385 

AGA GAA AAA CAC ATC AAC CCC TTA GAA CTC ATC GCA CGC AAT TAAACAAAT 12 5 9 
Arg Glu Lys His lie Asn Pro Leu Glu Leu lie Ala Arg Asn 
390 395 400 

CGTTTTTATT GCCGATATTG GCTAAAGAAT TTATGCAAAC AAAT 13 03 

(2) INFORMATION FOR SEQ ID NO: 2 06: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 05: 

CCTTGTGGTC TCATTTGTTT CTGTTTTACT TGTAGCTTGG AGGACTAGGC ATG TAT 56 

Met Tyr 
1 

AAA TTA GGG GTG TTT TTG TTA GCC ACC TTA CTA TCA GCT AAC ACG CAA 104 
Lys Leu Gly Val Phe Leu Leu Ala Thr Leu Leu Ser Ala Asn Thr Gin 
5 10 15 

AAA GTG AGO GAT ATT GCT AAA GAT ATC CAA CAT AAA GAA ACC CTT TTG 152 
Lys Val Ser Asp lie Ala Lys Asp lie Gin His Lys Glu Thr Leu Leu 
20 25 30 

AAA AAA ACC CAT GAA GAA AAA AAC CAA CTA AAC AGC CGT TTG ACT TCT 2 00 

Lys Lys Thr His Glu Glu Lys Asn Gin Leu Asn Ser Arg Leu Ser Ser 
35 40 45 50 

TTA GGC GAA GCG ATC CGC TCT AAA GAG CTT CAA AAG GCT GAG ATG GAG 24 8 

Leu Gly Glu Ala lie Arg Ser Lys Glu Leu Gin Lys Ala Glu Met Glu 
55- 60 65 

CGC CAA ATG ATC GCT TTA AAA AAG AGT CTT GAA AAA AAT CGT AAC GAA 2 96 

Arg Gin Met lie Ala Leu Lys Lys Ser Leu Glu Lys Asn Arg Asn Glu 
70 75 80 

AGT TTG GCG CAA GAA AAA GTC CTA ACC AAC TAG CGC AAG TCT TTA GAT 344 
Ser Leu Ala Gin Glu Lys Val Leu Thr Asn Tyr Arg Lys Ser Leu Asp 
85 90 95 

CAT TTG CAA AAA AAG CGA TCA TTT TTA CAA AAG AGG GTG TTT GAT ACG 3 92 

His Leu Gin Lys Lys Arg Ser Phe Leu Gin Lys Arg Val Phe Asp Thr 
100 105 110 

CTT TTA CAG GAT TTC CTT TTT TCA CAA GCC CTA AAG GGG CAG AAT TTA 44 0 

Leu Leu Gin Asp Phe Leu Phe Ser Gin Ala Leu Lys Gly Gin Asn Leu 
115 120 125 130 

GCC TCT TCT AAT GAT GTT GTT TTG CAA GTG GCG TTT GAA AAC TTG CAC 4 88 

Ala Ser Ser Asn Asp Val Val Leu Gin Val Ala Phe Glu Asn Leu His 
135 140 145 

CAA AGC ACT CTG TCT AAA ATG TCG CAA CTG AGC CAA GAA GAA AAG GAA 53 6 

Gin Ser Thr Leu Ser Lys Met Ser Gin Leu Ser Gin Glu Glu Lys Glu 
150 155 160 

CTC AAT ACG CAA GCT TTA AAA GTC AAA AAC AGC ATT CAA AAA ATC TCA 5 84 

Leu Asn Thr Gin Ala Leu Lys Val Lys Asn Ser lie Gin Lys lie Ser 
165 170 175 

TCC ATC ATA GAT GAG CAA AAA ACT CGT GAA GTA ACC TTA AAA TCC TTG 632 
Ser lie lie Asp Glu Gin Lys Thr Arg Glu Val Thr Leu Lys Ser Leu 
180 185 190 

AAA ACC GAA CAA GAT AAG CTC ATT TTG AGC ATG CAA AAA GAT TAT GCG 680 
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Arg Ser Glu lie 
100 

lie Lys Glu lie 
115 

Glu Val Gly Glu 
130 

Cys Glu Thr Lys 
145 

Phe Cys Tyr Ser 

Pro Leu lie Asp 
180 

Lys Gly Thr Ser 
195 

Glu Thr Lys Val 
210 

lie Leu lie Ala 
225 

Leu Val Leu Glu 

Val Asn Gin Lys 
260 

Gly Met Leu Glu 
275 

Val lie Val Thr 
290 

Ala Leu Ala Gin 
305 

Gly Glu Leu Asn 

Lys lie Leu Leu 
340 

Thr Thr lie Gin 
355 

Tyr lie 
370 



Tyr lie Ala Glu 

Thr Cys Leu lie 
120 

Gin His Leu Glu 
135 

Ala Glu Leu Leu 
150 

Val Glu Lys lie 
165 

Tyr Ser Ser Leu 

Phe Glu Met Leu 
200 

Leu Gly Glu Phe 
215 

Lys His Leu Gly 
230 

Leu Asn Pro lie 
245 

lie lie lie Asp 

Gly lie Arg Leu 
280 

Pro Gly Leu Val 
295 

Lys lie Asp Gly 
310 

Ser Lys Thr lie 
325 

Lys Asp Lys Ala 

Gly Asp Leu lie 
350 



Ala Gly Ala Arg 
105 

Glu Pro His Leu 

Tyr Phe Lys Thr 
140 

Asp Ser Lys Arg 
155 

Lys Pro Tyr Ala 
170 

Val Lys Asn lie 
185 

lie Gly Ser Val 

Ser Ala Tyr Asn 
220 

Leu Glu Thr Glu 
235 

Ala His Arg Leu 
250 

Asp Ser Phe Asn 
265 

Ala Ser Leu His 

Glu Ser Asn Thr 
300 

Val Phe Asp Val 
315 

Ala Ser Gin Leu 
330 

Gin Leu Glu Asn 
345 

Leu Phe Ala Asn 



Asn Lys Gly Asp 
110 

Val Val Val Ala 
125 

Leu Glu Asn lie 

Leu Glu Lys Ala 
160 

Pro Lys Asp Ser 
175 

Gin Ser Thr Leu 
190 

Trp Glu Arg Phe 
205 

lie Ala Ser Ala 

Arg lie Lys Arg 
240 

Gin Leu Leu Glu 
255 

Gly Asn Leu Lys 
270 

Lys Gly Arg Lys 
285 

Glu Ser Asn Glu 

Ala lie lie Thr 
320 

Lys Thr Pro Gin 
335 

lie Leu Gin Ala 
350 

Asp Ala Pro Asn 
365 



(2) INFORMATION FOR SEQ ID NO: 2 05: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1250 
(D) OTHER INFORMATION: 
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CAA AAA ATC ATC ATA GAC GAT AGC TTT AAT GGG AAT TTA AAG GGC ATG 872 
Gin Lys lie lie lie Asp Asp Ser Phe Asn Gly Asn Leu Lys Gly Met 
260 265 270 

TTA GAG GGC ATT CGT TTA GCG AGT TTG CAC AAA GGG CGT AAA GTC ATT 920 
Leu Glu Gly lie Arg Leu Ala Ser Leu His Lys Gly Arg Lys Val lie 
275 280 285 290 

GTA ACA CCG GGC TTA GTG GAA AGC AAT ACA GAA AGT AAT GAG GCT TTA 968 
Val Thr Pro Gly Leu Val Glu Ser Asn Thr Glu Ser Asn Glu Ala Leu 
295 300 305 

GCG CAA AAA ATA GAC GGG GTT TTT GAT GTC GCT ATC ATC ACA GGG GAG 1016 
Ala Gin Lys lie Asp Gly Val Phe Asp Val Ala lie lie Thr Gly Glu 
310 315 320 

TTG AAT TCC AAA ACG ATT GCT TCA CAA TTG AAA ACC CCC CAA AAA ATC 1064 
Leu Asn Ser Lys Thr lie Ala Ser Gin Leu Lys Thr Pro Gin Lys lie 
325 330 335 

TTA CTC AAG GAT AAG GCG CAA TTG GAA AAT ATC TTA CAA GCC ACC ACG 1112 
Leu Leu Lys Asp Lys Ala Gin Leu Glu Asn lie Leu Gin Ala Thr Thr 
340 345 350 

ATT CAA GGC GAT TTG ATT TTA TTC GCT AAT GAC GCC CCT AAT TAC ATT T 1161 
lie Gin Gly Asp Leu lie Leu Phe Ala Asn Asp Ala Pro Asn Tyr lie 
355 360 365 370 

AGGAAATGAA CATGCAACAT TTATACGCTC CTTGGCGCGA AAGTTATTTG AA 1213 

(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

Met Gly Phe Xaa Leu Ala Leu Gly Tyr Leu Cys Leu Phe lie Phe Val 

15 10 15 

Leu Ser Ala Ser Leu lie Ser Glu Lys Ala Leu Ser Lys Gin Tyr Leu 

20 25 30 

Gin Thr Ala Lys Asp Lys lie Thr Ser Leu Lys Asn Leu Lys Val lie 

35 40 45 

Ala lie Thr Gly Ser Phe Gly Lys Thr Ser Thr Lys Asn Phe Leu Leu 

50 55 60 

Gin lie Leu Gin Thr Thr Phe Asn Ala His Ala Ser Pro Lys Ser Val 
65 70 75 80 

Asn Thr Leu Leu Gly Leu Ala Asn Asp lie Asn Gin Asn Leu Asp Asp 
85 90 95 



-385- 



BNSDOCID: <WO 984347SA1J_> 



wo 98/43478 



PCT/US98/06371 



GCT AAA GAT AAA ATC ACC TCT TTA AAG AAT TTA AAA GTC ATC GCC ATT 200 
Ala Lys Asp Lys lie Thr Ser Leu Lys Asn Leu Lys Val lie Ala lie 
35 40 45 50 

ACC GGA AGO TTT GGG AAA ACC AGC ACC AAA AAT TTC TTG CTT CAA ATC 24 8 

Thr Gly Ser Phe Gly Lys Thr Ser Thr Lys Asn Phe Leu Leu Gin lie 
55 60 65 

TTA CAA ACC ACA TTC AAC GCG CAT GCA AGC CCC AAA AGC GTC AAT ACC 2 96 

Leu Gin Thr Thr Phe Asn Ala His Ala Ser Pro Lys Ser Val Asn Thr 
70 75 80 

CTT TTA GGG CTT GCG AAT GAT ATT AAT CAG AAT TTA GAC GAT AGG AGT 344 
Leu Leu Gly Leu Ala Asn Asp lie Asn Gin Asn Leu Asp Asp Arg Ser 
85 90 95 

GAA ATC TAT ATC GCT GAA GCC GGG GCA AGG AAT AAG GGC GAT ATT AAA 3 92 

Glu lie Tyr lie Ala Glu Ala Gly Ala Arg Asn Lys Gly Asp lie Lys 
100 105 110 

GAA ATC ACC TGT CTC ATT GAA CCG CAC CTT GTT GTG GTT GCA GAA GTG 44 0 

Glu lie Thr Cys Leu lie Glu Pro His Leu Val Val Val Ala Glu Val 
115 120 125 130 

GGC GAA CAG CAT TTA GAA TAC TTT AAA ACT TTA GAA AAT ATT TGC GAG 4 88 

Gly Glu Gin His Leu Glu Tyr Phe Lys Thr Leu Glu Asn lie Cys Glu 
135 140 145 

ACT AAA GCG GAA TTA TTG GAT TCC AAA CGC TTA GAA AAA GCC TTT TGT 53 6 

Thr Lys Ala Glu Leu Leu Asp Ser Lys Arg Leu Glu Lys Ala Phe Cys 
150 155 160 

TAC TCG GTG GAA AAG ATC AAG CCC TAT GCC CCT AAA GAT AGC CCT TTA 5 84 

Tyr Ser Val Glu Lys lie Lys Pro Tyr Ala Pro Lys Asp Ser Pro Leu 
165 170 175 

ATA GAC TAT TCT AGC CTG GTT AAA AAC ATC CAA TCC ACT TTA AAA GGC 632 
lie Asp Tyr Ser Ser Leu Val Lys Asn lie Gin Ser Thr Leu Lys Gly 
180 185 190 

ACT TCT TTT GAA ATG CTT ATA GGT AGC GTT TGG GAA AGA TTT GAA ACA 68 0 

Thr Ser Phe Glu Met Leu lie Gly Ser Val Trp Glu Arg Phe Glu Thr 
195 200 205 210 

AAG GTT CTA GGG GAG TTT AGC GCT TAT AAT ATC GCT TCA GCC ATT TTA 72 8 

Lys Val Leu Gly Glu Phe Ser Ala Tyr Asn lie Ala Ser Ala lie Leu 
215 220 225 

ATC GCT AAG CAT TTA GGC TTA GAG ACC GAA AGG ATC AAA CGG CTT GTT 776 
lie Ala Lys His Leu Gly Leu Glu Thr Glu Arg lie Lys Arg Leu Val 
230 235 240 

TTA GAA CTC AAC CCT ATT GCT CAT CGT TTG CAA CTT TTG GAA GTG AAT 824 
Leu Glu Leu Asn Pro lie Ala His Arg Leu Gin Leu Leu Glu Val Asn 
245 250 255 
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Asp Ser Val Arg Glu lie Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser 

130 135 140 

Glu Lys Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly 
145 150 155 160 

Ala Arg Leu Ala Ser Gly Ala lie Thr Phe Glu Gin Val Gly Pro Glu 

165 170 175 

Leu Pro Pro Lys Val Val Glu He Pro Tyr Gin Lys Ala Lys Ala Thr 

180 185 190 

Lys Gly Glu Val Lys Gly Asn He Pro He Leu Asp He Gin Tyr Gly 

195 200 205 

Asn Val Trp Ser Asn He Ser Asp Lys Leu Leu Asn Gin Ala Lys He 

210 215 220 

Lys Leu Asn Asp Thr Leu Cys Val Thr He Phe Lys Gly Ser Lys Lys 
225 230 235 240 

Gin Tyr Glu Gly Lys Met Pro Tyr Val Ala Ser Phe Gly Asp Val Pro 

245 250 255 

Glu Gly Gin Pro Leu Val Tyr Leu Asn Ser Leu Leu Asn Val Ser Val 

260 265 270 

Ala Leu Asn Arg Asp Asn Phe Ala Gin Lys Tyr Gin He Lys Ser Gly 

275 280 285 

Ala Asp Trp Asn He Asp He Lys Lys Cys Ala Lys 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51. . .1160 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 

ATTATTTTTA ATCTTGCATG AAATCTTAAA TATAGAATTA GTCCCTTTGG ATG GGA 5 6 

Met Gly 
1 

TTT TCN CTC GCG CTA GGC TAT TTG TGT TTG TTT ATA TTC GTT TTA AGC 104 
Phe Xaa Leu Ala Leu Gly Tyr Leu Cys Leu Phe He Phe Val Leu Ser 
5 10 15 

GCT TCT TTA ATC TCT GAA AAA GCC TTA TCC AAG CAG TAT TTG CAA ACC 152 
Ala Ser Leu He Ser Glu Lys Ala Leu Ser Lys Gin Tyr Leu Gin Thr 
20 25 30 
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TGG AGC AAC ATC AGC GAT AAA TTA CTC AAT 
Trp Ser Asn lie Ser Asp Lys Leu Leu Asn 
215 220 

AAT GAG ACG CTG TGT GTA AGO ATT TTT AAA 
Asn Asp Thr Leu Cys Val Thr lie Phe Lys 
230 235 

GAA GGG AAA ATG CCG TAT GTC GCA AGC TTT 
Glu Gly Lys Met Pro Tyr Val Ala Ser Phe 
245 250 

CAG CCG TTA GTT TAT TTA AAC AGC TTG TTA 
Gin Pro Leu Val Tyr Leu Asn Ser Leu Leu 
260 265 

AAT AGG GAT AAT TTC GCG CAA AAA TAT CAA 
Asn Arg Asp Asn Phe Ala Gin Lys Tyr Gin 
275 280 

TGG AAT ATT GAT ATA AAG AAG TGC GCT AAG 
Trp Asn lie Asp lie Lys Lys Cys Ala Lys 
295 300 

GGGGCGTGAA ACGCCCTAAC CGCTAAAGAT 1003 

(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 



Met 


Arg 


Lys 


Thr 


He 


Ser 


Ala 


Leu 


Phe 


Leu 


Ser 


Ala 


Cys 


He 


Gly 


Leu 


1 








5 










10 










15 




Ser 


Ser 


Val 


Tyr 


Ala 


Asp 


Asn 


Ala 


Leu 


He 


Leu 


Gin 


Thr 


Asp 


Phe 


Ser 








20 










25 










30 






Leu 


Lys 


Asp 


Gly 


Ala 


Val 


Ser 


Ala 


Met 


Lys 


Gly 


Val 


Ala 


Phe 


Ser 


Val 






35 










40 










45 








Asp 


Ser 


His 


Leu 


Lys 


He 


Phe 


Asp 


Leu 


Thr 


His 


Glu 


He 


Pro 


Pro 


Tyr 




50 










55 










60 










Asn 


He 


Trp 


Glu 


Gly 


Ala 


Tyr 


Arg 


Leu 


Tyr 


Gin 


Thr 


Ala 


Ser 


Tyr 


Trp 


65 










70 










75 










80 


Pro 


Lys 


Gly 


Ser 


Val 


Phe 


Val 


Ser 


Val 


Val 


Asp 


Pro 


Gly 


Val 


Gly 


Thr 










85 










90 










95 




Lys 


Arg 


Lys 


Ser 


Val 


Val 


Leu 


Lys 


Thr 


Lys 


Asn 


Gly 


Gin 


Tyr 


Phe 


Val 








100 










105 










110 






Ser 


Pro 


Asp 


Asn 


Gly 


Thr 


Leu 


Thr 


Leu 


Val 


Ala 


Gin 


Thr 


Leu 


Gly 


He 






115 










120 










125 









CAA GCA AAA ATC AAA CTC 72 8 

Gin Ala Lys He Lys Leu 
225 

GGT TCT 7\AG AAA CAA TAC 776 
Gly Ser Lys Lys Gin Tyr 
240 

GGC GAT GTG CCA GAA GGC 824 
Gly Asp Val Pro Glu Gly 
255 

AAT GTT TCC GTG GCG CTG 872 
Asn Val Ser Val Ala Leu 
270 

ATC AAA TCC GGT GCT GAC 92 0 

He Lys Ser Gly Ala Asp 
285 290 

TAAAGCGCTG TTTAGAAAAT TAA 97 3 



-382- 



BNSDOCID: <WO 984347aA1J_> 



wo 98/43478 



PCT/US98/06371 



ATCATGAGAA AACGCTTCAC TCCACTTTTG TTATTCAGGA AATAATACAG ATG AGA 56 

Met Arg 
1 

AAA ACG ATT TCA GCG TTG TTT TTA TCA GCG TGC ATA GGG TTA TCG TCT 104 
Lys Thr lie Ser Ala Leu Phe Leu Ser Ala Cys lie Gly Leu Ser Ser 
5 10 15 

GTT TAT GCA GAT AAC GCT TTG ATT TTG CAA ACC GAT TTT AGT CTA AAA 152 
Val Tyr Ala Asp Asn Ala Leu lie Leu Gin Thr Asp Phe Ser Leu Lys 
20 25 30 

GAT GGG GCC GTC TCG GCG ATG AAA GGC GTC GCT TTC AGC GTT GAT TCC 200 
Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val Asp Ser 
35 40 45 50 

CAT CTT AAA ATC TTT GAT TTA ACG CAC GAA ATC CCC CCG TAT AAC ATC 24 8 

His Leu Lys lie Phe Asp Leu Thr His Glu lie Pro Pro Tyr Asn lie 
55 60 65 

TGG GAA GGC GCT TAG CGC TTG TAT CAG ACC GCC AGT TAT TGG CCA AAA 2 96 

Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp Pro Lys 
70 75 80 

GGT TCG GTA TTT GTG AGC GTA GTT GAT CCG GGC GTA GGC ACT AAG CGT 344 
Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr Lys Arg 
85 90 95 

AAA TCG GTG GTA CTA AAA ACT AAA AAC GGC CAG TAT TTC GTC TCG CCG 3 92 

Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val Ser Pro 
100 105 110 

GAT AAC GGC ACG CTG ACT TTG GTG GCA CAA ACT TTG GGG ATT GAT AGC 44 0 

Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly lie Asp Ser 
115 120 125 130 

GTG CGT GAA ATT GAT GAA AAA GCT AAC CGC TTG AAA GGT TCT GAA AAA 4 88 

Val Arg Glu lie Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser Glu Lys 
135 140 145 

TCC TAT ACT TTC CAT GGT CGT GAT GTG TAT GCT TAC ACC GGT GCA CGC 53 6 

Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly Ala Arg 
150 155 160 

TTG GCT TCT GGG GCG ATC ACA TTC GAG CAG GTC GGG CCA GAG CTT CCC 584 
Leu Ala Ser Gly Ala lie Thr Phe Glu Gin Val Gly Pro Glu Leu Pro 
165 170 175 

CCA AAA GTC GTT GAA ATT CCT TAC CAA AAA GCG AAA GCC ACA AAA GGG 632 
Pro Lys Val Val Glu lie Pro Tyr Gin Lys Ala Lys Ala Thr Lys Gly 
180 185 190 

GAA GTG AAA GGT AAT ATC CCG ATT TTG GAT ATT CAA TAT GGC AAT GTT 680 
Glu Val Lys Gly Asn lie Pro lie Leu Asp lie Gin Tyr Gly Asn Val 
195 200 205 210 
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GATGATATTA ACATGCGTAA AAGCCTGGAG CTT 574 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

Met Arg Phe Phe Ser Phe Phe Tyr Phe Leu Phe Tyr Phe Leu Gly Val 

15 10 15 

Ser Leu Gin Ala Leu Ser Pro Leu Glu Asp Gin Glu Phe Leu lie Ser 

20 25 30 

Tyr Arg Leu Lys lie Val Asp Ser Arg Val Met Gly Glu Glu Tyr Ser 

35 40 45 

Val Ser Lys Pro lie Val Ser Arg lie Lys Thr Ala Pro Tyr Val Leu 

50 55 60 

Asp Tyr His Cys Ser lie lie Thr Arg Asn Leu Pro Asn Leu Lys Asn 
65 70 75 80 

Pro Leu Leu Pro lie Lys Leu Glu Arg Phe Leu Leu Glu lie Ala Leu 

85 90 95 

Lys Lys Glu Lys Glu Arg Val lie Asp Cys lie Leu Lys Ser Gin Val 

100 105 110 

Ala lie Thr His Tyr Asp His Ser Tyr Lys Asn Gly Thr Thr Thr Thr 

115 120 125 

Ser lie Leu Ala Leu Lys Ala Leu Ser Val Arg Ala Ser Leu Val Gly 

130 135 140 

Asp Ala Leu Phe Leu Asp lie Phe Arg Lys Glu Glu Glu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 950 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 
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(ii) MOliECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 521 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 199: 

GAAGAGCCAG AAAATTTAGA AACCTCTTCG GCACAAAATT TGTTTGAGTG ATG CGT 56 

Met Arg 
1 

TTC TTT TCA TTC TTT TAT TTT TTA TTT TAT TTT TTA GGG GTT TCT TTG 104 
Phe Phe Ser Phe Phe Tyr Phe Leu Phe Tyr Phe Leu Gly Val Ser Leu 
5 10 15 

CAA GCT CTC AGC CCC CTA GAA GAT CAA GAA TTT TTA ATT TCG TAG CGC 152 
Gin Ala Leu Ser Pro Leu Glu Asp Gin Glu Phe Leu lie Ser Tyr Arg 
20 25 30 

TTG AAA ATC GTT GAT TCT AGA GTG ATG GGC GAA GAG TAT TCT GTC TCT 2 00 

Leu Lys lie Val Asp Ser Arg Val Met Gly Glu Glu Tyr Ser Val Ser 
35 40 45 50 

AAA CCT ATC GTT AGC CGC ATT AAA ACA GCC CCC TAT GTT TTA GAC TAT 24 8 

Lys Pro lie Val Ser Arg lie Lys Thr Ala Pro Tyr Val Leu Asp Tyr 
55 60 65 

CAT TGC TCC ATC ATC ACT CGT AAC TTA CCC AAT TTG AAA AAC CCC TTG 2 96 

His Cys Ser lie lie Thr Arg Asn Leu Pro Asn Leu Lys Asn Pro Leu 
70 75 80 

CTC CCA ATA AAG TTA GAA CGC TTC CTT TTA GAA ATC GCG TTA AAA AAA 344 
Leu Pro lie Lys Leu Glu Arg Phe Leu Leu Glu lie Ala Leu Lys Lys 
85 90 95 

GAA AAA GAG CGG GTC ATA GAC TGC ATT TTA AAA AGC CAG GTC GCT ATC 3 92 

Glu Lys Glu Arg Val lie Asp Cys lie Leu Lys Ser Gin Val Ala lie 
100 105 110 

ACG CAT TAT GAT CAT AGC TAT AAA AAC GGC ACC ACT ACC ACA AGC ATT 44 0 

Thr His Tyr Asp His Ser Tyr Lys Asn Gly Thr Thr Thr Thr Ser lie 
115 120 125 130 

CTT GCC CTC AAA GCC TTA AGC GTT AGA GCG AGT TTA GTG GGA GAT GCG 4 88 

Leu Ala Leu Lys Ala Leu Ser Val Arg Ala Ser Leu Val Gly Asp Ala 
135 140 145 

CTG TTT TTA GAT ATT TTT AGA AAG GAA GAA GAA TGAAAATCGC CATTGTAGAA 541 
Leu Phe Leu Asp lie Phe Arg Lys Glu Glu Glu 
150 155 
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Ala bys Leu Phe 
65 

Ala Val Gly Phe 

Val Asp Thr Ser 
100 

Glu Tyr Glu Asp 
115 

Pro Asn Phe Thr 
130 

Ala Ala Leu Leu 
14 5 

Thr Leu Lys Gin 

Leu Gly Thr Lys 
180 

Gin Gin Phe Ala 
195 

Gly Gly Asp Leu 
210 

Leu Asn Phe Leu 
225 

Thr Ser Asp Lys 

Ala Phe Val Tyr 
260 

Asp Leu Ala Lys 
275 

Gly Ser Arg Leu 
290 

Ser Val Tyr lie 
305 

Gly Tyr Leu Gin 

Val Lys Lys lie 
340 

Glu Leu Asp Asp 
355 

Asn Glu Thr lie 
370 

Leu Gly Leu Pro 
385 

Lys Met Ser Leu 

lie Asn Asp Leu 
420 



Ala Gin Val Leu 
70 

Ala Gin Leu Leu 
85 

Thr Glu Asp Leu 

Glu Ala lie Thr 
120 

Gin Asn Ala Leu 
135 

Gin Lys Glu Ser 
150 

Glu Leu Phe Ala 
165 

Glu Ser He Gin 

Lys Val Phe Glu 
200 

Lys He Asp Gin 
215 

Pro Gin Gly Lys 
230 

Lys Ser Glu Lys 
245 

Phe Gly Ala Pro 

Ser Lys Val Met 
280 

Met Glu Lys He 
295 

Arg Ser Asn Phe 
310 

Thr Lys Leu Ser 
325 

Val Lys Glu Phe 

Ala Lys Lys Phe 
360 

Ser Ser Arg Leu 
375 

Leu Asn Phe Asn 
390 

Lys Glu He Asn 
405 

Thr Phe Ala He 



Asn Glu Gly Thr 
75 

Glu Gin Lys Ala 
90 

Gin He Thr Leu 
105 

Arg Leu Lys Glu 

Glu Lys Val Lys 
140 

Asp Phe Asp Tyr 
155 

Asn Thr Pro Leu 
170 

Lys He Lys Leu 
185 

Leu Asn Lys Leu 

Thr Leu Lys Arg 
220 

Ala Tyr Glu Glu 
235 

Val Leu Tyr Lys 
250 

Phe Lys He Lys 
265 

Met Phe Val Leu 

Arg Val Gin Glu 
300 

Ser Lys Val Ala 
315 

Thr Gin Thr Lys 
330 

He Glu Lys Gly 
345 

Leu Leu Gly Ser 

Asn Thr Thr Tyr 
380 

Gin Thr Leu Leu 
395 

Asp Phe He Lys 
410 

Val Ser Asn Lys 
425 



Lys Glu Leu Gly 
80 

He Ser Leu Asn 
95 

Glu Phe Leu Lys 
110 

Leu Leu Lys Ser 
125 

Thr Gin Met Leu 

Leu Ala Lys Leu 
160 

Ala Asn Ala Ala 
175 

Asp Asp Leu Lys 
190 

Val Val Val Leu 
205 

Leu Asn Asn Ala 

Pro Tyr Phe Glu 
240 

Asp Thr Glu Gin 
255 

Asp Leu Lys Gin 
270 

Gly Gly Gly Phe 
285 

Gly Leu Ala Tyr 

His Phe Ala Ser 
320 

Ser Val Ala Leu 
335 

Met Thr Gin Gin 
350 

Glu Pro Leu Arg 
365 

Asn Tyr Phe Tyr 

Asn Gin He Gin 
400 

Ala His Thr Glu 
415 

Lys Lys Asp Lys 
430 



(2) INFORMATION FOR SEQ ID NO: 199: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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TAT ATC CGC TCC AAT TTT TCT AAA GTG GCG CAT TTT GCG AGC GGG TAT 1016 
Tyr lie Arg Ser Asn Phe Ser Lys Val Ala His Phe Ala Ser Gly Tyr 
310 315 320 

TTG CAA ACC AAG CTC AGC ACT CAA ACT AAA AGC GTT GCC TTA GTT AAA 1064 
Leu Gin Thr Lys Leu Ser Thr Gin Thr Lys Ser Val Ala Leu Val Lys 
325 330 335 

AAA ATC GTT AAG GAA TTT ATA GAA AAA GGC ATG ACG CAA CAA GAA TTA 1112 
Lys lie Val Lys Glu Phe lie Glu Lys Gly Met Thr Gin Gin Glu Leu 
340 345 350 

GAC GAC GCT AAA AAG TTT TTA CTA GGC TCT GAG CCT TTA AGG AAT GAA 1160 
Asp Asp Ala Lys Lys Phe Leu Leu Gly Ser Glu Pro Leu Arg Asn Glu 
355 360 365 370 

ACG ATC TCT AGC CGC TTG AAC ACC ACT TAC AAT TAT TTT TAT TTA GGT 12 08 

Thr lie Ser Ser Arg Leu Asn Thr Thr Tyr Asn Tyr Phe Tyr Leu Gly 
375 380 385 

TTG CCT TTA AAT TTT AAC CAA ACG CTG CTC AAT CAA ATC CAA AAA ATG 12 56 

Leu Pro Leu Asn Phe Asn Gin Thr Leu Leu Asn Gin lie Gin Lys Met 
390 395 400 

AGT TTG AAA GAA ATC AAT GAT TTC ATT AAA GCC CAC ACC GAA ATC AAC 13 04 

Ser Leu Lys Glu lie Asn Asp Phe lie Lys Ala His Thr Glu lie Asn 
405 410 415 

GAC TTG ACT TTT GCT ATT GTG AGC AAT AAA AAG AAG GAC AAA TGATGCCAT 13 55 
Asp Leu Thr Phe Ala lie Val Ser Asn Lys Lys Lys Asp Lys 
420 425 430 

TTGAAGCTGT AATCGGGCTA GAAGTCCATG TCCAACTCAA CACC 13 99 

(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Met Lys Lys Phe Leu lie Thr Leu Leu Leu Gly Val Phe Met Gly Leu 

1 5 10 15 

Gin Ala Ser Ala Leu Thr His Gin Glu lie Asn Gin Ala Lys Val Pro 

20 25 30 

Val lie Tyr Glu Glu Asn His Leu Leu Pro Met Gly Phe lie His Leu 

35 40 45 

Ala Phe Arg Gly Gly Gly Ser Leu Ser Asp Lys Asn Gin Leu Gly Leu 
50 55 60 
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GGG TTT GCG CAA CTT TTA GAG CAA AAA GCG ATC AGT TTG AAT GTG GAT 344 
Gly Phe Ala Gin lieu Leu Glu Gin Lys Ala lie Ser Leu Asn Val Asp 
85 90 95 

ACC AGC ACA GAA GAT TTG CAA ATC ACT TTA GAA TTT TTA AAA GAA TAG 3 92 

Thr Ser Thr Glu Asp Leu Gin lie Thr Leu Glu Phe Leu Lys Glu Tyr 
100 105 110 

GAA GAT GAA GCC ATT ACG CGC TTA AAA GAG CTT TTA AAA TCC CCT AAT 44 0 

Glu Asp Glu Ala lie Thr Arg Leu Lys Glu Leu Leu Lys Ser Pro Asn 
115 120 125 130 

TTC ACG CAA AAC GCT TTA GAA AAA GTC AAA ACC CAA ATG TTA GCC GCA 4 88 

Phe Thr Gin Asn Ala Leu Glu Lys Val Lys Thr Gin Met Leu Ala Ala 
135 140 145 

CTT TTA CAA AAA GAA AGC GAT TTT GAC TAT TTG GCT AAA TTG ACT TTA 53 6 

Leu Leu Gin Lys Glu Ser Asp Phe Asp Tyr Leu Ala Lys Leu Thr Leu 
150 155 160 

AAG CAA GAG CTT TTT GCT AAC ACC CCT TTA GCT AAC GCA GCC TTA GGC 5 84 

Lys Gin Glu Leu Phe Ala Asn Thr Pro Leu Ala Asn Ala Ala Leu Gly 
165 170 175 

ACT AAA GAG AGC ATT CAA AAA ATC AAG CTA GAC GAT TTG AAA CAG CAA 632 
Thr Lys Glu Ser lie Gin Lys lie Lys Leu Asp Asp Leu Lys Gin Gin 
180 185 190 

TTT GCT AAG GTC TTT GAA CTC AAT AAG CTC GTG GTG GTG CTT GGG GGC 68 0 

Phe Ala Lys Val Phe Glu Leu Asn Lys Leu Val Val Val Leu Gly Gly 
195 200 205 210 

GAT TTG AAA ATC GAT CAA ACC CTT AAG CGT TTG AAT AAC GCC CTT AAT 72 8 

Asp Leu Lys lie Asp Gin Thr Leu Lys Arg Leu Asn Asn Ala Leu Asn 
215 220 225 

TTC TTG CCA CAA GGT AAA GCG TAT GAA GAG CCT TAT TTT GAA ACG AGC 77 6 

Phe Leu Pro Gin Gly Lys Ala Tyr Glu Glu Pro Tyr Phe Glu Thr Ser 
230 235 240 

GAT AAA AAA AGC GAA AAA GTC CTC TAT AAA GAC ACT GAG CAG GCT TTC 824 
Asp Lys Lys Ser Glu Lys Val Leu Tyr Lys Asp Thr Glu Gin Ala Phe 
245 250 255 

GTG TAT TTT GGT GCG CCC TTT AAA ATC AAG GAT TTA AAA CAG GAT TTA 872 
Val Tyr Phe Gly Ala Pro Phe Lys lie Lys Asp Leu Lys Gin Asp Leu 
260 265 270 

GCG AAA TCT AAA GTC ATG ATG TTT GTG CTT GGT GGG GGG TTT GGC TCT 920 
Ala Lys Ser Lys Val Met Met Phe Val Leu Gly Gly Gly Phe Gly Ser 
275 280 285 290 

CGT TTA ATG GAA AAA ATC AGG GTT CAA GAG GGA TTA GCT TAT AGC GTG 968 
Arg Leu Met Glu Lys lie Arg Val Gin Glu Gly Leu Ala Tyr Ser Val 
295 300 305 
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2 90 










295 










300 










Gly Glu 


Ala 


Lvs 


His 


Ala 


Asp 


Met 


Ala 


He 


Ala 


Phe 


Glv 


Asn 


AXQ 


Ser 


305 








310 










315 










320 


Gly Leu 


lie 


He 


Lys 


Glu 


Gly 


Lys 


Val 


He 


His 


Lys 


Leu 


Ala 


Glu 


Lys 








325 










330 










335 




Asp Leu 


Phe 


Glu 


Thr 


Phe 


Val 


He 


Glu 


Val 


Glu 


Asn 


Leu 


Ala 


Lys 


Glu 






340 










345 










350 






Arg Glu 


Lys 


Ser 


Leu 


Lys 


Asp 





















355 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1346 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

GCCTATGAAA TCTTAAAGCG TTATCCGGCT AAAGCAAAGG TATAAATAAC ATG AAA 56 

Met Lys 
1 

AAA TTT TTA ATC ACT TTA TTA TTA GGA GTT TTT ATG GGG TTA CAA GCG 104 
Lys Phe Leu He Thr Leu Leu Leu Gly Val Phe Met Gly Leu Gin Ala 
5 10 15 

AGC GCT TTG AC A CAC CAA GAA ATC AAT CAA GCT AAA GTC CCT GTG ATT 152 
Ser Ala Leu Thr His Gin Glu He Asn Gin Ala Lys Val Pro Val He 
20 25 30 

TAT GAA GAA AAC CAT TTG TTG CCT ATG GGG TTT ATC CAT TTA GCC TTT 200 
Tyr Glu Glu Asn His Leu Leu Pro Met Gly Phe He His Leu Ala Phe 
35 40 45 50 

AGG GGG GGT GGG AGC TTA AGC GAT AAA AAC CAG TTG GGT TTG GCG AAA 24 8 

Arg Gly Gly Gly Ser Leu Ser Asp Lys Asn Gin Leu Gly Leu Ala Lys 
55 60 65 

TTA TTC GCG CAA GTT TTA AAC GAA GGC ACT AAA GAG CTT GGT GCG GTG 2 96 

Leu Phe Ala Gin Val Leu Asn Glu Gly Thr Lys Glu Leu Gly Ala Val 
70 75 80 
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Lys Ser Leu Lys Asp 
355 



CAGCAATCTA AC 



1180 



(2) INFORMATION FOR SEQ ID NO:196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE; internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Met Leu Glu Asn Arg Val Lys Thr Lys Gin lie Phe lie Gly Gly Val 

15 10 15 

Ala lie Gly Gly Asp Ala Pro lie Ser Thr Gin Ser Met Thr Phe Ser 

20 25 30 

Lys Thr Ala Asp lie Glu Ser Thr Lys Asn Gin He Asp Arg Leu Lys 

35 40 45 

Leu Ala Gly Ala Asp Leu Val Arg Val Ala Val Ser Asn Glu Lys Asp 

50 55 60 

Ala Leu Ala Leu Lys Glu Leu Lys Lys Val Ser Pro Leu Pro Leu He 
65 70 75 80 

Ala Asp He His Phe His Tyr Lys Phe Ala Leu He Ala Ala Gin Ser 

85 90 95 

Val Asp Ala He Arg He Asn Pro Gly Asn He Gly Ser Lys Glu Lys 

100 105 110 

He Lys Ala Val Val Asp Ala Cys Lys Glu Lys Asn He Pro He Arg 

115 120 125 

He Gly Val Asn Ala Gly Ser Leu Glu Lys Gin Phe Asp Gin Lys Tyr 

130 135 140 

Gly Pro Thr Pro Lys Gly Met Val Glu Ser Ala Leu Tyr Asn Ala Lys 
145 150 155 160 

Leu Leu Glu Asp Leu Asp Phe Thr Asn Phe Lys He Ser Leu Lys Ala 

165 170 175 

Ser Asp Val He Arg Thr He Glu Ala Tyr Arg Met Leu Arg Pro Leu 

180 185 190 

Val He Tyr Pro Phe His Leu Gly Val Thr Glu Ala Gly Asn Leu Phe 

195 200 205 

Ser Ser Ser He Lys Ser Ala Met Ala Leu Gly Gly Leu Leu Met Glu 

210 215 220 

Gly He Gly Asp Thr Met Arg Val Ser He Thr Gly Glu Leu Glu Asn 
225 230 235 240 

Glu He Lys Val Ala Arg Ala He Leu Arg His Ser Gly Arg Leu Lys 

245 250 255 

Glu Gly He Asn Trp He Ser Cys Pro Thr Cys Gly Arg He Glu Ala 

260 265 270 

Asn Leu Val Asp Met Ala He Lys Val Glu Lys Arg Leu Ser His He 

275 280 285 

Lys Thr Pro Leu Asp He Ser Val Met Gly Cys Val Val Asn Ala Leu 
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135 140 145 

ACC CCA AAA GGC ATG GTA GAA AGO GCT TTG TAT AAC GCC AAA CTT TTA 536 
Thr Pro Lys Gly Met Val Glu Ser Ala Leu Tyr Asn Ala Lys Leu Leu 
150 155 160 

GAA GAT TTG GAT TTT ACC AAT TTT AAG ATT TCT TTA AAA GCG AGC GAT 584 
Glu Asp Leu Asp Phe Thr Asn Phe Lys lie Ser Leu Lys Ala Ser Asp 
165 170 175 

GTG ATT CGC ACC ATA GAA GCT TAC AGG ATG CTT CGC CCT CTT GTG ATC 632 
Val lie Arg Thr lie Glu Ala Tyr Arg Met Leu Arg Pro Leu Val lie 
180 185 190 

TAT CCT TTC CAT TTG GGG GTT ACG GAG GCG GGG AAT CTT TTT AGC TCC 6 80 

Tyr Pro Phe His Leu Gly Val Thr Glu Ala Gly Asn Leu Phe Ser Ser 
195 200 205 210 

AGT ATC AAA TCC GCT ATG GCT TTA GGG GGG CTT TTA ATG GAG GGC ATT 72 8 

Ser lie Lys Ser Ala Met Ala Leu Gly Gly Leu Leu Met Glu Gly lie 
215 220 225 

GGG GAT ACG ATG CGC GTA TCC ATC ACA GGG GAA TTA GAA AAT GAA ATC 7 76 

Gly Asp Thr Met Arg Val Ser lie Thr Gly Glu Leu Glu Asn Glu lie 
230 235 240 

AAA GTG GCC AGA GCA ATT TTA CGC CAT AGC GGG CGG TTG AAA GAA GGG 824 
Lys Val Ala Arg Ala lie Leu Arg His Ser Gly Arg Leu Lys Glu Gly 
245 250 255 

ATT AAT TGG ATT TCT TGC CCC ACT TGC GGG CGC ATT GAA GCC AAT TTA 872 
lie Asn Trp lie Ser Cys Pro Thr Cys Gly Arg lie Glu Ala Asn Leu 
260 265 270 

GTG GAT ATG GCG ATC AAG GTA GAA AAA CGC TTA AGC CAC ATC AAA ACC 92 0 

Val Asp Met Ala lie Lys Val Glu Lys Arg Leu Ser His lie Lys Thr 
275 280 285 290 

CCT TTA GAC ATT AGC GTG ATG GGT TGC GTG GTG AAT GCT TTG GGT GAA 96 8 

Pro Leu Asp lie Ser Val Met Gly Cys Val Val Asn Ala Leu Gly Glu 
295 300 305 

GCC AAG CAT GCA GAC ATG GCG ATC GCT TTT GGG AAT CGC AGC GGT TTG 1016 
Ala Lys His Ala Asp Met Ala lie Ala Phe Gly Asn Arg Ser Gly Leu 
310 315 320 

ATC ATT AAA GAG GGT AAA GTC ATT CAC AAA CTG GCT GAA AAG GAT TTA 1064 
lie lie Lys Glu Gly Lys Val lie His Lys Leu Ala Glu Lys Asp Leu 
325 330 335 

TTT GAA ACT TTT GTG ATA GAA GTG GAA AAT TTA GCT AAA GAA AGA GAA 1112 
Phe Glu Thr Phe Val lie Glu Val Glu Asn Leu Ala Lys Glu Arg Glu 
340 345 350 

AAA AGT TTA AAG GAT TAGGCATGAT CAATAAGTTT AAAAATTTTG TGAGCAACTA C 116 8 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1180 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1127 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CCAAGAAAGA GTATAATAGC GCATAAGAAT TTAACTGATG AAGAGGTTTA ATG CTA 5 6 

Met Leu 
1 

GAA AAT AGA GTT AAG ACC AAG CAA ATT TTT ATC GGT GGC GTG GCC ATA 104 
Glu Asn Arg Val Lys Thr Lys Gin lie Phe lie Gly Gly Val Ala He 
5 10 15 

GGG GGT GAT GCT CCC ATA AGC ACG CAA AGO ATG ACC TTT AGC AAA ACC 152 
Gly Gly Asp Ala Pro He Ser Thr Gin Ser Met Thr Phe Ser Lys Thr 
20 25 30 

GCT GAT ATT GAA AGC ACT AAA AAT CAA ATT GAC AGA CTC AAA CTC GCC 2 00 

Ala Asp He Glu Ser Thr Lys Asn Gin He Asp Arg Leu Lys Leu Ala 
35 40 45 50 

GGG GCC GAT TTA GTG AGG GTG GCG GTG AGT AAT GAA AAG GAC GCT CTA 24 8 

Gly Ala Asp Leu Val Arg Val Ala Val Ser Asn Glu Lys Asp Ala Leu 
55 60 65 

GCC TTA AAA GAA TTG AAA AAA GTG TCC CCT TTG CCT TTA ATC GCT GAT 2 96 

Ala Leu Lys Glu Leu Lys Lys Val Ser Pro Leu Pro Leu He Ala Asp 
70 75 80 

ATT CAT TTC CAT TAT AAA TTC GCT CTC ATT GCC GCT CAA AGC GTG GAT 344 
He His Phe His Tyr Lys Phe Ala Leu He Ala Ala Gin Ser Val Asp 
85 90 95 

GCG ATC AGG ATT AAC CCC GGA AAC ATC GGC TCT AAA GAG AAG ATC AAA 3 92 

Ala He Arg He Asn Pro Gly Asn He Gly Ser Lys Glu Lys He Lys 
100 105 110 

GCG GTG GTT GAT GCT TGT AAA GAA AAA AAC ATT CCT ATA AGA ATT GGC 44 0 

Ala Val Val Asp Ala Cys Lys Glu Lys Asn He Pro He Arg He Gly 
115 120 125 130 

GTG AAT GCT GGG AGT TTA GAA AAG CAG TTT GAT CAA AAA TAC GGA CCC 48 8 

Val Asn Ala Gly Ser Leu Glu Lys Gin Phe Asp Gin Lys Tyr Gly Pro 
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TCT ATC ATT TCT TCT TGC GTG TTT TCT TGT GGG TTT TOT TCT TTT TTA 2 04 

Ser lie lie Ser Ser Cys Val Phe Ser Cys Gly Phe Ser Ser Phe Leu 
35 40 45 

GGG TGG TTG GGG GTT TTT TGG TTT TCT GTT TTG TTG TCA TTT TCT ATT 252 
Gly Trp Leu Gly Val Phe Trp Phe Ser Val Leu Leu Ser Phe Ser lie 
50 55 60 

ATG GGT GCA AGT GTG GGC ATG ATA GGT TTG GGC GTG GTG GGC GTA AGA 300 
Met Gly Ala Ser Val Gly Met lie Gly Leu Gly Val Val Gly Val Arg 
65 70 75 

GTT TCT TTT GTA GGC GTG GGT TCT CTT TCT TTA GTT TCT TGT TTA ATT 34 8 

Val Ser Phe Val Gly Val Gly Ser Leu Ser Leu Val Ser Cys Leu lie 
80 85 90 

TCT TTT AAA GGG GGG TTA GTG GGG TTA GTC AAA TCA TCA AAT CGG TTT 396 
Ser Phe Lys Gly Gly Leu Val Gly Leu Val Lys Ser Ser Asn Arg Phe 
95 100 105 110 

CTT TTA GGG TAAAT GGTGTAATGG GTAGGGGGGT GGGAGGAAAT TTGGACT 44 7 

Leu Leu Gly 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 194 : 

Val Gly Val Thr Gly Leu Lys Phe Gly Ala Phe Cys Thr Ser Phe Ser 

15 10 15 

Ser Phe Phe Arg Phe Ser Phe lie lie Ser Ser lie Leu Pro Ser lie 

20 25 30 

lie Ser Ser Cys Val Phe Ser Cys Gly Phe Ser Ser Phe Leu Gly Trp 

35 40 45 

Leu Gly Val Phe Trp Phe Ser Val Leu Leu Ser Phe Ser lie Met Gly 

50 55 60 

Ala Ser Val Gly Met lie Gly Leu Gly Val Val Gly Val Arg Val Ser 
65 70 75 80 

Phe Val Gly Val Gly Ser Leu Ser Leu Val Ser Cys Leu lie Ser Phe 

85 90 95 

Lys Gly Gly Leu Val Gly Leu Val Lys Ser Ser Asn Arg Phe Leu Leu 
100 105 110 

Gly 

(2) INFORMATION FOR SEQ ID NO: 195: 
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(2) INFORMATION FOR SEQ ID NO:192: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECXJLE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

Met Phe Leu lie Leu Ser Asn Pro Phe Phe Asn Thr Ser Ala lie Lys 

15 10 15 

Phe Leu lie Leu Ser lie Gly Lys Leu Asn Phe Phe Pro Asn lie Ser 

20 25 30 

Leu Ser Pro lie lie Asn lie Arg Lys Asn Lys Lys Phe Asn lie lie 

35 40 45 

Glu Asn Lys Ser Leu Asp Lys Pro Val Lys Arg Phe Val Pro Pro Pro 

50 55 60 

Asn Lys Glu Ala Lys lie Phe Pro Met lie Ser Pro Phe lie Phe Gly 
65 70 75 80 

Cys Val Ser Ser Cys Leu Phe Gly Ser Leu Met Arg Val Leu Val Gly 

85 90 95 

Ser lie Ser Gin Trp His Thr 
100 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 67. , .405 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

TCCAATCCGT CTAATATCTC TTTATTTTCG CTCAATTCTT TAACCATAAC GGGTTTTTTA 60 
GCGCTT GTG GGG GTT ACT GGG CTA AAG TTT GGA GCG TTT TGC ACT TCT 108 
Val Gly Val Thr Gly Leu Lys Phe Gly Ala Phe Cys Thr Ser 
15 10 

TTT TCT TCT TTT TTT AGA TTT TCC TTT ATC ATT TCT TCT ATC CTT CCT 156 
Phe Ser Ser Phe Phe Arg Phe Ser Phe lie lie Ser Ser lie Leu Pro 
15 20 25 30 
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500 505 510 

lie Thr Gin Asp Arg Ser His Leu Met Thr His lie Ser Tyr Ser Phe 
515 520 525 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 359 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TTTTGTCTGA TTTGTTGCTA CCAAAACCAT TACCAACCAA AGCAGATCCC ATG TTT 56 

Met Phe 
1 

TTG ATA CTA TCG AAT CCA TTC TTC AAC ACT TCT GCC ATA AAA TTC TTG 104 
Leu lie Leu Ser Asn Pro Phe Phe Asn Thr Ser Ala lie Lys Phe Leu 
5 10 15 

ATA TTG TCC ATA GGC AAG TTG AAT TTT TTC CCT AAT ATT TCA TTA AGT 152 
lie Leu Ser lie Gly Lys Leu Asn Phe Phe Pro Asn lie Ser Leu Ser 
20 25 30 

CCC ATC ATT AAC ATC AGG AAG AAC AAA AAA TTT AAT ATC ATA GAA AAC 2 00 

Pro lie lie Asn lie Arg Lys Asn Lys Lys Phe Asn lie lie Glu Asn 
35 40 45 50 

AAA TCA CTG GAT AAA CCT GTA AAA AGA TTT GTT CCG CCA CCC AAC AAA 24 8 

Lys Ser Leu Asp Lys Pro Val Lys Arg Phe Val Pro Pro Pro Asn Lys 
55 60 65 

GAA GCT AAA ATT TTT CCC ATG ATC AGT CCT TTT ATT TTT GGT TGT GTA 2 96 

Glu Ala Lys lie Phe Pro Met lie Ser Pro Phe lie Phe Gly Cys Val 
70 75 80 

AGT TCT TGC TTG TTC GGA TCT CTA ATG CGT GTT TTA GTA GGA AGC ATT 344 
Ser Ser Cys Leu Phe Gly Ser Leu Met Arg Val Leu Val Gly Ser lie 
85 90 95 

TCA CAA TGG CAT ACC TAAAGCTACT AAGAAAATTC TTGAATCTAT TGGTAAGATT A 4 00 
Ser Gin Trp His Thr 
100 

CTCATGAAAT CA 412 
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Asn Leu Thr Gly Lys Leu Glu Gly Ser Val His Leu Gly Arg Gly Trp 
65 70 75 80 

Thr Val Asn Leu Gly Gly Val Leu Gly Gly Gin Ala Tyr Asp Gly Thr 

85 90 95 

Lys Tyr Asp Arg Trp Ala Lys Asp Phe Thr Pro Pro Ser Tyr Trp Asp 

100 105 110 

Lys Thr Ser Cys Gly Thr Asp Ser Met Ser Leu Cys Met Asn Ala Thr 

115 120 125 

Lys Met Trp Gin Gin Ser Gly Pro Gly Gly Val lie Asn Pro Arg Gly 

130 135 140 

lie Gly Trp Glu Tyr Met Gly Glu Trp Asn Gly. Leu Phe Pro Asn Tyr 
145 150 155 160 

Tyr Pro Ala Asn Ala Tyr Leu Pro Gly Gly Ser Arg Arg Tyr Gin Val 

165 170 175 

Tyr Lys Ala Asn Leu Thr Tyr Asp Ser Asp Arg Val His Met Val Met 

180 185 190 

Gly Arg Phe Asp lie Thr Glu Gin Glu Gin Met Asp Trp lie Tyr Gin 

195 200 205 

Leu Phe Gin Gly Phe Tyr Gly Thr Phe Lys Leu Thr Lys Asn Met Lys 

210 215 220 

Phe Leu Leu Phe Ser Gly Trp Gly Arg Gly lie Ala Asp Gly Gin Trp 
225 230 235 240 

Leu Phe Pro lie Tyr Arg Glu Lys Pro Trp Gly Val His Lys Ala Gly 

245 250 255 

lie lie Tyr Arg Pro Thr Lys Asn Leu Met lie His Pro Tyr Val Tyr 

260 265 270 

Leu lie Pro Met Val Gly Thr Leu Pro Gly Ala Lys lie Glu Tyr Asp 

275 280 285 

Thr Asn Pro Glu Phe Ser Gly Arg Gly lie Arg Asn Arg Thr Thr Phe 

290 295 300 

Tyr Ala Leu Tyr Asp Tyr Arg Trp Asn Asn Ala Glu Tyr Gly Arg Tyr 
305 310 315 320 

Ala Pro Ala Arg Tyr Asn Thr Trp Asp Pro Phe Leu Asp Asn Gly Lys 

325 330 335 

Trp Arg Gly Leu Gin Gly Pro Gly Gly Ala Thr Leu Leu Leu Arg His 

340 345 350 

His lie Asp lie Asn Asn Tyr Phe Val Val Gly Gly Ala Tyr Leu Asn 

355 360 365 

lie Gly Asn Pro Asn Met Asn Leu Gly Thr Trp Gly Asn Pro Val Ala 

370 375 380 

Val Asp Gly lie Glu Gin Trp Val Gly Ser lie Tyr Ser Leu Gly Phe 
385 390 395 400 

Ala Gly lie Asp Asn lie Thr Asp Ala Asp Ala Phe Thr Glu Tyr Val 

405 410 415 

Lys Gly Gly Gly Lys His Gly Lys Phe Ser Trp Ser Val Tyr Gin Arg 

420 425 430 

Phe Thr Thr Ala Pro Arg Ala Leu Glu Tyr Gly lie Gly Met Tyr Leu 

435 440 445 

Asp Tyr Gin Phe Ser Lys His Val Lys Ala Gly Leu Lys Leu Val Trp 

450 455 460 

Leu Glu Phe Gin lie Arg Ala Gly Tyr Asn Pro Gly Thr Gly Phe Leu 
465 470 475 480 

Gly Pro Asn Gly Gin Pro Leu Asn Leu Asn Thr Gly Leu Phe Glu Ser 

485 490 495 

Ser Ala Phe Ala Gin Gly Pro Gin Asn Met Gly Gly lie Ala Lys Ser 
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ATT GAC AAC ATT ACC GAT OCT GAC GCG TTC ACC GAG TAT GTT AAA GGT 13 04 

lie Asp Asn lie Thr Asp Ala Asp Ala Phe Thr Glu Tyr Val Lys Gly 
405 410 415 

GGA GGC AAG CAT GGT AAG TTT AGT TGG AGC GTT TAT GAG CGC TTC ACT 1352 
Gly Gly Lys His Gly Lys Phe Ser Trp Ser Val Tyr Gin Arg Phe Thr 
420 425 430 

ACC GCT CCA AGG GCT TTG GAA TAT GGT ATC GGT ATG TAT CTA GAC TAT 14 00 

Thr Ala Pro Arg Ala Leu Glu Tyr Gly lie Gly Met Tyr Leu Asp Tyr 
435 440 445 450 

CAG TTC AGC AAG CAT GTT AAA GCG GGT CTC AAA CTC GTA TGG TTA GAG 144 8 

Gin Phe Ser Lys His Val Lys Ala Gly Leu Lys Leu Val Trp Leu Glu 
455 460 465 

TTC CAA ATT CGT GCG GGT TAC AAC CCT GGA ACC GGT TTC CTT GGG CCA 14 96 

Phe Gin lie Arg Ala Gly Tyr Asn Pro Gly Thr Gly Phe Leu Gly Pro 
470 475 480 

AAC GGT CAG CCG CTT AAC TTG AAT ACT GGT TTG TTT GAG TCT TCA GCG 1544 
Asn Gly Gin Pro Leu Asn Leu Asn Thr Gly Leu Phe Glu Ser Ser Ala 
485 490 495 

TTC GCT CAA GGC CCT CAA AAC ATG GGC GGT ATC GCA AAA AGC ATC ACT 1592 
Phe Ala Gin Gly Pro Gin Asn Met Gly Gly He Ala Lys Ser He Thr 
500 505 510 

CAA GAC AGA AGC CAT TTG ATG ACA CAC ATT AGT TAT AGT TTC TAAGAGAGT 164 3 
Gin Asp Arg Ser His Leu Met Thr His He Ser Tyr Ser Phe 
515 520 525 

TCTCCCCCTA TCTCTTAGAT ATGCCTTTTT GTATTTTTAT TTTA 1687 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Met Lys Leu Lys Lys Arg Lys Val Ala Ala Ala Leu Leu Lys Arg Phe 

15 10 15 

Thr Leu Pro Leu Leu Phe Thr Thr Gly Ser Leu Gly Ala Val Thr Tyr 

20 25 30 

Glu Val His Gly Asp Phe He Asn Phe Ala Lys Val Gly Phe Asn His 

35 40 45 

Ser Pro He Asn Pro Val Lys Gly He Tyr Pro Thr Glu Thr Phe Val 
50 55 60 
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GCA AAT TTG ACC TAT GAT AGC GAC AGG GTC CAT ATG GTA ATG GGG CGT 632 
Ala Asn Leu Thr Tyr Asp Ser Asp Arg Val His Met Val Met Gly Arg 
180 185 190 

TTT GAC ATT ACC GAG CAG GAG CAA ATG GAT TGG ATT TAC CAA TTG TTC 68 0 

Phe Asp lie Thr Glu Gin Glu Gin Met Asp Trp lie Tyr Gin Leu Phe 
195 200 205 210 

CAA GGG TTT TAT GGG ACT TTC AAG CTC ACT AAG AAT ATG AAA TTC TTG 72 8 

Gin Gly Phe Tyr Gly Thr Phe Lys Leu Thr Lys Asn Met Lys Phe Leu 
215 220 225 

CTC TTT AGT GGT TGG GGT CGT GGT ATC GCT GAT GGT CAG TGG TTG TTC 776 
Leu Phe Ser Gly Trp Gly Arg Gly lie Ala Asp Gly Gin Trp Leu Phe 
230 235 240 

CCT ATC TAT CGT GAA AAG CCT TGG GGG GTT CAT AAA GCG GGT ATT ATT 824 
Pro lie Tyr Arg Glu Lys Pro Trp Gly Val His Lys Ala Gly lie lie 
245 250 255 

TAT CGC CCT ACA AAG AAT TTG ATG ATC CAC CCT TAT GTG TAT CTT ATC 872 
Tyr Arg Pro Thr Lys Asn Leu Met lie His Pro Tyr Val Tyr Leu lie 
260 265 270 

CCA ATG GTA GGC ACA TTG CCT GGT GCT AAA ATA GAA TAC GAT ACC AAT 920 
Pro Met Val Gly Thr Leu Pro Gly Ala Lys lie Glu Tyr Asp Thr Asn 
275 280 285 290 

CCT GAA TTT AGC GGT AGG GGC ATT AGG AAC AGA ACG ACT TTC TAT GCG 968 
Pro Glu Phe Ser Gly Arg Gly lie Arg Asn Arg Thr Thr Phe Tyr Ala 
295 300 305 

TTG TAT GAC TAT CGT TGG AAT AAC GCT GAA TAC GGT CGT TAC GCG CCC 1016 
Leu Tyr Asp Tyr Arg Trp Asn Asn Ala Glu Tyr Gly Arg Tyr Ala Pro 
310 315 320 

GCT CGT TAT AAC ACT TGG GAT CCG TTC TTG GAT AAT GGT AAG TGG CGT 1064 
Ala Arg Tyr Asn Thr Trp Asp Pro Phe Leu Asp Asn Gly Lys Trp Arg 
325 330 335 

GGC TTG CAA GGT CCT GGT GGT GCG ACG CTC CTT TTA CGC CAC CAT ATA 1112 
Gly Leu Gin Gly Pro Gly Gly Ala Thr Leu Leu Leu Arg His His lie 
340 345 350 

GAT ATT AAC AAC TAC TTT GTG GTT GGT GGT GCT TAT CTC AAC ATT GGT 116 0 

Asp He Asn Asn Tyr Phe Val Val Gly Gly Ala Tyr Leu Asn He Gly 
355 360 365 370 

AAC CCT AAC ATG AAC TTA GGT ACT TGG GGT AAC CCT GTG GCT GTT GAT 12 0 8 

Asn Pro Asn Met Asn Leu Gly Thr Trp Gly Asn Pro Val Ala Val Asp 
375 380 385 

GGT ATC GAA CAA TGG GTC GGT AGT ATC TAT AGC TTA GGG TTT GCG GGG 12 56 

Gly He Glu Gin Trp Val Gly Ser He Tyr Ser Leu Gly Phe Ala Gly 
390 395 400 
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(B) LOCATION: 51... 1634 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

TTATTTTTAC AGAGTAATTT ATCTATTCTC AGGTAAAGTA AGGAAGAGGA ATG AAA 5 6 

Met Lys 
1 

TTA AAG AAA CGA AAA GTT GCG GCT GCA TTG CTA AAG CGT TTT ACC TTG 104 
Leu Lys Lys Arg Lys Val Ala Ala Ala Leu Leu Lys Arg Phe Thr Leu 
5 10 15 

CCA CTA TTG TTC ACT ACG GGT TCA TTA GGG GCG GTT ACT TAT GAA GTG 152 
Pro Leu Leu Phe Thr Thr Gly Ser Leu Gly Ala Val Thr Tyr Glu Val 
20 25 30 

CAT GGA GAT TTT ATC AAT TTT GCT AAA GTG GGT TTT AAC CAT TCG CCC 200 
His Gly Asp Phe lie Asn Phe Ala Lys Val Gly Phe Asn His Ser Pro 
35 40 45 50 

ATT AAT CCT GTT AAA GGT ATC TAT CCC ACA GAA ACT TTT GTT AAC CTT 24 8 

lie Asn Pro Val Lys Gly lie Tyr Pro Thr Glu Thr Phe Val Asn Leu 
55 60 65 

ACG GGT AAG CTA GAG GGG TCT GTG CAT TTA GGT AGG GGA TGG ACC GTG 2 96 

Thr Gly Lys Leu Glu Gly Ser Val His Leu Gly Arg Gly Trp Thr Val 
70 75 80 

AAT TTA GGC GGT GTT TTG GGC GGA CAG GCT TAT GAT GGC ACT AAG TAT 344 
Asn Leu Gly Gly Val Leu Gly Gly Gin Ala Tyr Asp Gly Thr Lys Tyr 
85 90 95 

GAT AGG TGG GCG AAG GAT TTT ACC CCC CCA AGC TAT TGG GAT AAA ACT 3 92 

Asp Arg Trp Ala Lys Asp Phe Thr Pro Pro Ser Tyr Trp Asp Lys Thr 
100 105 110 

TCT TGC GGT ACT GAT TCT ATG AGT CTT TGT ATG AAT GCC ACT AAA ATG 44 0 

Ser Cys Gly Thr Asp Ser Met Ser Leu Cys Met Asn Ala Thr Lys Met 
115 120 125 130 

TGG CAG CAA TCA GGG CCA GGT GGC GTC ATT AAC CCT AGA GGT ATT GGT 4 88 

Trp Gin Gin Ser Gly Pro Gly Gly Val He Asn Pro Arg Gly He Gly 
135 140 145 

TGG GAA TAC ATG GGT GAG TGG AAC GGC TTG TTC CCT AAC TAC TAT CCG 536 
Trp Glu Tyr Met Gly Glu Trp Asn Gly Leu Phe Pro Asn Tyr Tyr Pro 
150 155 160 

GCT AAC GCC TAC TTG CCT GGT GGC TCA AGG CGC TAT CAA GTC TAT AAA 584 
Ala Asn Ala Tyr Leu Pro Gly Gly Ser Arg Arg Tyr Gin Val Tyr Lys 
165 170 175 
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Glu Tyr Ser lie 
35 

Gin Glu Arg lie 
50 

Leu His Gin Val 
65 

Tyr lie Asn Asn 

Met Gin Ala Glu 
100 

Lys lie Val Met 
115 

Gin Ala Leu Glu 
130 

Glu Glu Asn Leu 
145 

Arg Met Leu Ser 

Leu Asn Ala Leu 
180 

Tyr Gly Val Ser 
195 

Lys Asn Gin Gly 
210 

Asn Phe Gly Phe 
225 

Asn His Leu Tyr 

Asn Ala Lys Lys 
260 

Ala Gly Ser Ser 
275 

Thr Asp Phe lie 
290 

Thr Ser Phe Phe 
305 

Asn Arg His Asn 

Asn Ser Phe Tyr 
340 

Phe Lys Arg Leu 
355 



Ser His Ala Val 
40 

Gin lie lie Ser 
55 

Lys Asn Glu lie 
70 

Ala Leu Lys Asn 
85 

Gin Tyr Tyr Leu 

Leu Ser Gly Gly 
120 

Lys Met Gin Glu 
135 

Arg Asn Leu Glu 
150 

Ser Leu Ser Ser 
165 

Asp Pro Asn Ser 

Leu Ser Val Gly 
200 

Leu Arg Tyr Tyr 
215 

Val Gly Asn Gly 
230 

Gly Leu Gly lie 
245 

His Ser Ser Val 

Trp Val Gly Ser 
280 

Asn Asn Tyr Leu 
295 

Gin lie Pro Leu 
310 

Gly Phe Glu Met 
325 

Glu Thr His Gly 

Val Met Phe Asn 
360 



Glu His Asn Asn 

Asn Ala Gin Asn 
60 

Thr Ser Met Pro 
75 

Asn Ser Lys Leu 
90 

Gin Ser Thr Phe 
105 

Val Ser Ser Asn 

Pro lie Thr Asn 
140 

Val Gin Phe Ala 
155 

Gin He Ala Ala 
170 

Tyr Ser Lys Asn 
185 

Tyr Lys His Phe 

Leu Phe Tyr Asp 
220 

Phe Asp Gly Leu 
235 

Asp Tyr Leu Tyr 
250 

Gly Phe Tyr Leu 
265 

Gly Leu Ser Met 

Thr Gly Tyr Gin 
300 

Asn Phe Gly Val 
315 

Gly Leu Lys He 
330 

Lys Gly Leu Asn 
345 

Val Ser Tyr Val 



Pro Phe Leu Asn 
45 

Lys lie Tyr Lys 

Lys Thr Phe Ala 
80 

Thr Pro Thr Glu 
95 

Gin Asn He Glu 
110 

Pro Gin Leu Val 
125 

Pro Leu Glu Phe 

Gin Ser Gin Asn 
160 

He Ser Asn Ser 
175 

He Ser Ser Met 
190 

Phe Thr Lys Lys 
205 

Tyr Gly Tyr Thr 

Gly Lys Met Asn 
240 

Asn Phe He Asp 
255 

Gly Phe Ala Leu 
270 

Trp Val Ser Gin 
285 

Ala Lys Met His 

Arg Val Asn Val 
320 

Pro Leu Ala Met 
335 

Thr Ser Leu Phe 

350 
Tyr Ser Phe 
365 



(2) INFORMATION FOR SEQ ID NO: 18 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
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GGT TTT GTG GGC AAT GGC TTT GAT GOT TTA GGC AAA ATG AAT AAC CAT 77 6 

Gly Phe Val Gly Asn Gly Phe Asp Gly Leu Gly Lys Met Asn Asn His 
230 235 240 

CTC TAT GGG CTT GGG ATA GAC TAT CTT TAT AAT TTC ATT GAT AAT GCA 824 
Leu Tyr Gly Leu Gly lie Asp Tyr Leu Tyr Asn Phe lie Asp Asn Ala 
245 250 255 

AAA AAA CAC TCT AGC GTA GGT TTT TAT CTG GGT TTT GCT TTA GCG GGG 872 
Lys Lys His Ser Ser Val Gly Phe Tyr Leu Gly Phe Ala Leu Ala Gly 
260 265 270 

AGT TCG TGG GTA GGG AGT GGT TTG AGC ATG TGG GTG AGC CAA ACG GAT 92 0 

Ser Ser Trp Val Gly Ser Gly Leu Ser Met Trp Val Ser Gin Thr Asp 
275 280 285 290 

TTT ATC AAC AAT TAC TTG ACG GGC TAT CAA GCT AAA ATG CAC ACG AGT 968 
Phe lie Asn Asn Tyr Leu Thr Gly Tyr Gin Ala Lys Met His Thr Ser 
295 300 305 

TTT TTC CAG ATC CCT TTG AAT TTT GGG GTT CGT GTG AAT GTC AAT AGG 1016 
Phe Phe Gin lie Pro Leu Asn Phe Gly Val Arg Val Asn Val Asn Arg 
310 315 320 

CAT AAT GGC TTT GAA ATG GGC TTG AAA ATC CCT TTA GCG ATG AAT TCC 10 64 

His Asn Gly Phe Glu Met Gly Leu Lys lie Pro Leu Ala Met Asn Ser 
325 330 335 

TTT TAT GAA ACG CAT GGC AAA GGG CTA AAC ACT TCC CTC TTT TTC AAA 1112 
Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Thr Ser Leu Phe Phe Lys 
340 345 350 

CGC CTT GTC ATG TTT AAC GTG AGT TAC GTT TAT AGT TTT TAGGGGGGTA AA 1163 
Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr Ser Phe 
355 360 365 

TGCCTTCAAA CGCTCTTTTG ATTGAAGAAA TCACTCATTT A 12 04 

(2) INFORMATION FOR SEQ ID NO: 18 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 

(C) STRAWDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 8: 

Met Gin Lys Ala Leu Leu His Ser Ser Phe Phe Leu Pro Leu Phe Leu 

15 10 15 

Ser Phe Cys lie Ala Glu Glu Asn Gly Ala Tyr Ala Ser Val Gly Phe 
20 25 30 
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Lys Ala Leu Leu His Ser Ser Phe Phe Leu Pro Leu Phe Leu Ser Phe 
5 10 15 

TGT ATC GCT GAA GAA AAT GGG GCG TAT GCG AGC GTG GGT TTT GAA TAT 152 
Cys lie Ala Glu Glu Asn Gly Ala Tyr Ala Ser Val Gly Phe Glu Tyr 
20 25 30 

TCC ATT AGT CAT GCC GTT GAA CAC AAT AAC CCC TTT TTA AAT CAA GAA 200 
Ser lie Ser His Ala Val Glu His Asn Asn Pro Phe Leu Asn Gin Glu 
35 40 45 50 

CGC ATC CAA ATC ATT TCT AAC GCT CAA AAT AAA ATC TAT AAA CTC CAT 24 8 

Arg lie Gin lie lie Ser Asn Ala Gin Asn Lys lie Tyr Lys Leu His 
55 60 65 

CAA GTT AAA AAT GAA ATC ACA AGC ATG CCT AAA ACC TTT GCA TAT ATC 2 96 

Gin Val Lys Asn Glu lie Thr Ser Met Pro Lys Thr Phe Ala Tyr lie 
70 75 80 

AAC AAC GCT TTA AAA AAC AAC TCC AAA TTA ACC CCC ACT GAA ATG CAA 344 
Asn Asn Ala Leu Lys Asn Asn Ser Lys Leu Thr Pro Thr Glu Met Gin 
85 90 95 

GCC GAA CAA TAC TAC CTC CAA TCC ACC TTT CAA AAC ATT GAA AAA ATA 3 92 

Ala Glu Gin Tyr Tyr Leu Gin Ser Thr Phe Gin Asn lie Glu Lys lie 
100 105 110 

GTA ATG CTT AGC GGT GGC GTT TCA TCT AAC CCA CAA TTA GTC CAA GCG 44 0 

Val Met Leu Ser Gly Gly Val Ser Ser Asn Pro Gin Leu Val Gin Ala 
lis 120 125 130 

TTG GAA AAA ATG CAA GAA CCC ATT ACT AAC CCT TTA GAA TTT GAA GAA 488 
Leu Glu Lys Met Gin Glu Pro lie Thr Asn Pro Leu Glu Phe Glu Glu 
135 140 145 

AAC TTA AGA AAT TTA GAA GTG CAA TTT GCT CAA TCT CAA AAC CGC ATG 53 6 

Asn Leu Arg Asn Leu Glu Val Gin Phe Ala Gin Ser Gin Asn Arg Met 
150 155 160 

CTT TCT TCT TTA TCT TCT CAA ATC GCT GCC ATT TCA AAT TCC TTA AAC 584 
Leu Ser Ser Leu Ser Ser Gin lie Ala Ala lie Ser Asn Ser Leu Asn 
165 170 175 

GCG CTT GAT CCT AAC TCT TAT TCT AAA AAC ATT TCA AGC ATG TAT GGG 632 
Ala Leu Asp Pro Asn Ser Tyr Ser Lys Asn lie Ser Ser Met Tyr Gly 
180 185 190 

GTG AGT TTG AGC GTA GGT TAT AAG CAT TTC TTT ACC AAG AAA AAA AAT 680 
Val Ser Leu Ser Val Gly Tyr Lys His Phe Phe Thr Lys Lys Lys Asn 
195 200 205 210 

CAA GGG TTG CGC TAT TAC TTG TTT TAT GAC TAT GGT TAC ACT AAT TTT 728 
Gin Gly Leu Arg Tyr Tyr Leu Phe Tyr Asp Tyr Gly Tyr Thr Asn Phe 
215 220 225 
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TTATCCTACA AG 412 
(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Met Phe Leu Gin Val Val Ala Arg Thr Leu Arg Lys Asn Val Asn lie 

15 10 15 

Leu Glu Glu Gin Gly Phe lie Glu Val lie Lys Gly Lys Gin Arg Tyr 

20 25 30 

Leu Tyr Val Tyr Leu Lys Asp Tyr Arg Glu Leu Glu Gly Tyr Asn Ser 

35 40 45 

Val Gly Ala Asn Gin Lys Asn Asn lie Pro Ser Pro Phe Phe Leu Gin 

50 55 60 

lie Met Arg Phe Leu Glu Lys Phe Ala Lys Glu lie Glu Arg Val Lys 
65 70 75 80 

lie Thr Thr Lys Asn Val Leu Cys lie Phe Leu Ala Lys Ser Leu Cys 

85 90 95 

Lys Glu Leu lie Met Leu Phe 
lOO 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1151 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

TTCTATTAAA ATTAGTGTAT GATTGAGATT ATTTTTGATT AGGATCAACC ATG CAA 56 

Met Gin 
1 

AAA GGC TTA TTA CAT TCA TCA TTC TTT TTA CCT TTA TTT TTA TCT TTT 104 
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Lys Arg 
385 

(2) INFORMATION FOR SEQ ID NO: 185; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 359 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CATTTAATGC TAAGTCTAAT AAGATTGCCC TAGATAGACA TTACGCCAAA ATG TTT 56 

Met Phe 
1 

TTG CAA GTT GTA GCA AGA ACT CTA AGA AAG T^T GTC AAT ATA TTA GAA 104 
Leu Gin Val Val Ala Arg Thr Leu Arg Lys Asn Val Asn lie Leu Glu 
5 10 15 

GAG CAA GGT TTT ATT GAA GTC ATT AAA GGA AAA CAA AGA TAG TTG TAT 152 
Glu Gin Gly Phe lie Glu Val lie Lys Gly Lys Gin Arg Tyr Leu Tyr 
20 25 30 

GTG TAT CTT AAA GAT TAG AGA GAA TTA GAG GGC TAT AAC TCC GTA GGA 2 00 

Val Tyr Leu Lys Asp Tyr Arg Glu Leu Glu Gly Tyr Asn Ser Val Gly 
35 40 45 50 

GCT AAT CAA AAG AAC AAT ATC CCA TCG CCT TTT TTC TTA CAG ATT ATG 24 8 

Ala Asn Gin Lys Asn Asn lie Pro Ser Pro Phe Phe Leu Gin lie Met 
55 60 65 

CGT TTC TTA GAA AAG TTT GCC AAA GAA ATT GAG AGA GTA AAA ATA ACA 2 96 

Arg Phe Leu Glu Lys Phe Ala Lys Glu lie Glu Arg Val Lys lie Thr 
70 75 80 

ACA AAG AAT GTG TTA TGC ATA TTC CTA GCC AAG AGC TTA TGC AAA GAG 344 
Thr Lys Asn Val Leu Cys lie Phe Leu Ala Lys Ser Leu Cys Lys Glu 
85 90 95 

TTA ATA ATG TTG TTT TAAAATTCAC GCCTATTTCT AATCCTAATA CCACTTACAC T 4 00 
Leu lie Met Leu Phe 
100 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

Met Ala Glu Lys Leu Glu Glu Thr Asn Pro Glu Trp Phe Glu Lys Trp 

15 10 15 

Arg Glu Lys Gin Tyr Thr Gin Thr Gly Glu Ser Lys Pro Ser Lys Arg 

20 25 30 

lie Lys Val Phe Lys Asn Phe Thr Ala Phe Asp Asp Arg Leu Tyr Thr 

35 40 45 

lie Glu Cys Asn Leu Lys Asn Leu Asp Thr His Gin Lys Lys Phe Glu 

50 55 60 

lie Cys Gly Ala Leu Tyr Asp lie Tyr Glu Gin lie Phe Asp Glu Thr 
65 70 75 80 

Pro Ser Leu Lys Gly Arg Asp Leu Glu Thr Tyr Lys Ala Gin Asp Leu 

85 90 95 

Ser Lys Lys Phe Met His Leu Gly Phe Glu Gin lie Ser Lys Asp Leu 

100 105 110 

Asn Asp Ser Arg Leu Asn Ala Leu Leu Cys Tyr Glu Glu Lys Val Met 

115 120 125 

Gin Ala Leu Ala Lys Lys Tyr Pro Ser Phe Leu Gin Asp Leu His Asp 

130 135 140 

lie Lys Lys Tyr Arg Asn Lys Asp Lys His Gly Glu Lys Pro Gin Asp 
145 150 155 160 

Gly Ser Ser Leu Thr Arg Val Glu Leu Glu Arg Tyr Arg Asp Gly lie 

165 170 175 

Tyr Phe Leu Val Glu Asn Leu Leu Lys Asn Pro Leu lie Lys Glu Arg 

180 185 190 

Glu Asn Ala Gin Glu Glu Lys His Tyr Lys Lys Asn Ala Glu lie Asp 

195 200 205 

Asp Arg Ser Gin Leu Ser Asn Leu Asn Ala Pro Lys Pro Leu Phe Glu 

210 215 220 

Cys Phe Val Gly Val Asn Leu Ala Lys Ala Lys Tyr Tyr Ser Lys Lys 
225 230 235 240 

Glu Glu Arg Glu Lys Glu Lys Met lie Leu Asn Phe Cys Lys lie Phe 

245 250 255 

Glu lie lie Leu Phe Glu Ala lie Gin Lys Gin Pro Lys Pro Asp Phe 

260 265 270 

Lys Asn Lys Asp. Glu Leu Leu Gly Asp Tyr Pro Asn Leu Lys Asn Leu 

275 280 285 

Asp Ser Leu Arg Glu Val Arg Glu Asp Phe Leu Lys Arg Ala Phe Lys 

290 295 300 

Asn Asp Glu Ala Ser Leu Gly Ala Tyr Val Leu Val Leu Leu Ser Cys 
305 310 315 320 

Lys Tyr Phe Glu Ser Val Phe Glu Lys Val Gin Glu Trp Leu Asp Phe 

325 330 335 

lie Ala Arg Leu lie Ala Leu Arg Gly His Val His Lys lie Thr Lys 

340 345 350 

Glu Leu Glu Arg Leu Glu Glu Glu Asp Leu Glu Lys Leu Glu Lys Gin 

355 360 365 

Ala Leu Glu Tyr Phe Asn Lys lie Ala Asn Lys lie Tyr Leu Lys Glu 
370' 375 380 
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Ala Gin Glu Glu Lys His Tyr Lys Lys Asn Ala Glu lie Asp Asp Arg 
195 200 205 210 

TCC CAG CTA TCA AAC TTA AAC GCA CCC AAA CCC TTA TTT GAA TGT TTT 72 8 

Ser Gin Leu Ser Asn Leu Asn Ala Pro Lys Pro Leu Phe Glu Cys Phe 
215 220 225 

GTA GGA GTT AAT CTG GCC AAA GCC AAA TAT TAT TCT AAA AAA GAA GAA 776 
Val Gly Val Asn Leu Ala Lys Ala Lys Tyr Tyr Ser Lys Lys Glu Glu 
230 235 240 

AGA GAA AAA GAA AAG ATG ATC TTG AAT TTT TGT AAG ATA TTT GAA ATT 824 
Arg Glu Lys Glu Lys Met lie Leu Asn Phe Cys Lys lie Phe Glu He 
245 250 255 

ATT CTT TTT GAA GOT ATC CAA AAA CAA CCA AAG CCT GAT TTT AAA AAT 8 72 

He Leu Phe Glu Ala He Gin Lys Gin Pro Lys Pro Asp Phe Lys Asn 
260 265 270 

AAA GAC GAG CTT TTA GGG GAT TAT CCT AAT CTT AAA AAT TTA GAT TCT 920 
Lys Asp Glu Leu Leu Gly Asp Tyr Pro Asn Leu Lys Asn Leu Asp Ser 
275 280 285 290 

TTA AGA GAA GTG AGG GAA GAC TTT TTG AAA AGA GCG TTT AAG AAT GAT 968 
Leu Arg Glu Val Arg Glu Asp Phe Leu Lys Arg Ala Phe Lys Asn Asp 
295 300 305 

GAA GCG AGT TTG GGA GCG TAT GTG TTA GTG TTG CTT AGC TGT AAG TAT 1016 
Glu Ala Ser Leu Gly Ala Tyr Val Leu Val Leu Leu Ser Cys Lys Tyr 
310 315 320 

TTT GAG AGC GTG TTT GAA AAA GTT CAA GAA TGG CTA GAT TTT ATC GCT 1064 
Phe Glu Ser Val Phe Glu Lys Val Gin Glu Trp Leu Asp Phe He Ala 
325 330 335 

AGG CTT ATT GCT TTG AGA GGC CAT GTG CAC AAG ATA ACT AAA GAA CTT 1112 
Arg Leu He Ala Leu Arg Gly His Val His Lys He Thr Lys Glu Leu 
340 345 350 

GAA AGA TTA GAA GAA GAG GAT TTA GAA AAA TTG GAA AAA CAA GCA CTA 1160 
Glu Arg Leu Glu Glu Glu Asp Leu Glu Lys Leu Glu Lys Gin Ala Leu 
355 360 365 370 

GAA TAT TTT AAT AAA ATA GCA AAT AAA ATA TAT CTA AAG GAG AAA CGA T 1209 
Glu Tyr Phe Asn Lys He Ala Asn Lys He Tyr Leu Lys Glu Lys Arg 
375 380 385 

GAGCGGGAAT GAAGAATTGG AGCTAAGAGC CAGAGAAACT GAGTTGGATA AA 1261 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 183: 

ATGATAGTAA GGAAATAAGA GTGGAATGCA AGAATCACCC TATTGAAAAG ATG GCA 56 

Met Ala 
1 

GAA AAA TTA GAG GAA ACT AAT CCT GAA TGG TTT GAA AAA TGG AGG GAA 104 
Glu Lys Leu Glu Glu Thr Asn Pro Glu Trp Phe Glu Lys Trp Arg Glu 
5 10 15 

AAA CAA TAC ACC CAA ACT GGC GAA TCT AAG CCA TCA AAA CGA ATC AAA 152 
Lys Gin Tyr Thr Gin Thr Gly Glu Ser Lys Pro Ser Lys Arg lie Lys 
20 25 30 

GTT TTT AAA AAC TTT ACG GCA TTT GAT GAC AGA TTG TAT ACA ATT GAA 2 00 

Val Phe Lys Asn Phe Thr Ala Phe Asp Asp Arg Leu Tyr Thr lie Glu 
35 40 45 50 

TGT AAT TTA AAA AAT CTG GAT ACC CAT CAA AAA AAG TTT GAA ATT TGT 24 8 

Cys Asn Leu Lys Asn Leu Asp Thr His Gin Lys Lys Phe Glu lie Cys 
55 60 65 

GGG GCT CTG TAT GAC ATT TAT GAA CAA ATT TTT GAT GAA ACA CCA AGC 2 96 

Gly Ala Leu Tyr Asp lie Tyr Glu Gin lie Phe Asp Glu Thr Pro Ser 
70 75 80 

TTG AAA GGG CGC GAT TTA GAA ACA TAC AAA GCA CAA GAT TTG TCA AAG 344 
Leu Lys Gly Arg Asp Leu Glu Thr Tyr Lys Ala Gin Asp Leu Ser Lys 
85 90 95 

AAA TTC ATG CAT TTA GGT TTT GAA CAG ATC TCA AAA GAT TTA AAC GAC 3 92 

Lys Phe Met His Leu Gly Phe Glu Gin lie Ser Lys Asp Leu Asn Asp 
100 105 110 

TCT AGA TTG AAC GCT TTA TTG TGC TAT GAG GAA AAA GTC ATG CAA GCT 44 0 

Ser Arg Leu Asn Ala Leu Leu Cys Tyr Glu Glu Lys Val Met Gin Ala 
115 120 125 130 

TTG GCT AAA AAA TAC CCT AGT TTT TTA CAA GAT TTG CAT GAT ATA AAA 4 88 

Leu Ala Lys Lys Tyr Pro Ser Phe Leu Gin Asp Leu His Asp lie Lys 
135 140 145 

AAA TAC AGG AAT AAA GAT AAA CAC GGC GAG AAA CCA CAA GAT GGG TCT 5 36 

Lys Tyr Arg Asn Lys Asp Lys His Gly Glu Lys Pro Gin Asp Gly Ser 
150 155 160 

TCT TTA ACG AGA GTG GAA TTA GAA AGA TAC AGA GAT GGA ATT TAT TTT 5 84 

Ser Leu Thr Arg Val Glu Leu Glu Arg Tyr Arg Asp Gly lie Tyr Phe 
165 170 175 

CTA GTA GAA AAT CTT TTA AAA AAC CCC TTG ATT AAA GAG AGA GAA AAT 63 2 

Leu Val Glu Asn Leu Leu Lys Asn Pro Leu lie Lys Glu Arg Glu Asn 
180 185 190 

GCT CAA GAA GAA AAA CAT TAT AAG AAA AAT GCA GAG ATT GAC GAC CGA 680 
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lie Glu Ala Phe Val Leu Asp lie Gin Val Leu Glu Gly Met lie Glu 
70 75 80 

ATT GTG GTG TTT GAT AAA AAG CCT TTT AAA TTC AAT TTT AAA GAA GAG T 3 45 
lie Val Val Phe Asp Lys Lys Pro Phe Lys Phe Asn Phe Lys Glu Glu 
85 90 95 

AGTTAAATGG TTATTGTTTT AGTCGTGGAT AGTTTTAAAG ACACCAGTAA TG 3 97 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

Met Val Tyr Glu Glu Asn Phe Val His Ala Gly Phe Val Leu lie Ala 

15 10 15 

Cys Asn Tyr Ala Ala Leu ■ Cys Ala Leu Asn Lys Arg His Ser Val Val 

20 25 30 

Val Ser Asn Asn lie Asn Phe Tyr Ala Pro Leu Glu Leu Asn Gin Glu 

35 40 45 

Ala Leu lie Lys Ala Gin Val lie Gin Asp Gly Val Lys Lys Ala Glu 

50 55 60 

lie Lys lie Glu Ala Phe Val Leu Asp lie Gin Val Leu Glu Gly Met 
65 70 75 80 

lie Glu lie Val Val Phe Asp Lys Lys Pro Phe Lys Phe Asn Phe Lys 
85 90 95 

Glu Glu 



(2) INFORMATION FOR SEQ ID NO: 18 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1208 
(D) OTHER INFORMATION: 
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595 600 605 

Lys Val Phe Ala Ser Ser Phe Gly Val Phe Gly Gly Leu Arg Gly Leu 

610 615 620 

Tyr Asn Ser Tyr Tyr Val Phe Asn Gin Val Lys Gly Ser Gly Asn Leu 
625 630 635 640 

Asp lie Val Thr Gly Phe Asn Tyr Arg Tyr Lys His Ser Lys Tyr Ser 

645 650 655 

Val Gly lie Ser Val Pro Leu lie Gin Ser Gly lie Lys lie Ala Ser 

660 665 670 

Asn Asn Gly lie Tyr Ala Asn Ser Val Val Leu Asn Glu Gly Gly Ser 

675 680 685 

His Phe Lys Val Phe Phe Asn Tyr Gly Trp lie Phe 
690 695 700 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 344 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

TCGTTTTGGA AAAAGATATA GCCCATGCGC GTTTCAAGGG TAATGAAAGC ATG GTG 56 

Met Val 
1 

TAT GAA GAA AAT TTT GTG CAT GCC GGG TTT GTG CTT ATT GCG TGC AAT 104 
Tyr Glu Glu Asn Phe Val His Ala Gly Phe Val Leu lie Ala Cys Asn 
5 10 15 

TAT GCG GCC TTG TGC GCG TTG AAT AAA AGA CAC AGC GTG GTG GTT TCT 152 
Tyr Ala Ala Leu Cys Ala Leu Asn Lys Arg His Ser Val Val Val Ser 
20 25 30 

AAT AAC ATC AAT TTT TAT GCC CCC CTA GAA TTG AAT CAA GAA GCA CTC 2 00 

Asn Asn lie Asn Phe Tyr Ala Pro Leu Glu Leu Asn Gin Glu Ala Leu 
35 40 45 50 

ATT AAA GCG CAA GTG ATT CAA GAT GGC GTG AAA AAA GCT GAA ATA AAA 248 
lie Lys Ala Gin Val lie Gin Asp Gly Val Lys Lys Ala Glu lie Lys 
55 60 65 

ATA GAG GCG TTT GTG TTA GAC ATT CAG GTT TTA GAG GGA ATG ATA GAA 2 96 

-355- 



BNSDOCID: <WO 9843478A1 l.> 



wo 98/43478 



PCT/US98/06371 



Glu Thr lie Pro 

Asn Asp Ser Glu 
180 

Gin Ala Thr Ala 
195 

Gin Lys Leu Ser 
210 

Ala Asp Thr Asn 
225 

Phe Thr Pro Gin 

lie Ala Val Phe 
260 

Pro Phe Gin Phe 
275 

Lys Cys Val Asn 
290 

Lys Tyr Val Leu 
305 

lie Asn Leu Glu 

Tyr Glu Leu Asp 
340 

Leu Asn Pro Glu 
355 

Val Glu Leu Arg 
370 

Thr His Asn Gly 
385 

Asn Gly Gly Ala 

Asn Gly Lys Glu 
420 

Gin Asp Gly Tyr 
435 

Gin Val Tyr Thr 
450 

Ala Gin Val Pro 
465 

Gin Leu lie Asn 

Ser Pro Thr Asn 
500 

Thr Ser lie Val 
515 

Thr Thr Tyr Arg 
530 

Val Asn Val Lys 
545 

Leu Ala Tyr Tyr 

Glu Lys lie Gin 
580 

Asp Phe lie Thr 



Lys Gin Ser Gin 
165 

Phe Glu Gin Ala 

Thr Gin Phe Ser 
200 

Lys lie Thr Thr 
215 

Asn Phe Lys Gly 
230 

Thr Ala Glu Glu 
245 

Asp Ser Lys Ser 

Ser Asn Ser Ser 
280 

Pro Phe Asn Asn 
2 95 

Thr Pro Gin Ser 
310 

Val Asn lie Leu 
325 

Asp Asn Asp Gly 

Lys Leu Phe Gly 
360 

Asp lie lie His 
375 

Asn Met Thr Tyr 
390 

lie Gin Glu Cys 
405 

Glu Leu Lys Phe 

Thr Tyr Asp Val 
440 

Ala Ser Asn Tyr 
455 

Ala Gly Leu lie 
470 

Gin Asn Ala Leu 
485 

His Leu Asn Ala 

Ser Ala lie Ala 
520 

Ser Ser Ser Lys 
535 

lie Gly Tyr Gin 
550 

Gly lie He Lys 
565 

Gin Leu Ser Tyr 
Thr Tyr Ala Asn 



He Thr Leu Pro 
170 

Asp Ser Phe Asn 
185 

Asp Ala Asn Thr 

Asn Val Thr Met 
220 

Asn Cys His Asp 
235 

Leu Thr Asn Leu 
250 

Trp Glu Glu Ala 
265 

Ser Glu Cys Gly 

Gly Arg Val Ala 
300 

Val He Asp Ala 
315 

Lys Ser Gly Phe 
330 

Asn Leu Gly He 
345 

Lys Thr Leu Asn 

Glu Phe Ser His 
380 

Gin Arg Val Arg 
395 

Glu Gly Gly Lys 
410 

Thr Asn Gly Lys 
425 

Cys Ser Phe Tyr 

Pro Asn Ser He 
460 

Gly Val Thr Thr 
475 

Pro He Asn Phe 
490 

Gly Leu Asn Ala 
505 

Gin Asn Phe Ser 

Asn Phe Arg Ser 
540 

His Tyr Phe Asn 
555 

Tyr Asn Tyr Ala 
570 

Gly Gly Gly Met 
585 

Lys Lys Gin Asp 



Ala Ser Leu Phe 
175 

Tyr Gin Gin Leu 
190 

Gin Ser Leu Phe 
205 

Ser Tyr Glu Asn 

Cys Val Ser Asp 
240 

Met Leu Asp Met 
255 

Val Leu Asn Ala 
270 

Ser Asp Phe Pro 
285 

Pro He Tyr Glu 

Phe Arg Arg Thr 
320 

Val Gly Leu Gly 
335 

Glu Ala Ser Ala 
350 

Lys Val Asp He 
365 

Thr Lys Gly Tyr 

Leu Cys Gin Glu 
400 

Glu Glu Leu Val 
415 

Glu Val Lys Asp 
430 

Lys Asp Asn His 
445 

Tyr Thr Asn Cys 

Ala Val Trp Gin 
480 

Ala Asn Leu Asn 
495 

Gin Asn Phe Ala 
510 

Thr Thr Ser Thr 
525 

Pro He Leu Gly 

Asp Tyr He Gly 
560 

Lys Thr Asn Asp 
575 

Asp Val Leu Phe 
590 

Asn Pro Thr Lys 
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AGC TAT TAT GTC TTC AAC CAA GTC AAA GGA AGC GGT AAT TTA GAT ATA 1976 
Ser Tyr Tyr Val Phe Asn Gin Val Lys Gly Ser Gly Asn Leu Asp He 
630 635 640 

GTT ACT GGG TTT AAT TAG CGC TAC AAG CAT TCT AAA TAT TCT GTA GGC 2024 
Val Thr Gly Phe Asn Tyr Arg Tyr Lys His Ser Lys Tyr Ser Val Gly 
645 650 655 

ATT AGC GTT CCT TTA ATC CAA AGC GGT ATT AAA ATC GCT TCT AAT AAT 2 072 

He Ser Val Pro Leu He Gin Ser Gly He Lys He Ala Ser Asn Asn 
660 665 670 

GGC ATC TAT GCG AAC TCC GTT GTT TTG AAT GAA GGG GGC AGT CAT TTT 2120 
Gly He Tyr Ala Asn Ser Val Val Leu Asn Glu Gly Gly Ser His Phe 
675 680 685 690 

AAA GTG TTT TTT AAT TAC GGG TGG ATT TTT TAGGATTTAA AATCCCCAAT AAC 2173 
Lys Val Phe Phe Asn Tyr Gly Trp He Phe 
695 700 

CCCCTAAACT TGTGCGATAC TCGCTACAAA 22 03 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 0: 

Met Leu Arg Phe Val Ser Lys Thr He Cys Leu Ser Leu He Gly Leu 

1.5 10 15 

Phe Asn Pro Leu Glu Ala Phe Gin Lys His Gin Lys Asp Gly Phe Phe 

20 25 30 

He Glu Ala Gly Phe Glu Thr Gly Leu Leu Glu Gly Thr Gin Thr Lys 

35 40 45 

Glu Glu Val He Thr Thr Gin Lys He Tyr Glu Asn Pro Leu Thr His 

50 55 60 

Pro Gin Thr Lys Glu Gin Pro Lys Glu Gin Asn Lys Ser Asp Thr Ala 
65 70 75 80 

Thr Pro Gin Ser Ala Tyr Gly Lys Tyr Tyr He Pro Gin Ser Thr He 

85 90 95 

Leu Lys Asn Ala Thr Ala Leu Phe Thr Thr Asp Lys He Glu Asn Gly 

100 105 110 

Leu Thr Phe Tyr Ser Gin Asn Pro Val Tyr Ala Asn Met Val Asn Gly 

115 120 125 

Ser Val Thr He Gin Asn Phe Leu Pro Tyr Asn Leu Asn Asn Val Glu 

130 135 140 

Leu Ser Phe Lys Asp Ala Gin Gly Lys Val Val Asn Leu Gly Val He 
145 150 155 160 



-353- 



BNSDOCID: <WO_984347SAl_l_> 



wo 98/43478 



PCT/US98/06371 



GGA GCC ATA CAA GAA TGT GAG GGT GGG AAA GAA GAG TTA GTC AAT GGA 13 04 

Gly Ala lie Gin Glu Cys Glu Gly Gly Lys Glu Glu Leu Val Asn Gly 
405 410 415 

AAA GAA GAA CTA AAA TTT ACA AAT GGG AAA GAA GTG AAA GAT CAG GAT 13 52 

Lys Glu Glu Leu Lys Phe Thr Asn Gly Lys Glu Val Lys Asp Gin Asp 
420 425 430 

GGT TAG ACC TAT GAT GTA TGT TCT TTT TAT AAG GAC AAC CAC CAA GTC 14 00 

Gly Tyr Thr Tyr Asp Val Cys Ser Phe Tyr Lys Asp Asn His Gin Val 
435 440 445 450 

TAT ACA GCG AGC AAT TAC CCC AAT TCC ATT TAT ACG AAT TGC GCT CAA 144 8 
Tyr Thr Ala Ser Asn Tyr Pro Asn Ser lie Tyr Thr Asn Cys Ala Gin 
455 460 465 

GTC CCT GCT GGG CTT ATA GGG GTT ACC ACC GCT GTC TGG CAA CAG CTC 14 96 

Val Pro Ala Gly Leu lie Gly Val Thr Thr Ala Val Trp Gin Gin Leu 
470 475 480 

ATC AAT CAA AAC GCT CTG CCC ATT AAT TTC GCT AAT CTA AAT AGC CCA 1544 
lie Asn Gin Asn Ala Leu Pro lie Asn Phe Ala Asn Leu Asn Ser Pro 
485 490 495 

ACC AAC CAC TTA AAC GCC GGG TTG AAC GCA CAA AAT TTT GCA ACC TCT 15 92 

Thr Asn His Leu Asn Ala Gly Leu Asn Ala Gin Asn Phe Ala Thr Ser 
500 505 510 

ATA GTC AGC GCG ATC GCG CAA AAT TTT TCC ACC ACT TCC ACC ACC ACT 164 0 

lie Val Ser Ala lie Ala Gin Asn Phe Ser Thr Thr Ser Thr Thr Thr 
515 520 525 530 

TAC CGC TCT TCA AGT AAG AAT TTT AGA AGC CCT ATT TTA GGG GTT AAT 168 8 

Tyr Arg Ser Ser Ser Lys Asn Phe Arg Ser Pro lie Leu Gly Val Asn 
535 540 545 

GTT AAA ATA GGC TAC CAA CAT TAT TTC AAT GAC TAC ATA GGG TTA GCC 173 6 

Val Lys lie Gly Tyr Gin His Tyr Phe Asn Asp Tyr lie Gly Leu Ala 
550 555 560 

TAT TAC GGC ATT ATC AAA TAC AAT TAC GCC AAA ACT AAC GAT GAA AAA 1784 
Tyr Tyr Gly lie lie Lys Tyr Asn Tyr Ala Lys Thr Asn Asp Glu Lys 
565 570 575 

ATC CAG CAA TTA AGC TAT GGT GGG GGA ATG GAT GTG TTG TTT GAT TTC 183 2 

lie Gin Gin Leu Ser Tyr Gly Gly Gly Met Asp Val Leu Phe Asp Phe 
580 585 590 

ATC ACC ACT TAC GCT AAC AAA AAG CAA GAC AAC CCA ACT AAA AAA GTT 188 0 

lie Thr Thr Tyr Ala Asn Lys Lys Gin Asp Asn Pro Thr Lys Lys Val 
595 600 605 610 

TTT GCT TCC TCT TTT GGG GTG TTT GGG GGG TTA AGG GGC TTA TAC AAT 192 8 

Phe Ala Ser Ser Phe Gly Val Phe Gly Gly Leu Arg Gly Leu Tyr Asn 
615 620 625 



-352- 



BNSDOCID: <WO 9843478A1_I_> 



wo 98/43478 



PCT/US98/06371 



Ser Glu Phe Glu Gin Ala Asp Ser Phe Asn Tyr Gin Gin Leu Gin Ala 
180 185 190 

ACT GCC ACA CAA TTT TCT GAC GCT AAC ACG CAA AGT TTG TTT CAA AAG 680 
Thr Ala Thr Gin Phe Ser Asp Ala Asn Thr Gin Ser Leu Phe Gin Lys 
195 200 205 210 

CTC AGC AAG ATC ACA ACC AAT GTA ACA ATG AGT TAT GAA AAC GCC GAT 72 8 

Leu Ser Lys lie Thr Thr Asn Val Thr Met Ser Tyr Glu Asn Ala Asp 
215 220 225 

ACC AAC AAT TTT AAA GGT AAT TGC CAT GAT TGT GTG TCA GAT TTC ACC 77 6 

Thr Asn Asn Phe Lys Gly Asn Cys His Asp Cys Val Ser Asp Phe Thr 
230 235 240 

CCA CAA ACC GCA GAA GAA TTG ACC AAT TTA ATG CTA GAT ATG ATT GCG 824 
Pro Gin Thr Ala Glu Glu Leu Thr Asn Leu Met Leu Asp Met lie Ala 
245 250 255 

GTG TTT GAC TCT AAA TCG TGG GAA GAA GCC GTT TTA AAC GCT CCT TTC 872 
Val Phe Asp Ser Lys Ser Trp Glu Glu Ala Val Leu Asn Ala Pro Phe 
260 265 270 

CAA TTT TCT AAC AGC TCA TCA GAG TGC GGC TCT GAC TTT CCT AAG TGC 92 0 

Gin Phe Ser Asn Ser Ser Ser Glu Cys Gly Ser Asp Phe Pro Lys Cys 
275 280 285 290 

GTG AAT CCT TTC AAT AAC GGG CGT GTC GCT CCC ATC TAT GAA AAA TAC 968 
Val Asn Pro Phe Asn Asn Gly Arg Val Ala Pro lie Tyr Glu Lys Tyr 
295 300 305 

GTG CTA ACC CCA CAA TCC GTT ATA GAT GCG TTT AGA AGA ACG ATC AAT 1016 
Val Leu Thr Pro Gin Ser Val lie Asp Ala Phe Arg Arg Thr lie Asn 
310 315 320 

CTT GAA GTG AAT ATC CTA AAA TCA GGG TTT GTA GGG CTA GGG TAT GAA 1064 
Leu Glu Val Asn lie Leu Lys Ser Gly Phe Val Gly Leu Gly Tyr Glu 
325 330 335 

CTT GAT GAT AAT GAT GGT AAT CTG GGG ATA GAA GCT TCT GCC TTA AAT 1112 
Leu Asp Asp Asn Asp Gly Asn Leu Gly lie Glu Ala Ser Ala Leu Asn 
340 345 350 

CCT GAA AAA TTG TTT GGT AAA ACT TTG AAC AAA GTT GAT ATT GTG GAA 1160 
Pro Glu Lys Leu Phe Gly Lys Thr Leu Asn Lys Val Asp lie Val Glu 
355 360 365 370 

TTA AGA GAC ATT ATC CAT GAA TTT AGC CAC ACT AAA GGC TAT ACG CAT 12 08 

Leu Arg Asp lie lie His Glu Phe Ser His Thr Lys Gly Tyr Thr His 
375 380 385 

AAT GGG AAC ATG ACT TAT CAA AGA GTG CGC TTG TGT CAA GAA AAC GGC 1256 
Asn Gly Asn Met Thr Tyr Gin Arg Val Arg Leu Cys Gin Glu Asn Gly 
390 395 400 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 9: 

GAGTTACACA CTCTTTGAGA ACAAAACGCC AAACCATTTA GGAAATTACC ATG CTA 56 

Met Leu 
1 

AGA TTC GTT AGT AAA ACG ATT TGC TTG TCT TTA ATC GGC TTG TTC AAC 104 
Arg Phe Val Ser Lys Thr lie Cys Leu Ser Leu lie Gly Leu Phe Asn 
5 10 15 

CCT TTA GAA GCC TTT CAA AAA CAC CAA AAA GAC GGC TTT TTT ATA GAA 152 
Pro Leu Glu Ala Phe Gin Lys His Gin Lys Asp Gly Phe Phe lie Glu 
20 25 30 

GCT GGG TTT GAA ACT GGG TTA TTA GAA GGA ACG CAA ACT AAA GAA GAA 2 00 

Ala Gly Phe Glu Thr Gly Leu Leu Glu Gly Thr Gin Thr Lys Glu Glu 
35 40 45 50 

GTC ATA ACC ACC CAA AAA ATC TAT GAA AAC CCC CTA ACC CAC CCA CAA 24 8 

Val lie Thr Thr Gin Lys lie Tyr Glu Asn Pro Leu Thr His Pro Gin 
55 60 65 

ACT AAA GAA CAG CCT AAA GAA CAA AAT AAA AGC GAT ACG GCC ACC CCA 296 
Thr Lys Glu Gin Pro Lys Glu Gin Asn Lys Ser Asp Thr Ala Thr Pro 
70 75 80 

CAA AGC GCT TAC GGA AAA TAG TAC ATA CCC CAA AGC ACC ATT TTA AAA 344 
Gin Ser Ala Tyr Gly Lys Tyr Tyr lie Pro Gin Ser Thr lie Leu Lys 
85 90 95 

AAT GCA ACG GCT TTA TTC ACC ACG GAC AAG ATA GAA AAT GGC TTA ACT 3 92 

Asn Ala Thr Ala Leu Phe Thr Thr Asp Lys lie Glu Asn Gly Leu Thr 
100 105 110 

TTT TAT TCT CAA AAC CCT GTG TAT GCG AAT ATG GTT AAT GGG AGC GTA 44 0 

Phe Tyr Ser Gin Asn Pro Val Tyr Ala Asn Met Val Asn Gly Ser Val 
115 120 125 130 

ACC ATA CAA AAC TTT CTG CCT TAT AAT TTA AAC AAT GTT GAA CTG AGT 4 88 

Thr lie Gin Asn Phe Leu Pro Tyr Asn Leu Asn Asn Val Glu Leu Ser 
135 140 145 

TTT AAA GAC GCT CAA GGC AAG GTG GTC AAT TTA GGC GTG ATA GAG ACC 536 
Phe Lys Asp Ala Gin Gly Lys Val Val Asn Leu Gly Val lie Glu Thr 
150 155 160 

ATC CCT AAA CAA TCT CAA ATT ACC TTG CCT GCA AGC TTG TTT AAT GAT 5 84 

lie Pro Lys Gin Ser Gin lie Thr Leu Pro Ala Ser Leu Phe Asn Asp 
165 170 175 

TCA GAA' TTT GAA CAA GCT GAT AGC TTT AAT TAC CAA CAA CTT CAA GCC 632 
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lie Ser Arg Leu Ser Ala Gin lie lie Tyr Trp Leu Ser Lys Tyr Phe 
115 120 125 130 

AGG AGT GAG TAT GGA AAA TGATGTTAAA GAAGATCTAG AGCAAGCAAG ACCAAAGT 4 96 
Arg Ser Glu Tyr Gly Lys 
135 

TAGAGCCAGA AAAGC 511 
(2) INFORMATION FOR SEQ ID NO: 17 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 8: 

Met Leu Lys Ser Thr lie Lys Glu Asp Tyr Leu Met Leu Met Ser Arg 

15 10 15 

Glu Val Ser Ala Phe Val Gly Thr Leu Phe Phe lie Gly Leu Ser Cys 

20 25 30 

Tyr Ala lie Tyr His Gly Asn Met Pro Asp Tyr Leu Arg Pro Ala Leu 

35 40 45 

lie Asp Thr lie Lys Ala Ala Ser Asp Ser lie Tyr Ser Ser Cys Asp 

50 55 60 

Tyr Met Asp Tyr Phe Leu Lys Ala Arg Lys Met Leu Glu Gly Phe Ala 
65 70 75 80 

Trp Trp Ser Met Phe Lys Ala Glu Ser Met Gly Leu Asn Lys Gly Phe 

85 90 95 

Met Val Ala Gly Trp Val Ala Phe lie lie Tyr Asn Ala Leu Ser Gly 

100 105 110 

lie Ala lie Ser Arg Leu Ser Ala Gin lie He Tyr Trp Leu Ser Lys 

115 120 125 

Tyr Phe Arg Ser Glu Tyr Gly Lys 
130 135 

(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 2150 
(D) OTHER INFORMATION: 
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130 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 458 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TTTTTGCACT ATCGTTGTTT GCGCTGTGGT GTTTGGCACG CTTGAAAAAA ATG CTC 56 

Met Leu 
1 

AAG AGT ACC ATC AAA GAA GAT TAT TTG ATG CTG ATG TCT AGA GAA GTG 104 
Lys Ser Thr lie Lys Glu Asp Tyr Leu Met Leu Met Ser Arg Glu Val 
5 10 15 

AGT GCT TTT GTG GGG ACT CTT TTC TTC ATT GGC TTG AGT TGC TAT GCG 152 
Ser Ala Phe Val Gly Thr Leu Phe Phe lie Gly Leu Ser Cys Tyr Ala 
20 25 30 

ATC TAT CAT GGC AAC ATG CCC GAT TAT TTG AGA CCG GCT TTG ATA GAC 200 
lie Tyr His Gly Asn Met Pro Asp Tyr Leu Arg Pro Ala Leu lie Asp 
35 40 45 50 

ACT ATT AAG GCA GCG AGT GAT TCC ATC TAT TCC AGC TGC GAC TAC ATG 24 8 

Thr lie Lys Ala Ala Ser Asp Ser lie Tyr Ser Ser Cys Asp Tyr Met 
55 60 65 

GAT TAT TTT TTG AAG GCT AGA AAG ATG TTA GAG GGG TTT GCT TGG TGG 2 96 

Asp Tyr Phe Leu Lys Ala Arg Lys Met Leu Glu Gly Phe Ala Trp Trp 
70 75 80 

AGC ATG TTC AAA GCG GAG AGC ATG GGC TTA AAT AAG GGG TTT ATG GTT 344 
Ser Met Phe Lys Ala Glu Ser Met Gly Leu Asn Lys Gly Phe Met Val 
85 90 95 

GCG GGC TGG GTA GCG TTT ATC ATC TAT AAC GCT CTT AGC GGG ATA GCC 3 92 

Ala Gly Trp Val Ala Phe lie lie Tyr Asn Ala Leu Ser Gly lie Ala 
100 105 110 

ATC AGC AGG CTG AGC GCT CAA ATC ATT TAT TGG TTA TCA AAA TAT TTT 44 0 
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Val Lys Lys Arg Leu Asp Leu Arg Lys Gin Asn Ser lie Glu Lys Glu 
55 .60 65 

GAA AAG CAA AAA TTC CAA AAA TTC GCC CTG GGC TTG GAA ATG TCT TTC 296 
Glu Lys Gin Lys Phe Gin Lys Phe Ala Leu Gly Leu Glu Met Ser Phe 
70 75 80 

AAT GTG TGG CGT TTA GGA GGG TAT GGG GTT TTA CTA GGC ATT TTA GGA 344 
Asn Val Trp Arg Leu Gly Gly Tyr Gly Val Leu Leu Gly lie Leu Gly 
85 90 95 

ACG CTT TTA TTC TTG CAT CTT TTT AAC GGG TTA ATC TTT CTT ATT GGC 3 92 

Thr Leu Leu Phe Leu His Leu Phe Asn Gly Leu lie Phe Leu lie Gly 
100 105 110 

GTG TTT GTG AGC TCG CTC TCT AGC GCG TTA TTA CGA TTT TTG AAT AAT 44 0 

Val Phe Val Ser Ser Leu Ser Ser Ala Leu Leu Arg Phe Leu Asn Asn 
115 120 125 130 

AAT GGT AAG TTT TGACACAAAC TCACATGGAT TTTAACCCCT TTAATCCTCT TTTAA 4 97 
Asn Gly Lys Phe 



TTTTTAAT 505 
(2) INFORMATION FOR SEQ ID NO: 17 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 6: 



Met 


Cys 


Gin 


He 


Gin 


Cys 


Leu 


Leu 


He 


Leu 


Leu 


Ser 


He 


Asn 


He 


Val 


1 








5 










10 










15 




Ser 


Ala 


He 


He 


Val 


Tyr 


Phe 


Phe 


Gin 


Ala 


Phe 


Gin 


Gly 


Val 


Leu 


Asn 








20 










25 










30 






Phe 


Glu 


Gly 


Gly 


Phe 


Leu 


Gly 


Phe 


Phe 


He 


Val 


Ala 


Leu 


Ser 


Ser 


Tyr 






35 










40 










45 








Tyr 


Gly 


Val 


Lys 


Lys 


Arg 


Leu 


Asp 


Leu 


Arg 


Lys 


Gin 


Asn 


Ser 


He 


Glu 




50 










55 










60 










Lys 


Glu 


Glu 


Lys 


Gin 


Lys 


Phe 


Gin 


Lys 


Phe 


Ala 


Leu 


Gly 


Leu 


Glu 


Met 


65 










70 










75 










80 


Ser 


Phe 


Asn 


Val 


Trp 


Arg 


Leu 


Gly Gly 


Tyr 


Gly Val 


Leu 


Leu 


Gly 


He 










85 










90 










95 




Leu 


Gly 


Thr 


Leu 


Leu 


Phe 


Leu 


His 


Leu 


Phe 


Asn 


Gly 


Leu 


He 


Phe 


Leu 








100 










105 










110 






He 


Gly 


Val 


Phe 


Val 


Ser 


Ser 


Leu 


Ser 


Ser 


Ala 


Leu 


Leu 


Arg 


Phe 


Leu 






115 










120 










125 








Asn 


Asn 


Asn 


Gly 


Lys 


Phe 
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Gly Val Gly Thr Asp Phe Leu Tyr Asn Val Tyr Glu Arg Ser Arg Arg 

355 360 365 

Arg Glu Arg Thr Thr lie Gly Leu Phe Phe Gly Ala Gin lie Ala Gly 

370 375 380 

Gin Thr Trp Ser Thr Asn Val Thr Asn Leu Leu Ser Gly Gin Arg Pro 
385 390 395 400 

Asp Val Lys Ser Ser Ser Phe Gin Phe Leu Phe Asp Leu Gly Val Arg 

405 410 415 

Thr Asn Phe Ala Lys Thr Asn Phe Asn Lys His Arg Leu Asp Gin Gly 

420 425 430 

lie Glu Phe Gly Val Lys lie Pro Val lie Ala His Lys Tyr Phe Ala 

435 440 445 

Thr Gin Gly Ser Ser Ala Ser Tyr Met Arg Asn Phe Ser Phe Tyr Val 

450 455 460 

Gly Tyr Ser Val Gly Phe 
465 470 



(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 452 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAGACCCTAA ATACACACAA AATAAAACAA AATT^AATCCA ATCAAATCCC ATG TGC 5 6 

Met Cys 
1 

CAA ATC CAA TGC TTG CTT ATT TTA CTT TCT ATC AAT ATA GTT AGC GCG 104 
Gin lie Gin Cys Leu Leu lie Leu Leu Ser lie Asn lie Val Ser Ala 
5 10 15 

ATC ATC GTT TAT TTT TTC CAA GCA TTT CAA GGG GTT TTG AAT TTT GAA 152 
He He Val Tyr Phe Phe Gin Ala Phe Gin Gly Val Leu Asn Phe Glu 
20 25 30 

GGG GGT TTT TTA GGG TTT TTT ATC GTG GCG TTG TCT TCG TAT TAC GGC 200 
Gly Gly Phe Leu Gly Phe Phe He Val Ala Leu Ser Ser Tyr Tyr Gly 
35 40 45 50 

GTT AAA AAG CGT TTG GAT TTA AGG AAA CAA AAT TCA ATA GAA AAA GAA 24 8 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:174: 

Met Ser Leu Cys Arg Ala Glu Glu Asp Gly Ala Phe Phe Val lie Asp 

15 10 15 

Tyr Gin Thr Ser Leu Ala Arg Gin Glu Leu Lys Asn Pro Gly Phe Thr 

20 25 30 

Gin Ala Gin Glu Leu Arg Gin Leu lie Arg Asp Gly Ala Val Arg Leu 

35 40 45 

Gin Thr Ser Ala lie Pro Leu Ser Tyr Tyr Leu Asp lie Leu Gly Asn 

50 55 60 

Lys Thr Ala Thr Leu Leu Arg Glu Ser Leu Lys Asn Asn Ala Gin Pro 
65 70 75 80 

Ser Gin Pro Asn Ala Gin Pro Pro Gin Gin Asn Gly Pro Ser Asn Gin 

85 90 95 

Ala Leu Ala Asn Leu Glu Gin Ser Leu Gly lie Leu Gly Lys Leu Leu 

100 105 110 

Asp Leu Ser Gin Gin Tyr Ala Ser Gin Gly Val lie Lys Pro Leu Val 

115 120 125 

Val Asp Val Gly Lys Glu Gin lie Gly lie Thr Asp Ser Met Leu Leu 

130 135 140 

Val Ala Gin Asn He Val Leu Ala Leu Gly Gin Val Asp Leu Ser Lys 
145 150 155 160 

He Gin Gin Asn Asn Asn Glu Gin Leu Tyr Glu Asn He Met Lys Val 

165 170 175 

Met Leu Leu Gly Ala Gly Gly Thr Asn Gly Ala Tyr Asn Gly Val Ser 

180 185 190 

Val Gly Asp He Ala Thr Gly Met Gin Asn Phe Ser Ser Gin Thr Gly 

195 200 205 

Leu He Gly Ala Asn Ser Thr Val Ser Glu Leu Asn Ala Leu He Lys 

210 215 220 

Ser Gly He Ser Leu Asp Arg Glu Thr Leu Gly Leu Gly Ser Phe He 
225 230 235 240 

Glu Lys Asn He Cys Ser Gly Ala Ser Ser Cys Phe Ser Gly Asn Gin 

245 250 255 

Leu He Tyr Lys Lys Gly Leu Asp Arg Thr He Asn He He Asn Thr 

260 265 270 

Val Leu Gly Gin Phe Glu Ser Ser Ala Ser Ser Leu Tyr Lys He Ser 

275 280 285 

Tyr He Pro Asn Leu Phe Ser Leu Lys Asp Tyr Gin Ser Ala Ser Met 

290 295 300 

Asn Gly Phe Gly Ala Lys Met Gly Tyr Lys Gin Phe Phe Thr His Lys 
305 310 315 320 

Lys Asn Val Gly Leu Arg Tyr Tyr Gly Phe Leu Asp Tyr Gly Tyr Ala 

325 330 335 

Asn Phe Gly Asp Thr Asn Leu Lys Val Gly Ala Asn Leu Val Thr Tyr 
340 345 350 
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Gly Gin Phe Glu Ser Ser Ala Ser Ser Leu Tyr Lys lie Ser Tyr lie 
275 280 285 290 

CCT AAC CTC TTT TCG CTC AAG GAT TAG CAG TCA GCG AGO ATG AAC GGC 968 
Pro Asn Leu Phe Ser Leu Lys Asp Tyr Gin Ser Ala Ser Met Asn Gly 
295 300 305 

TTT GGG GOT AAG ATG GGC TAT AAA CAA TTT TTC ACC CAT AAG AAA AAT 1016 
Phe Gly Ala Lys Met Gly Tyr Lys Gin Phe Phe Thr His Lys Lys Asn 
310 315 320 

GTT GGC TTA AGG TAT TAC GGG TTT TTG GAT TAT GGC TAT GCG AAC TTT 1064 
Val Gly Leu Arg Tyr Tyr Gly Phe Leu Asp Tyr Gly Tyr Ala Asn Phe 
325 330 335 

GGC GAT ACG AAT TTA AAA GTG GGG GCG AAT CTT GTT ACT TAT GGG GTA 1112 
Gly Asp Thr Asn Leu Lys Val Gly Ala Asn Leu Val Thr Tyr Gly Val 
340 345 350 

GGA ACG GAT TTT TTA TAC AAT GTG TAT GAA CGC TCT AGA AGG AGG GAA 1160 
Gly Thr Asp Phe Leu Tyr Asn Val Tyr Glu Arg Ser Arg Arg Arg Glu 
355 360 365 370 

AGG ACT ACG ATC GGT CTT TTC TTT GGC GCT CAA ATT GCA GGG CAA ACT 12 0 8 

Arg Thr Thr He Gly Leu Phe Phe Gly Ala Gin He Ala Gly Gin Thr 
375 380 385 

TGG AGC ACT AAT GTA ACG AAC TTA TTG AGC GGG CAA AGG CCT GAT GTC 12 56 

Trp Ser Thr Asn Val Thr Asn Leu Leu Ser Gly Gin Arg Pro Asp Val 
390 395 400 

AAG TCC AGT TCG TTC CAA TTC TTG TTT GAT TTG GGC GTG CGC ACC AAC 13 04 

Lys Ser Ser Ser Phe Gin Phe Leu Phe Asp Leu Gly Val Arg Thr Asn 
405 410 415 

TTT GCA AAA ACC AAT TTC AAT AAG CAC AGG CTA GAC CAA GGG ATA GAA 1352 
Phe Ala Lys Thr Asn Phe Asn Lys His Arg Leu Asp Gin Gly lie Glu 
420 425 430 

TTT GGG GTG AAA ATC CCT GTT ATC GCT CAT AAA TAT TTT GCA ACC CAA 14 00 
Phe Gly Val Lys He Pro Val He Ala His Lys Tyr Phe Ala Thr Gin 
435 440 445 450 

GGC TCA AGC GCG AGC TAT ATG AGG AAT TTT AGC TTC TAT GTG GGC TAT 144 8 

Gly Ser Ser Ala Ser Tyr Met Arg Asn Phe Ser Phe Tyr Val Gly Tyr 
455 460 465 

TCA GTC GGT TTT TAAGGAAGGC TCTTGATGAA AAATACCAAT ACAAAAGAGA TAAAG 1505 
Ser Val Gly Phe 
470 

AATACAAG 1513 
(2) INFORMATION FOR SEQ ID NO: 174: 
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55 60 65 

GCA ACT CTT TTG CGT GAA AGC CTG AAA AAC AAT GCA CAA CCA TCA CAA 296 
Ala Thr Leu Leu Arg Glu Ser Leu Lys Asn Asn Ala Gin Pro Ser Gin 
70 75 80 

CCA AAC GCG CAA CCA CCG CAA CAA AAC GGA CCA TCA AAC CAA GCC TTA 344 
Pro Asn Ala Gin Pro Pro Gin Gin Asn Gly Pro Ser Asn Gin Ala Leu 
85 90 95 

GCC AAT TTA GAG CAA TCT CTA GGG ATT TTA GGA AAA CTA TTG GAT CTA 3 92 

Ala Asn Leu Glu Gin Ser Leu Gly lie Leu Gly Lys Leu Leu Asp Leu 
100 105 110 

TCC CAA CAA TAC GCT AGT CAG GGT GTC ATT AAG CCT TTG GTG GTG GAT 44 0 

Ser Gin Gin Tyr Ala Ser Gin Gly Val lie Lys Pro Leu Val Val Asp 
115 120 125 130 

GTG GGG AAA GAA CAA ATC GGT ATC ACG GAT AGC ATG CTC TTG GTG GCT 4 88 

Val Gly Lys Glu Gin lie Gly lie Thr Asp Ser Met Leu Leu Val Ala 
135 140 145 

CAA AAC ATC GTT TTA GCT TTA GGG CAA GTG GAT TTG AGC AAA ATC CAA 536 
Gin Asn lie Val Leu Ala Leu Gly Gin Val Asp Leu Ser Lys lie Gin 
150 155 160 

CAA AAC AAT AAC GAA CAG CTA TAC GAA AAT ATT ATG AAA GTC ATG CTT 584 
Gin Asn Asn Asn Glu Gin Leu Tyr Glu Asn lie Met Lys Val Met Leu 
165 170 175 

TTA GGC GCG GGC GGG ACT AAT GGG GCG TAT AAT GGC GTG AGT GTG GGC 632 
Leu Gly Ala Gly Gly Thr Asn Gly Ala Tyr Asn Gly Val Ser Val Gly 
180 185 190 

GAC ATT GCC ACG GGC ATG CAA AAT TTX TCT TCG CAA ACG GGC TTG ATA 6 80 

Asp lie Ala Thr Gly Met Gin Asn Phe Ser Ser Gin Thr Gly Leu lie 
195 200 205 210 

GGG GCT AAT TCT ACG GTT AGC GAG CTG AAT GCT TTG ATT AAG AGC GGG 728 
Gly Ala Asn Ser Thr Val Ser Glu Leu Asn Ala Leu lie Lys Ser Gly 
215 220 225 

ATT TCT TTG GAT CGT GAG ACT TTG GGG TTA GGG AGT TTT ATT GAA AAA 776 
lie Ser Leu Asp Arg Glu Thr Leu Gly Leu Gly Ser Phe lie Glu Lys 
230 235 240 

AAT ATC TGT AGC GGT GCA TCG TCT TGT TTT AGT GGG AAT CAG CTT ATC 824 
Asn lie Cys Ser Gly Ala Ser Ser Cys Phe Ser Gly Asn Gin Leu lie 
245 250 255 

TAT AAG AAA GGG CTA GAC AGA ACC ATA AAC ATC ATT AAT ACG GTA TTA 872 
Tyr Lys Lys Gly Leu Asp Arg Thr lie Asn lie lie Asn Thr Val Leu 
260 265 270 

GGT CAG TTT GAA TCT TCG GCT AGT TCT CTT TAT AAG ATT TCT TAT ATC 920 
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Asn Met Gin Val Leu Ala Lys Ala Leu Lys Glu Lys Asn His Lys Leu 

290 295 300 

Val Ser Gly Gly Thr Ser Asn His Leu Leu Leu Met Asp Phe Leu Asp 
305 310 315 320 

Lys Pro Tyr Ser Gly Lys Asp Ala Asp lie Ala Leu Gly Asn Ala Gly 

325 330 335 

lie Thr Val Asn Lys Asn Thr lie Pro Gly Glu Thr Arg Ser Pro Phe 

340 345 350 

Val Thr Ser Gly lie Arg lie Gly Ser Ala Ala Leu Ser Ala Arg Gly 

355 360 365 

Met Gly Ala Lys Glu Phe Glu lie lie Gly Asn Lys lie Ser Asp lie 

370 375 380 

Leu Asn Asp lie Asn Asn Val Ser Leu Gin Leu His Val Lys Glu Glu 
385 390 395 400 

Leu Lys Ala Met Val Asn Gin Phe Pro Val Tyr His Gin Pro lie Phe 
405 410 415 



(2) INFORMATION FOR SEQ ID NO: 173: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1460 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TGAAAAACCA CTCCTTTAAA AAAACGATCG CTCTTTCCTT ACTAGCGAGC ATG TCT 56 

Met Ser 
1 

TTG TGC AGG GCT GAA GAA GAT GGG GCG TTT TTT GTC ATA GAT TAC CAG 104 
Leu Cys Arg Ala Glu Glu Asp Gly Ala Phe Phe Val lie Asp Tyr Gin 
5 10 15 

ACG AGT TTG GCC AGA CAG GAA TTG AAA AAT CCA GGC TTC ACC CAA GCG 152 
Thr Ser Leu Ala Arg Gin Glu Leu Lys Asn Pro Gly Phe Thr Gin Ala 
20 25 30 

CAA GAA TTA AGG CAG TTG ATC AGA GAT GGG GCT GTG AGG TTG CAA ACT 2 00 

Gin Glu Leu Arg Gin Leu lie Arg Asp Gly Ala Val Arg Leu Gin Thr 
35 40 45 50 

TCT GCC ATT CCC TTA TCC TAC TAC TTG GAT ATT TTA GGG AAT AAA ACA 24 8 

Ser Ala lie Pro Leu Ser Tyr Tyr Leu Asp lie Leu Gly Asn Lys Thr 
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GCC ATG GTC AAT CAA TTC CCT GTG TAG CAC CAA CCT ATT TTT TAAGGGAGT 13 07 
Ala Met Val Asn Gin Phe Pro Val Tyr His Gin Pro lie Phe 
405 410 415 

CAAGATGACA GAAATGGAAT TAAAGCTCAT TAAGATAGAC ACAA 13 51 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 172 : 

Met Ala Tyr Phe Leu Glu Gin Thr Asp Ser Glu lie Phe Glu Leu lie 

15 10 15 

Phe Glu Glu Tyr Lys Arg Gin Asn Glu His Leu Glu Met lie Ala Ser 

20 25 30 

Glu Asn Tyr Thr Phe Ala Ser Val Met Glu Ala Met Gly Ser Val Leu 

35 40 45 

Thr Asn Lys Tyr Ala Glu Gly Tyr Pro Asn Lys Arg Tyr Tyr Gly Gly 

50 55 60 

Cys Glu Val Val Asp Lys lie Glu Ser Leu Ala lie Glu Arg Ala Lys 
65 70 75 80 

Lys Leu Phe Asn Cys Gin Phe Ala Asn Val Gin Ala His Ser Gly Ser 

85 90 95 

Gin Ala Asn Asn Ala Val Tyr His Ala Leu Leu Lys Pro Tyr Asp Lys 

100 105 110 

lie Leu Gly Met Asp Leu Ser Cys Gly Gly His Leu Thr His Gly Ala 

115 120 125 

Lys Val Ser Leu Thr Gly Lys His Tyr Gin Ser Phe Ser Tyr Gly Val 

130 135 140 

Asn Leu Asp Gly Tyr lie Asp Tyr Glu Glu Ala Leu Lys lie Ala Gin 
145 150 155 160 

Ser Val Lys Pro Glu lie lie Val Cys Gly Phe Ser Ala Tyr Pro Arg 

165 170 175 

Glu lie Asp Phe Lys Lys Phe Arg Glu lie Ala Asp Glu Val Gly Ala 

180 185 190 

Leu Leu Leu Gly Asp lie Ala His Val Ala Gly Leu Val Val Thr Gly 

195 200 205 

Glu His Ala His Pro Phe Pro His Cys His Val Val Ser Ser Thr Thr 

210 215 220 

His Lys Thr Leu Arg Gly Pro Arg Gly Gly lie lie Leu Thr Asn Asp 
225 230 235 240 

Glu Glu lie Ala Ala Lys lie Asp Lys Ala lie Phe Pro Gly Thr Gin 

245 250 255 

Gly Gly Pro Leu Met His Val lie Ala Ala Lys Ala Val Gly Phe Lys 

260 265 270 

Glu Asn Leu Lys Pro Glu Phe Lys Ala Tyr Ala Gin Leu Val Lys Ser 
275 280 285 
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Asp Phe Lys Lys Phe Arg Glu lie Ala Asp Glu Val Gly Ala Leu Leu 
180 185 190 

TTA GGC GAT ATA GCC CAT GTG GCA GGG CTT GTG GTA ACC GGT GAG CAT 680 
Leu Gly Asp lie Ala His Val Ala Gly Leu Val Val Thr Gly Glu His 
195 200 205 210 

GCC CAT CCT TTC CCG CAT TGC CAT GTG GTT TCA AGO ACC ACT CAT AAG 72 8 

Ala His Pro Phe Pro His Cys His Val Val Ser Ser Thr Thr His Lys 
215 220 225 

ACC TTA AGA GGG CCT AGA GGG GGG ATT ATT TTA ACT AAT GAT GAA GAG 77 6 

Thr Leu Arg Gly Pro Arg Gly Gly lie lie Leu Thr Asn Asp Glu Glu 
230 235 240 

ATA GCG GCT AAG ATT GAC AAA GCG ATT TTT CCA GGA ACT CAA GGC GGG 824 
lie Ala Ala Lys lie Asp Lys Ala lie Phe Pro Gly Thr Gin Gly Gly 
245 250 255 

CCT TTG ATG CAT GTG ATT GCT GCT AAA GCG GTG GGT TTT AAA GAG AAT 872 
Pro Leu Met His Val lie Ala Ala Lys Ala Val Gly Phe Lys Glu Asn 
260 265 270 

CTA AAA CCA GAA TTT AAA GCT TAT GCA CAA TTA GTG AAA TCT AAC ATG 92 0 

Leu Lys Pro Glu Phe Lys Ala Tyr Ala Gin Leu Val Lys Ser Asn Met 
275 280 285 290 

CAA GTT TTG GCT AAA GCG TTA AAA GAA AAA AAC CAT AAG TTA GTG AGT 968 
Gin Val Leu Ala Lys Ala Leu Lys Glu Lys Asn His Lys Leu Val Ser 
295 300 305 

GGT GGC ACT TCT AAC CAT TTG CTT TTA ATG GAT TTT TTA GAT AAG CCT 1016 
Gly Gly Thr Ser Asn His Leu Leu Leu Met Asp Phe Leu Asp Lys Pro 
310 315 320 

TAT AGC GGG AAA GAC GCT GAT ATT GCA TTA GGG AAT GCC GGA ATC ACC 1064 
Tyr Ser Gly Lys Asp Ala Asp lie Ala Leu Gly Asn Ala Gly lie Thr 
325 330 335 

GTG AAT AAA AAC ACC ATT CCT GGT GAA ACG CGC AGC CCT TTT GTA ACG 1112 
Val Asn Lys Asn Thr lie Pro Gly Glu Thr Arg Ser Pro Phe Val Thr 
340 345 350 

AGC GGG ATA AGG ATT GGC TCA GCG GCA TTG AGC GCA AGG GGC ATG GGA 1160 
Ser Gly lie Arg lie Gly Ser Ala Ala Leu Ser Ala Arg Gly Met Gly 
355 360 365 370 

GCT AAG GAA TTT GAA ATC ATA GGG AAT AAA ATA TCA GAT ATT TTG AAT 12 0 8 

Ala Lys Glu Phe Glu lie lie Gly Asn Lys lie Ser Asp lie Leu Asn 
375 380 385 

GAT ATT AAT AAT GTT AGT TTG CAA TTG CAT GTG AAA GAA GAA TTG AAA 12 56 

Asp lie Asn Asn Val Ser Leu Gin Leu His Val Lys Glu Glu Leu Lys 
390 395 400 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

CCAGCGATGC GTCCTGTTAA AAACGATTTT AATGTGGAGA GTGAAGAATA ATG GCG 5 6 

Met Ala 
1 

TAT TTT TTA GAA CAA ACG GAT AGT GAA ATT TTT GAG CTT ATC TTT GAA 104 
Tyr Phe Leu Glu Gin Thr Asp Ser Glu He Phe Glu Leu He Phe Glu 
5 10 15 

GAA TAC AAG CGG CAA AAT GAG CAT TTA GAA ATG ATA GCG AGC GAG AAT 152 
Glu Tyr Lys Arg Gin Asn Glu His Leu Glu Met He Ala Ser Glu Asn 
20 25 30 

TAC ACT TTT GCA AGC GTT ATG GAG GCT ATG GGG AGT GTT TTA ACG AAT 2 00 

Tyr Thr Phe Ala Ser Val Met Glu Ala Met Gly Ser Val Leu Thr Asn 
35 40 45 50 

AAA TAC GCT GAA GGC TAC CCT AAC AAG CGC TAT TAT GGA GGC TGT GAA 24 8 

Lys Tyr Ala Glu Gly Tyr Pro Asn Lys Arg Tyr Tyr Gly Gly Cys Glu 
55 60 65 

GTG GTG GAT AAA ATA GAA AGC CTA GCC ATA GAA AGG GCT AAA AAG CTT 2 96 

Val Val Asp Lys He Glu Ser Leu Ala He Glu Arg Ala Lys Lys Leu 
70 75 80 

TTT AAT TGC CAG TTC GCT AAC GTG CAA GCG CAT TCA GGC TCA CAA GCC 344 
Phe Asn Cys Gin Phe Ala Asn Val Gin Ala His Ser Gly Ser Gin Ala 
85 90 95 

AAT AAC GCT GTC TAT CAC GCT CTT TTA AAG CCT TAT GAC AAG ATT TTA 3 92 

Asn Asn Ala Val Tyr His Ala Leu Leu Lys Pro Tyr Asp Lys He Leu 
100 105 110 

GGC ATG GAT TTA AGC TGT GGA GGG CAT TTA ACG CAT GGC GCT AAA GTG 44 0 

Gly Met Asp Leu Ser Cys Gly Gly His Leu Thr His Gly Ala Lys Val 
115 120 125 130 

AGT TTA ACC GGC AAG CAT TAT CAG AGC TTT TCT TAT GGC GTG AAT TTG 488 
Ser Leu Thr Gly Lys His Tyr Gin Ser Phe Ser Tyr Gly Val Asn Leu 
135 140 145 

GAT GGC TAT ATT GAT TAT GAA GAG GCG CTA AAA ATC GCT CAA AGC GTT 53 6 

Asp Gly Tyr He Asp Tyr Glu Glu Ala Leu Lys He Ala Gin Ser Val 
150 155 160 

AAG CCA GAA ATC ATC GTG TGC GGG TTT TCA GCC TAT CCA AGG GAG ATT 5 84 

Lys Pro Glu He He Val Cys Gly Phe Ser Ala Tyr Pro Arg Glu He 
165 170 175 

GAT TTT AAG AAA TTT AGA GAA ATC GCT GAT GAA GTG GGG GCG TTA CTA 632 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 9: 

CTTTTACTTT ATAATTATCG TTGGCATTTT AATATTCAAA GGAGCTTGAA ATG AGA 5 6 

Met Arg 
1 

ATT TCT CTT TTA GCT GTA ATT TTA GCG TTA TTG TTT GTG GCT TGC CAC 104 
lie Ser Leu Leu Ala Val lie Leu Ala Leu Leu Phe Val Ala Cys His 
5 10 15 

GAA ACT AAA AAA CAA ATC TTA CAA AAC GAA GCC GAT AGC ACC CCT TCA 152 
Glu Thr Lys Lys Gin lie Leu Gin Asn Glu Ala Asp Ser Thr Pro Ser 
20 25 30 

GAA AAA ACC ATT TGG CAA CCT GAA CAA AAA TAAAAATTGT AAAAATACTC AAA 205 
Glu Lys Thr lie Trp Gin Pro Glu Gin Lys 
35 40 

GGCATTTTTT AAAATAAACG CAATAAAAAA 235 
(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 0: 

Met Arg lie Ser Leu Leu Ala Val lie Leu Ala Leu Leu Phe Val Ala 

15 10 15 

Cys His Glu Thr Lys Lys Gin lie Leu Gin Asn Glu Ala Asp Ser Thr 

20 25 30 

Pro Ser Glu Lys Thr lie Trp Gin Pro Glu Gin Lys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
" (B) LOCATION: 51.., 1298 
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Gin Gly Thr Pro 

Ala His lie Gin 
180 

Leu Val Tyr Met 
195 

Asp Tyr Val Met 
210 

Gly Asp Phe Tyr 
225 

Pro Gin Ser Gin 

Ala Tyr Gly Phe 
260 

Val lie Leu Glu 
275 

Ser Glu Asn Met 
290 

Gly Trp Gly Gly 
305 

Asp Ser Phe Ala 

Gin Ser Arg Tyr 
340 

Lys Glu Lys Glu 
355 

Leu lie Tyr Leu 
370 

His Gin Ala lie 
385 

His Phe Lys Thr 

Leu Trp Leu Ala 
420 

Leu lie Asp Arg 
435 

Lys Thr Ser Ser 
450 



Val Leu He Thr 
165 

Ser Ser Phe Val 

His Asp Lys Asp 
200 

Pro He Lys Asp 
215 

Thr Asn Ala Arg 
230 

Lys Thr Pro Gin 
245 

Leu Arg Asp Ala 

Glu Lys Asp Phe 
280 

Ala Tyr Phe He 
295 

Leu Leu Gly Asn 
310 

Asn Phe Gly He 
325 

Ala Asn Asn Tyr 

Asp Tyr He Leu 
360 

Lys Gly His He 
375 

Val Ala His Ser 
390 

Leu Ser His Lys 
405 

Glu Glu His Asn 

Val Leu Gly Met 
440 

Pro Leu Asn Ala 
455 



His Phe Asn Leu 
170 

Tyr Gly Trp He 
185 

He Glu Leu Leu 

Lys He Pro Leu 
220 

Val Gly Glu Leu 
235 

Lys Pro Gin Lys 
250 

Lys Gly Tyr Ala 
265 

Phe Val Phe Pro 

Asp Thr Met Leu 
300 

Arg Asp Cys Ser 
315 

Leu Leu Pro Arg 
330 

Val Asp Leu Ser 
345 

Lys Asn Ala Thr 

Met Leu Tyr Leu 
380 

He Trp Ser Val 
395 

He Gly Gly Val 
410 

Gly Ala Phe Ser 
425 

Ser Asp Leu Lys 
Asn 



Asp Lys Thr Tyr 
175 

Lys Val Ser Asp 
190 

Thr His Leu Lys 
205 

Tyr Thr Asp Tyr 

Phe Ala Leu He 
240 

Lys Glu Leu Lys 
255 

Ala Leu Gin Ser 
270 

Lys Ala Phe Asn 
285 

Gly Gin Lys Tyr 

Ala Phe Thr Arg 
320 

Asn Ser Tyr Ala 
335 

Ser Met Lys Ala 
350 

Pro Phe Gly Thr 
365 

Gly Ala His Asn 

Gin Thr Gin Lys 
400 

Val He Thr Ser 
415 

Lys Lys Lys Leu 
430 

Asp Phe Val Asn 
445 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHTU^CTERISTICS : 

(A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 182 
(D) OTHER INFORMATION: 
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GTC GCT CAC AGC ATT TGG TCG GTG CAA ACC CAA AAG CAT TTT AAA ACC 12 5 7 

Val Ala His Ser lie Trp Ser Val Gin Thr Gin Lys His Phe Lys Thr 
390 395 400 

TTG AGC CAT AAA ATA GGA GGC GTG GTG ATC ACT TCG TTA TGG TTA GCT 13 05 

Leu Ser His Lys lie Gly Gly Val Val lie Thr Ser Leu Trp Leu Ala 
405 410 415 420 

GAA GAG CAT AAT GGG GCG TTT TCT AAA AAG AAA TTA TTG ATT GAT AGG 13 53 

Glu Glu His Asn Gly Ala Phe Ser Lys Lys Lys Leu Leu lie Asp Arg 
425 430 435 

GTG CTT GGA ATG AGC GAT TTG AAA GAT TTT GTC AAT AAA ACT TCA AGC 14 01 

Val Leu Gly Met Ser Asp Leu Lys Asp Phe Val Asn Lys Thr Ser Ser 
440 445 450 

CCT TTA AAT GCG AAT TGATTTTCTT ATATTATGAT TACGATTTAT CAATTTAAAA C 14 57 
Pro Leu Asn Ala Asn 
455 

ATTTGGAGAA AG A 147 0 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Met Arg Tyr Phe Leu Val Val Phe Leu Phe Leu Phe Val Gly Cys Thr 

15 10 15 

Lys Lys Asp Phe Thr Leu Lys Asp Leu Ser Leu Pro Gin Glu Ala Ser 

20 25 30 

Ser Tyr Leu Ala Ser Ser Gin Asn Gly Ser Asn Asn Asn Gin Ser lie 

35 40 45 

Asp Pro Gin Ala Leu Arg Glu Asn Leu Lys Glu Ser Tyr Leu Lys Ala 

50 55 60 

Trp Tyr Ser Pro Trp Leu Asp Met Lys Val Lys Ser Asn Lys Lys Glu 
65 70 75 80 

Val Phe Trp lie Leu Lys Glu Met Asn Lys Ser Thr Gly Tyr Gly Glu 

85 90 95 

Asp Leu Lys Pro Asn Ala Lys Ala Phe Asn Asp Ala Leu lie Lys Ser 

100 105 110 

Met Asp lie Glu His Tyr Pro Ser Val Lys lie Arg Ala Val Val Ala 

115 120 125 

Arg Asp Ser Asp Val Arg Ala Val Pro Thr Asn Lys Pro Tyr Tyr Leu 

130 135 140 

Ser Gin Lys Gly Tyr Pro Phe Asp Arg Tyr Gin Asn Ser Leu lie Phe 
145 150 155 160 
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GTT TTA ATC ACG CAT TTT AAT CTA GAT AAA ACT TAT GCC CAC ATT CAA 585 
Val Leu lie Thr His Phe Asn Leu Asp Lys Thr Tyr Ala His lie Gin 
165 170 175 180 

AGC AGT TTT GTT TAT GGC TGG ATC AAA GTT AGC GAT TTA GTC TAC ATG 633 
Ser Ser Phe Val Tyr Gly Trp lie Lys Val Ser Asp Leu Val Tyr Met 
185 190 195 

CAC GAT AAA GAC ATA GAG CTT TTA ACC CAT CTT AAA GAT TAT GTC ATG 681 
His Asp Lys Asp lie Glu Leu Leu Thr His Leu Lys Asp Tyr Val Met 
200 205 210 

CCT ATA AAA GAT AAA ATC CCC CTT TAT ACA GAC TAT GGG GAT TTT TAC 72 9 

Pro lie Lys Asp Lys lie Pro Leu Tyr Thr Asp Tyr Gly Asp Phe Tyr 
215 220 225 

ACC AAC GCC AGA GTG GGC GAA TTG TTC GCT CTC ATC CCC CAA AGT CAA 777 
Thr Asn Ala Arg Val Gly Glu Leu Phe Ala Leu lie Pro Gin Ser Gin 
230 235 240 

AAA ACA CCT CAA AAA CCC CAA AAA AAG GAA TTG AAA GCC TAT GGT TTT 825 
Lys Thr Pro Gin Lys Pro Gin Lys Lys Glu Leu Lys Ala Tyr Gly Phe 
245 250 255 260 

TTG AGA GAC GCT AAG GGT TAT GCA GCT TTA CAA AGC GTG ATC TTA GAA 873 
Leu Arg Asp Ala Lys Gly Tyr Ala Ala Leu Gin Ser Val lie Leu Glu 
265 270 275 

GAA AAG GAT TTT - TTT GTT TTC CCT AAG GCT TTT AAC AGC GAG AAC ATG 921 
Glu Lys Asp Phe Phe Val Phe Pro Lys Ala Phe Asn Ser Glu Asn Met 
280 285 290 

GCG TAT TTT ATA GAC ACC ATG TTA GGG CAA AAA TAC GGC TGG GGC GGG 969 
Ala Tyr Phe lie Asp Thr Met Leu Gly Gin Lys Tyr Gly Trp Gly Gly 
295 300 305 

CTA TTG GGT AAT AGG GAT TGC TCG GCT TTC ACC AGA GAT AGT TTT GCT 1017 
Leu Leu Gly Asn Arg Asp Cys Ser Ala Phe Thr Arg Asp Ser Phe Ala 
310 315 320 

AAT TTT GGT ATT TTG CTC CCC AGA AAT TCC TAT GCG CAA AGC CGT TAT 10 65 

Asn Phe Gly lie Leu Leu Pro Arg Asn Ser Tyr Ala Gin Ser Arg Tyr 
325 330 335 340 

GCG AAC AAT TAT GTG GAT TTA AGC TCT ATG AAA GCC AAA GAA AAA GAA 1113 
Ala Asn Asn Tyr Val Asp Leu Ser Ser Met Lys Ala Lys Glu Lys Glu 
345 350 355 

GAC TAC ATC CTT AAA AAC GCC ACG CCT TTT GGA ACG CTC ATC TAT TTA 1161 
Asp Tyr lie Leu Lys Asn Ala Thr Pro Phe Gly Thr Leu lie Tyr Leu 
360 365 370 

AAA GGG CAT ATC ATG CTT TAT TTA GGC GCA CAC AAC CAT CAA GCG ATA 12 09 
Lys Gly His lie Met Leu Tyr Leu Gly Ala His Asn His Gin Ala lie 
' 375 380 385 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 46... 1416 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

TTTAAGTU^C TATTTGCGCA TTTGATGTTA AGGTTTCTCT AAAGC ATG CGT TAT TTT 57 

Met Arg Tyr Phe 
1 

CTT GTA GTT TTC TTG TTT TTG TTT GTG GGT TGC AC A AAA AAG GAT TTC 105 
Leu Val Val Phe Leu Phe Leu Phe Val Gly Cys Thr Lys Lys Asp Phe 
5 10 15 20 

ACG CTC AAA GAT TTA TCC TTG CCC CAA GAG GCT TCA AGC TAT CTT GCA 153 
Thr Leu Lys Asp Leu Ser Leu Pro Gin Glu Ala Ser Ser Tyr Leu Ala 
25 30 35 

AGC TCT CAA AAT GGC AGT AAC AAC AAC CAA AGC ATT GAC CCC CAA GCG 2 01 

Ser Ser Gin Asn Gly Ser Asn Asn Asn Gin Ser lie Asp Pro Gin Ala 
40 45 50 

TTA AGA GAA AAT CTG AAA GAG AGC TAT CTC AAA GCG TGG TAT TCC CCA 24 9 

Leu Arg Glu Asn Leu Lys Glu Ser Tyr Leu Lys Ala Trp Tyr Ser Pro 
55 60 65 

TGG CTA GAT ATG T^AA GTC AAA AGC AAT AAA AAA GAA GTG TTT TGG ATC 2 97 

Trp Leu Asp Met Lys Val Lys Ser Asn Lys Lys Glu Val Phe Trp lie 
70 75 80 

CTT AAG GAG ATG AAT AAA TCC ACC GGT TAT GGC GAA GAT CTA AAA CCC 345 
Leu Lys Glu Met Asn Lys Ser Thr Gly Tyr Gly Glu Asp Leu Lys Pro 
85 90 95 100 

AAC GCA AAA GCT TTC AAT GAC GCA CTC ATT AAG AGC ATG GAT ATT GAG 393 
Asn Ala Lys Ala Phe Asn Asp Ala Leu lie Lys Ser Met Asp lie Glu 
105 110 115 

CAT TAC CCT AGC GTT AAG ATT AGG GCT GTT GTA GCG CGA GAT AGC GAT 441 
His Tyr Pro Ser Val Lys lie Arg Ala Val Val Ala Arg Asp Ser Asp 
120 125 130 

GTG AGG GCT GTG CCT ACT AAC AAA CCT TAT TAT CTT TCT CAA AAA GGC 489 
Val Arg Ala Val Pro Thr Asn Lys Pro Tyr Tyr Leu Ser Gin Lys Gly 
135 140 145 

TAT CCT TTT GAT AGG TAT CAA AAT TCG CTG ATT TTT CAA GGC ACG CCG 5 37 

Tyr Pro Phe Asp Arg Tyr Gin Asn Ser Leu lie Phe Gin Gly Thr Pro 
150 155 160 
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Met Val Asn Asn 

Leu Ser Gly Val 
260 

Leu Leu Asn Ser 
275 

Gly Leu Asn Ser 
250 

Lys Gly Tyr Leu 
305 

Gly Ser Asn Ala 

Asp Met Ala Leu 
340 

Ala Val Thr Leu 
355 

Ala lie Asn Phe 

370 

Asp Ala Phe Leu 
385 

Ala Gin Glu Thr 

Gin Met Tyr Gly 
420 

Lys Thr Lys Arg 
435 

His Ala Asn Leu 
450 

Ala Ser Gin Val 
465 

Tyr Asn Phe Tyr 

Leu Gly Phe Gly 
500 

Tyr Leu Lys Ser 
515 

Ala Ser Met Asn 
530 

Arg Ser Asn Phe 
545 

Pro Leu Phe Thr 

Val Asp Val Phe 
580 

lie Asn Phe 
595 



Leu Lys Ser Val 
245 

Glu Thr Met Ser 

Ser Thr Asp Leu 
280 

Ala Phe Ser Thr 
295 

Asn Ser Ser Ser 
310 

Val Val Gly Ala 
325 

Ala Asp Leu Ser 

Ala Thr Ala Ser 
360 

Leu Asn Ala Leu 
375 

Asn Val His Lys 
390 

Tyr Thr Pro Ser 
405 

Val Asp Ala Met 

Phe Gly Phe Arg 
440 

Ser Phe Val Gly 
455 

Asn Asn Phe Thr 
470 

Glu Ser Lys Glu 
485 

Leu Gly Gly Asp 

Gin Met His lie 
520 

Thr Ser Tyr Phe 
535 

Ser Lys His Ser 
550 

Asn Gin Phe Tyr 
565 

Tyr Lys Arg Asn 



Ser Val Ser Ala 
250 

Gin Gin Thr Ala 
265 

Ser Ser Val lie 

Leu Glu Ser Ala 
300 

Ala Thr lie Gly 
315 

Leu Asp Lys Ala 
330 

Ala Ala Asp Thr 
345 

Asp Ser Pro Thr 

Lys Ser Asn Leu 
380 

Asn lie Gin Thr 
395 

Val lie Asn Thr 
410 

Ala Gly Tyr Lys 
425 

Ser Tyr Gly Tyr 

Ser Gin Leu Gly 
460 

Tyr Gly Val Gly 
475 

Gly Tyr Asn Thr 
490 

Ser Phe He Val 
505 

Cys Asn Asn Thr 

Gin Met Pro Val 
540 

Gly He Glu Val 
555 

Lys Glu Arg Gly 
570 

Phe Ser He Tyr 
585 



Met Asn Thr Thr 
255 

Thr He Gly Asn 
270 

Pro Asn Ala Gin 
285 

Gin Asn Thr Leu 

Gin Leu Thr Asn 
320 

He Asn Gin Val 
335 

Gin Lys Thr Gin 
350 

Thr Thr Thr Asp 
365 

Met Ala Gin Lys 

Ala Val Ala Gin 
400 

Asn Asn Tyr Gly 
415 

Trp Phe Phe Gly 
430 

Tyr Ser Tyr Asn 
445 

He Met Glu Gly 

Phe Asp Val Leu 
480 

Ala Gly Leu Phe 
495 

Gin Gly Glu Ser 
510 

Ala Gly Cys Ser 
525 

Glu Phe Gly Phe 

Gly Phe Lys Leu 
560 

Val Asp Gly Ser 
575 

Phe Asn Tyr Met 
590 



(2) INFORMATION FOR SEQ ID NO: 167: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 
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TTC ACC AAC CAA TTC TAT AAA GAA AGG GGC GTA GAT GGA TCG GTA GAT 17 84 

Phe Thr Asn Gin Phe Tyr Lys Glu Arg Gly Val Asp Gly Ser Val Asp 
565 570 575 

GTG TTC TAT AAA AGG AAT TTC TCT ATT TAT TTT AAC TAG ATG ATC AAC 18 32 

Val Phe Tyr Lys Arg Asn Phe Ser lie Tyr Phe Asn Tyr Met lie Asn 
580 585 590 

TTC TAAGCCTTTC TATTCTTTCC AATAGAGGGT TTTCTCTCTG TTGGTTTCTT TTT 1888 

Phe 

595 

(2) INFORMATION FOR SEQ ID NO: 16 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Met Lys Lys Ser Phe Lys Lys Leu Gly Phe Val Ser Leu Ala Ala Ser 

1 5 10 15 

Gly Val Leu Leu Gly Ser Met Asn Ala Thr Asp Leu Glu Thr Tyr Ala 

20 25 30 

Ala Leu Gin Lys Ser Ser His Val Phe Gly Asn Tyr Ala Glu Lys Asp 

35 40 45 

Lys Asp Ser Lys Leu Thr Ser Asp Ser Pro Thr Gin Gin Gin Asp Gin 

50 55 60 

Lys Val Ala Gin Asn Thr Ala Ser Asn Asp Ser Gin Glu Ala Thr Thr 
65 70 75 80 

Leu Glu Asn Thr Ala Ser Thr Asp Asn Thr Thr Ala Thr Thr Asp Glu 

85 90 95 

Thr Tyr Thr Lys Ser Thr Asp Thr Thr Val Ala Gly Ala Ala Gin Lys 

100 105 110 

Val Glu Thr Asp Asn Thr Ala Val Gin Ser Ala Glu Gin Thr Leu Lys 

115 120 125 

Thr Asp Val Ala Lys Val Gin Ala Asp Ala Ser Ala Lys Asp Phe Asp 

130 135 140 

Glu Thr Thr Phe Gin Ala Asp Gin Ala Ala Glu Gin Thr Ala Glu Lys 
145 150 155 160 

Ala Leu Gin Gin Ala Glu Ser Lys Leu Asn Thr Asp Gin Gin Thr Leu 

165 170 175 

Asn Thr Ala Leu Gin Asp Gin Thr Lys Thr Pro Thr Pro Ser Thr Pro 

180 185 190 

Pro Thr Lys Glu Glu Pro Lys His Thr Ala Ser Ser Gly Thr Pro Pro 

195 200 205 

Ala Pro Glu Ser Pro Pro Ala Lys Lys Asp Glu Thr Ser Gly Thr Pro 

210 215 220 

Ser Ala Ser Gly Ser Ser Val Ala Ser Gin Leu Thr Lys Asp Thr Thr 
225 230 235 240 
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Ala Leu Ala Asp Leu Ser Ala Ala Asp Thr Gin Lys Thr Gin Ala Val 
340 345 350 

ACG CTT GCA ACT GCT AGT GAT AGT CCA ACG ACA ACG ACA GAT GCC ATC 1160 
Thr Leu Ala Thr Ala Ser Asp Ser Pro Thr Thr Thr Thr Asp Ala lie 
355 360 365 370 

AAT TTC TTA AAC GCG CTA AAA AGC AAT CTA ATG GCT CAA AAA GAC GCT 12 0 8 

Asn Phe Leu Asn Ala Leu Lys Ser Asn Leu Met Ala Gin Lys Asp Ala 
375 380 385 

TTT TTG AAT GTG CAT AAA AAC ATT CAA ACC GCT GTC GCT CAA GCC CAG 12 56 

Phe Leu Asn Val His Lys Asn lie Gin Thr Ala Val Ala Gin Ala Gin 
390 395 400 

GAA ACC TAG ACG CCA AGC GTG ATC AAC ACC AAT AAT TAC GGG CAA ATG 13 04 

Glu Thr Tyr Thr Pro Ser Val lie Asn Thr Asn Asn Tyr Gly Gin Met 
405 410 415 

TAT GGG GTA GAT GCG ATG GCA GGG TAT AAG TGG TTC TTT GGC AAA ACC 13 52 

Tyr Gly Val Asp Ala Met Ala Gly Tyr Lys Trp Phe Phe Gly Lys Thr 
420 425 430 

AAA CGC TTT GGC TTT AGG TCT TAT GGA TAC TAC AGC TAT AAC CAT GCG 14 0 0 

Lys Arg Phe Gly Phe Arg Ser Tyr Gly Tyr Tyr Ser Tyr Asn His Ala 
435 440 445 450 

AAT TTA AGC TTT GTG GGG AGC CAG CTT GGA ATC ATG GAG GGC GCG TCT 144 8 

Asn Leu Ser Phe Val Gly Ser Gin Leu Gly lie Met Glu Gly Ala Ser 
455 460 465 

CAA GTG AAT AAC TTC ACT TAT GGC GTG GGC TTT GAT GTG CTC TAT AAC 14 96 

Gin Val Asn Asn Phe Thr Tyr Gly Val Gly Phe Asp Val Leu Tyr Asn 
470 475 480 

TTC TAT GAA AGC AAA GAG GGC TAT AAC ACA GCA GGG TTG TTC TTA GGC 1544 
Phe Tyr Glu Ser Lys Glu Gly Tyr Asn Thr Ala Gly Leu Phe Leu Gly 
485 490 495 

TTT GGG TTA GGA GGG GAT TCG TTT ATC GTT CAA GGA GAG AGC TAC TTG 1592 
Phe Gly Leu Gly Gly Asp Ser Phe He Val Gin Gly Glu Ser Tyr Leu 
500 505 510 

AAA TCT CAA ATG CAC ATT TGC AAC AAC ACC GCC GGC TGT TCA GCG AGC 164 0 

Lys Ser Gin Met His He Cys Asn Asn Thr Ala Gly Cys Ser Ala Ser 
515 520 525 530 

ATG AAC ACA AGC TAC TTC CAA ATG CCT GTT GAA TTT GGT TTT AGG AGC 168 8 

Met Asn Thr Ser Tyr Phe Gin Met Pro Val Glu Phe Gly Phe Arg Ser 
535 540 545 

AAT TTC TCT AAA CAC AGC GGG ATT GAA GTG GGC TTT AAA TTG CCT TTA 17 3 6 

Asn Phe Ser Lys His Ser Gly He Glu Val Gly Phe Lys Leu Pro Leu 
550 555 560 
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115 120 125 130 

GTA GCT AAA GTT CAA OCT GAT GCT AGT GCT AAA GAT TTT GAT GAA ACC 4 88 

Val Ala Lys Val Gin Ala Asp Ala Ser Ala Lys Asp Phe Asp Glu Thr 
135 140 145 

ACT TTT CAA GCC GAT CAA GCA GCA GAG CAA ACC GCT GAA AAA GCT TTA 536 
Thr Phe Gin Ala Asp Gin Ala Ala Glu Gin Thr Ala Glu Lys Ala Leu 
150 155 160 

CAA CAG GCT GAG AGC AAA CTC AAC ACC GAT CAA CAG ACT TTA AAC ACA 584 
Gin Gin Ala Glu Ser Lys Leu Asn Thr Asp Gin Gin Thr Leu Asn Thr 
165 170 175 

GCG TTA CAA GAT CAG ACG AAA ACA CCA ACC CCA TCA ACC CCA CCA ACT 632 
Ala Leu Gin Asp Gin Thr Lys Thr Pro Thr Pro Ser Thr Pro Pro Thr 
180 185 190 

AAA GAG GAA CCA AAA CAC ACC GCT TCA AGC GGC ACA CCA CCA GCT CCA 680 
Lys Glu Glu Pro Lys His Thr Ala Ser Ser Gly Thr Pro Pro Ala Pro 
195 200 205 210 

GAA AGC CCA CCA GCT AAA AAA GAT GAA ACA AGT GGC ACA CCA AGT GCT 72 8 

Glu Ser Pro Pro Ala Lys Lys Asp Glu Thr Ser Gly Thr Pro Ser Ala 
215 220 225 

AGT GGG AGT TCT GTG GCA AGC CAG CTA ACC AAA GAT ACC ACT ATG GTT 7 76 

Ser Gly Ser Ser Val Ala Ser Gin Leu Thr Lys Asp Thr Thr Met Val 
230 235 240 

AAT AAT CTT AAG AGT GTG AGC GTG AGC GCG ATG AAC ACC ACT TTA AGT 824 
Asn Asn Leu Lys Ser Val Ser Val Ser Ala Met Asn Thr Thr Leu Ser 
245 250 255 

GGA GTA GAA ACC ATG TCT CAA CAA ACT GCA ACG ATT GGC AAC CTT TTG 872 
Gly Val Glu Thr Met Ser Gin Gin Thr Ala Thr lie Gly Asn Leu Leu 
260 265 270 

AAT AGT AGC ACC GAT TTA AGC AGT GTG ATT CCC AAC GCT CAA GGG CTA 92 0 

Asn Ser Ser Thr Asp Leu Ser Ser Val lie Pro Asn Ala Gin Gly Leu 
275 280 285 290 

AAC AGC GCG TTT AGC ACA TTA GAA AGC GCT CAA AAC ACT CTA AAA GGC 968 
Asn Ser Ala Phe Ser Thr Leu Glu Ser Ala Gin Asn Thr Leu Lys Gly 
295 300 305 

TAT TTA AAT TCT TCT AGC GCG ACG ATT GGG CAA TTG ACA AAC GGA TCT 1016 
Tyr Leu Asn Ser Ser Ser Ala Thr lie Gly Gin Leu Thr Asn Gly Ser 
310 315 320 

AAT GCG GTT GTG GGC GCG TTA GAT AAA GCT ATC AAT CAA GTG GAT ATG 10 64 

Asn Ala Val Val Gly Ala Leu Asp Lys Ala lie Asn Gin Val Asp Met 
325 330 335 

GCT TTG GCC GAT CTT AGT GCA GCT GAT ACG CAA AAA ACG CAA GCC GTT 1112 
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145 

(2) INFORMATION FOR SEQ ID NO:165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1888 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1835 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: 

CACTAAAGTC AATCCAAGCG CAAGTTGGAT GAAAAAATAA GAAGGAAGTT ATG AAA 56 

Met Lys 
1 

AAG TCA TTC AAA AAA TTA GGC TTT GTC TCT TTA GCG GCT AGT GGC GTG 104 
Lys Ser Phe Lys Lys Leu Gly Phe Val Ser Leu Ala Ala Ser Gly Val 
5 10 15 

CTT TTA GGG AGC ATG AAC GCT ACC GAT TTA GAA ACC TAC GCA GCA TTG 152 
Leu Leu Gly Ser Met Asn Ala Thr Asp Leu Glu Thr Tyr Ala Ala Leu 
20 25 30 

CAA AAA TCA TCG CAT GTT TTT GGT AAT TAT GCT GAA AAG GAT AAG GAT 2 00 

Gin Lys Ser Ser His Val Phe Gly Asn Tyr Ala Glu Lys Asp Lys Asp 
35 40 45 50 

AGT AAA TTA ACA AGC GAT TCA CCA ACG CAA CAA CAA GAT CAA AAA GTA 24 8 

Ser Lys Leu Thr Ser Asp Ser Pro Thr Gin Gin Gin Asp Gin Lys Val 
55 60 65 

GCC CAA AAC ACC GCT TCA AAC GAC AGC CAA GAA GCG ACA ACA CTT GAA 2 96 

Ala Gin Asn Thr Ala Ser Asn Asp Ser Gin Glu Ala Thr Thr Leu Glu 
70 75 80 

AAC ACC GCT TCT ACT GAC AAC ACA ACC GCC ACA ACT GAT GAA ACT TAT 344 
Asn Thr Ala Ser Thr Asp Asn Thr Thr Ala Thr Thr Asp Glu Thr Tyr 
85 90 95 

ACA AAA AGC ACT GAC ACT ACT GTA GCT GGT GCG GCT CAA AAA GTA GAA 3 92 

Thr Lys Ser Thr Asp Thr Thr Val Ala Gly Ala Ala Gin Lys Val Glu 
100 105 110 

ACC GAT AAC ACA GCC GTT CAA AGC GCT GAA CAA ACT TTA AAA ACA GAT 44 0 

Thr Asp Asn Thr Ala Val Gin Ser Ala Glu Gin Thr Leu Lys Thr Asp 
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His Leu Lys Thr Leu Gly Val Leu Ala Leu Leu Phe Leu lie Tyr His 
70 75 80 

CAA ATC GCC TAT AAA AAC TCT TTA AAG CTT TTT AAT GAC GGC TTT TTA 34 6 

Gin lie Ala Tyr Lys Asn Ser Leu Lys Leu Phe Asn Asp Gly Phe Leu 
85 90 95 

TTC AAA ACT TTG CAT GTT TTT TTG GTT TAT TAC CTT TAT TTA TCG CGC 3 94 

Phe Lys Thr Leu His Val Phe Leu Val Tyr Tyr Leu Tyr Leu Ser Arg 
100 105 110 

TTT TTT TCG ATG TCT TTG AGT TTG AAA ATA CTC GGC TTT CTC GCT CTT 442 
Phe Phe Ser Met Ser Leu Ser Leu Lys lie Leu Gly Phe Leu Ala Leu 
115 120 125 130 

TTT GCT TTA ATA GAA AGC GCT TTG TGG GGT TTG TAT GAA AAA TCT TCG 4 90 

Phe Ala Leu lie Glu Ser Ala Leu Trp Gly Leu Tyr Glu Lys Ser Ser 
135 140 145 

CTA TAAGCTTTTG CTCTTTGTTT TTATAGGGTT TTGGGGGTTA CTAGCCT 54 0 

Leu 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Met Leu Ser Leu Lys Gin Asp Ser Phe Phe Phe Leu Cys Leu Gly lie 

15 10 15 

Leu Gly Phe Tyr Phe Tyr Ser Leu Leu Arg Asp Leu Met Pro Phe Leu 

20 25 30 

Pro Pro Met lie Gly Phe Leu Phe Leu Phe Tyr Ala Lys Lys Tyr Asp 

35 40 45 

His Phe Leu Pro Ser Leu Ser Val Phe Gly Cys Leu Phe Trp Phe Glu 

50 55 60 

Ser Met His Leu Lys Thr Leu Gly Val Leu Ala Leu Leu Phe Leu lie 
65 70 75 80 

Tyr His Gin lie Ala Tyr Lys Asn Ser Leu Lys Leu Phe Asn Asp Gly 

85 90 95 

Phe Leu Phe Lys Thr Leu His Val Phe Leu Val Tyr Tyr Leu Tyr Leu 

100 105 110 

Ser Arg Phe Phe Ser Met Ser Leu Ser Leu Lys lie Leu Gly Phe Leu 

115 120 125 

Ala Leu Phe Ala Leu lie Glu Ser Ala Leu Trp Gly Leu Tyr Glu Lys 

130 135 140 

Ser Ser Leu 
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Gin Thr Ser Arg Pro Met Trp Asn Ser Asn Leu Pro Lys Pro Asn Ala 

130 135 140 

lie Tyr Val Tyr Gly Val Ala Asn Ala Asn lie Thr Phe Phe Lys Gly 
145 150 155 160 

Ser Asp lie Leu Ser Tyr Glu Thr Arg Glu Val Leu Leu Lys Tyr Phe 

165 170 175 

Asp lie Leu Asp Lys Asp Glu Arg Ser Leu Lys Asn Ala Leu Lys Asp 

180 185 190 

Leu Glu Asn Pro Phe Gly Phe Ala Pro Tyr lie Arg Lys Ala Tyr Glu 

195 200 205 

His Lys Arg Asn Phe Leu Thr Thr Thr Arg Leu Lys Ala Ser Phe Arg 

210 215 220 

Pro Thr Thr Phe 
225 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 493 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

CCAAACCCTT TTGAAACACT TGCTCACTAA CCCATTATAA GCCGCAAAAA CC ATG CTC 58 

Met Leu 
1 

TCT TTA AAA CAA GAT TCC TTT TTT TTC TTA TGT TTA GGA ATC CTG GGG 106 
Ser Leu Lys Gin Asp Ser Phe Phe Phe Leu Cys Leu Gly lie Leu Gly 
5 10 15 

TTT TAT TTT TAT AGC CTT TTG AGG GAT TTA ATG CCT TTT TTA CCC CCA 154 
Phe Tyr Phe Tyr Ser Leu Leu Arg Asp Leu Met Pro Phe Leu Pro Pro 
20 25 30 

ATG ATT GGG TTT TTA TTC TTG TTT TAT GCG AAA AAA TAC GAT CAT TTT 2 02 

Met lie Gly Phe Leu Phe Leu Phe Tyr Ala Lys Lys Tyr Asp His Phe 
35 40 45 50 

TTA CCC AGT TTG AGC GTG TTT GGT TGT TTG TTT TGG TTT GAG AGC ATG 2 50 

Leu Pro Ser Leu Ser Val Phe Gly Cys Leu Phe Trp Phe Glu Ser Met 
55 60 65 

CAT TTA AAG ACT TTA GGC GTT TTA GCT TTA TTG TTT TTA ATC TAC CAT 2 98 
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GTG TAT GGA GTC GCT AAT GCA AAC ATC ACT TTT TTT AAA GGC TCA GAT 53 6 

Val Tyr Gly Val Ala Asn Ala Asn lie Thr Phe Phe Lys Gly Ser Asp 
150 155 160 

ATT TTG AGT TAT GAA ACC AGA GAG GTC TTG CTC AAG TAT TTT GAT ATT 584 
lie Leu Ser Tyr Glu Thr Arg Glu Val Leu Leu Lys Tyr Phe Asp lie 
165 170 175 

TTA GAT AAA GAT GAA AGA AGT TTG AAA AAC GCC TTA AAG GAT TTA GAA 632 
Leu Asp Lys Asp Glu Arg Ser Leu Lys Asn Ala Leu Lys Asp Leu Glu 
180 185 190 

AAC CCT TTT GGG TTT GCC CCC TAC ATC. AGA AAA GCT TAT GAG CAT AAA 68 0 

Asn Pro Phe Gly Phe Ala Pro Tyr lie Arg Lys Ala Tyr Glu His Lys 
195 200 205 210 

AGG AAT TTT CTA ACC ACC ACC AGA TTG AAA GCT TCT TTT CGC CCA ACC 72 8 

Arg Asn Phe Leu Thr Thr Thr Arg Leu Lys Ala Ser Phe Arg Pro Thr 
215 220 225 

ACA TTT TAAGAGAGCG GAATGTCTTG GAATTTTTGA AAACGCTCAC TCATTAGCGT AT 78 6 
Thr Phe 



T 787 
(2) INFORMATION FOR SEQ ID NO : 162 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Met Leu Glu Lys Val Phe Gin Glu lie Thr Asn Lys Arg Lys Phe Phe 

15 10 15 

Ala Ser Ser Ser Thr Gly Glu Gin Phe Glu Asn Gin Phe Arg Asn Glu 

20 25 30 

Leu Lys Lys His Phe Ser Glu lie Asn Gly Asp Leu Thr Glu Glu Leu 

35 40 45 

Ser His lie Glu Glu Lys Pro Asn Lys Glu lie Lys Thr Thr Phe Asn 

50 55 60 

Gin Leu Lys Lys Gin Val Leu Glu Lys Asn His Pro His Thr Leu Lys 
65 70 75 80 

Asn Pro Phe Ser Asn Leu Thr Ser His Phe Leu Tyr Gin Pro Phe Gly 

85 90 95 

Ser Gin Asn Tyr Pro Asp Phe Leu Val Phe lie Phe Asp Tyr Val Val 

100 105 110 

Gly lie Glu lie Lys Phe Ser Lys Asn Asp Lys Gly Glu Lys Asn Leu 
115 120 125 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJIjE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 734 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GTTTTCTACT TATGATTTTG TGGAAGAATA TTGCAAATTA AAGGAAATGC ATG CTT 56 

Met Leu 
1 

GAA AAA GTG TTT CAA GAA ATT ACC AAT AAA AGA AAG TTT TTT GCA AGT 104 
Glu Lys Val Phe Gin Glu lie Thr Asn Lys Arg Lys Phe Phe Ala Ser 
5 10 15 

TCT AGC ACA GGG GAG CAG TTT GAA AAC CAA TTT AGG AAT GAA TTA AAA 152 
Ser Ser Thr Gly Glu Gin Phe Glu Asn Gin Phe Arg Asn Glu Leu Lys 
20 25 30 

AAA CAC TTT AGC GAA ATC AAT GGC GAT TTA ACA GAA GAA TTA AGC CAT 200 
Lys His Phe Ser Glu lie Asn Gly Asp Leu Thr Glu Glu Leu Ser His 
35 40 45 50 

ATT GAA GAA AAG CCT AAT AAA GAA ATC AAA ACC ACT TTT AAC CAA CTC 24 8 

lie Glu Glu Lys Pro Asn Lys Glu lie Lys Thr Thr Phe Asn Gin Leu 
55 60 65 

AAA AAG CAA GTT TTA GAA AAA AAT CAC CCG CAC ACC CTT AAA AAC CCT 2 96 

Lys Lys Gin Val Leu Glu Lys Asn His Pro His Thr Leu Lys Asn Pro 
70 75 80 

TTT TCA AAC CTT ACA AGC CAT TTT TTA TAC CAG CCT TTT GGC TCA CAA 344 
Phe Ser Asn Leu Thr Ser His Phe Leu Tyr Gin Pro Phe Gly Ser Gin 
85 90 95 

AAT TAC CCT GAT TTT TTG GTT TTT ATT TTT GAC TAT GTG GTG GGG ATT 3 92 

Asn Tyr Pro Asp Phe Leu Val Phe lie Phe Asp Tyr Val Val Gly lie 
100 105 110 

GAA ATC AAG TTT TCT AAA AAC GAT AAG GGT GAA AAA AAT CTT CAA ACA 44 0 

Glu lie Lys Phe Ser Lys Asn Asp Lys Gly Glu Lys Asn Leu Gin Thr 
115 120 125 130 

TCT CGC CCC ATG TGG AAT TCA AAC CTG CCT AAA CCC AAT GCG ATT TAT 488 
Ser Arg Pro Met Trp Asn Ser Asn Leu Pro Lys Pro Asn Ala lie Tyr 
135 140 145 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 9: 

TGTAGAATGA AATCCTAGCC AGTGAGCTAG AATTTAAATT TTTAATCAAA GGAGTCATCA 60 
TGGCACACC ATG AAG AAC AAC ACG GCG GGC ACC ACC ACC ACC ATC ACC ACA 111 
Met Lys Asn Asn Thr Ala Gly Thr Thr Thr Thr lie Thr Thr 
15 10 

CAC ACC ACC ACC ACT ATC ATG GCG GTG AAC ACC ACC ATC ACC ACC ACA 159 
His Thr Thr Thr Thr lie Met Ala Val Asn Thr Thr lie Thr Thr Thr 
15 20 25 30 

GCT CTC ATC ATG AAG AAG GTT GTT GCA GCA CTA GCG ACA GTC ATC ATC 2 07 

Ala Leu lie Met Lys Lys Val Val Ala Ala Leu Ala Thr Val lie lie 
35 40 45 

AAG AAG AAG GTT GCT GCC ACG GGC ATC ACG AGT AAT ATC GGT GTG GCT 2 55 

Lys Lys Lys Val Ala Ala Thr Gly lie Thr Ser Asn lie Gly Val Ala 
50 55 60 

AGG GGC AAC TTG ACT AGG GTT GTC TCT GGC TTT TGACTTTAAA ATACAATCAT 3 08 
Arg Gly Asn Leu Thr Arg Val Val Ser Gly Phe 
65 70 

TCCATTCTAA CCCATTCTGA TCAAACCCGT T 33 9 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQtTENCE DESCRIPTION: SEQ ID NO: 160: 

Met Lys Asn Asn Thr Ala Gly Thr Thr Thr Thr lie Thr Thr His Thr 

15 10 15 

Thr Thr Thr lie Met Ala Val Asn Thr Thr lie Thr Thr Thr Ala Leu 

20 25 30 

lie Met Lys Lys Val Val Ala Ala Leu Ala Thr Val lie lie Lys Lys 

35 40 45 

Lys Val Ala Ala Thr Gly lie Thr Ser Asn lie Gly Val Ala Arg Gly 

50 55 60 

Asn Leu Thr Arg Val Val Ser Gly Phe 
65 70 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 87 base pairs 
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595 

Leu Lys Thr His 
610 

Asp Tyr Gin Asn 
625 

Glu Gin His Tyr 

Asn Phe Ser Asp 
660 

Ser Pro Leu Asp 
675 

Gly Val Asn Ser 
690 

lie Leu Met Ala 
705 

Ala Cys Lys Asp 

Gin Ala lie Gin 
740 

Phe Leu Ala Pro 
755 

Ser Gin Ala Leu 
770 

Lys Asp lie Ser 
785 

Pro Glu Asn Phe 

Glu Leu Asp Arg 
820 

Cys Ala lie Glu 
835 

Glu Leu Leu Met 
850 

Glu Thr Pro Ala 
865 

Arg Cys Trp Arg 

Cys Glu Gin Val 
900 



600 

Gly Ser Asp Val 
615 

Asp Leu Arg Val 
630 

Lys Lys Phe Arg 
645 

Met Asp Leu Lys 

His Phe Met Leu 
680 

Ala Phe Glu Glu 
695 

Phe Val Thr Asn 
710 

Ser Leu Tyr Cys 
725 

Met Val Leu Leu 

lie Leu Thr His 
760 

Arg lie Phe Leu 
775 

Val Ser Glu Lys 
790 

Glu Ala Val Leu 
805 

Leu Lys Lys Glu 

Val Lys Glu Lys 
840 

Val Ser Phe Val 
855 

Phe Thr Leu Phe 
870 

Phe Lys Ser Glu 
885 

Leu Lys Glu Arg 



Val Arg Leu Trp 
620 

Ser Gin Thr Phe 
635 

Asn Thr Leu Lys 
650 

Asn Leu Glu Arg 
665 

Glu Thr Leu Glu 

His Asp Phe Val 
700 

Glu Leu Ser Gly 
715 

Asp Ser Lys Asn 
730 

Ala Thr Ala Ser 
745 

Thr He Glu Glu 

Gin Ala Lys Asp 
780 

Leu His Leu Lys 
795 

Ala Leu Arg Ser 
810 

Gly Val lie Lys 
825 

Ala Leu Asp Glu 

Gly He Ala Lys 
860 

Lys Ala Pro Phe 
875 

Leu Glu Asn Thr 
890 



605 

Val Ala Phe Asn 

Phe Thr Gin Thr 
640 

Phe Leu Leu Ala 
655 

Pro His Asn Phe 
670 

Thr He Ser Ala 
685 

Lys Gly Leu Asn 

He Tyr Leu Asp 
720 

Asn Glu Lys Arg 
735 

Lys Leu Cys Tyr 
750 

Val Leu Glu His 
765 

Val Phe Asp Leu 

Glu Phe Lys Lys 
800 

Ala Phe Asn Glu 
815 

Asn Ser Leu Glu 
830 

Asn Leu Val Glu 
845 

Glu Lys Leu Ser 

Tyr Lys Cys Pro 
880 

Pro Cys Lys Arg 
895 



(2) INFORMATION FOR SEQ ID NO: 159: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70,.. 288 
(D) OTHER INFORMATION: 
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Gly Leu Leu Lys 

Glu Ser Ala Leu 
180 

Pro Ser lie Phe 
195 

Leu Lys Val Lys 
210 

Thr Leu Tyr Ala 
225 

Ala Leu Thr Gin 

Leu Ala Ala Leu 
260 

Ser Asn Asp Leu 
275 

Asp Ser Leu Val 
290 

Gly Ala Val His 
305 

Gly Leu Arg Tyr 

Cys Tyr Asp Glu 
340 

Leu Gly Glu His 
355 

Gly Asp Ser Leu 
370 

Cys Trp Arg Thr 
385 

Phe lie Leu Met 

Thr Leu Arg Glu 
420 

Pro Ser Ser Gly 
435 

Asp Trp Cys Leu 
450 

Phe lie Asp Lys 
465 

Glu His Val Ala 

Glu Tyr Ser Val 
500 

Lys His Tyr Glu 
515 

Gly Ser Thr Phe 
530 

Gin Ser Pro Ser 
545 

Trp Phe Gin Ser 

Pro Phe Lys Lys 
580 

Glu Lys Met Ser 



Glu Arg His Lys 
165 

Ala Glu Ala Glu 

Val Ala Phe Gly 
200 

Lys Ala Ser Leu 
215 

Asn Val Ala lie 
230 

Lys Gly Tyr Leu 
245 

Gly Val Val Asp 

Glu Tyr Leu Val 
280 

Ala Leu Gly Glu 
295 

Thr Ala Pro Gly 
310 

Asn Leu Glu Val 
325 

Gly lie lie His 

Val Phe Lys Ala 
360 

Leu Leu Glu Gin 
375 

His Lys Pro Val 
390 

Asp Glu Pro Phe 
405 

Val Ala Leu Asp 

Lys Asn Arg Leu 
440 

Ser Arg Gin Arg 
455 

Arg Thr Asn Lys 
470 

Asn Leu Phe Glu 
485 

Lys Asp Leu Leu 

Lys lie Met His 
520 

Lys Ala Val Leu 
535 

Asp Val lie Leu 
550 

Ser Leu Leu lie 
565 

Val lie Thr His 
Lys Ser Lys Gly 



Pro lie Tyr Trp 
170 

Val Glu Tyr Lys 
185 

Leu Lys Lys Glu 

Val lie Trp Thr 
220 

Ala Leu Lys Lys 
235 

Val Ala Lys Ala 
250 

Asn Glu lie Thr 
265 

Ala Thr Asn Pro 

His Val Gly Leu 
300 

His Gly Glu Glu 
315 

Leu Met Ser Val 
330 

Asn Gin Leu Leu 
345 

Gin Lys Arg lie 

Glu lie Glu His 
380 

lie Tyr Arg Ala 
395 

lie Gin Asn Asp 
410 

Ala lie Glu Lys 
425 

Lys Thr Met lie 

Lys Trp Gly Val 
460 

Pro Cys Phe Glu 
475 

Lys Lys Gly Cys 
490 

Pro Pro Ser Tyr 
505 

lie Leu Asp Val 

Glu Asp Tyr His 
540 

Glu Gly Ser Asp 
555 

Gly Cys Val Leu 
570 

Gly Phe lie Val 
585 

Asn Val Val Ser 
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Ser Tyr Ala Cys 
175 

Met Lys Lys Ser 
190 

Ser Leu Glu Lys 
205 

Thr Thr Pro Trp 

Asp Ala Val Tyr 
240 

Leu His Glu Lys 
255 

His Glu Phe Asn 
270 

Leu Asn Gin Arg 
285 

Glu Asp Gly Thr 

Asp Tyr Tyr Leu 
320 

Asp Glu Lys Gly 
335 

Asp Glu Ser Tyr 
350 

lie Glu Gin Leu 
365 

Ser Tyr Pro His 

Thr Thr Gin Trp 
400 

Gly Ser Gin Lys 
415 

Val Glu Phe Val 
430 

Glu Asn Arg Pro 
445 

Pro Leu Ala Phe 

Ser Glu Val Leu 
480 

Asp Val Trp Trp 
495 

Gin Glu Asp Ala 
510 

Trp Phe Asp Ser 
525 

Gly Glu Lys Gly 

Gin His Arg Gly 
560 

Asn Asn Gin Ala 
575 

Asp Glu Lys Gly 
590 

Leu Asp Lys Leu 
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GCT ATT GAA GTA AAA GAA AAA GCG TTG GAT GAA AAT TTA GTA GAA GAG 2 604 

Ala lie Glu Val Lys Glu Lys Ala Leu Asp Glu Asn Leu Val Glu Glu 
835 840 845 

TTG CTG ATG GTA AGC TTT GTG GGG ATT GCA AAA GAA AAA TTG AGT GAA 2 652 

Leu Leu Met Val Ser Phe Val Gly He Ala Lys Glu Lys Leu Ser Glu 
850 855 860 865 

ACG CCA GCA TTC ACG CTC TTT AAA GCC CCC TTT TAT AAA TGC CCC AGG 2 70 0 

Thr Pro Ala Phe Thr Leu Phe Lys Ala Pro Phe Tyr Lys Cys Pro Arg 
870 875 880 

TGT TGG CGT TTT AAA AGC GAG CTA GAA AAC ACC CCT TGC AAG CGT TGC 274 8 

Cys Trp Arg Phe Lys Ser Glu Leu Glu Asn Thr Pro Cys Lys Arg Cys 
885 890 895 

GAA CAG GTT TTA AAA GAG CGA TGATAAAAGG ATAGGGCTTT TGAAAACTTT ACAA 2 803 
Glu Gin Val Leu Lys Glu Arg 
900 

ACCCATAGAG TTTTACAA 2 821 

(2) INFORMATION FOR SEQ ID NO: 15 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:158: 

Met Lys Gly Asn Leu Ser Val Asn Glu Pro Lys Thr Tyr Ala Lys Trp 

15 10 15 

Gin Glu Gin Gin Ala Phe Lys Arg Met Gin Ala Arg Lys Asp Asn His 

20 25 30 

Gly Asp Phe Thr Leu His Asp Gly Pro Pro Tyr Ala Asn Gly His Leu 

35 40 45 

His Leu Gly His Ala Leu Asn Lys He Leu Lys Asp He Val Val Lys 

50 55 60 

Arg Glu Tyr Phe Lys Gly Lys Lys He Tyr Tyr Thr Pro Gly Trp Asp 
65 70 75 80 

Cys His Gly Leu Pro He Glu Gin Gin He Leu Glu Arg Leu Glu Lys 

85 90 95 

Glu Lys Thr Ser Leu Glu Asn Pro Thr Leu Phe Arg Glu Lys Cys Arg 

100 105 110 

Asp His Ala Lys Lys Phe Leu Glu He Gin Lys Asn Glu Phe Leu Gin 

115 120 125 

Leu Gly Val Leu Gly Asp Phe Glu Asp Pro Tyr Lys Thr Met Asp Phe 

130 135 140 

Lys Phe Glu Ala Ser He Tyr Arg Ala Leu Val Glu Val Ala Lys Lys 
145 150 155 160 
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AAA ACG CAT GGG AGC GAT GTG GTG CGT TTG TGG GTA GCG TTT AAT GAG 1932 
Lys Thr His Gly Ser Asp Val Val Arg Leu Trp Val Ala Phe Asn Asp 
610 615 620 625 

TAT CAA AAC GAT TTG AGA GTC TCT CAA ACC TTT TTC ACT CAA ACA GAA 1980 
Tyr Gin Asn Asp Leu Arg Val Ser Gin Thr Phe Phe Thr Gin Thr Glu 
630 635 640 

CAA CAT TAT AAA AAA TTC CGC AAC ACC CTG AAA TTC TTA CTC GCT AAT 202 8 
Gin His Tyr Lys Lys Phe Arg Asn Thr Leu Lys Phe Leu Leu Ala Asn 
645 650 655 

TTT AGC GAT ATG GAT CTC AAG AAT TTA GAA CGC CCC CAT AAC TTC AGC 2076 
Phe Ser Asp Met Asp Leu Lys Asn Leu Glu Arg Pro His Asn Phe Ser 
660 665 670 

CCT TTA GAT CAT TTT ATG TTA GAG ACT TTA GAA ACC ATA AGC GCT GGA 2124 
Pro Leu Asp His Phe Met Leu Glu Thr Leu Glu Thr lie Ser Ala Gly 
675 680 685 

GTC AAT AGC GCG TTT GAA GAG CAT GAT TTT GTG AAA GGC TTG AAT ATT 2172 
Val Asn Ser Ala Phe Glu Glu His Asp Phe Val Lys Gly Leu Asn lie 
690 695 700 705 

TTA ATG GCG TTT GTT ACC AAT GAA TTG AGC GGG ATT TAT TTA GAC GCT 22 2 0 
Leu Met Ala Phe Val Thr Asn Glu Leu Ser Gly lie Tyr Leu Asp Ala 
710 715 720 

TGC AAG GAT AGC TTG TAT TGC GAT AGC AAA AAC AAT GAA AAA CGC CAA 22 68 
Cys Lys Asp Ser Leu Tyr Cys Asp Ser Lys Asn Asn Glu Lys Arg Gin 
725 730 735 

GCC ATT CAA ATG GTT TTA CTC GCT ACA GCT AGT AAG TTG TGC TAC TTT 2 316 

Ala lie Gin Met Val Leu Leu Ala Thr Ala Ser Lys Leu Cys Tyr Phe 
740 745 750 

TTA GCC CCG ATT TTA ACG CAC ACG ATT GAA GAA GTT TTA GAG CAT AGC 2 3 64 
Leu Ala Pro lie Leu Thr His Thr lie Glu Glu Val Leu Glu His Ser 
755 760 765 

CAA GCG CTT CGC ATT TTT TTA CAA GCC AAA GAT GTG TTT GAT TTA AAA 2412 
Gin Ala Leu Arg lie Phe Leu Gin Ala Lys Asp Val Phe Asp Leu Lys 
770 775 780 785 

GAC ATT AGC GTT TCA GAA AAA CTC CAC CTC AAA GAG TTT AAA AAA CCA 24 60 
Asp lie Ser Val Ser Glu Lys Leu His Leu Lys Glu Phe Lys Lys Pro 
790 795 800 

GAA AAT TTT GAA GCC GTT TTA GCC TTG CGT TCT GCC TTT AAT GAA GAG 2 508 
Glu Asn Phe Glu Ala Val Leu Ala Leu Arg Ser Ala Phe Asn Glu Glu 
805 810 815 

TTA GAC CGA TTG AAA AAA GAA GGC GTC ATT AAA AAT TCG TTA GAG TGC 2 556 

Leu Asp Arg Leu Lys Lys Glu Gly Val lie Lys Asn Ser Leu Glu Cys 
820 825 830 
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TGG AGG ACG CAC AAG CCT GTG ATT TAC AGA GCG ACT ACG CAA TGG TTT 12 60 

Trp Arg Thr His Lys Pro Val lie Tyr Arg Ala Thr Thr Gin Trp Phe 
390 395 400 

ATT TTA ATG GAT GAG CCT TTT ATC CAA AAT GAT GGC TCT CAA AAA ACC 13 08 

lie Leu Met Asp Glu Pro Phe lie Gin Asn Asp Gly Ser Gin Lys Thr 
405 410 415 

TTA AGA GAA GTG GCT TTA GAT GCG ATT GAA AAG GTG GAA TTT GTG CCA 13 56 

Leu Arg Glu Val Ala Leu Asp Ala lie Glu Lys Val Glu Phe Val Pro 
420 425 430 

AGC AGC GGG AAA AAC CGC CTA AAA ACC ATG ATA GAA AAC CGC CCT GAT 14 04 

Ser Ser Gly Lys Asn Arg Leu Lys Thr Met lie Glu Asn Arg Pro Asp 
435 440 445 

TGG TGC TTG AGC CGG CAA AGA AAA TGG GGC GTG CCA CTG GCC TTT TTC 14 52 

Trp Cys Leu Ser Arg Gin Arg Lys Trp Gly Val Pro Leu Ala Phe Phe 
450 455 460 465 

ATA GAC AAA CGC ACG AAT AAG CCT TGT TTT GAA AGC GAA GTT TTA GAG 1500 
lie Asp Lys Arg Thr Asn Lys Pro Cys Phe Glu Ser Glu Val Leu Glu 
470 475 480 

CAT GTG GCC AAT CTT TTT GAG AAA AAA GGC TGT GAT GTG TGG TGG GAG 154 8 

His Val Ala Asn Leu Phe Glu Lys Lys Gly Cys Asp Val Trp Trp Glu 
485 490 495 

TAT AGC GTG AAA GAT TTA TTG CCC CCT AGC TAT CAA GAG GAC GCC AAG 15 96 

Tyr Ser Val Lys Asp Leu Leu Pro Pro Ser Tyr Gin Glu Asp Ala Lys 
500 505 510 

CAT TAT GAG AAA ATC ATG CAC ATT TTA GAC GTG TGG TTT GAT AGT GGT 1644 
His Tyr Glu Lys lie Met His lie Leu Asp Val Trp Phe Asp Ser Gly 
515 520 525 

AGC ACC TTT AAG GCG GTT TTA GAA GAC TAT CAT GGA GAA AAA GGG CAA 16 92 

Ser Thr Phe Lys Ala Val Leu Glu Asp Tyr His Gly Glu Lys Gly Gin 
530 535 540 545 

AGC CCT AGC GAT GTG ATC TTA GAA GGG AGC GAT CAG CAT AGG GGG TGG 174 0 

Ser Pro Ser Asp Val lie Leu Glu Gly Ser Asp Gin His Arg Gly Trp 
550 555 560 

TTT CAA AGC TCG CTT CTA ATC GGT TGT GTT TTA AAC AAC CAA GCC CCT 17 8 8 

Phe Gin Ser Ser Leu Leu lie Gly Cys Val Leu Asn Asn Gin Ala Pro 
565 570 575 

TTT AAA AAG GTC ATT ACG CAT GGC TTT ATC GTA GAT GAA AAG GGC GAA 183 6 

Phe Lys Lys Val lie Thr His Gly Phe lie Val Asp Glu Lys Gly Glu 
580 585 590 

AAA ATG AGT AAA TCT AAG GGC AAT GTG GTG TCT TTG GAC AAG CTG CTC 1884 
Lys Met Ser Lys Ser Lys Gly Asn Val Val Ser Leu Asp Lys Leu Leu 
595 600 605 
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Leu Leu Lys Glu Arg His Lys Pro lie Tyr Trp Ser Tyr Ala Cys Glu 
165 170 175 

AGC OCT TTA GCG GAA GCT GAA GTG GAA TAG AAA ATG AAA AAA ,TCG CCC 63 6 

Ser Ala Leu Ala Glu Ala Glu Val Glu Tyr Lys Met Lys Lys Ser Pro 
180 185 190 

TCC ATT TTC GTG GCG TTT GGT TTG AAA AAG GAG AGT TTA GAA AAA TTA 684 
Ser lie Phe Val Ala Phe Gly Leu Lys Lys Glu Ser Leu Glu Lys Leu 
195 200 205 

AAA GTC AAA AAA GCG AGC TTG GTG ATT TGG ACG ACC ACG CCT TGG ACT 732 
Lys Val Lys Lys Ala Ser Leu Val lie Trp Thr Thr Thr Pro Trp Thr 
210 215 220 225 

TTG TAT GCG AAT GTA GCG ATC GCT TTG AAA AAA GAC GCT GTT TAT GCG 780 
Leu Tyr Ala Asn Val Ala lie Ala Leu Lys Lys Asp Ala Val Tyr Ala 
230 235 240 

CTC ACC CAA AAA GGC TAT TTA GTC GCT AAA GCC TTG CAT GAA AAA TTA 82 8 

Leu Thr Gin Lys Gly Tyr Leu Val Ala Lys Ala Leu His Glu Lys Leu 
245 250 255 

GCC GCT TTA GGG GTG GTG GAT AAT GAG ATC ACA CAT GAA TTC AAT TCC 87 6 

Ala Ala Leu Gly Val Val Asp Asn Glu lie Thr His Glu Phe Asn Ser 
260 265 270 

AAT GAT TTA GAA TAT TTA GTG GCT ACA AAC CCG CTC AAT CAA AGG GAT 924 
Asn Asp Leu Glu Tyr Leu Val Ala Thr Asn Pro Leu Asn Gin Arg Asp 
275 280 285 

TCG CTG GTG GCT TTA GGA GAG CAT GTC GGT TTA GAA GAT GGC ACA GGA 972 
Ser Leu Val Ala Leu Gly Glu His Val Gly Leu Glu Asp Gly Thr Gly 
290 295 300 305 

GCC GTG CAT ACC GCA CCT GGG CAT GGT GAA GAG GAC TAT TAT TTA GGC 1020 
Ala Val His Thr Ala Pro Gly His Gly Glu Glu Asp Tyr Tyr Leu Gly 
310 315 320 

TTA AGA TAT AAT TTA GAA GTG TTA ATG TCT GTA GAT GAG AAA GGT TGC 1068 
Leu Arg Tyr Asn Leu Glu Val Leu Met Ser Val Asp Glu Lys Gly Cys 
325 330 335 

TAT GAT GAG GGC ATT ATC CAT AAC CAA CTA TTA GAT GAA AGC TAT CTG 1116 
Tyr Asp Glu Gly lie lie His Asn Gin Leu Leu Asp Glu Ser Tyr Leu 
340 345 350 

GGC GAG CAT GTT TTT AAG GCT CAA AAA CGC ATT ATA GAG CAA TTG GGC 1164 
Gly Glu His Val Phe Lys Ala Gin Lys Arg lie lie Glu Gin Leu Gly 
355 360 365 

GAT TCT TTA TTG CTA GAG CAA GAG ATT GAG CAT TCT TAT CCG CAT TGC 1212 
Asp Ser Leu Leu Leu Glu Gin Glu lie Glu His Ser Tyr Pro His Cys 
370 375 380 385 
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(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58... 2769 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

GATTAATCAG TGGAAGAATA CAAAGACACC CTAAACTTAA ACACAACCAC CTTTTCT ATG 60 

Met 
1 

AAG GGG AAT TTG AGC GTT AAT GAG CCT AAA ACT TAC GCC AAA TOG CAA 108 
Lys Gly Asn Leu Ser Val Asn Glu Pro Lys Thr Tyr Ala Lys Trp Gin 
5 10 15 

GAG CAA CAA GCG TTT AAA CGC ATG CAA OCT AGG AAA GAC AAC CAT GGG 156 
Glu Gin Gin Ala Phe Lys Arg Met Gin Ala Arg Lys Asp Asn His Gly 
20 25 30 

GAT TTC ACT TTG CAT GAC GGG CCG CCT TAT GCG AAC GGG CAT TTG CAT 2 04 

Asp Phe Thr Leu His Asp Gly Pro Pro Tyr Ala Asn Gly His Leu His 
35 40 45 

TTG GGG CAT GCC TTA AAT AAA ATT TTA AAA GAC ATT GTC GTT AAA AGA 252 
Leu Gly His Ala Leu Asn Lys lie Leu Lys Asp lie Val Val Lys Arg 
50 55 60 65 

GAA TAT TTT AAG GGG AAG AAA ATC TAT TAC ACG CCC GGT TGG GAT TGC 3 00 

Glu Tyr Phe Lys Gly Lys Lys lie Tyr Tyr Thr Pro Gly Trp Asp Cys 
70 75 80 

CAT GGT TTG CCC ATT GAG CAG CAA ATT TTA GAG CGA TTA GAA AAA GAA 34 8 

His Gly Leu Pro lie Glu Gin Gin lie Leu Glu Arg Leu Glu Lys Glu 
85 90 95 

AAA ACA AGC CTA GAA AAC CCC ACG CTG TTT AGA GAA AAG TGC CGA GAT 3 96 

Lys Thr Ser Leu Glu Asn Pro Thr Leu Phe Arg Glu Lys Cys Arg Asp 
100 105 110 

CAT GCG AAG AAA TTT TTA GAA ATC CAA AAG AAT GAA TTT TTG CAA TTG 444 
His Ala Lys Lys Phe Leu Glu lie Gin Lys Asn Glu Phe Leu Gin Leu 
115 120 125 

GGT GTT TTG GGG GAT TTT GAA GAT CCT TAT AAA ACC ATG GAT TTT AAA 4 92 

Gly Val Leu Gly Asp Phe Glu Asp Pro Tyr Lys Thr Met Asp Phe Lys 
130 135 140 145 

TTT GAA GCG AGC ATT TAT AGA GCC TTA GTG GAA GTG GCT AAA AAA GGG 54 0 

Phe Glu Ala Ser lie Tyr Arg Ala Leu Val Glu Val Ala Lys Lys Gly 
150 155 160 

CTT TTG AAA GAG CGC CAC AAG CCT ATT TAT TGG AGT TAT GCA TGC GAG 58 8 
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TTA AAT CCC AAA CTC TTA GAG GTG GAA CAA CGC CTT ATT TAAAAGGGCG CT 687 
Leu Asn Pro Lys Leu Leu Glu Val Glu Gin Arg Leu lie 
195 200 



GAGATGGGTA GCGCTCGCTG AAGAAAGGTC GATGCGTT 72 5 

(2) INFORMATION FOR SEQ ID NO: 156: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:156: 



Met Thr Leu Gly 
1 

Leu Phe Val Ala 
20 

Leu Lys Met Gly 
35 

Phe Phe Leu Glu 
50 

Val Val Lys Lys 
65 

Cys Leu Lys Leu 

Val Asp Glu lie 
100 

Arg Tyr Pro His 
115 

Gin lie Ala Met 
130 

Met Leu Glu Leu 
145 

Cys Gly Tyr Gly 

Ala Thr Pro Phe 
180 

Asn Pro Lys Leu 
195 



lie Asp Glu Ala 
5 

Gly Val Ala Cys 

Leu Lys Asp Ser 
40 

Tyr Lys lie Lys 
55 

Ser Ala Asn Glu 
70 

Ala Val Gin Glu 
85 

Lys lie Asp Gly 

He Gin Thr He 
120 

Ala Ser Val Leu 
135 

His Ala Leu Phe 
150 

Thr Lys Gin His 
165 

His Arg His Ser 

Leu Glu Val Glu 
200 



Gly Arg Gly Cys 
10 

Asn Glu Lys Thr 
25 

Lys Lys Leu Ser 

Thr His Gly Glu 
60 

He Asp Ser Leu 
75 

He Leu Glu Asn 
90 

Asn Thr Ala Phe 
105 

He Lys Gly Asp 

Ala Lys Ala Phe 
140 

Lys Glu Tyr Gly 
155 

He Glu Ala He 
170 

Phe Thr Leu Lys 
185 

Gin Arg Leu He 



Leu Ala Gly Ser 
15 

Ala Leu Glu Phe 
30 

Leu Lys Lys Arg 
45 

Val Gly Phe Phe 

Gly Leu Gly Ala 
80 

Gly Cys Ser Leu 
95 

Gly Leu Asn Lys 
110 

Glu Thr He Ala 
125 

Lys Asp Arg Glu 

Trp Asp Lys Asn 
160 

He Lys Leu Gly 
175 

Asn Arg He Leu 
190 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

GAAAACAGGA TAACGCATGA AACATGTGAG TAGGGATTTT GATACCGGTT GGGTTGCGTA 6 0 

TCA ATG ACT CTA GGC ATT GAT GAA GCG GGT AGG GGG TGT TTG GCC GGT 10 8 

Met Thr Leu Gly He Asp Glu Ala Gly Arg Gly Cys Leu Ala Gly 
15 10 15 

TCG CTT TTT GTG GCT GGG GTG GCG TGT AAT GAA AAA ACA GCC TTA GAA 156 
Ser Leu Phe Val Ala Gly Val Ala Cys Asn Glu Lys Thr Ala Leu Glu 
20 25 30 

TTT CTA AAA ATG GGT TTA AAA GAC AGC AAG AAG CTC AGC CTA AAA AAG 2 04 

Phe Leu Lys Met Gly Leu Lys Asp Ser Lys Lys Leu Ser Leu Lys Lys 
35 40 45 

CGC TTT TTC TTA GAA TAT AAG ATC AAA ACG CAT GGT GAG GTG GGG TTT 252 
Arg Phe Phe Leu Glu Tyr Lys He Lys Thr His Gly Glu Val Gly Phe 
50 55 60 

TTC GTG GTT AAA AAA AGC GCA AAT GAA ATT GAT AGC TTG GGC TTA GGG 3 00 

Phe Val Val Lys Lys Ser Ala Asn Glu He Asp Ser Leu Gly Leu Gly 
65 70 75 

GCG TGT TTG AAA CTC GCT GTG CAA GAA ATT TTA GAA AAT GGT TGC TCT 34 8 

Ala Cys Leu Lys Leu Ala Val Gin Glu He Leu Glu Asn Gly Cys Ser 
80 85 90 95 

TTA GTT GAT GAA ATA AAA ATA GAC GGC AAC ACG GCG TTT GGC TTG AAC 3 96 

Leu Val Asp Glu He Lys He Asp Gly Asn Thr Ala . Phe Gly Leu Asn 
100 105 110 

AAA CGC TAC CCC CAT ATA CAA ACC ATC ATC AAG GGC GAT GAA ACA ATC 444 
Lys Arg Tyr Pro His He Gin Thr He He Lys Gly Asp Glu Thr He 
115 120 125 

GCT CAA ATC GCT ATG GCG TCT GTT TTG GCG AAA GCT TTT AAG GAC AGA 4 92 

Ala Gin He Ala Met Ala Ser Val Leu Ala Lys Ala Phe Lys Asp Arg 
130 135 140 

GAA ATG CTA GAG TTG CAC GCT TTG TTT AAG GAA TAC GGC TGG GAT AAG 54 0 

Glu Met Leu Glu Leu His Ala Leu Phe Lys Glu Tyr Gly Trp Asp Lys 
145 150 155 

AAT TGC GGG TAT GGG ACT AAA CAA CAT ATA GAA GCG ATC ATT AAG CTA 588 
Asn Cys Gly Tyr Gly Thr Lys Gin His He Glu Ala He He Lys Leu 
160 165 170 175 

GGG GCT ACG CCT TTT CAT CGG CAT AGC TTC ACG CTT AAA AAC CGC ATC 636 
Gly Ala Thr Pro Phe His Arg His Ser Phe Thr Leu Lys Asn Arg He 
180 185 190 
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80 85 90 

GCT ATT GAG ACC ATG AAA AAG ATT GAA GCG CAA ATT TAT ATG AGC AAG 52 0 

Ala lie Asp Thr Met Lys Lys He Glu Ala Gin lie Tyr Met Ser Lys 
95 100 105 

TAT AAA ACT TTA AGC GCG GTA GGC TTT TAATGGCAAT CTCTATTAAA AGCCCAA 574 
Tyr Lys Thr Leu Ser Ala Val Gly Phe 
110 115 

AAGAAATCAA AGCTCTAAGA AAAGCCG 601 

(2) INFORMATION FOR SEQ ID NO : 154 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

Met Phe Leu Leu He He Phe Phe Ala Tyr Phe Tyr Ser Ser He Val 

15 10 15 

Phe Asn Ser Lys Asp He Ala Asp Asn Leu Arg Arg Asn Gly Gly Tyr 

20 25 30 

He Pro Gly Leu Arg Pro Gly Glu Gly Thr Ser Ser Phe Leu Asn Ser 

35 40 45 

Val Ala Ser Lys Leu Thr Leu Trp Gly Ser Leu Tyr Leu Ala Leu He 

50 55 60 

Ser Thr Val Pro Trp He Leu Val Lys Ala Met Gly Val Pro Phe Tyr 
65 70 75 80 

Phe Gly Gly Thr Ala Val Leu He Val Val Gin Val Ala He Asp Thr 

85 90 95 

Met Lys Lys He Glu Ala Gin He Tyr Met Ser Lys Tyr Lys Thr Leu 

100 105 110 

Ser Ala Val Gly Phe 
115 

(2) INFORMATION FOR SEQ ID NO : 155 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 64 . . . 675 
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325 330 335 

Leu Cys Gin Lys Glu Gly lie lie Pro Ala Leu Glu Ser Ser His Ala 

340 345 350 

Leu Ala Tyr Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser lie 

355 360 365 

lie Val Val Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val 

370 375 380 

Tyr Asn Ala Leu Lys Gly Gly Leu Lys 
385 390 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 197... 547 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

TGGTGATGCA AAACCAAAAC AAGCGCATCA TGAATTACAT TCCTATTAAG TTGAATTTAA 6 0 

GTGGGGTGAT CCCCCCTATT TTCGCTTCAG CTTTGCTCGT GTTCCCTTCT ACGATTTTGC 12 0 

AGCAAGCCAC AAGCAACAAA ACCTTGCAAG CGGTTGCGNA TTTTTTAAGC CCGCAAGGTA 180 

TGCGTATAAT ATTTTG ATG TTC TTG CTC ATC ATC TTT TTT GCT TAC TTT TAT 23 2 

Met Phe Leu Leu lie lie Phe Phe Ala Tyr Phe Tyr 
15 10 

TCT TCT ATT GTG TTC AAT TCT AAG GAT ATT GCG GAT AAT TTG AGG CGT 28 0 

Ser Ser lie Val Phe Asn Ser Lys Asp lie Ala Asp Asn Leu Arg Arg 
15 20 25 

AAT GGC GGG TAT ATT CCA GGG CTT AGG OCT GGA GAG GGG ACT TCA TCG 32 8 

Asn Gly Gly Tyr lie Pro Gly Leu Arg Pro Gly Glu Gly Thr Ser Ser 
30 35 40 

TTT TTA AAT TCT GTA GCG AGT AAG CTC ACT TTG TGG GGT TCA TTG TAT 376 
Phe Leu Asn Ser Val Ala Ser Lys Leu Thr Leu Trp Gly Ser Leu Tyr 
45 50 55 60 

TTA GCG CTC ATT TCT ACC GTG CCT TGG ATT TTG GTT AAG GCT ATG GGC 424 
Leu Ala Leu lie Ser Thr Val Pro Trp lie Leu Val Lys Ala Met Gly 
65 70 75 

GTG CCT TTT TAC TTT GGA GGC ACA GCG GTG CTG ATT GTG GTT CAA GTC 4 72 

Val Pro Phe Tyr Phe Gly Gly Thr Ala Val Leu lie Val Val Gin Val 
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(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



Met Asn Lys Lys 
1 

Glu Leu Leu Val 
20 

Cys Leu Lys Asp 
35 

Asp Phe Val Gly 
50 

Ser Asn Pro Lys 
65 

Gly Gly Ala His 

Lys Lys Met Gly 
100 

His Gly Val Ala 
115 

Val Val Phe Met 
130 

Phe Arg Met His 
145 

Ser Ala Thr Leu 

Ser Ser Tyr Lys 
180 

His Pro Tyr Pro 
195 

Glu Val Lys Ser 
210 

Val lie Ala Cys 
225 

Ala Phe Leu Asn 

Gly Leu Gly Leu 
260 

Arg Val Gly lie 
275 

Glu Gly Gin lie 
290 

Pro Gly Val Gly 
305 

Val Tyr" Glu Ser 



Ala Tyr Phe Gly 
5 

Pro Ala Leu Arg 

Glu Lys Phe Gin 
40 

Arg Pro Ser Pro 
55 

Val Lys Leu Tyr 
70 

Lys Thr Asn Gin 
85 

Lys Thr Arg lie 

Thr Ala lie Ala 
120 

Gly Ser Lys Asp 
135 

Leu Leu Gly Ala 
150 

Lys Asp Ala Val 
165 

Asp Thr His Tyr 

Thr Met Val Lys 
200 

Gin lie Leu Glu 
215 

Val Gly Gly Gly 
230 

Asp Lys Glu Val 
245 

Glu Thr Ash Lys 

Leu His Gly Asn 
280 

Ala Glu Ser His 
295 

Pro Glu His Ser 
310 

Ala Ser Asp Ala 



Glu Phe Gly Gly 
10 

Glu Leu Glu Gin 
25 

Lys Glu Tyr Phe 

Leu Thr Leu Cys 
60 

Leu Lys Arg Glu 
75 

Ala Leu Gly Gin 
90 

He Ala Glu Thr 
105 

Cys Ala Leu Leu 

He Lys Arg Gin 
140 

Glu Val Arg Glu 
155 

Asn Glu Ala Leu 
170 

Leu Leu Gly Thr 
185 

Thr Phe Gin Lys 

Lys Glu Asn Arg 
220 

Ser Asn Ala He 
235 

Lys Leu He Gly 
250 

His Gly Ala Thr 
265 

Lys Thr Tyr Leu 

Ser He Ser Ala 
300 

Tyr Leu Lys Glu 
315 

Glu Ala Leu Glu 



Ser Phe Val Ser 
15 

Ala Phe Asp Ala 
30 

Arg Leu Leu Lys 
45 

Gin Asn He Val 

Asp Leu He His 
80 

Ala Leu Leu Ala 
95 

Gly Ala Gly Gin 
110 

Asn Leu Lys Cys 
125 

Glu Met Asn Val 

Val Asn Ser Gly 
160 

Arg Asp Trp Ala 
175 

Ala Ala Gly Pro 
190 

Met He Gly Asp 
205 

Leu Pro Asp Tyr 

Gly He Phe Ser 
240 

Val Glu Pro Ala 
255 

Leu Asn Lys Gly 
270 

Leu Gin Asp Asp 
285 

Gly Leu Asp Tyr 

Ser Gly Arg Ala 
320 

Ala Phe Lys Leu 
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AAG GAC ACG CAT TAT TTG CTA GGC ACA GCC GCC GGG CCA CAC CCT TAG 632 
Lys Asp Thr His Tyr Leu Leu Gly Thr Ala Ala Gly Pro His Pro Tyr 
180 185 190 195 

CCC ACA ATG GTT AAA ACC TTT CAA AAA ATG ATA GGC GAT GAG GTT AAA 6 80 

Pro Thr Met Val Lys Thr Phe Gin Lys Met lie Gly Asp Glu Val Lys 
200 205 210 

AGC CAG ATT TTA GAA AAA GAA AAC CGC TTG CCT GAT TAT GTG ATC GCA 72 8 

Ser Gin lie Leu Glu Lys Glu Asn Arg Leu Pro Asp Tyr Val lie Ala 
215 220 225 

TGC GTT GGA GGG GGG TCT AAC GCT ATA GGG ATA TTC AGC GCA TTT TTA 776 
Cys Val Gly Gly Gly Ser Asn Ala lie Gly lie Phe Ser Ala Phe Leu 
230 235 240 

AAC GAC AAA GAA GTT AAA CTC ATA GGC GTA GAG CCG GCG GGT TTA GGG 824 
Asn Asp Lys Glu Val Lys Leu lie Gly Val Glu Pro Ala Gly Leu Gly 
245 250 255 

CTA GAA ACC AAT AAG CAT GGG GCG ACT TTG AAT AAG GGG CGT GTG GGG 872 
Leu Glu Thr Asn Lys His Gly Ala Thr Leu Asn Lys Gly Arg Val Gly 
260 265 270 275 

ATT TTG CAT GGG AAT AAA ACC TAT CTT TTA CAA GAT GAT GAA GGC CAG 92 0 

lie Leu His Gly Asn Lys Thr Tyr Leu Leu Gin Asp Asp Glu Gly Gin 
280 285 290 

ATT GCA GAA AGC CAT AGC ATT AGC GCC GGG CTT GAT TAT CCA GGA GTG 968 
lie Ala Glu Ser His Ser lie Ser Ala Gly Leu Asp Tyr Pro Gly Val 
295 300 305 

GGG CCA GAA CAC AGC TAT TTA AAA GAA AGT GGG CGT GCG GTT TAT GAA 1016 
Gly Pro Glu His Ser Tyr Leu Lys Glu Ser Gly Arg Ala Val Tyr Glu 
310 315 320 

AGC GCA AGC GAT GCT GAA GCG CTA GAA GCC TTC AAG TTG TTG TGC CAA 1064 
Ser Ala Ser Asp Ala Glu Ala Leu Glu Ala Phe Lys Leu Leu Cys Gin 
325 330 335 

AAA GAA GGC ATT ATC CCA GCG CTA GAA AGC TCA CAC GCC TTA GCG TAT 1112 
Lys Glu Gly lie lie Pro Ala Leu Glu Ser Ser His Ala Leu Ala Tyr 
340 345 350 355 

GCC TTA AAG CTC GCT CAA AAA TGC GAA GAA GAA AGC ATC ATC GTA GTG 1160 
Ala Leu Lys Leu Ala Gin Lys Cys Glu Glu Glu Ser lie lie Val Val 
360 365 370 

AAT TTA AGC GGC AGA GGG GAT AAG GAT TTA AGC ACC GTT TAT AAC GCT 12 08 
Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Ser Thr Val Tyr Asn Ala 
375 380 385 

TTA AAA GGA GGT TTA AAA TGAGGTATCA AAACATGTTT GAAACCTTAA AAA 125 9 

Leu Lys Gly Gly Leu Lys 
390 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

TAAGGATAAA ATCAAGCGAT TAGCCCGAAT TTTAAGAGAG TATTAAG ATG AAT AAA 5 6 

Met Asn Lys 
1 

AAA GCG TAT TTT GGG GAG TTT GGA GGG AGT TTT GTT TCG GAG TTG TTA 104 
Lys Ala Tyr Phe Gly Glu Phe Gly Gly Ser Phe Val Ser Glu Leu Leu 
5 10 15 

GTG CCT GCA TTA AGA GAA TTA GAA CAG GCG TTT GAT GCG TGT TTG AAA 152 
Val Pro Ala Leu Arg Glu Leu Glu Gin Ala Phe Asp Ala Cys Leu Lys 
20 25 30 35 

GAT GAA AAA TTC CAA AAA GAA TAT TTT CGT CTT TTA AAG GAT TTT GTG 2 00 

Asp Glu Lys Phe Gin Lys Glu Tyr Phe Arg Leu Leu Lys Asp Phe Val 
40 45 50 

GGC CGT CCT AGC CCT TTA ACC TTG TGT CAA AAT ATC GTT TCT AAC CCT 24 8 

Gly Arg Pro Ser Pro Leu Thr Leu Cys Gin Asn lie Val Ser Asn Pro 
55 60 65 

AAA GTC AAG CTT TAT TTA AAA CGA GAG GAT TTA ATC CAT GGC GGG GCG 2 96 

Lys Val Lys Leu Tyr Leu Lys Arg Glu Asp Leu lie His Gly Gly Ala 
70 75 80 

CAT AAG ACT AAT CAA GCC TTA GGG CAA GCC CTT TTA GCG AAA AAA ATG 344 
His Lys Thr Asn Gin Ala Leu Gly Gin Ala Leu Leu Ala Lys Lys Met 
85 90 95 

GGT AAA ACA AGG ATC ATC GCT GAA ACA GGC GCC GGT CAG CAT GGC GTG 3 92 

Gly Lys Thr Arg lie He Ala Glu Thr Gly Ala Gly Gin His Gly Val 
100 105 110 115 

GCG ACG GCT ATC GCT TGC GCA TTA TTG AAC TTA AAA TGC GTG GTT TTT 44 0 

Ala Thr Ala He Ala Cys Ala Leu Leu Asn Leu Lys Cys Val Val Phe 
120 125 130 

ATG GGA TCT AAA GAC ATC AAG CGC CAG GAA ATG AAT GTT TTT AGA ATG 48 8 

Met Gly Ser Lys Asp He Lys Arg Gin Glu Met Asn Val Phe Arg Met 
135 140 145 

CAC TTA TTA GGC GCT GAA GTG AGA GAG GTT AAT TCA GGG AGC GCG ACG 536 
His Leu Leu Gly Ala Glu Val Arg Glu Val Asn Ser Gly Ser Ala Thr 
150 155 160 

CTT AAA GAC GCT GTG AAT GAA GCC TTA AGA GAT TGG GCG AGC AGT TAG 584 
Leu Lys Asp Ala Val Asn Glu Ala Leu Arg Asp Trp Ala Ser Ser Tyr 
165 170 175 
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900 905 910 

lie Pro Thr Asn Gly Val Ser Asp Thr Asn Asn Leu lie Asn Leu Leu 

915 920 925 

Thr Glu Phe lie Lys Thr Ala Gly Phe lie Gin Asn Asn Asp Ser Ser 

930 935 940 

Val Ser Thr Ser Leu Thr Ser Ala Phe Gin Ala lie Thr Ser Ala lie 
945 950 955 960 

Ser Gin Gly Phe Gin Ala Leu Gin Asn Asp lie Ser Pro Asn Ala lie 

965 970 975 

Leu Thr Leu Leu Gin Glu lie Thr Ser Asn Thr Thr Thr lie Gin Ser 

980 985 990 

Phe Ser Gin Thr Leu Arg Gin Leu Leu Gly Asp Lys Thr Phe Phe Met 

995 1000 1005 

Ala Gin Gin Lys Leu lie Asp Ala Met lie Asn Ala Arg Asn Gin Val 

1010 1015 1020 

Gin Asn Ala Gin Asn Gin Ala Asn Asn Tyr Gly Ser Gin Pro Val Leu 
025 1030 1035 1040 

Ser Gin Tyr Ala Ala Ala Lys Ser Thr Gin His Gly Met Ser Asn Gly 

1045 1050 1055 

Leu Gly Val Gly Leu Gly Tyr Lys Tyr Phe Phe Gly Lys Ala Arg Lys 

1060 1065 1070 

Leu Gly Leu Arg His Tyr Phe Phe Phe Asp Tyr Gly Phe Ser Glu lie 

1075 1080 1085 

Gly Leu Ala Asn Gin Ser Val Lys Ala Asn lie Phe Ala Tyr Gly Val 

1090 1095 1100 

Gly Thr Asp Phe Leu Trp Asn Leu Phe Arg Arg Thr Tyr Asn Thr Lys 
105 1110 1115 1120 

Ala Leu Asn Phe Gly Leu Phe Ala Gly Val Gin Leu Gly Gly Ala Thr 

1125 1130 1135 

Trp Leu Ser Ser Leu Arg Gin Gin lie lie Asp Asn Trp Gly Ser Ala 

1140 1145 1150 

Asn Asp lie His Ser Thr Asn Phe Gin Val Ala Leu Asn Phe Gly Val 

1155 1160 1165 

Arg Thr Asn Phe Ala Glu Phe Lys Arg Phe Ala Lys Lys Phe His Asn 

1170 1175 1180 

Gin Gly Val lie Ser Gin Lys Ser Val Glu Phe Gly lie Lys Val Pro 
185 1190 1195 1200 

Leu lie Asn Gin Ala Tyr Leu Asn Ser Ala Gly Ala Asp Val Ser Tyr 

1205 1210 1215 

Arg Arg Leu Tyr Thr Phe Tyr lie Asn Tyr lie Met Gly Phe 
1220 1225 1230 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 4 8... 122 6 
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Val Gin Thr lie 
465 

Leu Val Ser Gin 

Glu Asn Leu Gin 
500 

Ser Gly Thr Ser 
515 

Gly Asn Ala Gin 
530 

Thr Asn Thr Gin 
545 

Val Val Asn Asn 

Ser Gly Thr Thr 
580 

Asn Thr Val Leu 
595 

Ala Phe Gin Asn 
610 

Tyr Asn Thr Asn 
625 

Gin Asp Leu Arg 

" Thr lie Asn Gin 
660 

Ser Gin Gin Thr 
675 

Cys Ala Ser Gly 
690 

Trp Ser Asp Ser 
705 

Tyr Gin Thr Gin 

lie Thr Tyr Asn 
740 

Asn Gin lie lie 
755 

Ser Gly Thr Gly 
770 

Gin Met Leu Thr 
785 

Ser Ser Gly Ser 

Ser Asn Lys Thr 
820 

Asn Ala Thr Thr 
835 

Tyr Asn Asp Ala 
850 

Asn Lys Gly Val 
865 

Asn Asn Ser Ser 
Ser Ser Gly Thr 



Asn Gly Lys Glu 
470 

Val Trp Ser Val 
485 

Lys Asn Ala Asn 

Ser Cys Asn Ser 
520 

Leu Gin Asn lie 
535 

Ala Lys Ser Asn 
550 

Glu Glu Glu Ala 
565 

Thr Gin Ser Pro 

Gin Asn Val Ser 
600 

Gin Glu Ser Asn 
615 

Gly Ser Gin Ser 
630 

lie Gin Leu Arg 
645 

Gin Val Pro Thr 

Gin Gin Thr Ser 
680 

Met Ser Gly Ser 
695 

Lys Ala Tyr Tyr 
710 

Ala Thr Thr Gin 
725 

Val Gin Gin lie 

Thr Asn Leu Lys 
760 

Ser Gly Asn Gly 
775 

Asp Ala Ser Asp 
790 

Asn Asn Gly Tyr 
805 

Ser Gly Asn Asn 

Ala Thr Ala Thr 
840 

Gin Lys lie Ala 
855 

Glu Asn Gly Leu 
870 

Ser Leu Ser Asn 
885 

Thr Cys Ser Gly 



Gin lie Gly Val 
475 

Tyr Asn Ser Leu 
490 

lie Leu Cys Ala 
505 

Ser Ser Gly Gly 

Leu Ser Pro Thr 
540 

Ala Pro Lys Leu 
555 

Lys Thr Ala Asn 
570 

Asn Ser Thr Val 
585 

Asn Phe Gin Gin 

lie Gin Ala Trp 
620 

Gin Glu Met Thr 
635 

Ala Asn Phe Tyr 
650 

Asp Met Asn Ala 
665 

Gly Ser Ala Ser 

Asn Gly Asn Trp 
700 

Ser Gly Leu Gin 
715 

Ser Gly Ser Asn 
730 

Thr Leu Thr Ser 
745 

Ser Val Asn Gly 

Thr Ser Gin lie 
780 

Gly Lys Leu Gly 
795 

Thr Pro Cys Asn 
810 

Cys Tyr Glu Pro 
825 

Thr Asp Ser Asn 

Asn lie lie Ala 
860 

Lys Gin Phe Phe 
875 

Leu Cys Gly Asn 
890 

Trp Leu lie Asn 



Asn Ser Phe Asn 
480 

Lys Thr Ser Glu 
495 

Asn Gly Thr Gin 
510 

Leu Ser lie Ser 
525 

Ser Gly Thr Thr 

Lys Ala Met Val 
560 

Leu Ala Gin Ser 
575 

Met Gly Ala Leu 
590 

Ser lie Gin Asn 
605 

Ala Asn Ala lie 

Pro Asn Asn Asn 
640 

Gin Leu lie Asn 
655 

Leu lie Asn Gin 
670 

Asn Asn Asn Ala 
685 

Cys Tyr Gin Gin 

Ser Ala Leu Gly 
720 

Gly Gly Asn Ser 
735 

Asn Gly Leu Leu 
750 

Gly Asn Gly Ala 
765 

Asn Thr Ala Tyr 

Thr Tyr Ser Ser 
800 

Ser Thr Asn Gly 
815 

Asn Lys Gin Gin 
830 

Leu Gin Lys Val 
845 

Ser Ser Gly Asn 

Glu Ala Leu Lys 
880 

Gly Ser Ser Gly 
895 

Leu Leu Gly Ala 
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20 25 30 

Gly Gly Thr Gin Gin Phe lie Asn Asn Lys Gin Leu Leu Glu Asn Gin 

35 40 45 

Asn lie lie Asn Ser Val Thr Gin Ser Ala lie Asn He Ala Gly Pro 

50 55 60 

Thr Thr Gly Leu He Thr Leu Ser Ser Gin Thr Val He Asp Ala Leu 
65 70 75 80 

Gly Tyr Gly Val Ser Asn Thr Val Gly Asn Gin Leu Glu Gly He Ser 

85 90 95 

Asn He Leu Asn Gin He Gly Lys Arg Lys Asp Phe Tyr Ser Ser Arg 

100 105 110 

Gin He Ser Ser He Ser Gin Gin He He Gly Leu Lys Gly Ser Ser 

115 120 125 

Asp Pro Leu Lys Ala His Ser Ser Gin He Thr Ala Lys Leu Leu Ser 

130 135 140 

Asn Thr Gin Ser Ala Phe Asp Gin Gly He Ala Leu Ser Thr Asn He 
145 150 155 160 

He Ser Ser He Asn Ser Leu Asn Pro Ser Asn Asn Thr Gin Glu Val 

165 170 175 

Lys Lys Gin Leu Gin Asn Thr Ala Gin Ser Met Thr Glu Leu Leu Gin 

180 185 190 

Gin He Glu His Ser He Thr Lys Thr Thr Ser Thr Thr Tyr Ala Gin 

195 200 205 

Ser Leu Leu Ser Asn Leu Thr Asp Ala Val Asn Ala Ser Ser Asn Asn 

210 215 220 

Thr Ala Tyr Val Ser Ala Leu Val Asn Ala Leu Asn Thr Leu Gly Val 
225 230 235 240 

Gly Val Phe Pro Thr Thr Thr Thr Thr His Val Val Leu Asn Pro Pro 

245 250 255 

Gly Gin Val Val Phe Tyr Pro Thr Asn Ser He Leu Gly Ser Thr Ser 

260 265 270 

Ser Asn Ser Asn Asn Gin Gin Gin Tyr Asn Asn Thr Leu Leu Met Asn 

275 280 285 

Thr Leu Gin Gly Thr Leu Ser Ala Asn Thr Gin Asn Asn Pro Asn Gly 

290 295 300 

Cys Ala Asn Gin Val Gin Cys Leu Glu Gin Phe He Gin Asn Leu Ala 
305 310 315 320 

Pro Leu Ala Ala Thr Pro Thr Ser Asn Asn Gin Ala Asn Gin Gin Val 

325 330 335 

Gin Ala He Ala Gin Lys Leu Gin Ser Val Ala He Asn Thr Leu Asp 

340 345 350 

Asn Asn Ala He Asn Asn Thr Thr Tyr Asn Leu Asn Asn Leu His Asn 

355 360 365 

Ala Leu Asn Phe Gin Ala Tyr Glu Ser Thr He Glu Gin Tyr Asn Asn 

370 375 380 

Ala Leu Lys Gin He Ser Trp He Ser Phe Thr Glu Pro Lys Asn Leu 
385 390 395 400 

Leu Lys Asn Thr Ser Asn Asn Tyr Gin He Gly Thr Val Thr Asn Ala 

405 410 415 

Gin Gly Gin Asn He Ser Ala Tyr Asp Cys Met Thr Ala Thr Gly Ser 

420 425 430 

Leu Ser Ser Asn Ala Ser Ser Gly He Ser Cys Ser Ala Thr Ser Ser 

435 440 445 

Thr Ser Ser Thr Asn Ser Phe Asp Asn Ser Leu Val Ala Thr Ser Lys 
450 455 460 
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GCC AAT CAA AGC GTG AAA GCG AAT ATC TTT GCT TAT GGG GTA GGC ACG 3 3 68 

Ala Asn Gin Ser Val Lys Ala Asn lie Phe Ala Tyr Gly Val Gly Thr 
1095 1100 1105 

GAT TTT TTA TGG AAC TTA TTC AGG AGG ACT TAG AAC ACT AAA GCG TTG 3416 
Asp Pile Leu Trp Asn Leu Phe Arg Arg Thr Tyr Asn Thr Lys Ala Leu 
1110 1115 1120 

AAT TTT GGG CTA TTT GCT GGG GTC CAA CTG GGC GGC GCA ACC TGG CTT 34 64 

Asn Phe Gly Leu Phe Ala Gly Val Gin Leu Gly Gly Ala Thr Trp Leu 
1125 1130 1135 

AGC TCC TTA AGG CAA CAA ATC ATT GAC AAC TGG GGG AGT GCT AAT GAC 3512 
Ser Ser Leu Arg Gin Gin He He Asp Asn Trp Gly Ser Ala Asn Asp 
1140 1145 1150 

ATC CAT TCA ACG AAT TTT CAA GTG GCG CTG AAT TTT GGG GTG CGC ACC 3 56 0 

He His Ser Thr Asn Phe Gin Val Ala Leu Asn Phe Gly Val Arg Thr 
1155 1160 1165 1170 

AAC TTC GCG GAG TTT AAG CGT TTT GCT AAG AAA TTC CAC AAT CAA GGG 360 8 

Asn Phe Ala Glu Phe Lys Arg Phe Ala Lys Lys Phe His Asn Gin Gly 
1175 1180 1185 

GTC ATC AGC CAA AAG AGC GTG GAA TTT GGG ATC AAA GTG CCT CTC ATC 3656 
Val He Ser Gin Lys Ser Val Glu Phe Gly He Lys Val Pro Leu He 
1190 1195 1200 

AAT CAA GCG TAT TTG AAT AGC GCT GGA GCT GAT GTG AGT TAC AGG AGG 3 7 04 

Asn Gin Ala Tyr Leu Asn Ser Ala Gly Ala Asp Val Ser Tyr Arg Arg 
1205 1210 1215 

CTT TAT ACT TTT TAT ATC AAT TAC ATC ATG GGG TTT TAAAAAAGGG TGTGTC 3756 
Leu Tyr Thr Phe Tyr He Asn Tyr He Met Gly Phe 
1220 1225 1230 

ATGGAAATCT TACAATTCAT CGGCTATGGG AATATGG 3793 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Met He Lys Lys Ala Arg Lys Phe He Pro Phe Phe Leu He Gly Ser 

1 5 10 15 

Leu Leu Ala Glu Asp Asn Gly Trp Tyr Met Ser Val Gly Tyr Gin He 
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GGC GTT GAA AAC GGC TTA AAA CAA TTC TTT GAA GCG TTA AAA AAT AAT 2696 
Gly Val Glu Asn Gly Leu Lys Gin Phe Phe Glu Ala Leu Lys Asn Asn 
870 875 880 

AGC AGC AGT CTC AGT AAT TTA TGT GGT AAT GGT AGT AGC GOT AGT AGT 2 744 
Ser Ser Ser Leu Ser Asn Leu Cys Gly Asn Gly Ser Ser Gly Ser Ser 
885 890 895 

GGC ACT ACT TGC TCC GGT TGG CTT ATC AAC CTT TTA GGG GCA ATC CCC 27 92 
Gly Thr Thr Cys Ser Gly Trp Leu He Asn Leu Leu Gly Ala He Pro 
900 905 910 

ACC AAT GGA GTG AGC GAT ACG AAT AAT TTA ATT AAT CTG CTC ACT GAA 2840 
Thr Asn Gly Val Ser Asp Thr Asn Asn Leu He Asn Leu Leu Thr Glu 
915 920 925 930 

TTC ATT AAA ACC GCC GGG TTT ATC CAA AAT AAT GAT AGT AGT GTA TCT 28 88 
Phe He Lys Thr Ala Gly Phe He Gin Asn Asn Asp Ser Ser Val Ser 
935 940 945 

ACT AGT CTT ACA AGC GCT TTT CAA GCC ATT ACG AGC GCT ATT TCT CAA 2 93 6 
Thr Ser Leu Thr Ser Ala Phe Gin Ala He Thr Ser Ala He Ser Gin 
950 955 960 

GGG TTT CAA GCC TTA CAA AAC GAT ATT AGC CCT AAT GCG ATT TTA ACC 2 984 
Gly Phe Gin Ala Leu Gin Asn Asp He Ser Pro Asn Ala He Leu Thr 
965 970 975 

TTG CTC CAA GAG ATT ACT TCT AAC ACC ACC ACC ATT CAG TCA TTC TCG 3 032 
Leu Leu Gin Glu He Thr Ser Asn Thr Thr Thr He Gin Ser Phe Ser 
980 985 990 

CAA ACC TTA CGG CAG CTT TTA GGG GAT AAA ACA TTC TTT ATG GCG CAA 3 080 
Gin Thr Leu Arg Gin Leu Leu Gly Asp Lys Thr Phe Phe Met Ala Gin 
995 1000 1005 1010 

CAA AAG CTC ATT GAT GCG ATG ATT AAC GCC AG A AAT CAG GTT CAA AAC 312 8 
Gin Lys Leu He Asp Ala Met He Asn Ala Arg Asn Gin Val Gin Asn 
1015 1020 1025 

GCG CAA AAT CAA GCC AAT AAC TAC GGC TCT CAA CCC GTT TTA AGC CAG 3176 
Ala Gin Asn Gin Ala Asn Asn Tyr Gly Ser Gin Pro Val Leu Ser Gin 
1030 1035 1040 

TAT GCG GCC GCT AAA AGC ACC CAA CAT GGC ATG AGC AAT GGT TTA GGG 3224 
Tyr Ala Ala Ala Lys Ser Thr Gin His Gly Met Ser Asn Gly Leu Gly 
1045 1050 1055 

GTT GGT TTG GGC TAT AAA TAC TTC TTT GGT AAA GCG AGA AAA TTA GGC 3 2 72 
Val Gly Leu Gly Tyr Lys Tyr Phe Phe Gly Lys Ala Arg Lys Leu Gly 
1060 1065 1070 

CTT AGG CAT TAT TTT TTC TTT GAT TAC GGC TTT AGT GAA ATA GGC CTA 33 2 0 
Leu Arg His Tyr Phe Phe Phe Asp Tyr Gly Phe Ser Glu He Gly Leu 
1075 1080 1085 1090 
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TTA CGC ATC CAA TTG AGG GCG AAT TTT TAG GAG CTC ATC AAT ACC ATT 2 024 
Leu Arg lie Gin Leu Arg Ala Asn Phe Tyr Gin Leu He Asn Thr He 
645 650 655 

AAC GAG CAA GTG GOT ACA GAG ATG AAT GCT TTA ATT AAT CAA AGC CAA 2 072 
Asn Gin Gin Val Pro Thr Asp Met Asn Ala Leu He Asn Gin Ser Gin 
660 665 670 

CAA ACC CAA CAA ACA AGC GGA TCA GCA AGC AAT AAT AAC GCA TGC GCG 212 0 
Gin Thr Gin Gin Thr Ser Gly Ser Ala Ser Asn Asn Asn Ala Cys Ala 
675 680 685 690 

AGT GGA ATG AGT GGG AGT AAT GGT AAC TGG TGC TAT CAG CAA TGG TCC 2168 
Ser Gly Met Ser Gly Ser Asn Gly Asn Trp Cys Tyr Gin Gin Trp Ser 
695 700 705 

GAT TCT AAG GCT TAT TAG AGC GGG TTG CAA AGC GCT TTA GGG TAT CAA 2216 
Asp Ser Lys Ala Tyr Tyr Ser Gly Leu Gin Ser Ala Leu Gly Tyr Gin 
710 715 720 

ACG CAA GCG ACA ACT CAA AGC GGG AGC AAT GGT GGG AAC AGC ATC ACC 2264 
Thr Gin Ala Thr Thr Gin Ser Gly Ser Asn Gly Gly Asn Ser He Thr 
725 730 735 

TAG AAT GTC CAA CAA ATC ACG CTC ACT AGT AAT GGT TTG CTC AAC CAA 2 312 
Tyr Asn Val Gin Gin He Thr Leu Thr Ser Asn Gly Leu Leu Asn Gin 
740 745 750 

ATC ATC ACA AAT CTT AAG AGC GTT AAT GGA GGG AAT GGG GCG AGT GGT 2360 
He He Thr Asn Leu Lys Ser Val Asn Gly Gly Asn Gly Ala Ser Gly 
755 760 765 770 

ACA GGC AGT GGG AAT GGC ACC AGT CAA ATC AAC ACA GCC TAG CAG ATG 24 08 
Thr Gly Ser Gly Asn Gly Thr Ser Gin He Asn Thr Ala Tyr Gin Met 
775 780 785 

CTC ACA GAC GCC AGC GAT GGG AAA TTA GGG ACT TAT AGT AGT AGT AGT 2456 
Leu Thr Asp Ala Ser Asp Gly Lys Leu Gly Thr Tyr Ser Ser Ser Ser 
790 795 800 

GGC AGT AAT AAC GGC TAT ACG CCA TGC AAT AGC ACC AAT GGG AGC AAT 2504 
Gly Ser Asn Asn Gly Tyr Thr Pro Cys Asn Ser Thr Asn Gly Ser Asn 
805 810 815 

AAA ACG AGT GGG AAC AAT TGT TAT GAA CCC AAC AAA CAA CAA AAC GCC 2552 
Lys Thr Ser Gly Asn Asn Cys Tyr Glu Pro Asn Lys Gin Gin Asn Ala 
820 825 830 

ACC ACC GCA ACC GCC ACA ACC GAC AGC AAT TTA CAA AAA GTC TAT AAT 2600 
Thr Thr Ala Thr Ala Thr Thr Asp Ser Asn Leu Gin Lys Val Tyr Asn 
835 840 845 850 

GAC GCC CAA AAA ATA GCC AAC ATT ATC GCC AGC TCT GGG AAC AAT AAA 264 8 
Asp Ala Gin Lys He Ala Asn He He Ala Ser Ser Gly Asn Asn Lys 
855 860 865 
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CAA AAT ATC AGC GCC TAT GAT TGC ATG ACT GCT ACC GGA AGC CTT TCT 1352 
Gin Asn lie Ser Ala Tyr Asp Cys Met Thr Ala Thr Gly Ser Leu Ser 
420 425 430 

AGC AAT GCT TCT AGC GGG ATT TCA TGC TCA GCC ACA AGC TCC ACA AGT 14 00 

Ser Asn Ala Ser Ser Gly lie Ser Cys Ser Ala Thr Ser Ser Thr Ser 
435 440 445 450 

TCC ACA AAT AGC TTT GAC AAT TCT TTA GTC GCT ACC TCC AAA GTC CAA 144 8 

Ser Thr Asn Ser Phe Asp Asn Ser Leu Val Ala Thr Ser Lys Val Gin 
455 460 465 

ACC ATC AAC GGC AAA GAG CAG ATC GGC GTG AAT TCT TTT AAC CTT GTC 14 96 

Thr lie Asn Gly Lys Glu Gin lie Gly Val Asn Ser Phe Asn Leu Val 
470 475 480 

TCT CAA GTG TGG AGC GTT TAT AAT TCT TTA AAA ACT TCA GAA GAA TUVT 1544 
Ser Gin Val Trp Ser Val Tyr Asn Ser Leu Lys Thr Ser Glu Glu Asn 
485 490 495 

TTG CAA AAA AAC GCC AAT ATT TTA TGC GCT AAT GGG ACG CAA TCT GGG 15 92 

Leu Gin Lys Asn Ala Asn lie Leu Cys Ala Asn Gly Thr Gin Ser Gly 
500 505 510 

ACA AGC TCA TGC AAT AGC TCT TCA GGG GGT TTG AGC ATC AGC GGG AAC 164 0 

Thr Ser Ser Cys Asn Ser Ser Ser Gly Gly Leu Ser lie Ser Gly Asn 
515 520 525 530 

GCC CAA TTG CAA AAT ATT TTA AGC CCT ACT AGT GGG ACT ACC ACT AAT 1688 
Ala Gin Leu Gin Asn lie Leu Ser Pro Thr Ser Gly Thr Thr Thr Asn 
535 540 545 

ACT CAA GCT AAA AGC AAC GCT CCC AAA CTA AAA GCG ATG GTG GTG GTG 173 6 

Thr Gin Ala Lys Ser Asn Ala Pro Lys Leu Lys Ala Met Val Val Val 
550 555 560 

AAT AAT GAA GAA GAA GCT AAA ACG GCC AAT TTA GCC CAA AGC AGC GGG 17 84 

Asn Asn Glu Glu Glu Ala Lys Thr Ala Asn Leu Ala Gin Ser Ser Gly 
565 570 575 

ACA ACC ACA CAA TCT CCT AAC AGC ACG GTG ATG GGA GCT TTA AAC ACC 1832 
Thr Thr Thr Gin Ser Pro Asn Ser Thr Val Met Gly Ala Leu Asn Thr 
580 585 590 

GTG TTG CAA AAT GTC AGC AAT TTC CAA CAA AGC ATT CAA AAC GCT TTT 1880 
Val Leu Gin Asn Val Ser Asn Phe Gin Gin Ser lie Gin Asn Ala Phe 
595 600 605 610 

CAA AAC CAA GAA AGT AAT ATC CAA GCT TGG GCG AAT GCG ATT TAT AAC 1928 
Gin Asn Gin Glu Ser Asn lie Gin Ala Trp Ala Asn Ala lie Tyr Asn 
615 620 625 

ACT AAT GGG AGT CAG TCG CAA GAG ATG ACA CCT AAC AAT AAC CAA GAT 1976 
Thr Asn Gly Ser Gin Ser Gin Glu Met Thr Pro Asn Asn Asn Gin Asp 
630 635 640 
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Glu His Ser lie Thr Lys Thr Thr Ser Thr Thr Tyr Ala Gin Ser Leu 
195 200 205 210 

CTC TCC AAT CTA ACC GAT GCG GTG AAT GCC TOT AGC AAT AAT AGO GOT 72 8 

Leu Ser Asn Leu Thr Asp Ala Val Asn Ala Ser Ser Asn Asn Thr Ala 
215 220 225 

TAT GTG AGC GOT CTT GTT AAC GCT TTA AAC ACT TTA GGG GTA GGG GTT 776 
Tyr Val Ser Ala Leu Val Asn Ala Leu Asn Thr Leu Gly Val Gly Val 
230 235 240 

TTC CCC ACC ACA ACC ACA ACG CAT GTG GTG TTA AAC CCA CCG GGA CAA 824 
Phe Pro Thr Thr Thr Thr Thr His Val Val Leu Asn Pro Pro Gly Gin 
245 250 255 

GTC GTA TTC TAT CCA ACC AAT TCC ATT TTA GGC TCT ACT TCT TCA AAC 872 
Val Val Phe Tyr Pro Thr Asn Ser lie Leu Gly Ser Thr Ser Ser Asn 
260 265 270 

AGC AAT AAC CAA CAA CAA TAC AAC AAC ACC CTT TTA ATG AAC ACC TTA 92 0 

Ser Asn Asn Gin Gin Gin Tyr Asn Asn Thr Leu Leu Met Asn Thr Leu 
275 280 285 290 

CAA GGG ACA TTA AGC GCT AAT ACT CAA AAT AAC CCC AAT GGT TGC GCC 96 8 

Gin Gly Thr Leu Ser Ala Asn Thr Gin Asn Asn Pro Asn Gly Cys Ala 
295 300 305 

AAT CAA GTC CAG TGT TTG GAG CAA TTC ATC CAA AAT TTA GCC CCT TTA 1016 
Asn Gin Val Gin Cys Leu Glu Gin Phe lie Gin Asn Leu Ala Pro Leu 
310 315 320 

GCC GCA ACC CCC ACT TCA AAC AAC CAG GCC AAC CAG CAA GTC CAA GCC 1064 
Ala Ala Thr Pro Thr Ser Asn Asn Gin Ala Asn Gin Gin Val Gin Ala 
325 330 335 

ATC GCT CAA AAG CTT CAA AGC GTT GCT ATC AAC ACT TTA GAC AAC AAT 1112 
lie Ala Gin Lys Leu Gin Ser Val Ala lie Asn Thr Leu Asp Asn Asn 
340 345 350 

GCG ATC AAC AAC ACC ACC TAT AAT TTA AAC AAT TTG CAC AAC GCT TTG 116 0 

Ala lie Asn Asn Thr Thr Tyr Asn Leu Asn Asn Leu His Asn Ala Leu 
355 360 365 370 

AAT TTC CAA GCC TAT GAA AGC ACG ATA GAA CAA TAC AAT AAC GCT TTA 12 08 

Asn Phe Gin Ala Tyr Glu Ser Thr lie Glu Gin Tyr Asn Asn Ala Leu 
375 380 385 

AAA CAA ATT TCT TGG ATC AGT TTT ACT GAG CCT AAA AAC TTA CTC AAA 125 6 

Lys Gin lie Ser Trp lie Ser Phe Thr Glu Pro Lys Asn Leu Leu Lys 
390 395 400 

AAC ACT TCC AAT AAC TAC CAA ATC GGC ACC GTT ACC AAC GCT CAA GGG 13 04 

Asn Thr Ser Asn Asn Tyr Gin lie Gly Thr Val Thr Asn Ala Gin Gly 
405 410 415 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

TTATGTGGGC TACAACATAG GCTTTTGATT AAACAAAATA AGGGAAAAAT ATG ATA 5 6 

Met lie 
1 

AAA AAA GCT AGA AAA TTC ATA CCA TTC TTT TTA ATT GGC TCC CTC TTA 104 
Lys Lys Ala Arg Lys Phe lie Pro Phe Phe Leu lie Gly Ser Leu Leu 
5 10 15 

GCT GAA GAC AAT GGC TGG TAT ATG TCT GTA GGC TAT CAA ATC GGT GGC 152 
Ala Glu Asp Asn Gly Trp Tyr Met Ser Val Gly Tyr Gin lie Gly Gly 
20 25 30 

ACG CAA CAA TTC ATC AAT AAC AAA CAA CTT TTA GAA AAT CAA AAT ATC 2 00 

Thr Gin Gin Phe lie Asn Asn Lys Gin Leu Leu Glu Asn Gin Asn lie 
35 40 45 50 

ATC AAC AGC GTA ACC CAA AGC GCG ATC AAC ATT GCA GGG CCT ACT ACC 248 
lie Asn Ser Val Thr Gin Ser Ala lie Asn lie Ala Gly Pro Thr Thr 
55 60 65 

GGC CTT ATC ACT TTA AGC TCT CAA ACC GTC ATT GAC GCT TTA GGC TAT 2 96 

Gly Leu lie Thr Leu Ser Ser Gin Thr Val lie Asp Ala Leu Gly Tyr 
70 75 80 

GGC GTG AGT AAC ACT GTT GGC AAC CAA TTA GAG GGC ATT TCT AAT ATC 344 
Gly Val Ser Asn Thr Val Gly Asn Gin Leu Glu Gly lie Ser Asn He 
85 90 95 

TTG AAT CAA ATT GGC AAA AGA AAA GAC TTT TAT TCT AGC CGT CAA ATC 3 92 

Leu Asn Gin He Gly Lys Arg Lys Asp Phe Tyr Ser Ser Arg Gin He 
100 105 110 

TCT AGC ATT TCC CAA CAA ATC ATA GGG CTT AAA GGA AGC TCT GAT CCC 44 0 

Ser Ser lie Ser Gin Gin He He Gly Leu Lys Gly Ser Ser Asp Pro 
115 120 125 130 

TTA AAA GCC CAT TCT TCA CAG ATC ACA GCC AAA CTC CTT TCC AAC ACC 4 88 

Leu Lys Ala His Ser Ser Gin He Thr Ala Lys Leu Leu Ser Asn Thr 
135 140 145 

CAA AGC GCG TTT GAT CAG GGC ATC GCG CTA AGC ACT AAC ATC ATT AGC 53 6 

Gin Ser Ala Phe Asp Gin Gly He Ala Leu Ser Thr Asn He He Ser 
150 155 160 

TCT ATC AAT AGC CTA AAC CCT AGC AAC AAC ACC CAA GAG GTT AAA AAA 584 
Ser He Asn Ser Leu Asn Pro Ser Asn Asn Thr Gin Glu Val Lys Lys 
165 170 175 

CAG CTC CAA AAC ACC GCG CAA TCC ATG ACA GAA TTG TTG CAA CAA ATT 63 2 

Gin Leu Gin Asn Thr Ala Gin Ser Met Thr Glu Leu Leu Gin Gin He 
180 185 190 

GAA CAC AGC ATC ACT AAA ACC ACT AGC ACC ACT TAC GCG CAA TCC TTA 6 80 

-301- 



BNSDOCID: <WO 984347BA1J_> 



wo 98/43478 



PCTAJS98/06371 



Thr Met Val Ser lie Lys Pro He Val Asn Glu He Asn Thr Leu Leu 

165 170 175 

Lys Gly Tyr Gly Phe Ala Asn Phe Ser Leu Ala Cys Thr Glu Asp Glu 

180 185 190 

Lys Phe Tyr Arg He Gin Arg Glu Asp Gly Gin Leu Val Gly Glu Thr 

195 200 205 

Leu Ser Glu Gly Glu Val Thr Phe He Thr Phe Leu Tyr Tyr Tyr His 

210 215 220 

Leu Ala Lys Gly Ser Leu Glu Glu Asn Asp He Ser Lys Asn Lys Val 
225 230 235 240 

Leu Val He Asp Asp Pro He Ser Ser Leu Asp Ser Asn He Leu Phe 

245 250 255 

He Val Ser Val Leu Val Lys Asp Leu Met Lys Glu Ala Met Glu Glu 

260 265 270 

Lys Thr Asn He Lys Gin Val He He Leu Thr His Asn Thr Tyr Phe 

275 280 285 

Tyr Lys Glu He Thr Leu Glu Cys Asp Leu Lys Arg Tyr Gin Gly Lys 

290 295 300 

Tyr Ser Phe Trp He He Lys Lys Asp Asn Asn Val Ser Lys He Lys 
305 310 315 320 

Asp Tyr Lys Glu Asn Pro He Lys Asn Ser Tyr Glu Leu Leu Trp Gin 

325 330 335 

Glu Val Lys Gin Ala Lys Glu Asn Asn Ala Ser Trp Val Ser Leu Gin 

340 345 350 

Asn Val Met Arg Arg He He Glu Tyr Tyr Phe Arg He Leu Gly Gly 

355 360 365 

Phe Lys His Asn Asp Ser Leu Ser Glu Cys Phe Glu Asn He Glu Glu 

370 375 380 

Lys Arg Val Cys Asn Ser Phe He Ser Trp Phe Asn Asp Gly Ser His 
385 390 395 400 

Gly He Ser Asp Asp Leu Phe Met Gin Ser Gin Asp Thr Ser He Glu 

405 410 415 

Thr Tyr Leu Lys Val Phe Glu Lys He Phe Lys Glu Thr Gly His Glu 

420 425 430 

Ala His Tyr Lys Met Met Met Arg Met Lys 
435 440 

(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 3740 
(D) OTHER INFORMATION: 
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GGT TTT AAA CAT AAT GAT AGC TTG AGT GAA TGT TTT GAA AAT ATT GAA 2114 
Gly Phe Lys His Asn Asp Ser Leu Ser Glu Cys Phe Glu Asn lie Glu 
370 375 380 

GAA AAA CGA GTG TGT AAT TCT TTC ATT TCA TGG TTT AAT GAT GGC TCT 2162 
Glu Lys Arg Val Cys Asn Ser Phe lie Ser Trp Phe Asn Asp Gly Ser 
385 390 395 

CAT GGG ATT TCA GAT GAT TTG TTT ATG CAA AGT CAA GAT ACA AGT ATT 2 210 

His Gly lie Ser Asp Asp Leu Phe Met Gin Ser Gin Asp Thr Ser lie 
400 405 410 415 

GAG ACA TAT TTA AAA GTT TTT GAA AAA ATA TTT AAA GAA ACC GGT CAT 2258 
Glu Thr Tyr Leu Lys Val Phe Glu Lys lie Phe Lys Glu Thr Gly His 
420 425 430 

GAA GCT CAT TAT AAA ATG ATG ATG AGA ATG AAG TAATTGAATT AAAAACAAGG 2311 
Glu Ala His Tyr Lys Met Met Met Arg Met Lys 
435 440 

AATAACATGC GAATCGTATT TATGGGAACG 2341 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8: 

Met Glu Asp Tyr Ala Ser Arg Thr Ala Gly Ala Leu Glu Arg Leu Asp 

15 10 15 

Lys lie Val Glu Thr Glu Gin Lys Asn Gin Gin Thr Lys Leu Asp Thr 

20 25 30 

Glu Asn Leu Lys lie lie lie Glu Thr Leu Arg Ser Lys lie Asn Gly 

35 40 45 

Asn Gin Gin Lys Met Leu Asp Lys Ser Lys Glu Met Ser Arg Asn Phe 

50 55 60 

Lys Leu Asp Ser Thr Lys Asn Glu lie Asp Ala lie Lys Asp Leu lie 
65 70 75 80 

Lys Lys Ala Asn Glu Gin lie Ala Asn Tyr Asn Glu Met lie Lys Asp 

85 90 95 

lie Glu Lys Gin Lys Lys Ser Cys Lys Glu Gin Thr Trp Lys Phe Leu 

100 105 110 

Val Asn Glu Phe Lys Ser Asp lie Gin Glu Tyr Asn Lys Lys Tyr Cys 

115 120 125 

Gly Leu Glu Lys Gly lie Asn Asn Leu Glu Lys Ala lie Ser Glu Asn 

130 135 140 

Gin Glu Glu Val Lys Lys Leu Glu Asn Glu lie Lys Glu Leu Glu Lys 
145 150 155 160 
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AAT CAA GAA GAG GTA AAG AAA TTA GAA AAT GAA ATT AAG GAA TTA GAA 1442 
Asn Gin Glu Glu Val Lys Lys Leu Glu Asn Glu lie Lys Glu Leu Glu 
145 150 155 

AAA ACT ATG GTA AGC ATA AAG CCC ATT GTC AAT GAA ATC AAT ACG CTT 14 90 

Lys Thr Met Val Ser lie Lys Pro lie Val Asn Glu lie Asn Thr Leu 
160 165 170 175 

TTA AAA GGG TAT GGA TTC GCG AAT TTT AGT TTG GCA TGC ACT GAA GAT 1538 
Leu Lys Gly Tyr Gly Phe Ala Asn Phe Ser Leu Ala Cys Thr Glu Asp 
180 185 190 

GAA AAA TTT TAT CGT ATT CAA AGA GAA GAT GGT CAA TTA GTA GGA GAA 158 6 

Glu Lys Phe Tyr Arg lie Gin Arg Glu Asp Gly Gin Leu Val Gly Glu 
195 200 205 

ACA CTG AGC GAG GGT GAA GTT ACT TTC ATC ACT TTC TTA TAT TAT TAT 1634 
Thr Leu Ser Glu Gly Glu Val Thr Phe lie Thr Phe Leu Tyr Tyr Tyr 
210 215 220 

CAT TTA GCA AAA GGC TCT TTA GAA GAG AAC GAT ATA TCA AAA AAT AAG 1682 
His Leu Ala Lys Gly Ser Leu Glu Glu Asn Asp lie Ser Lys Asn Lys 
225 230 235 

GTT TTA GTG ATT GAT GAC CCC ATT TCA AGT TTG GAT AGC AAT ATA TTG 173 0 

Val Leu Val lie Asp Asp Pro lie Ser Ser Leu Asp Ser Asn lie Leu 
240 245 250 255 

TTT ATA GTG AGT GTT TTA GTT AAA GAT CTT ATG AAA GAA GCC ATG GAA 177 8 

Phe lie Val Ser Val Leu Val Lys Asp Leu Met Lys Glu Ala Met Glu 
260 265 270 

GAA AAA ACA AAC ATC AAG CAA GTT ATT ATA CTA ACC CAC AAC ACA TAT 1826 
Glu Lys Thr Asn lie Lys Gin Val lie lie Leu Thr His Asn Thr Tyr 
275 280 285 

TTT TAC AAG GAA ATT ACA TTA GAA TGT GAT TTA AAA CGC TAT CAA GGG 1874 
Phe Tyr Lys Glu lie Thr Leu Glu Cys Asp Leu Lys Arg Tyr Gin Gly 
290 295 300 

AAA TAT TCT TTT TGG ATA ATT AAA AAG GAT AAT AAT GTT TCA AAA ATT 1922 
Lys Tyr Ser Phe Trp He He Lys Lys Asp Asn Asn Val Ser Lys He 
305 310 315 

AAA GAT TAT AAA GAA AAT CCC ATT AAA AAT TCC TAT GAA TTG CTA TGG 1970 
Lys Asp Tyr Lys Glu Asn Pro He Lys Asn Ser Tyr Glu Leu Leu Trp 
320 325 330 335 

CAA GAA GTA AAA CAA GCA AAA GAA AAT AAT GCT TCT TGG GTA TCT TTA 2 018 

Gin Glu Val Lys Gin Ala Lys Glu Asn Asn Ala Ser Trp Val Ser Leu 
340 345 350 

CAA AAT GTT ATG CGA AGA ATT ATT GAG TAT TAC TTT AGG ATT TTA GGC 2 066 

Gin Asn Val Met Arg Arg He He Glu Tyr Tyr Phe Arg He Leu Gly 
355 360 365 



-298- 



BNSDOCIO: <WO 984347aAlJ_> 



wo 98/43478 



PCT/US98/06371 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

ATTGTGAATT AGGAGTGAGC GTGAATAGTA ATGGCAATAA AGACAAACAA CAGCAGAATG 60 

TAAGCAGTGG GATTTCTCAA ATCTCATTAA AAAAGGTGGC AACTTTTGAT GAAAATGGGG 120 

CGAGTTTTGA GAATTTAAAT TCTATCAACT TTATTTATGG GGCTAATGGG AGCGGTAAGA 18 0 

CAACCACTTC TAGTTTTTTA AAAAATCTAG CTGAAAATGG GATTGAAGAC AAGTTTGCTA 24 0 

ATAGTAAAAT AGCATGGTAT AACAATGAGA GTTTAAAGAT TGAAGTTTAT AACAAGCAAT 3 00 

TTAAAGAAGA GCAATTGAGA AACTCTCAAG TTAAAGGCAT TTTTACGCTC GGTAAAAAAA 3 60 

CGAACGAGAA TTTAGAAAAA ATTGAAAGCA AGAAAGAATC AATAAACAAA GAGAATGAAA 42 0 

AGAAAATAAA AAATGAAGCA AGCTTGCAAG TTTTAACACA AAAAAAGGAA AAGGAAGAAA 4 80 

AGGATTTTGC TGATAGGTGT TGGGAAAAAC TTTATAAGAA AAATGAAGAG GATTTTAAAG 54 0 

AAACGCTAGA AGGCTTTAAG CGTAAAGAGA AGTTTAAAGA AAAAATCCTT AAGGAATTTG 6 00 

AAAACGATAA ATACAATCAA AGCGAAATAG TAGGGTTAGA AAAATTAAAG GAAAAAATTG 66 0 

AGATTGTTTT TGGTGAAAAC CAAACAGAAT TGGCACTATT GGAATGCAAT TTAACAGATT 720 

TTGATTTTAT TGAAAATCAT TCTATTTGGG AACAAAAAAT TGTAGGGAGT GGTGATGCAG 7 80 

CCATTGCAGA TTTAATAAAA AGATTAAGCA ATGAAGATTG GGTAGCTCAA GGTAGAGAAT 84 0 

ATATAAAAGA TAATAGTATA TGCCCTTTCT GTCAAAAAGA AACCATTACC GAAGAATTTA 900 

AAAAACAACT AGAATCTTAT TTTGATACAA GTTATCAAGA ATCTATTGAA ACGATCAAGG 960 

AAAAG ATG GAA GAC TAC GCA AGC AGA ACC GCT GGA GCA CTG GAG CGA CTT 1010 
Met Glu Asp Tyr Ala Ser Arg Thr Ala Gly Ala Leu Glu Arg Leu 
15 10 15 

GAT AAG ATT GTT GAA ACA GAA GAG AAG AAT CAA CAA ACT AAA TTG GAC 105 8 

Asp Lys lie Val Glu Thr Glu Gin Lys Asn Gin Gin Thr Lys Leu Asp 
20 25 30 

ACA GAA AAT TTG AAA ATA ATT ATT GAA ACT TTG AGA AGT AAA ATC AAT 110 6 

Thr Glu Asn Leu Lys lie lie lie Glu Thr Leu Arg Ser Lys lie Asn 
35 40 45 

GGG AAT CAG CAA AAG ATG CTT GAT AAA AGT AAA GAA ATG AGC AGA AAT 1154 
Gly Asn Gin Gin Lys Met Leu Asp Lys Ser Lys Glu Met Ser Arg Asn 
50 55 60 

TTT AAG CTT GAT AGC ACT AAA AAC GAG ATA GAC GCA ATT AAA GAT TTG 12 02 

Phe Lys Leu Asp Ser Thr Lys Asn Glu lie Asp Ala lie Lys Asp Leu 
65 70 75 

ATT AAA AAG GCT AAT GAG CAA ATA GCC AAT TAT AAT GAG ATG ATA AAG 12 50 

lie Lys Lys Ala Asn Glu Gin lie Ala Asn Tyr Asn Glu Met lie Lys 
80 85 90 95 

GAT ATT GAA AAA CAG AAA AAG AGT TGT AAG GAA CAA ACT TGG AAA TTT 12 98 

Asp lie Glu Lys Gin Lys Lys Ser Cys Lys Glu Gin Thr Trp Lys Phe 
100 105 110 

CTA GTC AAT GAA TTT AAA AGT GAT ATA CAA GJKA TAT AAT AAA AAG TAT 1346 
Leu Val Asn Glu Phe Lys Ser Asp lie Gin Glu Tyr Asn Lys Lys Tyr 
115 120 125 

TGC GGT TTG GAG AAA GGA ATA AAC AAT TTA GAG AAA GCA ATT AGT GAA 13 94 
Cys Gly Leu Glu Lys Gly lie Asn Asn Leu Glu Lys Ala lie Ser Glu 
'130 135 140 
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165 170 
GTCAGCATTT AGTAAATAGC AGAGAA 616 
(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 

Met Gly Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe Phe Cys 

15 10 15 

Leu Gly Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu Asp Leu 

20 25 30 

Val Leu Glu Lys Lys lie Gin Ala Ser Arg Lys Gly Glu lie Thr Gin 

35 40 45 

Asp Asn Val Pro lie lie Thr Ala lie Ala Thr His Leu Asn Asp Val 

50 55 60 

Asp Ser Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu lie Phe 
65 70 75 80 

Thr Gin Asn Asn Asp Trp lie Asp Asp Gly Tyr lie Ser Tyr Glu Leu 

85 90 95 

Phe Gly Thr Lys Pro lie Gly Ser Glu Pro Leu Trp Val Arg Glu lie 

100 105 110 

Thr Lys Asp Glu Phe Asp Gly lie Leu Glu Thr Thr Asn Arg Trp Ser 

115 120 125 

Arg Ala Phe Leu Leu Ala Phe Asn Lys Leu Asp Tyr Leu Ala Val Gin 

130 135 140 

Glu Ala Lys Leu Glu Leu Asp Ala Tyr Ser Leu Gly Lys lie Val Phe 
145 150 155 160 

Asn Phe Ala Tyr Gin Val Pro Leu Pro Gin Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 14 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION : 966 . . . 2291 
(D) OTHER INFORMATION: 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 563 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 

TATAATATAG ATTTTATTTT AGCTAAAAAT GGCATGGGTT TTAGCAAGGA ATG GGC 5 6 

Met Gly 
1 

TTG AAA AAT CTC TCA ACA CTT CTG GTG TTT TTA TTC TTT TGT TTA GGG 104 
Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe Phe Cys Leu Gly 
5 10 15 

TGT GTG AGC AAT TTT AAT GAA GAC ACT TAG ACG CTA GAC TTA GTT TTA 152 
Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu Asp Leu Val Leu 
20 25 30 

GAA AAA AAG ATC CAA GCC AGC AGG AAA GGT GAA ATC ACC CAA GAT AAT 2 00 

Glu Lys Lys lie Gin Ala Ser Arg Lys Gly Glu lie Thr Gin Asp Asn 
35 40 45 50 

GTG CCT ATC ATC ACG GCT ATC GCT ACG CAT TTA AAC GAT GTG GAT AGC 24 8 

Val Pro lie lie Thr Ala lie Ala Thr His Leu Asn Asp Val Asp Ser 
55 60 65 

GGC ACT TAC TAT GAC CAT GAG TAT TTT TTA GTG GAG ATT TTC ACG CAA 2 96 

Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu lie Phe Thr Gin 
70 75 80 

AAT AAC GAC TGG ATA GAT GAT GGC TAT ATT TCT TAT GAA CTT TTT GGC 344 
Asn Asn Asp Trp lie Asp Asp Gly Tyr lie Ser Tyr Glu Leu Phe Gly 
85 90 95 

ACA AAA CCT ATA GGC TCA GAG CCT TTA TGG GTG CGA GAA ATC ACA AAA 3 92 

Thr Lys Pro lie Gly Ser Glu Pro Leu Trp Val Arg Glu lie Thr Lys 
100 105 110 

GAT GAA TTT GAT GGC ATT TTA GAA ACC ACG AAC AGG TGG AGC AGA GCT 44 0 

Asp Glu Phe Asp Gly lie Leu Glu Thr Thr Asn Arg Trp Ser Arg Ala 
115 120 125 130 

TTT TTG CTC GCT TTT AAC AAA TTG GAT TAT TTA GCG GTT CAA GAA GCC 488 
Phe Leu Leu Ala Phe Asn Lys Leu Asp Tyr Leu Ala Val Gin Glu Ala 
135 140 145 

AAA CTA GAG CTT GAT GCC TAT AGT TTG GGC AAG ATT GTT TTT AAT TTC 53 6 

Lys Leu Glu Leu Asp TQa Tyr Ser Leu Gly Lys lie Val Phe Asn Phe 
150 155 160 

GCT TAT CAA GTC CCC CTA CCT CAA TTT TAATGCGCTT AGATTACGCC TTATTCA 590 
Ala Tyr Gin Val Pro Leu Pro Gin Phe 
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Gin Asp Leu Leu 
290 

Asp Leu lie Glu 
305 

Met Thr Leu Tyr 

He Glu Ala Ala 
340 

Ala Arg Phe Asp 
355 

Arg Ala Gly Ala 
370 

Ala Lys Met Leu 
385 

Phe Thr His Leu 

Tyr Thr Asp Val 
420 

He He Lys Leu 
435 

Ala Leu Glu Thr 
450 

He Glu Leu He 
465 

He Leu Lys Ala 

Tyr Gin Ala Ser 
500 

He Cys Cys Leu 
515 

Val Tyr Ser He 
530 

Phe Lys His Glu 
545 

Asn Leu Glu Arg 

He Ala His Lys 
580 

Leu Lys Arg Tyr 
595 

Asn Pro Asn Asp 
610 

Leu Lys Thr Phe 
625 



Leu Phe His Pro 
295 

Gin Ala Ala Ser 
310 

Arg Val Gly Lys 
325 

Ser Lys He Gin 

Glu Glu Ser Asn 
360 

Leu Val Val Tyr 
375 

Leu He Thr Lys 
390 

Ser Thr Gly Asn 
405 

Ser Phe Phe Ser 

Phe His Ser Leu 
440 

Leu Phe Met Ala 
455 

Gin Asn Glu Met 
470 

Asn Ala Leu Val 
485 

Gin Lys Gly Val 

Lys Pro Gin Val 
520 

Val Gly Lys Tyr 
535 

Asn He Tyr Phe 
550 

Arg Val Glu Leu 
565 

Leu Leu His He 

Glu Leu Asn Ser 
600 

Pro Leu Asn Ser 
615 



Tyr Glu Ser Phe 
300 

Asp Pro Ala Thr 
315 

His Ser Pro He 
330 

Val Ser Val Leu 
345 

Leu His Trp Ala 

Gly Val Phe Lys 
380 

Lys Thr Asp Asn 
395 

Tyr Asn Pro Leu 
410 

Ala Lys Asn Glu 
425 

Leu Thr Ser Ser 

Pro Lys Gin He 
460 

Asn His Gin Gin 
475 

Asp Ser Glu He 
490 

Lys He Asp Leu 
505 

Lys Gly Leu Ser 

Leu Glu His Ala 
540 

Ser Ser Ala Asp 
555 

Leu He Pro Ala 
570 

Leu Glu He Gin 
585 

Lys Gly Arg Tyr 

Gin Asp Tyr Phe 
620 



Glu Pro Val He 

Leu Ser He Lys 
320 

Val Lys Ala Leu 
335 

Val Glu Leu Lys 
350 

Lys Ala Leu Glu 
365 

Leu Lys Val His 

Gin Leu Arg His 
400 

Ser Ala Lys Val 
415 

He Ala Asn Asp 
430 

Ala Thr Asn Ser 
445 

Lys Pro Lys He 

Glu Gly Tyr He 
480 

He Glu Trp Leu 
495 

He He Arg Gly 
510 

Glu Asn He Arg 
525 

Arg He Tyr Tyr 

Leu Met Pro Arg 
560 

Thr Asn Pro Lys 
575 

Leu Lys Asp Thr 
590 

He Lys Val Ser 
605 

Glu Lys Gin Ala 



(2) INFORMATION FOR SEQ ID NO: 14 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
( ix) FEATURE : 
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ACC TTT TAAGGGTTAT CGTTCAAATC ATAAAAGATA AGGATTTAAA TGCTTTATTC AT 1986 
Thr Phe 



T 1987 
(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 628 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Met lie Arg Val Ala Gly Leu Lys Gin Leu Tyr Glu His Lys lie Ala 

15 10 15 

Ser Lys Gly lie Asp Gly Ala Ser Pro Glu Glu Gin Leu Glu Lys lie 

20 25 30 

Lys His Tyr Leu Ala His Glu lie Glu Glu Arg Glu Leu Glu Phe Gin 

35 40 45 

Lys lie Gin Ala Leu Leu Phe Lys Lys Gly Leu Cys lie Thr Pro Tyr 

50 55 60 

Asn Glu Leu Asn Leu Glu Gin Lys Ala Lys Ala Lys Thr Tyr Phe Lys 
65 70 75 80 

Glu Gin Leu Tyr Ala Leu Val Leu Pro Phe Lys Leu Asp Ser Ser His 

85 90 95 

Thr Phe Pro Pro Leu Ala Asn Leu Thr Phe Ala Leu Phe Ala Arg lie 

100 105 110 

Lys Asp Lys Glu Thr Gin lie lie Ser Tyr Ala Leu lie Lys Leu Pro 

115 120 125 

Ser Phe lie Phe Arg Phe Val Glu Leu Glu Lys Gly Leu Phe Val Leu 

130 135 140 

Ala Glu Glu lie Val Glu Ala His Leu Glu Glu Leu Phe Leu Glu His 
145 150 155 160 

Glu lie Leu Asp Cys Met Ala Phe Arg Val Thr Cys Asp Ala Asp lie 

165 170 175 

Ala lie Thr Glu Asp Glu Ala His Asp Tyr Ala Asp Leu Met Ser Lys 

180 185 190 

Ser Leu Arg Lys Arg Asn Gin Gly Glu lie Val Arg Leu Gin Thr Gin 

195 200 205 

Lys Gly Ser Gin Glu Leu Leu Lys Thr Leu Leu Ala Ser Leu Arg Ser 

210 215 220 

Phe Gin Thr His Ser Tyr Lys Lys His Lys Leu Thr Gly Met His lie 
225 230 235 240 

Tyr Lys Ser Ala lie Met Leu Asn Leu Gly Asp Leu Trp Glu Leu Val 

245 250 255 

Asn His Ser Asp Phe Lys Ala Leu Lys Ser Pro Asn Phe Thr Pro Lys 

260 265 270 

lie His Pro His Phe Asn Glu Asn Asp Leu Phe Lys Ser lie Glu Lys 
275 280 285 
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CAT TTA AGC ACG GGC AAT TAG AAC CCT TTG AGO GCT AAA GTC TAT ACC 1304 
His Leu Ser Thr Gly Asn Tyr Asn Pro Leu Ser Ala Lys Val Tyr Thr 
405 410 415 

GAT GTG AGT TTT TTT AGC GCT AAA AAT GAA ATC GCT AAC GAC ATT ATC 1352 
Asp Val Ser Phe Phe Ser Ala Lys Asn Glu lie Ala Asn Asp lie lie 
420 425 430 

AAG CTT TTC CAT TCC TTG CTC ACT AGC AGC GCG ACT AAT AGC GCA TTA 14 00 
Lys Leu Phe His Ser Leu Leu Thr Ser Ser Ala Thr Asn Ser Ala Leu 
435 440 445 450 

GAA ACG CTT TTT ATG GCA CCC AAA CAA ATC AAG CCT AAA ATC ATT GAA 144 8 

Glu Thr Leu Phe Met Ala Pro Lys Gin lie Lys Pro Lys lie lie Glu 
455 460 465 

CTC ATT CAA AAT GAA ATG AAT CAC CAA CAA GAA GGC TAT ATC ATT TTA 14 96 

Leu lie Gin Asn Glu Met Asn His Gin Gin Glu Gly Tyr lie lie Leu 
470 475 480 

AAA GCC AAC GCC CTA GTG GAT AGC GAA ATC ATT GAA TGG CTC TAT CAA 1544 
Lys Ala Asn Ala Leu Val Asp Ser Glu lie lie Glu Trp Leu Tyr Gin 
485 490 495 

GCC TCT CAA AAA GGG GTT AAA ATT GAT CTC ATT ATT AGA GGG ATT TGC 1592 
Ala Ser Gin Lys Gly Val Lys lie Asp Leu lie lie Arg Gly lie Cys 
500 505 510 

TGT TTA AAG CCC CAA GTC AAG GGC TTG AGC GAA AAT ATC AGG GTG TAT 164 0 

Cys Leu Lys Pro Gin Val Lys Gly Leu Ser Glu Asn lie Arg Val Tyr 
515 520 525 530 

TCT ATC GTG GGG AAA TAT TTA GAA CAT GCA CGC ATT TAT TAT TTT AAA 168 8 

Ser lie Val Gly Lys Tyr Leu Glu His Ala Arg lie Tyr Tyr Phe Lys 
535 540 545 

CAT GAA AAT ATT TAT TTT TCT AGC GCG GAT TTA ATG CCC AGG AAT TTA 173 6 

His Glu Asn lie Tyr Phe Ser Ser Ala Asp Leu Met Pro Arg Asn Leu 
550 555 560 

GAA AGG CGC GTG GAA TTG CTC ATT CCA GCC ACA AAC CCA AAG ATC GCT 1784 
Glu Arg Arg Val Glu Leu Leu lie Pro Ala Thr Asn Pro Lys lie Ala 
565 570 575 

CAT AAA TTG TTG CAT ATT TTA GAA ATC CAA CTC AAA GAC ACC TTA AAA 183 2 

His Lys Leu Leu His lie Leu Glu lie Gin Leu Lys Asp Thr Leu Lys 
580 585 590 

CGC TAC GAG TTA AAT TCT AAA GGC CGT TAC ATT AAA GTT TCA AAC CCT 1880 
Arg Tyr Glu Leu Asn Ser Lys Gly Arg Tyr lie Lys Val Ser Asn Pro 
595 600 605 610 

AAC GAT CCT TTA AAT TCG CAG GAT TAT TTT GAA AAA CAA GCC CTT AAA 192 8 

Asn Asp Pro Leu Asn Ser Gin Asp Tyr Phe Glu Lys Gin Ala Leu Lys 
615 620 625 
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ACT GAA GAT GAA GCG CAT GAT TAT GCA GAT TTG ATG AGT AAG AGT TTG 632 
Thr Glu Asp Glu Ala His Asp Tyr Ala Asp Leu Met Ser Lys Ser Leu 
180 185 190 

AGG AAA CGC AAT CAA GGC GAA ATC GTG CGC TTG CAA ACC CAA AAA GGG 68 0 

Arg Lys Arg Asn Gin Gly Glu lie Val Arg Leu Gin Thr Gin Lys Gly 
195 200 205 210 

AGT CAA GAG CTT TTA AAA ACC CTC TTA GCG TCT TTA AGG AGT TTT CAA 72 8 

Ser Gin Glu Leu Leu Lys Thr Leu Leu Ala Ser Leu Arg Ser Phe Gin 
215 220 225 

ACC CAC TCT TAG AAA AAG CAC AAA CTC ACC GGC ATG CAT ATC TAT AAA 776 
Thr His Ser Tyr Lys Lys His Lys Leu Thr Gly Met His lie Tyr Lys 
230 235 240 

AGC GCG ATC ATG CTC AAT TTA GGG GAT TTG TGG GAA TTA GTC AAT CAT 824 
Ser Ala lie Met Leu Asn Leu Gly Asp Leu Trp Glu Leu Val Asn His 
245 250 255 

AGC GAT TTT AAA GCG CTC AAA TCG CCC AAT TTC ACA CCC AAA ATC CAC 872 
Ser Asp Phe Lys Ala Leu Lys Ser Pro Asn Phe Thr Pro Lys lie His 
260 265 270 

CCT CAT TTC AAT GAA AAC GAT CTT TTC AAA TCT ATA GAA AAA CAG GAT 92 0 

Pro His Phe Asn Glu Asn Asp Leu Phe Lys Ser lie Glu Lys Gin Asp 
275 280 285 290 

CTG TTG CTG TTT CAT CCT TAT GAA AGT TTT GAG CCT GTG ATT GAT TTA 968 
Leu Leu Leu Phe His Pro Tyr Glu Ser Phe Glu Pro Val lie Asp Leu 
295 300 305 

ATA GAG CAA CCC GCT AGC GAT CCA GCC ACC CTT TCT ATC AAA ATG ACG 1016 
lie Glu Gin Ala Ala Ser Asp Pro Ala Thr Leu Ser lie Lys Met Thr 
310 315 320 

CTT TAT CGT GTG GGC AAG CAT TCC CCC ATT GTC AAA GCT TTG ATT GAA 1064 
Leu Tyr Arg Val Gly Lys His Ser Pro lie Val Lys Ala Leu lie Glu 
325 330 335 

GCG GCG AGC AAG ATT CAA GTG AGC GTT TTA GTG GAA TTA AAA GCG CGC 1112 
Ala Ala Ser Lys lie Gin Val Ser Val Leu Val Glu Leu Lys Ala Arg 
340 345 350 

TTT GAT GAA GAG AGC AAT CTG CAC TGG GCA AAA GCT TTA GAA AGG GCG 1160 
Phe Asp Glu Glu Ser Asn Leu His Trp Ala Lys Ala Leu Glu Arg Ala 
355 360 365 370 

GGC GCG TTA GTC GTT TAT GGC GTT TTC AAA CTC AAA GTG CAT GCT AAA 120 8 

Gly Ala Leu Val Val Tyr Gly Val Phe Lys Leu Lys Val His Ala Lys 
375 380 385 

ATG CTA TTG ATC ACT AAA AAA ACA GAC AAC CAA TTA CGC CAT TTC ACC 1256 
Met Leu Leu lie Thr Lys Lys Thr Asp Asn Gin Leu Arg His Phe Thr 
390 395 400 
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(B) LOCATION : 51 . . . 1934 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

AGCGCTTGAA ATTTTTAGCC ATTTATGACA CGAATTTAGA CGAATTTTAC ATG ATA 56 

Met lie 
1 

AGA GTG GCA GGG CTT AAA CAA CTC TAT GAG CAT AAA ATC GCC TCT AAA 104 
Arg Val Ala Gly Leu Lys Gin Leu Tyr Glu His Lys lie Ala Ser Lys 
5 10 15 

GGC ATT GAT GGC GCA AGC CCT GAA GAA CAA TTA GAA AAA ATC AAG CAT 152 
Gly lie Asp Gly Ala Ser Pro Glu Glu Gin Leu Glu Lys lie Lys His 
20 25 30 

TAT TTA GCG CAT GAA ATT GAA GAA AGG GAG TTA GAA TTC CAA AAA ATC 200 
Tyr Leu Ala His Glu lie Glu Glu Arg Glu Leu Glu Phe Gin Lys lie 
35 40 45 50 

CAA GCC CTA CTC TTT AAA AAA GGG CTT TGT ATC ACC CCC TAT AAT GAA 24 8 

Gin Ala Leu Leu Phe Lys Lys Gly Leu Cys lie Thr Pro Tyr Asn Glu 
55 60 65 

TTG AAT TTA GAG CAA AAA GCG AAG GCT AAA ACC TAT TTT AAA GAG CAG 2 96 

Leu Asn Leu Glu Gin Lys Ala Lys Ala Lys Thr Tyr Phe Lys Glu Gin 
70 75 80 

CTT TAC GCG TTA GTT TTG CCT TTT AAA TTG GAT TCT TCA CAC ACT TTC 344 
Leu Tyr Ala Leu Val Leu Pro Phe Lys Leu Asp Ser Ser His Thr Phe 
85 90 95 

CCG CCT TTA GCG AAT TTG ACT TTC GCG CTT TTT GCC CGC ATC AAA GAC 3 92 

Pro Pro Leu Ala Asn Leu Thr Phe Ala Leu Phe Ala Arg lie Lys Asp 
100 105 110 

AAA GAA ACC CAA ATT ATC TCC TAT GCG CTC ATC AAA CTC CCC TCT TTT 44 0 

Lys Glu Thr Gin lie lie Ser Tyr Ala Leu lie Lys Leu Pro Ser Phe 
115 120 125 130 

ATC TTC CGT TTT GTA GAG CTA GAA AAA GGC TTG TTT GTG TTA GCT GAA 4 88 

lie Phe Arg Phe Val Glu Leu Glu Lys Gly Leu Phe Val Leu Ala Glu 
135 140 145 

GAA ATC GTG GAA GCG CAT TTA GAA GAA TTG TTT TTA GAG CAT GAG ATT 536 
Glu lie Val Glu Ala His Leu Glu Glu Leu Phe Leu Glu His Glu lie 
150 155 160 

TTA GAT TGC ATG GCG TTT AGG GTA ACT TGC GAT GCG GAT ATT GCT ATC 5 84 

Leu Asp Cys Met Ala Phe Arg Val Thr Cys Asp Ala Asp lie Ala lie 
165 170 175 
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Glu Val Asn Asp 

lie His lie Val 
180 

lie Met lie Gin 
195 

Tyr Ala Ala Ser 
210 

Val Ala Cys Val 
225 

Ser Gly Asn Ser 

His Leu Thr Thr 
260 

Ala Glu Glu Val 
275 

Glu Glu Thr Pro 
290 

Gly His Glu Ser 
305 

Arg Glu Arg Ala 

Asp Ser Gly Leu 
340 

Gly Met Ala Leu 
355 

Thr Asn Tyr Pro 
370 

Met Gly Met Phe 
385 

Gly Leu lie Leu 

Ser Lys Gly Val 
420 

Ala His Gin Ser 
435 

Arg Asn Leu Ser 
450 

Lys Asn Asp Asp 
465 

Phe Lys Ser Phe 



Pro Leu Gly Met 
165 

Tyr Thr Glu Lys 

Ser Gly Val Glu 
200 

lie Ala Thr Leu 
215 

Asp Met Gly Gly 
230 

lie Arg Tyr Asn 
245 

Asp Leu Ser His 

Lys lie Lys Tyr 
280 

Ser Gin Asn Val 
295 

His lie Val Pro 
310 

Leu Glu Thr Phe 
325 

Glu Glu His Leu 

Met Lys Gly He 
360 

Val Arg Leu Ala 
375 

Glu Asp Leu Lys 
390 

Tyr Lys Ala Gly 
405 

He Arg Tyr His 

Ser Pro Thr Pro 
440 

Asp Leu Lys Ala 
455 

Phe Leu Pro He 
470 

Leu Asp Lys He 
485 



Ser Gly Thr Arg 
170 

Asn Asn He Glu 
185 

He Glu Asn He 

Ser Asn Asp Glu 
220 

Glu Thr Cys Asn 
235 

Lys Tyr Leu Pro 
250 

Met Leu Asn Thr 
265 

Gly Asp Leu Ser 

Gin He Pro Thr 
300 

Leu Ser Glu He 

' 315 
Lys He He His 
330 

Gly Gly Gly Val 
345 

Lys Glu Leu Ala 

Ala Pro Val Glu 
380 

Asp Pro Arg Phe 
395 

Gly His Thr Asn 
410 

Glu Ser Asp Asp 
425 

His He His Ser 

Pro Ser Ala Pro 
460 

Lys Pro Thr Glu 
475 

Ser Lys Phe Phe 
490 



Leu Glu Val Phe 
175 

Asn Leu Glu Lys 
190 

Val He Asn Ser 
205 

Arg Glu Leu Gly 

Leu Thr He Tyr 
240 

Val Gly Ser His 
255 

Pro Phe Pro Tyr 
270 

Phe Glu Gly Gly 
285 

Thr Gly Ser Asp 

Gin Thr He Met 
320 

Arg Ser He Gin 
335 

Val Leu Thr Gly 
350 

Arg Thr His Phe 
365 

Lys Tyr Asn He 

Ser Val Val Val 
400 

Tyr Glu Arg Asp 
415 

Tyr Thr Arg Thr 
430 

Ser Pro Thr Glu 
445 

Leu Asn Thr Ala 

Gin Lys Gly Phe 
480 



(2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRAITOEDNESS : single 
{ D ) TOPOLOGY : 1 ine ar 



(ii) MOLECtTLE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 
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GGG GTT ATC CGC TAG CAT GAA AGC GAT 
Gly Val lie Arg Tyr His Glu Ser Asp 
420 425 

CAA TCA AGC CCT ACC CCC CAT ATC CAT 
Gin Ser Ser Pro Thr Pro His lie His 
435 440 

TTG AGC GAT TTA AAA GCC CCT AGT GCT 

Leu Ser Asp Leu Lys Ala Pro Ser Ala 
455 

GAT GAC TTT TTA CCT ATA AAA CCC ACC 

Asp Asp Phe Leu Pro lie Lys Pro Thr 

470 475 

AGT TTC CTT GAT AAG ATT TCT AAA TTC 
Ser Phe Leu Asp Lys lie Ser Lys Phe 
485 490 

CGATAAAAAC GCCTTGATGG TTATCAAAAG 

(2) INFORMATION FOR SEQ ID 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



Met 


Glu 


His 


Lys 


Glu 


He 


Val 


He 


Gly 


Val 


Asp 


Leu 


Gly 


Ser 


Arg 


Lys 


1 








5 










10 










15 




He 


Cys 


Ala 


He 


Val 


Ala 


Glu 


Phe 


Lys 


Glu 


Gly 


He 


Leu 


Arg 


He 


He 








20 










25 










30 






Gly 


Thr 


Ala 


His 


Gin 


Asp 


Ser 


Lys 


Glu 


He 


Asn 


Ser 


Lys 


Ala 


He 


Lys 






35 










40 










45 






Arg 


Gly 


Arg 


He 


Asn 


Ser 


Leu 


Ala 


His 


Ala 


Ser 


Asn 


Ala 


He 


Lys 


Glu 




50 










55 










60 








Val 


He 


Asn 


Ser 


Ala 


Lys 


Lys 


Met 


Ala- 


Gly 


Leu 


Asn 


Ala 


Asp 


Glu 


Asp 


65 










70 










75 










80 


Arg 


Asn 


Asn 


Pro 


Met 


Pro 


His 


Phe 


Gly 


Glu 


Tyr 


His 


Pro 


Lys 


Thr 


Lys 










85 










90 










95 




Ala 


He 


Val 


Ser 


Phe 


Ser 


Gly 


Ala 


Tyr 


Thr 


Glu 


Ser 


He 


Arg 


Asp 


Val 








100 










105 










110 






Thr 


Gly 


Val 


Ala 


Ser 


Thr 


Lys 


Asp 


Asn 


Val 


Val 


Thr 


He 


Asp 


Glu 


He 






115 










120 










125 








Asn 


Arg 


Ala 


He 


Asn 


Ser 


Ala 


Cys 


Ala 


Lys 


Ala 


Gly 


Leu 


Asp 


Asn 


Asp 




130 










135 










140 










Lys 


His 


He 


Leu 


His 


Ala 


Leu 


Pro 


Tyr 


Arg 


Phe 


Thr 


Leu 


Asp 


Lys 


Gin 


145 










150 










155 










160 
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GAT TAC ACA AGA ACA GCC CAT 13 52 

Asp Tyr Thr Arg Thr Ala His 
430 

TCA TCG CCC ACA GAA AGG AAT 14 0 0 

Ser Ser Pro Thr Glu Arg Asn 
445 450 

CCT TTA AAC ACC GCT AAA AAC 144 8 
Pro Leu Asn Thr Ala Lys Asn 
460 465 

GAA CAA AAA GGT TTT TTT AAA 14 96 

Glu Gin Lys Gly Phe Phe Lys 
480 

TTT TAAGATACAG CCATTTCTTT ATG 154 9 
Phe 



1579 

NO: 142 : 



wo 98/43478 
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ATC CAA TCT GGG GTA GAG ATT GAA AAC ATC GTG ATC AAT TCT TAT GCA 680 
lie Gin Ser Gly Val Glu lie Glu Asn lie Val lie Asn Ser Tyr Ala 
195 200 205 210 

GCC TCG ATT GCC ACC TTA TCT AAT GAT GAA AGG GAA TTG GGC GTG GCT 72 8 

Ala Ser lie Ala Thr Leu Ser Asn Asp Glu Arg Glu Leu Gly Val Ala 
215 220 225 

TGC GTG GAT ATG GGC GGA GAG ACA TGC AAC CTT ACG ATT TAT AGC GGC 77 6 

Cys Val Asp Met Gly Gly Glu Thr Cys Asn Leu Thr lie Tyr Ser Gly 
230 235 240 

AAT TCC ATA CGC TAT AAC AAA TAT TTG CCC GTA GGC TCT CAC CAT TTA 824 
Asn Ser lie Arg Tyr Asn Lys Tyr Leu Pro Val Gly Ser His His Leu 
245 250 255 

ACC ACG GAT TTA TCG CAC ATG CTC AAC ACC CCA TTC CCT TAC GCT GAA 872 
Thr Thr Asp Leu Ser His Met Leu Asn Thr Pro Phe Pro Tyr Ala Glu 
260 265 270 

GAA GTT AAG ATC AAA TAC GGG GAT CTT TCT TTT GAA GGC GGC GAA GAA 92 0 

Glu Val Lys lie Lys Tyr Gly Asp Leu Ser Phe Glu Gly Gly Glu Glu 
275 280 285 290 

ACG CCC TCT CAA AAT GTC CAA ATC CCT ACC ACC GGC TCG GAT GGC CAT 968 
Thr Pro Ser Gin Asn Val Gin lie Pro Thr Thr Gly Ser Asp Gly His 
295 300 305 

GAA AGC CAT ATT GTG CCG CTT AGT GAA ATC CAA ACT ATC ATG AGA GAA 1016 
Glu Ser His lie Val Pro Leu Ser Glu lie Gin Thr lie Met Arg Glu 
310 315 320 

AGG GCT TTA GAA ACT TTT AAA ATC ATC CAC AGG AGC ATT CAA GAT AGC 1064 
Arg Ala Leu Glu Thr Phe Lys lie lie His Arg Ser lie Gin Asp Ser 
325 330 335 

GGC TTA GAA GAG CAT TTG GGC GGA GGC GTT GTG TTA ACC GGT GGG ATG 1112 
Gly Leu Glu Glu His Leu Gly Gly Gly Val Val Leu Thr Gly Gly Met 
340 345 350 

GCT TTA ATG AAA GGG ATC AAA GAA TTA GCC AGA ACC CAT TTC ACT AAT 1160 
Ala Leu Met Lys Gly lie Lys Glu Leu Ala Arg Thr His Phe Thr Asn 
355 360 365 370 

TAC CCG GTG CGT TTG GCA GCC CCT GTG GAA AAA TAC AAT ATC ATG GGC 12 08 

Tyr Pro Val Arg Leu Ala Ala Pro Val Glu Lys Tyr Asn lie Met Gly 
375 380 385 

ATG TTT GAA GAT TTG AAA GAC CCT CGC TTT TCA GTC GTA GTT GGC TTG 125 6 

Met Phe Glu Asp Leu Lys Asp Pro Arg Phe Ser Val Val Val Gly Leu 
390 395 400 

ATT TTA TAC AAA GCA GGG GGG CAT ACC AAT TAT GAA AGA GAC TCT AAA 13 04 

lie Leu Tyr Lys Ala Gly Gly His Thr Asn Tyr Glu Arg Asp Ser Lys 
405 410 415 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAAAACGCTA TAAGATAGTC AAATACATTC AATAAATGCA AGGGGAAATC ATG GAA 56 

Met Glu 
1 

CAT AAA GAA ATC GTT ATA GGG GTT GAT CTA GGC TCT AGA AAG ATT TGC 104 
His Lys Glu lie Val lie Gly Val Asp Leu Gly Ser Arg Lys lie Cys 
5 10 15 

GCG ATA GTG GCT GAA TTT AAA GAA GGG ATT TTG CGC ATC ATT GGC ACG 152 
Ala lie Val Ala Glu Phe Lys Glu Gly lie Leu Arg lie lie Gly Thr 
20 25 30 

GCC CAT CAA GAC TCC AAA GAA ATC AAT TCA AAA GCC ATT AAA AGA GGG 2 00 

Ala His Gin Asp Ser Lys Glu lie Asn Ser Lys Ala lie Lys Arg Gly 
35 40 45 50 

CGT ATC AAT AGC CTT GCT CAC GCT TCC AAC GCC ATT AAA GAA GTG ATT 24 8 

Arg lie Asn Ser Leu Ala His Ala Ser Asn Ala lie Lys Glu Val lie 
55 60 65 

AAT AGC GCT AAA AAA ATG GCA GGT TTG AAC GCT GAT GAA GAC AGA AAT 2 96 

Asn Ser Ala Lys Lys Met Ala Gly Leu Asn Ala Asp Glu Asp Arg Asn 
70 75 80 

AAC CCC ATG CCC CAT TTT GGG GAA TAC CAC CCT AAA ACT AAG GCG ATT 344 
Asn Pro Met Pro His Phe Gly Glu Tyr His Pro Lys Thr Lys Ala lie 
85 90 95 

GTT TCT TTT TCT GGG GCT TAT ACT GAA AGC ATT AGA GAT GTT ACC GGT 3 92 

Val Ser Phe Ser Gly Ala Tyr Thr Glu Ser lie Arg Asp Val Thr Gly 
100 105 110 

GTA GCG AGC ACC AAA GAT AAT GTG GTA ACC ATT GAT GAA ATC AAT CGC 44 0 

Val Ala Ser Thr Lys Asp Asn Val Val Thr lie Asp Glu lie Asn Arg 
115 120 125 130 

GCT ATC AAT AGT GCA TGC GCT AAA GCA GGC TTA GAT AAC GAC AAA CAT 4 88 

Ala lie Asn Ser Ala Cys Ala Lys Ala Gly Leu Asp Asn Asp Lys His 
135 140 145 

ATT TTG CAT GCT CTC CCC TAT CGC TTC ACT TTA GAC AAA CAA GAA GTG 536 
lie Leu His Ala Leu Pro Tyr Arg Phe Thr Leu Asp Lys Gin Glu Val 
150 155 160 

AAT GAC CCT TTA GGG ATG AGC GGG ACT CGC TTG GAA GTC TTT ATC CAC 584 
Asn Asp Pro Leu Gly Met Ser Gly Thr Arg Leu Glu Val Phe lie His 
165 170 175 

ATT GTC TAT ACA GAA AAA AAC AAC ATT GAA AAT TTA GAA AAA ATC ATG 632 
lie Val Tyr Thr Glu Lys Asn Asn lie Glu Asn Leu Glu Lys lie Met 
180 185 190 
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. 100 

Gly Trp Leu Ser 
115 

Asp Phe Trp Tyr 
130 

Glu Ser Phe Glu 
145 

Lys lie Leu Ser 

Asp Asn lie Cys 
180 

Pro Gly Glu Val 
195 

Val Glu Phe Leu 
210 

Asn Leu lie Glu 
225 

Thr lie lie Leu 

Lys Asn Asp Phe 
260 

Lys Met Pro Glu 
275 

lie Phe Met Ala 
290 

Lys Leu Glu Leu 
305 

Ala Asn Met Lys 

Lys Asp Ala lie 
340 

lie Pro Tyr Leu 
355 

Thr Gly Leu Gly 
370 

lie lie Ala Asn 
385 

lie Met Leu Phe 



lie Asp Met Tyr 
120 

Lys Glu Asp Gly 
135 

Lys Lys Leu Lys 
150 

Ala Lys Glu Thr 
165 

Gly Ser Val Leu 

Met His Ser Leu 
200 

Tyr Asn Glu Glu 
215 

Gly Val He Thr 
230 

Ala Thr Gly Ala 
245 

Leu Met Met Gly 

Glu Leu Lys Pro 
280 

Met Thr Pro Arg 
295 

Asn Thr Asn Asn 
310 

Lys Asn Leu Ala 
325 

Glu Trp Cys Gly 

Gly Tyr Asp Lys 
360 

Trp Leu Gly He 
375 

Leu Ser Gin Asp 
390 

Ser Ala Phe Phe 
405 



105 

His Gin Met Leu 

Leu Leu Met He 
140 

Thr Cys Asp Asn 
155 

Lys Glu Tyr Met 
170 

Leu Thr Glu Asn 
185 

Gin Glu Tyr Leu 

Val lie Asp Phe 
220 

His Lys Glu Lys 
235 

Asn Pro Thr Leu 
250 

Ala Lys Gly Tyr 
265 

Lys Thr Ser Ser 

Arg Asp Thr Val 
300 

Ala Leu He Asp 
315 

Ala Phe Thr Gin 
330 

Phe Arg Pro Leu 
345 

Arg Tyr Lys Asn 

Thr Phe Gly Pro 
380 

Gly Ala Asn Glu 
395 

Arg Asp 
410 



110 

Lys Asp Gly Met 
125 

Tyr Thr Leu Glu 

Ser Gly Ala Tyr 
160 

Pro Val Val Asn 
175 

Ala His Val Asp 
190 

Gin Asn Val Gly 
205 

Glu Phe Lys Asn 

He Gin Ala Glu 
240 

He Lys Lys Thr 
255 

Ser He Thr Phe 
270 

Leu Phe Ala Asp 
285 

Arg He Thr Ser 

Lys Glu Gin He 
320 

Pro Phe Glu Met 
335 

Thr Pro Asn Asp 
350 

Leu He His Ala 
365 

Ala He Gly Lys 

Lys Asn Ala Asp 
400 



(2) INFORMATION FOR SEQ ID NO: 141: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 



(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 51... 1526 
(D) OTHER INFORMATION: 
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Leu Glu Leu Asn Thr Asn Asn Ala Leu lie Asp Lys Glu Gin lie Ala 
310 315 320 

AAC ATG AAA AAG AAT TTA GCC GCT TTC ACG CAG CCT TTT GAA ATG AAA 10 65 

Asn Met Lys Lys Asn Leu Ala Ala Phe Thr Gin Pro Phe Glu Met Lys 
325 330 335 

GAG GCC ATA GAG TGG TGC GGT TTC AGA CCC TTA ACC CCT AAT GAT ATT 1113 
Asp Ala lie Glu Trp Cys Gly Phe Arg Pro Leu Thr Pro Asn Asp lie 
340 345 350 

CCT TAT TTG GGC TAT GAC AAA CGC TAT AAA AAC TTA ATC CAT GCG ACA 1161 
Pro Tyr Leu Gly Tyr Asp Lys Arg Tyr Lys Asn Leu lie His Ala Thr 
355 360 365 

GGG CTA GGG TGG CTT GGC ATC ACT TTT GGC CCA GCC ATT GGT AAA ATC 12 09 

Gly Leu Gly Trp Leu Gly lie Thr Phe Gly Pro Ala lie Gly Lys lie 
370 375 380 385 

ATC* GCC AAT TTG AGC CAA GAC GGA GCG AAT GAA AAA AAT GCC GAT ATT 12 57 

lie Ala Asn Leu Ser Gin Asp Gly Ala Asn Glu Lys Asn Ala Asp lie 
390 395 400 

ATG CTT TTT TCT GCA TTT TTT AGG GAT TAAGGAATTT CTTTTTTAAA CCCTAGT 1311 
Met Leu Phe Ser Ala Phe Phe Arg Asp 
405 410 

TTATTAAGGA GTTTTTATGG AAAC 1335 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

Met Lys Lys Glu Val Val Val lie Gly Gly Gly lie Val Gly Leu Ser 

15 10 15 

Cys Ala Tyr Ser Met His Lys Leu Gly His Lys Val Cys Val lie Glu 

20 25 30 

Lys Asn Asp Gly Ala Asn Gly Thr Ser Phe Gly Asn Ala Gly Leu lie 

35 40 45 

Ser Ala Phe Lys Lys Ala Pro Leu Ser Cys Pro Gly Val Val Leu Asp 

50 55 60 

Thr Leu Lys Leu Met Leu Lys Asn Gin Ala Pro Leu Lys Phe His Phe 
65 70 75 80 
Gly Leu Asn Leu Lys Leu Tyr Gin Trp lie Leu Lys Phe Val Lys Ser 

85 90 95 

Ala Asn Ala Lys Ser Thr His Arg Thr Met Ala Leu Phe Glu Arg Tyr 
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85 90 95 

AAC GCC AAA TCC ACG CAC CGC ACC ATG GCG TTG TTT GAA CGC TAG GGG 3 93 

Asn Ala Lys Ser Thr His Arg Thr Met Ala Leu Phe Glu Arg Tyr Gly 
100 105 110 

TGG CTG AGT ATT GAT ATG TAT CAT CAA ATG CTA AAA GAG GGC ATG GAG 441 
Trp Leu Ser lie Asp Met Tyr His Gin Met Leu Lys Asp Gly Met Asp 
115 120 125 

TTT TGG TAT AAA GAA GAT GGG CTT TTA ATG ATC TAG ACT CTA GAA GAA 4 89 

Phe Trp Tyr Lys Glu Asp Gly Leu Leu Met lie Tyr Thr Leu Glu Glu 
130 135 140 145 

AGT TTT GAA AAA AAG CTT AAA ACT TGC GAT AAC AGC GGC GCT TAT AAA 53 7 

Ser Phe Glu Lys Lys Leu Lys Thr Cys Asp Asn Ser Gly Ala Tyr Lys 
150 155 160 

ATC CTT AGC GCT AAA GAG ACC AAA GAA TAC ATG CCC GTT GTT AAT GAC 585 
lie Leu Ser Ala Lys Glu Thr Lys Glu Tyr Met Pro Val Val Asn Asp 
165 170 175 

AAT ATC TGC GGG AGC GTG CTT TTA ACC GAA AAC GCG CAT GTG GAT CCG 633 
Asn lie Cys Gly Ser Val Leu Leu Thr Glu Asn Ala His Val Asp Pro 
180 185 190 

GGC GAA GTG ATG CAC TCT TTG CAA GAA TAT TTA CAA AAT GTT GGC GTG 681 
Gly Glu Val Met His Ser Leu Gin Glu Tyr Leu Gin Asn Val Gly Val 
195 200 205 

GAG TTC CTT TAT AAT GAA GAA GTG ATC GAT TTT GAG TTT AAA AAT AAC 729 
Glu Phe Leu Tyr Asn Glu Glu Val lie Asp Phe Glu Phe Lys Asn Asn 
210 215 220 225 

CTC ATT GAG GGC GTT ATC ACG CAC AAG GAA AAA ATC CAA GCA GAA ACA 777 
Leu He Glu Gly Val He Thr His Lys Glu Lys He Gin Ala Glu Thr 
230 235 240 

ATC ATT CTA GCC ACT GGG GCT AAC CCC ACT CTC ATT AAA AAA ACC AAG 825 
He He Leu Ala Thr Gly Ala Asn Pro Thr Leu He Lys Lys Thr Lys 
245 250 255 

AAC GAT TTT TTA ATG ATG GGG GCT AAA GGA TAT AGC ATC ACC TTT AAA 873 
Asn Asp Phe Leu Met Met Gly Ala Lys Gly Tyr Ser He Thr Phe Lys 
260 265 270 

ATG CCT GAA GAA TTA A7UV CCC AAA ACC TCT TCT TTA TTT GCG GAT ATT 921 
Met Pro Glu Glu Leu Lys Pro Lys Thr Ser Ser Leu Phe Ala Asp He 
275 280 285 

TTC ATG GCG ATG ACC CCA CGA AGA GAC ACT GTA AGG ATC ACT TCT AAA 969 
Phe Met Ala Met Thr Pro Arg Arg Asp Thr Val Arg He Thr Ser Lys 
290 295 300 305 

TTA GAA TTA AAC ACC AAC AAC GCT CTC ATT GAT AAA GAG CAA ATC GCT 1017 
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Thr Phe Lys Val Asn Ala Ala His Asn Pro Leu Ser Thr His Ala Arg 

1235 1240 1245 

Val Met Met Gly Gly Glu Leu Lys Leu Ala Lys Glu Val Phe Leu Asn 

1250 1255 1260 

Leu Gly Phe Val Tyr Leu His Asn Leu lie Ser Asn lie Gly His Phe 
265 1270 1275 1280 

Ala Ser Asn Leu Gly Met Arg Tyr Ser Phe 
1285 1290 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Secjuence 

(B) LOCATION: 55... 1284 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

TTAGTAGAAA TTGAAGCGAT AGCCATTAAG TAATTTATTA AAGGGACTAT CAGC ATG 57 

Met 
1 

AAA AAA GAG GTC GTG GTC ATA GGC GGT GGG ATT GTA GGG CTT TCT TGT 10 5 

Lys Lys Glu Val Val Val lie Gly Gly Gly lie Val Gly Leu Ser Cys 
5 10 15 

GCG TAT TCT ATG CAC AAG TTA GGG CAT AAG GTC TGC GTG ATA GAA AAA 153 
Ala Tyr Ser Met His Lys Leu Gly His Lys Val Cys Val lie Glu Lys 
20 25 30 

AAC GAT GGC GCA AAC GGC ACT TCT TTT GGG AAT GCT GGG CTT ATT TCT 201 
Asn Asp Gly Ala Asn Gly Thr Ser Phe Gly Asn Ala Gly Leu lie Ser 
35 40 45 

GCG TTT AAA AAA GCC CCA CTC TCA TGC CCT GGT GTG GTG TTA GAC ACC 24 9 

Ala Phe Lys Lys Ala Pro Leu Ser Cys Pro Gly Val Val Leu Asp Thr 
50 55 60 65 

CTG AAG CTC ATG CTC AAA AAC CAA GCC CCT TTA AAA TTC CAT TTC GGG 2 97 

Leu Lys Leu Met Leu Lys Asn Gin Ala Pro Leu Lys Phe His Phe Gly 
70 75 80 

CTT AAT TTA AAG CTC TAT CAA TGG ATT TTA AAA TTT GTA AAA AGC GCG 34 5 

Leu Asn Leu Lys Leu Tyr Gin Trp lie Leu Lys Phe Val Lys Ser Ala 
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785 790 795 800 

Tyr Lys Tyr Leu lie Gly Lys Ala Trp Lys Asn lie Gly He Ser Lys 

805 810 815 

Thr Ala Asn Gly Ser Lys He Ser Val Tyr Tyr Leu Gly Asn Ser Thr 

820 825 830 

Pro Thr Glu Lys Gly Gly Asn Thr Thr Asn Leu Pro Thr Asn Thr Thr 

835 840 845 

Ser Asn Val Arg Ser Ala Asn Asn Ala Leu Ala Gin Asn Ala Pro Phe 

850 855 860 

Ala Gin Pro Ser Ala Thr Pro Asn Leu Val Ala He Asn Gin His Asp 
865 870 875 880 

Phe Gly Thr He Glu Ser Val Phe Glu Leu Ala Asn Arg Ser Lys Asp 

885 890 895 

He Asp Thr Leu Tyr Ala Asn Ser Gly Ala Gin Gly Arg Asp Leu Leu 

900 905 910 

Gin Thr Leu Leu He Asp Ser His Asp Ala Gly Tyr Ala Arg Gin Met 

915 920 925 

He Asp Asn Thr Ser Thr Gly Glu He Thr Lys Gin Leu Asn Ala Ala 

930 935 940 

Thr Thr Thr Leu Asn Asn He Ala Ser Leu Glu His Lys Thr Ser Ser 
945 950 955 960 

Leu Gin Thr Leu Ser Leu Ser Asn Ala Met He Leu Asn Ser Arg Leu 

965 970 975 

Val Asn Leu Ser Arg Arg His Thr Asn Asn He Asp Ser Phe Ala Gin 

980 985 990 

Arg Leu Gin Ala Leu Lys Asp Gin Lys Phe Ala Ser Leu Glu Ser Ala 

995 1000 1005 

Ala Glu Val Leu Tyr Gin Phe Ala Pro Lys Tyr Glu Lys Pro Thr Asn 

1010 1015 1020 

Val Trp Ala Asn Ala He Gly Gly Thr Ser Leu Asn Asn Gly Gly Asn 
025 1030 1035 1040 

Ala Ser Leu Tyr Gly Thr Ser Ala Gly Val Asp Ala Tyr Leu Asn Gly 

1045 1050 1055 

Glu Val Glu Ala He Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser 

1060 1065 1070 

Phe Asn Asn Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Thr Asn 

1075 1080 1085 

Phe Gly Val Tyr Ser Arg He Phe Ala Asn Gin His Glu Phe Asp Phe 

1090 1095 1100 

Glu Ala Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys 
105 1110 1115 1120 

Ser Ala Leu Leu Arg Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr 

1125 1130 1135 

Ser Ala Ala Thr Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg 

1140 1145 1150 

Asn Ala Leu Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu 

1155 1160 1165 

Gly Ser Thr Asn Phe Lys Ser Asn Ser Asn Gin Val Ala Leu Lys Asn 

1170 1175 1180 

Gly Ser Ser Ser Gin His Leu Phe Asn Ala Ser Ala Asn Val Glu Ala 
185 1190 1195 1200 

Arg Tyr Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Met Asn Ala Gly Val 

1205 1210 1215 

Leu Gin Glu Phe Ala Asn Phe Gly Ser Ser Asn Ala Val Ser Leu Asn 
1220 1225 1230 
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Asn Ser Ala Gin 
355 

Ser Ala Gin Lys 
370 

Phe Ala Gly Gly 
385 

Asn Ala Asp Gly 

Thr Asn Ala Ala 
420 

Asn Gin Ala Ser 
435 

lie Thr Val Asp 
450 

Ala Leu Ala Gly 
465 

Thr Lys Asn Gly 

Phe Val Asn Leu 
500 

Asp Thr Gly Asn 
515 

Gly Lys Val Asn 
530 

Val Lys Asn Phe 
545 

Ser Val Gly Glu 

Arg lie Asn Thr 
580 

Gly Gly Val Lys 
595 

Tyr Tyr Ser Pro 
610 

Glu lie Thr Arg 
625 

Thr Ser Lys Leu 

Met Asp Tyr Ser 
660 

Asn Asn Gin Gly 
675 

Thr Leu Asn Val 
690 

Asp Ser Ala Thr 
705 

Gin Asp Leu lie 

lie Gly Tyr Gly 
740 

Asn Leu lie Glu 
755 

Asn Arg Met Asp 
770 

Cys Gly Thr Ala 
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Asn Asn Ser Asn 
360 

Thr Glu lie Gin 
375 

Lys Asp Thr Val 
390 

Thr lie Lys Val 
405 

His Leu His lie 

Gly Arg Thr Leu 
440 

Gly Pro Leu Arg 
455 

Ser Ser Ala Asn 
470 

Thr Ala Thr Phe 
485 

Lys Val Asp Ala 

Gly Gly Phe Asn 
520 

lie Asn Lys Leu 
535 

Asn lie Asn Glu 
550 

Tyr Thr His Phe 
565 

Val Arg Leu Glu 

Phe Lys Ser Gly 
600 

Trp Asn Tyr Phe 
615 

Lys Phe Ala Ser 
630 

Met Phe Asn Asn 
645 

Gin Phe Ser Asn 

Thr lie Asn Tyr 
680 

Gly Asn Ala Ala 
695 

Gly Phe Tyr Gin 
710 

Lys Asn Lys Glu 
725 

Asn Val Ser Leu 

Gin Phe Lys Glu 
760 

He Cys Val Val 
775 

He Gly Asn Gin 



Thr Gin Val He 

Pro Thr Gin Val 
380 

Val Asn He Asp 
395 

Gly Gly Tyr Lys 
410 

Gly Lys Gly Gly 
425 

Leu Val Glu Asn 

Val Asn Asn Gin 
460 

Phe Glu Phe Lys 
475 

Asn Asn Asp He 
490 

His Thr Ala Asn 
505 

Thr Leu Asp Phe 

He Thr Ala Ser 
540 

Leu Val Val Lys 
555 

Ser Glu Asp He 
570 

Thr Gly Thr Arg 
585 

Glu Lys Leu Val 

Asp Ala Arg Asn 
620 

Ser Thr Pro Glu 
635 

Leu Thr Leu Gly 
650 

Leu Thr He Gin 
665 

Leu Val Arg Gly 

Ala Met Phe Phe 
700 

Pro Leu Met Lys 
715 

His Val Leu Leu 
730 

Gly Thr Asn Ser 
745 

Arg Leu Ala Leu 

Arg Asn Thr Asp 
780 

Ser Met Val Asn 
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Asn Pro Pro Asn 
365 

He Asp Gly Pro 

Arg He Asn Thr 
400 

Ala Ser Leu Thr 
415 

He Asn Leu Ser 
430 

Leu Thr Gly Asn 
445 

Val Gly Gly Tyr 

Ala Gly Thr Asp 
480 

Ser Leu Gly Arg 
495 

Phe Lys Gly He 
510 

Ser Gly Val Thr 
525 

Thr Asn Val Ala 

Thr Asn Gly Val 
560 

Gly Ser Gin Ser 
575 

Ser He Phe Ser 
590 

He Asp Glu Phe 
605 

He Lys Asn Val 

Asn Pro Trp Gly 
640 

Gin Asn Ala Val 
655 

Gly Asp Phe He 
670 

Gly Gin Val Ala 
685 

Ser Asn Asn Val 

He Asn Ser Ala 
720 

Lys Ala Lys He 
735 

He Ser Asn Val 
750 

Tyr Asn Asn Asn 
765 

Asp He Lys Ala 
Asn Pro Asp Asn 



wo 98/43478 PCT/US98/06371 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1290 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 138: 

Met Glu lie Gin Gin Thr His Arg Lys lie Asn Arg Pro Leu Val Ser 

15 10 15 

Leu Ala Leu Val Gly Ala Leu Val Ser lie Thr Pro Gin Gin Ser His 

20 25 30 

Ala Ala Phe Phe Thr Thr Val lie lie Pro Ala lie Val Gly Gly lie 

35 40 45 

Ala Thr Gly Ala Ala Val Gly Thr Val Ser Gly Leu Leu Gly Trp Gly 

50 55 60 

Leu Lys Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys 
65 70 75 80 

Val Trp Arg lie Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys 

85 90 95 

Glu Tyr Asp Leu Tyr Arg Ser Leu Leu Ser Ser Lys lie Asp Gly Gly 

100 105 110 

Trp Asp Trp Gly Asn Ala Ala Thr His Tyr Trp Val Lys Gly Gly Gin 

115 120 125 

Trp Asn Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Asn 

130 135 140 

Leu Ser Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp Val Asn Met 
145 150 155 160 

Gin Lys Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn Ser Phe Thr 

165 170 175 

Ser Tyr Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asp Phe Asn Ala 

180 185 190 

Lys Asn lie Leu lie Asp Asn Phe Leu Glu lie Asn Asn Arg Val Gly 

195 200 205 

Ser Gly Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr Leu Gin Ala 

210 215 220 

Ser Glu Gly lie Thr Ser Ser Lys Asn Ala Glu lie Ser Leu Tyr Asp 
225 230 235 240 

Gly Ala Thr Leu Asn Leu Ala Ser Asn Ser Val Lys Leu Met Gly Asn 

245 250 255 

Val Trp Met Gly Arg Leu Gin Tyr Val Gly Ala Tyr Leu Ala Pro Ser 

260 265 270 

Tyr Ser Thr lie Asn Thr Ser Lys Val Thr Gly Glu Val Asn Phe Asn 

275 280 285 

His Leu Thr Val Gly Asp His Asn Ala Ala Gin Ala Gly lie lie Ala 

290 295 300 

Ser Asn Lys Thr His lie Gly Thr Leu Asp Leu Trp Gin Ser Ala Gly 
305 310 315 320 

Leu Asn lie lie Ala Pro Pro Glu Gly Gly Tyr Lys Asp Lys Pro Lys 

325 330 335 

Asp Lys Pro Ser Asn Thr Thr Gin Asn Asn Ala Asn Asn Asn Gin Gin 
340 345 350 
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Val Tyr Ser Arg lie Phe Ala Asn Gin His Glu Phe Asp Phe Glu Ala 
1095 1100 1105 

CAA GGG GCG CTA GGG AGT GAT CAA TCA AGC TTG AAT TTC AAA AGC OCT 3416 
Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys Ser Ala 
1110 1115 1120 

CTA TTG CGA GAT TTG AAT CAA AGC TAT AAT TAC TTA GCC TAT AGC GCT 3464 
Leu Leu Arg Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr Ser Ala 
1125 1130 1135 

GCA ACA AGA GCG AGC TAT GGT TAT GAC TTT GCA TTT TTT AGG AAC GCT 3 512 

Ala Thr Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg Asn Ala 
1140 1145 1150 

TTG GTG TTA AAA CCA AGC GTG GGC GTG AGC TAT AAC CAT TTA GGT TCA 3560 
Leu Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu Gly Ser 
1155 1160 1165. 1170 

ACC AAC TTT AAA AGC AAT AGC AAT CAA GTG GCT TTG AAA AAT GGC TCT 3 6 08 

Thr Asn Phe Lys Ser Asn Ser Asn Gin Val Ala Leu Lys Asn Gly Ser 
1175 1180 1185 

AGC AGT CAG CAT TTA TTC AAC GCT AGC GCT AAT GTG GAA GCG CGC TAT 3 656 

Ser Ser Gin His Leu Phe Asn Ala Ser Ala Asn Val Glu Ala Arg Tyr 
1190 1195 1200 

TAT TAT GGG GAC ACT TCA TAC TTC TAT ATG AAC GCT GGA GTT TTA CAA 3 704 

Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Met Asn Ala Gly Val Leu Gin 
1205 1210 1215 

GAA TTT GCT AAC TTT GGT TCT AGC AAT GCG GTA TCT TTA AAC ACC TTT 3 752 

Glu Phe Ala Asn Phe Gly Ser Ser Asn Ala Val Ser Leu Asn Thr Phe 
1220 1225 1230 

AAA GTG AAT GCC GCA CAC AAT CCT TTA AGT ACC CAT GCC AGA GTG ATG 3 80 0 

Lys Val Asn Ala Ala His Asn Pro Leu Ser Thr His Ala Arg Val Met 
1235 1240 1245 1250 

ATG GGT GGG GAA TTA AAA TTA GCT AAA GAA GTG TTT TTG AAT TTG GGC 384 8 

Met Gly Gly Glu Leu Lys Leu Ala Lys Glu Val Phe Leu Asn Leu Gly 
1255 1260 1265 

TTT GTT TAT TTG CAC AAT TTG ATT TCC AAT ATA GGC CAT TTC GCT TCC 3 8 96 

Phe Val Tyr Leu His Asn Leu lie Ser Asn lie Gly His Phe Ala Ser 
1270 1275 1280 

AAT TTA GGA ATG AGG TAT AGT TTC TAAATACCGC TCTTAAACCC ATGCTCAAAG 3 950 
Asn Leu Gly Met Arg Tyr Ser Phe 
1285 1290 

CATGGGTTTG AAATCTTACA AAA 3973 

(2) INFORMATION FOR SEQ ID NO: 138: 
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870 875 880 

ACC ATT GAA AGC GTG TTT GAA TTG GCT AAC CGC TCT AAA GAT ATT GAC 2744 
Thr lie Glu Ser Val Phe Glu Leu Ala Asn Arg Ser Lys Asp lie Asp 
885 890 895 

ACG CTT TAT GCT AAC TCA GGC GCG CAA GGC AGG GAT CTC TTA CAA ACC 2792 
Thr Leu Tyr Ala Asn Ser Gly Ala Gin Gly Arg Asp Leu Leu Gin Thr 
900 905 910 

TTA TTG ATT GAT AGC CAT GAT GCG GGT TAT GCC AGA CAA ATG ATT GAT 2 84 0 

Leu Leu lie Asp Ser His Asp Ala Gly Tyr Ala Arg Gin Met lie Asp 
915 920 925 930 

AAC ACA AGC ACC GGT GAA ATC ACC AAG CAA TTG AAT GCG GCC ACT ACC 2 888 
Asn Thr Ser Thr Gly Glu lie Thr Lys Gin Leu Asn Ala Ala Thr Thr 
935 940 945 

ACT TTA AAC AAC ATA GCC AGT TTA GAG CAT AAG ACA AGC AGC TTA CAA 2 93 6 
Thr Leu Asn Asn lie Ala Ser Leu Glu His Lys Thr Ser Ser Leu Gin 
950 955 960 

ACT TTG AGC TTG AGT AAT GCG ATG ATC TTA AAT TCT CGT TTA GTC AAT 2 984 

Thr Leu Ser Leu Ser Asn Ala Met lie Leu Asn Ser Arg Leu Val Asn 
965 970 975 

CTC TCC AGA AGG CAC ACC AAT AAT ATT GAC TCA TTC GCC CAA CGC TTA 3 032 

Leu Ser Arg Arg His Thr Asn Asn lie Asp Ser Phe Ala Gin Arg Leu 
980 985 990 

CAA GCT TTA AAA GAC CAA AAA TTC GCT TCT TTA GAA AGC GCG GCG GAA 3 080 
Gin Ala Leu Lys Asp Gin Lys Phe Ala Ser Leu Glu Ser Ala Ala Glu 
995 1000 1005 1010 

GTG TTG TAT CAA TTT GCC CCT AAA TAT GAA AAA CCT ACC AAT GTT TGG 312 8 

Val Leu Tyr Gin Phe Ala Pro Lys Tyr Glu Lys Pro Thr Asn Val Trp 
1015 1020 1025 

GCT AAC GCT ATT GGG GGA ACG AGC TTG AAT AAT GGC GGC AAC GCT TCA 3176 
Ala Asn Ala lie Gly Gly Thr Ser Leu Asn Asn Gly Gly Asn Ala Ser 
1030 1035 1040 

TTG TAT GGC ACA AGT GCG GGC GTA GAT GCC TAC CTT AAT GGG GAA GTG 32 24 
Leu Tyr Gly Thr Ser Ala Gly Val Asp Ala Tyr Leu Asn Gly Glu Val 
1045 1050 1055 

GAA GCC ATT GTG GGC GGT TTT GGA AGC TAT GGT TAT AGC TCT TTT AAT 3272 
Glu Ala lie Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser Phe Asn 
1060 1065 1070 

AAT CAA GCG AAC TCT CTT AAC TCT GGA GCC AAT AAC ACT AAT TTT GGC 332 0 

Asn Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Thr Asn Phe Gly 
1075 1080 1085 1090 

GTG TAT AGC CGT ATC TTT GCT AAC CAG CAT GAA TTT GAT TTT GAA GCT 33 6 8 
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TAT AGT CAA TTT TCA AAT TTA ACC ATT CAG GGG GAT TTC ATC AAC AAT 2 072 
Tyr Ser Gin Phe Ser Asn Leu Thr He Gin Gly Asp Phe He Asn Asn 
660 665 670 

CAA GGC ACT ATC AAT TAT TTG GTC CGA GGC GGG CAA GTA GCC ACC TTG 212 0 

Gin Gly Thr He Asn Tyr Leu Val Arg Gly Gly Gin Val Ala Thr Leu 
675 680 685 690 

AAT GTA GGC AAT GCG GCA GCT ATG TTC TTT AGT AAT AAT GTG GAT AGC 2168 
Asn Val Gly Asn Ala Ala Ala Met Phe Phe Ser Asn Asn Val Asp Ser 
695 700 705 

GCG ACT GGG TTT TAC CAA CCG CTC ATG AAG ATT AAC AGC GCT CAA GAT 2216 
Ala Thr Gly Phe Tyr Gin Pro Leu Met Lys lie Asn Ser Ala Gin Asp 
710 715 720 

CTC ATT AAA AAT AAA GAA CAT GTC TTA TTG AAA GCG AAA ATC ATC GGT 22 64 

Leu lie Lys Asn Lys Glu His Val Leu Leu Lys Ala Lys He He Gly 
725 730 735 

TAT GGC AAT GTT TCT TTA GGC ACT AAC AGC ATT AGT AAT GTT AAT CTA 2 312 
Tyr Gly Asn Val Ser Leu Gly Thr Asn Ser He Ser Asn Val Asn Leu 
740 745 750 

ATA GAG CAA TTC AAA GAG CGC CTA GCC CTT TAC AAC AAC AAT AAC CGC 2 360 

He Glu Gin Phe Lys Glu Arg Leu Ala Leu Tyr Asn Asn Asn Asn Arg 
755 760 765 770 

ATG GAT ATT TGT GTG GTG CGA AAT ACT GAT GAC ATT AAA GCA TGC GGG 24 08 

Met Asp He Cys Val Val Arg Asn Thr Asp Asp He Lys Ala Cys Gly 
775 780 785 

ACG GCT ATC GGC AAT CAA AGC ATG GTG AAT AAC CCC GAC AAT TAC AAG 2456 
Thr Ala He Gly Asn Gin Ser Met Val Asn Asn Pro Asp Asn Tyr Lys 
790 795 800 

TAT CTT ATC GGT AAA GCA TGG AAG AAC ATA GGG ATC AGC AAA ACA GCT 2 5 04 
Tyr Leu He Gly Lys Ala Trp Lys Asn He Gly He Ser Lys Thr Ala 
805 810 815 

AAT GGC TCT AAA ATT TCG GTG TAT TAT TTA GGC AAT TCT ACG CCT ACT 2552 
Asn Gly Ser Lys He Ser Val Tyr Tyr Leu Gly Asn Ser Thr Pro Thr 
820 825 830 

GAG AAA GGT GGC AAT ACC ACA AAT TTA CCT ACA AAC ACC ACT AGC AAT 26 0 0 
Glu Lys Gly Gly Asn Thr Thr Asn Leu Pro Thr Asn Thr Thr Ser Asn 
835 840 845 850 

GTG CGT TCT GCC AAC AAC GCC CTT GCG CAA AAC GCT CCT TTC GCT CAA 2 64 8 
Val Arg Ser Ala Asn Asn Ala Leu Ala Gin Asn Ala Pro Phe Ala Gin 
855 860 865 

CCT AGC GCC ACT CCT AAT TTA GTC GCT ATC AAT CAG CAT GAT TTT GGC 26 96 
Pro Ser Ala Thr Pro Asn Leu Val Ala He Asn Gin His Asp Phe Gly 
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GCG AGC GGG CGC ACC CTT TTA GTG GAA AAT CTA ACC GGG AAT ATC ACC 14 00 

Ala Ser Gly Arg Thr Leu Leu Val Glu Asn Leu Thr Gly Asn lie Thr 
435 440 445 450 

GTT GAT GGG CCT TTA AGA GTG AAT AAT CAA GTG GGT GGT TAT GCT TTG 144 8 

Val Asp Gly Pro Leu Arg Val Asn Asn Gin Val Gly Gly Tyr Ala Leu 
455 460 465 

GCA GGA TCA AGC GCG AAT TTT GAG TTT AAG GCT GGT ACG GAT ACC AAA 14 96 
Ala Gly Ser Ser Ala Asn Phe Glu Phe Lys Ala Gly Thr Asp Thr Lys 
470 475 480 

AAC GGC AC A GCC ACT TTT AAT AAC GAT ATT AGT TTG GGA AGA TTT GTG 1544 
Asn Gly Thr Ala Thr Phe Asn Asn Asp He Ser Leu Gly Arg Phe Val 
485 490 495 

AAT TTA AAA GTG GAT GCT CAT AC A GCT AAT TTT AAA GGT ATT GAT ACT 15 92 

Asn Leu Lys Val Asp Ala His Thr Ala Asn Phe Lys Gly He Asp Thr 
500 505 510 

GGT AAT GGT GGT TTC AAC ACC TTA GAT TTT AGT GGC GTT ACA GGT AAG 164 0 
Gly Asn Gly Gly Phe Asn Thr Leu Asp Phe Ser Gly Val Thr Gly Lys 
515 520 525 530 

GTC AAT ATC AAC AAG CTC ATT ACG GCT TCC ACT AAT GTG GCC GTT AAA 16 88 
Val Asn He Asn Lys Leu He Thr Ala Ser Thr Asn Val Ala Val Lys 
535 540 545 

AAC TTC AAC ATT AAT GAA TTG GTT GTT AAG ACC AAT GGG GTG AGT GTG 173 6 

Asn Phe Asn He Asn Glu Leu Val Val Lys Thr Asn Gly Val Ser Val 
550 555 560 

GGG GAA TAC ACT CAT TTT AGC GAA GAT ATA GGC AGT CAA TCG CGC ATC 17 84 

Gly Glu Tyr Thr His Phe Ser Glu Asp He Gly Ser Gin Ser Arg He 
565 570 575 

AAT ACC GTG CGT TTG GAA ACT GGC ACT AGG TCA ATC TTT TCT GGG GGT 1832 
Asn Thr Val Arg Leu Glu Thr Gly Thr Arg Ser He Phe Ser Gly Gly 
580 585 590 

GTC AAA TTT AAA AGC GGT GAA AAA CTG GTT ATA GAT GAG TTT TAC TAT 188 0 

Val Lys Phe Lys Ser Gly Glu Lys Leu Val He Asp Glu Phe Tyr Tyr 
595 600 605 610 

AGC CCT TGG AAT TAT TTT GAC GCT AGG AAT ATT AAA AAT GTT GAA ATC 192 8 

Ser Pro Trp Asn Tyr Phe Asp Ala Arg Asn He Lys Asn Val Glu He 
615 620 625 

ACC AGA AAA TTC GCT TCT TCA ACC CCA GAA AAC CCT TGG GGC ACA TCA 197 6 
Thr Arg Lys Phe Ala Ser Ser Thr Pro Glu Asn Pro Trp Gly Thr Ser 
630 635 640 

AAG CTT ATG TTT AAT AAT CTA ACC CTG GGT CAA AAT GCG GTC ATG GAC 2 024 

Lys Leu Met Phe Asn Asn Leu Thr Leu Gly Gin Asn Ala Val Met Asp 
645 650 655 
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GCC GGG AGG AAA GCC AGC TCT ACG GTT TTA ACT TTG CAA OCT TCA GAA 72 8 

Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr Leu Gin Ala Ser Glu 
215 220 225 

GGG ATT ACT AGC AGT AAA AAT GCG GAA ATT TCT CTT TAT GAT GGC GCC 776 
Gly He Thr Ser Ser Lys Asn Ala Glu He Ser Leu Tyr Asp Gly Ala 
230 235 240 

ACG CTC AAT TTG GCT TCA AAC AGC GTT AAA TTA ATG GGT AAT GTG TGG 824 
Thr Leu Asn Leu Ala Ser Asn Ser Val Lys Leu Met Gly Asn Val Trp 
245 250 255 

ATG GGC CGT TTG CAA TAT GTG GGA GCG TAT TTG GCC CCT TCA TAC AGC 872 
Met Gly Arg Leu Gin Tyr Val Gly Ala Tyr Leu Ala Pro Ser Tyr Ser 
260 265 270 

ACG ATA AAC ACT TCA AAA GTG ACA GGG GAA GTG AAT TTT AAC CAT CTC 92 0 

Thr He Asn Thr Ser Lys Val Thr Gly Glu Val Asn Phe Asn His Leu 
275 280 285 290 

ACT GTG GGC GAT CAC AAC GCC GCT CAA GCA GGC ATT ATC GCT AGT AAC 968 
Thr Val Gly Asp His Asn Ala Ala Gin Ala Gly He He Ala Ser Asn 
295 300 305 

AAG ACT CAT ATT GGC ACA CTG GAT TTG TGG CAA AGC GCG GGA CTA AAC 1016 
Lys Thr His He Gly Thr Leu Asp Leu Trp Gin Ser Ala Gly Leu Asn 
310 315 320 

ATT ATC GCC CCT CCA GAA GGC GGT TAT AAG GAT AAA CCT AAG GAT AAA 1064 
He He Ala Pro Pro Glu Gly Gly Tyr Lys Asp Lys Pro Lys Asp Lys 
325 330 335 

CCT AGT AAC ACC ACG CAA AAT AAT GCT AAC AAC AAC CAA CAA AAC AGC 1112 
Pro Ser Asn Thr Thr Gin Asn Asn Ala Asn Asn Asn Gin Gin Asn Ser 
340 345 350 

GCT CAA AAC AAT AGT AAC ACT CAG GTT ATT AAC CCA CCC AAT AGC GCG 1160 
Ala Gin Asn Asn Ser Asn Thr Gin Val He Asn Pro Pro Asn Ser Ala 
355 360 365 370 

CAA AAA ACA GAA ATT CAA CCC ACG CAA GTC ATT GAT GGG CCT TTT GCT 12 08 

Gin Lys Thr Glu He Gin Pro Thr Gin Val He Asp Gly Pro Phe Ala 
375 380 385 

GGT GGC AAA GAC ACG GTT GTC AAT ATT GAT CGC ATC AAC ACT AAC GCT 1256 
Gly Gly Lys Asp Thr Val Val Asn He Asp Arg He Asn Thr Asn Ala 
390 395 400 

GAT GGC ACG ATT AAA GTG GGA GGG TAT AAA GCT TCT CTT ACC ACC AAT 13 04 

Asp Gly Thr He Lys Val Gly Gly Tyr Lys Ala Ser Leu Thr Thr Asn 
405 410 415 

GCG GCT CAT TTG CAT ATC GGC AAA GGC GGT ATC AAT CTG TCC AAT CAA 13 52 

Ala Ala His Leu His He Gly Lys Gly Gly He Asn Leu Ser Asn Gin 
420 425 430 
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AAAGTCGCAC CCTTTGTGCA AAAATCGTTT TACAAGAAGA AAGGAAAAAA ATG GAA 5 6 

Met Glu 
1 

ATA CAA CAA ACA CAC CGC AAA ATC AAT CGC CCT TTG GTT TCT CTC GCT 104 
lie Gin Gin Thr His Arg Lys lie Asn Arg Pro Leu Val Ser Leu Ala 
5 10 15 

TTA GTA GGA GCG TTA GTC AGC ATC ACA CCG CAA CAA AGT CAT GCC GCC 152 
Leu Val Gly Ala Leu Val Ser lie Thr Pro Gin Gin Ser His Ala Ala 
20 25 30 

TTT TTC ACA ACC GTG ATC ATT CCA GCC ATT GTT GGG GGG ATT GCT ACA 2 00 

Phe Phe Thr Thr Val lie lie Pro Ala lie Val Gly Gly lie Ala Thr 
35 40 45 50 

GGC GCT GCT GTA GGA ACG GTC TCA GGG CTT CTT GGC TGG GGG CTA AAA 24 8 

Gly Ala Ala Val Gly Thr Val Ser Gly Leu Leu Gly Trp Gly Leu Lys 
55 60 65 

CAA GCC GAA GAA GCC AAT AAA ACC CCA GAT AAA CCC GAT AAA GTT TGG 2 96 

Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys Val Trp 
70 75 80 

CGC ATT CAA GCA GGA AAA GGC TTT AAT GAA TTC CCT AAC AAG GAA TAC 344 
Arg lie Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys Glu Tyr 
85 90 95 

GAC TTA TAC AGA TCC CTA CTA TCT AGT . AAG ATT GAT GGA GGC TGG GAT 3 92 

Asp Leu Tyr Arg Ser Leu Leu Ser Ser Lys lie Asp Gly Gly Trp Asp 
100 105 110 

TGG GGG AAT GCC GCT ACG CAT TAT TGG GTC AAA GGC GGG CAA TGG AAC 44 0 

Trp Gly Asn Ala Ala Thr His Tyr Trp Val Lys Gly Gly Gin Trp Asn 
115 120 125 130 

AAG CTT GAA GTG GAT ATG AAA GAC GCT GTA GGG ACT TAT AAT CTC TCA 488 
Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Asn Leu Ser 
135 140 145 

GGG CTA AGA AAC TTT ACT GGT GGG GAT TTA GAT GTC AAT ATG CAA AAA 53 6 

Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp Val Asn Met Gin Lys 
150 155 160 

GCC ACT TTG CGC TTG GGC CAA TTC AAT GGC AAT TCT TTC ACA AGC TAT 5 84 

Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn Ser Phe Thr Ser Tyr 
165 170 175 

AAG GAT AGC GCT GAT CGC ACC ACG AGA GTG GAT TTC AAC GCT AAA AAT 632 
Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asp Phe Asn Ala Lys Asn 
180 185 190 

ATC TTA ATT GAT AAT TTT TTA GAA ATC AAT AAT CGT GTG GGT TCT GGA 6 80 

lie Leu lie Asp Asn Phe Leu Glu lie Asn Asn Arg Val Gly Ser Gly 
195 " 200 205 210 
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Gly Phe Asp Ijys 
260 

Ser Leu Ser Tyr 
275 

Asp Met Ala lie 
290 

Asn Phe Asn Ser 
305 

Arg Trp His lie 

Pro Leu Gly Gly 
340 

lie Leu Val Phe 
355 

Phe lie lie Trp 
370 

Ala Tyr Ser His 
385 

Leu Ala Trp Leu 

Phe Arg Ala Lys 
420 

Val Gly Leu Asn 
435 

Glu Phe Leu Asn 
450 

Tyr Ala Ser Pro 
465 

Phe Cys Leu Lys 

Trp lie Lys Thr 
500 

Phe lie Phe Ala 
515 



Ala Thr Ser Leu 

Ser Phe Gin Leu 
280 

Gly lie Gly Leu 
295 

Pro Tyr Lys Ala 
310 

Thr Leu Ser Arg 
325 

Asn Arg Val Lys 

Leu lie Gly Gly 
360 

Gly Leu Leu His 
375 

Ala Thr Arg Lys 
390 

lie Thr Phe Asn 
405 

Asn Leu Glu Ser 

Gly Val Ser Leu 
440 

Arg Val Asn Asp 
455 

Thr Phe Lys Met 
470 

Asn Ser Ser His 
485 

Thr Ser Ala Cys 

Ser Ser Gin Ser 
520 



Ser Phe lie Gin 
265 

Tyr Phe Asp Phe 

Phe Phe Asn lie 
300 

Leu Asn lie Gin 
315 

Phe Leu Lys Glu 
330 

Glu Leu lie Val 
345 

Phe Trp His Gly 

Gly lie Ala Leu 
380 

Phe His Phe Thr 
395 

Phe lie Asn Leu 
410 

Ala Leu Lys Val 
425 

Cys His Leu Ser 

Asn Met lie Met 
460 

Cys Val Leu Met 
475 

Leu Tyr Gin Ser 
490 

Leu Leu Leu Ser 
505 

Val Phe Leu Tyr 



Ala Trp Met Thr 
270 

Ser Gly Tyr Cys 
285 

Lys Leu Pro lie 

Asp Phe Trp Arg 
320 

Tyr Leu Tyr lie 
335 

Tyr Arg Asn Leu 
350 

Ala Gly Trp Thr 
365 

Ser Val His Arg 

Met Pro Lys lie 
400 

Ala Trp Val Phe 
415 

Leu Lys Gly Met 
430 

Lys Glu Ala Ser 
445 

His Thr lie Met 

lie lie lie Ser 
480 

Asn Gin Met Asp 
495 

lie Gly Phe Leu 

510 
Phe Asn Phe 
525 



(2) INFORMATION FOR SEQ ID NO: 13 7: 



(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic RNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 3920 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
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AAA AC A ACA AGC GCT TGT TTG TTG CTC TCT ATA GGT TTT TTA TTT ATT 1592 
Lys Thr Thr Ser Ala Cys Leu Leu Leu Ser lie Gly Phe Leu Phe lie 
500 505 510 

TTT GCC AGT TCT CAA TCG GTA TTT TTG TAT TTT AAT TTT TAGGACACTG CT 164 3 
Phe Ala Ser Ser Gin Ser Val Phe Leu Tyr Phe Asn Phe 
515 520 525 

ATGGAATTTT ATAAAAAACA AACTTTAATC ATTGTTTCTT T 1684 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Met Leu Ala Ser lie lie Ser lie Leu Arg Val Phe Val Leu Leu Phe 

15 10 15 

Asn Thr Pro Leu Phe lie Phe Ala Phe Leu Pro Val Gly Phe Leu Gly 

20 25 30 

Tyr Phe lie Leu Gin Ala Tyr Ala Lys Asn Pro Leu Phe Pro Lys Leu 

35 40 45 

Trp Leu Val Leu Ala Ser Leu Phe Phe Tyr Ala Phe Trp Asn Val Lys 

50 55 60 

Tyr Leu Pro Leu Leu Val Gly Ser lie Val Phe Asn Tyr Phe Val Ala 
65 70 75 80 

Leu Lys lie His Gin Thr Gin Pro Asn Ala Tyr Lys Arg Leu Trp Leu 

85 90 95 

lie Leu Gly Leu lie Ala Asn Val Ser Leu Leu Gly Phe Phe Lys Tyr 

100 105 110 

Thr Asp Phe Phe Leu Thr Asn Phe Asn Leu lie Trp Lys Ser His Phe 

115 120 125 

Glu Thr Leu His Leu lie Leu Pro Leu Ala lie Ser Phe Phe Thr Leu 

130 135 140 

Gin Gin lie Ala Tyr Leu Met Asp Thr Tyr Lys Gin Asn Gin lie Met 
145 150 155 160 

Gin Pro Lys Met Arg Glu Arg Val Ser Glu Asn Ala Pro lie Leu Leu 

165 . 170 175 

Asn Pro Pro Thr Ser Phe Phe Ser Leu Ser His Phe Leu Asp Tyr Ala 

180 185 190 

Leu Phe Val Ser Phe Phe Pro Gin Leu lie Ala Gly Pro lie Val His 

195 200 205 

His Ser Glu Met Met Pro Gin Phe Lys Asp Lys Asn Asn Gin Tyr Leu 

210 215 220 

Asn Tyr Arg Asn lie Ala Leu Gly Leu Phe lie Phe Ser lie Gly Leu 
225 230 235 240 

Phe Lys Lys Val Val lie Ala Asp Asn Thr Ala His Phe Ala Asp Phe 
245 250 255 
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TCT TAT TCG TTC CAG CTG TAT TTT GAT TTT AGC GGT TAT TGC GAT ATG 920 
Ser Tyr Ser Phe Gin Leu Tyr Phe Asp Phe Ser Gly Tyr Cys Asp Met 
275 280 285 290 

GCT ATA GGC ATT GGC CTC TTT TTT AAC ATC AAA CTC CCT ATC AAT TTT 968 
Ala lie Gly lie Gly Leu Phe Phe Asn lie Lys Leu Pro lie Asn Phe 
295 300 305 

AAT AGC CCC TAT AAG GCT TTG AAT ATC CAA GAT TTT TGG AGG AGG TGG 1016 
Asn Ser Pro Tyr Lys Ala Leu Asn lie Gin Asp Phe Trp Arg Arg Trp 
310 315 320 

CAT ATC ACT TTG AGC CGC TTC TTA AAA GAG TAT TTG TAT ATC CCT TTA 1064 
His lie Thr Leu Ser Arg Phe Leu Lys Glu Tyr Leu Tyr lie Pro Leu 
325 330 335 

GGG GGT AAT AGG GTG AAA GAA TTA ATC GTG TAT AGG AAT TTA ATT TTA 1112 
Gly Gly Asn Arg Val Lys Glu Leu lie Val Tyr Arg Asn Leu lie Leu 
340 345 350 

GTG TTT TTG ATT GGG GGG TTT TGG CAT GGG GCT GGT TGG ACT TTT ATC 1160 
Val Phe Leu lie Gly Gly Phe Trp His Gly Ala Gly Trp Thr Phe lie 
355 360 365 370 

ATT TGG GGG CTA TTG CAT GGG ATT GCT TTG AGC GTT CAT AGA GCG TAT 12 08 
lie Trp Gly Leu Leu His Gly lie Ala Leu Ser Val His Arg Ala Tyr 
375 380 385 

TCT CAT GCC ACT AGA AAA TTC CAT TTC ACT ATG CCA AAG ATT TTA GCA 12 56 
Ser His Ala Thr Arg Lys Phe His Phe Thr Met Pro Lys lie Leu Ala 
390 395 400 

TGG CTC ATC ACT TTT AAT TTT ATC AAT CTC GCA TGG GTG TTT TTT AGA 13 04 
Trp Leu He Thr Phe Asn Phe He Asn Leu Ala Trp Val Phe Phe Arg 
405 410 415 

GCC AAA AAT TTA GAA AGC GCT TTG AAG GTT TTA AAG GGG ATG GTT GGT 1352 
Ala Lys Asn Leu Glu Ser Ala Leu Lys Val Leu Lys Gly Met Val Gly 
420 425 430 

TTG AAT GGT GTT TCG CTT TGT CAT CTT TCA AAA GAG GCA TCA GAG TTT 14 00 
Leu Asn Gly Val Ser Leu Cys His Leu Ser Lys Glu Ala Ser Glu Phe 
435 440 445 450 

TTA AAT CGT GTC AAT GAT AAC ATG ATC ATG CAC ACC ATA ATG TAT GCA 144 8 
Leu Asn Arg Val Asn Asp Asn Met He Met His Thr He Met Tyr Ala 
455 460 465 

TCC CCC ACA TTT AAA ATG TGT GTT TTG ATG ATA ATC ATC TCT TTT TGT 14 96 
Ser Pro Thr Phe Lys Met Cys Val Leu Met He He He Ser Phe Cys 
470 475 480 

TTA AAA AAT AGT TCC CAT TTA TAG CAA TCC AAT CAA ATG GAT TGG ATT 1544 
Leu Lys Asn Ser Ser His Leu Tyr Gin Ser Asn Gin Met Asp Trp He 
485 490 495 
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GTA TTG GCT AGT TTG TTT TTT TAT GCT TTT TGG AAT GTG AAG TAT TTG 24 8 

Val Leu Ala Ser Leu Phe Phe Tyr Ala Phe Trp Asn Val Lys Tyr Leu 
55 60 65 

CCC TTA TTG GTT GGC TCT ATT GTT TTT AAT TAT TTT GTG GCT TTG AAA 296 
Pro Leu Leu Val Gly Ser lie Val Phe Asn Tyr Phe Val Ala Leu Lys 
70 75 80 

ATC CAT CAA ACC CAG CCA AAT GCA TAT AAA AGA TTA TGG CTT ATT TTG 344 
lie His Gin Thr Gin Pro Asn Ala Tyr Lys Arg Leu Trp Leu lie Leu 
85 90 95 

GGC TTG ATC GCT AAT GTT TCA CTT TTA GGA TTT TTC AAA TAC ACT GAT 3 92 

Gly Leu lie Ala Asn Val Ser Leu Leu Gly Phe Phe Lys Tyr Thr Asp 
100 105 110 

TTT TTC TTA ACC AAT TTC AAT CTA ATA TGG AAG AGC CAT TTT GAA ACC 440 
Phe Phe Leu Thr Asn Phe Asn Leu lie Trp Lys Ser His Phe Glu Thr 
115 120 125 130 

TTG CAT TTA ATC TTG CCT TTA GCG ATC AGC TTT TTC ACT TTG CAA CAA 4 88 

Leu His Leu lie Leu Pro Leu Ala lie Ser Phe Phe Thr Leu Gin Gin 
135 140 145 

ATC GCT TAC TTG ATG GAC ACT TAT AAG CAA AAT CAA ATC ATG CAG CCC 53 6 

He Ala Tyr Leu Met Asp Thr Tyr Lys Gin Asn Gin He Met Gin Pro 
150 155 160 

AAA ATG AGA GAG AGA GTG AGT GAA AAC GCT CCT ATT TTA TTA AAT CCT 584 
Lys Met Arg Glu Arg Val Ser Glu Asn Ala Pro lie Leu Leu Asn Pro 
165 170 175 

CCC ACT TCA TTT TTT TCA CTT TCG CAT TTT TTA GAT TAC GCT TTA TTT 632 
Pro. Thr Ser Phe Phe Ser Leu Ser His Phe Leu Asp Tyr Ala Leu Phe 
180 185 190 

GTG AGT TTC TTC CCT CAA CTC ATT GCA GGG CCT ATT GTG CAT CAT AGC 680 
Val Ser Phe Phe Pro Gin Leu He Ala Gly Pro He Val His His Ser 
195 200 205 210 

GAG ATG ATG CCT CAA TTT AAA GAT AAA AAC AAT CAA TAT TTG AAT TAC 728 
Glu Met Met Pro Gin Phe Lys Asp Lys Asn Asn Gin Tyr Leu Asn Tyr 
215 220 225 

AGA AAT ATC GCT TTA GGC TTG TTT ATC TTT TCT ATC GGT TTG TTT AAA 776 
Arg Asn He Ala Leu Gly Leu Phe He Phe Ser He Gly Leu Phe Lys 
230 235 240 

AAG GTC GTG ATT GCA GAT AAT ACC GCT CAT TTT GCT GAT TTT GGA TTT 824 
Lys Val Val He Ala Asp Asn Thr Ala His Phe Ala Asp Phe Gly Phe 
245 250 255 

GAT AAG GCG ACT AGC TTA AGT TTT ATT CAA GCA TGG ATG ACT TCT TTA 872 
Asp Lys Ala Thr Ser Leu Ser Phe He Gin Ala Trp Met Thr Ser Leu 
260 265 270 
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15 10 15 

Ser Leu Val Ala Arg Glu Lys Asp Ala Ser Ser Asn Leu Phe Asp Leu 

20 25 30 

lie Asp Lys Gly lie Asn Arg Glu Gin Glu Leu Lys Glu Gin Glu Gin 

35 40 45 

Lys Thr Arg Leu Lys Leu Ala Gin Ser Pro Leu Val Ala Leu Glu lie 

50 55 60 

Val Pro Gin Glu Thr Pro Tyr Leu Glu Trp Gin Gly Ala Arg Glu Ser 
65 70 75 80 

Tyr Tyr Leu Lys Val Ser Ala Val Val Glu Ser Val Val lie Leu Lys 

85 90 95 

lie Asp lie Asn Gin Gly Arg Ser Cys Ser Leu Tyr Pro Thr Pro Lys 

100 105 110 

Ser Val Ser Leu Val Arg Asn Gin Ser Val Ala Tyr Glu lie Leu Cys 

115 120 125 

Glu Asn Gin Pro Leu Trp lie Glu Val Ser Thr Asn Leu Gly Lys Arg 

130 135 140 

Thr Phe Gin Phe 
145 



(2) INFOR^4ATION FOR SEQ ID NO: 135: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1631 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CCTTTCTTTA TTCTCATTGC ATCTTGTGGG ATCATTTTTT TTTTTTATTA ATG CTA 56 

Met Leu 
1 

GCT TCC ATC ATC TCA ATT TTA AGG GTT TTT GTT TTG TTA TTC AAC ACG 104 
Ala Ser lie lie Ser lie Leu Arg Val Phe Val Leu Leu Phe Asn Thr 
5 10 15 

CCG TTA TTC ATC TTT GCT TTT TTG CCT GTT GGT TTT TTA GGG TAT TTT 152 
Pro Leu Phe lie Phe Ala Phe Leu Pro Val Gly Phe Leu Gly Tyr Phe 
20 25 30 

ATC TTG CAA GCT TAT GCT J\AA AAT CCC CTG TTC CCT AAA CTA TGG CTA 20 0 

lie Leu Gin Ala Tyr Ala Lys Asn Pro Leu Phe Pro Lys Leu Trp Leu 
35 40 45 50 
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CGC ATG AAT AAA AAT TAT CTT TTA ATC TTT TTG TTG TTA GCG AGT CTT 104 
Arg Met Asn Lys Asn Tyr Leu Leu lie Phe Leu Leu Leu Ala Ser Leu 
5 10 15 

GTT GCT AGA GAG AAG GAC GCT TCT TCA AAC CTT TTT GAT TTG ATT GAT 152 
Val Ala Arg Glu Lys Asp Ala Ser Ser Asn Leu Phe Asp Leu lie Asp 
20 25 30 

AAG GGG ATC AAC AGA GAA CAA GAA TTA AAA GAG CAG GAG CAA AAA ACG 200 
Lys Gly lie Asn Arg Glu Gin Glu Leu Lys Glu Gin Glu Gin Lys Thr 
35 40 45 50 

CGC TTA AAA CTG GCT CAA AGC CCT TTA GTA GCG TTA GAG ATT GTC CCC 24 8 

Arg Leu Lys Leu Ala Gin Ser Pro Leu Val Ala Leu Glu He Val Pro 
55 60 65 

CAA GAA ACG CCC TAT TTA GAA TGG CAA GGG GCT AGG GAG TCG TAT TAT 2 96 

Gin Glu Thr Pro Tyr Leu Glu Trp Gin Gly Ala Arg Glu Ser Tyr Tyr 
70 75 80 

TTA AAG GTG AGC GCT GTA GTG GAG AGC GTG GTT ATC TTA AAA ATT GAC 344 
Leu Lys Val Ser Ala Val Val Glu Ser Val Val He Leu Lys He Asp 
85 90 95 

ATC AAT CAA GGG CGT TCT TGC TCG CTC TAC CCC ACG CCT AAA AGC GTT 3 92 

He Asn Gin Gly Arg Ser Cys Ser Leu Tyr Pro Thr Pro Lys Ser Val 
100 105 110 

TCT TTA GTG AGG AAT CAA AGC GTA GCC TAT GAA ATT TTA TGC GAA AAC 44 0 

Ser Leu Val Arg Asn Gin Ser Val Ala Tyr Glu He Leu Cys Glu Asn 
115 120 125 130 

CAA CCC CTA TGG ATA GAA GTA AGC ACC AAT TTA GGC AAA CGC ACC TTT 4 88 

Gin Pro Leu Trp He Glu Val Ser Thr Asn Leu Gly Lys Arg Thr Phe 
135 140 145 

CAG TTT TAACCTGCAA CCAACATTAA AGAATGCCTT TAGCATTTTA AAACCCCTTT AT 54 6 
Gin Phe 

C 547 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
Met Asn Arg Met Asn Lys Asn Tyr Leu Leu He Phe Leu Leu Leu Ala 
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Ser Val Leu Val He Gly Ala Gly Gly Leu Gly Ser Ala Val Leu Met 

20 25 30 

Tyr Leu Cys Ala Ala Gly He Gly Lys He Gly He Val Asp Phe Asp 

35 40 45 

Val Val Asp Met Ser Asn Leu Gin Arg Gin He He His Ser Gin Asp 

50 55 60 

Phe Leu Asn Gin Ser Lys Ala Ser Ser Ala Lys Ala Arg Leu Lys Gin 
65 70 75 80 

Leu Asn Ala Gly He Glu He Glu Ala Phe Glu Glu Arg Phe Lys Ala 

85 90 95 

His Asn Ala Leu Ser Leu He Glu Pro Tyr Asp Phe He He Asp Ala 

100 105 110 

Thr Asp Asn Phe Asn Ala Lys Phe Leu He Asn Asp Ala Cys Val Leu 

115 120 125 

Ala Gin Lys Pro Tyr Ser His Ala Gly Val Leu Glu Tyr Arg Gly Gin 

130 135 140 

Ser Met Ser Val Leu Pro His Ser Ala Cys Leu Ala Cys Val Phe Asp 
145 150 155 160 

Lys Pro Pro Lys Lys Gly Leu Asn Pro He Ser Gly Leu Phe Gly Val 

165 170 175 

Leu Pro Gly Val Leu Gly Cys He Gin Ala Ser Glu Cys Leu Lys Tyr 

180 185 190 

Phe Leu Gly Phe Glu Thr Leu Leu He Asn Thr Leu Leu He Ala Asp 

195 200 205 

He Lys Thr Met Asp Phe Lys Lys He Gin Ala Pro Lys Asn Pro Glu 

210 215 220 

Cys Arg Val Cys Gly Thr His Lys He Thr His Leu Gin Asp Tyr Glu 
225 230 235 240 

He 



(2) INFORMATION FOR SEQ ID NO: 13 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
{ ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.., 494 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

AAAATTCATT CTATTTTAGA TAATGAGTTC AATCCCCACA AACAGCAAGA ATG AAT 56 

Met Asn 
1 
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85 90 95 

GCT CTT TCT CTC ATA GAG CCT TAT GAT TTT ATC ATA GAG GCC ACG GAC 3 92 

Ala Leu Ser Leu lie Glu Pro Tyr Asp Phe lie lie Asp Ala Thr Asp 
100 105 110 

AAT TTT AAC GCT AAA TTT TTG ATC AAT GAC GCT TGC GTG TTA GCC CAA 44 0 

Asn Phe Asn Ala Lys Phe Leu lie Asn Asp Ala Cys Val Leu Ala Gin 
115 120 125 130 

AAA CCC TAT TCG CAT GCC GGG GTT TTA GAA TAC AGG GGG CAA AGC ATG 488 
Lys Pro Tyr Ser His Ala Gly Val Leu Glu Tyr Arg Gly Gin Ser Met 
135 140 145 

AGC GTT TTA CCC CAT AGC GCA TGC TTA GCG TGC GTT TTT GAT AAG CCC 53 6 

Ser Val Leu Pro His Ser Ala Cys Leu Ala Cys Val Phe Asp Lys Pro 
150 155 160 

CCT AAA AAG GGA TTA AAT CCC ATT TCA GGG CTT TTT GGG GTC TTA CCC 5 84 

Pro Lys Lys Gly Leu Asn Pro lie Ser Gly Leu Phe Gly Val Leu Pro 
165 170 175 

GGA GTT TTA GGG TGT ATC CAA GCG AGC GAA TGC CTT AAA TAT TTT TTA 63 2 

Gly Val Leu Gly Cys lie Gin Ala Ser Glu Cys Leu Lys Tyr Phe Leu 
180 185 190 

GGG TTT GAA ACT TTA CTT ATA AAT ACT TTA CTT ATA GCC GAT ATT AAA 680 
Gly Phe Glu Thr Leu Leu lie Asn Thr Leu Leu lie Ala Asp lie Lys 
195 200 205 210 

ACG ATG GAT TTT AAA AAA ATT CAA GCA CCC AAA AAC CCT GAA TGT AGG 72 8 

Thr Met Asp Phe Lys Lys lie Gin Ala Pro Lys Asn Pro Glu Cys Arg 
215 220 225 

GTT TGT GGC ACG CAT AAA ATC ACG CAT TTA CAG GAT TAT GAA ATT TAGAT 77 8 
Val Cys Gly Thr His Lys lie Thr His Leu Gin Asp Tyr Glu lie 
230 235 240 

TAAGGGGTAA GTTTTGGATT TATCAACCAT ATTAGGCTTG GTATTGGC 826 

(2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Met Leu Glu Asp Val Gly Glu Glu Gly Gin Leu Lys Leu Leu Lys Ser 
15 10 15 
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15 10 15 

Phe Leu Asp Arg Phe Gly Glu Lys Met Glu Glu Ser Lys Glu Arg Lys 

20 25 30 

Gin Leu Leu lie Ala Ser Leu Ala Pro Leu Ala Gly Phe Ala Ala Arg 

35 40 45 

lie Ser Pro Gly Leu Leu Ser Leu Leu Gly Leu Met Leu Ala Met Gly 

50 55 60 

Cys Ala Asn Phe Trp lie 
65 70 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 773 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

TGGCTTAATT GTTAAGCCGG CTAGAAAAAG AGCGTTATTT GCGCCATATC ATG CTA 56 

Met Leu 
1 

GAA GAT GTG GGC GAA GAG GGT CAA TTG AAG CTT TTA AAA TCT AGC GTT 104 
Glu Asp Val Gly Glu Glu Gly Gin Leu Lys Leu Leu Lys Ser Ser Val 
5 10 15 

TTA GTC ATT GGG GCT GGG GGT CTT GGA TCG GCG GTT TTG ATG TAT TTG 152 
Leu Val lie Gly Ala Gly Gly Leu Gly Ser Ala Val Leu Met Tyr Leu 
20 25 30 

TGT GCC GCT GGG ATA GGA AAA ATC GGT ATT GTA GAT TTT GAT GTA GTA 20 0 

Cys Ala Ala Gly lie Gly Lys lie Gly lie Val Asp Phe Asp Val Val 
35 40 45 50 

GAT ATG AGT AAT TTG CAA CGC CAA ATC ATC CAT TCA CAG GAT TTT TTA 24 8 

Asp Met Ser Asn Leu Gin Arg Gin lie lie His Ser Gin Asp Phe Leu 
55 60 65 

AAC CAA TCT AAA GCC TCT AGC GCG AAA GCG CGC TTA AAA CAA CTC AAT 296 
Asn Gin Ser Lys Ala Ser Ser Ala Lys Ala Arg Leu Lys Gin Leu Asn 
70 75 80 

GCG GGT ATT GAA ATA GAG GCT TTT GAA GAA CGC TTT AAG GCT CAT AAC 344 
Ala Gly lie Glu lie Glu Ala Phe Glu Glu Arg Phe Lys Ala His Asn 
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(B) TYPE: nucleic acid 

(C) STR?VNDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 261 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

GTAAGCTAGA GTTTATGCAA AGGGAGATGA GTAGCCTAGA AGCTAAGCAT T ATG ATT 57 

Met lie 
1 

CAG TTA AAA TCA AAT TTG GAT TGG TAC GCA GAT TAT TTG AAT TTT TTA 105 
Gin Leu Lys Ser Asn Leu Asp Trp Tyr Ala Asp Tyr Leu Asn Phe Leu 
5 10 15 

GAT CGC TTT GGG GAA AAA ATG GAA GAA TCC AAA GAG CGA AAA CAA CTC 153 
Asp Arg Phe Gly Glu Lys Met Glu Glu Ser Lys Glu Arg Lys Gin Leu 
20 25 30 

CTG ATC GCT TCC CTT GCA CCT CTT GCG GGC TTT GCT GCA AGA ATA TCG 201 
Leu lie Ala Ser Leu Ala Pro Leu Ala Gly Phe Ala Ala Arg lie Ser 
35 40 45 50 

CCG GGA TTA TTG AGC TTA TTG GGT TTG ATG CTG GCA ATG GGG TGT GCA 24 9 

Pro Gly Leu Leu Ser Leu Leu Gly Leu Met Leu Ala Met Gly Cys Ala 
55 60 65 

AAT TTT TGG ATT TAGAAACCAA TCTGTGCAAG ATTTATGAAT CGCGCCCGTT AA 3 03 

Asn Phe Trp lie 
70 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 
Met lie Gin Leu Lys Ser Asn Leu Asp Trp Tyr Ala Asp Tyr Leu Asn 
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Asn Phe Asn lie Ala Ser Asp Pro Asp Met Ala Arg Leu Val Leu Ser 

100 105 110 

Leu Pro Leu Val Ala Met Asp Ser Val Gly Ala Gly Ala Gly Ser Phe 

115 120 125 

Val Arg lie Asp Pro His Ser Arg Ser Val Lys Leu Gly Pro Asp Ser 

130 135 140 

Ala Gly Tyr Arg Val Gly Thr Cys Trp Lys Asp Ser Gly Leu Asp Thr 
145 150 155 160 

Val Ser Val Thr Asp Cys His lie Val Leu Gly Tyr Leu Asn Pro Asp 

165 170 175 

Asn Phe Leu Gly Gly Leu lie Lys Leu Asp Val Asp Arg Ala Lys Lys 

180 185 190 

His lie Lys Glu Gin lie Ala Asp Pro Leu Gly lie Ser Val Glu Asp 

195 200 205 

Ala Ala Ala Gly Val lie Glu Leu Leu Asp Leu Glu Leu Lys Glu Tyr 

210 215 220 

Leu Arg Ser Asn lie Ser Ala Lys Gly Tyr Ser Pro Ser Asp Phe Val 
225 230 235 240 

Cys Phe Ser Tyr Gly Gly Ala Gly Pro Val His Thr Tyr Gly Tyr Thr 

245 250 255 

Glu Gly Leu Gly Phe Lys Asp Val Val Val Pro Ala Trp Ala Ala Gly 

260 265 270 

Phe Ser Ala Phe Gly Cys Ala Cys Ala Asp Phe Glu Tyr Arg Tyr Asp 

275 280 285 

Lys Ser Val Asp lie Ala lie Pro Gin. Tyr Ser Ser Asp Lys Ser Lys 

290 295 300 

lie Asp Ala Cys Lys lie lie Gin Asp Ala Trp Asp Glu Leu Thr Leu 
305 310 315 320 

Lys Val lie Glu Glu Phe Lys lie Asn Gly Phe Ser Gin Lys Asp Val 

325 330 335 

lie Leu Arg Pro Gly Tyr Arg Met Gin Tyr Met Gly Gin Leu Asn Asp 

340 345 350 

Leu Glu lie Thr Ser Pro Val Ser Lys Ala Ala Ser Val Ala Asp Trp 

355 360 365 

Glu Glu lie Val Lys Glu Tyr Glu Lys Thr Tyr Ala Arg Val Tyr Ser 

370 375 380 

Glu Ser Ala Cys Ser Pro Glu Leu Gly Phe Ser Val Thr Gly Val lie 
385 390 395 400 

Met Arg Gly Val Val Ala Thr Gin Lys Pro Val lie Pro Val Glu Lys 

405 410 415 

Glu His Gly Ala Thr Pro Pro Lys Glu Ala Lys lie Gly Val Arg Lys 

420 425 430 

Phe Tyr Arg His Lys Lys Trp Val Asp Ala Asp Val Trp Gin Met Glu 

435 440 445 

Lys Leu Leu Pro Gly Asn Glu Val lie Gly Pro Ala lie Val Glu Ser 

450 455 460 

Asp Ala Thr Thr Phe Val lie Pro Lys Gly Phe Ala Thr Arg Leu Asp 
465 470 475 480 

Lys His Arg Leu Phe His Leu Lys Glu lie Lys 
485 490 

(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 303 base pairs 
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GCG TGT TCT CCA GAG CTT GGT TTT AGC GTG ACT GGC GTG ATC ATG CGT 1256 
Ala Cys Ser Pro Glu Leu Gly Phe Ser Val Thr Gly Val lie Met Arg 
390 395 400 

GGT GTT GTG GCT ACG CAA AAA CCT GTG ATT CCG GTT GAA AAA GAG CAT 13 04 

Gly Val Val Ala Thr Gin Lys Pro Val lie Pro Val Glu Lys Glu His 
405 410 415 

GGT GCT ACG CCC CCA AAA GAA GCC AAA ATA GGC GTT AGA AAA TTC TAT 13 52 
Gly Ala Thr Pro Pro Lys Glu Ala Lys lie Gly Val Arg Lys Phe Tyr 
420 425 430 

CGG CAT AAA AAA TGG GTG GAT GCA GAT GTG TGG CAA ATG GAA AAA TTA 14 00 
Arg His Lys Lys Trp Val Asp Ala Asp Val Trp Gin Met Glu Lys Leu 
435 440 445 450 

CTG CCT GGA AAT GAA GTC ATA GGA CCT GCG ATC GTG GAA TCA GAT GCG 144 8 
Leu Pro Gly Asn Glu Val lie Gly Pro Ala lie Val Glu Ser Asp Ala 
455 460 465 

ACC ACT TTC GTG ATA CCC AAA GGC TTT GCG AC A AGA CTA GAC AAA CAC 14 96 

Thr Thr Phe Val lie Pro Lys Gly Phe Ala Thr Arg Leu Asp Lys His 
470 475 480 

CGA TTG TTC CAC TTG AAA GAA ATT AAA TAAAGGAGTT CAAAATGGCA AATTTAT 1550 
Arg Leu Phe His Leu Lys Glu lie Lys 
485 490 

TGAAAAACGG CAAAACTTTA AAACAA 157 6 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Met Asn Thr Thr lie Leu Glu Ala Tyr Ala Ala Glu Pro Ser Arg Gin 

15 10 15 

Thr Leu Ser Lys Val Ser Asn Arg Phe Lys Glu His Gly Ala Lys Phe 

20 25 30 

Asp Leu Arg Val Met Ala Thr His Gly Gly Thr lie Ser Trp Lys Ala 

35 40 45 

Lys Glu Leu Ala Arg Thr lie Val Ser Gly Pro lie Gly Gly Val lie 

50 55 60 

Gly Ser Lys Leu Leu Gly Glu Thr Leu Gly Tyr Asp Asn lie Ala Cys 
65 70 75 80 

Ser Asp lie Xaa Gly Thr Ser Phe Asp Met Ala Leu lie Val Lys Ser 
85 90 95 
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GTA ACC GAT TGC CAT ATT GTT TTA GGC TAT TTG AAC CCG GAT AAT TTC 584 
Val Thr Asp Cys His lie Val Leu Gly Tyr Leu Asn Pro Asp Asn Phe 
165 170 175 

TTA GGC GGT TTG ATC AAA TTA GAT GTG GAT AGG GCT AAA AAA CAC ATT 632 
Leu Gly Gly Leu lie Lys Leu Asp Val Asp Arg Ala Lys Lys His lie 
180 185 190 

AAA GAA CAA ATC GCT GAT CCG CTA GGC ATT AGC GTA GAA GAT GCG GCT 680 
Lys Glu Gin lie Ala Asp Pro Leu Gly lie Ser Val Glu Asp Ala Ala 
195 200 205 210 

GCT GGT GTG ATT GAA TTG CTT GAT TTG GAG CTT AAA GAA TAC TTG CGA 728 
Ala Gly Val lie Glu Leu Leu Asp Leu Glu Leu Lys Glu Tyr Leu Arg 
215 220 225 

TCC AAC ATT AGC GCT AAA GGG TAT AGC CCA TCT GAT TTT GTG TGC TTT 776 
Ser Asn lie Ser Ala Lys Gly Tyr Ser Pro Ser Asp Phe Val Cys Phe 
230 235 240 

TCA TAT GGT GGC GCA GGA CCT GTG CAT ACC TAT GGC TAT ACA GAA GGA 824 
Ser Tyr Gly Gly Ala Gly Pro Val His Thr Tyr Gly Tyr Thr Glu Gly 
245 250 255 

TTA GGG TTT AAG GAT GTG GTA GTG CCT GCG TGG GCG GCT GGA TTT AGC 872 
Leu Gly Phe Lys Asp Val Val Val Pro Ala Trp Ala Ala Gly Phe Ser 
260 265 270 

GCT TTT GGT TGT GCT TGC GCT GAT TTT GAA TAC AGA TAC GAC AAG AGC 920 
Ala Phe Gly Cys Ala Cys Ala Asp Phe Glu Tyr Arg Tyr Asp Lys Ser 
275 280 285 290 

GTG GAT ATT GCC ATT CCG CAG TAT TCT TCA GAC AAG TCA AAA ATA GAC 968 
Val Asp lie Ala lie Pro Gin Tyr Ser Ser Asp Lys Ser Lys lie Asp 
295 300 305 

GCA TGC AAA ATC ATT CAA GAC GCA TGG GAT GAA TTG ACT TTG AAA GTG 1016 
Ala Cys Lys lie lie Gin Asp Ala Trp Asp Glu Leu Thr Leu Lys Val 
310 315 320 

ATT GAA GAG TTC AAG ATC AAT GGA TTT TCT CAA AAA GAT GTG ATC TTA 1064 
lie Glu Glu Phe Lys He Asn Gly Phe Ser Gin Lys Asp Val He Leu 
325 330 335 

AGA CCT GGA TAC AGG ATG CAG TAT ATG GGG CAA TTG AAT GAT TTA GAG 1112 
Arg Pro Gly Tyr Arg Met Gin Tyr Met Gly Gin Leu Asn Asp Leu Glu 
340 345 350 

ATC ACT TCT CCT GTG TCA AAA GCT GCA AGC GTG GCT GAT TGG GAA GAG 1160 
He Thr Ser Pro Val Ser Lys Ala Ala Ser Val Ala Asp Trp Glu Glu 
355 360 365 370 

ATT GTC AAA GAA TAT GAA AAA ACC TAC GCT CGC GTT TAT TCT GAA TCA 1208 
He Val Lys Glu Tyr Glu Lys Thr Tyr Ala Arg Val Tyr Ser Glu Ser 
375 380 385 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 152 3 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

CTGTATTCGC TTCTGTGGAT TACTACCCTC AAAGAAAAGA AAGCCACAGA ATG AAC 56 

Met Asn 
1 

ACC ACC ATC TTA GAA GCT TAT GCG GCT GAG CCA AGC AGG CAA ACC CTC 104 
Thr Thr lie Leu Glu Ala Tyr Ala Ala Glu Pro Ser Arg Gin Thr Leu 
5 10 15 

TCT AAA GTC AGC AAC CGA TTC AAA GAG CAT GGC GCT AAA TTT GAT CTT 152 
Ser Lys Val Ser Asn Arg Phe Lys Glu His Gly Ala Lys Phe Asp Leu 
20 25 30 

CGT GTG ATG GCA ACG CAT GGA GGC ACC ATT AGT TGG AAA GCT AAA GAA 2 00 

Arg Val Met Ala Thr His Gly Gly Thr lie Ser Trp Lys Ala Lys Glu 
35 40 45 50 

CTC GCT AGG ACT ATT GTG AGC GGC CCT ATT GGA GGC GTG ATT GGA TCT 24 8 

Leu Ala Arg Thr lie Val Ser Gly Pro lie Gly Gly Val lie Gly Ser 
55 60 65 

AAA TTG CTA GGC GAA ACG CTT GGT TAT GAC AAT ATT GCA TGC AGT GAT 2 96 

Lys Leu Leu Gly Glu Thr Leu Gly Tyr Asp Asn lie Ala Cys Ser Asp 
70 75 80 

ATT GGK GGC ACG AGC TTT GAT ATG GCG CTT ATC GTT AAG AGC AAT TTT 344 
lie Xaa Gly Thr Ser Phe Asp Met Ala Leu lie Val Lys Ser Asn Phe 
85 90 95 

AAC ATC GCT TCT GAC CCT GAT ATG GCA CGC CTT GTT TTA TCT CTA CCG 3 92 

Asn lie Ala Ser Asp Pro Asp Met Ala Arg Leu Val Leu Ser Leu Pro 
100 105 110 

CTT GTG GCT ATG GAT TCT GTT GGC GCA GGT GCT GGG AGT TTT GTG CGC 440 
Leu Val Ala Met Asp Ser Val Gly Ala Gly Ala Gly Ser Phe Val Arg 
115 120 125 130 

ATT GAT CCA CAC AGC CGA TCT GTC AAA CTA GGG CCT GAC AGC GCG GGG 4 88 

lie Asp Pro His Ser Arg Ser Val Lys Leu Gly Pro Asp Ser Ala Gly 
135 140 145 

TAT AGA GTT GGC ACT TGT TGG AAA GAC AGC GGG TTA GAC ACG GTT TCA 53 6 

Tyr Arg Val Gly Thr Cys Trp Lys Asp Ser Gly Leu Asp Thr Val Ser 
150 155 160 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 161 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

GCATTGCTAA TTTGGGAATA CTTGTTTATG CCAGTGAAAT AGGAGCGGCT ATG ATG 56 

Met Met 
1 

TGG CGT AGT CTC ARG GTG GCT TTT ACG ATC ACT GAT ATT AGT AAA ACC 104 
Trp Arg Ser Leu Xaa Val Ala Phe Thr lie Thr Asp lie Ser Lys Thr 
5 10 15 

TTT CAA TCC CAG CCT AAG CAC CAT CAA ATC GGC ACT TTA GAA TTG AAT 152 
Phe Gin Ser Gin Pro Lys His His Gin lie Gly Thr Leu Glu Leu Asn 
20 25 30 

TTC GCC TTT TGATTTAATA TCAGTTTAAT ATTTTTCTTC CTATATGATA TTTATATGA 210 

Phe Ala Phe 

35 

TATT 214 
(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Met Met Trp Arg Ser Leu Xaa Val Ala Phe Thr lie Thr Asp lie Ser 

1 5 10 15 

Lys Thr Phe Gin Ser Gin Pro Lys His His Gin lie Gly Thr Leu Glu 

20 25 30 

Leu Asn Phe Ala Phe 
35 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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GAT CAT ATC GCT TTA GAA TOT ATC GCT GAA CTT TCT TAT AAG GGT TTT T 5 85 
Asp His lie Ala Leu Glu Cys lie Ala Glu Leu Ser Tyr Lys Gly Phe 
165 170 175 

GAATTGGTTT TTGTTTTTCT TTTTAAATGC GTTAATGAAG AAACAAGCCT GA 63 7 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Met Ser Gin Lys lie Leu lie Leu Gly lie Gly Asn lie Leu Phe Gly 

15 10 15 

Asp Glu Gly lie Gly Val His Leu Ala His Tyr Leu Lys Lys Asn Phe 

20 25 30 

Ser Phe Phe Pro Ser Val Asp lie lie Asp Gly Gly Thr Met Ala Gin 

35 40 45 

Gin Leu lie Pro Leu lie Thr Ser Tyr Glu Lys Val Leu lie Leu Asp 

50 55 60 

Cys Val Ser Ala Glu Gly Val Glu lie Gly Ser Val Tyr Ala Phe Asp 
65 70 75 80 

Phe Lys Asp Ala Pro Lys Glu lie Thr Trp Ala Gly Ser Ala His Glu 

85 90 95 

Val Glu Met Leu His Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu 

100 105 110 

Pro Lys Thr Phe lie Val Gly Leu Val Pro Phe Val lie Gly Ser Glu 

115 120 125 

Thr Thr Phe Lys Leu Ser Ser Lys lie Leu Asn Ala Leu Glu Thr Ala 

130 135 140 

Leu Lys Ala lie Glu Thr Gin Leu Asn Ala Trp Gly Val Lys Met Gin 
145 150 155 160 

Arg Thr Asp His lie Ala Leu Glu Cys lie Ala Glu Leu Ser Tyr Lys 
165 170 175 

Gly Phe 



(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 584 
(D) OTHER INFORiyiATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

GAAAGTTCGG GGGCGGATTC TATGATTAAT GGCTATGGTT ATACCAAAGA ATG AGT 5 6 

Met Ser 
1 

CAA AAA ATC CTA ATT CTA GGT ATT GGC AAT ATC CTT TTT GGC GAT GAA 104 
Gin Lys lie Leu lie Leu Gly lie Gly Asn lie Leu Phe Gly Asp Glu 
5 10 15 

GGG ATT GGG GTG CAT TTA GCC CAC TAC CTC AAA AAA AAT TTT TCT TTT 152 
Gly lie Gly Val His Leu Ala His Tyr Leu Lys Lys Asn Phe Ser Phe 
20 25 30 

TTC CCT AGC GTG GAT ATT ATA GAT GGG GGG ACA ATG GCC CAG CAG CTC 2 00 

Phe Pro Ser Val Asp lie lie Asp Gly Gly Thr Met Ala Gin Gin Leu 
35 40 45 50 

ATT CCT TTA ATC ACT TCG TAT GAA AAG GTT TTG ATT TTG GAT TGC GTG 24 8 

lie Pro Leu lie Thr Ser Tyr Glu Lys Val Leu He Leu Asp Cys Val 
55 60 65 

AGC GCT GAA GGC GTT GAG ATA GGA TCA GTC TAT GCT TTT GAT TTT AAG 2 96 

Ser Ala Glu Gly Val Glu He Gly Ser Val Tyr Ala Phe Asp Phe Lys 
70 75 80 

GAC GCT CCT AAA GAA ATC ACA TGG GCT GGG AGC GCT CAT GAA GTG GAA 344 
Asp Ala Pro Lys Glu He Thr Trp Ala Gly Ser Ala His Glu Val Glu 
85 90 95 

ATG CTA CAC ACT TTA AGG CTC ACG GAG TTT TTA GGG GAT TTG CCT AAA 3 92 

Met Leu His Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu Pro Lys 
100 105 110 

ACT TTT ATC GTG GGG CTT GTG CCT TTT GTG ATA GGG AGC GAG ACC ACT 44 0 

Thr Phe He Val Gly Leu Val Pro Phe Val He Gly Ser Glu Thr Thr 
115 120 125 130 

TTC AAG CTT TCA AGC AAA ATT TTA AAC GCT TTA GAA ACC GCC TTA AAA 488 
Phe Lys Leu Ser Ser Lys He Leu Asn Ala Leu Glu Thr Ala Leu Lys 
135 140 145 

GCC ATA GAA ACC CAA CTC AAC GCA TGG GGG GTT AAA ATG CAA CGC ACC 53 6 

Ala He Glu Thr Gin Leu Asn Ala Trp Gly Val Lys Met Gin Arg Thr 
150 155 160 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



Met Thr Leu Asn 
1 

lie Leu Phe Tyr 
20 

Val Pro Gin Gly 
35 

Gly Val Asp Met 
50 

Met Pro Lys Lys 
65 

Gly Asp Phe Leu 

Ala Thr Leu lie 
100 

Ser Glu Thr Tyr 
115 

Ser lie Ala Pro 
130 

Trp Pro Asp Thr 
145 

Met Gin Thr Leu 

Lys Gin Trp Leu 
180 

lie Leu Ala Ser 
195 

Pro Leu lie Ala 
210 

Leu Gin Met Asp 
225 

Val Thr Lys Glu 

Lys Phe Lys Gly 
260 

Ala lie Arg Ala 
275 

Val Lys Met Pro 
290 

His Leu Lys Asn 
305 



Thr Phe Leu Asp 
5 

Leu Ser lie Pro 

Ser Leu Lys Lys 
40 

Asn Ala Leu Asp 
55 

Gly Tyr lie Asp 
70 

Val Arg Leu lie 
85 

Pro Gly Glu Ser 

Gin Leu Glu Thr 
12 0 

Arg Leu Asn Gly 
135 

Tyr His Leu Pro 
150 

lie Gly Gin Ser 
165 

Gly Tyr Tyr His 

He Val Gin Lys 
200 

Ser Val He Phe 
215 

Gly Ala Leu Asn 
230 

Arg He Lys Thr 
245 

Leu Pro Lys Asn 

Val He Phe Pro 
280 

Asp Lys Lys His 
295 

He Asn Leu Ser 
310 



Thr Cys Phe Leu 
10 

He Tyr Pro Asn 
25 

Val Phe Phe Ser 

Leu Leu Phe Leu 
60 

Met Gly Asp Gly 
75 

Lys Ala Lys Ala 
90 

Arg Tyr Phe Phe 
105 

Ser Asp Leu Asn 

Glu Val He Glu 
140 

Leu Gly Glu Asp 
155 

Met Lys Lys His 
170 

Lys Glu Glu Trp 
185 

Glu Ala Ala Asn 

Asn Arg Leu Lys 
220 

Tyr Gin Glu Phe 
235 

Asp Asn Thr Pro 
250 

Pro Val Gly Ser 
265 

Lys Lys Thr Asp 

Ala Phe Ser Ala 
300 

Asn Asn His Phe 
315 



Leu Phe He Ser 
15 

Lys Val Val Val 
30 

Leu Lys Glu Gin 
45 

Arg Leu Met Gly 

Ala Leu Arg Lys 
80 

Ala Gin Lys Ser 
95 

Thr Gin He Leu 
110 

Gin Ala Tyr Glu 
125 

Asp Gly Val He 

Ala Phe Lys He 
160 

Glu Ala Leu Ser 
175 

Phe Glu Lys He 
190 

Val Glu Glu Met 
205 

Lys Gly Met Pro 

Ser His Ala Lys 
240 

Tyr Asn Thr Tyr 
255 

Val Ser Leu Glu 
270 

Phe Leu Tyr Phe 
285 

Thr Tyr Lys Glu 



(2) INFORMATION FOR SEQ ID NO: 12 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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Ala Pro Arg Leu Asn Gly Glu Val lie Glu Asp Gly Val lie Trp Pro 
135 140 145 

GAC ACT TAT CAT TTG CCT TTA GGG GAG GAC GCT TTT AAA ATC ATG CAA 536 
Asp Thr Tyr His Leu Pro Leu Gly Glu Asp Ala Phe Lys lie Met Gin 
150 155 160 

ACT TTG ATT GGT CAA TCC ATG AAA AAA CAC GAA GCC TTA AGC AAA CAA 584 
Thr Leu lie Gly Gin Ser Met Lys Lys His Glu Ala Leu Ser Lys Gin 
165 170 175 

TGG CTT GGA TAG TAG CAT AAA GAA GAG TGG TTT GAA AAA ATC ATT CTC 632 
Trp Leu Gly Tyr Tyr His Lys Glu Glu Trp Phe Glu Lys lie lie Leu 
180 185 190 

GCT TCT ATT GTG CAA AAA GAA GCC GCT AAT GTT GAA GAA ATG CCC TTG 680 
Ala Ser lie Val Gin Lys Glu Ala Ala Asn Val Glu Glu Met Pro Leu 
195 200 205 210 

ATT GCG AGC GTG ATT TTT AAC CGC TTG AAA AAA GGC ATG CCT TTA CAA 72 8 

lie Ala Ser Val lie Phe Asn Arg Leu Lys Lys Gly Met Pro Leu Gin 
215 220 225 

ATG GAT GGG GCT TTG AAT TAT CAG GAA TTT TCA CAC GCT AAA GTA ACC 7 76 

Met Asp Gly Ala Leu Asn Tyr Gin Glu Phe Ser His Ala Lys Val Thr 
230 235 240 

AAA GAG CGC ATT AAA ACC GAT AAC ACC CCC TAG AAT ACC TAT AAA TTT 824 
Lys Glu Arg lie Lys Thr Asp Asn Thr Pro Tyr Asn Thr Tyr Lys Phe 
245 250 255 

AAG GGT TTG CCT AAA AAT CCT GTA GGG AGC GTG AGC CTA GAA GCG ATT 872 
Lys Gly Leu Pro Lys Asn Pro Val Gly Ser Val Ser Leu Glu Ala lie 
260 265 270 

AGA GCC GTG ATC TTC CCT AAA AAA ACG GAT TTC TTG TAT TTT GTG AAA 92 0 

Arg Ala Val lie Phe Pro Lys Lys Thr Asp Phe Leu Tyr Phe Val Lys 
275 280 285 290 

ATG CCG GAT AAA AAA CAT GCT TTC AGC GCG ACT TAT AAA GAG CAT TTA 968 
Met Pro Asp Lys Lys His Ala Phe Ser Ala Thr Tyr Lys Glu His Leu 
295 300 305 

AAA AAC ATT AAT CTT TCT AAT AAT CAT TTT TAAGATTAAG GTAAATGGGG CGT 1021 
Lys Asn lie Asn Leu Ser Asn Asn His Phe 
310 315 

TTTTTCTTTT GAATTGAGTA AAAAGTGTTT 1051 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



-254- 



BNSDOCID: <WO 9843478A1_L> 



wo 98/43478 



PCT/US98/06371 



(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 1051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 998 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GGGTTTTGTG AATGACGACT AAAAGAGTGA ATACTGCCAC AAACAAGATA ATG ACA 56 

Met Thr 
1 

TTA AAT ACT TTC TTG GAT ACA TGT TTT CTT TTA TTC ATC AGT ATT CTT 104 
Leu Asn Thr Phe Leu Asp Thr Cys Phe Leu Leu Phe lie Ser lie Leu 
5 10 15 

TTT TAT TTA AGT ATA CCA ATT TAT CCT AAC AAA GTG GTG GTT GTC CCG 152 
Phe Tyr Leu Ser lie Pro lie Tyr Pro Asn Lys Val Val Val Val Pro 
20 25 30 

CAA GGT TCG CTC AAA AAA GTG TTT TTT TCT TTA AAA GAG CAA GGC GTG 2 00 

Gin Gly Ser Leu Lys Lys Val Phe Phe Ser Leu Lys Glu Gin Gly Val 
35 40 45 50 

GAT ATG AAC GCT TTG GAT TTG CTT TTT TTA CGC CTG ATG GGC ATG CCT 24 8 

Asp Met Asn Ala Leu Asp Leu Leu Phe Leu Arg Leu Met Gly Met Pro 
55 60 65 

AAA AAA GGT TAT ATT GAT ATG GGC GAT GGG GCT TTA AGG AAG GGG GAT 2 96 

Lys Lys Gly Tyr lie Asp Met Gly Asp Gly Ala Leu Arg Lys Gly Asp 
70 75 80 

TTT TTA GTC CGT TTG ATT AAG GCA AAA GCG GCA CAA AAA AGT GCG ACT 344 
Phe Leu Val Arg Leu lie Lys Ala Lys Ala Ala Gin Lys Ser Ala Thr 
85 90 95 

CTA ATC CCT GGG GAA AGC CGC TAT TTT TTC ACG CAA ATT TTG AGC GAG 3 92 

Leu lie Pro Gly Glu Ser Arg Tyr Phe Phe Thr Gin lie Leu Ser Glu 
100 105 110 

ACT TAC CAA CTA GAA ACA AGC GAT CTC AAT CAG GCT TAT GAA AGC ATC 44 0 

Thr Tyr Gin Leu Glu Thr Ser Asp Leu Asn Gin Ala Tyr Glu Ser lie 
115 120 125 130 

GCT CCA CGA TTG AAT GGC GAA GTG ATA GAA GAT GGG GTG ATA TGG CCA 488 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

CCCCACAAAT CCTAGCGATA GTGAAATGCC CTAATTCATG GACAAAGATT AAAAACGCCA 6 0 

GCATCAAAAC CGCTAC AAT GAA CAT CAT ACC CCT GCA GGC TCT TTG GTG TTA 112 

Asn Glu His His Thr Pro Ala Gly Ser Leu Val Leu 
15 10 

GGA TCT TTT ATC ATC GGC TCT TTT AAA GGC GTG GGT GCT ATA GGG GGC 16 0 

Gly Ser Phe lie lie Gly Ser Phe Lys Gly Val Gly Ala lie Gly Gly 
15 20 25 

GTG GGT GCT GTG GTT TTT GGG ATT TCT TTA TTT TCT TTT GGG GGT TTT 20 8 

Val Gly Ala Val Val Phe Gly lie Ser Leu Phe Ser Phe Gly Gly Phe 
30 35 40 

TGC CAC AAC TCT GTC AAA GCC GCC GCT TTT TTA GGA TCC ATT TTG GCT 25 6 

Cys His Asn Ser Val Lys Ala Ala Ala Phe Leu Gly Ser lie Leu Ala 
45 50 55 60 

AAA ATT TTA CCG AGT TCT TGG GGT TTT AGC GCC ATT AAA ATT TCT AAT 304 
Lys lie Leu Pro Ser Ser Trp Gly Phe Ser Ala lie Lys lie Ser Asn 
65 70 75 

GCG TTT TGAGTGGGTA AATTTTCTAA AATCAGAGCC GATTTAGAAT CTTTCATTT 35 9 

Ala Phe 



(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Asn Glu His His Thr Pro Ala Gly Ser Leu Val Leu Gly Ser Phe lie 

15 10 15 

lie Gly Ser Phe Lys Gly Val Gly Ala lie Gly Gly Val Gly Ala Val 

20 25 30 

Val Phe Gly lie Ser Leu Phe Ser Phe Gly Gly Phe Cys His Asn Ser 

35 40 45 

Val Lys Ala Ala Ala Phe Leu Gly Ser lie Leu Ala Lys lie Leu Pro 

50 55 60 

Ser Ser Trp Gly Phe Ser Ala lie Lys lie Ser Asn Ala Phe 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 121: 
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TCT GGG GCG TCT TCG CGC ACT AAA AAA TAGCCAAACC CTATAGCGTA TAGGGCA 543 
Ser Gly Ala Ser Ser Arg Ser Lys Lys 
150 

AT^GAGATTA AAAAGCTAAC ACTCGT 5 69 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



Ala 


Ser 


Glu 


Val 


Ala 


Pro 


Ser 


Glu 


Val 


Leu 


Leu 


Asp 


Ser 


Ser 


Cys 


Leu 


1 








5 










10 










15 




Ser 


Phe 


Ser 


Leu 


Thr 


lie 


Ser 


Leu 


Val 


Val 


Thr 


Cys 


Leu Gly Ala 


Leu 








20 










25 










30 






Phe 


Ser 


Leu 


Ala 


Ser 


Ser 


Leu 


Ala 


Ser 


Ser 


Phe 


Leu 


Gly 


Ser 


Ser 


Leu 






35 










40 










45 








Gly 


Ser 


Ser 


Phe 


Leu 


Thr 


Ser 


Ser 


Thr 


Leu 


Gly 


Ser 


Gly 


Leu 


Gly 


Ser 




50 










55 










60 










Gly 


Phe 


Gly 


Ser 


Gly 


Leu 


Gly 


Ser 


Gly 


Leu 


Gly 


Phe 


Gly 


Phe 


Gly 


Phe 


65 










70 










75 










80 


Gly 


Leu 


Gly 


Leu 


Gly 


Leu 


Gly 


Leu 


Gly 


Phe 


Val 


Thr 


Ser 


Phe 


Leu 


Gly 










85 










90 










95 




Ser 


Ser 


Phe 


Phe 


Gly 


Ser 


Ser 


Phe 


Leu 


Gly 


Phe 


Ser 


Leu 


Gly 


Ser 


Ser 








100 










105 










110 






Leu 


Gly 


Leu 


Ala 


Asp 


Ser 


Ala 


Leu 


Val 


Phe 


Val 


Leu 


Glu 


Leu 


Val 


Leu 






115 










120 










125 








Met 


Leu 


Ala 


Lys 


Leu 


Met 


Val 


Thr 


Leu 


Val 


Val 


Pro 


Ala 


Cys 


Ala 


Lys 




130 










135 










140 










Gly 


Ser 


Gly 


Ala 


Ser 


Ser 


Arg 


Ser 


Lys 


Lys 















145 150 



(2) INFORMATION FOR SEQ ID NO: 119: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 77... 310 
(D) OTHER INFORMATION: 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 55... 516 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

GCGGTTTGGA TCTTCATTAA AAATTCGTTG CTCACCCCTG GGTTATAAGC TTGA GCT 57 

Ala 
1 

TCA GAA GTG GCC CCC TCA GAG GTT TTG TTG GAT TCT TCT TGC TTG TCT 105 
Ser Glu Val Ala Pro Ser Glu Val Leu Leu Asp Ser Ser Cys Leu Ser 
5 10 15 

TTT TCT TTG ACT ATA TCC TTA GTT GTT ACT TGT TTA GGA GCG CTT TTT 153 
Phe Ser Leu Thr lie Ser Leu Val Val Thr Cys Leu Gly Ala Leu Phe 
20 25 30 

TCT TTA GCT TCC TCT TTA GCT TCT TCT TTT TTG GGC TCT TCT TTA GGC 2 01 

Ser Leu Ala Ser Ser Leu Ala Ser Ser Phe Leu Gly Ser Ser Leu Gly 
35 40 45 

TCT TCT TTT TTA ACC TCT TCA ACT TTA GGC TCA GGC TTA GGC TCG GGT 24 9 

Ser Ser Phe Leu Thr Ser Ser Thr Leu Gly Ser Gly Leu Gly Ser Gly 
50 55 60 65 

TTT GGT TCA GGT TTG GGT TCA GGC TTA GGT TTT GGT TTT GGC TTT GGC 2 97 

Phe Gly Ser Gly Leu Gly Ser Gly Leu Gly Phe Gly Phe Gly Phe Gly 
70 75 80 

TTG GGT TTA GGC TTA GGT TTA GGC TTT GTA ACC TCC TTT TTG GGT TCT 34 5 

Leu Gly Leu Gly Leu Gly Leu Gly Phe Val Thr Ser Phe Leu Gly Ser 
85 90 95 

TCT TTT TTT GGC TCT TCT TTC TTG GGT TTT TCT TTA GGC TCT TCT TTG 3 93 

Ser Phe Phe Gly Ser Ser Phe Leu Gly Phe Ser Leu Gly Ser Ser Leu 
100 105 110 

GGT TTA GCC GAC TCA GCA TTA GTC TTT GTA TTG GAA TTA GTG TTG ATG 441 
Gly Leu Ala Asp Ser Ala Leu Val Phe Val Leu Glu Leu Val Leu Met 
115 120 125 

CTG GCT AAA CTC ATG GTA ACC TTA GTG GTC CCG GCT TGC GCT AAA GGC 4 89 

Leu Ala Lys Leu Met Val Thr Leu Val Val Pro Ala Cys Ala Lys Gly 
130 135 140 145 



-250- 



BNSDOCID: <WO 984347SA1_L> 



wo 98/43478 



PCT/US98/06371 



AAA GAT ATT CTT CTC ATC GCT TCG CAA ATT CTC AAA GAA AGA CTT TTT 152 
Lys Asp lie Leu Leu lie Ala Ser Gin lie Leu Lys Glu Arg Leu Phe 
20 25 30 

GCC CAC AAA AAT GAG ATT TTC TTT TGC CCT AGA AAT AGC TAG ATT CAA 2 00 

Ala His Lys Asn Glu lie Phe Phe Cys Pro Arg Asn Ser Tyr lie Gin 
35 40 45 

GCG TTT AGA ATC TAT CAA GAA AGA AAG ATT ACC ATA AGT TTT CAC GGT 24 8 

Ala Phe Arg lie Tyr Gin Glu Arg Lys lie Thr lie Ser Phe His Gly 
50 55 60 65 

GGA ATA AAT AAT AAT ATC TGC CTT CTC GCC TTG AAA GGC ATC CAC AGT 296 
Gly lie Asn Asn Asn lie Cys Leu Leu Ala Leu Lys Gly lie His Ser 
70 75 80 

GTC TAT TTT GAG CTC ATC AAA ATT CTT GAA GCC GTA TTT TTC CAC TTC T 34 5 
Val Tyr Phe Glu Leu lie Lys lie Leu Glu Ala Val Phe Phe His Phe 
85 90 95 

GATCGCAAGC ATCTTTTTTG GGCATTATAA GGTGTGATAA T 386 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Met lie lie Phe Gly Lys Asp Tyr Leu Ser Thr Asp Leu Gin Asn Ser 

15 10 15 

Ala Lys Asp lie Leu Leu lie Ala Ser Gin lie Leu Lys Glu Arg Leu 

20 25 30 

Phe Ala His Lys Asn Glu lie Phe Phe Cys Pro Arg Asn Ser Tyr lie 

35 40 45 

Gin Ala Phe Arg lie Tyr Gin Glu Arg Lys lie Thr lie Ser Phe His 

50 55 60 

Gly Gly lie Asn Asn Asn He Cys Leu Leu Ala Leu Lys Gly He His 
65 70 75 80 

Ser Val Tyr Phe Glu Leu He Lys He Leu Glu Ala Val Phe Phe His 
85 90 95 

Phe 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 9 base pairs 

(B) TYPE: nucleic acid 



-249- 



BNSDCXID: <WO 984347aA1J„> 



wo 98/43478 



PCT/US98/06371 



ACATTTAAAA TCTTA 367 
(2) INFORMATION FOR SEQ ID NO : 114 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:114: 

Met lie Gin Ser Asp Ala Val Phe Lys lie Asn Phe Cys Leu Ala Leu 

15 10 15 

Leu Val Phe Val Lys Arg Gly Leu Ser Asp lie Asn Met Pro Leu Phe 

20 25 30 

Asn Gin Arg Ala Gin lie Thr lie Glu Lys Ser His Gin Gin Gly Leu 

35 40 45 

Asp Met Ala Pro lie His lie Ser lie Gly His Asp Asn Asp Phe Met 

50 55 60 

lie Ala Gin Ser Phe Tyr lie Lys Thr Leu Leu Asn Ala Ala Pro Lys 
65 70 75 80 

Ser Arg Asp His Val Phe Asn Phe Phe 
85 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 54 . . . 344 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 115 : 

TAAAAAGCAC TTTAGAGAGA TTTACGAAAG TGTTTTGAAG CGAAGAATGT CTG ATG 56 

Met 
1 

ATT ATC TTT GGA AAA GAT TAC CTA TCT AC A GAC TTG CAA AAT AGC GCT 104 
lie lie Phe Gly Lys Asp Tyr Leu Ser Thr Asp Leu Gin Asn Ser Ala 
5 10 15 
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50 55 60 

Ala Tyr Lys Asn Trp Asp Arg Leu Gly Glu lie Gly Trp Lys Cys Val 
65 70 75 80 

Gly lie lie lie lie Thr Ala Ala lie Ser Asn Ala Lys Thr Leu Ser 
85 90 95 

Gin Trp Leu Phe 
100 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 318 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:113: 

CTTGCCAATC CCTTCAATAT CCACCGAATT GATGCCATGC TCAATTAAAA A ATG ATC 57 

Met lie 
1 

CAA AGC GAC GCT GTC TTT AAG ATA AAT TTC TGT CTT GCC CTT CTT GTA 105 
Gin Ser Asp Ala Val Phe Lys lie Asn Phe Cys Leu Ala Leu Leu Val 
5 10 15 

TTT GTA AAG AGG GGC TTG AGC GAT ATA AAC ATG CCC TTG TTC AAT CAG 153 
Phe Val Lys Arg Gly Leu Ser Asp lie Asn Met Pro Leu Phe Asn Gin 
20 25 30 

CGG GCG CAA ATA ACG ATA GAA AAA AGT CAT CAG CAA GGT TTG GAT ATG 2 01 

Arg Ala Gin lie Thr lie Glu Lys Ser His Gin Gin Gly Leu Asp Met 
35 40 45 50 

GCT CCC ATC CAC ATC AGC ATC GGT CAT GAT AAT GAT TTT ATG ATA GCG 24 9 

Ala Pro lie His lie Ser lie Gly His Asp Asn Asp Phe Met lie Ala 
55 60 65 

CAA TCT TTC TAT ATC AAA ACT CTC TTG AAT GCC GCA CCC AAA AGC CGT 2 97 

Gin Ser Phe Tyr lie Lys Thr Leu Leu Asn Ala Ala Pro Lys Ser Arg 
70 75 80 

GAT CAT GTT TTT AAT TTC TTC TGATTTTAGG ATTTTTGATA AATGGCTTTT TTCC 3 52 
Asp His Val Phe Asn Phe Phe 
85 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CTAATTCTGT CTATTACACC AACAATCAAT CTCAAAACAA AGGACATGAAAG ATG AAA 58 

Met Lys 
1 

ACA AAA CAT AAA GGA ATA AGA ATG TTT AAG CAA ATT CGT AGA ATG ATG 10 6 

Thr Lys His Lys Gly lie Arg Met Phe Lys Gin lie Arg Arg Met Met 
5 10 15 

AGT TTG GCA ATA TTA ATG CCT AGT TTT TTA TTG GCG GCA CCA GAT TAC 154 
Ser Leu Ala lie Leu Met Pro Ser Phe Leu Leu Ala Ala Pro Asp Tyr 
20 25 30 

AAA CAA AAA TTC ACT CAA ATA TTG GAT TTC ATA AGC AAT GAC TTT ATC 2 02 

Lys Gin Lys Phe Thr Gin lie Leu Asp Phe lie Ser Asn Asp Phe lie 
35 40 45 50 

AAG GCT ATT GGT GGT CTA ATC ATT GTT GGG ACT TGC ATT TAC GCC TAT 250 
Lys Ala lie Gly Gly Leu lie lie Val Gly Thr Cys lie Tyr Ala Tyr 
55 60 65 

AAA AAT TGG GAC AGG CTT GGA GAA ATT GGT TGG AAA TGC GTT GGG ATT 2 98 

Lys Asn Trp Asp Arg Leu Gly Glu lie Gly Trp Lys Cys Val Gly lie 
70 75 80 

ATC ATT ATA ACC GCT GCT ATT TCT AAT GCT AAA ACT TTA AGT CAA TGG 34 6 

lie lie lie Thr Ala Ala lie Ser Asn Ala Lys Thr Leu Ser Gin Trp 
85 90 95 

TTA TTT TAGATGGCAT TGCATATTGT TTGTGTTGAA AGTATCAACA TTAGA 3 97 

Leu Phe 
100 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Met Lys Thr Lys His Lys Gly lie Arg Met Phe Lys Gin lie Arg Arg 

15 10 15 

Met Met Ser Leu Ala lie Leu Met Pro Ser Phe Leu Leu Ala Ala Pro 

20 25 30 

Asp Tyr Lys Gin Lys Phe Thr Gin lie Leu Asp Phe lie Ser Asn Asp 

35 40 45 

Phe lie Lys Ala lie Gly Gly Leu lie lie Val Gly Thr Cys lie Tyr 
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Ser Ser Asp Phe 
225 

Ser Pro Leu Ala 

lie Leu Lys Leu 
260 

Lys Leu Tyr Asn 
275 

Lys Gly Ala lie 
290 

Ser Asn Leu Leu 
305 

Leu Lys Gly Arg 

Phe His Tyr Pro 
340 

Tyr Asp Leu lie 
355 

Ala Arg Phe Leu 
370 

Lys Phe Asp lie 
385 

Gin lie Asn Gin 

Lys Thr Gin Leu 
420 

Gin Met Asn Met 
435 

Met Lys Leu Gin 
450 

Asn Glu Lys Ala 
465 

Leu Lys Asn Asp 

Asp Lys Leu Lys 
500 



Lys Gly Asn Lys 
230 

Asn Phe Lys Ala 
245 

Asn Ala Pro Tyr 

lie Thr Asn His 
280 

Glu Gin Ser Pro 
295 

Asp Gly Ala Leu 
310 

Phe Ser Asn lie 
325 

Lys Phe Phe Gin 

Ala Lys Gin Gly 
360 

Lys Asn Ala Phe 
375 

Thr Lys Glu lie 
390 

Gin Arg Leu Leu 
405 

Lys lie His Asn 

Leu Met Asp Ala 
440 

Gly Asn Met His 
455 

lie Gin Gin Asn 
470 

Thr Leu Lys Lys 
485 

Glu Lys Leu Glu 



Ala lie Ser Asp 
235 

Leu Lys Ser Glu 
250 

Thr Leu Glu lie 
265 

Pro Leu Lys Gly 

Lys Leu Leu Lys 
300 

Asp Phe Thr Leu 
315 

Ser Thr Leu Lys 
330 

Ser Val Ala Asp 
345 

Val Leu Lys Ala 

Ser Asp Phe Leu 
380 

Tyr Asn Asp Ala 
395 

Ser Asp Leu Ser 
410 

Gly Leu Leu Asp 
425 

Glu lie Leu Lys 

Gin Pro Lys Phe 
460 

Leu Gin Gin Gly 
475 

Gly Leu Asp His 
490 

Lys Gly Leu Lys 
505 



Thr Thr Leu Thr 
240 

Tyr Leu Phe Ser 
255 

Pro Asn Leu Ala 
270 

Ser Leu Thr Leu 
285 

Val Ser Gly His 

Leu Asn Lys Asp 
320 

Ala Leu Asp Leu 
335 

Ala Asn Leu Asp 
350 

Arg Leu Lys Asn 
365 

Tyr Ser lie Ser 

Asn Leu Val Ser 
400 

Leu Lys Ser Pro 
415 

Leu Asn Thr Lys 
430 

Phe lie Phe Lys 
445 

Ser Leu lie Leu 

Leu Lys Glu lie 
480 

Leu Leu Lys Asp 
495 

Gly Leu Phe 
510 



(2) INFORMATION FOR SEQ ID NO: 111: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53... 352 
(D) OTHER INFORMATION: 
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GGC TTG AAA GAA ATC TTA AAA AAC GAC ACC CTT AAA AAA GGT TTA GAT 1551 
Gly Leu Lys Glu lie Leu Lys Asn Asp Thr Leu Lys Lys Gly Leu Asp 
480 485 490 

CAT TTG CTT AAA GAT GAT AAG CTC AAA GAA AAG CTT GAA AAA GGG CTT 15 99 

His Leu Leu Lys Asp Asp Lys Leu Lys Glu Lys Leu Glu Lys Gly Leu 
495 500 505 

AAG GGG CTT TTT TAAAATTTTA AAGGATAGAA ATGGCGCACA TTTTAGTTAG CGGGG 1656 
Lys Gly Leu Phe 
510 

CGACT 1661 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Met Lys Lys Leu Leu Tyr Thr lie Leu Ala Leu Leu Leu lie Gly Leu 

15 10 15 

Leu Thr lie Tyr Leu lie Leu Phe Thr Glu Trp Gly Asn Lys lie lie 

20 25 30 

Ala Ser Tyr lie Glu Lys Lys lie Asn Pro Asn Glu His Tyr Leu Ser 

35 40 45 

Val Lys Thr Phe Lys Leu Arg Phe Asn Ser Leu Asp Phe Lys Ala Gin 

50 55 60 

Ala Asn Asp Asp Ser Thr Leu lie Leu Lys Gly Asp Phe Ser Leu Leu 
65 70 75 80 

Lys Gin Ser Val Asn Leu Asn Tyr His lie Asp lie Lys Asp Leu Arg 

85 90 95 

Ser Phe Lys Glu Trp lie Pro Tyr Pro Leu Arg Gly Ala Val lie Thr 

100 105 110 

Ser Gly Asn lie Lys Gly His Arg Lys Ala Leu Met lie Gin Gly Val 

115 120 125 

Ser Asn Val Ala Gin Ser His Thr Ala Tyr Asn Ala Leu Leu Asp Asp 

130 135 140 

Phe Lys Leu Ser Arg Leu Asn Leu Asn Ala Gin Asp Ala Asn Leu Glu 
145 150 155 160 

Asp Leu Leu Tyr Leu lie Asn Arg Pro Ala Tyr Ala Asn Ala Lys Val 

165 170 175 

Ser Leu Gin Ala Asp Phe Asn Ser Leu Lys Pro Leu Glu Gly His Leu 

180 185 190 

lie Leu Thr Ala Asn Asn Ala Leu lie Asn Asn Ala Leu lie Asn Gin 

195 200 205 

lie Phe His Leu Asn Leu Lys Asp Thr Leu Val Phe Ser Leu Ser His 
210 215 220 
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GAA TAG CTT TTC TCT ATT TTA AAA CTC AAC GCC CCC TAG ACT TTA GAA 879 
Glu Tyr Leu Phe Ser lie Leu Lys Leu Asn Ala Pro Tyr Thr Leu Glu 
255 260 265 

ATC CCC AAT CTA GCC AAA CTC TAT AAC ATT ACC AAC CAC CCC TTA AAA 92 7 

lie Pro Asn Leu Ala Lys Leu Tyr Asn lie Thr Asn His Pro Leu Lys 
270 275 280 

GGG AGC TTG ACT TTA AAA GGC GCT ATA GAA GAA AGG CCC AAA CTT TTA 975 
Gly Ser Leu Thr Leu Lys Gly Ala lie Glu Gin Ser Pro Lys Leu Leu 
285 290 295 

AAA GTC AGC GGC CAT TCA AAT TTA CTA GAC GGC GGG CTG GAT TTC AGG 102 3 

Lys Val Ser Gly His Ser Asn Leu Leu Asp Gly Ala Leu Asp Phe Thr 
300 305 310 315 

CTT TTA AAT AAA GAT TTG AAA GGG CGT TTT TCC AAT ATT TCC ACT TTA 1071 
Leu Leu Asn Lys Asp Leu Lys Gly Arg Phe Ser Asn lie Ser Thr Leu 
320 325 330 

AAA GCT TTA GAT TTA TTC CAT TAC CCT AAG TTT TTC C7^ TCC GTT GCA 1119 
Lys Ala Leu Asp Leu Phe His Tyr Pro Lys Phe Phe Gin Ser Val Ala 
335 340 345 

GAC GCT AAT TTG GAT TAT GAT CTT ATC GCT AAG GAA GGC GTA TTG AAA 1167 
Asp Ala Asn Leu Asp Tyr Asp Leu lie Ala Lys Gin Gly Val Leu Lys 
350 355 360 

GCC GGC CTA AAA AAC GCA AGA TTC CTC AAA AAT GCA TTC AGC GAT TTT 1215 
Ala Arg Leu Lys Asn Ala Arg Phe Leu Lys Asn Ala Phe Ser Asp Phe 
365 370 375 

GTC TAC TCC ATT TCT AAA TTT GAT ATT AGA AAA GAA ATT TAT AAG GAT 12 63 

Leu Tyr Ser lie Ser Lys Phe Asp lie Thr Lys Glu lie Tyr Asn Asp 
380 385 390 395 

GCC AAT CTG GTA AGC GAA ATC AAC CAG GAA GGC CTG CTC TCT GAT CTG 1311 
Ala Asn Leu Val Ser Gin lie Asn Gin Gin Arg Leu Leu Ser Asp Leu 
400 405 410 

AGT TTA AAA AGG CCC AAA AGC CAA TTG AAA ATC GAT AAC GGT TTG TTG 13 5 9 

Ser Leu Lys Ser Pro Lys Thr Gin Leu Lys lie His Asn Gly Leu Leu 
415 420 425 

GAT TTA AAC ACC AAA CAA ATG AAC ATG GTC ATG GAT GGG GAA ATT TTA 14 07 

Asp Leu Asn Thr Lys Gin Met Asn Met Leu Met Asp Ala Glu lie Leu 
430 435 440 

AAA TTC ATT TTT AAA ATG AAA CTT CAA GGC AAC ATG CAC CAG CCA AAA 1455 
Lys Phe lie Phe Lys Met Lys Leu Gin Gly Asn Met His Gin Pro Lys 
445 450 455 

TTT TCT CTC ATT TTA AAC GAA AAA GCC ATT CAG CAA AAC TTG CAA CAA 1503 
Phe Ser Leu lie Leu Asn Glu Lys Ala lie Gin Gin Asn Leu Gin Gin 
460 465 470 475 
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GGG AAT AAG ATC ATC GCT TCG TAT ATA GAG AAA AAA ATC AAC CCG AAC 2 07 

Gly Asn Lys He He Ala Ser Tyr He Glu Lys Lys He Asn Pro Asn 
30 35 40 

GAG CAC TAG TTG AGC GTT AAA ACC TTT AAA TTG AGA TTC AAC TOT TTG 255 
Glu His Tyr Leu Ser Val Lys Thr Phe Lys Leu Arg Phe Asn Ser Leu 
45 50 55 

GAT TTT AAA GCT CAA GCC AAC GAT GAT TCC ACG CTC ATT CTT AAG GGG 3 03 

Asp Phe Lys Ala Gin Ala Asn Asp Asp Ser Thr Leu He Leu Lys Gly 
60 65 70 75 

GAT TTT TCA CTT TTA AAG CAA AGC GTA AAT TTG AAT TAG CAT ATA GAT 3 51 

Asp Phe Ser Leu Leu Lys Gin Ser Val Asn Leu Asn Tyr His He Asp 
80 85 90 

ATT AAA GAT TTA CGC TCT TTC AAA GAA TGG ATA CCC TAG CCT TTA AGG 3 99 

He Lys Asp Leu Arg Ser Phe Lys Glu Trp He Pro Tyr Pro Leu Arg 
95 100 105 

GGG GCT GTT ATC ACT TCT GGG AAT ATT AAA GGG CAT AGA AAA GCC CTT 44 7 

Gly Ala Val He Thr Ser Gly Asn He Lys Gly His Arg Lys Ala Leu 
110 115 120 

ATG ATT CAA GGC GTC TCT AAT GTG GCT CAA TCC CAC ACT GCC TAG AAT 4 95 

Met He Gin Gly Val Ser Asn Val Ala Gin Ser His Thr Ala Tyr Asn 
125 130 135 

GCC CTT TTA GAT GAT TTC AAG CTT TCT CGC TTA AAT TTG AAC GCA CAA 543 
Ala Leu Leu Asp Asp Phe Lys Leu Ser Arg Leu Asn Leu Asn Ala Gin 
140 145 150 155 

GAC GCC AAT TTA GAA GAT TTG CTT TAT TTA ATC AAT CGC CCC GCT TAT 591 
Asp Ala Asn Leu Glu Asp Leu Leu Tyr Leu He Asn Arg Pro Ala Tyr 
160 165 170 

GCG AAC GCA AAA GTG TCC TTA GAG GCG GAT TTT AAC TCT CTA AAG CCT 63 9 

Ala Asn Ala Lys Val Ser Leu Gin Ala Asp Phe Asn Ser Leu Lys Pro 
175 180 185 

TTA GAG GGG CAT TTG ATC CTA ACA GCT AAT AAC GCT TTA ATC AAT AAC 6 87 

Leu Glu Gly His Leu He Leu Thr Ala Asn Asn Ala Leu He Asn Asn 
190 195 200 

GCC CTA ATC AAT CAA ATT TTT CAT TTA AAC CTT AAA GAC ACG CTT GTT 73 5 

Ala Leu He Asn Gin He Phe His Leu Asn Leu Lys Asp Thr Leu Val 
205 210 215 

TTC AGC CTC TCG CAT TCA AGC GAC TTT AAA GGA AAC AAA GCC ATC AGC 7 83 

Phe Ser Leu Ser His Ser Ser Asp Phe Lys Gly Asn Lys Ala He Ser 
220 225 230 235 

GAT ACC ACC CTG ACT AGC CCT TTA GCC AAT TTC AAA GCC CTA AAA AGC 831 
Asp Thr Thr Leu Thr Ser Pro Leu Ala Asn Phe Lys Ala Leu Lys Ser 
240 245 250 
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50 55 60 

lie Ala Pro Asn Phe Arg Val Val Ser lie lie Ala Asn Asn Gin Arg 
65 70 75 80 

Asn Phe Gin Ala Leu Arg Pro lie Asn His lie Ser Phe lie Pro Arg 

85 90 95 

lie Pro Thr Phe Asn Arg Ala Pro Arg Gin Asp Phe Ala Val Phe Leu 

100 105 110 

His Asp Leu Thr Leu lie lie Asp Lys Asn Gin Ser Val lie Gly lie 

115 120 125 

Leu Phe Gly Leu Leu Val Phe Phe Pro Cys Gin Arg Glu His Ser Pro 

130 135 140 

Asn Leu Val Phe Leu Thr Ser Phe Ser Lys Asp Arg Gly Phe Phe Ser 
145 150 155 160 

Arg Asn Ala Cys Gly Cys lie Lys His Phe Leu Ser Val lie His Asn 

165 170 175 

Pro Met Arg Ala Val Phe Arg Glu Asn Asn Gin lie Gin Pro Arg Gin 

180 185 190 

Thr Leu Phe Asp Pro Thr Asn His Leu Ser Asp lie Ala Thr lie Phe 

195 200 205 

Gin His Leu lie Leu Ser Val Glu Ser Arg His Leu lie lie Asn Tyr 

210 215 220 

Arg Tyr Thr His Ser lie Trp Ala Ala Thr Asn lie Ser Met Ser His 
225 230 235 240 

He Met Phe Leu Val Phe 
245 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 79... 1611 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGCTTTATAA AAAATGTTAG AAACCCTTAC AAAACAAGCT AATATATTCT ATTCAATTTG 60 
CCTCAAGGAC AAACAAAC ATG AAA AAA CTT CTT TAT ACC ATA CTC GCG CTT 111 

Met Lys Lys Leu Leu Tyr Thr He Leu Ala Leu 
15 10 

CTT TTA ATC GGC CTT TTA ACA ATC TAT CTC ATC CTT TTT ACA GAA TGG 159 
Leu Leu He Gly Leu Leu Thr He Tyr Leu He Leu Phe Thr Glu Trp 
15 20 25 
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ATA GGG ATT CTT TTT GGC TTG CTC GTG TTT TTC CCC TGT CAA AGA GAA 7 95 

lie Gly lie Leu Phe Gly Leu Leu Val Phe Phe Pro Cys Gin Arg Glu 
130 135 140 

CAC TCC CCA AAC CTT GTT TTT CTT ACA AGC TTC AGC AAA GAT CGC GGT 843 
His Ser Pro Asn Leu Val Phe Leu Thr Ser Phe Ser Lys Asp Arg Gly 
145 150 155 

TTC TTC TCC AGG AAC GCT TGC GGC TGT ATC AAA CAT TTC TTG TCT GTC 891 
Phe Phe Ser Arg Asn Ala Cys Gly Cys lie Lys His Phe Leu Ser Val 
160 165 170 

ATA CAT AAT CCC ATG CGT GCT GTA TTC AGG GAA AAT AAT CAG ATC CAA 93 9 

lie His Asn Pro Met Arg Ala Val Phe Arg Glu Asn Asn Gin lie Gin 
175 180 185 

CCC AGG CAA ACC CTG TTT GAC CCC ACC AAT CAC CTT AGC GAT ATT GCG 987 
Pro Arg Gin Thr Leu Phe Asp Pro Thr Asn His Leu Ser Asp lie Ala 
190 195 200 205 

ACA ATT TTC CAA CAC CTC ATT CTT AGT GTG GAG TCT AGG CAT CTT ATA 1035 
Thr lie Phe Gin His Leu lie Leu Ser Val Glu Ser Arg His Leu lie 
210 215 220 

ATT AAC TAC CGC TAG ACC CAC AGT ATC TGG GCT GCT ACT AAT ATC TCC 1083 
He Asn Tyr Arg Tyr Thr His Ser He Trp Ala Ala Thr Asn He Ser 
225 230 235 

ATG TCT CAT ATT ATG TTC CTT GTT TTT TGATGAGAGT TCCTACAAAC CCTCTAC 1137 
Met Ser His He Met Phe Leu Val Phe 
240 245 

TTGAATTTAT AAAATAATTG TGT 1160 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Met Pro Val Lys Thr Asn Asn Ala Arg Met Pro Lys He Gly He His 

15 10 15 

Pro He Lys Thr Gly Arg He Arg Tyr Arg Tyr He Thr Leu He Gly 

20 25 30 

Pro Arg His Ser Phe Tyr Tyr Cys Asn Leu Leu Leu Leu Ser Arg He 

35 40 45 

His Val Thr Leu Thr Ala His Asn Glu Phe Cys Pro Thr His Arg Ala 



-240- 



BNSDOCID: <WO 984347aA1_l_> 



wo 98/43478 



PCT/US98/06371 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 373... 1110 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

GAGATATTAA AAAGAGATAT TAAAATGGCT TTTAAGCTTC TATGAAGCCC CCCCCCCTCC 60 

TCCTTTTGCC CTTTATTTCG TGGGCAAATC GCCCACAGGA CAAGCGGCCA CACCCACGCT 12 0 

TGGGCGGGTG ATTTTTTCTA CATTTTCTTG AGCTTTTTTG GGGTCATTCA CCCAAGTTTT 180 

ATAGAACTCA AAAGGGCATT CCGCCACACC CTTATCCCCA TCGCCACTAG CAAAAACCCC 24 0 

ACTATAACCT CTGTGCAAGA GTTTGAAGAG TTGGTTTTGG CTTTGGTCGT ATTTTCTCGC 3 00 

ATCACGGATT TGTTGCACAG AAAGTTGAGC GTATTGAAGA CCATTTTCTT CTTCCCCGCA 360 

TTCGCCCAAA GT ATG CCC GTC AAA ACC AAT AAT OCT AGA ATG CCC AAA ATA 411 
Met Pro Val Lys Thr Asn Asn Ala Arg Met Pro Lys lie 
15 10 

GGA ATA CAC CCC ATC AAA ACC GGT CGC ATT CGC TAC CGC TAC ATA ACA 4 59 

Gly lie His Pro lie Lys Thr Gly Arg lie Arg Tyr Arg Tyr lie Thr 
15 20 25 

TTG ATT GGC CCA CGC CAT AGC TTT TAC TAT TGC AAT TTG TTG CTC CTT 507 
Leu lie Gly Pro Arg His Ser Phe Tyr Tyr Cys Asn Leu Leu Leu Leu 
30 35 40 45 

AGC CGG ATA CAT GTA ACC TTG ACA GCG CAC AAT GAG TTC TGC CCC ACG 555 
Ser Arg lie His Val Thr Leu Thr Ala His Asn Glu Phe Cys Pro Thr 
50 55 60 

CAT CGC GCA ATC GCG CCA AAT TTC AGG GTA GTT TCC ATC ATC GCA AAT 603 
His Arg Ala lie Ala Pro Asn Phe Arg Val Val Ser lie lie Ala Asn 
65 70 75 

AAT CAA AGA AAC TTT CAA GCC CTT AGG CCC ATC AAC CAC ATA AGT TTT 651 
Asn Gin Arg Asn Phe Gin Ala Leu Arg Pro lie Asn His lie Ser Phe 
80 85 90 

ATC CCC AGG ATA CCA ACA TTC AAT AGG GCA CCA AGG CAA GAT TTT GCG 699 
lie Pro Arg lie Pro Thr Phe Asn Arg Ala Pro Arg Gin Asp Phe Ala 
95 100 105 

GTA TTT TTG CAC GAT CTC ACC CTT ATC ATT GAC AAG AAT CAA AGT GTT 74 7 

Val Phe Leu His Asp Leu Thr Leu lie lie Asp Lys Asn Gin Ser Val 
110 115 120 125 
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TTT AAA GAA GAA ATC AGC CTG GCC TTT AGC GCC ATT AGC AAA ATC TTT T 828 
Phe Lys Glu Glu lie Ser Leu Ala Phe Ser Ala lie Ser Lys lie Phe 
240 245 250 255 

AAAGGATAAA CATGATTAGT TTTAAAGAAG CTCTAAAAAT CCATTCT 875 



(2) INFORMATION FOR SEQ ID NO: 106: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



Met Leu Asp Phe 

1 

Phe Leu Leu Phe 
20 

Phe Glu Asn His 
35 

Tyr Val Ser Ala 
50 

Tyr Thr Pro Glu 
65 

Gly Val Cys Ala 

Phe Ala Thr Asp 
100 

Ala Tyr Asp Pro 
115 

Leu Leu Leu Leu 
130 

Gin lie lie Leu 
145 

Gin Phe Val Phe 

Ser His Leu Phe 
180 

Leu Val Leu Leu 
195 

Pro Gin Phe Asn 
210 

Gly Phe Val Gly 
225 

Lys Glu Glu lie 



lie Gin Glu Leu 
5 

Leu Arg Val Ser 

Leu Val Pro Leu 
40 

lie Phe Tyr Pro 
55 

Gly Phe lie lie 
70 

Ser Val Phe Leu 
85 

Ser lie Ser Phe 

lie Ser Gly Ser 
120 

Ala lie Leu lie 
135 

Phe Val Asp His 
150 

Glu Pro Ala Leu 
165 

Val lie Gly Phe 

Ser Asp lie lie 
200 

Leu Leu Ala lie 
215 

lie lie Leu lie 
230 

Ser Leu Ala Phe 
245 



Ser Thr Pro His 
10 

Gly Val Leu Ser 
25 

Ser Val Arg Gly 

Thr Leu Glu Phe 
60 

Ala Cys Leu Cys 
75 

Gin lie Val Phe 
90 

Ser Met Gly Leu 
105 

Gin Lys Pro lie 

Leu Leu Asp Leu 
14 0 

Ser Leu Lys Ala 
155 

Ala Lys Asn lie 
170 

Ser Met Ala Phe 
185 

Phe Gly Met lie 

Gly Phe Pro Val 
220 

Ala Ser Ala lie 
235 

Ser Ala lie Ser 
250 



Val Arg Asp Phe 
15 

Phe Phe Pro Phe 
30 

Ala Leu Ser Leu 
45 

Ser Asn Ala Ala 

Glu Leu Phe Leu 
80 

Ala Ser Leu Val 
95 

Thr Met Ala Ser 
110 

Val Gly Gin Ala 
125 

Ser Phe His His 

Val Pro Leu Gly 
160 

Val Lys Ala Phe 
175 

Pro lie Leu Cys 
190 

Met Lys Thr His 
205 

Lys lie Ala lie 

Met Gly Arg Phe 
240 

Lys lie Phe 
255 



(2) INFORMATION FOR SEQ ID NO: 107: 
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Phe Phe Leu Leu Phe Leu Arg Val Ser Gly Val Leu Ser Phe Phe Pro 
20 25 30 

TTT TTT GAA AAC CAT TTA GTG CCT TTG TCG GTG CGT GGG GCT TTG AGT 2 03 

Phe Phe Glu Asn His Leu Val Pro Leu Ser Val Arg Gly Ala Leu Ser 
35 40 45 

TTG TAT GTG AGC GCG ATT TTT TAG CCC ACT TTA GAA TTT TCA AAC GCC 251 
Leu Tyr Val Ser Ala He Phe Tyr Pro Thr Leu Glu Phe Ser Asn Ala 
50 55 60 

GCT TAC ACG CCA GAG GGT TTT ATC ATT GCT TGC TTG TGC GAA TTG TTT 2 99 

Ala Tyr Thr Pro Glu Gly Phe He He Ala Cys Leu Cys Glu Leu Phe 
65 70 75 

TTA GGG GTG TGC GCG TCT GTC TTT TTA CAA ATC GTC TTT GCA AGC TTA 34 7 

Leu Gly Val Cys Ala Ser Val Phe Leu Gin He Val Phe Ala Ser Leu 
80 85 90 95 

GTG TTT GCA ACC GAT AGC ATC AGC TTT TCT ATG GGG CTT ACG ATG GCG 3 95 

Val Phe Ala Thr Asp Ser He Ser Phe Ser Met Gly Leu Thr Met Ala 
100 105 110 

AGC GCG TAT GAT CCT ATT TCA GGA TCG CAA AAA CCC ATT GTG GGG CAA 44 3 

Ser Ala Tyr Asp Pro He Ser Gly Ser Gin Lys Pro He Val Gly Gin 
115 120 125 

GCC CTT TTA TTG TTA GCG ATT TTA ATT TTA TTG GAT TTA TCG TTC CAC 4 91 

Ala Leu Leu Leu Leu Ala He Leu He Leu Leu Asp Leu Ser Phe His 
130 135 140 

CAT CAA ATC ATT TTG TTT GTG GAT CAC AGC TTA AAA GCC GTC CCT TTA 53 9 

His Gin He He Leu Phe Val Asp His Ser Leu Lys Ala Val Pro Leu 
145 150 155 

GGG CAA TTT GTC TTT GAG CCA GCG TTG GCT AAA AAC ATC GTT AAA GCC 587 
Gly Gin Phe Val Phe Glu Pro Ala Leu Ala Lys Asn He Val Lys Ala 
160 165 170 175 

TTT TCG CAC CTC TTT GTC ATA GGG TTT TCT ATG GCG TTC CCT ATT TTA 635 
Phe Ser His Leu Phe Val He Gly Phe Ser Met Ala Phe Pro He Leu 
180 185 190 

TGC TTG GTG TTA TTG AGC GAT ATT ATT TTT GGC ATG ATC ATG AAA ACC 683 
Cys Leu Val Leu Leu Ser Asp He He Phe Gly Met He Met Lys Thr 
195 200 205 

CAC CCT CAG TTC AAC CTG CTC GCT ATT GGG TTT CCG GTT AAA ATT GCG 731 
His Pro Gin Phe Asn Leu Leu Ala He Gly Phe Pro Val Lys He Ala 
210 215 220 

ATC GGG TTT GTG GGC ATT ATC TTA ATC GCT TCG GCT ATC ATG GGG CGT 779 
He Gly Phe Val Gly He He Leu He Ala Ser Ala He Met Gly Arg 
225 230 235 
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Lys Glu Phe Val beu Ala lie Trp Leu Thr Leu Pro Val Leu Val Phe 

195 200 205 

Ala Phe Asp His Ser Pre He He Ser Thr Phe Thr Gin Asn Val Gly 

210 215 220 

Lys Glu Tyr Gly Val Phe Lys Glu Tyr Lys Leu Asn Gin He Glu Leu 
225 230 235 240 

Gly Thr Ser Leu Met Leu Leu Gly Phe Val Met Phe Phe Val Phe Ser 

245 250 255 

Cys Val Met Cys Leu Asn Ala Asp Asp Phe Val Lys Ala Arg Glu Gin 

260 265 270 

Asn He Pro He Leu Ser Tyr Leu Ala Asn Thr Leu Asn Asn Pro Leu 

275 280 285 

He Asn Tyr Ala Gly Pro Val Val Ala Phe Leu Ala He Phe Ser Ser 

290 295 300 

Phe Phe Gly His Tyr Tyr Gly Ala Lys Glu Gly Leu Glu Gly He He 
305 310 315 320 

He Gin Ser Leu Lys Leu Lys Lys Ala Ser Lys Pro Leu Ser Val Ser 

325 330 335 

Val Thr He Phe Leu Trp Leu Thr He Thr Leu Val Ala Tyr He Asn 

340 345 350 

Pro Asn He Leu Asp Phe He Glu Asn Leu Gly Gly Pro He He Ala 

355 360 365 

Leu He Leu Phe Val Met Pro Met He Ala Phe Tyr Ser Val Ser Ser 

370 375 380 

Leu Lys Arg Phe Arg Asn Phe Lys Val Asp He Phe Val Phe Val Phe 
385 390 395 400 

Gly Ser Leu Thr Ala Leu Ser Val Phe Leu Gly Leu Phe 
405 410 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 63 . . . 827 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

TTTGCCACCT TATTGCAAAC CCTAATGCTA ACGCACTACT TTTTTATCTT TAAAGAGAAA 60 

GA ATG CTA GAT TTT ATT CAA GAG CTT AGC ACC CCC CAT GTT AGG GAT 107 
Met Leu Asp Phe He Gin Glu Leu Ser Thr Pro His Val Arg Asp 
15 10 15 

TTT TTC TTG TTG TTT TTA AGG GTT AGC GGC GTG CTG TCT TTC TTC CCT 15 5 
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AAT TTA GGC GGC CCC ATT ATC GCG CTC ATT CTG TTT GTG ATG CCC ATG 1218 
Asn Leu Gly Gly Pro lie lie Ala Leu lie Leu Phe Val Met Pro Met 
365 370 375 

ATA GCT TTT TAT AGT GTT TOT AGT TTG AAG CGT TTT AGA AAT TTC AAA 1266 
lie Ala Phe Tyr Ser Val Ser Ser Leu Lys Arg Phe Arg Asn Phe Lys 
380 385 390 

GTG GAT ATT TTT GTG TTT GTC TTT GGG AGC TTG ACG GCT TTG AGO GTG 1314 
Val Asp lie Phe Val Phe Val Phe Gly Ser Leu Thr Ala Leu Ser Val 
395 400 405 

TTT TTA GGA CTA TTT TAATGGCTAG TTTTTCTATT TTATCTATTT TTAAAATCGG C 1370 
Phe Leu Gly Leu Phe 
410 

GTGGGGCCTA GC 13 82 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

Met Ala Gin Glu Lys Ala Val Pro Arg Asp Pro Lys Lys Leu Asn Ala 

15 10 15 

Phe Asp Leu Arg Trp Met Val Ser Leu Phe Gly Thr Ala Val Gly Ala 

20 25 30 

Gly lie Leu Phe Leu Pro lie Arg Ala Gly Gly His Gly Val Trp Ala 

35 40 45 

lie Val Val Met Ser Ala lie lie Phe Pro Leu Thr Tyr Leu Gly His 

50 55 60 

Arg Ala Leu Ala Tyr Phe lie Gly Ser Lys Asp Lys Glu Asp lie Thr 
65 70 75 80 

Met Val Val Arg Ser His Phe Gly Ala Gin Trp Gly Phe Leu lie Thr 

85 90 95 

Leu Leu Tyr Phe Leu Ala lie Tyr Pro lie Cys Leu Val Tyr Gly Val 

100 105 110 

Gly lie Thr Asn Val Phe Asp His Phe Phe Thr Asn Gin Leu His Leu 

115 120 125 

Ala Pro Phe His Arg Gly Leu Leu Ala Val Ala Leu Val Ser Leu Met 

130 135 140 

Met Leu Val Met Val Phe Asn Ala Thr lie Val Thr Arg lie Cys Asn 
145 150 155 160 

Ala Leu Val Tyr Pro Leu Cys Leu lie Leu Leu Leu Phe Ser Leu Tyr 

165 170 175 

Leu lie Pro Tyr Trp Gin Gly Ala Asn Leu Phe Val Val Pro Ser Phe 
180 185 190 
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Ala Val Ala Leu Val Ser Leu Met Met Leu Val Met Val Phe Asn Ala 
140 145 150 

ACG ATT GTT ACG CGC ATT TGT AAC GCT TTA GTG TAT CCT TTA TGC TTG 594 
Thr He Val Thr Arg He Cys Asn Ala Leu Val Tyr Pro Leu Cys Leu 
155 160 165 

ATT TTA TTG CTT TTT TCT TTG TAT CTT ATC CCT TAT TGG CAA GGC GCT 642 
He Leu Leu Leu Phe Ser Leu Tyr Leu He Pro Tyr Trp Gin Gly Ala 
170 175 180 

AAT CTT TTT GTG GTG CCG AGT TTT AAA GAA TTT GTG TTA GCG ATT TGG 6 90 

Asn Leu Phe Val Val Pro Ser Phe Lys Glu Phe Val Leu Ala He Trp 
185 190 195 200 

CTA ACC TTA CCG GTG CTT GTG TTT GCA TTC GAC CAT AGC CCC ATC ATT 73 8 

Leu Thr Leu Pro Val Leu Val Phe Ala Phe Asp His Ser Pro He He 
205 210 215 

TCA ACC TTC ACT CAA AAT GTG GGA AAA GAA TAC GGC GTT TTC AAA GAA 786 
Ser Thr Phe Thr Gin Asn Val Gly Lys Glu Tyr Gly Val Phe Lys Glu 
220 225 230 

TAC AAA CTC AAT CAA ATT GAA TTA GGG ACA TCG CTG ATG CTT TTA GGG 834 
Tyr Lys Leu Asn Gin He Glu Leu Gly Thr Ser Leu Met Leu Leu Gly 
235 240 245 

TTT GTG ATG TTT TTT GTG TTT TCG TGC GTC ATG TGC TTG AAT GCT GAT 8 82 

Phe Val Met Phe Phe Val Phe Ser Cys Val Met Cys Leu Asn Ala Asp 
250 255 260 

GAT TTT GTG AAA GCA AGG GAA CAA AAT ATC CCC ATT TTA AGC TAT TTG 930 
Asp Phe Val Lys Ala Arg Glu Gin Asn He Pro He Leu Ser Tyr Leu 
265 270 275 280 

GCT AAC ACT TTA AAC AAC CCT TTA ATC AAT TAT GCG GGG CCT GTG GTG 97 8 

Ala Asn Thr Leu Asn Asn Pro Leu He Asn Tyr Ala Gly Pro Val Val 
285 290 295 

GCT TTT TTA GCG ATT TTT TCA TCT TTT TTT GGG CAT TAT TAT GGG GCT 102 6 
Ala Phe Leu Ala He Phe Ser Ser Phe Phe Gly His Tyr Tyr Gly Ala 
300 305 310 

AAG GAG GGT TTA GAA GGC ATT ATT ATT CAA AGC TTA AAA TTG AAA AAA 1074 
Lys Glu Gly Leu Glu Gly He He He Gin Ser Leu Lys Leu Lys Lys 
315 320 325 

GCT TCT AAA CCC TTG AGC GTT AGC GTA ACG ATT TTT TTA TGG CTG ACT 1122 
Ala Ser Lys Pro Leu Ser Val Ser Val Thr He Phe Leu Trp Leu Thr 
330 335 340 

ATC ACG CTT GTG GCT TAT ATT AAC CCC AAT ATC TTG GAT TTT ATT GAA 117 0 
He Thr Leu Val Ala Tyr He Asn Pro Asn He Leu Asp Phe He Glu 
345 350 355 360 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 91. . .132 9 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

ACCACCCCTT AATCTCAAAA AACCCCAATC ATAAAAAGCT TTATGCTACA ATGAAAGCTC 60 
TTTAACACGA TAAAAGGGCG GTTTAATAGC ATG GCA CAA GAA AAA GCA GTT CCA 114 

Met Ala Gin Glu Lys Ala Val Pro 
1 5 

AGA GAT CCT AAA AAA CTC AAT GCG TTT GAT TTG CGT TGG ATG GTG TCC 162 
Airg Asp Pro Lys Lys Leu Asn Ala Phe Asp Leu Arg Trp Met Val Ser 
10 15 20 

TTA TTT GGC ACG GCG GTG GGG GCT GGG ATT TTA TTT TTG CCT ATT AGA 210 
Leu Phe Gly Thr Ala Val Gly Ala Gly lie Leu Phe Leu Pro lie Arg 
25 30 35 40 

GCC GGT GGG CAT GGG GTA TGG GCT ATT GTG OTA ATG AGC GCG ATC ATT 258 
Ala Gly Gly His Gly Val Trp Ala lie Val Val Met Ser Ala lie lie 
45 50 55 

TTC CCT TTA ACT TAT CTA GGG CAT AGA GCT TTA GCT TAT TTC ATA GGA 3 06 

Phe Pro Leu Thr Tyr Leu Gly His Arg Ala Leu Ala Tyr Phe lie Gly 
60 65 70 

TCT AAA GAC AAA GAA GAC ATT ACC ATG GTC GTT CGC TCT CAT TTT GGC 354 
Ser Lys Asp Lys Glu Asp lie Thr Met Val Val Arg Ser His Phe Gly 
75 80 85 

GCT CAA TGG GGT TTT CTT ATC ACT TTG CTT TAT TTC TTA GCG ATT TAT 4 02 

Ala Gin Trp Gly Phe Leu lie Thr Leu Leu Tyr Phe Leu Ala lie Tyr 
90 95 100 

CCT ATT TGC TTG GTT TAT GGG GTG GGT ATC ACT AAC GTG TTT GAT CAT 450 
Pro lie Cys Leu Val Tyr Gly Val Gly lie Thr Asn Val Phe Asp His 
105 110 115 120 

TTT TTC ACT AAC CAG TTG CAT TTA GCG CCT TTT CAT CGG GGA TTA TTG 4 98 

Phe Phe Thr Asn Gin Leu His Leu Ala Pro Phe His Arg Gly Leu Leu 
125 130 135 

GCT GTA GCG TTA GTT TCT TTA ATG ATG TTG GTG ATG GTT TTT AAC GCT 546 
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(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 59 . , . 196 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
AACAATTCTT TTTTAAGCAA AAACAAAACA AAATTAAGGC ATAATCACTC TTTTTAAA 58 



ATG AAA GGT CGC GTA GCT CAG TTG GTA GAG CAC TAG CTT GAG. ATG GTA 106 
Met Lys Gly Arg Val Ala Gin Leu Val Glu His Tyr Leu Asp Met Val 
15 10 15 

GTG GCC GCT GGT TCA AGT CCA GTC GTG GCC ACC ATT ATC ACT CCA ATT 154 
Val Ala Ala Gly Ser Ser Pro Val Val Ala Thr lie lie Thr Pro lie 
20 25 30 

TTA ATT CTC ATT TTT TTG CGA GTT TTT GAT CTT TAT AAA TTC TAAAGGGGTA 2 06 
Leu lie Leu lie Phe Leu Arg Val Phe Asp Leu Tyr Lys Phe 
35 40 45 

TTAAACGCAC TTCTAATAAC GATTTTATAG CGCT 24 0 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Met Lys Gly Arg Val Ala Gin Leu Val Glu His Tyr Leu Asp Met Val 

15 10 15 

Val Ala Ala Gly Ser Ser Pro Val Val Ala Thr lie lie Thr Pro lie 

20 25 30 

Leu lie Leu lie Phe Leu Arg Val Phe Asp Leu Tyr Lys Phe 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 103: 
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ATTACAAGGG AATGA 1010 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Met Lys Thr Ser Lys Thr Lys Thr Pro Lys Ser Val Leu lie Ala Gly 

15 10 15 

Pro Cys Val lie Glu Ser Leu Glu Asn Leu Arg Ser lie Ala Thr Lys 

20 25 30 

Leu Gin Pro Leu Ala Asn Asn Glu Arg Leu Asp Phe Tyr Phe Lys Ala 

35 40 45 

Ser Phe Asp Lys Ala Asn Arg Thr Ser Leu Glu Ser Tyr Arg Gly Pro 

50 55 60 

Gly Leu Glu Lys Gly Leu Glu Met Leu Gin Thr lie Lys Glu Glu Phe 
65 70 75 80 

Gly Tyr Lys lie Leu Thr Asp Val His Glu Ser Tyr Gin Ala Ser Val 

85 90 95 

Ala Ala Lys Val Ala Asp lie Leu Gin lie Pro Ala Phe Leu Cys Arg 

100 105 110 

Gin Thr Asp Leu lie Val Glu Val Ser Gin Thr Asn Ala lie Val Asn 

115 120 125 

lie Lys Lys Gly Gin Phe Met Asn Pro Lys Asp Met Gin Tyr Ser Val 

130 135 140 

Leu Lys Ala Leu Lys Thr Arg Asp Lys Ser lie Gin Ser Pro Thr Tyr 
145 150 155 160 

Glu Thr Ala Leu Lys Asn Gly Val Trp Leu Cys Glu Arg Gly Ser Ser 

165 170 175 

Phe Gly Tyr Gly Asn Leu Val Val Asp Met Arg Ser Leu Lys lie Met 

180 185 190 

Arg Glu Phe Ala Pro Val lie Phe Asp Ala Thr His Ser Val Gin Met 

195 200 205 

Pro Gly Gly Ala Asn Gly Lys Ser Ser Gly Asp Ser Ser Phe Ala Pro 

210 215 220 

lie Leu Ala Arg Ala Ala Ala Ala Val Gly lie Asp Gly Leu Phe Ala 
225 230 235 240 

Glu Thr His Val Asp Pro Lys Asn Ala Leu Ser Asp Gly Ala Asn Met 

245 250 255 

Leu Lys Pro Asp Glu Leu Glu Gin Leu Val Thr Asp Met Leu Lys lie 

260 265 270 

Gin Asn Leu Phe 
275 

(2) INFORMATION FOR SEQ ID NO: 101: 
(i) SEQUENCE CHARACTERISTICS: 
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GGG CCT GOT TTA GAA AAA GGC CTA GAA ATG TTA CAA ACG ATC AAA GAG 3 63 

Gly Pro Gly Leu Glu Lys Gly Leu Glu Met Leu Gin Thr lie Lys Glu 
65 70 75 

GAA TTT GGT TAT AAA ATC TTA ACC GAT GTG CAT GAG AGT TAT CAA GCA 411 
Glu Phe Gly Tyr Lys lie Leu Thr Asp Val His Glu Ser Tyr Gin Ala 
80 85 90 

AGC GTG GCA GCC AAA GTG GCG GAT ATT TTA CAA ATC CCG GCG TTT TTG 459 
Ser Val Ala Ala Lys Val Ala Asp lie Leu Gin lie Pro Ala Phe Leu 
95 100 105 110 

TGC CGC CAA ACG GAT CTG ATT GTA GAA GTG AGC CAG ACT AAC GCT ATT 5 07 

Cys Arg Gin Thr Asp Leu lie Val Glu Val Ser Gin Thr Asn Ala lie 
115 120 125 

GTC AAT ATC AAA AAA GGG CAA TTC ATG AAC CCA AAA GAC ATG CAA TAT 555 
Val Asn lie Lys Lys Gly Gin Phe Met Asn Pro Lys Asp Met Gin Tyr 
130 135 140 

TCT GTT CTA AAG GCC CTT AAA ACG AGA GAT AAA AGC ATT CAA AGC CCC 6 03 

Ser Val Leu Lys Ala Leu Lys Thr Arg Asp Lys Ser lie Gin Ser Pro 
145 150 155 

ACT TAT GAA ACA GCG TTA AAA AAT GGC GTG TGG CTG TGT GAA AGG GGG 651 
Thr Tyr Glu Thr Ala Leu Lys Asn Gly Val Trp Leu Cys Glu Arg Gly 
160 165 170 

AGC AGC TTT GGG TAT GGG AAT TTA GTG GTG GAT ATG CGC TCT TTA AAA 699 
Ser Ser Phe Gly Tyr Gly Asn Leu Val Val Asp Met Arg Ser Leu Lys 
175 180 185 190 

ATC ATG CGA GAA TTT GCC CCT GTG ATT TTT GAC GCT ACC CAT AGC GTG 74 7 

lie Met Arg Glu Phe Ala Pro Val lie Phe Asp Ala Thr His Ser Val 
195 200 205 

CAA ATG CCA GGG GGA GCG AAC GGG AAA AGT TCA GGA GAC AGC TCT TTT 795 
Gin Met Pro Gly Gly Ala Asn Gly Lys Ser Ser Gly Asp Ser Ser Phe 
210 215 220 

GCC CCT ATT TTA GCG AGA GCT GCG GCG GCG GTG GGG ATT GAT GGG TTG 84 3 

Ala Pro lie Leu Ala Arg Ala Ala Ala Ala Val Gly lie Asp Gly Leu 
225 230 235 

TTT GCT GAA ACG CAT GTT GAT CCT AAA AAC GCC CTA AGC GAT GGA GCA 8 91 

Phe Ala Glu Thr His Val Asp Pro Lys Asn Ala Leu Ser Asp Gly Ala 
240 245 250 

AAC ATG CTA AAA CCT GAC GAG CTA GAA CAA TTA GTA ACC GAC ATG TTA 93 9 

Asn Met Leu Lys Pro Asp Glu Leu Glu Gin Leu Val Thr Asp Met Leu 
255 260 265 270 

AAA ATC CAA AAT TTA TTT TAAAGGAATT TCATGCTU^T CATAGAAGGG AAATTGCA 995 
Lys lie Gin Asn Leu Phe 
275 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 130 . . . 957 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

AAGCGAGCAA GAATGAATTA AAAATTTTTG GTTGGCACTA CATCATAGAA ACAGGCAGGA 60 
TTTATAATTA TAATTTTGAA AGCCATTTTT TTGAGCCGAT TGGAGAAACC ATTAAACAAA 12 0 
GGAAAAGTC ATG AAA ACT TCT AAA ACA AAA ACC CCT AAA TCC GTT TTA ATC 171 
Met Lys Thr Ser Lys Thr Lys Thr Pro Lys Ser Val Leu He 
15 10 

GCT GGG CCA TGC GTC ATT GAG AGC TTA GAA AAT CTA AGA AGT ATC GCC 219 
Ala Gly Pro Cys Val He Glu Ser Leu Glu Asn Leu Arg Ser He Ala 
15 20 25 30 

ACT AAA TTG CAA CCC CTA GCC AAC AAC GAG CGG TTG GAT TTT TAT TTT 2 67 

Thr Lys Leu Gin Pro Leu Ala Asn Asn Glu Arg Leu Asp Phe Tyr Phe 
35 40 45 

AAA GCG AGT TTT GAT AAG GCG AAC CGC ACG AGT TTA GAG AGT TAC AGA 315 
Lys Ala Ser Phe Asp Lys Ala Asn Arg Thr Ser Leu Glu Ser Tyr Arg 
50 55 60 
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80 85 90 95 

GAG TCT CCT AAG GCC AAA CGG CAG CAG GAT TTG TAT TTC TTC CCT AAT 396 
Glu Ser Pro Lys Ala Lys Arg Gin Gin Asp Leu Tyr Phe Phe Pro Asn 
100 105 110 

GGG GTT ACT TAT AAA AGA AGC GAT GAT TCC AGT TTT TGG AGT GAA ACA 444 
Gly Val Thr Tyr Lys Arg Ser Asp Asp Ser Ser Phe Trp Ser Glu Thr 
115 120 125 

GGG ATT TAT AAC CAT AAG GAG CAA AAT TTT AAA GGC AAG GGC CGT TTC 4 92 

Gly lie Tyr Asn His Lys Glu Gin Asn Phe Lys Gly Lys Gly Arg Phe 
130 135 140 

ATT CTC ACT TCA AAG GAC AGC AAG ATT GAA GGG CTT GAC ATT TCT TAT 540 
lie Leu Thr Ser Lys Asp Ser Lys lie Glu Gly Leu Asp lie Ser Tyr 
145 150 155 

TCG CAT GCA TTA GCT ATT ATT GAA GCT CAA AGC ATT CAA GCG CAT TTA 58 8 

Ser His Ala Leu Ala lie lie Glu Ala Gin Ser lie Gin Ala His Leu 
160 165 170 175 

TTC TTA GAT GAA ATC AAA CAA AGC CAA AAA GAA AAG AAA AAA TTC CCC 636 
Phe Leu Asp Glu lie Lys Gin Ser Gin Lys Glu Lys Lys Lys Phe Pro 
180 185 190 

ACT TTC AAA GGA GGT TTT TAATGCGTTG GTGGTGTTTT TTGGTGTGTT GTTTTGGT 6 92 
Thr Phe Lys Gly Gly Phe 
195 

ATTTTAAGCG TGAT 706 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Met Lys Arg Ser Ser Phe Thr Ser Asn Ser Val Leu Asn Phe Phe Val 

15 10 15 

Val Leu Ser Phe lie Thr lie Gly Leu Val Phe Phe Phe Leu Arg Ser 

20 25 30 

Gin Pro Thr Ser Val Val Ser Lys Glu Asn lie Pro Lys lie Glu Leu 

35 40 45 

Glu Asn Phe Lys Ala Phe Gin lie Asn Asp Lys lie Leu Asp Leu Ser 

50 55 60 

lie Glu Gly Lys Lys Ala Leu Gin Tyr Asp Asp His Glu lie Phe Phe 
65 70 75 80 
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435 440 445 

Asn Ala Ser Arg Ala lie Phe Asp Ser Asn lie Thr Ser Leu lie Ala 

450 455 460 

Ser Val Leu Leu Tyr Ala Tyr Gly Thr Gly Ala lie Lys Gly Phe Ala 
465 470 475 480 

Leu Thr Thr Gly lie Gly lie Leu Ala Ser lie lie Thr Ala lie Val 

485 490 495 

Gly Thr Gin Gly lie Tyr Gin Ala Leu Leu Pro Lys Leu Thr Gin Thr 

500 505 510 

Lys Ser Leu Tyr Phe Trp Phe Gly Val Asn Lys Arg Ala 
515 520 525 

(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 64... 654 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



CAGGGGGCAA GGGGGCTGTT AGGGAAGCGA TTGATTATCT TTTAACATTA GAAGGCTTGC 60 

AAG ATG AAG CGC TCA AGC TTT ACC TCT AAT AGC GTT TTA AAC TTT TTT 10 8 
Met Lys Arg Ser Ser Phe Thr Ser Asn Ser Val Leu Asn Phe Phe 
1 5 . 10 15 

GTA GTT TTG TCT TTC ATT ACG ATA GGA TTA GTG TTT TTC TTT TTG CGT 156 
Val Val Leu Ser Phe lie Thr lie Gly Leu Val Phe Phe Phe Leu Arg 
20 25 30 



TCC CAA CCC ACT AGC GTA GTT TCT AAA GAA AAT ATC CCT AAA ATT GAA 2 04 

Ser Gin Pro Thr Ser Val Val Ser Lys Glu Asn lie Pro Lys lie Glu 
35 40 45 



TTA GAA AAT TTT AAA GCG TTT CAA- ATC AAC GAT AAA ATC CTT GAT CTG 2 52 

Leu Glu Asn Phe Lys Ala Phe Gin lie Asn Asp Lys lie Leu Asp Leu 
50 55 60 



TCC ATA GAG GGC AAA AAA GCC CTA CAA TAC GAT GAT CAT GAA ATC TTT 3 00 

Ser lie Glu Gly Lys Lys Ala Leu Gin Tyr Asp Asp His Glu lie Phe 
65 70 75 



TTT GAT TCC AAA ATC AAG CGC TAT GAT GAA GAC ACC ATT GAA AGC GTT 34 8 

Phe Asp Ser Lys lie Lys Arg Tyr Asp Glu Asp Thr lie Glu Ser Val 
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Met L.ys Leu Phe 
1 

Leu Gly Val Gly 
20 

Lys lie Thr Leu 
35 

Gly Val Gin Thr 
50 

Ser Ala Leu Glu 
65 

lie Lys Ser Asn 

Glu Ala Lys Lys 
100 

Gin Phe Glu lie 
115 

Pro Leu Glu Gin 
130 

Gly lie lie Arg 
145 

Val lie Gin Gin 

Lys Thr Leu Glu 
180 

Ala His Leu Gin 
195 

Met Lys Met Thr 
210 

Ser Asp Val Glu 
225 

Leu Asp Gly Glu 

Asn Gin Pro Val 
260 

Phe Gly Asp Phe 
275 

Leu Asp Asn Lys 
290 

Gly Gly Ser Gly 
305 

Asp Leu Ala lie 

Val Leu Glu Lys 
340 

Lys Thr Ser lie 
355 

Phe Met Val Leu 
370 

Leu Val Val Asn 
385 

Ala Thr Leu Thr 

He Ala Val Asp 
420 

Leu Arg Glu Asn 



Asn Ala Arg Leu 
5 

Phe Ser Val Pro 

Gly Leu Asp Leu 
40 

Asp Glu Ala Leu 
55 

Tyr Asn Ala Lys 
70 

Leu Glu Gly He 
85 

Leu Asp Ala. Leu 

Lys Lys Glu Ala 
120 

Glu Glu Leu Arg 
135 

Asn Arg Leu Asp 
150 

Gly Lys Glu Glu 
165 

Glu Glu Arg Arg 

Met Met Ala Val 
200 

Asp Leu Glu Ala 
215 

Met Gly Gly Lys 
230 

Met Leu Thr Asp 
245 

Val Ser Phe Thr 

Ser Gly Ala Asn 
280 

Val Tyr Ser Ala 
295 

Gin He Ser Gly 
310 

Ala Leu Arg Ser 
325 

Arg He He Gly 

He Ala Leu Val 
360 

Tyr Tyr Ser Met 
375 

Leu Phe Leu He 
390 

Leu Pro Gly Met 
405 

Ala Asn He He 
Glu Gly He Ala 



He Val Phe He 
10 

Ser Leu Leu Glu 
25 

Arg Gly Gly Leu 

Lys Asn Lys Tyr 
60 

Lys Gin Asn He 
75 

Ser Phe Glu Leu 
90 

Leu Leu Glu Leu 
105 

Gly Phe Tyr Ser 

Lys Asn Thr He 
140 

Gin Phe Gly Leu 
155 

He Ser Val Gin 
170 

Ala Lys Asp Leu 
185 

Asp Glu Glu His 

Gin Lys Leu Gly 
220 

He Leu Leu Lys 
235 

Ala Lys Val Val 
250 

Leu Asp Ala Gin 
265 

Val Gly Lys Arg 

Pro Val He Arg 
300 

Asn Phe Ser Val 
315 

Gly Ala Met Ser 
330 

Pro Ser Leu Gly 
345 

Gly Gly Phe He 

Ala Gly Val He 
380 

Val Ala Val Met 
395 

Ala Gly He Val 
410 

He Asn Glu Arg 
425 

Lys Ala He His 



Gly Ala Leu Leu 
15 

Thr Lys Gly Pro 
30 

Asn Met Leu Leu 
45 

Leu Ser Leu Ala 

Leu Leu Lys Asp 
80 

Leu Asp Glu Asp 
95 

Gin Gly His Ser 
110 

Val Asn Leu Thr 
125 

Leu Gin Val He 

Ala Glu Pro Val 
160 

Leu Pro Gly He 
175 

He Ser Arg Ser 
190 

Asn Lys Asp Ala 
205 

Ser Val Leu Leu 

Ala He Pro He 
240 

Tyr Asp Gin Asn 
255 

Gly Ala Lys He 
270 

Met Ala He Val 
285 

Glu Arg He Gly 

Ala Gin Ala Ser 
320 

Ala Pro He Gin 
335 

Lys Asp Ser Val 
350 

Leu Val Met Gly 
365 

Ala Cys Leu Ala 

Ala He Phe Gly 
400 

Leu Thr Val Gly 
415 

He Arg Glu Val 
430 

Leu Gly Tyr He 
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CTT TAT TAC TCT ATG GCG GGG GTG ATC GCT TGT TTG GCG TTA GTG GTC 1210 
Leu Tyr Tyr Ser Met Ala Gly Val lie Ala Cys Leu Ala Leu Val Val 
375 380 385 

AAT CTT TTT TTG ATT GTG GCG GTC ATG GCG ATT TTT GGA GCG ACG CTG 12 5 8 

Asn Leu Phe Leu lie Val Ala Val Met Ala lie Phe Gly Ala Thr Leu 
390 395 400 

ACT TTA CCG GGA ATG GCG GGG ATT GTT TTA ACC GTG GGG ATT GCC GTG 13 06 

Thr Leu Pro Gly Met Ala Gly lie Val Leu Thr Val Gly lie Ala Val 
405 410 415 

GAT GCT AAT ATC ATC ATC AAC GAG CGC ATT AGA GAA GTC TTA AGA GAG 13 54 

Asp Ala Asn lie lie lie Asn Glu Arg lie Arg Glu Val Leu Arg Glu 
420 425 430 435 

AAT GAG GGC ATC GCT AAA GCG ATC CAT TTA GGC TAT ATC AAT GCG AGC 14 02 

Asn Glu Gly lie Ala Lys Ala lie His Leu Gly Tyr lie Asn Ala Ser 
440 445 450 

CGG GCG ATT TTT GAT TCT AAT ATC ACT TCT , TTG ATC GCT TCA GTG TTA 14 50 

Arg Ala lie Phe Asp Ser Asn lie Thr Ser Leu lie Ala Ser Val Leu 
455 460 465 

TTA TAC GCT TAT GGC ACA GGA GCG ATT IkAA GGC TTT GCC CTA ACT ACA 14 98 

Leu Tyr Ala Tyr Gly Thr Gly Ala lie Lys Gly Phe Ala Leu Thr Thr 
470 475 480 

GGC ATT GGG ATT TTA GCC TCT ATT ATC ACC GCT ATT GTT GGC ACG CAA 154 6 

Gly lie Gly lie Leu Ala Ser lie lie Thr Ala lie Val Gly Thr Gin 
485 490 495 

GGG ATT TAT CAA GCC CTT TTA CCT AAA CTC ACT CAA ACA AAA AGC CTT 15 94 

Gly lie Tyr Gin Ala Leu Leu Pro Lys Leu Thr Gin Thr Lys Ser Leu 
500 505 510 515 

TAC TTT TGG TTT GGC GTG AAT AAA AGA GCT TAGGAGGTTT TATGGAATTA TTC 164 7 
Tyr Phe Trp Phe Gly Val Asn Lys Arg Ala 
520 525 

AAACGAACTA GAATCTTAAG CTTC 1671 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 
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CGT AAC CGC TTG GAT CAA TTT GGT TTG GCA GAG OCT GTA GTC ATT GAG 53 8 

Arg Asn Arg Leu Asp Gin Phe Gly Leu Ala Glu Pro Val Val lie Gin 
150 155 160 

CAA GGT AAA GAA GAA ATT TOG GTG CAA TTG COT GGC ATT AAG ACT TTA 5 86 

Gin Gly Lys Glu Glu lie Ser Val Gin Leu Pro Gly lie Lys Thr Leu 
165 170 175 

GAA GAA GAA CGG CGC GCT AAA GAC TTG ATT TCA AGA TCC GCT CAT TTG 634 
Glu Glu Glu Arg Arg Ala Lys Asp Leu lie Ser Arg Ser Ala His Leu 
180 185 190 195 

CAG ATG ATG GCG GTG GAT GAA GAA CAC AAT AAA GAT GCG ATG AAA ATG 682 
Gin Met Met Ala Val Asp Glu Glu His Asn Lys Asp Ala Met Lys Met 
200 205 210 

ACG GAT TTA GAG GCT CAA AAA TTA GGC AGC GTG TTG TTG TCT GAT GTG 73 0 

Thr Asp Leu Glu Ala Gin Lys Leu Gly Ser Val Leu Leu Ser Asp Val 
215 220 225 

GAA ATG GGG GGT AAA ATC TTG CTC AAA GCG ATC CCC ATT TTA GAT GGC 77 8 

Glu Met Gly Gly Lys lie Leu Leu Lys Ala lie Pro lie Leu Asp Gly 
230 235 240 

GAA ATG CTT ACA GAT GCG AAA GTG GTG TAT GAC CAA AAC AAC CAG CCG 82 6 

Glu Met Leu Thr Asp Ala Lys Val Val Tyr Asp Gin Asn Asn Gin Pro 
245 250 255 

GTG GTG AGC TTC ACG CTG GAT GCG CAA GGG GCT AAG ATT TTT GGG GAT 874 
Val Val Ser Phe Thr Leu Asp Ala Gin Gly Ala Lys lie Phe Gly Asp 
260 265 270 275 

TTC TCA GGT GCG AAT GTG GGC AAA CGC ATG GCG ATT GTT TTA GAC AAT 922 
Phe Ser Gly Ala Asn Val Gly Lys Arg Met Ala lie Val Leu Asp Asn 
280 285 290 

AAG GTC TAT TCA GCC CCG GTG ATT AGG GAG CGT ATC GGT GGG GGG AGC 97 0 

Lys Val Tyr Ser Ala Pro Val lie Arg Glu Arg lie Gly Gly Gly Ser 
295 300 305 

GGG CAG ATT AGC GGG AAT TTT AGC GTG GCT CAA GCG AGC GAT TTA GCG 1018 
Gly Gin lie Ser Gly Asn Phe Ser Val Ala Gin Ala Ser Asp Leu Ala 
310 315 320 

ATC GCT TTA AGG AGT GGG GCG ATG AGC GCT CCC ATT CAG GTT TTA GAA 1066 
lie Ala Leu Arg Ser Gly Ala Met Ser Ala Pro lie Gin Val Leu Glu 
325 330 335 

AAA AGA ATT ATA GGC CCA AGT TTA GGG AAA GAC AGC GTT AAA ACT TCC 1114 
Lys Arg He He Gly Pro Ser Leu Gly Lys Asp Ser Val Lys Thr Ser 
340 345 350 355 

ATT ATC GCT CTA GTT GGG GGC TTT ATT TTA GTG ATG GGC TTT ATG GTG 1162 
He He Ala Leu Val Gly Gly Phe He Leu Val Met Gly Phe Met Val 
360 365 370 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50 . . . 1624 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

CAAAATTATC TGGTGCTAAG ACTTTGAAAC AACGCCAAAT AACAACTGA ATG AAA CTT 58 

Met Lys Leu 
1 

TTT AAC GCT CGT TTA ATC GTT TTT ATT GGC GCG CTT CTT TTA GGG GTA 106 
Phe Asn Ala Arg Leu lie Val Phe lie Gly Ala Leu Leu Leu Gly Val 
5 10 15 

GGG TTT TCT GTG CCT TCT TTA CTA GAA ACT AAA GGC OCT AAA ATC ACT 154 
Gly Phe Ser Val Pro Ser Leu Leu Glu Thr Lys Gly Pro Lys lie Thr 
20 25 30 35 

TTA GGT TTG GAT TTA AGG GGG GGG TTG AAC ATG CTT TTA GGG GTA CAA 2 02 

Leu Gly Leu Asp Leu Arg Gly Gly Leu Asn Met Leu Leu Gly Val Gin 
40 45 50 

ACC GAT GAG GCT TTA AAA AAC AAG TAT TTA AGC TTG GCG TCC GCT TTA 25 0 

Thr Asp Glu Ala Leu Lys Asn Lys Tyr Leu Ser Leu Ala Ser Ala Leu 
55 60 65 

GAA TAC AAC GCT AAA AAG CAA AAT ATC TTG CTT AAA GAT ATT AAA TCC 298 
Glu Tyr Asn Ala Lys Lys Gin Asn lie Leu Leu Lys Asp lie Lys Ser 
70 75 80 

AAT TTA GAA GGG ATC AGT TTT GAG CTT TTA GAT GAA GAT GAA GCG AAA 346 
Asn Leu Glu Gly lie Ser Phe Glu Leu Leu Asp Glu Asp Glu Ala Lys 
85 90 95 

AAA TTA GAC GCG CTT TTA TTG GAA TTG CAA GGC CAT AGC CAG TTT GAA 3 94 

Lys Leu Asp Ala Leu Leu Leu Glu Leu Gin Gly His Ser Gin Phe Glu 
100 105 110 115 

ATC AAA AAG GAA GCG GGG TTT TAT AGC GTG AAT CTC ACC CCT TTA GAG 44 2 

lie Lys Lys Glu Ala Gly Phe Tyr Ser Val Asn Leu Thr Pro Leu Glu 
120 125 130 

CAA GAA GAA TTG CGT AAA AAC ACG ATT TTG CAA GTG ATA GGG ATC ATT 4 90 

Gin Glu Glu Leu Arg Lys Asn Thr lie Leu Gin Val lie Gly lie lie 
135 140 145 
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Val Leu Leu Leu Ala Tyr His Phe Tyr Cys Lys Lys Cys Met Arg Glu 
100 105 110 

CTG GAA AAA GAG CCC ACA AGG AGA AAC GCA AGG TTT TAT CGC GTG TTT 443 
Leu Glu Lys Asp Pro Thr Arg Arg Asn Ala Arg Phe Tyr Arg Val Phe 
115 120 125 

AAT GAG GCG CCA ACG ATT TTA ATG ATC CTC ATT GTG ATT TTA GTG GTT 4 91 

Asn Glu Ala Pro Thr lie Leu Met lie Leu lie Val lie Leu Val Val 
130 135 140 

GTG AAG OCT TTT TAAAGACAAG CCATGAAAAA AGAAAAGTCA TGAAAAAAGA AAAGCA 54 9 

Val Lys Pro Phe 

145 

TCTCAAGC 557 
(2) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 

Met Gly Phe Leu Asn Gly Tyr Phe Leu Trp Val Lys Ala Phe His Val 

15 10 15 

lie Ala Val lie Ser Trp Met Ala Ala Leu Phe Tyr Leu Pro Arg Leu 

20 25 30 

Phe Val Tyr His Ala Glu Asn Ala His Lys Lys Glu Phe Val Gly Val 

35 40 45 

Val Gin lie Gin Glu Lys Lys Leu Tyr Ser Phe lie Ala Ser Pro Ala 

50 55 60 

Met Gly Phe Thr Leu lie Thr Gly lie Leu Met Leu Leu lie Glu Pro 
65 70 75 80 

Thr Leu Phe Lys Ser Gly Gly Trp Leu His Ala Lys Leu Ala Leu Val 

85 90 95 

Val Leu Leu Leu Ala Tyr His Phe Tyr Cys Lys Lys Cys Met Arg Glu 

100 105 110 

Leu Glu Lys Asp Pro Thr Arg Arg Asn Ala Arg Phe Tyr Arg Val Phe 

115 120 125 

Asn Glu Ala Pro Thr lie Leu Met lie Leu lie Val lie Leu Val Val 

130 135 140 

Val Lys Pro Phe 
145 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1671 base pairs 
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Phe Glu Val Lys Gly Asp Thr Glu Ser lie Gly Arg Leu Thr Thr lie 

340 345 350 

Ser Val Val Asn Ala Leu Phe Trp He He Phe Leu Asp Ala He Phe 

355 360 365 

Ser He He Phe Ser Lys Leu Asn He 
370 375 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60.., 503 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 93 : 
CACAAGAGAA AATTTGCAAG CGTTTTTACA ACAAAATAAG ATAAATTAGG GAGAGTGGT 59 



ATG GGA TTT TTG AAT GGG TAT TTT TTA TGG GTT AAG GCT TTC CAT GTG 107 
Met Gly Phe Leu Asn Gly Tyr Phe Leu Trp Val Lys Ala Phe His Val 
15 10 15 

ATA GCG GTC ATT TCG TGG ATG GCA GCG TTG TTT TAT TTG CCG CGC CTT 155 
He Ala Val He Ser Trp Met Ala Ala Leu Phe Tyr Leu Pro Arg Leu 
20 25 30 

TTT GTC TAT CAT GCA GAA AAC GCG CAT AAA AAA GAG TTT GTA GGA GTG 2 03 

Phe Val Tyr His Ala Glu Asn Ala His Lys Lys Glu Phe Val Gly Val 
35 40 45 

GTT CAA ATC CAA GAA AAA AAG CTT TAT TCC TTT ATC GCT TCA CCG GCT 251 
Val Gin He Gin Glu Lys' Lys Leu Tyr Ser Phe He Ala Ser Pro Ala 
50 55 60 

ATG GGT TTT ACG CTT ATT ACA GGG ATT TTA ATG CTG TTG ATA GAG CCT 299 
Met Gly Phe Thr Leu He Thr Gly He Leu Met Leu Leu He Glu Pro 
65 70 75 80 

ACG CTC TTT AAA AGT GGG GGT TGG TTG CAT GCT AAA TTG GCT TTA GTG 34 7 

Thr Leu Phe Lys Ser Gly Gly Trp Leu His Ala Lys Leu Ala Leu Val 
85 90 95 

GTT TTA CTT TTA GCC TAT CAT TTT TAT TGC AAA AAA TGC ATG CGC GAG 395 
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(2) INFORMATION FOR SEQ ID NO : 92 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 

Met Lys Thr Glu Lys Gin Lys Phe Leu Glu Met Arg Lys Asp Gly Ala 

15 10 15 

Asn Ser Val Leu lie Leu Arg Gly Asp Trp Asp Phe Lys Thr Ser Val 

20 25 30 

Phe Arg Leu Asp Glu Leu Lys Lys Asn Leu Leu Asp His Gin Gly Pro 

35 40 45 

Leu Lys Met Asp Phe Ser Gly Cys Gin Lys Val Asp Phe Val Phe Gly 

50 55 60 

Met Phe Leu Phe Asp Leu Val Lys Glu Arg Ser Leu Asn lie Glu Leu 
65 70 75 80 

Cys Asn Val Ser Glu Asn Asn Ala Cys Ala Leu Lys Val Val Lys Asp 

85 90 95 

Trp Leu Glu Lys Glu Glu Asp Leu Glu Ser Lys Lys Ala Gly Lys His 

100 105 110 

Tyr Glu Leu Leu lie Thr Lys Leu Gly Lys Ser lie Val Glu Thr Tyr 

115 120 125 

Asn Thr Phe Leu Asn Ala Phe Asn Phe Cys Gly Met lie Leu Phe Tyr 

130 135 140 

Phe lie Lys Ser Val Phe Asn Pro Lys Arg Phe Cys lie Thr Pro Leu 
145 150 155 160 

Leu Tyr His lie Asn Glu Ser Gly Phe Lys Val Leu Pro Val Ser lie 

165 170 175 

Leu Thr Val Phe He Val Gly Phe Ala Val Ala Leu Gin Gly Ala Leu 

180 185 190 

Gin Leu Gin Asp Met Gly Ala Pro Leu Met Ser Val Glu Met Thr Ala 

195 200 205 

Lys Leu Ala Leu Arg Glu He Gly Pro Phe He Leu Thr Leu Val Val 

210 215 220 

Ala Gly Arg Ser Ala Ser Ser Phe Thr Ala Gin He Gly Val Met Lys 
225 230 235 240 

He Thr Glu Glu Leu Asp Ala Met Lys Thr Met Gly Phe Asn Pro Phe 

245 250 255 

Glu Phe Leu Val Leu Pro Arg Val Leu Ala Leu Val He Val Leu Pro 

260 265 270 

Leu Leu Val Phe He Ala Asp Ala Phe Ala He Leu Gly Gly Met Phe 

275 280 285 

Ala He Lys Tyr Gin Leu Asp Leu Gly Phe Pro Ser Tyr He Asp Arg 

290 295 300 

Phe His Asp Thr Val Gly Trp Asn His Phe Leu Val Gly He Val Lys 
305 310 315 320 

Ala Pro Phe Trp Gly Phe Ala He Ala Met Val Gly Cys Met Arg Gly 
325 330 335 
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155 160 165 170 

GTT TTG CCA GTG AGT ATT TTA ACG GTG TTT ATC GTG GGG TTT GCC GTT 641 
Val Leu Pro Val Ser lie Leu Thr Val Phe lie Val Gly Phe Ala Val 
175 180 185 

GCT TTA CAA GGG GOT TTA CAA TTA CAA GAG ATG GGC GCG CCT TTA ATG 68 9 

Ala Leu Gin Gly Ala Leu Gin Leu Gin Asp Met Gly Ala Pro Leu Met 
190 195 200 

TCG GTG GAA ATG ACG GCT AAA CTC GCT . TTA AGA GAA ATC GGC CCT TTT 73 7 

Ser Val Glu Met Thr Ala Lys Leu Ala Leu Arg Glu lie Gly Pro Phe 
205 210 215 

ATT TTA ACC CTT GTG GTG GCC GGG AGG AGC GCG AGC AGT TTT ACC GCG 785 
lie Leu Thr Leu Val Val Ala Gly Arg Ser Ala Ser Ser Phe Thr Ala 
220 225 230 

CAA ATT GGG GTG ATG AAG ATC ACT GAG GAA TTA GAC GCG ATG AAA ACC 83 3 

Gin lie Gly Val Met Lys lie Thr Glu Glu Leu Asp Ala Met Lys Thr 
235 240 245 250 

ATG GGC TTT AAC CCT TTT GAA TTT TTA GTG TTG CCT AGG GTG TTA GCC 881 
Met Gly Phe Asn Pro Phe Glu Phe Leu Val Leu Pro Arg Val Leu Ala 
255 260 265 

TTA GTG ATT GTT TTG CCT TTA TTG GTG TTT ATT GCC GAT GCG TTC GCC 92 9 

Leu Val lie Val Leu Pro Leu Leu Val Phe lie Ala Asp Ala Phe Ala 
270 275 280 

ATT CTT GGG GGC ATG TTT GCG ATT AAA TAC CAA TTG GAT TTA GGC TTC 977 
lie Leu Gly Gly Met Phe Ala lie Lys Tyr Gin Leu Asp Leu Gly Phe 
285 290 295 

CCG AGC TAT ATT GAC AGA TTC CAT GAC ACA GTG GGT TGG AAC CAT TTT 102 5 

Pro Ser Tyr lie Asp Arg Phe His Asp Thr Val Gly Trp Asn His Phe 
300 305 310 

TTG GTA GGG ATT GTC AAA GCC CCT TTT TGG GGG TTT GCG ATT GCG ATG 1073 
Leu Val Gly lie Val Lys Ala Pro Phe Trp Gly Phe Ala lie Ala Met 
315 320 325 330 

GTA GGG TGC ATG CGC GGG TTT GAA GTC AAG GGG GAT ACT GAG AGC ATT 1121 
Val Gly Cys Met Arg Gly Phe Glu Val Lys Gly Asp Thr Glu Ser lie 
335 340 345 

GGG CGC TTG ACC ACT ATT AGC GTC GTG AAC GCT TTG TTT TGG ATC ATT 1169 
Gly Arg Leu Thr Thr He Ser Val Val Asn Ala Leu Phe Trp lie He 
350 355 360 

TTC TTA GAC GCT ATT TTT TCT ATC ATC TTT TCT AAG TTG AAC ATA TAATG 1219 
Phe Leu Asp Ala He Phe Ser He He Phe Ser Lys Leu Asn He 
365 370 375 

AACGCTACTA ACAATCAAGT CTTAATTGAA GTGAAGGATC TCCATAGCGC 12 6 9 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 84 ... 1214 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTATCATTGT GTTAAAATAG TCGTTTTAAC AAACAAAATT TTGTTAATAG ATTTTACCTA 60 
ATCTGAGAGA GAATTATATT TTA ATG AAG AC A GAG AAA CAA AAA TTT TTA GAG 113 

Met Lys Thr Glu Lys Gin Lys Phe Leu Glu 
15 10 

ATG CGT AAA GAT GGG GCG AAC TCT GTG CTG ATT TTA AGA GGG GAT TGG 161 
Met Arg Lys Asp Gly Ala Asn Ser Val Leu lie Leu Arg Gly Asp Trp 
15 20 25 

GAT TTT AAA ACG AGC GTG TTT CGT TTA GAT GAG TTG AAA AAA AAT TTA 2 09 

Asp Phe Lys Thr Ser Val Phe Arg Leu Asp Glu Leu Lys Lys Asn Leu 
30 35 40 

TTA GAT CAT CAA GGG CCT TTA AAA ATG GAT TTT TCA GGG TGC CAA AAA 2 57 

Leu Asp His Gin Gly Pro Leu Lys Met Asp Phe Ser Gly Cys Gin Lys 
45 50 55 

GTG GAT TTT GTT TTT GGC ATG TTT TTA TTT GAT TTA GTT AAG GAG CGT 3 05 

Val Asp Phe Val Phe Gly Met Phe Leu Phe Asp Leu Val Lys Glu Arg 
60 65 70 

TCT TTA AAC ATT GAA TTG TGT AAC GTG AGT GAG AAT AAC GCA TGC GCT 353 
Ser Leu Asn lie Glu Leu Cys Asn Val Ser Glu Asn Asn Ala Cys Ala 
75 80 85 90 

TTG AAA GTG GTT AAA GAC TGG CTT GAA AAA GAA GAG GAT TTA GAG TCT 4 01 

Leu Lys Val Val Lys Asp Trp Leu Glu Lys Glu Glu Asp Leu Glu Ser 
95 100 105 

AAA AAA GCG GGC AAA CAC TAC GAA CTT TTG ATC ACT AAA TTG GGT AAG 449 
Lys Lys Ala Gly Lys His Tyr Glu Leu Leu lie Thr Lys Leu Gly Lys 
110 115 120 

AGT ATC GTA GAG ACT TAT AAT ACC TTT TTA AAC GCA TTC AAT TTT TGC 4 97 

Ser lie Val Glu Thr Tyr Asn Thr Phe Leu Asn Ala Phe Asn Phe Cys 
125 130 135 

GGC ATG ATT TTA TTC TAC TTC ATT AAA AGC GTT TTC AAC CCC AAA CGC 54 5 

Gly Met lie Leu Phe Tyr Phe lie Lys Ser Val Phe Asn Pro Lys Arg 
140 145 150 

TTT TGT ATC ACT CCT TTG CTC TAT CAT ATC AAT GAA TCC GGG TTT AAG 593 
Phe Cys lie Thr Pro Leu Leu Tyr His He Asn Glu Ser Gly Phe Lys 
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ATG TTT TAAATTTATA TTTGAAAGGA TGAACAATGA AAAATCAAGT TAAAAAAATT TT 74 0 

Met Phe 
210 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



Met Leu Leu Lys 
1 

Ala Leu Leu Trp 
20 

Val Asn Asp Ala 
35 

Leu Leu Leu Thr 
50 

Asn Phe Glu Gin 
65 

lie Asn Asp Thr 

Glu lie Ala Arg 
100 

Thr Arg Ala Ser 
115 

Lys Gin Arg Glu 
130 

Thr Val Glu Lys 
14 5 

Lys Val Ser Ser 

Asp Tyr Asp Phe 
180 

Val Trp Ser Asp 
195 

Met Phe 
210 



Thr Lys Leu Lys 
5 

Val Gly Cys Ser 

Thr Lys Asn Thr 
40 

Ala Asn Ala Met 
55 

Leu Lys Gly Lys 
70 

Thr Gin Pro Asn 
85 

Gin Leu Arg Leu 

Gly Gly Ser Gly 
120 

Lys Glu Arg Glu 
135 

Gly Thr Leu Lys 
150 

lie Ala Ala Ser 
165 

Thr Leu Ser Leu 

Val Lys Pro lie 
200 



lie lie Ser Ser 
10 

Ser Glu Met Ala 
25 

Thr Ala Ser lie 

Leu Asp Ser Met 
60 

His Leu lie Glu 
75 

Leu Asp Met Asn 
90 

Arg Ser Asn Gly 
105 

lie Ala Ala Asp 

Ser Glu Glu Tyr 
140 

Ala Ala Asp Leu 
155 

lie Ser Ser Ser 
170 

Thr Asn Arg Lys 
185 

Val Lys Asn Ala 



Val lie Leu Ser 
15 

Thr Tyr Gin Asn 
30 

Asn Ser Thr Asp 
45 

Phe Ser Asp Pro 

Val Ser Asp Val 
80 

Leu Leu Thr Thr 
95 

Arg Phe Asn lie 
110 

Ser Arg Met Val 
125 

Asn Gin Asp Thr 

Ser Leu Ser Gly 
160 

Arg Gin Arg Leu 
175 

Thr Gly Glu Glu 
190 

Ser Asn Lys Arg 
205 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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TTTAAAATTA GAAACAGATG TATCTGTTTT AAATTTTGAA TAGGGAGTTT CTATCATT 5 8 



ATG TTA TTG AAA ACA AAA TTA AAA ATT ATA AGC TCG GTG ATT TTG AGO 106 
Met Leu Leu Lys Thr Lys Leu Lys lie lie Ser Ser Val lie Leu Ser 
1 5 10 15 

GCT TTA TTG TGG GTG GGT TGC TCA AGC GAA ATG GCA ACT TAT CAA AAC 154 
Ala Leu Leu Trp Val Gly Cys Ser Ser Glu Met Ala Thr Tyr Gin Asn 
20 25 30 

GTG AAT GAT GCC ACT AAA AAT ACG ACT GCA AGC ATT AAT AGC ACG GAT 2 02 

Val Asn Asp Ala Thr Lys Asn Thr Thr Ala Ser lie Asn Ser Thr Asp 
35 40 45 

TTA TTG CTA ACC GCT AAC GCG ATG TTA GAT TCC ATG TTT AGC GAC CCT 25 0 

Leu Leu Leu Thr Ala Asn Ala Met Leu Asp Ser Met Phe Ser Asp Pro 
50 55 60 

AAT TTT GAG CAA CTC AAG GGC AAG CAT TTG ATT GAA GTT TCA GAT GTG 2 98 

Asn Phe Glu Gin Leu Lys Gly Lys His Leu lie Glu Val Ser Asp Val 
65 70 75 80 

ATT AAC GAC ACC ACG CAG CCC AAT TTG GAC ATG AAT CTT TTG ACG ACT 34 6 

lie Asn Asp Thr Thr Gin Pro Asn Leu Asp Met Asn Leu Leu Thr Thr 
85 90 95 

GAA ATC GCG CGG CAG TTG CGG TTG CGA TCT AAT GGG AGG TTC AAT ATC 3 94 

Glu lie Ala Arg Gin Leu Arg Leu Arg Ser Asn Gly Arg Phe Asn lie 
100 105 110 

ACA AGG GCG AGC GGA GGG AGT GGC ATT GCA GCC GAT AGC AGA ATG GTG 442 
Thr Arg Ala Ser Gly Gly Ser Gly lie Ala Ala Asp Ser Arg Met Val 
lis 120 125 

AAA CAG CGC GAA AAA GAA CGA GAG AGC GAA GAG TAT AAT CAA GAC ACC 4 90 

Lys Gin Arg Glu Lys Glu Arg Glu Ser Glu Glu Tyr Asn Gin Asp Thr 
130 135 140 

ACT GTA GAA AAA GGC ACT TTA AAA GCC GCT GAT TTA TCT TTA AGT GGT 53 8 

Thr Val Glu Lys Gly Thr Leu Lys Ala Ala Asp Leu Ser Leu Ser Gly 
145 150 155 160 

AAA GTA TCT AGT ATC GCA GCC TCT ATT AGT AGT TCT AGG CAG CGC TTG 5 86 

Lys Val Ser Ser lie Ala Ala Ser lie Ser Ser Ser Arg Gin Arg Leu 
165 170 175 

GAC TAT GAC TTC ACC CTA AGC CTT ACC AAC AGG AAA ACG GGT GAA GAG 634 
Asp Tyr Asp Phe Thr Leu Ser Leu Thr Asn Arg Lys Thr Gly Glu Glu 
180 185 190 ' 

GTA TGG AGC GAT GTT AAG CCT ATT GTG AAG AAC GCT AGC AAT AAG CGT 682 
Val Trp Ser Asp Val Lys Pro lie Val Lys Asn Ala Ser Asn Lys Arg 
195 200 205 
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GAC GCT CCC ATA AGA AGC GGT AAA ATA AAA TAG GGA TCC ATG ATG GAT 357 
Asp Ala Pro lie Arg Ser Gly Lys He Lys Tyr Gly Ser Met Met Asp 
70 75 80 

AAA TCA TGAATCCATA AGATCCACTC TGAGCTTTTC AATTCCACAG CGTTATAAAG CA 415 
Lys Ser 
85 

CTCTATAA 423 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Met Arg Phe Leu Phe Ser Lys Thr Leu Leu Met Met Ser Cys Cys Asn 

15 10 15 

Thr Glu Arg Met Leu Phe Val Val Gin Tyr Lys Thr Asn Pro Ala Gly 

20 25 30 

Lys Val He Lys Lys He Val Asn Asn Arg Gly Lys Ser Leu Lys He 

35 40 45 

Phe Ala Cys Met Gly Ser Val Met Val Phe Gly Val Thr Leu Trp Cys 

50 55 60 

Gin Tyr He Asp Ala Pro He Arg Ser Gly Lys He Lys Tyr Gly Ser 
65 70 75 80 

Met Met Asp Lys Ser 
85 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 59... 688 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
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180 185 190 

Phe Asn Leu Val Asn Thr Gly Val lie Asn lie Leu Met Leu He Gly 

195 200 205 

He Val Leu He Val Leu Ala Thr He Phe Ala He He Tyr Val Glu 

210 215 220 

Leu Ala Glu Arg Arg He Pro He Ser Tyr Ala Arg Lys Val Val Met 
225 230 235 240 

Gin Asn Gin Asn Lys Arg He Met Asn Tyr He Pro He Lys Leu Asn 

245 250 255 

Leu Ser Gly Val He Pro Pro He Phe Ala Ser Ala Leu Leu Val Phe 

260 265 270 

Pro Ser Thr He Leu Gin Gin Ala Thr Ser Asn Lys Thr Leu Gin Ala 

275 280 285 

Val Ala Xaa Phe Leu Ser Pro Gin Gly Met Arg He He Phe 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 109... 363 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

AATGGGGCAA TTCAAGGCGA TAGAAGCTTG AATGAGGGCT TCTTCTAAGG TTTTGGCTTT 60 
GATTTCAATA AAATTTTGCA TCAATGTTCC TTTTTGTTTT GCGCATGC ATG CGT TTT 117 

Met Arg Phe 
1 

TTA TTC TCT AAG ACT TTA TTG ATG ATG AGT TGT TGC AAC ACC GAA AGG 165 
Leu Phe Ser Lys Thr Leu Leu Met Met Ser Cys Cys Asn Thr Glu Arg 
5 10 15 

ATG TTG TTC GTG GTC CAA TAC AAG ACT AAC CCT GCC GGG AAA GTG ATT 213 
Met Leu Phe Val Val Gin Tyr Lys Thr Asn Pro Ala Gly Lys Val He 
20 25 30 35 

AAA AAG ATT GTG AAT AAT AGG GGT AAG AGT TTA AAA ATC TTT GCT TGC 261 
Lys Lys He Val Asn Asn Arg Gly Lys Ser Leu Lys He Phe Ala Cys 
40 45 50 

ATG GGA TCG GTC ATG GTG TTT GGC GTA ACG CTT TGG TGC CAA TAC ATA 3 09 

Met Gly Ser Val Met Val Phe Gly Val Thr Leu Trp Cys Gin Tyr He 
55 60 65 
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AAC AAG CGC ATC ATG AAT TAC ATT CCT ATT AAG TTG AAT TTA AGT GGG 
Asn Lys Arg He Met Asn Tyr He Pro He Lys Leu Asn Leu Ser Gly 
245 250 255 



825 



GTG ATC CCC CCT ATT TTC GCT TCA GCT TTG CTC GTG TTC CCT TCT ACG 
Val He Pro Pro He Phe Ala Ser Ala Leu Leu Val Phe Pro Ser Thr 
260 265 270 275 



873 



ATT TTG CAG CAA GCC ACA AGC AAC AAA ACC TTG CAA GCG GTT GCG NAT 
He Leu Gin Gin Ala Thr Ser Asn Lys Thr Leu Gin Ala Val Ala Xaa 
280 285 290 



921 



TTT TTA AGC CCG CAA GGT ATG CGT ATA ATA TTT TGATGTTCTT GCTCATCATC 
Phe Leu Ser Pro Gin Gly Met Arg He He Phe 
295 300 



974 



TTTTTTGCTT ACTTTTATTC TTCTATTGTG TTCAATTCTA AGGATATTGC GGATAATTTG 1034 
AGGCGTAATG GCGGGTATAT TCCAGGGCTT AGGCCTGGAG AGGGGAC 1081 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIiECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Asn Lys Ala He Ala Ser Lys He Leu He Thr Leu Gly Phe Leu 

15 10 15 

Phe Leu Tyr Arg Val Leu Ala Tyr He Pro He Pro Gly Val Asp Leu 

20 25 30 

Ala Ala He Lys Ala Phe Phe Asp Ser Asn Ser Asn Asn Ala Leu Gly 

35 40 45 

Leu Phe Asn Met Phe Ser Gly Asn Ala Val Ser Arg Leu Ser He He 

50 55 60 

Ser Leu Gly He Met Pro Tyr He Thr Ser Ser He He Met Glu Leu 
65 70 75 80 

Leu Ser Ala Thr Phe Pro Asn Leu Ala Lys Met Lys Lys Glu Arg Asp 

85 90 95 

Gly Met Gin Lys Tyr Met Gin He Val Arg Tyr Leu Thr He Leu He 

100 105 110 

Thr Leu He Gin Ala Val Ser Val Ser Val Gly Leu Arg Ser He Ser 

115 120 125 

Gly Gly Ala Asn Gly Ala He Met He Asp Met Gin Val Phe Met He 

130 135 140 

Val Ser Ala Phe Ser Met Leu Thr Gly Thr Met Leu Leu Met Trp He 
145 150 155 160 

Gly Glu Gin He Thr Gin Arg Gly Val Gly Asn Gly He Ser Leu He 

165 170 175 

He Phe Ala Gly He Val Ser Gly He Pro Ser Ala He Ser Gly Thr 
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Arg Val Leu Ala Tyr lie Pro lie Pro Gly Val Asp Leu Ala Ala lie 
20 25 30 35 

AAG GCT TTT TTT GAC AGC AAT TCC AAC AAC GCT TTG GGG TTG TTT AAT 201 
Lys Ala Phe Phe Asp Ser Asn Ser Asn Asn Ala Leu Gly Leu Phe Asn 
40 45 50 

ATG TTT AGC GGG AAT GCG GTT TCT CGC TTG AGC ATC ATC TCG TTG GGT 24 9 

Met Phe Ser Gly Asn Ala Val Ser Arg Leu Ser lie lie Ser Leu Gly 
55 60 65 

ATC ATG CCC TAT ATC ACT TCT TCA ATT ATC ATG GAG CTT TTG AGC GCG 2 97 

lie Met Pro Tyr lie Thr Ser Ser lie lie Met Glu Leu Leu Ser Ala 
70 75 80 

ACT TTC CCT AAC CTG GCT AAA ATG AAA AAA GAG CGG GAT GGC ATG CAA 345 
Thr Phe Pro Asn Leu Ala Lys Met Lys Lys Glu Arg Asp Gly Met Gin 
85 90 95 

AAA TAC ATG CAA ATC GTG CGT TAT TTG ACC ATT TTA ATC ACC CTA ATC 3 93 

Lys Tyr Met Gin lie Val Arg Tyr Leu Thr lie Leu lie Thr Leu lie 
100 105 110 115 

CAA GCG GTG AGC GTT TCA GTA GGC TTA AGG AGC ATT AGT GGA GGA GCC 441 
Gin Ala Val Ser Val Ser Val Gly Leu Arg Ser lie Ser Gly Gly Ala 
120 125 130 

AAT GGG GCG ATC ATG ATT GAT ATG CAA GTT TTT ATG ATC GTT TCA GCG 4 89 

Asn Gly Ala lie Met lie Asp Met Gin Val Phe Met lie Val Ser Ala 
135 140 145 

TTT TCT ATG CTT ACA GGA ACG ATG CTA CTC ATG TGG ATA GGG GAG CAA 53 7 

Phe Ser Met Leu Thr Gly Thr Met Leu Leu Met Trp lie Gly Glu Gin 
150 155 160 

ATC ACG CAA AGG GGC GTG GGG AAT GGG ATC AGT CTC ATT ATT TTT GCC 5 85 

lie Thr Gin Arg Gly Val Gly Asn Gly lie Ser Leu lie lie Phe Ala 
165 170 175 

GGG ATT GTT TCA GGG ATC CCA TCA GCT ATT TCA GGC ACA TTC AAT TTG 633 
Gly lie Val Ser Gly lie Pro Ser Ala lie Ser Gly Thr Phe Asn Leu 
180 185 190 195 

GTC AAT ACG GGC GTT ATT AAT ATC TTA ATG CTC ATT GGT ATT GTG CTG 681 
Val Asn Thr Gly Val lie Asn lie Leu Met Leu lie Gly lie Val Leu 
200 205 210 

ATT GTT TTA GCG ACT ATT TTT GCG ATT ATC TAT GTG GAA TTA GCT GAG 72 9 

lie Val Leu Ala Thr lie Phe Ala lie lie Tyr Val Glu Leu Ala Glu 
215 220 225 

CGC AGG ATC CCT ATT TCT TAT GCG CGT AAA GTG GTG ATG CAA AAC CAA 777 
Arg Arg lie Pro lie Ser Tyr Ala Arg Lys Val Val Met Gin Asn Gin 
230 235 240 
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Ala Ser His Leu Arg Ala Leu Lys His Ala Ser Met Ala Lys Asn Phe 
65 70 75 80 

Gin Leu Leu Lys Lys lie Arg Asp Tyr Asn His Asn lie Pro lie Gly 

85 90 95 

Leu Leu Ala Tyr Ala Asn Leu lie Phe Ser Tyr Gly Val Asp Gly Phe 

100 105 110 

Tyr Ala Gin Ala Lys Glu Cys Gly lie Asp Ser Val Leu lie Ala Asp 

115 120 125 

Met Pro Leu lie Glu Lys Glu Leu Val lie Lys Ser Ala Gin Lys His 

130 135 140 

Gin lie Lys Gin lie Phe lie Ala Ser Pro Asn Ala Ser Ser Lys Asp 
145 150 155 160 

Leu Glu Gin Val Ala Thr His Ser Gin Gly Tyr lie Tyr Ala Leu Ala 

165 170 175 

Arg Ser Gly Val Thr Gly Ala Ser Arg lie Leu Glu Asn Asp Ser Ser 

180 185 190 

Ala lie lie Lys Thr Leu Lys Ala Phe Ser Pro Thr Pro Ala Leu Leu 

195 200 205 

Gly Phe Gly lie Ser Lys Lys Glu His lie Thr Asn Ala Lys Gly Met 

210 215 220 

Gly Ala Asp Gly Val lie Cys Gly Ser Ala Leu Val Lys lie lie Glu 
225 230 235 240 

Glu Asn Leu Asn Asn Glu Asn Ala Met Leu Glu Lys lie Lys Gly Phe 

245 250 255 

lie Gly Gly Met lie Phe 
260 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1081 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 954 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 
AAGTAATGCC CCTGTTGTAT CAGCTTGATT TAAGAGGAAT AAGTTATT ATG AAT AAA 57 



Met Asn Lys 
1 



GCT ATT GCT AGT AAG ATA CTC ATC ACT TTG GGT TTT TTA TTT CTC TAC 
Ala lie Ala Ser Lys lie Leu lie Thr Leu Gly Phe Leu Phe Leu Tyr 
5 10 15 



105 



AGA GTC TTA GCT TAT ATC CCC ATT CCT GGC GTA GAT TTA GCA GCG ATC 



153 
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AAA CAC CAA ATC AAG CAA ATC TTT ATC GCC AGC CCC AAT GCG AGC AGT 54 0 

Lys His Gin lie Lys Gin lie Phe lie Ala Ser Pro Asn Ala Ser Ser 
145 150 155 

AAA GAT TTA GAA CAA GTC GCT ACG CAT TCG CAA GGC TAT ATC TAC GCT 58 8 

Lys Asp Leu Glu Gin Val Ala Thr His Ser Gin Gly Tyr lie Tyr Ala 
160 165 170 

TTA GCC AGG AGT GGG GTT ACA GGG GCG AGC CGT ATT TTA GAG AAT GAT 636 
Leu Ala Arg Ser Gly Val Thr Gly Ala Ser Arg lie Leu Glu Asn Asp 
175 180 185 190 

TCG AGT GCT ATT ATT AAA ACC TTA AAA GCT TTT AGC CCT ACC CCA GCC 684 
Ser Ser Ala He He Lys Thr Leu Lys Ala Phe Ser Pro Thr Pro Ala 
195 200 205 

TTA CTG GGC TTT GGC ATT TCC AAA AAA GAA CAC ATC ACA AAC GCT AAA 732 
Leu Leu Gly Phe Gly He Ser Lys Lys Glu His lie Thr Asn Ala Lys 
210 215 220 

GGC ATG GGT GCT GAT GGC GTG ATT TGC GGA TCA GCG TTA GTC AAA ATC 7 80 

Gly Met Gly Ala Asp Gly Val He Cys Gly Ser Ala Leu Val Lys He 
225 230 235 

ATA GAA GAA AAT TTA AAC AAT GAA AAC GCC ATG CTG GAA AAA ATT AAA 82 8 

He Glu Glu Asn Leu Asn Asn Glu Asn Ala Met Leu GLu Lys He Lys 
240 245 250 

GGG TTT ATA GGA GGA ATG ATT TTT TAAGGCTTTT AGGCTTTGTT GCGTTAAAAA 882 
Gly Phe He Gly Gly Met He Phe 
255 260 

TTAAAGATCA CAGATTAAC 901 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Met Arg Tyr Gin Asn Met Phe Glu Thr Leu Lys Lys His Glu Lys Met 

15 10 15 

Ala Phe He Pro Phe Val Thr Leu Gly Asp Pro Asn Tyr Glu Leu Ser 

20 25 30 

Phe Glu He He Lys Thr Leu He He Ser Gly Val Ser Ala Leu Glu 

35 40 45 

Leu Gly Leu Ala Phe Ser Asp Pro Val Ala Asp Gly He Thr He Gin 
50 55 60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 67 . . . 852 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

GAATTTAAGC GGCAGAGGGG ATAAGGATTT AAGCACCGTT TATAACGCTT TAAAAGGAGG 60 
TTTAAA ATG AGG TAT CAA AAC ATG TTT GAA ACC TTA AAA AAA CAC GAA 10 8 

Met Arg Tyr Gin Asn Met Phe Glu Thr Leu Lys Lys His Glu 
15 10 

AAA ATG GCG TTT ATC CCG TTT GTA ACC TTG GGC GAT CCT AAT TAT GAA 15 6 

Lys Met Ala Phe lie Pro Phe Val Thr Leu Gly Asp Pro Asn Tyr Glu 
15 20 25 30 

TTG AGT TTT GAA ATC ATT AAA ACC CTA ATT ATT AGC GGG GTG AGC GCT 2 04 

Leu Ser Phe Glu lie lie Lys Thr Leu lie lie Ser Gly Val Ser Ala 
35 40 45 

TTA GAA TTG GGT CTT GCT TTT TCT GAT CCT GTG GCG GAT GGC ATT ACC 252 
Leu Glu Leu Gly Leu Ala Phe Ser Asp Pro Val Ala Asp Gly lie Thr 
50 55 60 

ATA CAA GCG AGC CAT TTA AGG GCG TTA AAA CAC GCT AGC ATG GCT AAA 3 00 

lie Gin Ala Ser His Leu Arg Ala Leu Lys His Ala Ser Met Ala Lys 
65 70 75 

AAT TTC CAG CTT TTA AAA AAG ATT AGA GAT TAC AAC CAC AAT ATT CCC 34 8 

Asn Phe Gin Leu Leu Lys Lys He Arg Asp Tyr Asn His Asn He Pro 
80 85 90 

ATA GGG CTT TTA GCG TAT GCG AAT TTA ATT TTT TCT TAT GGC GTT GAT 3 96 

He Gly Leu Leu Ala Tyr Ala Asn Leu He Phe Ser Tyr Gly Val Asp 
95 100 105 110 

GGC TTT TAC GCT CAA GCT AAA GAA TGC GGT ATA GAT AGC GTT TTA ATA 444 
Gly Phe Tyr Ala Gin Ala Lys Glu Cys Gly He Asp Ser Val Leu He 
115 120 125 

GCG GAC ATG CCC CTA ATA GAA AAA GAA TTA GTC ATC AAA TCC GCT CAA 4 92 

Ala Asp Met Pro Leu He Glu Lys Glu Leu Val He Lys Ser Ala Gin 
130 135 140 
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85 90 95 

TTT AGG ATT AAG ATT ATT GAT ATT TAT TTG GCA GAT TGC CCC AGT AGG 3 93 

Phe Arg lie Lys lie lie Asp lie Tyr Leu Ala Asp Cys Pro Ser Arg 
100 105 110 115 

GCT ATT TTA TTG AAG CGT TTA GGG TTA AAG ATC GTG GTT TTT CTA TGC 441 
Ala lie Leu Leu Lys Arg Leu Gly Leu Lys lie Val Val Phe Leu Cys 
120 125 130 

CCC TTT TTA TGG TTT GTT GCG TTT AAA AAC CCC TAT CAT AGG GCG TGG 4 89 

Pro Phe Leu Trp Phe Val Ala Phe Lys Asn Pro Tyr His Arg Ala Trp 
135 140 145 

CAT GAA GAA AAA AGC AAA AGT CTT TTG GTA TTG TTT TAATCATGAT TTATTG 541 
His Glu Glu Lys Ser Lys Ser Leu Leu Val Leu Phe 
150 155 

GTTGTATTTG GCGGTCTTTT TTTTGTTGAG CGCATTAGAC 5 81 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Met His Ser Pro Asn Leu Glu Lys Glu Glu Thr Glu lie lie Glu Thr 

15 10 15 

Leu Leu Met Arg Glu Lys Met Arg Leu Cys Pro Leu Tyr Trp Arg lie 

20 25 30 

Leu Ala Phe Leu Thr Asp Gly Leu Leu Val Ala Phe Leu Leu Ser Asp 

35 40 45 

Leu Leu Asp Ala Cys Asp Phe Leu His Ser Leu Tyr Trp Leu Ala Asn 

50 55 60 

Pro lie Tyr His Ser Ala Phe Val Ala Met Gly Phe lie lie Leu Tyr 
65 70 75 80 

Gly Val Tyr Glu lie Phe Phe Val Cys Leu Cys Lys Met Ser Leu Ala 

85 90 95 

Lys Leu Val Phe Arg lie Lys lie lie Asp lie Tyr Leu Ala Asp Cys 

100 105 110 

Pro Ser Arg Ala lie Leu Leu Lys Arg Leu Gly Leu Lys lie Val Val 

115 120 125 

Phe Leu Cys Pro Phe Leu Trp Phe Val Ala Phe Lys Asn Pro Tyr His 

130 135 140 

Arg Ala Trp His Glu Glu Lys Ser Lys Ser Leu Leu Val Leu Phe 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 83: 
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Thr Lys Lys Gly 
945 

Glu Glu Lys Ala 

Leu Asn Arg Glu 
980 

Phe 



Tyr Glu Lys Asp 
950 

Lys Leu Asp Met 
965 

Glu Leu Leu Arg 



Ala Arg Val Leu 
955 

Glu His Pile Asp 
970 

Val Thr Arg Ser 
985 



Ser Ala Tyr Glu 
960 

Arg Leu Thr Met 
975 

Phe Leu Lys Arg 
990 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 525 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

AGGTTAAAGT TTAAGACAAA CCAAAGAGTT TGTCTTGTTT GTTTTTGA ATG CAC TCT 57 

Met His Ser 
1 

CCA AAT TTA GAA AAA GAA GAA ACC GAA ATC ATA GAA ACA CTC CTT ATG 105 
Pro Asn Leu Glu Lys Glu Glu Thr Glu lie lie Glu Thr Leu Leu Met 
5 10 15 

CGT GAA AAA ATG CGT TTA TGC CCC TTG TAT TGG CGC ATC TTA GCG TTT 153 
Arg Glu Lys Met Arg Leu Cys Pro Leu Tyr Trp Arg lie Leu Ala Phe 
20 25 30 35 

TTA ACC GAT GGT TTG TTA GTG GCG TTT TTA TTG AGC GAT CTT TTA GAC 2 01 

Leu Thr Asp Gly Leu Leu Val Ala Phe Leu Leu Ser Asp Leu Leu Asp 
40 45 50 

GCA TGC GAT TTC TTG CAT TCT TTA TAT TGG CTA GCT AAC CCT ATT TAT 24 9 

Ala Cys Asp Phe Leu His Ser Leu Tyr Trp Leu Ala Asn Pro lie Tyr 
55 60 65 

CAC AGC GCA TTT GTT GCG ATG GGT TTT ATC ATC TTG TAT GGC GTT TAT 2 97 

His Ser Ala Phe Val Ala Met Gly Phe lie lie Leu Tyr Gly Val Tyr 
70 75 80 

GAA ATC TTT TTT GTG TGT TTG TGC AAG ATG AGC TTG GCT AAA CTG GTT 34 5 

Glu lie Phe Phe Val Cys Leu Cys Lys Met Ser Leu Ala Lys Leu Val 
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500 

Thr Ser Thr lie 
515 

Met Asp Gin Thr 
530 

Ser Ala Leu Gly 
545 

Ala Arg Asp Val 

Thr Pro Glu Gly 
580 

Thr Arg Val Asn 
595 

Val Asp Gly Lys 
610 

Glu Asp Ser His 
625 

Gly Asn lie Leu 

Val Leu Asn Glu 
660 

Met Leu Val Gly 
675 

Asp Ala Asn Arg 
690 

Pro Leu Leu Arg 
705 

lie lie Ala Arg 

Val Val Glu Lys 
740 

Lys Glu Glu Ala 
755 

Thr Asn Gin Asn 
770 

Asp Lys Val Gly 
785 

Arg Gly Glu Leu 

Trp Asn Gly Tyr 
820 

Thr Lys Asp Asp 
835 

Asp Ala Arg Glu 
850 

Pro Asp Val Lys 
865 

Val Lys Val Gly 

Thr Ser Pro Lys 
900 

Arg Ala lie Phe 
915 

Tyr Cys Pro Pro 
930 



Met Glu Phe Phe 
520 

Asn Pro Leu Ser 
535 

Glu Gly Gly Leu 
550 

His Pro Thr His 
565 

Gin Asn lie Gly 

Asp Leu Gly Phe 
600 

Val Val Gly Glu 
615 

lie He Ala Pro 
630 

Gly Asp Leu He 
645 

Lys Ser Lys Val 

Val Ala Ala Ser 
680 

Ala Leu Met Gly 
695 

Ser Asp Ala Pro 
710 

Asp Ser Trp Gly 
725 

He Asp Ser Lys 

Tyr He Asp Ala 
760 

Thr Ser Phe Asn 
775 

Ala Gly Gin He 
790 

Ala Leu Gly Lys 
805 

Asn Phe Glu Asp 

He Phe Thr Ser 
840 

Leu Lys His Gly 
855 

Glu Glu Ala Leu 
870 

Thr Tyr Val Ser 
885 

Gly Glu lie Lys 

Gly Asp Lys Ala 
920 

Ser Leu Glu Gly 
935 



505 

Met Gly Gly Gin 

Glu Val Thr His 
540 

Val Lys Asp Arg 
555 

Tyr Gly Arg He 
570 

Leu He Asn Thr 
585 

He Glu Ala Pro 

Thr He Tyr Leu 
620 

Ala Ser Thr Pro 
635 

Glu Thr Arg Val 
650 

Thr Leu Met Asp 
665 

Leu He Pro Phe 

Thr Asn Met Gin 
700 

He Val Gly Thr 
715 

Ala He Lys Ala 
730 

Asn He Tyr He 
745 

Tyr Ser Leu Gin 

Gin Val Pro He 
780 

He Ala Asp Gly 
795 

Asn Val Arg Val 
810 

Ala He Val Val 
825 

Thr His He Tyr 

Val Glu Glu Phe 
860 

Ala His Leu Asp 
875 

Ala Gly Met He 
890 

Ser Thr Pro Glu 
905 

Gly His Val Val 

Thr Val He Asp 
94 0 



510 

Leu Ser Gin Phe 
525 

Lys Arg Arg Leu 

Val Gly Phe Glu 
560 

Cys Pro He Glu 
575 

Leu Ser Thr Phe 
590 

Tyr Lys Lys Val 
605 

Thr Ala He Gin 

He Asp Glu Glu 
640 

Glu Gly Glu He 
655 

Leu Ser Ser Ser 
670 

Leu Glu His Asp 
685 

Arg Gin Ala Val 

Gly He Glu Lys 
720 

Asn Arg Ala Gly 
735 

Leu Gly Glu Ser 
750 

Lys Asn Leu Arg 
765 

Val Lys Val Gly 

Pro Ser Met Asp 
800 

Ala Phe Met Pro 
815 

Ser Glu Cys He 
830 

Glu Lys Glu Val 
845 

Thr Ala Asp He 

Glu Ser Gly He 
880 

Leu Val Gly Lys 
895 

Glu Arg Leu Leu 
910' 

Asn Lys Ser Leu 
925 

Val Lys Val Phe 
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Arg lie Thr Leu 
65 

Thr Val Arg Glu 

Lys lie Lys Val 
100 

Glu Lys Asn Gly 
115 

Glu lie Pro Leu 
130 

Glu Arg Val Val 
145 

Lys Glu Glu Glu 

Gin lie lie Pro 
180 

Lys Asp Val Leu 
195 

Thr lie Leu Phe 
210 

Met Phe Tyr Pro 
225 

He Pro Phe Ala 

Lys Asp Pro Gin 
260 

Ser Arg Lys He 
275 

Tyr Pro Met Asp 
290 

Val Gly Lys Glu 
305 

Lys Leu Glu Lys 

Asn Asp Leu Ala 
340 

Ala Asp Ser Glu 
355 

Asp Glu Asn Ala 
370 

Gly Asp Pro Val 
385 

Phe Phe Asp Pro 

Met Asn His Lys 
420 

Thr His Glu Asp 
435 

Asn Asn Gin Gly 
450 

He Arg Ala Val 
465 

Val Lys Met Gin 
Ala Phe Asp Ser 
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Glu Tyr Ala Gly 
70 

Ala Met Glu Arg 
85 

Arg Leu He Leu 

He Lys Asp He 
120 

Met Thr Glu Arg 
135 

Val Asn Gin Leu 
150 

Ser Ser Thr Ser 
165 

Asp Arg Gly Ser 

Tyr Ala Arg He 
200 

Arg Ala Met Asp 
215 

Leu Val Lys Val 
230 

Ser Leu Asp Ala 
245 

Gly Lys Val He 

Lys Glu Leu Lys 
280 

He Leu Leu Asn 
295 

Val Leu Leu Asp 
310 

He His Asp Leu 
325 

Leu Gly His Asp 

Ser Leu Lys Leu 
360 

Leu Ala Ala He 
375 

Thr Thr Glu Val 
390 

Glu Arg Tyr Asp 
405 

Leu Gly Leu His 

He He Thr Thr 
440 

Lys He Asp Asp 
455 

Gly Glu Leu Leu 
470 

Lys Thr He Lys 
485 

Leu Met Pro His 



Cys Glu Phe Gly 
75 

Gly He Thr Tyr 
90 

Trp Glu Lys Asp 
105 

Lys Glu Gin Ser 

Thr Ser Phe He 
140 

His Arg Ser Pro 
155 

Leu Asn Lys Leu 
170 

Trp Leu Tyr Phe 
185 

Asn Lys Arg Arg 

Tyr Gin Lys Gin 
220 

Arg Tyr Glu Asn 
235 

Asn Gin Arg Met 
250 

Leu Leu Ala Gly 
265 

Glu Asn His Leu 

Arg His Leu Ala 
300 

Met Leu Thr Gin 
315 

Gly Val Gin Glu 
330 

Ala Ser He He 
345 

Leu Lys Gin Thr 

Arg He His Lys 
380 

Ala Lys Gin Phe 
395 

Leu Thr Met Val 
410 

Val Pro Asp Tyr 
425 

Val Lys Tyr Leu 

Arg Asp His Leu 
460 

Ala Asn Glu Leu 
475 

Asp Lys Leu Thr 
490 

Asp Leu Val Asn 
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Lys Ser Lys Tyr 
80 

Ser He Pro Leu 
95 

Thr Lys Ser Gly 
110 

He Phe He Arg 
125 

He Asn Gly Val 

Gly Val He Phe 
160 

He Tyr Thr Gly 
175 

Glu Tyr Asp Ser 
190 

Lys Val Pro Val 
205 

Asp He He Lys 

Asp Lys Tyr Leu 
240 

Glu Phe Asp Leu 
255 

Lys Lys Leu Thr 
270 

Glu Trp Val Glu 
285 

Glu Pro Val Met 

Leu Asp Lys Asn 
320 

Phe Val He He 
335 

Gin Ser Phe Ser 
350 

Glu Lys He Asp 
365 

Val Met Lys Pro 

Val Lys Lys Leu 
400 

Gly Arg Met Lys 
415 

He Thr Thr Leu 
430 

Met Lys He Lys 
445 

Gly Asn Arg Arg 

His Ser Gly Leu 
480 

Thr Met Ser Gly 
495 

Ser Lys Met He 



wo 98/43478 



PCT/US98/06371 



AAA GAA GAA GCG CTC GCT CAT CTT GAT GAA AGC GGG ATC GTT AAA GTC 2 697 

Lys Glu Glu Ala Leu Ala His Leu Asp Glu Ser Gly lie Val Lys Val 
870 875 880 

GGT ACT TAT GTG AGC GCT GGC ATG ATT TTG GTG GGC AAA ACT TCT CCT 2 74 5 

Gly Thr Tyr Val Ser Ala Gly Met lie Leu Val Gly Lys Thr Ser Pro 
885 890 895 

AAA GGC GAG ATT AAA AGC ACG CCT GAA GAG CGG CTT TTA AGG GCT ATT 2 7 93 

Lys Gly Glu lie Lys Ser Thr Pro Glu Glu Arg Leu Leu Arg Ala lie 
900 905 910 915 

TTT GGG GAT AAA GCC GGG CAT GTG GTC AAT AAG AGT TTG TAT TGC CCT 2 841 
Phe Gly Asp Lys Ala Gly His Val Val Asn Lys Ser Leu Tyr Cys Pro 
920 925 930 

CCC AGT TTG GAA GGC ACG GTG ATT GAT GTG AAA GTC TTC ACT AAA AAA 2 889 
Pro Ser Leu Glu Gly Thr Val lie Asp Val Lys Val Phe Thr Lys Lys 
935 940 945 

GGC TAT GAG AAA GAC GCG CGA GTT TTG AGC GGG TAT GAA GAA GAA AAA 2 93 7 
Gly Tyr Glu Lys Asp Ala Arg Val Leu Ser Ala Tyr Glu Glu Glu Lys 
950 955 960 

GCC AAG CTT GAT ATG GAG CAT TTT GAT CGC TTG ACC ATG CTC AAT AGA 2 985 
Ala Lys Leu Asp Met Glu His Phe Asp Arg Leu Thr Met Leu Asn Arg 
965 970 975 

GAA GAA TTG TTG CGC GTT ACT CGC TCC TTT CTC AAG CGA TTT TAGAAGAGC 3 036 
Glu Glu Leu Leu Arg Val Thr Arg Ser Phe Leu Lys Arg Phe 
980 985 990 

CTTTCAGCCA TAACGGCAAG GATTATAAAG AAGGCGATCA AATCCCTA 3 084 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 amino acids 

(B) TYPE: amino acid 

*(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Met Ser Lys Lys lie Pro Leu Lys Asn Arg Leu Arg Ala Asp Phe Thr 

15 10 15 

Lys Thr Pro Thr Asp Leu Glu Val Pro Asn Leu Leu Leu Leu Gin Arg 

20 25 30 

Asp Ser Tyr Asp Ser Phe Leu Tyr Ser Lys Glu Gly Lys Glu Ser Gly 

35 40 45 

lie Glu Lys Val Phe Lys Ser lie Phe Pro lie Gin Asp Glu His Asn 
50 55 60 
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TTG GGC GAT TTG ATT GAA ACG CGC GTG GAA GGC GAG ATC GTT TTA AAC 2 025 

Leu Gly Asp Leu lie Glu Thr Arg Val Glu Gly Glu lie Val Leu Asn 
645 650 655 

GAA AAA AGC AAA GTA ACC TTA ATG GAT TTA AGC TCT AGC ATG CTA GTG 2 073 

Glu Lys Ser Lys Val Thr Leu Met Asp Leu Ser Ser Ser Met Leu Val 
660 665 670 675 

GGG GTA GCC GCA TCG CTC ATT CCT TTC TTA GAG CAT GAT GAG GCC AAC 2121 
Gly Val Ala Ala Ser Leu lie Pro Phe Leu Glu His Asp Asp Ala Asn 
680 685 690 

CGT GCC TTA ATG GGG ACT AAC ATG CAG CGC CAA GCG GTG CCC TTA TTA 2169 
Arg Ala Leu Met Gly Thr Asn Met Gin Arg Gin Ala Val Pro Leu Leu 
695 700 705 

AGA AGC GAC GCT CCC ATT GTA GGC ACG GGG ATT GAA AAA ATT ATT GCT 2 217 

Arg Ser Asp Ala Pro lie Val Gly Thr Gly lie Glu Lys lie lie Ala 
710 715 720 

AGG GAT TCT TGG GGA GCG ATC AAA GCC AAT CGC GCA GGC GTT GTA GAA 2 2 65 

Arg Asp Ser Trp Gly Ala lie Lys Ala Asn Arg Ala Gly Val Val Glu 
725 730 735 

AAA ATT GAT TCT AAA AAT ATT TAT ATT TTA GGC GAA AGC AAA GAA GAA 2313 
Lys lie Asp Ser Lys Asn lie Tyr lie Leu Gly Glu Ser Lys Glu Glu 
740 745 750 755 

GCC TAT ATT GAT GCG TAT TCT TTG CAA AAA AAC TTG CGC ACC AAC CAA 2 3 61 

Ala Tyr lie Asp Ala Tyr Ser Leu Gin Lys Asn Leu Arg Thr Asn Gin 
760 765 770 

AAC ACC AGT TTC AAT CAA GTC CCT ATC GTT AAA GTG GGC GAT AAA GTG 24 0 9 

Asn Thr Ser Phe Asn Gin Val Pro lie Val Lys Val Gly Asp Lys Val 
775 780 785 

GGA GCC GGG CAA ATC ATC GCT GAT GGC CCT AGC ATG GAT AGA GGC GAG 24 57 

Gly Ala Gly Gin lie lie Ala Asp Gly Pro Ser Met Asp Arg Gly Glu 
790 795 800 

TTG GCG TTA GGG AAA AAT GTG CGC GTG GCG TTC ATG CCT TGG AAT GGC 2 505 

Leu Ala Leu Gly Lys Asn Val Arg Val Ala Phe Met Pro Trp Asn Gly 
805 810 815 

TAT AAC TTT GAA GAC GCG ATC GTG GTG AGT GAG TGC ATC ACT AAA GAT 255 3 

Tyr Asn Phe Glu Asp Ala lie Val Val Ser Glu Cys lie Thr Lys Asp 
820 825 830 835 

GAT ATT TTC ACT TCC ACC CAC ATT TAT GAA AAA GAA GTG GAT GCT AGG 2 601 

Asp lie Phe Thr Ser Thr His lie Tyr Glu Lys Glu Val Asp Ala Arg 
840 845 850 

GAG CTT AAG CAT GGT GTG GAA GAA TTT ACC GCT GAT ATT CCT GAT GTG 2 64 9 

Glu Leu Lys His Gly Val Glu Glu Phe Thr Ala Asp lie Pro Asp Val 
855 860 865 
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AAG TTA GGC TTG CAT GTG CCT GAT TAG ATT ACG ACT TTA ACG CAT GAA 13 5 3 

Lys Leu Gly Leu His Val Pro Asp Tyr lie Thr Thr Leu Thr His Glu 
420 425 430 435 

GAT ATT ATC ACC ACC GTT AAA TAC CTC ATG AAG ATC AAA AAC AAT CAA 14 01 

Asp lie lie Thr Thr Val Lys Tyr Leu Met Lys lie Lys Asn Asn Gin 
440 445 450 

GGC AAG ATT GAT GAC AGG GAC CAC TTG GGC AAT CGT AGG ATT AGG GCG 144 9 

Gly Lys lie Asp Asp Arg Asp His Leu Gly Asn Arg Arg lie Arg Ala 
455 460 465 

GTA GGG GAA TTG TTG GCC AAT GAA TTG CAT TCA GGT TTA GTG AAA ATG 14 97 

Val Gly Glu Leu Leu Ala Asn Glu Leu His Ser Gly Leu Val Lys Met 
470 475 480 

CAA AAG ACC ATT AAA GAC AAG CTC ACT ACC ATG AGC GGG GCT TTT GAT 154 5 

Gin Lys Thr lie Lys Asp Lys Leu Thr Thr Met Ser Gly Ala Phe Asp 
485 490 495 

TCG CTC ATG CCC CAT GAC TTG GTC AAT TCT AAA ATG ATC ACA AGC ACC 15 93 

Ser Leu Met Pro His Asp Leu Val Asn Ser Lys Met lie Thr Ser Thr 
500 505 510 515 

ATC ATG GAA TTT TTC ATG GGC GGT CAG CTC TCG CAA TTT ATG GAT CAA 1641 
lie Met Glu Phe Phe Met Gly Gly Gin Leu Ser Gin Phe Met Asp Gin 
520 525 530 

ACG AAT CCC TTG AGT GAG GTT ACG CAC AAG CGC CGC CTT TCA GCG CTC 168 9 

Thr Asn Pro Leu Ser Glu Val Thr His Lys Arg Arg Leu Ser Ala Leu 
535 540 545 

GGC GAA GGG GGG TTG GTG AAA GAC AGA GTG GGG TTT GAA GCC AGG GAT 1737 
Gly Glu Gly Gly Leu Val Lys Asp Arg Val Gly Phe Glu Ala Arg Asp 
550 555 560 

GTG CAC CCC ACG CAT TAT GGC CGA ATT TGT CCC ATT GAG ACC CCA GAA 17 85 

Val His Pro Thr His Tyr Gly Arg He Cys Pro He Glu Thr Pro Glu 
565 570 575 

GGT CAA AAT ATC GGT CTG ATC AAC ACC CTT TCC ACT TTC ACA AGA GTG 183 3 

Gly Gin Asn He Gly Leu He Asn Thr Leu Ser Thr Phe Thr Arg Val 
580 585 590 595 

AAT GAT TTA GGC TTT ATT GAA GCC CCT TAT AAA AAG GTT GTG GAT GGC 1881 
Asn Asp Leu Gly Phe He Glu Ala Pro Tyr Lys Lys Val Val Asp Gly 
600 605 610 

AAG GTC GTG GGT GAG ACG ATT TAT TTG ACC GCT ATT CAA GAA GAC AGC 192 9 

Lys Val Val Gly Glu Thr He Tyr Leu Thr Ala He Gin Glu Asp Ser 
615 620 625 

CAC ATC ATC GCT CCC GCA AGC ACC CCC ATT GAT GAA GAG GGG AAT ATT 197 7 

His He He Ala Pro Ala Ser Thr Pro He Asp Glu Glu Gly Asn He 
' 630 635 640 
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Leu Tyr Ala Arg lie Asn Lys Arg Arg Lys Val Pro Val Thr lie Leu 
200 205 210 

TTC AGG GCG ATG GAT TAT CAA AAA CAA GAC ATC ATC AAA ATG TTC TAG 72 9 

Phe Arg Ala Met Asp Tyr Gin Lys Gin Asp lie lie Lys Met Phe Tyr 
215 220 225 

COG CTT GTT AAA GTG CGT TAT GAA AAC GAT AAA TAT TTG ATC COG TTT 777 
Pro Leu Val Lys Val Arg Tyr Glu Asn Asp Lys Tyr Leu lie Pro Phe 
230 235 240 

GCT TCA TTA GAC GCC AAT CAA AGA ATG GAA TTT GAC TTG AAA GAT CCT 825 
Ala Ser Leu Asp Ala Asn Gin Arg Met Glu Phe Asp Leu Lys Asp Pro 
245 250 255 

CAA GGC AAG GTT ATT CTT TTA GCG GGT AAA PiAG CTC ACT TCA AGA AAG 873 
Gin Gly Lys Val lie Leu Leu Ala Gly Lys Lys Leu Thr Ser Arg Lys 
260 265 270 275 

ATT AAA GAG CTT AAA GAA AAC CAT TTA GAA TGG GTG GAA TAC CCT ATG 921 
lie Lys Glu Leu Lys Glu Asn His Leu Glu Trp Val Glu Tyr Pro Met 
280 285 290 

GAT ATT TTA CTC AAT CGC CAT TTA GCT GAG CCT GTT ATG GTA GGG AAA 969 
Asp lie Leu Leu Asn Arg His Leu Ala Glu Pro Val Met Val Gly Lys 
295 300 305 

GAA GTC TTA TTG GAC ATG CTC ACT CAG CTA GAT AAA AAC AAA TTA GAA 1017 
Glu Val Leu Leu Asp Met Leu Thr Gin Leu Asp Lys Asn Lys Leu Glu 
310 315 320 

AAA ATC CAC GAT TTA GGC GTG CAA GAA TTT GTG ATC ATC AAC GAT CTG 1065 
Lys lie His Asp Leu Gly Val Gin Glu Phe Val lie lie Asn Asp Leu 
325 330 335 

GCG TTA GGG CAT GAC GCT TCC ATT ATC CAA TCT TTT TCA GCC GAT TCT 1113 
Ala Leu Gly His Asp Ala Ser lie lie Gin Ser Phe Ser Ala Asp Ser 
340 345 350 355 

GAG TCT TTG AAA TTA CTC AAG CAA ACC GAA AAA ATT GAT GAT GAA AAC 1161 
Glu Ser Leu Lys Leu Leu Lys Gin Thr Glu Lys lie Asp Asp Glu Asn 
360 365 370 

GCT CTA GCG GCG ATT CGT ATC CAT AAG GTT ATG AAA CCA GGC GAT CCC 12 09 

Ala Leu Ala Ala lie Arg lie His Lys Val Met Lys Pro Gly Asp Pro 
375 380 385 

GTT ACG ACT GAA GTG GCT AAG CAG TTT GTC AAA AAA CTT TTC TTT GAT 1257 
Val Thr Thr Glu Val Ala Lys Gin Phe Val Lys Lys Leu Phe Phe Asp 
390 395 400 

CCA GAA CGC TAT GAT TTG ACC ATG GTG GGC CGC ATG AAA ATG AAT CAC 13 05 

Pro Glu Arg Tyr Asp Leu Thr Met Val Gly Arg Met Lys Met Asn His 
405 410 415 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

GAATTAATGC ATTAAATAAC TCAAAATTTT TGATCAAAGG CTTGAAAT ATG TCA AAA 57 

Met Ser Lys 
1 

AAA ATT CCC CTA AAA AAC CGC TTG AGA GCT GAT TTT ACA AAA ACC CCA 105 
Lys lie Pro Leu Lys Asn Arg Leu Arg Ala Asp Phe Thr Lys Thr Pro 
5 10 15 

ACA GAT TTA GAA GTC CCT AAT TTA TTA TTA TTA CAA CGA GAC AGC TAT 153 
Thr Asp Leu Glu Val Pro Asn Leu Leu Leu Leu Gin Arg Asp Ser Tyr 
20 25 30 35 

GAT TCT TTC TTG TAT TCT AAA GAG GGT AAA GAG AGC GGG ATT GAA AAG 2 01 

Asp Ser Phe Leu Tyr Ser Lys Glu Gly Lys Glu Ser Gly lie Glu Lys 
40 45 50 

GTT TTT AAA TCC ATT TTC CCT ATC CAA GAT GAG CAT AAC CGC ATC ACT 24 9 

Val Phe Lys Ser lie Phe Pro lie Gin Asp Glu His Asn Arg lie Thr 
55 60 65 

TTA GAA TAG GCG GGT TGC GAA TTT GGC AAG TCT AAA TAC ACC GTT AGA 2 97 

Leu Glu Tyr Ala Gly Cys Glu Phe Gly Lys Ser Lys Tyr Thr Val Arg 
70 75 80 

GAA GCG ATG GAG AGG GGC ATT ACC TAC TCT ATC CCT CTC AAA ATT AAG 34 5 

Glu Ala Met Glu Arg Gly lie Thr Tyr Ser lie Pro Leu Lys lie Lys 
85 90 95 

GTG CGC TTG ATC TTG TGG GAA AAA GAT ACC AAG AGT GGC GAA AAG AAC 3 93 

Val Arg Leu lie Leu Trp Glu Lys Asp Thr Lys Ser Gly Glu Lys Asn 
100 105 110 115 

GGC ATT AAG GAT ATT AAA GAA CAA AGC ATT TTC ATT CGT GAG ATC CCT 441 
Gly lie Lys Asp lie Lys Glu Gin Ser lie Phe lie Arg Glu lie Pro 
120 125 130 

TTG ATG ACA GAA CGC ACT TCA TTT ATT ATT AAT GGG GTG GAG CGC GTG 4 89 

Leu Met Thr Glu Arg Thr Ser Phe lie lie Asn Gly Val Glu Arg Val 
135 140 145 

GTG GTC AAT CAA CTC CAC AGA AGC CCC GGT GTG ATT TTC AAA GAA GAA 537 
Val Val Asn Gin Leu His Arg Ser Pro Gly Val lie Phe Lys Glu Glu 
150 155 160 

GAG TCT AGC ACT TCT TTA AAC AAG CTC ATT TAC ACA GGG CAA ATC ATT 585 
Glu Ser Ser Thr Ser Leu Asn Lys Leu lie Tyr Thr Gly Gin lie lie 
165 170 175 

CCT GAT AGG GGT TCG TGG TTG TAT TTT GAA TAC GAT TCT AAA GAT GTT 63 3 

Pro Asp Arg Gly Ser Trp Leu Tyr Phe Glu Tyr Asp Ser Lys Asp Val 
180 185 190 195 

TTA TAC GCT CGT ATC AAT AAA CGC CGT AAA GTG CCT GTT ACC ATT TTA 681 
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(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Met Gin Lys Thr Ser Asn Thr Leu Ala Leu Gly Ser Leu Thr Ala Leu 

15 10 15 

Phe Phe Leu Met Gly Phe lie Thr Val Leu Asn Asp lie Leu lie Pro 

20 25 30 

His Leu Lys Pro lie Phe Asp Leu Thr Tyr Phe Glu Ala Ser Leu lie 

35 40 45 

Gin Phe Cys Phe Phe Gly Ala Tyr Phe lie Met Gly Gly Val Phe Gly 

50 55 60 

Asn Val lie Ser Lys lie Gly Tyr Pro Phe Gly Val Val Leu Gly Phe 
65 70 75 80 

Val lie Thr Ala Thr Gly Cys Ala Leu Phe Tyr Pro Ala Ala His Phe 

85 90 95 

Gly Ser Tyr Gly Phe Phe Leu Gly Ala Leu Phe lie Leu Ala Ser Gly 

100 105 110 

lie Val Cys Leu Gin Thr Ala Gly Asn Pro Phe Val Thr Leu Leu Ser 

115 120 125 

Lys Gly Lys Glu Ala Arg Asn Leu Val Leu Val Gin Ala Phe Asn Ser 

130 135 140 

Leu Gly Thr Thr Leu Gly Pro lie Phe Gly Ser Leu Leu lie Phe Ser 
145 150 155 160 

Thr Thr Lys Met Gly Asp Asn Ala Ser Leu lie Asp Lys Leu Ala Asp 

165 170 175 

Ala Lys Ser Val Gin Met Pro Tyr Leu Gly Leu Ala Val Phe Ser Leu 

180 185 190 

Leu Leu Ala Leu lie Met Tyr Leu Leu Lys Leu Pro Asp Val Glu Lys 

195 200 205 

Glu Met Pro Lys Glu Thr Thr Gin Lys Ser Leu Phe Ser His Lys His 

210 215 220 

Phe Val Phe Gly Ala Trp Gly Ser Phe Phe Met Trp Gly Glu Xaa Trp 
225 230 235 240 

Arg Leu Ala His Ser Trp Cys 
245 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 084 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 49... 3027 
(D) OTHER INFORMATION: 
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GTG ATC ACA GCG ACG GGG TGC GCG TTG TTT TAT CCG GCG GCG CAT TTT 4 08 

Val lie Ttir Ala Thr Gly Cys Ala Leu Phe Tyr Pro Ala Ala His Phe 
85 90 95 

GGA TCC TAT GGG TTT TTT TTA GGA GCG TTG TTT ATT TTA GCG AGC GGG 45 6 

Gly Ser Tyr Gly Phe Phe Leu Gly Ala Leu Phe lie Leu Ala Ser Gly 
100 105 110 

ATT GTG TGC TTG CAA ACC GCT GGT AAT CCC TTT GTA ACC TTG CTT TCT 5 04 

He Val Cys Leu Gin Thr Ala Gly Asn Pro Phe Val Thr Leu Leu Ser 
115 120 125 

AAA GGT AAA GAA GCC AGA AAT TTG GTT TTA GTC CAG GCG TTC AAT TCG 552 
Lys Gly Lys Glu Ala Arg Asn Leu Val Leu Val Gin Ala Phe Asn Ser 
130 135 140 

CTT GGC ACA ACT TTA GGG CCT ATT TTT GGG AGC TTG TTG ATT TTT AGC 60 0 

Leu Gly Thr Thr Leu Gly Pro lie Phe Gly Ser Leu Leu lie Phe Ser 
145 150 155 160 

ACG ACT AAA ATG GGC GAT AAT GCA AGT TTG ATA GAT AAA TTA GCG GAC 64 8 

Thr Thr Lys Met Gly Asp Asn Ala Ser Leu lie Asp Lys Leu Ala Asp 
165 170 175 

GCT AAA AGC GTT CAA ATG CCT TAT TTG GGC TTG GCG GTG TTT TCG CTT 6 96 

Ala Lys Ser Val Gin Met Pro Tyr Leu Gly Leu Ala Val Phe Ser Leu 
180 185 190 

CTT TTA GCG CTC ATC ATG TAT CTT TTG AAA TTG CCT GAT GTG GAA AAA 744 
Leu Leu Ala Leu lie Met Tyr Leu Leu Lys Leu Pro Asp Val Glu Lys 
195 200 205 

GAA ATG CCC AAA GAG ACG ACT CAA AAA AGC TTG TTT TCG CAC AAA CAC 7 92 

Glu Met Pro Lys Glu Thr Thr Gin Lys Ser Leu Phe Ser His Lys His 
210 215 220 

TTT GTT TTT GGG GCT TGG GGA TCT TTT TTT ATG TGG GGG GAG AAN TGG 84 0 

Phe Val Phe Gly Ala Trp Gly Ser Phe Phe Met Trp Gly Glu Xaa Trp 
225 230 235 240 

CGA TTG GCT CAT TCT TGG TGC TAAGCTTTGA AAAGCTTTTG AATTTAGACT CTCA 8 95 
Arg Leu Ala His Ser Trp Cys 
245 

ATCAAGCGCG CATTAC 911 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
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595 

Lys Ser Asp Asn Ser 
610 

Gin lie lie Lys Glu 
625 

Lys Thr Leu Gin Lys 
645 

Cys Val Phe Phe Met 
660 

Arg lie Cys Ser Asn 
675 

Ala Glu Thr Pro Tyr 
690 



600 

Val Leu Asn Asn Val 
615 

Lys Met Ala Thr Thr 
630 

Ser Phe Phe Asn His 
650 

Leu Val Ala Leu Leu 
665 

Ala Tyr Phe Lys Asn 
680 

lie Lys Ala Ser 
695 



605 

Ser His Val Ala Gin Met 
620 

Thr Asp Glu Lys Ala Leu 
635 640 
Ala lie Asp Ala lie Leu 
655 

Val Leu lie Val Ser Val 
670 

Lys lie Tyr Pro Pro Leu 
685 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE; Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 121... 861 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



TAAGGGGCTT TGCATTTTTT ACTCATTTCA 
TATCTTAAAA TAATTGTAAT ATCATTATTA 
ATG CAA AAA ACT TCT AAC ACT CTG 
Met Gin Lys Thr Ser Asn Thr Leu 
1 5 



TGCCTCTTTT TCTTTATTTA GACAGATTAT 
TTATATCAAC TCAATAAAAA AGGAGAAGGT 
GCG CTG GGG AGT TTG ACA GCG CTA 
Ala Leu Gly Ser Leu Thr Ala Leu 
10 15 



TTC TTT CTA ATG GGT TTT ATC ACG GTT TTA AAC GAT ATT TTA ATC CCA 216 
Phe Phe Leu Met Gly Phe lie Thr Val Leu Asn Asp lie Leu lie Pro 
20 25 30 



CAC TTA AAG CCC ATT TTT GAC TTG ACC TAT TTT GAA GCT TCA CTC ATT 264 
His Leu Lys Pro lie Phe Asp Leu Thr Tyr Phe Glu Ala Ser Leu lie 
35 40 45 



CAA TTT TGC TTT TTT GGG GCG TAT TTC ATC ATG GGA GGA GTT TTT GGG 312 
Gin Phe Cys Phe Phe Gly Ala Tyr Phe lie Met Gly Gly Val Phe Gly 
50 55 60 



AAT GTG ATC AGT AAA ATC GGC TAC CCT TTT GGC GTG GTG CTT GGT TTT 360 
Asn Val lie Ser Lys lie Gly Tyr Pro Phe Gly Val Val Leu Gly Phe 
65 70 75 80 
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lie Lys Leu Glu 

lie Leu Gly lie 
180 

Val Lys Ala Leu 
195 

Thr lie Pro lie 
210 

Pro His Lys lie 
225 

lie Ala lie Tyr 

Ser He Phe Thr 
260 

Tyr Gly Phe Val 
275 

Arg Asp Tyr Leu 
290 

Val Val Ala He 
305 

Thr Pro Phe Val 

Pro Phe Leu Phe 
340 

Ala Leu lie Ser 
355 

Asp Ala Arg Leu 
370 

Ala Leu Met Ala 
385 

Phe Ala He Asn 

Ala Ala Ser Val 
420 

He Arg Glu Met 
435 

Thr Gly Gly Ala 
450 

His He Leu Gly 
465 

He Leu Phe Glu 

Arg Thr Ala Arg 
500 

Pro Leu Gly Asp 
515 

Leu Cys Val Ala 
530 

Pro Lys Gly Gly 
545 

Gin Met Leu Ala 

Lys Met Gly Arg 
580 

Leu He Leu Ser 



Met Gly Gin Phe 
165 

Met Leu He He 

Ala His Ser Pro 
200 

Ala He Leu Met 
215 

Leu Glu Val Ser 
230 

Ala Gly Lys Tyr 
245 

Phe Glu Ala Ser 

Ala Ser He Leu 
280 

Ser Thr Phe Leu 
295 

He Phe Val Ala 
310 

Asp Gly Ser Gly 
325 

He Thr Val Ala 

Ser Gly Thr Thr 
360 

Val Gly Tyr Gly 
375 

Leu Val Cys Ala 
390 

Ser Pro Glu Val 
405 

He Ser Ser Trp 

Thr Lys Asn He 
440 

Pro Thr Phe Ala 
455 

Asp Pro Ser Val 
470 

Ala Leu Phe He 
485 

Phe Met He Gin 

Leu Ser Ser Tyr 
520 

Gly Trp Gly Tyr 
535 

He Tyr Thr Leu 
550 

Gly Met Ala Leu 
565 

Phe Lys Gly Ala 
He Thr Phe Tyr 



Val Gly Met He 
170 

He Ala He Leu 
185 

Trp Gly Phe Phe 

Gly Leu Tyr Met 
220 

Val He Gly Phe 
235 

Val Ser Leu Asp 
250 

Ser Leu Ala Trp 
265 

Pro Val Trp Phe 

Lys He Gly Val 
300 

Pro Pro Leu Gin 
315 

Pro Val Phe Ala 
330 

Cys Gly Thr He 
345 

Pro Lys Met Leu 

Ser Met Val Met 
380 

Gly He Leu His 
395 

Ser He Gly Lys 
410 

Gly Phe Asn He 
425 

Gly Glu Ser Ser 

He Gly Leu Ala 
460 

Met Ala Phe Trp 
475 

Leu Thr Ala Val 
490 

Asp Leu Leu Gly 
505 

Lys Ala Gly He 

Phe Leu Tyr Gin 
540 

Trp Pro Leu Phe 
555 

Leu Leu Val Thr 
570 

Met He Ser Ala 
585 

Ser Gly He Leu 



Ala Ser Leu Gly 
175 

Ala Met Val Val 
190 

Thr He Ala Met 
205 

Arg Phe Phe Arg 

He Leu Leu He 
240 

Pro Lys Leu Ala 
255 

Met He Met Gly 
270 

Leu Leu Ala Pro 
285 

He Gly Val Leu 

He Pro Lys He 
320 

Gly Ser Val Phe 
335 

Ser Gly Phe His 
350 

Ala Lys Glu Ser 
365 

Glu Ser Val Val 

Pro Gly Leu Tyr 
400 

Asp He Ala Asp 
415 

Ser Ala Glu Glu 
430 

He Leu Ser Arg 
445 

Met He Val Tyr 

Tyr His Phe Ala 
480 

Asp Ala Gly Thr 
495 

Asn Val Tyr Lys 
510 

Phe Ala Thr Leu 
525 

Gly Thr He Asp 

Gly Val Ser Asn 
560 

Val Val Leu Phe 
575 

Leu Pro Ala Val 
590 

Lys Val Val Pro 
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GTG GCG CAA ATG CAA ATC ATC AAA GAA AAA ATG GCT ACC ACT ACC GAT 19 84 

Val Ala Gin Met Gin lie lie Lys Glu Lys Met Ala Thr Thr Thr Asp 
625 630 635 

GAA AAA GCG CTC AAA ACG CTC CAA AAA TCC TTT TTT AAC CAC GCT ATT 2 032 
Glu Lys Ala Leu Lys Thr Leu Gin Lys Ser Phe Phe Asn His Ala lie 
640 645 650 

GAT GCG ATT TTG TGC GTG TTT TTC ATG CTT GTG GCG CTA TTG GTT TTA 2 08 0 
Asp Ala lie Leu Cys Val Phe Phe Met Leu Val Ala Leu Leu Val Leu 
655 660 665 

ATC GTG AGC GTT AGG ATT TGC TCA AAC GCT TAT TTT AAA AAC AAA ATT 212 8 

lie Val Ser Val Arg lie Cys Ser Asn Ala Tyr Phe Lys Asn Lys lie 
670 675 680 

TAC CCA CCG CTG GCT GAA ACG CCC TAC ATC AAA GCC TCT TGAATAAAAA AG 217 9 
Tyr Pro Pro Leu Ala Glu Thr Pro Tyr lie Lys Ala Ser 
685 690 695 

GGGTTTTAAC CCCCTTTAAA TCCATAGAAA AAAGTTTGA 2218 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 697 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met lie Lys Gin Ser Leu Asn Gly Glu Asp Met Gin Lys Ser Leu Val 

15 10 15 

Ser Leu Ala Trp Val Phe Val Ala lie Leu Gly Ala lie Cys Leu Gly 

20 25 30 

Val Leu Ala Leu His Lys Gly Glu Ser lie Asn Thr Leu Trp Leu Val 

35 40 45 

Val Ala Ser Ala Cys lie Tyr Ser lie Gly Tyr Arg Phe Tyr Ser His 

50 55 60 

Phe lie Ala Tyr Lys Val Leu Lys Leu Asp Asp Ser Arg Ala Thr Pro 
65 70 75 80 

Ala Cys Val Arg Asn Asp Gly Lys Asp Phe Val Pro Thr Asp Lys Ala 

85 90 95 

lie Thr Phe Gly His His Phe Ala Ala lie Ala Gly Ala Gly Pro Leu 

100 105 110 

Val Gly Pro lie Leu Ala Ala Gin Met Gly Tyr Leu Pro Ser lie Leu 

115 120 125 

Trp lie Leu lie Gly Ser Val Leu Gly Gly Cys Val His Asp Phe Val 

130 135 140 

Val Leu Phe Ala Ser lie Arg Arg Asp Gly Lys Ser Leu Gly Glu Met 
145 150 155 160 
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CCA GGG CTT TAT TTC OCT ATC AAT TCG CCA GAA GTG AGC ATC GGT AAA 1312 
Pro Gly Leu Tyr Phe Ala lie Asn Ser Pro Glu Val Ser lie Gly Lys 
400 405 410 

GAT ATA GOT GAT GCG GCT TCA GTG ATT AGC TCA TGG GGG TTT AAT ATC 13 60 

Asp lie Ala Asp Ala Ala Ser Val lie Ser Ser Trp Gly Phe Asn lie 
415 420 . 425 

AGC GCT GAA GAA ATT CGT GAG ATG ACT AAA AAC ATC GGC GAA AGC TCC 14 0 8 

Ser Ala Glu Glu lie Arg Glu Met Thr Lys Asn lie Gly Glu Ser Ser 
430 435 440 

ATT TTG AGC CGC ACC GGT GGG GCG CCC ACT TTT GCG ATC GGT TTA GCG 14 56 

lie Leu Ser Arg Thr Gly Gly Ala Pro Thr Phe Ala lie Gly Leu Ala 
445 450 455 460 

ATG ATT GTG TAT CAC ATT TTA GGG GAT CCA AGC GTG ATG GCG TTT TGG 15 04 

Met lie Val Tyr His lie Leu Gly Asp Pro Ser Val Met Ala Phe Trp 
465 470 475 

TAT CAT TTT GCG ATT TTG TTT GAA GCT TTG TTC ATT TTA ACC GCT GTG 15 52 

Tyr His Phe Ala lie Leu Phe Glu Ala Leu Phe lie Leu Thr Ala Val 
480 485 490 

GAT GCT GGC ACA CGA ACC GCT CGT TTC ATG ATT CAA GAT TTG CTC GGT 160 0 

Asp Ala Gly Thr Arg Thr Ala Arg Phe Met lie Gin Asp Leu Leu Gly 
495 500 505 

AAT GTT TAT AAG CCT TTG GGC GAT CTT AGC TCT TAT AAG GCT GGG ATT 164 8 

Asn Val Tyr Lys Pro Leu Gly Asp Leu Ser Ser Tyr Lys Ala Gly lie 
510 515 520 

TTT GCC ACT CTT TTG TGC GTG GCA GGG TGG GGG TAT TTC TTG TAT CAA 1696 
Phe Ala Thr Leu Leu Cys Val Ala Gly Trp Gly Tyr Phe Leu Tyr Gin 
525 530 535 540 

GGC ACG ATT GAT CCT AAA GGG GGG ATT TAT ACG CTA TGG CCT TTA TTT 1744 
Gly Thr lie Asp Pro Lys Gly Gly lie Tyr Thr Leu Trp Pro Leu Phe 
545 550 555 

GGC GTG AGC AAT CAG ATG TTA GCG GGC ATG GCG TTG TTG TTG GTC ACG 17 92 

Gly Val Ser Asn Gin Met Leu Ala Gly Met Ala Leu Leu Leu Val Thr 
560 565 570 

GTG GTG TTG TTT AAA ATG GGG CGT TTT AAG GGG GCG ATG ATA AGC GCC 184 0 

Val Val Leu Phe Lys Met Gly Arg Phe Lys Gly Ala Met lie Ser Ala 
575 580 585 

TTA CCG GCA GTT TTG ATT TTA TCC ATC ACT TTT TAT AGC GGT ATT TTA 18 8 8 

Leu Pro Ala Val Leu lie Leu Ser lie Thr Phe Tyr Ser Gly lie Leu 
590 595 600 

AAG GTG GTG CCA AAG AGC GAT AAC AGC GTG CTG AAT AAT GTT TCC CAT 193 6 

Lys Val Val Pro Lys Ser Asp Asn Ser Val Leu Asn Asn Val Ser His 
605 610 615 620 
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GCA AGT CTG GGC ATT TTA GGG ATC ATG CTC ATT ATC ATT GCG ATT TTA 64 0 

Ala Ser Leu Gly lie Leu Gly lie Met Leu lie lie lie Ala He Leu 
175 180 185 

GCG ATG GTG GTG GTG AAG GCT TTA GCG CAT TCG CCT TGG GGC TTT TTT 688 
Ala Met Val Val Val Lys Ala Leu Ala His Ser Pro Trp Gly Phe Phe 
190 195 200 

ACG ATC GCA ATG ACT ATT CCC ATT GCG ATT CTT ATG GGG CTT TAC ATG 73 6 

Thr He Ala Met Thr He Pro He Ala He Leu Met Gly Leu Tyr Met 
205 210 215 220 

CGG TTT TTC AGG CCA CAC AAG ATT TTA GAG GTT TCT GTT ATT GGC TTT 7 84 

Arg Phe Phe Arg Pro His Lys He Leu Glu Val Ser Val He Gly Phe 
225 230 235 

ATC CTA TTG ATT ATA GCG ATT TAT GCG GGT AAA TAC GTT TCT TTA GAT 832 
He Leu Leu He He Ala He Tyr Ala Gly Lys Tyr Val Ser Leu Asp 
240 245 250 

CCT AAA TTA GCG TCA ATA TTC ACT TTT GAG GCC AGT TCT TTA GCG TGG 8 80 

Pro Lys Leu Ala Ser He Phe Thr Phe Glu Ala Ser Ser Leu Ala Trp 
255 260 265 

ATG ATC ATG GGC TAT GGG TTT GTG GCT TCT ATT TTA CCG GTA TGG TTT 92 8 

Met He Met Gly Tyr Gly Phe Val Ala Ser He Leu Pro Val Trp Phe 
270 275 280 

TTA CTC GCT CCA CGA GAT TAT CTA AGC ACT TTT TTA AAA ATT GGC GTT 97 6 

Leu Leu Ala Pro Arg Asp Tyr Leu Ser Thr Phe Leu Lys He Gly Val 
285 290 295 300 

ATA GGG GTG TTG GTT GTG GCC ATT ATT TTT GTC GCT CCG CCT TTA CAA 1024 
He Gly Val Leu Val Val Ala He He Phe Val Ala Pro Pro Leu Gin 
305 310 315 

ATC CCT AAA ATC ACG CCC TTT GTA GAT GGC AGT GGG CCT GTG TTT GCA 1072 
He Pro Lys He Thr Pro Phe Val Asp Gly Ser Gly Pro Val Phe Ala 
320 325 330 

GGA AGC GTG TTC CCT TTC TTG TTT ATC ACG GTG GCT TGC GGG ACG ATT 1120 
Gly Ser Val Phe Pro Phe Leu Phe He Thr Val Ala Cys Gly Thr He 
335 340 345 

AGC GGA TTC CAT GCT TTA ATT TCT TCA GGC ACG ACC CCT AAA ATG CTC 1168 
Ser Gly Phe His Ala Leu He Ser Ser Gly Thr Thr Pro Lys Met Leu 
350 355 360 

GCT AAA GAA AGC GAC GCA AGG CTA GTG GGC TAT GGC TCT ATG GTG ATG 1216 
Ala Lys Glu Ser Asp Ala Arg Leu Val Gly Tyr Gly Ser Met Val Met 
365 370 375 380 

GAG AGC GTT GTG GCT CTT ATG GCG TTG GTG TGC GCA GGG ATC TTG CAC 12 64 

Glu Ser Val Val Ala Leu Met Ala Leu Val Cys Ala Gly He Leu His 
385 390 395 
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(ix) FEATURE: 

(A) N7Uy[E/KEY: Coding Sequence 

(B) LOCATION: 77. . .2167 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 75 : 

GAAGTTTATG AGCCGTTTTG CCACTATTCA AAGAAATTTT GGATTATAAT AAAAAAACTG 6 0 

GCTGAAATTA ACAACA ATG ATT AAA CAA TCA TTA AAT GGA GAG GAC ATG CAA 112 

Met lie Lys Gin Ser Leu Asn Gly Glu Asp Met Gin 
15 10 

AAA AGT TTA GTT TCT TTG GCT TGG GTT TTT GTA GCT ATT TTA GGG GCG 16 0 

Lys Ser Leu Val Ser -Leu Ala Trp Val Phe Val Ala lie Leu Gly Ala 
15 20 25 

ATC TGT TTA GGG GTG TTA GCC TTA CAC AAG GGT GAG AGC ATT AAC ACG 208 
lie Cys Leu Gly Val Leu Ala Leu His Lys Gly Glu Ser lie Asn Thr 
30 35 40 

CTA TGG CTT GTA GTA GCG AGC GCT TGT ATT TAT AGC ATA GGC TAT CGT 25 6 

Leu Trp Leu Val Val Ala Ser Ala Cys lie Tyr Ser lie Gly Tyr Arg 
45 50 55 60 

TTT TAT AGC CAT TTT ATC GCT TAT AAG GTG TTA AAG CTA GAT GAT AGC 3 04 

Phe Tyr Ser His Phe lie Ala Tyr Lys Val Leu Lys Leu Asp Asp Ser 
65 70 75 

AGA GCC ACG CCC GCA TGC GTA AGG AAT GAT GGC AAG GAT TTT GTG CCA 3 52 

Arg Ala Thr Pro Ala Cys Val Arg Asn Asp Gly Lys Asp Phe Val Pro 
80 85 90 

ACC GAT AAA GCG ATC ACT TTT GGG CAC CAT TTC GCC GCT ATT GCT GGG 400 
Thr Asp Lys Ala lie Thr Phe Gly His His Phe Ala Ala lie Ala Gly 
95 100 105 

GCT GGC CCT TTA GTA GGC CCG ATA CTA GCC GCT CAA ATG GGT TAC TTG 44 8 

Ala Gly Pro Leu Val Gly Pro lie Leu Ala Ala Gin Met Gly Tyr Leu 
110 115 120 

CCC TCT ATC TTA TGG ATT TTG ATA GGC TCG GTT TTA GGG GGT TGC GTG 4 96 

Pro Ser lie Leu Trp lie Leu lie Gly Ser Val Leu Gly Gly Cys Val 
125 130 135 140 

CAT GAT TTT GTG GTG CTT TTT GCT TCT ATT AGG CGC GAT GGC AAG TCT 54 4 

His Asp Phe Val Val Leu Phe Ala Ser lie Arg Arg Asp Gly Lys Ser 
145 150 155 

TTA GGC GAA ATG ATC AAA CTT GAA ATG GGC CAA TTT GTA GGC ATG ATC 592 
Leu Gly Glu Met lie Lys Leu Glu Met Gly Gin Phe Val Gly Met lie 
160 165 170 
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35 40 45 

Ser Phe Cys Val Leu Val Ala Glu Tyr Pro Ser Gly Val Leu Ala Asp 

50 55 60 

Leu Met Ser Arg Lys Asn Leu Phe Leu Val Ser Asn Ala Phe Leu lie 
65 70 75 80 

Ala Ser Phe Ser Phe Val Leu Phe Phe Asp Ser Phe lie Phe Met Leu 

85 90 95 

Leu Ala Trp Gly Leu Tyr Gly Leu Tyr Ser Ala Cys Ser Ser Gly Thr 

100 105 110 

lie Glu Ala Ser Leu lie Thr Asp lie Lys Glu Asn Lys Lys Asp Leu 

115 120 125 

Ser Lys Phe Leu Ala Lys Asn Asn Gin lie Thr Tyr Leu Gly Met lie 

130 135 140 

lie Gly Ser Ser Leu Gly Ser Phe Leu Tyr Leu Lys Val His Ala Met 
145 150 155 160 

Leu Tyr lie Val Gly lie Phe Leu lie Met Leu Cys Val Leu Thr lie 

165 170 175 

lie Phe Tyr Phe Lys Glu Lys Glu Gly Asp Phe Lys Ser Gin Lys Ser 

180 185 190 

Leu Lys Leu Leu Lys Glu Gin Val Lys Gly Ser Leu Lys Glu Leu Lys 

195 200 205 

Asp Asn Pro Lys Leu Lys lie Leu Leu Val Gly His Leu lie Thr Pro 

210 215 220 

Val Phe Phe Met Ser His Phe Gin Met Trp Gin Ala Tyr Phe Leu Lys 
225 230 235 240 

Gin Gly Val Lys Glu Gin Tyr Leu Phe Val Phe Tyr lie Ala Phe Gin 

245 250 255 

Val lie Ser lie Leu lie His Phe Leu Lys Ala Ser Ser Tyr Ser Gin 

260 265 270 

Lys lie Ala Leu Ser Ser Leu Val Val Leu Leu Gly Val Ser Pro Leu 

275 280 285 

Leu Leu Ser Asn lie Pro Tyr Cys Phe lie Gly Val Tyr Ala Leu Met 

290 295 300 

Val Ala Phe Phe Thr Tyr Met Ser Tyr Cys Leu Asn Tyr Gin Phe Ser 
305 310 315 320 

Lys Phe Val Ser Lys Asn Asn lie Ser Ser Leu Ser Ser Leu Leu Ser 

325 330 335 

Ser Cys Val Arg Val Val Ser Val Leu lie Leu Ser Leu Ser Ser Leu 

340 345 350 

Glu Leu Arg Tyr Phe Ser Pro Leu Thr lie lie Thr Met His Phe Ala 

355 360 365 

Leu Thr Leu lie lie Leu Phe Phe Phe Leu Tyr Lys Ala Lys Pro Phe 

370 375 380 

Asp Glu 
385 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
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Phe Gin Val lie Ser lie Leu lie His Phe Leu Lys Ala Ser Ser Tyr 
255 260 265 270 

AGC CAA AAA ATC GCC TTG ACT TCG CTT GTG GTG TTG TTA GGC GTT AGC 927 
Ser Gin Lys lie Ala Leu Ser Ser Leu Val Val Leu Leu Gly Val Ser 
275 280 285 

CCC TTA TTG CTT AGC AAT ATC CCT TAT TGT TTC ATA GGG GTG TAT GCG 975 
Pro Leu Leu Leu Ser Asn lie Pro Tyr Cys Phe lie Gly Val Tyr Ala 
290 295 300 

CTC ATG GTG GCG TTT TTC ACT TAG ATG AGC TAT TGC TTA AAC TAT CAA 102 3 

Leu Met Val Ala Phe Phe Thr Tyr Met Ser Tyr Cys Leu Asn Tyr Gin 
305 310 315 

TTC TCC AAA TTC GTT TCT AAA AAC AAC ATT TCC TCG CTC TCA TCG CTT 10 71 

Phe Ser Lys Phe Val Ser Lys Asn Asn lie Ser Ser Leu Ser Ser Leu 
320 325 330 

TTA TCA AGC TGT GTG CGC GTG GTC TCT GTG CTA ATC TTA TCG CTC AGC 1119 
Leu Ser Ser Cys Val Arg Val Val Ser Val Leu lie Leu Ser Leu Ser 
335 340 345 350 

AGT CTG GAA CTG CGT TAC TTC TCA CCC CTA ACT ATC ATA ACC ATG CAT 1167 
Ser Leu Glu Leu Arg Tyr Phe Ser Pro Leu Thr lie lie Thr Met His 
355 360 365 

TTT GCC TTG ACG CTT ATC ATC CTC TTT TTC TTT TTG TAT AAG GCT AAG 1215 
Phe Ala Leu Thr Leu lie lie Leu Phe Phe Phe Leu Tyr Lys Ala Lys 
370 375 380 

CCG TTT GAT GAG TGAGCGGCTT T7VAGAGTGCA ACCTTTTAGC GATTTCTATA GCAAC 12 72 
Pro Phe Asp Glu 
385 

ATCATAGCC 12 81 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Met Leu Arg Lys Asn lie Leu Ala Tyr Tyr Gly Ala Asn Phe Leu Leu 

15 10 15 

lie lie Ala Gin Ser Leu Pro His Ala lie Leu Thr Pro Leu Leu Leu 

20 25 30 

Ser Lys Gly Leu Ser Leu Ser Glu lie Leu Leu Val Gin Thr Phe Phe 
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35 40 45 

TTT TTT AGC TTT TGC GTG CTA GTG GCT GAA TAG CCA AGC GGC GTT TTA 2 55 

Phe Phe Ser Phe Cys Val Leu Val Ala Glu Tyr Pro Ser Gly Val Leu 
50 55 60 

GCG GAT TTG ATG AGC CGA AAA AAT TTA TTC CTG GTT TCT AAT GCC TTT 3 03 

Ala Asp Leu Met Ser Arg Lys Asn Leu Phe Leu Val Ser Asn Ala Phe 
65 70 75 

TTA ATC GCT AGT TTT TCG TTT GTG CTG TTT TTT GAT AGC TTT ATT TTC 351 
Leu lie Ala Ser Phe Ser Phe Val Leu Phe Phe Asp Ser Phe lie Phe 
80 85 90 

ATG CTT TTA GCG TGG GGG TTG TAT GGT TTG TAT AGC GCA TGC TCT AGC 3 99 

Met Leu Leu Ala Trp Gly Leu Tyr Gly Leu Tyr Ser Ala Cys Ser Ser 
95 100 105 110 

GGC ACG ATT GAA GCT TCA CTC ATC ACA GAC ATT AAG GAA AAC AAA AAA 44 7 

Gly Thr lie Glu Ala Ser Leu lie Thr Asp lie Lys Glu Asn Lys Lys 
115 120 125 

GAT TTA TCC AAG TTT TTA GCC AAA AAC AAT CAA ATT ACT TAT TTA GGC 4 95 

Asp Leu Ser Lys Phe Leu Ala Lys Asn Asn Gin lie Thr Tyr Leu Gly 
130 135 140 

ATG ATT ATA GGG AGT TCT TTG GGA TCG TTT TTG TAT CTC AAA GTC CAT 54 3 

Met lie lie Gly Ser Ser Leu Gly Ser Phe Leu Tyr Leu Lys Val His 
145 150 155 

GCG ATG CTG TAT ATT GTG GGG ATT TTT TTA ATC ATG CTC TGT GTG CTA 591 
Ala Met Leu Tyr lie Val Gly lie Phe Leu lie Met Leu Cys Val Leu 
160 165 170 

ACG ATC ATT TTT TAT TTT AAA GAG AAA GAA GGG GAT TTT AAA AGC CAA 63 9 

Thr lie lie Phe Tyr Phe Lys Glu Lys Glu Gly Asp Phe Lys Ser Gin 
175 180 185 190 

AAA AGC CTG AAA CTC CTT AAA GAG CAA GTC AAA GGC AGT CTT AAA GAG 6 87 

Lys Ser Leu Lys Leu Leu Lys Glu Gin Val Lys Gly Ser Leu Lys Glu 
195 200 205 

CTT AAA GAT AAC CCC AAA CTT AAA ATT CTG TTA GTG GGG CAT TTG ATT 73 5 

Leu Lys Asp Asn Pro Lys Leu Lys lie Leu Leu Val Gly His Leu lie 
210 215 220 

ACG CCC GTC TTT TTT ATG AGC CAT TTT CAA ATG TGG CAA GCG TAT TTT 7 83 

Thr Pro Val Phe Phe Met Ser His Phe Gin Met Trp Gin Ala Tyr Phe 
225 230 235 

TTA AAA CAA GGC GTT AAA GAG CAA TAC CTT TTT GTG TTT TAT ATC GCT 831 
Leu Lys Gin Gly Val Lys Glu Gin Tyr Leu Phe Val Phe Tyr lie Ala 
240 245 250 

TTT CAA GTG ATT TCT ATT CTC ATT CAT TTT TTA AAA GCC TCT AGT TAT 87 9 



-189- 



BNSDOCID: <WO_9843478A1 J_> 



wo 98/43478 



PCT/US98/06371 



. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met lie Phe Val Asn Lys Tyr Leu Tyr Gly lie Lys Ser Val Val Pro 

15 10 15 

Leu Ala Val Gly Phe Ser Lys Tyr Pro Leu Lys Lys Phe Leu Trp Leu 

20 25 30 

Asn Val Phe Ser Ser Phe Leu Trp Ala Leu lie Val Gly Ser Val Ser 

35 40 45 

Phe Gin Ala Ser Asp Trp Val Lys Thr Leu Tyr Glu Arg Leu Ser His 

50 55 60 

Tyr Thr Ser Phe Phe lie lie Ser Phe Val Leu lie Ala Leu Leu lie 

65 70 75 80 

Trp Phe Leu Leu Lys Arg Tyr Ser Arg Lys Met Gly Phe 
85 90 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 1227 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

TAGCATCAAT ACCCCTTAAA TAAAAGATAT AATGCTGTAT TATAAGCTAG TTTTAATTAC 6 0 

AATTTTCAA ATG TTA AGG AAA AAC ATT TTA GCT TAG TAT GGG GCG AAT TTT 111 
Met Leu Arg Lys Asn lie Leu Ala Tyr Tyr Gly Ala Asn Phe 
15 10 

CTC TTA ATC ATC GCT CAA AGC TTA CCC CAT GCG ATT TTA ACC CCC TTG 15 9 

Leu Leu lie lie Ala Gin Ser Leu Pro His Ala lie Leu Thr Pro Leu 
15 20 25 30 

TTG CTT TCT AAA GGG CTT AGT TTG AGT GAA ATC TTG CTC GTG CAA ACC 2 07 

Leu Leu Ser Lys Gly Leu Ser Leu Ser Glu He Leu Leu Val Gin Thr 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 103... 381 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

AGTCTTTGCC CGCTATCAAA AAAGAGATTT TTTGCAATAT TTTCATAAGC ACCGAAGAAA 60 
GCTTGCTTTA GCGAGTTTGT GGGTGAAACG TACGCCTTGC TC ATG ATT TTT GTC 114 

Met lie Phe Val 
1 

AAT AAA TAT CTC TAT GGG ATT AAA AGC GTT GTG CCT TTG GCG GTT GGT 162 
Asn Lys Tyr Leu Tyr Gly lie Lys Ser Val Val Pro Leu Ala Val Gly 
5 10 15 20 

TTT AGC AAA TAG CCT TTA AAA AAG TTT TTA TGG CTT AAT GTT TTT TCC 210 
Phe Ser Lys Tyr Pro Leu Lys Lys Phe Leu Trp Leu Asn Val Phe Ser 
25 30 35 

AGT TTT TTG TGG GCG CTC ATC GTG GGG AGC GTT TCT TTT CAA GCG AGC 2 58 

Ser Phe Leu Trp Ala Leu lie Val Gly Ser Val Ser Phe Gin Ala Ser 
40 45 50 

GAT TGG GTG AAA ACG CTG TAT GAA AGG CTT TCT CAT TAC ACT TCG TTT 3 06 

Asp Trp Val Lys Thr Leu Tyr Glu Arg Leu Ser His Tyr Thr Ser Phe 
55 60 65 

TTT ATC ATA AGT TTT GTT CTT ATA GCG CTT TTA ATA TGG TTT TTA TTG 3 54 

Phe lie lie Ser Phe Val Leu lie Ala Leu Leu lie Trp Phe Leu Leu 
70 75 80 

AAA CGA TAT TCG CGC AAA ATG GGT TTT TAAGCAAGAT GTTTAATTAA ATGCGCT 408 
Lys Arg Tyr Ser Arg Lys Met Gly Phe 
85 ' 90 

AGACTACGCC CACAAGCATT CGC 431 
(2) INFORMATION FOR SEQ ID NO: 72: 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



Met Leu He Ser 
1 

Ser Ser Phe Arg 
20 

Gin Tyr Ala Tyr 
35 

Ala Phe Val Phe 
50 

Thr Ser Ala Cys 
65 

Ser Phe Tyr Gly 

Ser Arg Arg Lys 
100 

Ala Leu Leu Gly 
115 

Pro Lys He Lys 
130 

Leu Lys He He 
145 

Lys Asp Phe Val 

Asp Met Val Leu 
180 

Val Lys Ser Phe 
195 

Thr Phe Tyr Val 
210 

He Leu Ser Phe 
225 

Cys Val His Leu 

Ala Arg Lys Arg 
260 

Lys Arg Asn Glu 
275 

Gin He Arg Ser 
290 

Gly His Thr His 
305 

Leu Ala Gin Thr 

Gin He Tyr Val 
340 

Phe Leu Ala Pro 
355 

Gin Ala 



He Ala Phe Leu 
5 

Met Leu Lys Ser 

Leu Trp Phe Phe 
40 

Tyr Arg Asn He 
55 

Ser Phe Val Ser 
70 

Phe Ser Tyr Ser 
85 

Ser Leu Lys Asn 

Tyr Phe Trp Arg 
120 

Glu Thr Pro He 
135 

Leu Leu Thr Asp 
150 

Asp Tyr He Val 
165 

He Gly Gly Asp 

Leu Leu Pro Leu 
200 

Pro Gly Asn His 
215 

Leu Asp Thr Leu 
230 

Gly Gly He Asn 
245 

Gin Asn Phe Ala 

Ser Lys Pro Thr 
280 

Leu Lys Glu Ser 
295 

Ala Gly Gin He 
310 

Tyr Leu His Gly 
325 

Ser Ser Gly Ala 

Ser Glu He Ala 
360 



Leu Val Leu Tyr 
10 

Phe Leu Thr Leu 
25 

He Leu Leu Ser 

Met Pro Ser His 
60 

Phe He He Phe 
75 

He Glu Lys He 
90 

Phe Leu Lys Leu 
105 

Gly Phe Tyr Glu 

Tyr Leu Asp Lys 
140 

Met His Val Gly 
155 

Glu Glu Val Asn 
170 

Leu Val Asp Glu 
185 

Asn Asn Leu Lys 

Glu Tyr Tyr His 
220 

Asn Leu Thr He 
235 

Leu Cys Gly Val 
250 

Pro Asp He Asp 
265 

He Leu Leu Ala 

His Ser Val Asp 
300 

Phe Pro Phe Ser 
315 

Leu Tyr Lys His 
330 

Gly Tyr Trp Gly 
345 

Tyr Leu Arg Leu 



Leu Leu Asn Tyr 
15 

Lys Lys He Ser 
30 

He Gly Glu Ala 
45 

Leu Phe Val Leu 

He Leu Ser Leu 
80 

Asp Phe Leu His 
95 

Gly Phe Tyr Leu 
110 

Gly Leu Ala Arg 
125 

Leu Asp Lys Glu 

Ser Leu Leu Gin 
160 

Gin Lys Glu Val 
175 

Ser He Glu Lys 
190 

Ser Thr His Gly 
205 

Gly He Glu Pro 

Leu Gly Asn Glu 
240 

Tyr Asp Tyr Phe 
255 

Lys Ala Leu Lys 
270 

His Gin Pro Lys 
285 

Leu Val Leu Ser 

Leu Leu Val Lys 
320 

Ser Pro Thr Thr 
335 

He Pro Leu Arg 
350 

Leu Pro Lys Asn 
365 
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TTA GTG GAT GAA AGC ATT GAA AAA GTC AAA TCT TTT TTA CTG CCT TTA 688 
Leu Val Asp Glu Ser lie Glu Lys Val Lys Ser Phe Leu Leu Pro Leu 
185 190 195 200 

AAC AAC CTT AAA AGC ACQ CAT GGC ACT TTT TAT GTG CCA GGA AAT CAT 73 6 

Asn Asn Leu Lys Ser Thr His Gly Thr Phe Tyr Val Pro Gly Asn His 
205 210 215 

GAG TAT TAT CAT GGC ATA GAG CCG ATT TTA TCG TTT CTT GAC ACG CTT 784 
Glu Tyr Tyr His Gly lie Glu Pro lie Leu Ser Phe Leu Asp Thr Leu 
220 225 230 

AAT TTG ACG ATT TTA GGG AAT GAG TGC GTG CAT TTA GGG GGG ATC AAT 832 
Asn Leu Thr lie Leu Gly Asn Glu Cys Val His Leu Gly Gly lie Asn 
235 240 245 

TTG TGC GGC GTG TAT GAT TAT TTC GCA AGG AAG CGT CAA AAT TTT GCC 8 80 

Leu Cys Gly Val Tyr Asp Tyr Phe Ala Arg Lys Arg Gin Asn Phe Ala 
250 255 260 

CCT GAT ATT GAC AAA GCT TTA AAA AAG CGC AAT GAG AGT AAG CCC ACG 92 8 

Pro Asp lie Asp Lys Ala Leu Lys Lys Arg Asn Glu Ser Lys Pro Thr 
265 270 275 280 

ATC CTT TTG GCC CAC CAA CCT AAA CAA ATT AGA AGC CTC AAA GAA AGC 97 6 

lie Leu Leu Ala His Gin Pro Lys Gin lie Arg Ser Leu Lys Glu Ser 
285 290 295 

CAC TCT GTA GAT TTA GTC CTT TCA GGG CAT ACC CAT GCA GGG CAA ATC 1024 
His Ser Val Asp Leu Val Leu Ser Gly His Thr His Ala Gly Gin lie 
300 305 310 

TTT CCC TTT AGC CTT TTA GTC AAG TTG GCG CAA ACC TAT TTA CAT GGT 1072 
Phe Pro Phe Ser Leu Leu Val Lys Leu Ala Gin Thr Tyr Leu His Gly 
315 320 325 

TTA TAG AAG CAC AGC CCC ACC ACT CAA ATT TAT GTG AGC AGT GGG GCA 112 0 

Leu Tyr Lys His Ser Pro Thr Thr Gin lie Tyr Val Ser Ser Gly Ala 
330 335 340 

GGG TAT TGG GGG ATT CCT TTA AGG TTT TTA GCC CCT AGC GAG ATC GCA 116 8 

Gly Tyr Trp Gly lie Pro Leu Arg Phe Leu Ala Pro Ser Glu lie Ala 
345 350 355 360 

TAC CTT AGG CTT TTA CCT AAA AAT CAA GCT TAGTTAAACA AAATCTTAAA ATC 1221 
Tyr Leu Arg Leu Leu Pro Lys Asn Gin Ala 
365 370 

TTAATCGTAA TCAAGCGGTT AAAAATAAGA A 12 52 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 
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(B) LOCATION : 89 . . . 1198 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

AAAAAGGAAA TAGCACGATG AAACCTAAAG GGGATATTAG CGTTAATATG CTAATATAGT 6 0 

AGAACATTAT GACTACAAAA AGGGTGAT ATG CTG ATC TCC ATA GCG TTT TTA 112 

Met Leu lie Ser lie Ala Phe Leu 
1 5 

TTG GTT TTA TAT CTT TTG AAT TAT AGT TCT TTC AGG ATG TTG AAA TCG 160 
Leu Val Leu Tyr Leu Leu Asn Tyr Ser Ser Phe Arg Met Leu Lys Ser 
10 15 20 

TTT TTA ACC TTA AAG AAA ATC TCT CAA TAG GCT TAT TTA TGG TTT TTT 2 08 

Phe Leu Thr Leu Lys Lys lie Ser Gin Tyr Ala Tyr Leu Trp Phe Phe 
25 30 35 40 

ATC CTT TTG AGC ATA GGC GAG GCG GCT TTT GTT TTT TAT AGA AAT ATT 2 56 

He Leu Leu Ser He Gly Glu Ala Ala Phe Val Phe Tyr Arg Asn He 
45 50 55 

ATG CCT AGC CAT TTG TTT GTT TTG ACT TCA GCG TGT TCG TTT GTG TCT 3 04 

Met Pro Ser His Leu Phe Val Leu Thr Ser Ala Cys Ser Phe Val Ser 
60 65 70 

TTT ATT ATT TTT ATC CTT TCT TTA AGT TTT TAC GGG TTT TCC TAT TCC 352 
Phe He He Phe He Leu Ser Leu Ser Phe Tyr Gly Phe Ser Tyr Ser 
75 80 85 

ATA GAA AAA ATA GAT TTT TTG CAT TCA AGG CGT AAA AGT TTA AAA AAC 400 
He Glu Lys He Asp Phe Leu His Ser Arg Arg Lys Ser Leu Lys Asn 
90 95 100 

TTT TTA AAA TTG GGG TTT TAT CTG GCG TTA TTA GGG TAT TTT TGG CGT 44 8 

Phe Leu Lys Leu Gly Phe Tyr Leu Ala Leu Leu Gly Tyr Phe Trp Arg 
105 110 115 120 

GGG TTT TAT GAA GGG TTG GCC CGC CCT AAA ATC AAA GAA ACC CCT ATT 4 96 

Gly Phe Tyr Glu Gly Leu Ala Arg Pro Lys He Lys Glu Thr Pro He 
125 130 135 

TAT TTG GAT AAG CTG GAT AAA GAA TTA AAG ATT ATT TTA CTC ACA GAC 544 
Tyr Leu Asp Lys Leu Asp Lys Glu Leu Lys He He Leu Leu Thr Asp 
140 145 150 

ATG CAT GTG GGG AGT TTG TTG CAA AAA GAT TTT GTT GAT TAC ATT GTA 592 
Met His Val Gly Ser Leu Leu Gin Lys Asp Phe Val Asp Tyr He Val 
155 160 165 

GAA GAA GTC AAT CAA AAA GAA GTG GAT ATG GTG CTG ATT GGG GGG GAT 64 0 

Glu Glu Val Asn Gin Lys Glu Val Asp Met Val Leu He Gly Gly Asp 
170 175 180 
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Ser Leu Thr Tyr Ala Cys Gly Ala Leu Cys Leu Leu Phe Ala Ser Cys 
65 70 75 80 

Leu Arg lie Lys Leu Asn Gin Lys lie Leu Phe Tyr Ser Met Ala Val 

85 90 95 

Ala Asn Phe lie Asn Gly Leu Leu Ser Leu Val Gin Lys lie Tyr Phe 

100 105 110 

Asn Met Pro Arg Ala Gin Gly Phe Ser Thr Val Lys Glu Tyr Val Val 

115 120 125 

Leu Val Ser Val Ser lie Leu Gly Cys Tyr lie Tyr Ala Leu Tyr Ser 

130 135 140 

His Asn Gin Lys Glu Lys Leu Phe Phe Thr Leu Ser Val Phe Val Gly 
145 150 155 160 

Phe Leu Val Val lie Leu Ser Ala Thr Arg Ser Ala Thr lie Ala Phe 

165 170 175 

Val lie Thr Phe Leu lie Leu Ser Cys Phe lie Leu Tyr Ala Lys Lys 

180 185 190 

Ser Leu Lys Pro Leu Gly Tyr Met Val Val Val Ser Leu lie Leu Ser 

195 200 205 

Ala Leu Tyr Val Gly Ser Asn Ala Leu Glu Lys Lys Gly Ala lie Glu 

210 215 220 

Gin Ser Arg Val Gin Asn Gin Ser Phe Glu Glu Asp Leu Lys Arg Tyr 
225 230 235 240 

Ala Lys Lys Asp Ala Asp Ser Ser lie Gly Trp Arg Leu Glu Arg Trp 

245 250 255 

Lys Glu Ala Leu Thr Val Leu Arg Leu Arg Pro Phe Phe Gly Met Ala 

260 265 270 

Ala Ser Glu Lys Cys Gin Arg Leu Glu Glu lie Leu Ser Leu Ser Lys 

275 280 285 

Ser Tyr Arg Ala Lys Asp Leu lie Leu Cys Tyr Glu Arg Tyr Asp Asn 

290 295 300 

Gin lie He His He Leu Ala Thr Arg Gly He He Gly Phe Leu He 
305 310 315 320 

Trp Leu Phe Phe Leu Leu Val He Val Lys He Phe Trp Ser Gly He 

325 330 335 

Lys Gin Asn Ser Leu He Ser Phe Phe He Leu Met Thr Leu Ala Phe 

340 345 350 

Tyr Leu He Phe Gly He Gly Phe Asp Pro Phe Asp Phe Phe He Thr 

355 360 365 

Gly Ser Phe Phe Val Gly Met He Met Met Ala Val Phe Leu Lys Lys 

370 375 380 

Asp Lys Ser Ala Phe 
385 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
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TTT TTT GGT ATG GCC GCT AGC GAG AAA TGC CAG AGG TTA GAA GAG ATT 104 8 

Phe Phe Gly Met Ala Ala Ser Glu Lys Cys Gin Arg Leu Glu Glu lie 
270 275 280 

TTA TCC TTA TCA AAG TCT TAT AGG GCC AAA GAT TTG ATT CTC TGT TAT 1096 
Leu Ser Leu Ser Lys Ser Tyr Arg Ala Lys Asp Leu lie Leu Cys Tyr 
285 290 295 

GAA AGA TAC GAC AAT CAA ATC ATT CAC ATT TTA GCC ACT AGG GGG ATC 1144 
Glu Arg Tyr Asp Asn Gin lie lie His lie Leu Ala Thr Arg Gly lie 
300 305 310 315 

ATA GGC TTT TTG ATC TGG CTC TTT TTT TTA TTA GTT ATT GTA AAG ATT 1192 
lie Gly Phe Leu lie Trp Leu Phe Phe Leu Leu Val lie Val Lys lie 
320 325 330 

TTT TGG AGC GGG ATA AAG CAA AAC TCT TTA ATA TCG TTT TTT ATA CTA 12 4 0 

Phe Trp Ser Gly lie Lys Gin Asn Ser Leu lie Ser Phe Phe lie Leu 
335 340 345 

ATG ACA CTC GCC TTT TAC CTC ATT TTT GGC ATT GGG TTT GAC CCC TTT 12 8 8 

Met Thr Leu Ala Phe Tyr Leu lie Phe Gly lie Gly Phe Asp Pro Phe 
350 355 360 

GAT TTC TTC ATT ACG GGA AGT TTT TTT GTA GGA ATG ATC ATG ATG GCT 133 6 

Asp Phe Phe lie Thr Gly Ser Phe Phe Val Gly Met lie Met Met Ala 
365 370 375 

GTT TTT TTA AAA AAA GAT AAA AGT GCT TTT TAGCATCAAG GGGTTTGACA TTA 13 8 9 
Val Phe Leu Lys Lys Asp Lys Ser Ala Phe 
380 385 

GTCAAGCGGT AGTTTCTTGT GATTCGTTCT T 14 2 0 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

Met Val Phe Leu Phe Phe Arg Cys Tyr Phe Gin Ala Ser Leu Lys Glu 

15 10 15 

Thr Phe Ala lie Asn His Leu Lys Thr Met Ser Phe Lys Trp Leu Thr 

20 25 30 

Leu Ala Phe Leu Gly Val Phe Leu Ser lie Phe Pro Asn Met Phe Asn 

35 40 45 

Met His Asp Ser Gin Thr Phe Arg Tyr Asn Leu Phe Ala Leu Asn Met 
50 55 60 
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CCT AAC ATG TTT AAC ATG CAT GAT AGO CAA ACT TTC CGC TAG AAT TTA 3 76 

Pro Asn Met Phe Asn Met His Asp Ser Gin Thr Phe Arg Tyr Asn Leu 
45 50 55 

TTC GCT CTA AAC ATG TCC TTA ACT TAT GCT TGC GGG GCG TTA TGC TTG 424 
Phe Ala Leu Asn Met Ser Leu Thr Tyr Ala Cys Gly Ala Leu Cys Leu 
60 65 70 75 

CTT TTT GCC AGT TGC TTA AGA ATC AAA TTG AAT CAA AAA ATC CTT TTT 4 72 

Leu Phe Ala Ser Cys Leu Arg lie Lys Leu Asn Gin Lys lie Leu Phe 
80 85 90 

TAC AGC ATG GCT GTT GCA AAT TTC ATC AAC GGC TTG CTC TCA TTG GTG 52 0 

Tyr Ser Met Ala Val Ala Asn Phe lie Asn Gly Leu Leu Ser Leu Val 
95 100 105 

CAA AAA ATT TAT TTT AAC ATG CCC AGA GCG CAA GGG TTT AGC ACG GTT 568 
Gin Lys lie Tyr Phe Asn Met Pro Arg Ala Gin Gly Phe Ser Thr Val 
110 115 120 

AAG GAG TAT GTG GTT TTA GTG AGC GTG TCC ATT TTA GGC TGT TAT ATT 616 
Lys Glu Tyr Val Val Leu Val Ser Val Ser lie Leu Gly Cys Tyr lie 
125 130 135 

TAT GCG CTT TAT TCG CAC AAT CAA AAA GAA AAA CTT TTT TTC ACG CTT 664 
Tyr Ala Leu Tyr Ser His Asn Gin Lys Glu Lys Leu Phe Phe Thr Leu 
140 145 150 155 

TCT GTT TTT GTG GGG TTT TTA GTC GTT ATT TTA AGC GCC ACA AGG AGC 712 
Ser Val Phe Val Gly Phe Leu Val Val lie Leu Ser Ala Thr Arg Ser 
160 165 170 

GCG ACA ATC GCT TTT GTT ATT ACT TTT TTA ATC CTT TCT TGC TTT ATT 7 60 

Ala Thr lie Ala Phe Val lie Thr Phe Leu lie Leu Ser Cys Phe lie 
175 180 185 

TTA TAC GCC AAA AAA TCG CTC AAA CCA TTG GGT TAT ATG GTG GTC GTG 8 08 

Leu Tyr Ala Lys Lys Ser Leu Lys Pro Leu Gly Tyr Met Val Val Val 
190 195 200 

AGT CTT ATT TTG AGC GCT TTG TAT GTG GGG AGT AAC GCT TTA GAA AAA 8 56 

Ser Leu lie Leu Ser Ala Leu Tyr Val Gly Ser Asn Ala Leu Glu Lys 
205 210 215 

AAG GGG GCA ATA GAG CAA TCT AGA GTT CAA AAT CAA AGC TTT GAA GAA 904 
Lys Gly Ala lie Glu Gin Ser Arg Val Gin Asn Gin Ser Phe Glu Glu 
220 225 230 235 

GAT CTG AAA CGC TAC GCT AAA AAG GAC GCT GAT AGC AGT ATC GGA TGG 952 
Asp Leu Lys Arg Tyr Ala Lys Lys Asp Ala Asp Ser Ser lie Gly Trp 
240 245 250 

CGT TTG GAG CGT TGG AAA GAA GCC CTA ACG GTT TTG CGT TTA AGG CCC 100 0 

Arg Leu Glu Arg Trp Lys Glu Ala Leu Thr Val Leu Arg Leu Arg Pro 
255 260 265 
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290 

Asn Glu Ser Glu Phe 
305 

Phe Asp Arg Leu Asn 
325 

Ala Tyr Ser Leu Glu 
340 

Lys Ala Leu Asn Lys 
355 

Ala Leu Leu Asn Lys 
370 

Glu Tyr Phe Tyr Lys 
385 

Gly Glu He Leu He 
405 

Gly His Tyr Thr He 
420 

Ala Lys Val Leu Ser 
435 



295 

Glu Glu Leu Ser Ala 
310 

He Phe Ala Phe Ser 
330 

Lys Val Pro Lys Lys 
345 

He Ala Leu Lys His 
360 

Pro He Lys Ala Leu 
375 

Ala Arg Asp Leu Arg 
390 

Asn Asp Ser Glu Leu 
410 

Ala Pro Ser Glu Phe 
425 

Pro Phe 



300 

Phe Leu Asp Glu Phe Gin 
315 320 
Ala Glu Glu Asn Thr His 
335 

Thr He Asn Ala Arg He 
350 

Gin Asn His Ser Phe Lys 
365 

Val Glu Asn Lys Glu Gly 
380 

Trp Ala Pro Glu Val Asp 
395 400 
Thr Thr Pro Leu Lys Pro 
415 

Lys Asp Asn He Leu Leu 
430 



(2) INFORMATION FOR SEQ ID NO:67: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 200... 1366 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TGCGCGTTAT RTGGTTTTGA TTGAAAAGTT AGGCATTAAA GACAGATAAT TTTTATCGGT 60 

TTTAGGGTGT GGCGAGTTTG TGTTGAAAGA GCGTTTGAAG GCCTTTTTTA GTGCGGACTC 12 0 

TGTTTTCACT TTAATTTTTG CCCTTTTCTT TCTCACTTCG TTTAAAAAAC CTTTAACTCA 18 0 

AGTCTTGTTG ATTGTTTTA ATG GTT TTT TTG TTT TTT AGG TGT TAT TTC CAA 232 

Met Val Phe Leu Phe Phe Arg Cys Tyr Phe Gin 
15 10 

GCG TCT TTG AAA GAA ACT TTC GCA ATT AAT CAT TTA AAA ACA ATG TCT 2 80 

Ala Ser Leu Lys Glu Thr Phe Ala He Asn His Leu Lys Thr Met Ser 
15 20 25 

TTT AAA TGG CTC ACT CTG GCT TTT TTG GGC GTG TTT TTA AGC ATC TTC 32 8 

Phe Lys Trp Leu Thr Leu Ala Phe Leu Gly Val Phe Leu Ser He Phe 
30 35 40 
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TTA AGC CCT TTT TAAAAGTTAG CCATAAGGCT AAAAGCACGG CTAAAGCGT 14 0 5 

Leu Ser Pro Phe 



(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



Met Gin Val Lys 

1 

Ser Lys Asn Leu 
20 

Tyr Thr Leu Thr 
35 

Thr Cys Gly Phe 
50 

Leu Asn Ala Ala 
65 

Gly Cys Leu Ser 

Glu Val Asp lie 
100 

Met lie Ala Lys 
115 

Glu His Tyr Asn 
130 

Val Lys lie Ser 
145 

Pro Ser Phe Lys 

Lys Glu Val Glu 
180 

lie Ala Gin Asp 
195 

Gly Leu lie Gin 
210 

Ser Ala Arg lie 
225 

lie Gly Ala lie 

Pro He Gin His 
260 

Ser Ser Gin Ala 
275 

Lys Glu Ser Phe 



Glu Asn Lys Gin 
5 

Val Asp Ser Glu 

Asn Asp Ala Lys 
40 

He Glu Ser Ala 
55 

Lys Asp Lys Lys 
70 

Glu Arg Tyr Lys 
85 

Phe Thr Gly Val 

Lys Gin Asn Gin 
120 

Ala Arg He He 
135 

Glu Gly Cys Asn 
150 

Gly Lys Leu Gin 
165 

Asn Leu Ala Leu 

Ser Ser Ser Phe 
200 

Leu He Arg Ala 
215 

Leu Tyr Leu Tyr 
230 

Glu Ser Ser Pro 
245 

He Ser Asp Ser 

His His Leu Lys 
280 

He Arg Ser Thr 



Leu Cys Leu He 
10 

Val Met Leu Gly 
25 

Ser Ala Asp Val 

Lys Gin Glu Ser 
60 

Glu Gly Ala He 
75 

Asp Glu He Lys 
90 

Gly Asp Tyr Asp 
105 

Phe Ser Glu Gin 

Thr Gly Ser Ser 
140 

Gin Lys Cys Ser 
155 

Ser Arg Glu Leu 
170 

Lys Gly Tyr Thr 
185 

Leu Tyr Asp Lys 

He Asp Lys Gin 
220 

Pro Ser Ser Thr 
235 

He Phe Gin Asn 
250 

Met Leu Lys Lys 
265 

Leu Leu Asp Ala 
He He Val Gly 



Ser Leu Gly Cys 
15 

Lys Leu Tyr Asn 
30 

He Leu He Asn 
45 

He Gin Thr He 

Leu He Ala Ser 
80 

Glu Leu He Pro 
95 

Lys He Asp He 
110 

Val Phe Leu Ser 
125 

Val His Ala Tyr 

Phe Cys Ala He 
160 

Asp Ser He Leu 
175 

Asp Met Thr Phe 
190 

Gly Gin Lys Asp 
205 

Gin Ala Leu Lys 

Thr Leu Glu Leu 
240 

Tyr Phe Asp Met 
255 

Met Arg Arg Asn 
270 

Met Lys Gin Val 
285 

His Pro Glu Glu 
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CAG CTC ATT AGA GCG ATT GAT AAA CAG CAA GCC TTA AAG AGC GCG CGT 73 0 

Gin Leu He Arg Ala He Asp Lys Gin Gin Ala Leu Lys Ser Ala Arg 
215 220 225 

ATT TTA TAT CTC TAG CCC TCT AGC ACC ACG CTA GAG CTT ATT GGC GCG 778 
He Leu Tyr Leu Tyr Pro Ser Ser Thr Thr Leu Glu Leu He Gly Ala 
230 235 240 

ATT GAA AGT TCG CCC ATT TTT CAA AAT TAT TTT GAC ATG CCC ATC CAG 826 
He Glu Ser Ser Pro He Phe Gin Asn Tyr Phe Asp Met Pro He Gin 
245 250 255 

CAC ATC AGC GAC TCC ATG CTC AAA AAG ATG CGG CGC AAC TCT AGC CAA 874 
His He Ser Asp Ser Met Leu Lys Lys Met Arg Arg Asn Ser Ser Gin 
260 265 270 275 

GCG CAC CAT TTA AAG CTT TTA GAT GCC ATG AAG CAG GTT AAA GAA AGC 922 
Ala His His Leu Lys Leu Leu Asp Ala Met Lys Gin Val Lys Glu Ser 
280 285 290 

TTT ATC AGA AGC ACG ATC ATT GTA GGG CAT CCG GAA GAA AAT GAG AGC 970 
Phe He Arg Ser Thr He He Val Gly His Pro Glu Glu Asn Glu Ser 
295 300 305 

GAA TTT GAA GAA TTG AGC GCG TTT TTA GAC GAG TTC CAG TTT GAT AGA 1018 
Glu Phe Glu Glu Leu Ser Ala Phe Leu Asp Glu Phe Gin Phe Asp Arg 
310 315 320 

TTG AAT ATT TTT GCT TTC AGC OCT GAA GAA AAC ACG CAT GCC TAT TCT 1066 
Leu Asn He Phe Ala Phe Ser Ala Glu Glu Asn Thr His Ala Tyr Ser 
325 330 335 

TTA GAA AAA GTG CCT AAA AAA ACC ATC AAC GCT CGC ATC AAA GCC TTG 1114 
Leu Glu Lys Val Pro Lys Lys Thr He Asn Ala Arg He Lys Ala Leu 
340 345 350 355 

AAT AAA ATC GCT TTA AAG CAC CAA AAC CAT TCC TTT AAG GCT TTG TTG 1162 
Asn Lys He Ala Leu Lys His Gin Asn His Ser Phe Lys Ala Leu Leu 
360 365 370 

AAT AAG CCC ATT AAG GCG TTA GTG GAA AAT AAA GAG GGC GAG TAT TTT 1210 
Asn Lys Pro He Lys Ala Leu Val Glu Asn Lys Glu Gly Glu Tyr Phe 
375 380 385 

TAC AAA GCA AGG GAT CTC AGA TGG GCG CCT GAA GTG GAT GGG GAA ATC 12 5 8 

Tyr Lys Ala Arg Asp Leu Arg Trp Ala Pro Glu Val Asp Gly Glu He 
390 395 400 

TTG ATC AAT GAT AGC GAA CTA ACC ACC CCC TTA AAA CCC GGG CAT TAT 13 06 

Leu He Asn Asp Ser Glu Leu Thr Thr Pro Leu Lys Pro Gly His Tyr 
405 410 415 

ACG ATT GCA CCT AGC GAA TTT AAA GAT AAT ATC CTA CTC GCT AAG GTT 1354 
Thr He Ala Pro Ser Glu Phe Lys Asp Asn He Leu Leu Ala Lys Val 
420 425 430 435 
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ATTGATTTCT TTTGGGAAGT GGATTTGAAA AACTTAAAAA GCCATTAAC ATG CAA GTT 58 

Met Gin Val 
1 

AAA GAA AAC AAA CAA CTC TGC TTA ATT TCA TTA GGT TGC TCT AAA AAT 106 
Lys Glu Asn Lys Gin Leu Cys Leu lie Ser Leu Gly Cys Ser Lys Asn 
5 10 15 

TTG GTG GAT TCA GAG GTG ATG TTA GGC AAG CTT TAT AAT TAC ACG CTC 154 
Leu Val Asp Ser Glu Val Met Leu Gly Lys Leu Tyr Asn Tyr Thr Leu 
20 25 30 35 

ACT AAT GAC GCT AAG AGC GCT GAT GTG ATT TTG ATC AAC ACT TGC GGG 2 02 

Thr Asn Asp Ala Lys Ser Ala Asp Val lie Leu lie Asn Thr Cys Gly 
40 45 50 

TTT ATT GAA AGC GCT AAA CAA GAG AGT ATC CAA ACC ATT CTC AAC GCC 25 0 

Phe He Glu Ser Ala Lys Gin Glu Ser He Gin Thr He Leu Asn Ala 
55 60 65 

GCC AAA GAC AAA AAA GAG GGA GCG ATT TTG ATT GCG AGC GGG TGC TTG 2 98 

Ala Lys Asp Lys Lys Glu Gly Ala He Leu He Ala Ser Gly Cys Leu 
70 75 80 

AGC GAA CGC TAT AAA GAT GAA ATC AAA GAA TTG ATC CCT GAA GTG GAT 346 
Ser Glu Arg Tyr Lys Asp Glu He Lys Glu Leu He Pro Glu Val Asp 
85 90 95 

ATT TTT ACC GGC GTG GGG GAT TAT GAC AAG ATC GAT ATA ATG ATT GCT 3 94 

He Phe Thr Gly Val Gly Asp Tyr Asp Lys He Asp He Met He Ala 
100 105 110 115 

AAA AAA CAA AAC CAG TTC AGC GAG CAA GTG TTT TTA AGC GAG CAT TAC 44 2 

Lys Lys Gin Asn Gin Phe Ser Glu Gin Val Phe Leu Ser Glu His Tyr 
120 125 130 

AAC GCA CGC ATC ATC ACG GGA TCG AGC GTG CAT GCG TAT GTG AAA ATT 4 90 

Asn Ala Arg He He Thr Gly Ser Ser Val His Ala Tyr Val Lys He 
135 140 145 

TCT GAG GGT TGC AAT CAA AAA TGT TCT TTT TGC GCT ATC CCT AGC TTT 53 8 

Ser Glu Gly Cys Asn Gin Lys Cys Ser Phe Cys Ala He Pro Ser Phe 
150 155 160 

AAG GGG AAA TTG CAA AGC AGG GAA TTG GAC TCC ATT TTA AAA GAA GTG 5 86 

Lys Gly Lys Leu Gin Ser Arg Glu Leu Asp Ser He Leu Lys Glu Val 
165 170 175 

GAA AAT CTC GCG CTT AAA GGC TAT ACG GAT ATG ACT TTT ATC GCT CAA 6 34 

Glu Asn Leu Ala Leu Lys Gly Tyr Thr Asp Met Thr Phe He Ala Gin 
180 185 190 195 

GAC TCT AGC TCC TTT TTA TAC GAT AAG GGG CAA AAA GAC GGC TTG ATC 6 82 

Asp Ser Ser Ser Phe Leu Tyr Asp Lys Gly Gin Lys Asp Gly Leu He 
200 205 210 
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(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 

Met Leu Ser Pro Ala Thr Phe Lys Gin lie Thr Leu Ala Leu lie Ala 

15 10 15 

Ser Arg Leu lie Val Val lie Leu Tyr Ala Phe lie Phe lie Val Leu 

20 25 30 

Ser Phe Tyr Met Leu Asn lie lie Thr lie Leu Asn Phe Lys Ala Leu 

35 40 45 

lie Leu Gly Phe Val Ser Val Phe Ser Ser Ala Leu Phe Cys Phe Cys 

50 55 60 

Leu Ala lie Phe Val Ala Arg lie Phe Gin Asn Glu Gin Ser lie Leu 
65 70 75 80 

Gly Phe Cys Asn lie lie Asn Leu Tyr Ala Leu Met Ser Cys Asn Val 

85 90 95 

Phe Val Pro Leu Glu Tyr Leu Pro Ser lie Gly Gin Leu Phe lie Lys 

100 105 110 

Thr Ser lie Phe Tyr Tyr Leu Asn Gin Leu Leu lie Lys Ala Phe Gin 

115 120 125 

Gly lie Asp Thr lie Leu Val Leu Ala Thr Ser Thr Phe Phe lie lie 

130 135 140 

Gly Gly lie lie Leu Phe Leu Leu Ser Ala Asn Arg Met Leu Leu Thr 
145 150 155 160 

Pro Lys Glu Arg Met Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 50... 1366 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 70... 567 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CTAGCGCGAT CTTTGGTCTC ACACAAGCTA TAACAAGCTT GAGAATCGCA TAATATATTC 6 0 

TTGTTCTAC ATG CTA TCA CCA GCA ACT TTC AAA CAA ATA ACT CTA GCA TTA 111 
Met Leu Ser Pro Ala Thr Phe Lys Gin lie Thr Leu Ala Leu 
15 10 

ATC GCT TCA AGA CTA ATC GTT GTA ATC CTA TAT GCT .TTT ATC TTT ATT 159 
lie Ala Ser Arg Leu lie Val Val lie Leu Tyr Ala Phe lie Phe lie 
15 20 25 30 

GTT CTC TCT TTT TAT ATG CTC AAT ATC ATC ACT ATT CTT AAT TTT AAA 2 07 

Val Leu Ser Phe Tyr Met Leu Asn lie lie Thr He Leu Asn Phe Lys 
35 40 45 

GCG CTT ATT TTG GGG TTT GTT AGT GTT TTT TCA AGC GCA TTG TTT TGT 2 55 

Ala Leu He Leu Gly Phe Val Ser Val Phe Ser Ser Ala Leu Phe Cys 
50 55 60 

TTT TGC TTG GCA ATT TTT GTA GCT AGA ATT TTT CAA AAC GAA CAA AGC 3 03 

Phe Cys Leu Ala He Phe Val Ala Arg He Phe Gin Asn Glu Gin Ser 
65 70 75 

ATC TTA GGA TTT TGT AAT ATC ATC AAT CTC TAT GCG CTA ATG TCT TGT 351 
He Leu Gly Phe Cys Asn He He Asn Leu Tyr Ala Leu Met Ser Cys 
80 85 90 

AAT GTT TTT GTT CCT TTA GAA TAC CTA CCT AGT ATT GOT CAA TTA TTT 3 99 

Asn Val Phe Val Pro Leu Glu Tyr Leu Pro Ser He Gly Gin Leu Phe 
95 100 105 110 

ATC AAA ACA TCT ATT TTT TAC TAC CTT AAT CAA CTT CTA ATC AAA GCT 447 
He Lys Thr Ser He Phe Tyr Tyr Leu Asn Gin Leu Leu He Lys Ala 
115 120 125 

TTT CAA GGG ATT GAT ACT ATA CTG GTT TTA GCA ACT TCA ACA TTT TTC 4 95 

Phe Gin Gly He Asp Thr He Leu Val Leu Ala Thr Ser Thr Phe Phe 
130 135 140 

ATT ATT GGT GGC ATT ATT TTA TTT TTA CTA AGC GCT AAT CGC ATG TTA 54 3 

He He Gly Gly He He Leu Phe Leu Leu Ser Ala Asn Arg Met Leu 
145 150 155 

CTA ACA CCA AAA GAA CGC ATG CGT TAAAGGCTTA GTCCCACCAT TGATTTATTT 5 97 
Leu Thr Pro Lys Glu Arg Met Arg 
160 165 

AATGGCTCAA AAAAGGGGTA AGC 62 0 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Arg Leu Leu Phe Leu Leu Leu Ser Ala Ala Phe Met Leu Leu Ala 

15 10 15 

Glu Glu Lys lie Ser Leu Asn Asp Asp Ala Pro lie Lys Leu Val His 

20 25 30 

Trp Gin Asn Ala Leu Lys Glu Val Gin Pro Asp Ser Asn Ala Pro Ala 

35 40 45 

Thr Pro Pro lie Lys Ala Val Gin Thr Thr Leu Thr Phe Glu Thr Pro 

50 55 60 

Phe Asn Lys Thr Pro Lys lie Met Glu Val Glu Gly Gin Lys Val lie 
65 70 75 80 

Val Leu Lys Asn Ala Lys Leu Asp Ser Lys Lys Thr Met Asp Phe Lys 

85 90 95 

Glu Ala Ser Leu Asn Ala Leu Glu Met Phe Ser Tyr Gin Asn Asp lie 

100 105 110 

Tyr Leu Leu Ser Lys Lys Ala Lys Val Glu Leu Glu lie Gin Ala Ser 

115 120 125 

Asn Ser Lys Asp Lys Lys Arg Leu Arg Phe Leu Phe Leu Pro Lys Gly 

130 135 140 

Phe His Leu Ala Pro Pro Pro Asn Leu Lys Glu Lys Ser Gin Gin Thr 
145 150 155 160 

Asn Leu Ala Gin Lys Asp Thr Asn Glu Gin Pro Gin Ser Pro Leu Asn 

165 170 175 

Thr Leu Glu Leu Lys Pro Pro Leu Asn Leu Ser His Ala Tyr Lys Ala 

180 185 190 

Leu Ala Val lie Ala Ala Leu Leu Leu lie Leu Tyr Val lie Lys Lys 

195 200 205 

Lys lie Val Pro Thr Gin Gly Ser Phe Ser Ala Lys Asp Phe Lys Leu 

210 215 220 

Glu lie Ser Val Leu Gly Arg Val Asp Ala Asn His Lys lie He Ser 
225 230 235 240 

He Glu Thr Asn Lys Glu Arg Tyr Leu Val Leu Leu Ser Asp Lys Tyr 

245 250 255 

Gly Leu Leu Leu Asp Lys He Ser Pro Lys Thr Ser Lys Glu Glu Leu 

260 265 270 

He Lys Glu Ala Glu Asn Asn He Lys Asn Ser Lys Leu Gly Asn Leu 

275 280 285 

Tyr Ala Gly Lys Phe 
290 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 
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TTG TCT AAA AAA GCT AAA GTG GAA TTA GAA ATC CAA GCT TCA AAC AGC 442 
Leu Ser Lys Lys Ala Lys Val Glu Leu Glu He Gin Ala Ser Asn Ser 
115 120 125 130 

AAG GAT AAA AAA CGG CTC CGC TTT CTC TTT TTA CCC AAA GGT TTT CAT 4 90 

Lys Asp Lys Lys Arg Leu Arg Phe Leu Phe Leu Pro Lys Gly Phe His 
135 140 145 

TTA GCC CCA CCG CCT AAC CTG AAA GAA AAA TCT CAG CAA ACT AAC CTT 538 
Leu Ala Pro Pro Pro Asn Leu Lys Glu Lys Ser Gin Gin Thr Asn Leu 
150 155 160 

GCA CAA AAA GAC ACC AAC GAG CAA CCC CAA AGC CCT TTA AAC ACT CTA 5 86 

Ala Gin Lys Asp Thr Asn Glu Gin Pro Gin Ser Pro Leu Asn Thr Leu 
165 170 175 

GAG TTA AAA CCC CCA CTA AAT TTA AGC CAT GCT TAT AAG GCG CTA GCG 634 
Glu Leu Lys Pro Pro Leu Asn Leu Ser His Ala Tyr Lys Ala Leu Ala 
180 185 190 

GTT ATT GCT GCC TTA CTC TTA ATA TTG TAT GTC ATC AAA AAA AAA ATT 6 82 

Val He Ala Ala Leu Leu Leu He Leu Tyr Val He Lys Lys Lys He 
195 200 205 210 

GTT CCC ACA CAA GGG TCT TTT TCT GCA AAA GAT TTT AAG TTA GAA ATT 73 0 

Val Pro Thr Gin Gly Ser Phe Ser Ala Lys Asp Phe Lys Leu Glu He 
215 220 225 

AGC GTT TTG GGT CGT GTT GAT GCG AAC CAT AAA ATC ATT TCA ATA GAA 77 8 

Ser Val Leu Gly Arg Val Asp Ala Asn His Lys He He Ser He Glu 
230 235 240 

ACC AAT AAG GAG CGT TAC TTG GTC TTA CTA AGC GAT AAA TAG GGC CTG 826 
Thr Asn Lys Glu Arg Tyr Leu Val Leu Leu Ser Asp Lys Tyr Gly Leu 
245 250 255 

CTT TTA GAC AAA ATA AGC CCA AAA ACA TCT AAA GAA GAA CTG ATT AAA 8 74 

Leu Leu Asp Lys He Ser Pro Lys Thr Ser Lys Glu Glu Leu He Lys 
260 265 270 

GAA GCT GAA AAT AAT ATA AAG AAT TCA AAA TTA GGA AAT TTA TAT GCC 922 
Glu Ala Glu Asn Asn He Lys Asn Ser Lys Leu Gly Asn Leu Tyr Ala 
275 280 285 290 

GGA AAA TTC TAAACTACAA CCTGCTAAGT TAGGGAAAAA TTTTGACCCT GTGGATCAT 980 
Gly Lys Phe 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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Gin Asp Thr Lys Asp Asp Glu Asp Leu Thr Lys Lys Leu Lys Ala Tyr 

195 200 205 

lie Lys Tyr Lys Thr Asn Leu Ser Lys Ser Lys Ser Asn 

210 215 220 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 980 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 53 . . . 931 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

TGAATGAAAT CCTAGATTCT AACGCCATTG TATATTATCT CGCTAAAAAT TC ATG AGA 58 

Met Arg 
1 

TTG TTA TTC TTG TTA TTG AGT GCT GCT TTT ATG TTA CTG GCT GAA GAA 106 
Leu Leu Phe Leu Leu Leu Ser Ala Ala Phe Met Leu Leu Ala Glu Glu 
5 10 15 

AAA ATA TCT TTA AAC GAT GAC GCC CCC ATT AAA CTA GTG CAT TGG CAA 154 
Lys lie Ser Leu Asn Asp Asp Ala Pro lie Lys Leu Val His Trp Gin 
20 25 30 

AAT GCA TTA AAA GAA GTC CAA CCT GAT TCA AAC GCT CCA GCA ACA CCA 2 02 

Asn Ala Leu Lys Glu Val Gin Pro Asp Ser Asn Ala Pro Ala Thr Pro 
35 40 45 50 

CCT ATA AAA GCC GTG CAA ACC ACG CTC ACT TTT GAA ACG CCT TTT AAC 25 0 

Pro lie Lys Ala Val Gin Thr Thr Leu Thr Phe Glu Thr Pro Phe Asn 
55 60 65 

AAA ACG CCT AAA ATC ATG GAA GTT GAA GGG CAA AAG GTG ATC GTC TTA 2 98 

Lys Thr Pro Lys lie Met Glu Val Glu Gly Gin Lys Val lie Val Leu 
70 75 80 

AAA AAC GCT AAA CTG GAT TCT AAA AAA ACC ATG GAT TTT AAA GAA GCC 346 
Lys Asn Ala Lys Leu Asp Ser Lys Lys Thr Met Asp Phe Lys Glu Ala 
85 90 95 

TCT TTG AAT GCT TTA GAA ATG TTT TCC TAC CAA AAT GAC ATC TAC CTC 3 94 

Ser Leu Asn Ala Leu Glu Met Phe Ser Tyr Gin Asn Asp lie Tyr Leu 
100 105 110 

-172- 



BNSDOCIO: <WO 9843478A1J_> 



r 



WO 98/43478 PCTAJS98/0637 1 

AAA ACC GGC AAA ACC ATC TAT GAA TTG CCC GGC TAT ATG CCC TCT ACG 584 
Lys Thr Gly Lys Thr lie Tyr Glu Leu Pro Gly Tyr Met Pro Ser Thr 
165 170 175 

CAA TTT TTA GCC GTG TTA GAA TTT ATC GGC GAT GGG AAG TAT CAA GAG 632 
Gin Phe Leu Ala Val Leu Glu Phe lie Gly Asp Gly Lys Tyr Gin Asp 
180 185 190 

ACA AAA GAC GAT GAG GAT CTC ACT AAA AAA TTA AAG GCT TAG ATC AAG 680 
Thr Lys Asp Asp Glu Asp Leu Thr Lys Lys Leu Lys Ala Tyr lie Lys 
195 200 205 210 

TAT AAA ACC AAC CTT TCT AAA AGC AAG TCT AAC TAGGAAAGCC TAATGAAGAA 733 
Tyr Lys Thr Asn Leu Ser Lys Ser Lys Ser Asn 
215 220 

TCTCAAAAGC CTGCTTTCTT TTTTGCTGGC TTC 7 66 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Phe Ser Leu Ser Tyr Val Ser Lys Lys Phe Leu Ser Val Leu Leu 

15 10 15 

Leu lie Ser Leu Phe Leu Ser Ala Cys Lys Ser Asn Asn Lys Asp Lys 

20 25 30 

Leu Asp Glu Asn Leu Leu Ser Ser Gly Ser Gin Ser Ser Lys Glu Leu 

35 40 45 

Asn Asp Glu Arg Asp Asn lie Asp Lys Lys Ser Tyr Ala Gly Leu Glu 

50 55 60 

Asp Val Phe Ser Asp Asn Lys Ser lie Ser Pro Asn Asp Lys Tyr Met 
65 70 75 80 

Leu Leu Val Phe Gly Arg Asn Gly Cys Ser Tyr Cys Glu Arg Phe Lys 

85 90 95 

Lys Asp Leu Lys Asn Val Lys Glu Leu Arg Asp Tyr lie Lys Glu His 

100 105 110 

Phe Ser Ala Tyr Tyr Val Asn lie Ser Tyr Ser Lys Glu His Asp Phe 

115 120 125 

Lys Val Gly Asp Lys Asn Asn Glu Lys Glu lie Lys Met Ser Thr Glu 

130 135 140 

Glu Leu Ala Gin lie Tyr Ala Val Gin Ser Thr Pro Thr lie Val Leu 
145 150 155 160 

Ser Asp Lys Thr Gly Lys Thr lie Tyr Glu Leu Pro Gly Tyr Met Pro 

165 170 175 

Ser Thr Gin Phe Leu Ala Val Leu Glu Phe lie Gly Asp Gly Lys Tyr 
180 185 190 
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(ii) MOLECULE TYPE: Genomic DNA 
( ix ) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 713 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

TGTAGTAAGA TACCTAGTTT TCAAATCTAT TAAAAGATAA AGGTTATTAC ATG TTT 56 

Met Phe 
1 

TCA CTT TCT TAT GTT TCC AAG AAA TTT TTA AGC GTT TTA TTA TTG ATT 104 
Ser Leu Ser Tyr Val Ser Lys Lys Phe Leu Ser Val Leu Leu Leu lie 
5 10 15 

TCG CTG TTT TTA AGC GCT TGC AAA TCC AAC AAT AAA GAC AAG TTA GAC 152 
Ser Leu Phe Leu Ser Ala Cys Lys Ser Asn Asn Lys Asp Lys Leu Asp 
20 25 30 

GAA AAT CTT TTA AGC TCT GGC TCT CAA AGC TCC AAA GAA TTA AAC GAT 20 0 

Glu Asn Leu Leu Ser Ser Gly Ser Gin Ser Ser Lys Glu Leu Asn Asp 
35 40 45 50 

GAG CGA GAC AAT ATA GAC AAA AAG AGT TAC GCT GGT TTA GAA GAT GTT 24 8 

Glu Arg Asp Asn lie Asp Lys Lys Ser Tyr Ala Gly Leu Glu Asp Val 
55 60 65 

TTT TCA GAC AAT AAG TCC ATT AGT CCT AAC GAT AAA TAC ATG CTT TTA 2 96 

Phe Ser Asp Asn Lys Ser lie Ser Pro Asn Asp Lys Tyr Met Leu Leu 
70 75 80 

GTT TTT GGC CGT AAT GGT TGC TCC TAT TGC GAA AGG TTT AAA AAA GAT 344 
Val Phe Gly Arg Asn Gly Cys Ser Tyr Cys Glu Arg Phe Lys Lys Asp 
85 90 95 

CTC AAA AAT GTC AAA GAA TTG CGC GAC TAC ATT AAA GAG CAT TTT AGC 3 92 

Leu Lys Asn Val Lys Glu Leu Arg Asp Tyr lie Lys Glu His Phe Ser 
100 105 110 

GCT TAC TAT GTC AAT ATC AGC TAC TCC AAA GAG CAT GAT TTT AAA GTC 44 0 

Ala Tyr Tyr Val Asn lie Ser Tyr Ser Lys Glu His Asp Phe Lys Val 
115 120 125 130 

GGC GAT AAA AAT AAT GAA AAA GAA ATC AAA ATG TCC ACA GAA GAA TTA 488 
Gly Asp Lys Asn Asn Glu Lys Glu lie Lys Met Ser Thr Glu Glu Leu 
135 140 145 

GCG CAA ATT TAT GCC GTC CAA TCC ACC CCT ACG ATT GTT TTA TCC GAT 53 6 

Ala Gin lie Tyr Ala Val Gin Ser Thr Pro Thr lie Val Leu Ser Asp 
150 155 160 
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Glv 


Phe 


Phe 


Val 


Gly 


Ala 


Leu 


Leu 


Pne 


Tyr 


Trp 


Cys 


Ala 


Leu 


Ser 


Phe 


65 










70 










75 










80 


Airg 


Tvr 

X _y J. 


Ser 


Asp 


Phe 


Thr 


Tyr 


Leu 


Leu 


Pro 


Leu 


He 


He 


Val 


Leu 


He 


Ala 








85 










yo 










95 




Leu 


Val 


lyr 


Gly Val 


Leu 


Pne 


Tyr 


Leu 


Leu 


Leu 


Tyr 


Phe 


Glu 


Asn 








100 










105 








110 






Pro 




Phe 
115 


Arg 


Leu 


Leu 


Ser 


i'ne 


Leu 


Gly 


Ser 


Ser 


Phe 
125 


He 


His 


Pro 


Phe 


Glv 


Phe 


Asp 


Trp 


Leu 


va J. 


Pro 


Asp 


Ser 


Phe 


Phe 


Ser 


Tyr 


Ser 


Val 




130 










13 5 










14 0 








Phe 


Ara 


Val 


Asp 


Lys 


Leu 


Ser 


Leu 


Gly Leu Val 


Phe 


Leu 


Ala 


Cys 


He 


145 










15 0 










155 








160 


Phe 


Leu 


Ser 


Thr 


Lys 


Pro 


Leu 


Lys 


Lys 


Tyr 


Arg 


He 


He 


Gly 


Val 


Leu 










Xob 










170 








175 




Leu 


Leu 


Leu 


Gly 


Ala 


Leu 


Asp 


pne 


Asn 


Gly 


Phe 


Lys 


Thr 


Ser 


Asp 


Leu 








180 










185 










190 




Lvs 


Lys 


Val 


Glv 


Asn 


lie 


O J. Ll 


Leu 


Val 


Ser 


Thr 


Lys 


Thr 


Pro 


Gin 


Asp 






195 










200 










205 






Leu 


Lys 
210 


Phe 


Asp 


Ser 


Ser 


Tyr 
215 


Leu 


Asn 


Asp 


He 


Glu 
220 


Asn 


Asn 


He 


Leu 


Lys 


Glu 


He 


Lys 


Leu 


Ala 


oin 


Ser 


Lys 


Gin 


Lys 


Thr 


Leu 


He 


Val 


Phe 


225 


Glu 








230 










235 










240 


Pro 


Thr 


Ala 


Tyr 
245 


Pro 


Tic* 

lie 


Ala 


Leu 


Glu 
250 


Asn 


Ser 


Pro 


Phe 


Lys 
255 


Ala 


Lys 


XJC Li. 


Li 


Asp 
2 60 


Leu 


Ser 


Asp 


Asn 


He 
265 


Ala 


He 


Leu 


He 


Gly 
270 


Thr 


Leu 


Arg 


Thr 






Tyr 


Asn 


Leu 


Tyr 


Asn 


Ser 


Ser 


Phe 


Leu 


Phe 


Ser 


Lys 


Glu 




275 










Don 
Z o U 










285 






Ser 


Val 


Gin 


He 


Ala 


Asp 


Lys 


Val 


He 


Leu 


Ala 


Pro 


Phe 


Gly 


Glu 




290 










295 










3 0 0 








Thr 


Met 


Pro 


T .#5*n 
xjc Ll 


Pro 


Glu 


fne 


Leu 


Gin 


Lys 


Pro 


Leu 


Glu 


Lys 


Leu 


Phe 


305 










310 










315 








320 


Phe 


Glv 


Glu 


Ser 


Thr 


Tyr 


Leu 


Tyr 


Arg 


Asn 


Ala 


Pro 


His 


Phe 


Ser 


Asp 


Phe 








325 










330 










335 


Thr 


Leu 


Asp 


Asp 


Phe 


inr 


Pne 


Arg 


Pro 


Leu 


He 


Cys 


Tyr 


Glu 


Gly 


Thr 






340 










345 










350 




Ser 


Lys 


Pro 


Ala 


Tyr 


Q ci -|- 


Asn 


Ser 


Pro 


Ser 


Lys 


He 


Phe 


He 


Val 






355 










T n 
J o u 








365 








Met 


Ser 


Asn 


Asn 


Ala 


Trp 




Ser 


Pro 


Ser 


He 


Glu 


Pro 


Thr 


Leu 


Gin 




370 










375 










380 








Arg 




Leu 


Leu 


Lys 


Tyr 


Tyr 


Ala 


Arg 


Arg 


Tyr 


Asp 


Lys 


He 


He 


Leu 


385 










390 










395 








400 


His 


Ser 


Ala 


Asn 


Phe 


Ser 


Thr 


Ser 


Tyr 


He 


Leu 


Ser 


Pro 


Ser 


Leu 


Leu 


Gly 








405 










410 










415 




Asp 


He 


Leu 
420 


Phe 


Arg 


Lys 


Arg 


Ser 
425 

















(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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TTT CTT CAA AAA CCC CTT GAA AAG CTC TTT TTT GGC GAG AGC ACT TAT 1074 
Phe Leu Gin Lys Pro Leu Glu Lys Leu Phe Phe Gly Glu Ser Thr Tyr 
315 320 325 

TTA TAG CGC AAT GOT CCT CAT TTC AGC GAT TTT AC A TTA GAC GAT TTT 112 2 

Leu Tyr Arg Asn Ala Pro His Phe Ser Asp Phe Thr Leu Asp Asp Phe 
330 335 340 

ACT TTT CGC CCC CTG ATT TGC TAT GAA GGC ACT TCC AAA CCC GCT TAT 1170 
Thr Phe Arg Pro Leu lie Cys Tyr Glu Gly Thr Ser Lys Pro Ala Tyr 
345 350 355 

TCA AAC AGC CCT TCA AAA ATT TTT ATC GTG ATG AGC AAT AAC GCA TGG 1218 
Ser Asn Ser Pro Ser Lys lie Phe lie Val Met Ser Asn Asn Ala Trp 
360 365 370 

TTT AGC CCA AGC ATT GAA CCC ACC TTA CAA AGA ACG CTT TTA AAA TAC 12 6 6 

Phe Ser Pro Ser lie Glu Pro Thr Leu Gin Arg Thr Leu Leu Lys Tyr 
375 380 385 390 

TAC GCA AGG CGT TAT GAT AAG ATC ATC TTG CAC AGC GCG AAC TTT TCA 1314 
Tyr Ala Arg Arg Tyr Asp Lys lie lie Leu His Ser Ala Asn Phe Ser 
395 400 405 

ACT TCT TAC ATC TTA AGC CCT AGT TTA TTA GGC GAT ATT CTT TTT AGG 13 62 

Thr Ser Tyr lie Leu Ser Pro Ser Leu Leu Gly Asp lie Leu Phe Arg 
410 415 420 

AAA CGA TCA TGATTAAAGC GATTAATATT TCTCATGCTT TTGAAAAGCC TCTTTATAA 14 2 0 
Lys Arg Ser 
425 

TGGCG 14 2 5 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Met Arg Leu Leu Leu Phe Asn Gin Asn Ala Phe Leu Leu Ala Cys Met 

15 10 15 

Phe Val Ser Ser Val Tyr Val Asn Ala Val Leu Asp Ala Tyr Ala lie 

20 25 30 

Glu Asn Pro Tyr lie Ser lie Thr Leu Thr Ser Leu Leu Ala Pro Leu 

35 40 45 

Ser Met Leu Ala Phe Leu Lys Thr Pro Arg Asn Ser Ala Phe Ala Leu 
50 55 60 



-168- 



BNSDOCID: <WO 9843478A1J_> 



wo 98/43478 



PCT/US98/06371 



Tyr Leu Leu Pro Leu lie He Val Leu He Ala Leu Val Tyr Gly Val 
90 95 100 

TTA TTT TAT TTG TTG CTC TAT TTT GAA AAC CCC TAT TTC AGG CTT TTG 4 50 

Leu Phe Tyr Leu Leu Leu Tyr Phe Glu Asn Pro Tyr Phe Arg Leu Leu 
105 110 115 

ACT TTT TTA GGC TCT AGT TTT ATC CAC CCC TTT GGA TTT GAT TGG TTA 4 98 

Ser Phe Leu Gly Ser Ser Phe He His Pro Phe Gly Phe Asp Trp Leu 
120 125 130 

GTC CCA GAT AGC TTT TTT TCT TAT AGC GTG TTT AGA GTG GAT AAA TTA 546 
Val Pro Asp Ser Phe Phe Ser Tyr Ser Val Phe Arg Val Asp Lys Leu 
135 140 145 150 

TCG CTA GGG CTT GTT TTT TTG GCT TGC ATT TTT TTG AGC ACT AAA CCA 594 
Ser Leu Gly Leu Val Phe Leu Ala Cys He Phe Leu Ser Thr Lys Pro 
155 160 165 

TTG AAA AAA TAT AGG ATC ATA GGG GTT TTA TTG TTA CTT GGC GCG TTG 642 
Leu Lys Lys Tyr Arg He He Gly Val Leu Leu Leu Leu Gly Ala Leu 
170 175 180 

GAT TTT AAT GGT TTC AAA AC A AGC GAT TTA AAA AAG GTT GGA AAT ATT 6 90 

Asp Phe Asn Gly Phe Lys Thr Ser Asp Leu Lys Lys Val Gly Asn He 
185 190 195 

GAA TTA GTC TCT ACA AAA ACG CCC CAA GAT TTG AAA TTT GAC TCA AGT 73 8 

Glu Leu Val Ser Thr Lys Thr Pro Gin Asp Leu Lys Phe Asp Ser Ser 
200 205 210 

TAG CTT AAT GAT ATT GAA AAC AAC ATT CTT AAA GAA ATC AAG CTC GCT 786 
Tyr Leu Asn Asp He Glu Asn Asn He Leu Lys Glu He Lys Leu Ala 
215 220 225 230 

CAA AGC AAG CAA AAA ACC TTG ATT GTT TTT CCA GAA ACC GCC TAC CCC 8 34 

Gin Ser Lys Gin Lys Thr Leu He Val Phe Pro Glu Thr Ala Tyr Pro 
235 240 245 

ATC GCT TTA GAA AAC TCC CCC TTT AAA GCG AAG CTA GAA GAT TTA AGC 8 82 

He Ala Leu Glu Asn Ser Pro Phe Lys Ala Lys Leu Glu Asp Leu Ser 
250 255 260 

GAT AAT ATT GCT ATT TTA ATA GGG ACA TTA CGG ACT CAA GGC TAT AAT 93 0 

Asp Asn He Ala He Leu He Gly Thr Leu Arg Thr Gin Gly Tyr Asn 
265 270 275 

CTT TAT AAC AGC TCG TTT TTA TTT TCT AAA GAA AGC GTT CAG ATC GCT 97 8 

Leu Tyr Asn Ser Ser Phe Leu Phe Ser Lys Glu Ser Val Gin He Ala 
280 285 290 

GAT AAA GTA ATT TTA GCC CCC TTT GGC GAG ACC ATG CCT TTA CCG GAA 1026 
Asp Lys Val He Leu Ala Pro Phe Gly Glu Thr Met Pro Leu Pro Glu 
295 300 305 310 
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Arg Gin Ala Leu 
180 

Leu Asn Val Glu 
195 

Lys Asn Leu Lys 
210 

Asp Glu lie His 
225 



Lys lie Leu Ala 

Ser Lys Ala Lys 
200 

Thr Pro Gly Val 
215 

Thr Phe Lys lie 
230 



lie Asn Tyr Leu 
185 

Gly Gly Phe Val 

Val Val Val Lys 
220 

Asp lie Ser Lys 
235 



Lys Asn Asn Thr 
190 

Val Val Asp Thr 
205 

Val Phe Leu Glu 
Met 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 97 . . . 1371 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TAAAAAAAAC TACCCTTAAA AAAATCAATC TAAAATTCTT AAATTAAAAT ATAGCTATAA 60 

TACACTAAAA CAATCTCAAG GTTTCAAAAT TTAGCC ATG CGT CTT CTT CTG TTC 114 

Met Arg Leu Leu Leu Phe 
1 5 

AAT CAA AAC GCT TTT TTA TTA GCG TGC ATG TTT GTT TCA AGC GTG TAT 162 

Asn Gin Asn Ala Phe Leu Leu Ala Cys Met Phe Val Ser Ser Val Tyr 
10 15 20 

GTG AAC GCT GTC TTA GAC GCT TAT GCA ATT GAA AAC CCC TAT ATT TCT 210 
Val Asn Ala Val Leu Asp Ala Tyr Ala He Glu Asn Pro Tyr He Ser 
25 30 35 

ATC ACA CTC ACA AGC CTA TTA GCC CCT TTA AGC ATG CTA GCG TTT TTA 258 
He Thr Leu Thr Ser Leu Leu Ala Pro Leu Ser Met Leu Ala Phe Leu 
40 45 50 

AAA ACC CCT AGA AAT AGT GCT TTT GCT TTG GGG TTT TTT GTG GGG GCG 3 06 

Lys Thr Pro Arg Asn Ser Ala Phe Ala Leu Gly Phe Phe Val Gly Ala 
55 60 65 70 

TTA TTG TTT TAT TGG TGC GCT TTA AGC TTT CGC TAC TCG GAT TTC ACT 354 
Leu Leu Phe Tyr Trp Cys Ala Leu Ser Phe Arg Tyr Ser Asp Phe Thr 
75 80 85 

TAT TTA TTG CCC TTA ATC ATT GTT TTA ATA GCG TTA GTT TAT GGG GTT 4 02 
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165 170 175 

GCT TTA AAA ATC CTA GCG ATC AAT TAG CTT AAA AAC AAC ACC CTT AAT 632 
Ala Leu Lys lie Leu Ala lie Asn Tyr Leu Lys Asn Asn Thr Leu Asn 
180 185 190 

GTT GAG AGT AAG GCT AAG GGA GGG TTT GTG GTG GTG GAT ACC AAA AAC 680 
Val Glu Ser Lys Ala Lys Gly Gly Phe Val Val Val Asp Thr Lys Asn 
195 200 205 210 

CTT AAA ACC CCG GGT GTG GTG GTG GTT AAA GTC TTT TTA GAA GAT GAA 72 8 

Leu Lys Thr Pro Gly Val Val Val Val Lys Val Phe Leu Glu Asp Glu 
215 220 225 

ATC CAC ACC TTT AAA ATT GAT ATT TCT AAG ATG TAATCGCCCC CTTTAATAAA 781 
lie His Thr Phe Lys lie Asp lie Ser Lys Met 
230 235 

AGCCTTTGGG CCATCCACCT AAAGGTTTTT 811 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

Met Lys Lys lie Val Leu Val Ala lie Ala Leu Leu Met Ser Ala Cys 

15 10 15 

Ala Ser Tyr Lys lie Thr Pro Glu His Val Thr Ser Tyr Asn Asn Gly 

20 25 30 

lie Gin Val Met Thr Ser Thr Gin Ala Lys Ser Lys Val Gin Leu Glu 

35 40 45 

lie Ala Gin Ser Lys Leu Lys Gly Leu Asn Glu Ser Pro Leu Val Leu 

50 55 60 

Tyr Val Ala Ala Gin Val lie Glu Gly Ser Pro Val Val Phe Ser Arg 
65 70 75 80 

Lys Ala lie Ser Val Ser lie Asn Gin Thr Asn Leu Pro Val Leu Ser 

85 90 95 

Leu Arg Gin Val Met Lys Ser Ser Phe Asp Phe Glu Gly lie Leu Gin 

100 105 110 

Ser Phe Asn lie Ala Val Pro Thr Thr Pro lie Asp Asn Val Asn Met 

115 120 125 

lie Thr Pro Pro Met Phe Tyr Tyr Gly Gin Gly Gly Phe Leu Ala Tyr 

130 135 140 

Asn Gly Met Met Tyr Gly Gly Met Gly Met Tyr Gly Pro Gly Phe Gly 
145 150 155 160 

Met Met Met Met Asp Asp Val Glu Glu Gin Glu Val Met Gin Glu Ser 
165 170 175 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 761 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TATAATAAGG AAATTCTAAA CGAAAATTAA ACTGAATGAA AGGAGTTTGA ATG AAA 5 6 

Met Lys 
1 

AAA ATC GTT TTA GTA GCG ATA GCC TTA TTG ATG AGC GCT TGC GCG AGC 104 
Lys lie Val Leu Val Ala lie Ala Leu Leu Met Ser Ala Cys Ala Ser 
5 10 15 

TAT AAG ATC ACG CCT GAA CAT GTT ACT TCC TAT AAT AAT GGG ATT CAA 152 
Tyr Lys lie Thr Pro Glu His Val Thr Ser Tyr Asn Asn Gly lie Gin 
20 25 30 

GTG ATG ACT TCC ACG CAA GCC AAA TCT AAA GTC CAG CTA GAA ATC GCT 200 
Val Met Thr Ser Thr Gin Ala Lys Ser Lys Val Gin Leu Glu lie Ala 
35 40 45 50 

CAA AGC AAG TTG AAA GGC TTG AAC GAG TCC CCC TTA GTG CTG TAT GTA 24 8 

Gin Ser Lys Leu Lys Gly Leu Asn Glu Ser Pro Leu Val Leu Tyr Val 
55 60 65 

GCG GCG CAA GTT ATA GAG GGA AGT CCT GTG GTG TTT AGC CGT AAA GCC 2 96 

Ala Ala Gin Val lie Glu Gly Ser Pro Val Val Phe Ser Arg Lys Ala 
70 75 80 

ATT TCA GTG TCT ATC AAC CAA ACG AAT TTA CCG GTC TTA AGC CTG AGA 344 
lie Ser Val Ser lie Asn Gin Thr Asn Leu Pro Val Leu Ser Leu Arg 
85 90 95 

CAG GTG ATG AAA TCC AGT TTT GAT TTT GAG GGT ATT TTA CAA AGT TTT 3 92 

Gin Val Met Lys Ser Ser Phe Asp Phe Glu Gly lie Leu Gin Ser Phe 
100 105 110 

AAT ATC GCC GTG CCG ACC ACC CCT ATT GAT AAT GTC AAT ATG ATC ACC 44 0 

Asn lie Ala Val Pro Thr Thr Pro lie Asp Asn Val Asn Met lie Thr 
115 120 125 130 

CCG CCT ATG TTT TAT TAG GGG CAA GGG GGA TTT TTA GCT TAT AAC GGC 4 88 

Pro Pro Met Phe Tyr Tyr Gly Gin Gly Gly Phe Leu Ala Tyr Asn Gly 
135 140 145 

ATG ATG TAT GGG GGA ATG GGC ATG TAT GGG CCA GGC TTT GGC ATG ATG 53 6 

Met Met Tyr Gly Gly Met Gly Met Tyr Gly Pro Gly Phe Gly Met Met 
150 155 160 

ATG ATG GAT GAT GTA GAA GAG CAA GAA GTC ATG CAA GAA AGC CGC CAA 5 84 

Met Met' Asp Asp Val Glu Glu Gin Glu Val Met Gin Glu Ser Arg Gin 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 : 



Met Gly Leu Ala 
1 

lie Ser Ala Cys 
20 

Lys His Glu Ser 
35 

Thr Thr Lys Lys 
50 

Lys Arg Tyr Ser 
65 

Asp Asn Gly Met 

Pro Tyr Gin Val 
100 

Gly Glu Lys Phe 
115 

Ala Lys Lys Thr 
130 

Ala Ala His Lys 
145 

Val Asp Asn Asn 

Phe Val Ser Asp 
180 

lie Asp Met Val 
195 

Gly Phe Gly Gly 
210 

Ser Ser Ser Lys 
225 

Lys Ser Val Ser 

Asn Gin lie Gly 
260 

Pro Asn Tyr Ser 
275 

Leu Val Gin Gly 
290 

Lys Tyr Asn Gin 
305 



Leu Glu Lys Val 
5 

Thr Val Lys Lys 

Leu Arg Ala Tyr 
40 

Ala Ala Tyr Lys 
55 

Asp Ser Gin Asp 
70 

Arg Asp Ser Ser 
85 

Gly Gly Lys Trp 

Asp Gly Val Ala 
120 

Ser Asn Gly Glu 
135 

Thr Leu Pro Met 
150 

Leu Ser Thr lie 
165 

Arg lie lie Asp 

Lys Lys Gly Thr 
200 

Val lie Ser Thr 
215 

lie Leu His Lys 
230 

Gly Gly Lys Phe 
245 

Ala Gin Thr Leu 

Val Lys Val Ala 
280 

Phe Gin Ser Glu 
295 

Asn Ala Val Leu 
310 



Cys Phe Leu Gly 
10 

Glu Gly Val Lys 
25 

Glu Asn Ala Lys 

Arg Asn Phe Phe 
60 

Ser Asn Thr Lys 
75 

Ser lie Gin Arg 
90 

Tyr Tyr Pro Thr 
105 

Ser Trp Tyr Gly 

lie Tyr Asn Met 
140 

Asn Thr Val Val 
155 

Val Arg lie Asn 
170 

Leu Ser Asn Ala 
185 

Ala Ser Val Arg 

Gin Tyr Glu Gin 
220 

Glu Phe Lys Val 
235 

Ser Leu Gin Met 
250 

Ala Asp Lys Leu 
265 

Phe Lys Asp Asp 

Glu Glu Ala Arg 
300 

Thr Arg Glu 
315 



Val lie Phe Leu 
15 

Asn Leu Ser Tyr 
30 

Asp Tyr Asp Pro 
45 

Glu Arg His Phe 

Asp Gin Pro Leu 
80 

Ala Thr Met Arg 
95 

Lys Val Asp Leu 
110 

Pro Asn Phe His 
125 

Tyr Ala His Thr 

Lys Val lie Asn 
160 

Asp Arg Gly Pro 
175 

Ala Ala Arg Asp 
190 

Leu lie Val Leu 
205 

Ser Phe Asn Ala 

Gly Glu Ser Glu 
240 

Gly Ala Phe Arg 
255 

Gin Ala Glu Asn 
270 

Leu Tyr Lys Val 
285 

Asp Phe Met Lys 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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GCC GCG CAC AAA ACT TTA CCC ATG AAC ACC GTG GTG AAA GTC ATC AAT 6 00 

Ala Ala His Lys Thr Leu Pro Met Asn Thr Val Val Lys Val lie Asn 
145 150 155 160 

GTT GAT AAT AAC TTA AGC ACC ATT GTG CGC ATC AAC GAT AGA GGG CCT 64 8 

Val Asp Asn Asn Leu Ser Thr lie Val Arg lie Asn Asp Arg Gly Pro 
165 170 175 

TTT GTG AGC GAT CGC ATC ATT GAT TTG TCT AAT GCG GCC GCT AGG GAT 6 96 

Phe Val Ser Asp Arg lie lie Asp Leu Ser Asn Ala Ala Ala Arg Asp 
180 185 190 

ATT GAC ATG GTT AAA AAA GGC ACA GCC AGC GTG CGT CTC ATT GTT TTG 744 
lie Asp Met Val Lys Lys Gly Thr Ala Ser Val Arg Leu lie Val Leu 
195 200 205 

GGC TTT GGT GGG GTT ATC TCC ACG CAA TAC GAA CAA TCC TTT AAC GCC 7 92 

Gly Phe Gly Gly Val lie Ser Thr Gin Tyr Glu Gin Ser Phe Asn Ala 
210 215 220 

AGC TCT TCA AAG ATC TTG CAC AAG GAA TTT AAA GTC GGC GAG AGC GAA 84 0 

Ser Ser Ser Lys lie Leu His Lys Glu Phe Lys Val Gly Glu Ser Glu 
225 230 235 240 

AAA AGC GTG AGC GGA GGG AAA TTT TCT TTG CAA ATG GGG GCT TTT AGA 8 88 

Lys Ser Val Ser Gly Gly Lys Phe Ser Leu Gin Met Gly Ala Phe Arg 
245 250 255 

AAC CAA ATA GGT GCT CAA ACT TTA GCG GAT AAA TTG CAA GCA GAA AAT 93 6 

Asn Gin lie Gly Ala Gin Thr Leu Ala Asp Lys Leu Gin Ala Glu Asn 
260 265 270 

CCA AAT TAC AGC GTC AAG GTT GCT TTT AAA GAC GAT TTG TAT AAA GTT 984 
Pro Asn Tyr Ser Val Lys Val Ala Phe Lys Asp Asp Leu Tyr Lys Val 
275 280 285 

TTA GTT CAA GGG TTT CAA AGC GAA GAA GAG GCT AGG GAT TTT ATG AAA 103 2 
Leu Val Gin Gly Phe Gin Ser Glu Glu Glu Ala Arg Asp Phe Met Lys 
290 295 300 

AAA TAC AAC CAG AAT GCG GTT TTA ACG AGA GAA TGATTAAGTT ATTGCTTTTA 10 85 
Lys Tyr Asn Gin Asn Ala Val Leu Thr Arg Glu 
305 310 315 

GATGTGGATG GCACGCTCAC AGACGGATCG TTGTAT 1121 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 121... 1065 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

CTATCAAGTC AGTATTTCCA ATATCCAATT AGCCAATGAT CTCAAAGATT CTAATATTTT 60 
TATCCACCAG CGTTTAATCA TCCCCACCAA CAAAAAATTA CTCGCTACAA GGGAATTTTA 12 0 

ATG GGT TTG GCG TTG GAA AAA GTT TGT TTT TTA GGC GTT ATT TTT TTG 168 
Met Gly Leu Ala Leu Glu Lys Val Cys Phe Leu Gly Val lie Phe Leu 
1 5 10 15 . 

ATT AGC GCT TGC ACG GTT AAA AAA GAG GGG GTA AAG AAT TTG TCT TAC 216 
lie Ser Ala Cys Thr Val Lys Lys Glu Gly Val Lys Asn Leu Ser Tyr 
20 25 30 

AAG CAT GAA AGC TTG CGC GCT TAT GAA AAC GCT AAA GAT TAT GAT CCG 2 64 

Lys His Glu Ser Leu Arg Ala Tyr Glu Asn Ala Lys Asp Tyr Asp Pro 
35 40 45 

ACA ACC AAA AAA GCC GCC TAT AAA CGC AAT TTT TTT GAA CGC CAT TTC 312 
Thr Thr Lys Lys Ala Ala Tyr Lys Arg Asn Phe Phe Glu Arg His Phe 
50 55 60 

AAA CGC TAC TCC GAT TCG CAA GAT AGC AAC ACA AAA GAT CAG CQA CTA 360 
Lys Arg Tyr Ser Asp Ser Gin Asp Ser Asn Thr Lys Asp Gin Pro Leu 
65 70 75 80 

GAT AAC GGC ATG CGC GAT TCT AGC TCG ATC CAA AGA GCC ACC ATG CGC 4 08 

Asp Asn Gly Met Arg Asp Ser Ser Ser lie Gin Arg Ala Thr Met Arg 
85 90 95 

CCT TAT CAA GTG GGG GGC AAG TGG TAT TAC CCC ACT AAA GTG GAT TTA 4 56 

Pro Tyr Gin Val Gly Gly Lys Trp Tyr Tyr Pro Thr Lys Val Asp Leu 
100 105 110 

GGC GAA AAA TTT GAT GGC GTT GCG AGT TGG TAT GGC CCT AAC TTC CAT 5 04 

Gly Glu Lys Phe Asp Gly Val Ala Ser Trp Tyr Gly Pro Asn Phe His 
115 120 125 

GCC AAA AAA ACC AGT AAT GGG GAA ATT TAT AAC ATG TAT GCC CAC ACC 5 52 

Ala Lys Lys Thr Ser Asn Gly Glu lie Tyr Asn Met Tyr Ala His Thr 
130 135 140 
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Asn Thr Lys lie Thr Lys Asn Asn Pro lie Asn Gin Ala Asn Asn Asp 
35 40 45 50 



ATA AGA AAA ATT GAG CAT GAA GAA GAA GAT GAA AAA GCC ACC AAA GAA 24 8 

lie Arg Lys lie Glu His Glu Glu Glu Asp Glu Lys Ala Thr Lys Glu 
55 60 65 



GTG AAC GAT TTG ATC AAT AAC GAA AAT AAA ATT GAT GAA ATC AAT AAT 2 96 

Val Asn Asp Leu lie Asn Asn Glu Asn Lys lie Asp Glu lie Asn Asn 
70 75 80 



GAA GAA AAC GCT GAT CCT TCG CAA AAA AGA ACG AAC AAC GTT TTG CAA 344 
Glu Glu Asn Ala Asp Pro Ser Gin Lys Arg Thr Asn Asn Val Leu Gin 
85 90 95 



CGA GCC ACT AAC CAC CAA GAC AAT CTC AAT TCC CCA CTC AAC AGG AAG 3 92 

Arg Ala Thr Asn His Gin Asp Asn Leu Asn Ser Pro Leu Asn Arg Lys 
100 105 110 



TAT TAAAGTGTGA AACTTTTTTC AAAGGATTTA TTTATVAAAAG TAACCCCTTT ATT 44 8 

Tyr 

115 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 52 : 



Met Lys Ser Lys lie 

1 5 
Leu Phe Ser Ala Cys 
20 

Gin Asn Asn Thr Lys 
35 

Asn Asp lie Arg Lys 
50 

Lys Glu Val Asn Asp 
65 

Asn Asn Glu Glu Asn 
85 

Leu Gin Arg Ala Thr 
100 

Arg Lys Tyr 
115 



Thr His Phe lie Ala 
10 

Lys Asp Glu Pro Lys 
25 

lie Thr Lys Asn Asn 
40 

lie Glu His Glu Glu 
55 

Leu lie Asn Asn Glu 
70 

Ala Asp Pro Ser Gin 
90 

Asn His Gin Asp Asn 
105 



lie Ser Phe Val Leu Ser 
15 

Lys Ser Ser Gin Ser His 
30 

Pro lie Asn Gin Ala Asn 
45 

Glu Asp Glu Lys Ala Thr 
60 

Asn Lys lie Asp Glu lie 
75 80 
Lys Arg Thr Asn Asn Val 
95 

Leu Asn Ser Pro Leu Asn 
110 



(2) INFORMATION FOR SEQ ID NO: 53: 
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65 

lie Thr His Gly Lys 
85 

Leu Lys Thr Met Phe 
100 

Ala Leu Pro Pro Tyr 
115 

Thr lie Leu Ser Phe 
130 

Ala Glu Phe Ala Leu 
145 

Ser Gly Val lie lie 
165 

Thr Thr Lys Thr Ser 
180 

lie Gin Ser Leu Gin 
195 

Ser Leu lie Lys Lys 
210 

Lys 
225 



70 75 
His Gin Lys Trp lie Asp 
90 

Met Gin Glu Ala Gin Lys 
105 

Gly Ala Gly Ala Pro Thr 
120 

Ser Leu Leu Glu Lys Glu 
135 

Gly Tyr Asp lie Ser Val 
150 155 
Lys His Glu Asn lie Ser 
170 

Lys Asn Gly Asn Gin Asp 
185 

His Val Ser Val Gin Ala 
200 

Ala lie Glu Ala Gin Ser 
215 



80 

Leu Pro Arg Asn Met 
95 

Ala Cys Leu Gly Val 
110 

Tyr Ala Val Arg Phe 
125 

Asn Ser Thr Tyr Arg 
140 

Lys Gly Asp Ser His 
160 

Ser Leu Glu Asn Lys 
175 

Phe Gin Glu Ser Ala 
190 

lie Gin Glu Ala Val 
205 

Val Ser Pro Leu Lys 
220 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 395 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

TTTTATGATA AGATAGTCAA ATTATACATT GACTTAAGGA AATTTAATTG ATG AAA 56 

Met Lys 
1 

TCT AAA ATC ACT CAT TTT ATC GCT ATC TCT TTT GTT TTA AGC CTG TTT 104 
Ser Lys lie Thr His Phe lie Ala lie Ser Phe Val Leu Ser Leu Phe 
5 10 15 

AGC GCA TGC AAA GAC GAG CCT AAA AAA TCG TCT CAA TCG CAC CAA AAC 152 
Ser Ala Cys Lys Asp Glu Pro Lys Lys Ser Ser Gin Ser His Gin Asn 
20 25 30 

AAC ACT AAA ATC ACT AAA AAC AAT CCA ATC AAT CAA GCG AAT AAT GAT 2 00 
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Cys Leu Gly Val Ala Leu Pro Pro Tyr Gly Ala Gly Ala Pro Thr Tyr 
110 115 120 

GCG GTT CGT TTT ACG ATT TTA TCG TTT TCT CTT TTA GAA AAA GAA AAT 4 94 

Ala Val Arg Phe Thr lie Leu Ser Phe Ser Leu Leu Glu Lys Glu Asn 
125 130 135 140 

TCT ACC TAT AGG GCG GAA TTT GCA CTA GGC TAT GAC ATT AGC GTG AAA 542 
Ser Thr Tyr Arg Ala Glu Phe Ala Leu Gly Tyr Asp lie Ser Val Lys 
145 150 155 

GGC GAT TCG CAT TCT GGG GTG ATC ATT AAG CAT GAA AAT ATT TCT AGC 5 90 

Gly Asp Ser His Ser Gly Val lie lie Lys His Glu Asn lie Ser Ser 
160 165 170 

TTG GAA AAT AAA ACG ACC AAA ACG AGT AAA AAT GGC AAT CAA GAT TTT 63 8 

Leu Glu Asn Lys Thr Thr Lys Thr Ser Lys Asn Gly Asn Gin Asp Phe 
175 180 185 

CAA GAA AGC GCG ATA CAA TCT CTC CAA CAT GTA AGC GTG CAA GCG ATT 686 
Gin Glu Ser Ala lie Gin Ser Leu Gin His Val Ser Val Gin Ala lie 
190 195 200 

CAA GAA GCG GTT TCT TTG ATT AAA AAA GCC ATT GAA GCG CAA AGC GTA 734 
Gin Glu Ala Val Ser Leu lie Lys Lys Ala lie Glu Ala Gin Ser Val 
205 210 215 220 

AGC CCG TTA AAA AAA TAAAAAATAA GGAGGAATTG TTTGATTTTA CGATTGGCTG G 7 90 
Ser Pro Leu Lys Lys 
225 

AGCAAGCGTT T 801 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Arg Ala Thr Ala lie Lys lie Phe Ser Leu Ser Ser Ala Leu Ala 

15 10 15 

Leu Leu Leu His Gly Cys Leu Ser lie Asn Leu Lys Gin Met Leu Pro 

20 25 30 

Glu lie Arg Thr Tyr Asp Leu Asn Ala Ser Ser Phe Glu lie Thr Gin 

35 40 45 

Cys Ala Lys Pro Leu Thr Glu Val Arg Leu lie Ser lie Leu Ser Ala 

50 55 60 

Asp Leu Phe Asn Thr Lys Glu lie Val Phe Lys Ala Lys Asp Gly Gin 
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Leu Glu Arg lie Asp Glu Thr Leu Glu Lys Glu Thr Lys Pro Lys Pro 

195 200 205 

Ser His Met Pro Trp Tyr Val Leu He Phe Asp Trp 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 75... 749 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GAAAAAGGCT CTGCTTTGAT AGATAAATTT GACGCTAACC CCTATAAAAC GATTTTTGGA 60 
GAAAGGAAAT AATC ATG AGA GCT ACG GCG ATA AAA ATC TTT TCA CTC TCA 110 
Met Arg Ala Thr Ala lie Lys lie Phe Ser Leu Ser 
IS 10 

TCA GCA TTA GCC CTA TTG CTT CAT GGT TGC TTG AGC ATC AAT TTA AAA 158 
Ser Ala Leu Ala Leu Leu Leu His Gly Cys Leu Ser lie Asn Leu Lys 
15 20 25 

CAA ATG CTA CCA GAG ATC AGA ACT TAC GAT TTG AAT GCG AGT TCT TTT 2 06 

Gin Met Leu Pro Glu lie Arg Thr Tyr Asp Leu Asn Ala Ser Ser Phe 
30 35 40 

GAA ATC ACG CAA TGC GCT AAA CCT TTG ACT GAA GTG AGG CTC ATT AGT 2 54 

Glu lie Thr Gin Cys Ala Lys Pro Leu Thr Glu Val Arg Leu lie Ser 
45 50 55 60 

ATT TTG AGC GCG GAT TTA TTC AAC ACT AAA GAG ATC GTT TTT AAA GCC 3 02 

lie Leu Ser Ala Asp Leu Phe Asn Thr Lys Glu lie Val Phe Lys Ala 
65 70 75 

AAA GAC GGG CAG ATC ACG CAT GGG AAG CAC CAA AAA TGG ATA GAC TTG 35 0 

Lys Asp Gly Gin lie Thr His Gly Lys His Gin Lys Trp lie Asp Leu 
80 85 90 

CCT CGC AAC ATG CTA AAA ACC ATG TTC ATG CAA GAA GCG CAA AAA GCA 3 98 

Pro Arg Asn Met Leu Lys Thr Met Phe Met Gin Glu Ala Gin Lys Ala 
95 100 105 

TGC TTA GGC GTG GCT TTG CCT CCT TAT GGC GCG GGT GCA CCC ACT TAT 44 6 
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ATC AAA TTC ATT TTA GGG CAA AAT GAG CTC AAT CGC GCG ATC GCG AAT 5 84 

lie Lys Phe lie Leu Gly Gin Asn Glu Leu Asn Arg Ala He Ala Asn 
165 170 175 

GTC TAT AAA AAA CGC CAC AAG CCT GAG GGC GTG AAA CGC TAT TTA GAA 632 
Val Tyr Lys Lys Arg His Lys Pro Glu Gly Val Lys Arg Tyr Leu Glu 
180 185 190 

AGG ATA GAT GAG ACT TTA GAA AAA GAG ACT AAA CCC AAA CCA TCG CAC 68 0 

Arg He Asp Glu Thr Leu Glu Lys Glu Thr Lys Pro Lys Pro Ser His 
195 200 205 210 

ATG CCT TGG TAT GTG TTA ATT TTT GAT TGG TAGGATATTT CAAAACCATA CAC 73 3 
Met Pro Trp Tyr Val Leu He Phe Asp Trp 
215 220 

ATTATAACAG AGAGATGAAA AATGACTGAA 763 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 0 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : S ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Arg Leu Lys His Phe Lys Thr Phe Leu Phe He Thr Met Ala He 

15 10 15 

He Val He Gly Thr Gly Cys Ala Asn Lys Lys Lys Lys Lys Asp Glu 

20 25 30 

Tyr Asn Lys Pro Ala He Phe Trp Tyr Gin Gly He Leu Arg Glu He 

35 40 45 

Leu Phe Ala Asn Leu Glu Thr Ala Asp Asn Tyr Tyr Ser Ser Leu Gin 

50 55 60 

Ser Glu His He Asn Ser Pro Leu Val Pro Glu Ala Met Leu Ala Leu 
65 70 75 80 

Gly Gin Ala His Met Lys Lys Lys Glu Tyr Val Leu Ala Ser Phe Tyr 

85 90 95 

Phe Asp Glu Tyr He Lys Arg Phe Gly Thr Lys Asp Asn Val Asp Tyr 

100 105 110 

Leu Thr Phe Leu Lys Leu Gin Ser His Tyr Tyr Ala Phe Lys Asn His 

115 120 125 

Ser Lys Asp Gin Glu Phe He Ser Asn Ser He Val Ser Leu Gly Glu 

130 135 140 

Phe He Glu Lys Tyr Pro Asn Ser Arg Tyr Arg Pro Tyr Val Glu Tyr 
145 150 155 160 

Met Gin He Lys Phe He Leu Gly Gin Asn Glu Leu Asn Arg Ala He 

165 170 175 

Ala Asn Val Tyr Lys Lys Arg His Lys Pro Glu Gly Val Lys Arg Tyr 
180 185 190 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 51... 710 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

AAAAAATAAC CATGAGTTAT TCAAAAATTT AACTTTATAA GACAGGTGGC ATG CGT 56 

Met Arg 
1 

TTA AAA CAT TTT AAA ACT TTC CTT TTT ATC ACA ATG GCA ATC ATT GTA 104 
Leu Lys His Phe Lys Thr Phe Leu Phe He Thr Met Ala He He Val 
5 10 15 

ATA GOT ACC GGT TGC GCG AAC AAA AAG AAA AAA AAA GAC GAA TAC AAC 152 
He Gly Thr Gly Cys Ala Asn Lys Lys Lys Lys Lys Asp Glu Tyr Asn 
20 25 30 

AAA CCG GCG ATC TTT TGG TAT CAA GGG ATT TTG AGA GAA ATC CTT TTT 2 00 

Lys Pro Ala He Phe Trp Tyr Gin Gly He Leu Arg Glu He Leu Phe 
35 40 45 50 

GCT AAT TTA GAA ACA GCG GAC AAT TAC TAT TCT TCT TTA CAA AGC GAA 24 8 

Ala Asn Leu Glu Thr Ala Asp Asn Tyr Tyr Ser Ser Leu Gin Ser Glu 
55 60 65 

CAC ATC AAT TCC CCC CTT GTC CCA GAA GCG ATG CTA GCT TTA GGG CAA 2 96 

His He Asn Ser Pro Leu Val Pro Glu Ala Met Leu Ala Leu Gly Gin 
70 75 80 

GCG CAC ATG AAA AAG AAA GAG TAT GTT TTA GCG TCT TTT TAC TTT GAT 344 
Ala His Met Lys Lys Lys Glu Tyr Val Leu Ala Ser Phe Tyr Phe Asp 
85 90 95 

GAA TAC ATC AAG CGC TTT GGG ACT AAG GAC AAT GTG GAT TAT TTG ACT 3 92 

Glu Tyr He Lys Arg Phe Gly Thr Lys Asp Asn Val Asp Tyr Leu Thr 
100 105 110 

TTT TTA AAA TTG CAA TCG CAT TAT TAC GCT TTC AAA AAC CAT TCT AAA 44 0 

Phe Leu Lys Leu Gin Ser His Tyr Tyr Ala Phe Lys Asn His Ser Lys 
115 120 125 130 

GAC CAG GAA TTT ATC TCT AAT TCT ATT GTG AGT TTA GGC GAA TTT ATA 48 8 

Asp Gin Glu Phe He Ser Asn Ser He Val Ser Leu Gly Glu Phe He 
135 140 145 

GAA AAA TAC CCT AAC AGC CGT TAC CGC CCC TAT GTA GAA TAC ATG CAA 536 
Glu Lys Tyr Pro Asn Ser Arg Tyr Arg Pro Tyr Val Glu Tyr Met Gin 
150 155 160 
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AAT GCA GAG TGG CTT TAAGGCATGA AAAAGATTGC ATTTTTTATT TTTGTCATTT T 74 5 
Asn Ala Glu Trp Leu 
205 

GTTTTCGGTA GGGATTTATT TAATTTGGCA TGTTTTATTG GAAAAAGCCC TAGAATTGAA 8 05 
ATTAGCAACC TCAGCTAATG ATTTGCTTTT 83 5 

(2) INFORMATION FOR SEQ 10 NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



Met Leu Arg Val 
1 

Gin Phe Phe Asn 
20 

Lys Asn Ser Ala 
35 

Phe Phe Asn Ala 
50 

Lys Ala lie Thr 
65 

Phe lie Leu Asn 

Pro Lys Asp Leu 
100 

Pro Asn Ala Asp 
115 

Ser Leu lie Leu 
130 

Leu Leu lie Pro 
145 

Leu Gly Asp Asn 

Asn Ala Leu lie 
180 

Gly Leu Glu Lys 
195 



Leu Ser Val Gly 
5 

Lys Thr Thr Leu 

Leu Lys Thr Ala 
40 

Arg Phe Val Pro 
55 

Gin Gin lie Ala 
70 

Val Ser Gly Asn 
85 

Lys Ala lie Lys 

Pro Lys Ala Val 
120 

Glu Cys Val Pro 
135 

Thr Ala Phe Ser 
150 

Pro Ser Leu Phe 
165 

Lys Ala Leu Asn 

Arg Leu Asn Ala 
200 



Val Ala Phe lie 
10 

His Leu Lys Tyr 
25 

Phe Thr Leu Thr 

Pro Phe Tyr Gin 
60 

Tyr Phe Leu Lys 
75 

Val Phe Phe Ser 
90 

Glu Arg Leu Lys 
105 

Met Arg Phe Leu 

Gin Thr Thr Cys 
140 

Val Pro Val Tyr 
155 

Ser Gin Glu Asp 
170 

Lys Ala Tyr Tyr 
185 

lie Lys Asn Ala 



Leu Leu Gly Cys 
15 

Lys Asp Tyr Pro 
30 

Pro Pro Lys lie 
45 

Lys Glu Phe Lys 

Asp Lys Ser Ala 
80 

Phe Glu Glu Asn 
95 

Lys Thr lie Glu 
110 

Asn Leu Gin Ala 
125 

Pro Phe Asp Thr 

Tyr Ala Asn Arg 
160 

Lys Thr Tyr His 
175 

Ser Leu Met Glu 

190 
Glu Trp Leu 
205 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

TTGATCATTC TTATTTCGCA CAACCCAAGC ACGCTAAAAT TAGCCACTAA GCATGTGAAA 60 
TTAGAGCATG GGCGTTTGAC AGA ATG CTA AGG GTT TTA AGC GTT GGT GTT GCT 113 

Met Leu Arg Val Leu Ser Val Gly Val Ala 
15 10 

TTT ATT TTA CTA GGG TGT CAG TTT TTC AAC AAA ACG ACG CTG CAT TTA 161 
Phe lie Leu Leu Gly Cys Gin Phe Phe Asn Lys Thr Thr Leu His Leu 
15 20 25 

AAA TAT AAA GAT TAC CCC AAA AAT AGC GCT TTA AAA ACC GCT TTC ACT 2 09 

Lys Tyr Lys Asp Tyr Pro Lys Asn Ser Ala Leu Lys Thr Ala Phe Thr 
30 35 40 

TTA ACC CCC CCT AAA ATC TTT TTT 7VAC GCC CGT TTT GTG CCG CCC TTT 2 57 

Leu Thr Pro Pro Lys lie Phe Phe Asn Ala Arg Phe Val Pro Pro Phe 
45 50 55 

TAC CAA AAA GAA TTT AAA AAA GCG ATC ACC CAA CAA ATC GCT TAT TTT 3 05 

Tyr Gin Lys Glu Phe Lys Lys Ala lie Thr Gin Gin lie Ala Tyr Phe 
60 65 70 

TTA AAA GAT AAA AGT GCT TTT ATT CTC AAT GTT TCA GGC AAT GTT TTT 3 53 

Leu Lys Asp Lys Ser Ala Phe lie Leu Asn Val Ser Gly Asn Val Phe 
75 80 85 90 

TTT TCT TTT GAA GAG AAT CCT AAA GAT TTA AAA GCC ATT AAA GAA AGG 4 01 

Phe Ser Phe Glu Glu Asn Pro Lys Asp Leu Lys Ala lie Lys Glu Arg 
95 100 105 

CTT AAA AAG ACG ATT GAG CCT AAC GCT GAC CCA AAA GCC GTC ATG CGT 44 9 

Leu Lys Lys Thr lie Glu Pro Asn Ala Asp Pro Lys Ala Val Met Arg 
110 115 120 

TTT TTA AAC CTT CAA GCG AGC TTG ATT TTA GAA TGC GTC CCG CAA ACC 4 97 

Phe Leu Asn Leu Gin Ala Ser Leu lie Leu Glu Cys Val Pro Gin Thr 
125 130 135 

ACT TGC CCG TTT GAC ACC CTT TTA ATC CCC ACC GCT TTC AGC GTG CCT 54 5 

Thr Cys Pro Phe Asp Thr Leu Leu lie Pro Thr Ala Phe Ser Val Pro 
140 145 150 

GTT TAT TAC GCT AAT CGT TTG GGC GAT AAC CCC TCT CTT TTT TCC CAA 5 93 

Val Tyr Tyr Ala Asn Arg Leu Gly Asp Asn Pro Ser Leu Phe Ser Gin 
155 160 165 170 

GAG GAT AAA ACC TAT CAT AAC GCT TTG ATC AAA GCC CTT AAT AAG GCT 641 
Glu Asp Lys Thr Tyr His Asn Ala Leu lie Lys Ala Leu Asn Lys Ala 
175 180 185 

TAC TAT TCT CTT ATG GAG GGT TTA GAA AAG CGT TTG AAC GCT ATA AAA 68 9 

Tyr Tyr Ser Leu Met Glu Gly Leu Glu Lys Arg Leu Asn Ala lie Lys 
190 195 200 



-153- 



BNSDOCID: <WO__984347aAlJ_> 



wo 98/43478 



PCT/US98/06371 



100 

lie lie Gin Lys 
115 

Pro Glu Leu Lys 
130 

Gin Lys Asp Arg 
145 

Phe Glu Lys Glu 

Thr Lys Glu Cys 
180 

Leu Cys Gin Lys 
195 

Ser Lys Glu Leu 
210 

Val Asp Ala Lys 
225 

Ser Pro Tyr Lys 

Lys Ser Leu Lys 
260 

Gin Ser Pro Phe 
275 

Leu Leu Arg Lys 
290 

Cys Met Glu Arg 
305 

Asp Ser Cys Thr 

Phe Asp Lys Phe 
340 

Leu Asp Thr Ser 
355 

Cys Gin Lys Glu 
370 

Leu Glu Glu Leu 
385 



Gin Glu Arg Arg 
120 

Glu Gin Cys Met 
135 

Gin Lys Arg Leu 
150 

Tyr Ala Gin Tyr 
165 

Val Lys Asn Ala 

Glu Val His Glu 
200 

Ser Val Gin Ser 
215 

Leu Glu Lys Ala 
230 

Lys Ser Thr His 
245 

Glu lie Ala Leu 

lie Asp Ala Asp 
280 

Asn Lys Gly Val 
295 

Asn Glu His Lys 
310 

Gin Ser Gin Ala 
325 

lie Gin Asp Tyr 

Asn Thr Lys Glu 
360 

Gin Leu Arg Asp 
375 

Val Lys Lys Tyr 
390 



105 

Phe Ser Cys Ser 

Asp Lys Thr Asn 
140 

lie Asn Leu Val 
155 

Lys Pro Tyr lie 
170 

Pro His Leu Ala 
185 

Lys Phe Asp Asp 

Ala lie Ser Phe 
220 

Ala Leu Met Asn 
235 

Cys Gin Arg Thr 
250 

Asn Met Asn Pro 
265 

Lys Met Ala Met 

Leu lie Ala Phe 
300 

Lys Glu Glu Phe 
315 

Lys lie Tyr Asn 
330 

Gin Lys Asp Leu 
345 

Glu Val Glu Gin 

Asp Asn Lys Gly 
380 

Ala Lys 



110 

Asp Phe His Asn 
125 

Ala Tyr Glu Lys 

Gin Leu Glu Ala 
160 

lie Pro Tyr Phe 
175 

Asn Lys Glu Arg 
190 

Pro Tyr Ser Ser 
205 

Cys lie Lys Lys 

Gly Val Tyr lie 
240 

His Leu Glu Asn 
255 

Lys Leu Glu Lys 
270 

Gin Ser Ala Gly 
285 

Ala Thr Asp lie 

lie Ser Leu Lys 
320 

Asn Lys Glu Arg 
335 

Lys Thr Cys Leu 
350 

Asn Phe Ser Gin 
365 

Leu Gly Phe Thr 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 ine ar 



(ii) MOLECULE TYPE: Genomic RNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 84... 704 
(D) OTHER INFORMATION: 
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Arg Lys Asn Lys Gly Val Leu lie Ala Phe Ala Thr Asp lie Cys Met 
295 300 305 

GAG CGT AAC GAA CAT AAA AAA GAA GAG TTT ATC AGC CTT AAA GAT AGT 1016 
Glu Arg Asn Glu His Lys Lys Glu Glu Phe lie Ser Leu Lys Asp Ser 
310 3.15 320 

TGC ACC CAA TCG CAA GCC AAA ATC TAT AAC AAC AAG GAG CGC TTT GAC 1064 
Cys Thr Gin Ser Gin Ala Lys lie Tyr Asn Asn Lys Glu Arg Phe Asp 
325 330 335 

AAA TTC ATA CAA GAT TAG CAA AAA GAC TTA AAA ACT TGT CTT TTA GAC 1112 
Lys Phe lie Gin Asp Tyr Gin Lys Asp Leu Lys Thr Cys Leu Leu Asp 
340 345 350 

ACT TCT AAC ACT AAA GAA GAA GTG GAG CAA AAT TTT TCA CAA TGC CAA 1160 
Thr Ser Asn Thr Lys Glu Glu Val Glu Gin Asn Phe Ser Gin Cys Gin 
355 360 365 370 

AAA GAG CAA TTG AGA GAT GAT AAC AAA GGC TTG GGT TTC ACT TTA GAA 1208 
Lys Glu Gin Leu Arg Asp Asp Asn Lys Gly Leu Gly Phe Thr Leu Glu 
375 380 385 

GAA TTG GTT AAA AAA TAG GCT AAG TAAAGTTATT TAATTTTATG GATGGTTTTA 1262 
Glu Leu Val Lys Lys Tyr Ala Lys 
390 

AAAATCCATT CCATAGTTAT TGT 12 85 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Lys Glu Thr Arg Leu Leu Lys Leu Arg Ala Leu Ser Leu Ala Cys 

15 10 15 

Leu Met Gly Leu Gly Val Ser Gly Cys Ala Phe Leu Asp Lys Gin lie 

20 25 30 

Leu Asn Asp His Leu Thr Lys Ala Lys Asn Asn Pro Lys Tyr Asp Cys 

35 40 45 

Gin Lys Glu Met Trp Ser Phe Pro Lys Lys Tyr Asp Gly lie Asn Gin 

50 55 60 

Cys Leu Lys Ala Gin Glu Glu Leu lie Glu Pro lie lie Thr Lys Lys 
65 70 75 80 

lie Asp Gin Tyr Gin Cys Asp Asp Phe Thr Asn Glu Gly Leu Lys Asp 

85 90 95 

Lys Cys Phe Lys Arg Asn Asp Ala Tyr Leu Asn Thr Leu Leu Thr Pro 
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70 75 80 

CAG TAT CAA TGC GAT GAT TTC ACT AAT GAA GGC TTA AAA GAT AAG TGT 344 
Gin Tyr Gin Cys Asp Asp Phe Thr Asn Glu Gly Leu Lys Asp Lys Cys 
85 90 95 

TTC AAA AGA AAC GAT GCC TAC TTA AAC ACC CTT TTA ACG CCC ATC ATT 3 92 

Phe Lys Arg Asn Asp Ala Tyr Leu Asn Thr Leu Leu Thr Pro lie lie 
100 105 110 

CAA AAA CAA GAG CGT CGT TTT AGC TGC TCT GAT TTC CAT AAC CCA GAG 44 0 

Gin Lys Gin Glu Arg Arg Phe Ser Cys Ser Asp Phe His Asn Pro Glu 
115 120 125 130 

CTA AAA GAA CAA TGC ATG GAT AAA ACT AAC GCT TAT GAA AAG CAA AAA 488 
Leu Lys Glu Gin Cys Met Asp Lys Thr Asn Ala Tyr Glu Lys Gin Lys 
135 140 145 

GAC CGA CAA AAA AGA CTA ATT AAT CTC GTG CAA TTA GAA GCG TTT GAA 53 6 

Asp Arg Gin Lys Arg Leu lie Asn Leu Val Gin Leu Glu Ala Phe Glu 
150 155 160 

AAA GAA TAC GCG CAA TAT AAA CCA TAC ATT ATC CCT TAC TTC ACC AAA 584 
Lys Glu Tyr Ala Gin Tyr Lys Pro Tyr lie lie Pro Tyr Phe Thr Lys 
165 170 175 

GAA TGC GTT AAA AAT GCG CCC CAT TTA GCC AAC AAG GAA AGA CTA TGC 632 
Glu Cys Val Lys Asn Ala Pro His Leu Ala Asn Lys Glu Arg Leu Cys 
180 185 190 

CAA AAA GAA GTG CAT GAA AAA TTT GAC GAC CCT TAT TCT AGC TCT AAA 680 
Gin Lys Glu Val His Glu Lys Phe Asp Asp Pro Tyr Ser Ser Ser Lys 
195 200 205 210 

GAA TTG AGC GTT CAA TCG GCT ATT TCT TTT TGC ATT AAA AAA GTT GAT 728 
Glu Leu Ser Val Gin Ser Ala lie Ser Phe Cys lie Lys Lys Val Asp 
215 220 225 

GCT AAA TTA GAA AAA GCC GCT CTT ATG AAT GGC GTT TAT ATA AGC CCT 776 
Ala Lys Leu Glu Lys Ala Ala Leu Met Asn Gly Val Tyr lie Ser Pro 
230 235 240 

TAT AAA AAA TCC ACC CAT TGC CAA AGA ACG CAT TTG GAA AAT AAG AGC 824 
Tyr Lys Lys Ser Thr His Cys Gin Arg Thr His Leu Glu Asn Lys Ser 
245 250 255 

TTG AAA GAA ATC GCT TTA AAT ATG AAC CCT AAA TTA GAA AAG CAA ^ AGC 872 
Leu Lys Glu lie Ala Leu Asn Met Asn Pro Lys Leu Glu Lys Gin Ser 
260 265 270 

CCT TTT ATT GAT GCG GAT AAA ATG GCT ATG CAA TCT GCG GGG TTA TTG 92 0 

Pro Phe lie Asp Ala Asp Lys Met Ala Met Gin Ser Ala Gly Leu Leu 
275 280 285 290 

AGA AAG AAT AAA GGT GTC TTG ATT GCT TTT GCT ACA GAT ATT TGC ATG 96 8 
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Thr Asn Ala Gin Leu lie Pro Pro Ser Ala Asn Gly Leu Gin Ala Pro 

35 40 45 

lie Tyr Pro Pro Thr Asn Phe Thr Pro Arg Lys Ser lie Gin Pro Leu 

50 55 60 

Pro Ser Pro Arg Leu Glu Asn Asn Asp Gin Pro Val lie Ser Ser Asn 
65 70 75 80 

Pro Thr Asn Ala lie Pro Asn Thr Pro lie Leu Thr Pro Asn Asn Val 

85 90 95 

lie Glu Leu Asn Ala Trp Ala Trp Ala Trp Leu Gin Asn Pro Pro Phe 

100 105 110 

His Pro Leu Lys Pro Trp Leu 
115 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 1285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) . LOCATION: 51... 1232 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

GTTGTGATTT TATTGTGTTT TCATATCAAT TTTCATATCA AGGAGTTTAA ATG AAA 56 

Met Lys 
1 

GAA ACA AG A CTT TTA AAA TTG AGA GCG TTG AGC TTA GCA TGT TTA ATG 104 
Glu Thr Arg Leu Leu Lys Leu Arg Ala Leu Ser Leu Ala Cys Leu Met 
5 10 15 

GGA TTA GGC GTG AGT GGG TGC GCG TTT TTA GAT AAG CAA ATC TTA AAC 152 
Gly Leu Gly Val Ser Gly Cys Ala Phe Leu Asp Lys Gin lie Leu Asn 
20 25 30 

GAC CAT TTG ACT AAA GCT AAA AAT AAC CCA AAA TAC GAT TGC CAA AAA 2 00 

Asp His Leu Thr Lys Ala Lys Asn Asn Pro Lys Tyr Asp Cys Gin Lys 
35 40 45 50 

GAA ATG TGG TCT TTC CCT AAA AAA TAC GAT GGG ATA AAT CAG TGT TTA 24 8 

Glu Met Trp Ser Phe Pro Lys Lys Tyr Asp Gly lie Asn Gin Cys Leu 
55 60 65 

AAG GCT CAA GAA GAG CTT ATT GAA CCA ATC ATC ACT AAA AAG ATC GAT 2 96 

Lys Ala Gin Glu Glu Leu lie Glu Pro lie lie Thr Lys Lys lie Asp 
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AATCCCTTCA AAAATGATAT AATAGACTTG ATGAACTCAT TTTAAGGAAA ATG CCC 56 

Met Pro 
1 

ATG CGT TTG CAC ACT GCC TTT TTT GGT ATT AAT TCA TTG CTT GTT GCC 104 
Met Arg Leu His Thr Ala Phe Phe Gly lie Asn Ser Leu Leu Val Ala 
5 10 15 

TCT CTT TTG ATA AGC GGT TGC AGT CTC TTT AAA AAG CGT AAC ACT AAC 152 
Ser Leu Leu lie Ser Gly Cys Ser Leu Phe Lys Lys Arg Asn Thr Asn 
20 25 30 

GCC CAG CTA ATC CCC CCT TCA GCT AAT GGC TTG CAA GCC CCC ATT TAT 2 00 

Ala Gin Leu lie Pro Pro Ser Ala Asn Gly Leu Gin Ala Pro lie Tyr 
35 40 45 50 

CCC CCA ACC AAT TTC ACC CCT AGA AAG AGC ATT CAG CCT CTC CCA AGC 248 
Pro Pro Thr Asn Phe Thr Pro Arg Lys Ser lie Gin Pro Leu Pro Ser 
55 60 65 

CCT CGC CTT GAG AAT AAC GAT CAG CCC GTC ATT AGT TCT AAC CCC ACT 296 
Pro Arg Leu Glu Asn Asn Asp Gin Pro Val lie Ser Ser Asn Pro Thr 
70 75 80 

AAC GCT ATC CCT AAC ACC CCC ATT CTC ACG CCT AAT AAT GTC ATT GAA 344 
Asn Ala lie Pro Asn Thr Pro lie Leu Thr Pro Asn Asn Val lie Glu 
85 90 95 

TTG AAC GCA TGG GCA TGG GCG TGG CTC CAG AAT CCA CCA TTT CAC CCT 3 92 

Leu Asn Ala Trp Ala Trp Ala Trp Leu Gin Asn Pro Pro Phe His Pro 
100 105 110 

CTC AAG CCC TGG CTC TAGCCAAGCG GGCGGCTATC GTTGATGGCT ACCGCCAGTT G 44 8 

Leu Lys Pro Trp Leu 

115 

GGTGAAAAAA TG 4 60 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 42: 

Met Pro Met Arg Leu His Thr Ala Phe Phe Gly lie Asn Ser Leu Leu 

15 10 15 

Val Ala Ser Leu Leu lie Ser Gly Cys Ser Leu Phe Lys Lys Arg Asn 
20 25 30 
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(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Met Arg lie Lys Ala Tyr Phe Leu Arg Phe lie Ala Leu Val Leu lie 

15 10 15 

Val Leu Leu Gly Phe Ser Ala Cys Lys Asn Ser Gin Lys Ser Gin Asp 

20 25 30 

Ser Gin Asn Asn Thr Pro Gin Gin Asp Ser Pro Lys Thr Tyr Thr Ala 

35 40 45 

Met Asp Leu Asn Asn Gin Glu Tyr Thr lie Thr Gly Asp Leu Asp Ser 

50 55 60 

Leu Asn lie Ser Pro Asp Ser Asn Thr Pro Thr Leu Leu Val Leu Ser 
65 70 75 80 

Ala Leu Asp Asn Ser Leu Lys Asp Tyr Ala Pro Ser Phe Asn lie Leu 

85 90 95 

Lys Lys Thr Phe Lys Asp Arg Leu Arg Val Leu lie Leu Leu Asn Lys 

100 105 110 

Pro Tyr Ser Ser Asp Ala lie Lys Asp Phe Ser Ala His Phe Gin Ala 

115 120 125 

Asp Leu Met lie Leu Asn Pro Lys Asp Thr Ala Leu Phe Asp His Leu 

130 135 140 

Lys Tyr Asp Ala Leu Asn His Ser Phe Asn Met Leu Leu Tyr His Lys 
145 150 155 160 

His Gin Leu lie Lys Met Tyr Gin Gly lie Val Pro lie Glu Met Leu 

165 170 175 

Gin Phe Asp lie Ser Asn Leu Lys Asp 
180 185 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 407 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
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ATT AAG GCT TAT TTT TTG CGT TTT ATC GCG CTG GTT TTG ATT GTT TTG 104 
lie Lys Ala Tyr Phe Leu Arg Phe lie Ala Leu Val Leu lie Val Leu 
5 10 15 

TTA GGT TTT AGT GCT TGT AAA AAT TCT CAA AAA TCT CAA GAT TCT CAA 152 
Leu Gly Phe Ser Ala Cys Lys Asn Ser Gin Lys Ser Gin Asp Ser Gin 
20 25 30 

AAC AAT ACC CCC CAA CAA GAT AGC CCT AAA ACC TAC ACC GCT ATG GAT 2 00 

Asn Asn Thr Pro Gin Gin Asp Ser Pro Lys Thr Tyr Thr Ala Met Asp 
35 40 45 50 

TTG AAT AAC CAA GAA TAC ACC ATC ACA GGC GAT TTA GAT TCT CTC AAT 24 8 

Leu Asn Asn Gin Glu Tyr Thr lie Thr Gly Asp Leu Asp Ser Leu Asn 
55 60 65 

ATC AGC CCG GAT TCC AAC ACC CCT ACC CTA TTA GTT TTA AGC GCT TTA 2 96 

lie Ser Pro Asp Ser Asn Thr Pro Thr Leu Leu Val Leu Ser Ala Leu 
70 75 80 

GAT AAT TCT TTA AAA GAT TAC GCC CCC AGC TTT AAC ATC TTA AAA AAA 344 
Asp Asn Ser Leu Lys Asp Tyr Ala Pro Ser Phe Asn lie Leu Lys Lys 
85 90 95 

ACT TTT AAA GAT CGT TTG AGO GTG CTT ATT TTA CTC AAT AAA CCC TAT 3 92 

Thr Phe Lys Asp Arg Leu Arg Val Leu lie Leu Leu Asn Lys Pro Tyr 
100 105 110 

TCA AGC GAT GCA ATC AAA GAC TTT AGC GCG CAT TTT CAA GCT GAT TTG 44 0 

Ser Ser Asp Ala lie Lys Asp Phe Ser Ala His Phe Gin Ala Asp Leu 
115 120 125 130 

ATG ATT TTA AAC CCT AAA GAT ACC GCT CTT TTT GAT CAT TTA AAG TAT 4 88 

Met lie Leu Asn Pro Lys Asp Thr Ala Leu Phe Asp His Leu Lys Tyr 
135 140 145 

GAC GCT TTA AAC CAT TCT TTT AAC ATG CTC TTA TAC CAC AAA CAC CAA 53 6 

Asp Ala Leu Asn His Ser Phe Asn Met Leu Leu Tyr His Lys His Gin 
150 155 160 

TTG ATC AAA ATG TAT CAA GGG ATC GTG CCA ATA GAA ATG CTC CAA TTT 5 84 

Leu lie Lys Met Tyr Gin Gly lie Val Pro lie Glu Met Leu Gin Phe 
165 170 175 

GAT ATT TCC AAT TTA AAG GAT TAAAAAAAAC CATGTTTAAT TTTTTCAAAA AAAT 63 9 
Asp lie Ser Asn Leu Lys Asp 
180 185 

TGTCAATAAA ATTAAGGGT 658 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
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Phe Gly Arg Tyr Tyr Thr Ser Gly Lys Ala Leu Met Val Ala Asp Leu 
225 230 235 240 

Lys Tyr Glu Lys Asp Asn Leu Lys lie Asn Pro Tyr Phe Tyr Ala lie 

245 250 255 

Phe Gin Arg Met Tyr Ala Pro Gly lie Asn lie Thr Tyr Asp Thr Asn 

260 265 270 

Pro Asn Phe Asn Asn Lys Gly Phe Arg Phe Val Gly Thr Phe Val Gly 

275 280 285 

Phe Phe Pro lie Phe Ala Thr Pro Ala Asn Gin Asn Asp lie lie Leu 

290 295 300 

Phe Gin Gin Val Pro Leu Gly Lys Ser Gly Gin Thr Tyr Phe Phe Arg 
305 310 315 320 

Thr Arg Phe Tyr Tyr Asn Lys Trp Gin Phe Gly Gly Ser Val Tyr Lys 

325 330 335 

Asn lie Gly Asn Ala Asn Gly Asp lie Gly lie Tyr Gly Asp Pro Leu 

340 345 350 

Gly Tyr Asn lie Trp Thr Asn Ser lie Tyr Asp Ala Glu lie Asn Asn 

355 360 365 

lie Val Gly Ala Asp Val lie Asn Gly Phe Leu Tyr Val Gly Ser Gin 

370 375 380 

Tyr Arg Gly Phe Ser Trp Lys He Leu Gly Arg Trp Thr Asp Ser Pro 
385 390 395 400 

Arg Ala Asp Glu Arg Ser Leu Ala Leu Phe Leu Ser Tyr Phe Ser Asn 

405 410 415 

Lys Tyr Asn He Arg Met Asp Leu Lys Leu Glu Tyr Tyr Gly Asn He 

420 425 430 

Thr Lys Lys Gly Tyr Cys He Gly Tyr Cys Gly Met Tyr Val Pro Val 

435 440 445 

Asp Pro Asn Gly Pro Gly Thr Gin Pro Leu Thr His Asn Val Tyr Ser 

450 455 460 

Asp Arg Ser His He Met Phe Asn He Ala Tyr Gly Phe Arg He Tyr 
465 470 475 480 

(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 605 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AATTTTAGGT TATTAGTTAC CATTTTATTA TTCTTAAGGA TGTGTTTATA ATG AGA 56 

Met Arg 
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AAA GGC TAT TGT ATT GGG TAT TGT GGC ATG TAT GTT CCA GTC GAT CCT 14 01 

Lys Gly Tyr Cys lie Gly Tyr Cys Gly Met Tyr Val Pro Val Asp Pro 
435 440 445 450 

AAC GGG CCT GGG ACA CAG CCT TTA ACG CAC AAT GTG TAT TCT GAC AGG 144 9 

Asn Gly Pro Gly Thr Gin Pro Leu Thr His Asn Val Tyr Ser Asp Arg 
455 460 465 

AGC CAT ATA ATG TTT AAC ATT GCT TAC GGT TTT AGG ATT TAC TAGCATTTT 1500 
Ser His lie Met Phe Asn lie Ala Tyr Gly Phe Arg lie Tyr 
470 475 480 

ATCCTTAATG GATATTTTTG ATTAGCCTTT TTAAAATATT GAAA 1544 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Pro Phe Cys lie Phe lie Leu lie Ser Leu Gly Val Arg Val Leu 

15 10 15 

Glu lie Lys Lys Tyr Phe Ser Tyr Ser Leu Phe Phe Leu Leu Phe Ser 

20 25 30 

Ser Leu Phe Leu Ser Lys Leu Gin Ala Tyr Lys Phe Asn Met Ser lie 

35 40 45 

Val Gly Lys Val Ser Ser Tyr Thr Lys Phe Gly Phe Asn Asn Gin Arg 

50 55 60 

Tyr Gin Pro Ser Lys Asp lie Tyr Pro Thr Gly Ser Tyr Thr Ser Leu 
65 70 75 80 

Leu Gly Glu Leu Asn Leu Ser Met Gly Leu Tyr Lys Gly Leu Arg Ala 

85 90 95 

Glu Val Gly Ala Met Met Ala Ala Leu Pro Tyr Asp Ser Thr Ala Tyr 

100 105 110 

Gin Gly Asn Asn lie Pro Asn Gly Gin Pro Gly Ser Arg Thr Asp Pro 

115 120 125 

Phe Gly Ala Gly lie Phe Trp Gin Tyr lie Gly Trp Tyr Ala Gly His 

130 135 140 

Ser Gly Leu Gin Val Gin Lys Pro Arg Leu Ala Met Val His Asn Ala 
145 150 155 160 

Phe Leu Ser Tyr Asn Tyr Lys Lys Asp Lys Phe Ser Phe Gly Val Lys 

165 170 175 

Gly Gly Arg Tyr Asp Ala Glu Glu Tyr Asp Trp Phe Thr Ser Tyr Thr 

180 185 190 

Gin Gly Val Glu Gly Phe Val Lys Tyr Lys Asp Thr Arg Phe Arg Val 

195 200 205 

Met Tyr Ser Asp Ala Arg Ala Ser Ala Ser Ser Asp Trp Phe Trp Tyr 
210 215 220 
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TCA GAC GCT AGG GCT TCA GCG TCA AGC GAC TGG TTT TGG TAT TTT GGG 72 9 

Ser Asp Ala Arg Ala Ser Ala Sex Ser Asp Trp Phe Trp Tyr Phe Gly 
215 220 225 

CGT TAG TAT ACA AGC GGT AAG GCT CTA ATG GTA GCT GAT TTG AAA TAT 777 
Arg Tyr Tyr Thr Ser Gly Lys Ala Leu Met Val Ala Asp Leu Lys Tyr 
230 235 240 

GAA AAA GAC AAC CTA AAA ATC AAC CCT TAT TTT TAT GCG ATC TTT CAA 825 
Glu Lys Asp Asn Leu Lys lie Asn Pro Tyr Phe Tyr Ala lie Phe Gin 
245 250 255 

AGA ATG TAT GCG CCA GGC ATT AAT ATC ACT TAT GAC ACC AAC CCT AAT 873 
Arg Met Tyr Ala Pro Gly lie Asn lie Thr Tyr Asp Thr Asn Pro Asn 
260 265 270 

TTC AAC AAT AAG GGT TTT CGT TTT GTA GGC ACT TTC GTA GGG TTT TTC 921 
Phe Asn Asn Lys Gly Phe Arg Phe Val Gly Thr Phe Val Gly Phe Phe 
275 280 285 290 

CCC ATT TTT GCC ACT CCG GCT AAT CAA AAT GAT ATT ATC CTC TTC CAA 96 9 

Pro lie Phe Ala Thr Pro Ala Asn Gin Asn Asp lie lie Leu Phe Gin 
295 300 305 

CAA GTG CCA TTA GGC AAG AGT GGG CAA ACT TAT TTC TTC CGC ACT CGT 1017 
Gin Val Pro Leu Gly Lys Ser Gly Gin Thr Tyr Phe Phe Arg Thr Arg 
310 315 320 

TTT TAC TAT AAT AAG TGG CAA TTT GGG GGC AGC GTC TAT AAA AAC ATC 1065 
Phe Tyr Tyr Asn Lys Trp Gin Phe Gly Gly Ser Val Tyr Lys Asn lie 
325 330 335 

GGT AAC GCT AAT GGT GAT ATA GGT ATT TAT GGC GAC CCT TTG GGG TAT 1113 
Gly Asn Ala Asn Gly Asp lie Gly lie Tyr Gly Asp Pro Leu Gly Tyr 
340 345 350 

AAC ATT TGG ACG AAT AGT ATT TAT GAC GCA GAA ATT AAC AAT ATT GTT 1161 
Asn lie Trp Thr Asn Ser lie Tyr Asp Ala Glu lie Asn Asn lie Val 
355 360 365 370 

GGC GCT GAT GTT ATT AAC GGG TTT TTG TAT GTA GGC TCA CAA TAT AGA 12 09 

Gly Ala Asp Val lie Asn Gly Phe Leu Tyr Val Gly Ser Gin Tyr Arg 
375 380 385 

GGG TTT AGT TGG AAA ATT TTA GGC CGT TGG ACG GAT AGC CCA AGG GCT 12 57 

Gly Phe Ser Trp Lys lie Leu Gly Arg Trp Thr Asp Ser Pro Arg Ala 
390 395 400 

GAT GAA AGG AGT CTC GCG CTC TTT TTG AGT TAT TTT TCT AAT AAG TAT 13 05 

Asp Glu Arg Ser Leu Ala Leu Phe Leu Ser Tyr Phe Ser Asn Lys Tyr 
405 410 415 

AAT ATT AGA ATG GAT TTA AAA CTA GAA TAT TAT GGC AAT ATC ACC AAA 1353 
Asn lie Arg Met Asp Leu Lys Leu Glu Tyr Tyr Gly Asn lie Thr Lys 
420 425 430 
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Met Pro 
1 

TTT TGT ATT TTT ATT TTA ATA TCT TTG GGA GTT AGG GTT TTG GAA ATT 105 
Phe Cys lie Phe lie Ijeu lie Ser Leu Gly Val Arg Val Leu Glu lie 
5 10 15 

AAG AAA TAT TTT TCT TAG TCT CTA TTT TTT TTG CTT TTT TCT AGT CTC 153 
Lys Lys Tyr Phe Ser Tyr Ser Leu Phe Phe Leu Leu Phe Ser Ser Leu 
20 25 30 

TTT TTA TCC AAA CTT CAA GCT TAT AAA TTC AAC ATG AGC ATT GTT GGA 2 01 

Phe Leu Ser Lys Leu Gin Ala Tyr Lys Phe Asn Met Ser lie Val Gly 
35 40 45 50 

AAG GTG AGC AGC TAT ACC AAG TTT GGC TTT AAC AAC CAA AGA TAC CAG 24 9 

Lys Val Ser Ser Tyr Thr Lys Phe Gly Phe Asn Asn Gin Arg Tyr Gin 
55 60 65 

OCT TCT AAA GAC ATT TAT CCT ACA GGT AGC TAC ACT TCT TTA CTC GGC 2 97 

Pro Ser Lys Asp lie Tyr Pro Thr Gly Ser Tyr Thr Ser Leu Leu Gly 
70 75 80 

GAA TTG AAT TTG AGC ATG GGT TTA TAC AAG GGT TTG AGA GCG GAA GTG 3 45 

Glu Leu Asn Leu Ser Met Gly Leu Tyr Lys Gly Leu Arg Ala Glu Val 
85 90 95 

GGG GCT ATG ATG GCA GCG CTC CCC TAT GAC TCT ACC GCC TAT CAA GGC 3 93 

Gly Ala Met Met Ala Ala Leu Pro Tyr Asp Ser Thr Ala Tyr Gin Gly 
100 105 110 

AAC AAT ATC CCT AAC GGC CAG CCC GGC TCT AGG ACC GAT CCT TTT GGG 441 
Asn Asn lie Pro Asn Gly Gin Pro Gly Ser Arg Thr Asp Pro Phe Gly 
115 120 125 130 

GCG GGT ATC TTT TGG CAA TAT ATT GGT TGG TAT GCG GGG CAT AGT GGT 4 89 

Ala Gly lie Phe Trp Gin Tyr lie Gly Trp Tyr Ala Gly His Ser Gly 
135 140 145 

TTG CAA GTG CAA AAA CCT CGT TTA GCC ATG GTG CAT AAC GCT TTT TTG 53 7 

Leu Gin Val Gin Lys Pro Arg Leu Ala Met Val His Asn Ala Phe Leu 
150 155 160 

AGC TAC AAC TAC AAA AAA GAC AAA TTC AGT TTT GGC GTG AAA GGG GGG 5 85 

Ser Tyr Asn Tyr Lys Lys Asp Lys Phe Ser Phe Gly Val Lys Gly Gly 
165 170 175 

CGC TAT GAC GCT GAA GAG TAT GAT TGG TTC ACT TCT TAC ACT CAA GGG 63 3 

Arg Tyr Asp Ala Glu Glu Tyr Asp Trp Phe Thr Ser Tyr Thr Gin Gly 
180 185 190 

GTT GAA GGC TTT GTC AAA TAT AAA GAC ACC AGA TTC AGG GTG ATG TAT 681 
Val Glu Gly Phe Val Lys Tyr Lys Asp Thr Arg Phe Arg Val Met Tyr 
135 200 205 210 
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20 25 30 

GAT GAT GAA GTT GAG AGT GAT GGG TTT TTT AGT AGA ATA TGG GAT AAA 2 00 

Asp Asp Glu Val Glu Ser Asp Gly Phe Phe Ser Arg lie Trp Asp Lys 
35 40 45 50 

TTC GTT GAA TAT TTC GGC TAT ACT CTA GTT ACT ATA TAATGTTTTT TCCTTA 252 
Phe Val Glu Tyr Phe Gly Tyr Thr Leu Val Thr lie 
55 60 

TATAATTGGA CCAGTTATCG CTTTAATTTT TATATTT 289 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Phe Arg Asp lie Val Asp lie Leu lie Ser Val Val lie lie Gly 

15 10 15 

Leu Val Leu Thr Ala lie Arg Ala Thr lie Met Ala Phe Lys Gly Asp 

20 25 30 

Thr Asp Asp Asp Glu Val Glu Ser Asp Gly Phe Phe Ser Arg lie Trp 

35 40 45 

Asp Lys Phe Val Glu Tyr Phe Gly Tyr Thr Leu Val Thr lie 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 52... 1491 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
GACACACATT AGTTATAGTT TCTAAGAGAG TTCTCCCCCT ATCTCTTAGA T ATG CCT 57 
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Thr Arg Gly Tyr Lys Gly Leu Ser Asn Gin Gin Tyr Ala lie Asn Gly 

100 105 110 

Phe Gly Phe Val Val Gly Tyr Lys His Phe Phe Lys Lys Ser Pro Gin 

115 120 125 

Phe Gly Met Arg Tyr Tyr Gly Phe Phe Asp Phe Ala Ser Ser Tyr Tyr 

130 135 140 

Lys Tyr Tyr Thr Tyr Asn Asp Tyr Gly Met Arg Asp Ala Arg Lys Gly 
145 150 155 160 

Ser Gin Ser Phe Met Phe Gly Tyr Gly Ala Gly Thr Asp Val Leu Phe 

165 170 175 

Asn Pro Ala lie Phe Asn Arg Glu Asn Leu His Phe Gly Phe Phe Leu 

180 185 190 

Gly Val Ala lie Gly Gly Thr Ser Trp Gly Pro Thr Asn Tyr Tyr Phe 

195 200 205 

Lys Asp Leu Ala Asp Glu Tyr Arg Gly Ser Phe His Pro Ser Asn Phe 

210 215 220 

Gin Val Leu Val Asn Gly Gly lie Arg Leu Gly Thr Lys His Gin Gly 
225 230 235 240 

Phe Glu lie Gly Leu Lys lie Gin Thr lie Arg Asn Asn Tyr Tyr Thr 

245 250 255 

Ala Ser Ala Asp Asn Val Pro Glu Gly Thr Thr Tyr Arg Phe Thr Phe 

260 265 270 

His Arg Pro Tyr Ala Phe Tyr Trp Arg Tyr lie Val Ser Phe 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51.,. 236 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGGATTTTAT TTCTTATAGC AGAAATTATT TTTAAAGTAA AAGACAAATC ATG TTT 5 6 

Met Phe 
1 

AGA GAT ATA GTA GAT ATT TTA ATA TCT GTT GTT ATT ATT GGA TTA GTA 104 
Arg Asp lie Val Asp lie Leu lie Ser Val Val lie lie Gly Leu Val 
5 10 15 

TTA ACA GCT ATT AGA GCT ACT ATA ATG GCG TTT AAA GGC GAT ACT GAT 152 
Leu Thr-Ala lie Arg Ala Thr lie Met Ala Phe Lys Gly Asp Thr Asp 
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GCT ATT TTC AAT CGT GAG AAC TTG CAT TTT GGG TTT TTC TTG GGC GTT 63 2 

Ala lie Phe Asn Arg Glu Asn Leu His Phe Gly Phe Phe Leu Gly Val 
180 185 190 

GCG ATC GGT GGC ACC TCT TGG GGT CCA ACA AAC TAT TAT TTT AAG GAC 680 
Ala lie Gly Gly Thr Ser Trp Gly Pro Thr Asn Tyr Tyr Phe Lys Asp 
195 200 205 210 

TTG GCT GAT GAG TAT AGA GGG AGT TTC CAC CCA TCA AAT TTC CAG GTC 72 8 

Leu Ala Asp Glu Tyr Arg Gly Ser Phe His Pro Ser Asn Phe Gin Val 
215 220 225 

TTA GTT AAT GGT GGG ATT CGC TTA GGC ACT AAA CAC CAA GGT TTT GAA 77 6 

Leu Val Asn Gly Gly lie Arg Leu Gly Thr Lys His Gin Gly Phe Glu 
230 235 240 

ATT GGC TTG AAA ATC CAA ACC ATC CGC AAC AAT TAC TAC ACC GCT AGT 824 
lie Gly Leu Lys lie Gin Thr lie Arg Asn Asn Tyr Tyr Thr Ala Ser 
245 250 255 

GCG GAT AAT GTG CCT GAA GGG ACT ACT TAT AGA TTC ACT TTC CAC CGC 872 
Ala Asp Asn Val Pro Glu Gly Thr Thr Tyr Arg Phe Thr Phe His Arg 
260 265 270 

CCC TAT GCC TTT TAT TGG CGT TAC ATT GTA AGC TTT TAAGGTGTTT TAGGGC 924 
Pro Tyr Ala Phe Tyr Trp Arg Tyr lie Val Ser Phe 
275 280 285 

TAATCTTATG GGGGCATAGA AAAGGGCTTT TGCTCTT 961 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Lys Lys Ser Val lie Val Gly Ala lie Ser Leu Ala Met Thr Ser 

1 5 10 15 

Leu Leu Ser Ala Glu Thr Pro Lys Gin Glu Lys Ala lie Lys Thr Ser 

20 25 30 

Pro Thr Lys Lys Gly Glu Arg Asn Ala Ala Phe lie Gly lie Asp Tyr 

35 40 45 

Gin Leu Gly Met Leu Ser Thr Thr Ala Gin Asn Cys Ser His Gly Asn 

50 55 60 

Cys Asn Gly Asn Gin Ser Gly Ala Tyr Gly Ser Asn Thr Pro Asn Met 
65 70 75 80 

Pro Thr Ala Ser Asn Pro Thr Gly Gly Phe Thr His Gly Ala Leu Gly 
85 90 95 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GAATGTAGCA TTTAGAACTC AAGTAGAGAA AATGTAGAAG GAAGGAATAC ATG AAG 56 

Met Lys 
1 

AAA TCT GTT ATA GTA GGT GCT ATC TCT CTA GCA ATG ACA AGC TTA TTG 104 
Lys Ser Val lie Val Gly Ala lie Ser Leu Ala Met Thr Ser beu Leu 
5 10 15 

TCA GCA GAG ACC CCT AAG CAA GAA AAA GCT ATT AAG ACT AGC CCT ACC 152 
Ser Ala Glu Thr Pro Lys Gin Glu Lys Ala lie Lys Thr Ser Pro Thr 
20 25 30 

AAA AAA GGT GAA AGA AAT GCT GCT TTT ATA GGG ATT GAT TAC CAG TTG 2 00 

Lys Lys Gly Glu Arg Asn Ala Ala Phe He Gly He Asp Tyr Gin Leu 
35 40 45 50 

GGT ATG CTC AGC ACT ACC GCT CAA AAT TGT TCC CAT GGG AAT TGC AAT 24 8 

Gly Met Leu Ser Thr Thr Ala Gin Asn Cys Ser His Gly Asn Cys Asn 
55 60 65 

GGT AAT CAA AGT GGG GCT TAC GGC TCT AAT ACG CCT AAC ATG CCT ACA 2 96 

Gly Asn Gin Ser Gly Ala Tyr Gly Ser Asn Thr Pro Asn Met Pro Thr 
70 75 80 

GCG TCA AAC CCA ACA GGA GGG TTT ACT CAT GGC GCT CTA GGG ACT CGT 344 
Ala Ser Asn Pro Thr Gly Gly Phe Thr His Gly Ala Leu Gly Thr Arg 
85 90 95 

GGG TAT AAA GGC TTA AGC AAC CAA CAA TAC GCT ATC AAT GGT TTT GGT 3 92 

Gly Tyr Lys Gly Leu Ser Asn Gin Gin Tyr Ala He Asn Gly Phe Gly 
100 105 110 

TTT GTT GTA GGG TAT AAG CAT TTT TTC AAG AAA TCC CCG CAA TTT GGA 44 0 

Phe Val Val Gly Tyr Lys His Phe Phe Lys Lys Ser Pro Gin Phe Gly 
115 120 125 130 

ATG CGT TAT TAC GGA TTC TTT GAT TTT GCA AGC TCT TAT TAT AAG TAT 48 8 

Met Arg Tyr Tyr Gly Phe Phe Asp Phe Ala Ser Ser Tyr Tyr Lys Tyr 
135 140 145 

TAC ACT TAT AAT GAT TAT GGC ATG AGA GAC GCT CGC AAG GGT TCT CAA 53 6 

Tyr Thr Tyr Asn Asp Tyr Gly Met Arg Asp Ala Arg Lys Gly Ser Gin 
150 155 160 

AGT TTC ATG TTT GGC TAT GGG GCT GGC ACA GAT GTG TTG TTT AAC CCG 584 
Ser Phe Met Phe Gly Tyr Gly Ala Gly Thr Asp Val Leu Phe Asn Pro 
165 170 175 
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35 40 45 50 

GAC ACT TCA ATG TTT GGC AGC AAC AAA ACA GCT CCA GCC CAA GAA ACG 24 8 

Asp Thr Ser Met Phe Gly Ser Asn Lys Thr Ala Pro Ala Gin Glu Thr 
55 60 65 

CAA AGC GCT TCC AAA CCG GAC ACT AAA GTC AAT CCA AGC GCA AGT TOG 296 
Gin Ser Ala Ser Lys Pro Asp Thr Lys Val Asn Pro Ser Ala Ser Trp 
70 75 80 

ATG AAA AAA TAAGAAGGAA GTTATGAAAA AGTCATTCAA AAAATTAGGC TTTGTCTCT 354 
Met Lys Lys 
85 

TTA 357 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Lys Val Phe Leu Gly Met Ala Leu Ala Phe Ser Val Ser Met 

15 10 15 

Ala Glu Lys Ser Gly Ala Phe Leu Gly Gly Gly Phe Gin Tyr Ser Asn 

20 25 30 

Leu Glu Asn Gin Asn Thr Thr Arg Thr Pro Gly Ala Asn Asn Asn Thr 

35 40 45 

Pro lie Asp Thr Ser Met Phe Gly Ser Asn Lys Thr Ala Pro Ala Gin 

50 55 60 

Glu Thr Gin Ser Ala Ser Lys Pro Asp Thr Lys Val Asn Pro Ser Ala 
65 70 75 80 

Ser Trp Met Lys Lys 
85 

(2) INFORMATION FOR SEQ ID NO : 3 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
- (B) LOCATION: 51... 908 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOIjECUIiE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Met 


Arg 


Lys 


Val 


Leu 


Tyr 


Ala 


Leu 


Val 


Gly 


Phe 


Leu 


Leu 


Ala 


Phe 


Ser 


1 








5 










10 










15 




Ala 


Leu 


Lys 


Ala 


Asp 


Asp 


Phe 


Leu 


Glu 


Glu 


Ala 


Asn 


Glu 


Thr 


Ala 


Pro 








20 










25 










30 






Ala 


His 


Leu' 


Asn 


His 


Pro 


Met 


Gin 


Asp 


Leu 


Asn 


Ala 


He 


Gin 


Gly 


Ser 






35 










40 










45 








Phe 


Phe 


Asp 


Lys 


Asn 


Arg 


Ser 


Lys 


Met 


Ser 


Asn 


Thr 


Leu 


Asn 


He 


Asp 




50 










55 










60 










Tyr 


Phe 


Gin 


Gly 


Gin 


Thr 


Tyr 


Lys 


He 


Pro 


Leu 


Ala 


Leu 


Cys 


Asp 


Gly 


65 










70 










75 










80 


Xaa 


Leu 


He 


Val 


Phe 


Phe 


Lys 


Thr 


His 

















85 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGCAATAAA AAAAGAAATT CTTAGGATTT CTCACATTAA GGAGTTTTAA ATG AAA 56 

Met Lys 
1 

AAG GTT TTT TTA GGT ATG GCA TTA GCC TTT AGT GTG TCC ATG GCA GAA 104 
Lys Val Phe Leu Gly Met Ala Leu Ala Phe Ser Val Ser Met Ala Glu 
5 10 15 

AAA AGT GGC GCG TTT TTA GGA GGG GGG TTT CAA TAT TCT AAT TTA GAA 152 
Lys Ser Gly Ala Phe Leu Gly Gly Gly Phe Gin Tyr Ser Asn Leu Glu 
20 25 30 

AAC CAA AAC ACC ACC CGC ACC CCA GGC GCT AAC AAT AAC ACC CCG ATA 200 
Asn Gin Asn Thr Thr Arg Thr Pro Gly Ala Asn Asn Asn Thr Pro He 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 317 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TTTAAAATCA CCCGTTACAG CATCACTGAA ATCACTAATA GGGGTGATTG ATG CGT 56 

Met Arg 
1 

AAG GTT TTA TAC GCT CTT GTG GGC TTT TTG TTG GCT TTT AGC GCT TTA 104 
Lys Val Leu Tyr Ala Leu Val Gly Phe Leu Leu Ala Phe Ser Ala Leu 
5 10 15 

AAA GCC GAT GAT TTT TTA GAA GAA GCG AAC GAA ACA GCC CCG GCG CAT 152 
Lys Ala Asp Asp Phe Leu Glu Glu Ala Asn Glu Thr Ala Pro Ala His 
20 25 30 

TTA AAC CAC CCT ATG CAG GAT TTA AAC GCC ATT CAA GGG AGC TTT TTT 2 00 

Leu Asn His Pro Met Gin Asp Leu Asn Ala lie Gin Gly Ser Phe Phe 
35 40 45 50 

GAC AAA AAC CGC TCA AAA ATG TCC AAC ACT TTG AAC ATT GAT TAC TTT 24 8 

Asp Lys Asn Arg Ser Lys Met Ser Asn Thr Leu Asn lie Asp Tyr Phe 
55 60 65 

CAA GGG CAA ACT TAT AAA ATC CCG CTT GCG TTA TGC GAT GGC GMC CTT 2 96 

Gin Gly Gin Thr Tyr Lys lie Pro Leu Ala Leu Cys Asp Gly Xaa Leu 
70 75 80 

ATT GTT TTT TTC AAA ACC CAT TAGCGATTTT GTTTTAGGGG ATAAGGTGGG TTTT 3 51 
lie Val Phe Phe Lys Thr His 
85 

GATGCGAAAA TTTTAGAAA 37 0 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 



-135- 



BNSDOCID: <WO 9843478A1 J_> 



wo 98/43478 



PCT/US98/06371 



435 

Gly Asn Ala Asp 
450 

Asn Ser Gly Val 
465 

Gly Asn Val Phe 

Ser Gly Leu Asp 
500 

Pro Asn Asp Pro 
515 

Asp Gin Ser Ser 
530 

Tyr Arg Asn Met 
545 

Gly Phe Met Gin 

Ala Gly Thr Arg 
580 

Thr His Val Thr 
595 

Pro lie Glu Ser 
610 

Gly Ala Leu Pro 
625 

Tyr Gin Arg Asn 

Val Asp Phe Asn 
660 

Gin Val lie Asn 
675 

Asn Gly Gly Gly 
690 

Glu Thr lie Asn 
705 

Lys Asn Phe Leu 

Arg Gly His Leu 
740 

Asn Val Phe lie 
755 

Thr Leu Thr Trp 
770 

Tyr Ser lie Tyr 
785 

Gly Tyr Gly Val 

Lys Trp Gin Gly 
820 

Lys Gin Tyr Val 
835 

Pro Ala Ser Asp 
850 

Pro Gly Phe Asn 
865 
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440 

Ala Glu Gly Thr 
455 

Asn Phe Lys Val 
470 

Glu Tyr Gly Met 
485 

Lys Gly Lys Asn 

Asn Gly Pro Gly 
520 

Gin Tyr Pro Gin 
535 

Pro Gin Asn Ser 
550 

Ala Asn Tyr Asn 
565 

Tyr Asp lie Tyr 

Ser Gly Phe Ser 
600 

lie Gly Leu Lys 
615 

Gly Asp Gly Val 
630 

Leu Arg Pro Ala 
645 

Ser Lys Tyr Phe 

Asn Phe lie Asn 
680 

Asn Ala Thr Ala 
695 

lie Tyr Gly Tyr 
710 

Gly Thr Phe Ser 
725 

Leu Ala Asp Thr 

Leu Lys Ala Asp 
760 

Leu Ser Arg Phe 
775 

Tyr Pro Gin Tyr 
790 

His Asn Val Phe 
805 

Leu Arg lie Ser 

Asp Gin Thr Ser 
840 

Met lie Pro Lys 
855 

Ala Arg Phe Glu 
870 



Leu Ala Arg Arg 
460 

Thr His Pro lie 
475 

lie Tyr Gin Asn 
490 

Gly Tyr Tyr Lys 
505 

Leu Pro Tyr Arg 

Asn Leu Asn Thr 
540 

His Ala lie Gly 
555 

lie Leu Ser Asn 
570 

Thr Leu Leu Asp 
585 

Pro Ser Ala Thr 

Val Ser Tyr Ala 
620 

Leu Met Arg Asp 
635 

lie Gly Gin Asn 
650 

Asn Val Arg Gly 
665 

Ser Tyr Gly Gin 

Lys Asn Met Ser 
700 

Glu Val Ser Gly 
715 

Val Ala Arg Ser 
730 

Tyr Ala Leu Ala 
745 

Tyr Asp Val Arg 

Val Thr Asn Met 
780 

Gly Leu lie Lys 
795 

lie Asn Trp Thr 
810 

Ala Val Phe Asn 
825 

Val Phe Gin Ala 

Gly Lys Arg Met 
860 

Val Ser Tyr Gin 
875 
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445 

lie Phe Leu lie 

Ser Glu Asp Tyr 
480 

Leu Ser Val Phe 
495 

Asn Asn lie Asp 
510 

His Tyr Tyr Thr 
525 

Pro Asn Pro Leu 

Asn lie lie Gly 
560 

Val lie Val Gly 
575 

Lys Asn Gly Arg 
590 

Val Leu Tyr Asn 
605 

Tyr Val Thr Lys 

Pro Thr Val lie 
640 

Val Glu Phe Asn 
655 

Ala Ala Phe Tyr 
670 

Asp Thr Ser Lys 
685 

Gly Asn Leu Pro 

Asn Val Arg Tyr 
720 

Trp Pro Thr Ala 
735 

Ala Thr Thr Gly 
750 

Arg Trp Gly Leu 
765 

Tyr Tyr Glu Gly 

lie His Lys Pro 
800 

Pro Pro Ser Lys 
815 

Asn lie Leu Asn 
830 

Ser Ala Asp Ala 
845 

Ala Leu Pro Ala 
Phe 
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Met Leu Arg Asn 
1 

Ser Asn Leu Gin 
20 

Thr Lys Gly Glu 
35 

Arg Lys Glu Leu 
50 

Arg Thr Arg Ala 
65 

Lys lie Tyr Val 

Asp Gly Val Ala 
100 

Val lie Asp Pro 
115 

Ala Asn Ala Ser 
130 

Thr Thr lie Asp 
145 

Ala Lys Ala Glu 

Ala Thr Ala Ala 
180 

Asn His Gin Asn 
195 

Asn Val Phe His 
210 

Met Ser Val Gly 
225 

Asn Gly Tyr lie 

Thr Arg Asp Asn 
260 

Leu Ser Lys Ala 
275 

Asp Phe Gly Lys 
290 

Ser Leu Lys Tyr 
305 

Val Glu Ser Thr 

Val Val Asn Pro 
340 

Tyr lie Pro Glu 
355 

Gin Cys Thr Gin 
370 

Gly Tyr Gly Thr 
385 

Ser Gly Gly Ala 

Ser Ala Asn Val 
420 

Met Thr Pro Pro 



Gin Phe Arg lie 
5 

Ala Gin Glu Thr 

Arg Thr Phe Glu 
40 

Gin Gin Arg Gin 
55 

Asp Val Asn Val 
70 

Arg Gly lie Glu 
85 

Gin Asn Gly Asn 

Asn Met lie Lys 
120 

Ala Gly Pro Gly 
135 

Ala Asn Asp Phe 
150 

Ala Ala Phe Tyr 
165 

Tyr Arg Gly Lys 

lie Phe Tyr Tyr 
200 

Pro Asn Tyr Asp 
215 

Thr Pro Ser Glu 
230 

Asn Glu Thr Asp 
245 

Ser Thr Arg Leu 

Asn Asp Pro Gly 
280 

Glu Leu Ala His 
295 

Lys His Glu Gly 
310 

Ala Phe Leu Gly 
325 

Phe Ala Tyr Asn 

Val Lys Glu Trp 
360 

Gly Ala lie Arg 
375 

Pro Asn Ser lie 
390 

Gin Ala Gly Tyr 
405 

Tyr His Gly Leu 
Asn Ala Gin Asn 



Val Phe Val Ser 
10 

Thr His Thr Leu 
25 

Tyr Asn Asn Lys 

Ser Asn Gin lie 
60 

Ala Ser Gly Gly 
75 

Ser Arg Leu Leu 
90 

lie Phe His His 
105 

Glu Val Glu Val 

Ala Val Ala Gly 
140 

Leu Arg Lys Asn 
155 

Thr Asn Phe Gly 
170 

Asn Trp Asp lie 
185 

Arg Asp Gly Asn 

Leu Gin Asp Pro 
220 

Val Asn Ser Val 
235 

Ser He Ser Val 
250 

Leu Arg Pro Asn 
265 

Ser Gin Pro Ala 

Thr He Asn Phe 
300 

Gly Pro Asn Phe 
315 

Val Arg Gly Gly 
330 

Ser Asn Glu Pro 
345 

Cys Asn Asn Pro 

Pro Ser Asn Gly 
380 

Asn Trp Gin Gly 
395 

Gly Gin Leu Asn 
410 

Val Pro Lys Asn 
425 

Pro Ser Ala Asn 



Cys He Val Ala 
15 

Gly Lys Val Thr 
30 

Met Tyr He Asp 
45 

Arg Asp He Phe 

Leu Met Ala Gin 
80 

Arg Val Thr He 
95 

Asp Ala Asn Thr 
110 

He Lys Gly Ala 
125 

Lys Leu Ser Phe 

Gin Thr Tyr Gly 
160 

Tyr Arg Met Asn 
175 

Leu Ala Tyr Tyr 
190 

Asn Ala Phe Arg 
205 

Ser Asn Ser Asp 

Leu Ala Lys He 
240 

Ser Tyr Asn Leu 
255 

Thr Thr Ser Ala 
270 

Pro Phe Val He 
285 

Asn His Asn Leu 

Asn Gin Pro Arg 
320 

Asn Tyr Asn Pro 
335 

Ala Asn Pro Asp 
350 

Asp Asn He Ser 
365 

Gly Tyr Gin He 

Thr Ser Asp Ser 
400 

Ala He Ser Thr 
415 

Pro. Asp Tyr Asp 
430 

Asp Trp Thr Leu 
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AGG TAT AAG AAT TTC TTA GGG ACT TTC TCA GTG GCT CGC TCT TGG CCA 227 0 

Arg Tyr Lys Asn Phe Leu Gly Thr Phe Ser Val Ala Arg Ser Trp Pro 
720 725 730 

ACG GCT AGG GGG CAT TTA TTA GCG GAC ACT TAC GCT CTA GCT GCA ACG 2318 
Thr Ala Arg Gly His Leu Leu Ala Asp Thr Tyr Ala Leu Ala Ala Thr 
735 740 745 750 

ACT GGG AAT GTG TTT ATT TTA AAA GCC GAT TAT GAT GTT CGC AGG TGG 23 6 6 
Thr Gly Asn Val Phe lie Leu Lys Ala Asp Tyr Asp Val Arg Arg Trp 
755 760 765 

GGG CTT ACT TTA ACC TGG CTC TCG CGC TTT GTA ACT AAC ATG TAT TAT 2414 
Gly Leu Thr Leu Thr Trp Leu Ser Arg Phe Val Thr Asn Met Tyr Tyr 
770 775 780 

GAG GGC TAT TCT ATC TAT TAC CCG CAA TAC GGC TTG ATC AAA ATC CAT 24 62 

Glu Gly Tyr Ser lie Tyr Tyr Pro Gin Tyr Gly Leu lie Lys lie His 
785 790 795 

AAA CCC GGG TAT GGC GTG CAT AAT GTC TTT ATC AAC TGG ACT CCG CCT 2510 
Lys Pro Gly Tyr Gly Val His Asn Val Phe lie Asn Trp Thr Pro Pro 
800 805 810 

TCT AAA AAA TGG CAG GGT TTA AGG ATT TCA GCC GTG TTT AAT AAT ATC 255 8 

Ser Lys Lys Trp Gin Gly Leu Arg lie Ser Ala Val Phe Asn Asn lie 
815 820 825 830 

TTA AAC AAG CAA TAT GTG GAT CAA ACT TCT GTG TTT CAA GCG AGC GCG 2 606 

Leu Asn Lys Gin Tyr Val Asp Gin Thr Ser Val Phe Gin Ala Ser Ala 
835 840 845 

GAC GCT CCA GCG AGC GAT ATG ATC CCT AAA GGT AAG CGC ATG GCG CTC 2 654 

Asp Ala Pro Ala Ser Asp Met lie Pro Lys Gly Lys Arg Met Ala Leu 
850 855 860 

CCG GCT CCT GGA TTT AAC GCG CGT TTT GAG GTA TCC TAT CAG TTC TAAAA 2 7 04 
Pro Ala Pro Gly Phe Asn Ala Arg Phe Glu Val Ser Tyr Gin Phe 
865 870 875 

TGAAAGGAAT CTTAGGATTT CTTTTTGAAT TTTGAACATG GAAACA 2 750 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
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GTT TTC TCT GGA TTG GAT AAA GGC AAA AAC GGC TAT TAT AAA AAC AAC 1598 
Val Phe Ser Gly Leu Asp Lys Gly Lys Asn Gly Tyr Tyr Lys Asn Asn 
495 500 505 510 

ATT GAT CCT AAC GAC CCT AAC GGG CCG GGC TTG CCT TAG CGC CAT TAG 164 6 

lie Asp Pro Asn Asp Pro Asn Gly Pro Gly Leu Pro Tyr Arg His Tyr 
515 520 525 

TAC ACC GAT CAA AGC TCC CAA TAC CCC CAA AAT CTC AAC ACC CCT AAC 1694 
Tyr Thr Asp Gin Ser Ser Gin Tyr Pro Gin Asn Leu Asn Thr Pro Asn 
530 535 540 

CCG CTC TAT CGT AAC ATG CCC CAA AAT TCG CAT GCG ATC GGC AAT ATC 1742 
Pro Leu Tyr Arg Asn Met Pro Gin Asn Ser His Ala lie Gly Asn lie 
545 550 555 

ATC GGA GGG TTT ATG CAA GCA AAC TAC AAC ATT TTA AGC AAT GTG ATC 17 90 

lie Gly Gly Phe Met Gin Ala Asn Tyr Asn lie Leu Ser Asn Val lie 
560 565 570 

GTG GGT GCG GGA ACT CGT TAT GAT ATT TAC ACC TTG CTA GAC AAA AAC 183 8 

Val Gly Ala Gly Thr Arg Tyr Asp lie Tyr Thr Leu Leu Asp Lys Asn 
575 580 585 590 

GGC CGC ACG CAT GTA ACT TCT GGT TTC TCG CCT TCT GCA ACC GTG CTT 1886 
Gly Arg Thr His Val Thr Ser Gly Phe Ser Pro Ser Ala Thr Val Leu 
595 600 605 

TAT AAC CCC ATT GAA AGC ATT GGC TTG AAA GTG AGT TAT GCG TAT GTA 1934 
Tyr Asn Pro lie Glu Ser lie Gly Leu Lys Val Ser Tyr Ala Tyr Val 
610 615 620 

ACT AAG GGG GCT TTG CCT GGC GAT GGC GTT TTG ATG CGC GAT CCT ACG 1982 
Thr Lys Gly Ala Leu Pro Gly Asp Gly Val Leu Met Arg Asp Pro Thr 
625 630 635 

GTG ATT TAT CAA AGG AAT TTG CGC CCT GCG ATC GGT CAA AAT GTG GAA 2 03 0 
Val lie Tyr Gin Arg Asn Leu Arg Pro Ala lie Gly Gin Asn Val Glu 
640 645 650 

TTT AAT GTG GAT TTC AAC AGC AAG TAT TTC AAT GTG CGC GGG GCA GCG 2 07 8 

Phe Asn Val Asp Phe Asn Ser Lys Tyr Phe Asn Val Arg Gly Ala Ala 
655 660 665 670 

TTC TAT CAA GTC ATC AAT AAT TTC ATC AAC AGC TAC GGG CAA GAC ACT 2126 
Phe Tyr Gin Val lie Asn Asn Phe lie Asn Ser Tyr Gly Gin Asp Thr 
675 680 685 

TCT AAA AAT GGA GGG GGT AAC GCA ACC GCA AAA AAC ATG TCA GGG AAT 2174 
Ser Lys Asn Gly Gly Gly Asn Ala Thr Ala Lys Asn Met Ser Gly Asn 
690 695 700 

TTA CCC GAA ACC ATT AAC ATT TAT GGT TAT GAA GTT TCA GGG AAT GTG 22 22 

Leu Pro Glu Thr lie Asn lie Tyr Gly Tyr Glu Val Ser Gly Asn Val 
705 710 715 
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TCA GCC CTC TCT AAA GCC AAT GAC CCA GGA AGC CAG CCA GCC CCC TTT 92 6 

Ser Ala Leu Ser Lys Ala Asn Asp Pro Gly Ser Gin Pro Ala Pro Phe 
275 280 285 

GTG ATT GAC TTT GGG AAA GAA TTA GCC CAT ACG ATC AAC TTC AAC CAC 974 
Val He Asp Phe Gly Lys Glu Leu Ala His Thr He Asn Phe Asn His 
290 295 300 

AAT TTG AGC TTG AAA TAC AAG CAT GAA GGC GGC CCT AAT TTT AAC CAG 1022 
Asn Leu Ser Leu Lys Tyr Lys His Glu Gly Gly Pro Asn Phe Asn Gin 
305 310 315 

CCG CGC GTT GAA TCC ACC GCC TTT TTA GGG GTA AGG GGG GGC AAT TAT 107 0 
Pro Arg Val Glu Ser Thr Ala Phe Leu Gly Val Arg Gly Gly Asn Tyr 
320 325 330 

AAC CCT GTG GTG AAT CCT TTC GCT TAC AAT TCT AAC GAG CCG GCT AAC 1118 
Asn Pro Val Val Asn Pro Phe Ala Tyr Asn Ser Asn Glu Pro Ala Asn 
335 340 345 350 

CCA GAT TAT ATC CCT GAA GTG AAA GAG TGG TGT AAT AAC CCA GAT AAT 1166 
Pro Asp Tyr He Pro Glu Val Lys Glu Trp Cys Asn Asn Pro Asp Asn 
355 360 365 

ATC AGC CAG TGC ACG CAA GGG GCT ATC AGG CCT TCT AAT GGA GGC TAT 1214 
He Ser Gin Cys Thr Gin Gly Ala He Arg Pro Ser Asn Gly Gly Tyr 
370 375 380 

CAA ATA GGC TAT GGC ACG CCT AAT TCT ATT AAT TGG CAA GGG ACT AGC 1262 
Gin He Gly Tyr Gly Thr Pro Asn Ser He Asn Trp Gin Gly Thr Ser 
385 390 395 

GAT TCT AGT GGA GGG GCG CAA GCA GGG TAT GGG CAG CTT AAC GCT ATT 1310 
Asp Ser Ser Gly Gly Ala Gin Ala Gly Tyr Gly Gin Leu Asn Ala He 
400 405 410 

TCT ACA AGC GCG AAC GTT TAT CAT GGG CTT GTC CCT AAA AAT CCT GAT 135 8 
Ser Thr Ser Ala Asn Val Tyr His Gly Leu Val Pro Lys Asn Pro Asp 
415 420 425 430 

TAT GAC ATG ACC CCC CCT AAC GCT CAA AAC CCT AGC GCA AAC GAT TGG 14 06 
Tyr Asp Met Thr Pro Pro Asn Ala Gin Asn Pro Ser Ala Asn Asp Trp 
435 440 445 

ACT TTA GGG AAT GCG GAC GCT GAG GGG ACT TTA GCC AGA AGG ATT TTT 14 54 
Thr Leu Gly Asn Ala Asp Ala Glu Gly Thr Leu Ala Arg Arg He Phe 
450 455 460 

TTA ATC AAC TCG GGC GTT AAT TTT AAA GTA ACC CAC CCC ATT AGC GAA 1502 
Leu He Asn Ser Gly Val Asn Phe Lys Val Thr His Pro He Ser Glu 
465 470 475 

GAT TAT GGG AAT GTG TTT GAA TAC GGC ATG ATT TAT CAA AAC CTG AGC 155 0 
Asp Tyr Gly Asn Val Phe Glu Tyr Gly Met He Tyr Gin Asn Leu Ser 
480 485 490 
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ATT GAC AGA AAA GAG CTC CAA CAA CGC CAA AGT AAC CAA ATC CGT GAT 2 54 

lie Asp Arg Lys Glu Leu Gin Gin Arg Gin Ser Asn Gin lie Arg Asp 
50 55 60 

ATT TTT AGG ACT AGA GCG GAT GTG AAT GTG GCC AGT GGG GGC TTG ATG 3 02 

lie Phe Arg Thr Arg Ala Asp Val Asn Val Ala Ser Gly Gly Leu Met 
65 70 75 

GCG CAA AAG ATC TAT GTT AGG GGG ATT GAG AGC CGT CTC TTA AGG GTA 3 50 

Ala Gin Lys He Tyr Val Arg Gly He Glu Ser Arg Leu Leu Arg Val 
80 85 90 

ACA ATA GAT GGC GTC GCC CAA AAT GGT AAC ATT TTC CAC CAT GAC GCT 3 98 

Thr He Asp Gly Val Ala Gin Asn Gly Asn He Phe His His Asp Ala 
95 100 105 110 

AAC ACC GTG ATC GAT CCT AAC ATG ATT AAA GAA GTG GAA GTG ATC AAG 446 
Asn Thr Val He Asp Pro Asn Met He Lys Glu Val Glu Val He Lys 
115 120 125 

GGG GCG GCG AAC GCT TCA GCA GGC CCA GGT GCG GTG GCG GGT AAA TTG 4 94 

Gly Ala Ala Asn Ala Ser Ala Gly Pro Gly Ala Val Ala Gly Lys Leu 
130 135 140 

TCT TTC ACC ACG ATT GAC GCT AAC GAC TTC TTA AGA AAG AAT CAA ACT 542 
Ser Phe Thr Thr He Asp Ala Asn Asp Phe Leu Arg Lys Asn Gin Thr 
145 150 155 

TAT GGC GCT AAA GCG GAA GCG GCC TTT TAT ACC AAC TTC GGG TAT CGC 5 90 

Tyr Gly Ala Lys Ala Glu Ala Ala Phe Tyr Thr Asn Phe Gly Tyr Arg 
160 165 170 

ATG AAC GCC ACT GCG GCT TAG CGG GGG AAA AAC TGG GAC ATC CTA GCC 638 
Met Asn Ala Thr Ala Ala Tyr Arg Gly Lys Asn Trp Asp He Leu Ala 
175 180 185 190 

TAT TAC AAC CAT CAA AAT ATT TTT TAC TAC AGA GAC GGG AAC AAC GCT 686 
Tyr Tyr Asn His Gin Asn He Phe Tyr Tyr Arg Asp Gly Asn Asn Ala 
195 200 205 

TTT AGG AAT GTC TTC CAC CCT AAC TAC GAT TTA CAA GAT CCA AGC AAT 734 
Phe Arg Asn Val Phe His Pro Asn Tyr Asp Leu Gin Asp Pro Ser Asn 
210 215 220 

AGC GAT ATG AGC GTA GGG ACT CCC AGT GAA GTC AAT AGt GTT TTA GCT 7 82 

Ser Asp Met Ser Val Gly Thr Pro Ser Glu Val Asn Ser Val Leu Ala 
225 230 235 

AAA ATT AAT GGC TAT ATC AAC GAA ACA GAC AGC ATT AGC GTG AGC TAC 83 0 

Lys He Asn Gly Tyr He Asn Glu Thr Asp Ser He Ser Val Ser Tyr 
240 245 250 

AAC CTC ACA CGA GAC AAT TCT ACA AGG CTT TTA CGC CCT AAC ACC ACT 87 8 

Asn Leu Thr Arg Asp Asn Ser Thr Arg Leu Leu Arg Pro Asn Thr Thr 
255 260 265 270 
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85 










90 










95 




Pro 


Val 


Gly 


Asp 


Ser 


Gly 


Phe 


He 


Tyr 


Asn 


Ser 


Phe 


Ser 


Phe 


Glv 


Glv 








100 










105 










110 






Asn 


Thr 


Leu 


Thr 


Glu 


Arg 


Asp 


Ser 


Tyr 


Gin 


Gly 


Gin 


Tyr 


Tyr 


Val 


Asn 






115 










120 










125 








Leu 


Phe 


Thr 


Tyr 


Gly 


Val 


Gly 


Leu 


Asp 


Thr 


Leu 


Trp 


Asn 


Phe 


Val 


Asn 




130 










135 










140 










Lys 


Glu 


Asn 


Met 


Val 


Phe 


Gly 


Phe 


Val 


Val 


Gly 


He 


Gin 


Leu 


Ala 


Gly 


145 










150 










155 










160 


Asp 


Ser 


Trp 


Ala 


Thr 


Ser 


He 


Ser 


Lys 


Glu 


He 


Ala 


His 


Tyr 


Ala 


Lys 










165 










170 










175 




His 


His 


Ser 


Asn 


Ser 


Ser 


Tyr 


Ser 


Pro 


Ala 


Asn 


Phe 


Gin 


Phe 


Leu 


Trp 








180 










185 










190 






Lys 


Phe 


Gly 


Val 


Arg 


Thr 


His 


He 


Ala 


Lys 


His 


Asn 


Ser 


Leu 


Glu 


Leu 






195 










200 










205 








Gly 


He 


Lys 


Val 


Pro 


Thr 


He 


Thr 


His 


Gin 


Leu 


Phe 


Ser 


Leu 


Thr 


Asn 




210 










215 










220 










Glu 


Lys 


Gly 


Tyr 


Thr 


Leu 


Gin 


Ala 


Asp 


Val 


Arg 


Arg 


Val 


Tyr 


Ala 


Phe 


225 










230 










235 










240 


Gin 


He 


Ser 


Tyr 


Leu 


Arg 


Asp 


Phe 



















245 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 69... 2699 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TAATAATAAC CTTAGGTTTA AAACTTGACT AAATTTTTAG AAAAAAGTAA ATAAAAAGGC 60 
TAAAAGAA ATG CTT AGA AAT CAA TTT CGT ATC GTG TTT GTC TCT TGT ATT 110 
Met Leu Arg Asn Gin Phe Arg He Val Phe Val Ser Cys He 
15 10 

GTC GCT AGC AAT TTG CAA GCT CAA GAA ACA ACC CAC ACT TTG GGT AAG 15 8 

Val Ala Ser Asn Leu Gin Ala Gin Glu Thr Thr His Thr Leu Gly Lys 
15 20 25 30 

GTA ACC ACT AAG GGT GAA AGG ACT TTT GAA TAC AAC AAT AAA ATG TAT 2 06 

Val Thr Thr Lys Gly Glu Arg Thr Phe Glu Tyr Asn Asn Lys Met Tyr 
35 40 45 
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AAA GAA AAC ATG GTT TTT GOT TTT GTG GTG GGG ATC CAA TTA GCG GGG 53 8 

Lys Glu Asn Met Val Phe Gly Phe Val Val Gly He Gin Leu Ala Gly 
145 150 155 160 

GAT AGT TGG GCA ACG AGC ATC AGT AAA GAA ATC GCT CAT TAT GCA AAA 5 86 

Asp Ser Trp Ala Thr Ser He Ser Lys Glu He Ala His Tyr Ala Lys 
165 170 175 

CAC CAC AGC AAT TCC AGT TAT AGC CCG GCC AAT TTC CAG TTT TTA TGG 634 
His His Ser Asn Ser Ser Tyr Ser Pro Ala Asn Phe Gin Phe Leu Trp 
180 185 190 

AAG TTT GGG GTC CGC ACC CAT ATC GCT APJK CAC AAT AGC CTA GAA TTA 682 
Lys Phe Gly Val Arg Thr His He Ala Lys His Asn Ser Leu Glu Leu 
195 200 205 

GGG ATT AAA GTG CCT ACG ATC ACA CAC CAG CTT TTC TCT CTT ACC AAC 73 0 

Gly He Lys Val Pro Thr He Thr His Gin Leu Phe Ser Leu Thr Asn 
210 215 220 

GAA AAG GGA TAC ACC TTA CAG GCT GAT GTG CGT AGA GTT TAT GCG TTT 7 78 

Glu Lys Gly Tyr Thr Leu Gin Ala Asp Val Arg Arg Val Tyr Ala Phe 
225 230 235 240 

CAA ATC AGT TAC TTG AGG GAT TTT TAACCCCTTT TTAGATACAA TCACGCCTGA AA 834 
Gin He Ser Tyr Leu Arg Asp Phe 
245 

CTATCCATTT AAAGGTGTGA AA 856 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Met 


Leu 


Lys 


Arg 


Met 


He 


Leu 


Leu 


Gly 


Ala 


Leu 


Gly 


Val 


Leu 


Ala 


Ser 


1 








5 










10 










15 




Ala 


Glu 


Glu 


Ser 


Ala 


Ala 


Phe 


Val 


Gly 


Val 


Asn 


Tyr 


Gin 


Val 


Ser 


Met 








20 










25 










30 






He 


Gin 


Asn 


Gin 


Thr 


Lys 


Met 


Val 


Asn 


Asp 


Asn 


Gly Leu Gin 


Lys 


Pro 






35 










40 










45 








Leu 


He 


Lys 


Phe 


Pro 


Pro 


Tyr 


Ala 


Gly Ala 


Gly 


Phe 


Glu 


Val 


Gly Tyr 




50 










55 










60 










Lys 


Gin 


Phe 


Phe 


Gly 


Lys 


Lys 


Lys 


Trp 


Phe 


Gly 


Met 


Arg 


Tyr 


Tyr 


Gly 


65 










70 










75 










80 


Phe 


Phe 


Asp 


Tyr 


Ala 


His 


Asn 


Arg 


Phe 


Gly 


Val 


Met 


Lys 


Lys 


Gly 


He 
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(A) LENGTH: 856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 59... 802 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 25: 
GCCATTTTAA GCTAATATAA TATAGAGCGA TTATCAAAAA ATAAAGGGAA AAGACTGA 5 8 



ATG TTG AAA AGA ATG ATA TTA TTA GGG GCT TTG GGT GTT TTA GCG AGC 106 
Met Leu Lys Arg Met lie Leu Leu Gly Ala Leu Gly Val Leu Ala Ser 
15 10 15 

GCT GAA GAG AGT GCG GCT TTT GTG GGA GTC AAT TAC CAG GTG AGC ATG 154 
Ala Glu Glu Ser Ala Ala Phe Val Gly Val Asn Tyr Gin Val Ser Met 
20 25 30 

ATA CAA AAT CAG ACT AAA ATG GTG AAT GAC AAC GGC TTG CAA AAG CCT 2 02 

lie Gin Asn Gin Thr Lys Met Val Asn Asp Asn Gly Leu Gin Lys Pro 
35 40 45 

TTG ATA AAG TTT CCG CCT TAC GCA GGA GCG GGT TTT GAA GTG GGC TAT 2 50 

Leu lie Lys Phe Pro Pro Tyr Ala Gly Ala Gly Phe Glu Val Gly Tyr 
50 55 60 

AAG CAA TTT TTT GGT AAG AAA AAA TGG TTT GGC ATG CGT TAT TAT GGG 298 
Lys Gin Phe Phe Gly Lys Lys Lys Trp Phe Gly Met Arg Tyr Tyr Gly 
65 70 75 80 

TTT TTT GAC TAC GCG CAC AAC CGC TTT GGC GTG ATG AAA AAG GGC ATT 346 
Phe Phe Asp Tyr Ala His Asn Arg Phe Gly Val Met Lys Lys Gly lie 
85 90 95 

CCG GTG GGC GAT AGT GGG TTT ATT TAC AAT AGT TTT AGT TTT GGA GGG 3 94 

Pro Val Gly Asp Ser Gly Phe He Tyr Asn Ser Phe Ser Phe Gly Gly 
100 105 110 

AAC ACT TTA ACG GAA AGG GAT TCC TAT CAG GGG CAA TAC TAT GTC AAT 442 
Asn Thr Leu Thr Glu Arg Asp Ser Tyr Gin Gly Gin Tyr Tyr Val Asn 
115 120 125 

TTA TTC ACT TAT GGC GTG GGG TTA GAT ACG CTG TGG AAT TTT GTG AAT 4 90 

Leu Phe Thr Tyr Gly Val Gly Leu Asp Thr Leu Trp Asn Phe Val Asn 
130 135 140 
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GGG GOT GGT GTT GGT GGT GCA GTG GGG GGA GCT GTT GGT GGT CCT GCG 24 8 

Gly Gly Gly Val Gly Gly Ala Val Gly Gly Ala Val Gly Gly Pro Ala 
55 60 65 

GGT GGT TGG GCT GGC AGA TTA GTT GGT GGT TCT GTG GGG AGA GAG TTT 2 96 

Gly Gly Trp Ala Gly Arg Leu Val Gly Gly Ser Val Gly Arg Glu Phe 
70 75 80 

GGT CGG GAA ATA GGC GAT AGG GTA GAA GAT TAC ATC CGT GGC GTT GAT 344 
Gly Arg Glu lie Gly Asp Arg Val Glu Asp Tyr lie Arg Gly Val Asp 
85 90 95 

AGA GAG CCA CAA GCC CCA AGA GAA CCC ACC TAT GAT CGT CAT TTC GTG 3 92 

Arg Glu Pro Gin Ala Pro Arg Glu Pro Thr Tyr Asp Arg His Phe Val 
100 105 110 

TAT GAC AGG TAGCTTTGGG CGAGAAAGGA GAGAGCATGA ATGTCAAAAA TCGTTTGAG 4 50 

Tyr Asp Arg 

115 

CGAT 4 54 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Lys Gly Phe Val Met Ser Gly Leu Lys Ala Phe Ser Cys Val Val 

15 10 15 

Val Leu Cys Gly Ala Met Ala Asn Thr Ala lie Ala Gly Pro Lys lie 

20 25 30 

Glu Ala Arg Gly Glu Phe Gly Arg Phe Trp Gly Gly Ala Val Gly Gly 

35 40 45 

Ala lie Gly Gly Gly Val Gly Gly Ala Val Gly Gly Ala Val Gly Gly 

50 55 60 

Pro Ala Gly Gly Trp Ala Gly Arg Leu Val Gly Gly Ser Val Gly Arg 
65 70 75 80 

Glu Phe Gly Arg Glu lie Gly Asp Arg Val Glu Asp Tyr lie Arg Gly 

85 90 95 

Val Asp Arg Glu Pro Gin Ala Pro Arg Glu Pro Thr Tyr Asp Arg His 

100 105 110 

Phe Val Tyr Asp Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 
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515 










520 










525 








Tyr 


Asp 
530 


Tyr 


Gin 


Gly 


Arg 


Leu 
535 


Tyr 


Asp 


Ser 


Val 


Trp 
540 


Asn 


Pro 


Ser 


Ser 


lie 


Leu 


Pro 


Ser 


Asn 


Ala 


Ser 


Asn 


Lys 


Thr 


Val 


Asp 


Leu 


Thr 


Leu 


Thr 


545 










550 










555 










560 


Gin 


Tyr 


Leu 


Tyr 


Gly 
565 


Leu 


Gly 


Gly 


Gin 


Glu 
570 


Leu 


Leu 


Tyr 


Phe 


Lys 
575 


He 


Ser 


Gin 


Leu 


lie 
580 


Asn 


Leu 


Asp 


Asp 


Lys 
585 


Val 


Ser 


Pro 


Phe 


Arg 
590 


Met 


Pro 


Leu 


Glu 


Ser 
595 


Lys 


He 


Gly 


Phe 


Ser 
600 


Pro 


Leu 


Thr 


Gly 


Leu 
605 


Asn 


He 


Phe 


Gly 


Asn 
610 


Val 


Phe 


Tyr 


Ser 


Phe 
615 


Tyr 


Gin 


Asn 


Arg 


Leu 
620 


Glu 


Glu 


He 


Ser 


Val 


Asn 


Ala 


Asn 


Tyr 


Gin 


Arg 


Lys 


Phe 


Leu 


Ser 


Phe 


Asn 


Leu 


Ser 


Tyr 


625 










630 










635 










640 


Phe 


Leu 


Lys 


Asn 


Asn 
645 


Phe 


Ser 


Ser 


Gly 


He 
650 


Asn 


Ser 


He 


Val 


Glu 
655 


Asn 


Leu 


Arg 


lie 


He 



























660 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATtJRE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 401 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TCAAGGTGTG CCAAACATGC CTTGAAACTC AATTTTTGAA TCTCAATTTT ATG AAA 56 

Met Lys 
1 

GGA TTT GTT ATG AGT GGA TTA AAA GCA TTT AGT TGT GTA GTG GTT TTA 104 
Gly Phe Val Met Ser Gly Leu Lys Ala Phe Ser Cys Val Val Val Leu 
5 10 15 

TGC GGT GCA ATG GCT AAT ACG GCT ATA GCT GGT CCT AAA ATA GAA GCA 152 
Cys Gly Ala Met Ala Asn Thr Ala He Ala Gly Pro Lys He Glu Ala 
20 25 30 

AGG GGT GAG TTT GGC AGA TTT TGG GGG GGA GCT GTT GGT GGT GCA ATT 200 
Arg Gly Glu Phe Gly Arg Phe Trp Gly Gly Ala Val Gly Gly Ala He 
35 40 45 50 
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Gly Asn lie Lys 

Tyr Val Lys Leu 
100 

Tyr Val Gin Asp 
115 

Ser Gly Lys Asp 
130 

Cys Ser lie Asp 
145 

Phe Asn Met Gin 

Val Gly Asp lie 
180 

Ser Asn Lys Arg 
195 

Asn Leu Asp Gly 
210 

Asn Ser Trp Asp 
225 

Phe Gly Leu Asn 

Phe Leu Phe Asn 
260 

Arg Tyr Asp Leu 
275 

Ser Ser Arg Asp 
290 

Asp Asn Gly His 
305 

Val Arg Phe Glu 

Ser Arg Ala Asn 
340 

Asn lie Lys Tyr 
355 

Phe Gin Ser Val 
370 

Tyr Phe Arg Asn 
385 

Ala Arg Glu lie 

Val Gly Leu Gin 
420 

Trp Asn Asp Leu 
435 

Ser Phe Val Pro 
450 

Val Ser Ser Asn 
465 

Tyr Asn Lys Leu 

Pro Tyr Tyr Thr 
500 

Leu Ser Ala Gin 



Val Tyr Arg Gly 
85 

Ser Leu Asn Glu 

Ser Val Ser Gly 
120 

Gin Lys Tyr Lys 
135 

Asn Pro lie Trp 
150 

Lys Ser His Leu 
165 

Pro Val Leu Tyr 

Thr Thr Gly Phe 
200 

Phe lie Tyr Leu 
215 

Met Thr Phe Thr 
230 

Phe Glu Ala Arg 
245 

Ala Arg Tyr Phe 

Arg Asn Gin Asn 
280 

Thr Leu Gin Lys 
295 

Tyr lie Asp Phe 
310 

Lys Val Asn Lys 
325 

Tyr Tyr Leu Gin 

Phe Leu Asn Leu 
360 

Pro Asn Leu Gin 
375 

Leu Leu Tyr Ser 
390 

Gly Tyr Gly Tyr 
405 

Phe Ser Leu Phe 

Gin Leu Ser Asn 
440 

Thr lie Pro Asn 
455 

Phe Ser Met Tyr 
470 

Phe His Thr lie 
485 

Phe Lys Asn Gly 
Ala Leu Asn Ser 



Glu Gly Leu Leu 
90 

Lys Tyr Glu lie 
105 

He Trp Val Ser 

Val Lys Asn Met 
140 

His Val Asn Ala 
155 

Ser Met Trp Asn 
170 

Leu Pro Tyr He 
185 

Leu Tyr Pro Glu 

Gin Pro Phe Tyr 
220 

Pro Gin He Arg 
235 

Tyr He Asn Ser 
250 

Arg Asn Tyr Thr 
265 

He Tyr Gly Phe 

Tyr Phe His Leu 
300 

Leu Tyr Met Asn 
315 

Arg He Thr Asp 
330 

Thr Glu Asn Asn 
345 

Asn Lys He Asn 

Tyr His Lys Tyr 
380 

Val Asp Tyr Gin 
395 

Val Gin Asn Ala 
410 

Lys Lys Tyr Leu 
425 

Val Ala Leu Met 

Glu Ser Arg Glu 
460 

Val Asn Thr Asp 
475 

Gin Leu Glu Ala 
490 

Leu Phe Ser Gin 
505 

Tyr Thr Ser Pro 



Val Lys Thr Asp 
95 

He Phe Pro Phe 
110 

Ala Asp He Ala 
125 

Ser Thr Ser Gly 

Thr Ser Gly Ser 
160 

Pro Lys He Tyr 
175 

Phe Met Ser Thr 
190 

Phe Gly Thr Ser 
205 

Leu Ala Pro Lys 

Tyr Lys Arg Gly 
240 

Lys Asn Asp Arg 
255 

Gin Tyr Val Lys 
270 

Glu Phe Leu Ser 
285 

Lys Ser Asn He 

Asp Leu Asp Tyr 
320 

Ala Thr His Met 
335 

Tyr Tyr Gly Leu 
350 

Asn Asn Arg Thr 
365 

Leu Asn Ser Leu 

Phe Arg Asn Thr 
400 

Leu Asn Val Pro 
415 

Ser Leu Gly Leu 
430 

Gin Ser Lys Asn 
445 

Phe Gly Asn Phe 

Leu Ala Arg Glu 
480 

He Phe Asn He 
495 

Asn Met Tyr Ala 
510 

Leu Leu Arg Asp 
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CAA TAG CTT TAT GGC TTA GGG GGG CAA GAG TTA TTG TAT TTT AAA ATA 17 87 

Gin Tyr Leu Tyr Gly Leu Gly Gly Gin Glu Leu Leu Tyr Phe Lys lie 

565 570 

575 

TCG CAA CTC ATC AAT CTT GAC GAT AAA GTT TCG CCC TTT AGA ATG CCA 183 5 

Ser Gin Leu lie Asn Leu Asp Asp Lys Val Ser Pro Phe Arg Met Pro 
580 585 590 

CTA GAG AGC AAG ATC GGG TTT TCG CCC TTA ACG GGA TTG AAC ATC TTT 18 83 

Leu Glu Ser Lys lie Gly Phe Ser Pro Leu Thr Gly Leu Asn He Phe 
595 600 605 

GGG AAT GTC TTT TAT TCG TTT TAT CAA AAC CGC TTA GAA GAA ATC TCT 1931 
Gly Asn Val Phe Tyr Ser Phe Tyr Gin Asn Arg Leu Glu Glu He Ser 
610 615 620 

GTG AAC GCC AAT TAC CAA CGC AAG TTT TTA AGC TTT AAC CTC TCT TAT 197 9 

Val Asn Ala Asn Tyr Gin Arg Lys Phe Leu Ser Phe Asn Leu Ser Tyr 
625 630 635 640 

TTT TTA AAA AAC AAT TTT AGC AGT GGG ATT AAT AGC ATT GTA GAA AAT 2 027 

Phe Leu Lys Asn Asn Phe Ser Ser Gly He Asn Ser He Val Glu Asn 
645 650 655 

CTG CGG ATT ATT TAAAGGCGGG TTTTAGCAAC GACTTTGGCT ATTTTTCCAT GAGCGC 2 085 
Leu Arg He He 
660 

GGATGTGGGT TATGATATTA GAAACAATGT GGTTTTAAAT TGGAATGTGG GGATTTATAA 2145 

AAAAATCCGT TGTTTTGGGA TTGG 2169 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ing 1 e 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met He Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser Ala Leu 

15 10 15 

Asp Ala Lys Glu He Ala Met Gin Arg Phe Asp Lys Gin Asn His Lys 

20 25 30 

He Phe Glu He Leu Ala Asp Lys Val Ser Ala Lys Asp Asn Val He 

35 40 45 

Thr Ala Ser Gly Asn Ala He Leu Leu Asn Tyr Asp Val Tyr He Leu 

50 55 60 

Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu Leu Glu 
65 70 75 80 
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TCT AGG GCG AAT TAC TAT TTG CAA ACA GAA AAC AAT TAT TAG GGC TTG 1115 
Ser Arg Ala Asn Tyr Tyr Leu Gin Thr Glu Asn Asn Tyr Tyr Gly Leu 
340 345 350 



AAT ATC AAG TAT TTT TTA AAC CTG AAT AAA ATC AAC AAT AAC CGC ACT 1163 
Asn lie Lys Tyr Phe Leu Asn Leu Asn Lys lie Asn Asn Asn Arg Thr 
355 360 365 



TTC CAA TCT GTC CCT AAT TTG CAA TAC CAT AAA TAT TTA AAT TCT TTG 1211 
Phe Gin Ser Val Pro Asn Leu Gin Tyr His Lys Tyr Leu Asn Ser Leu 
370 375 380 



TAT TTT AGA AAT TTG TTG TAT TCG GTG GAT TAT CAG TTT AGA AAC ACC 12 5 9 

Tyr Phe Arg Asn Leu Leu Tyr Ser Val Asp Tyr Gin Phe Arg Asn Thr 
385 390 395 400 



GCA AGA GAG ATT GGT TAT GGC TAT GTG CAA AAC GCT TTG AAT GTG CCG 13 07 

Ala Arg Glu lie Gly Tyr Gly Tyr Val Gin Asn Ala Leu Asn Val Pro 
405 410 415 



GTG GGC TTG CAA TTT TCT TTG TTT AAA AAG TAT TTG TCT TTA GGG CTT 13 55 

Val Gly Leu Gin Phe Ser Leu Phe Lys Lys Tyr Leu Ser Leu Gly Leu 
420 425 430 



TGG AAT GAT CTC CAA CTA TCT AAT GTG GCT TTA ATG CAA TCT AAA AAT 14 03 

Trp Asn Asp Leu Gin Leu Ser Asn Val Ala Leu Met Gin Ser Lys Asn 
435 440 445 



TCC TTC GTG CCT ACG ATC CCT AAT GAA TCA AGG GAA TTT GGG AAT TTT 1451 
Ser Phe Val Pro Thr lie Pro Asn Glu Ser Arg Glu Phe Gly Asn Phe 
450 455 460 



GTG TCT TCA AAT TTT TCC ATG TAT GTC AAT ACG GAT TTG GCT AGA GAA 14 99 

Val Ser Ser Asn Phe Ser Met Tyr Val Asn Thr Asp Leu Ala Arg Glu 
465 470 475 480 



TAC AAC AAG CTT TTC CAC ACG ATC CAA CTA GAA GCG ATT TTC AAC ATC 154 7 

Tyr Asn Lys Leu Phe His Thr lie Gin Leu Glu Ala lie Phe Asn lie 
485 490 495 

CCT TAT TAC ACC TTT AAA AAC GGC TTA TTT TCT CAA AAC ATG TAT GCT 1595 
Pro Tyr Tyr Thr Phe Lys Asn Gly Leu Phe Ser Gin Asn Met Tyr Ala 
500 505 510 



TTA AGC GCG CAA GCC TTA AAC AGC TAC ACT TCG CCT TTA TTG AGA GAT 1643 
Leu Ser Ala Gin Ala Leu Asn Ser Tyr Thr Ser Pro Leu Leu Arg Asp 
515 520 525 



TAT GAT TAT CAA GGG CGT TTG TAT GAC TCG GTG TGG AAT CCT AGC AGT 1691 
Tyr Asp Tyr Gin Gly Arg Leu Tyr Asp Ser Val Trp Asn Pro Ser Ser 
530 535 540 



ATT TTA CCT AGC AAT GCG AGC AAC AAG ACG GTG GAT TTA ACC CTA ACG 173 9 

lie Leu Pro Ser Asn Ala Ser Asn Lys Thr Val Asp Leu Thr Leu Thr 
545 550 555 560 
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Tyr Val Gin Asp Ser Val Ser Gly lie Trp Val Ser Ala Asp lie Ala 
115 120 125 

AGC GGA AAG GAT CAA AAA TAT AAG GTT AAA AAC ATG AGC ACT TCA GGG 4 91 

Ser Gly Lys Asp Gin Lys Tyr Lys Val Lys Asn Met Ser Thr Ser Gly 
130 135 140 

TGC AGC ATT GAT AAC CCC ATT TGG CAT GTC AAT GCG ACT TCA GGC TCA 53 9 

Cys Ser lie Asp Asn Pro lie Trp His Val Asn Ala Thr Ser Gly Ser 
145 150 155 160 

TTC AAC ATG CAA AAA TCG CAT TTG TCT ATG TGG AAT CCT AAG ATC TAT 587 
Phe Asn Met Gin Lys Ser His Leu Ser Met Trp Asn Pro Lys lie Tyr 
165 170 175 

GTC GGT GAT ATT CCT GTA TTG TAT TTG CCC TAT ATT TTC ATG TCC ACG 63 5 

Val Gly Asp lie Pro Val Leu Tyr Leu Pro Tyr lie Phe Met Ser Thr 
180 185 190 

AGC AAT AAA AGA ACT ACT GGG TTT TTA TAC CCT GAG TTT GGC ACT TCC 68 3 

Ser Asn Lys Arg Thr Thr Gly Phe Leu Tyr Pro Glu Phe Gly Thr Ser 
195 200 205 

AAC TTA GAC GGC TTT ATT TAT TTG CAA CCC TTT TAT TTA GCC CCC AAA 731 
Asn Leu Asp Gly Phe lie Tyr Leu Gin Pro Phe Tyr Leu Ala Pro Lys 
210 215 220 

AAC TCA TGG GAT ATG ACC TTT ACC CCA CAA ATC CGC TAT AAA AGG GGT 779 
Asn Ser Trp Asp Met Thr Phe Thr Pro Gin lie Arg Tyr Lys Arg Gly 
225 230 235 240 

TTT GGC TTG AAT TTT GAA GCG CGC TAC ATT AAC TCT AAA AAC GAC AGG 827 
Phe Gly Leu Asn Phe Glu Ala Arg Tyr lie Asn Ser Lys Asn Asp Arg 
245 250 255 

TTT TTA TTC AAC GCG CGC TAT TTT AGG AAT TAC ACC CAA TAT GTC AAA 875 
Phe Leu Phe Asn Ala Arg Tyr Phe Arg Asn Tyr Thr Gin Tyr Val Lys 
260 265 270 

CGC TAC GAT TTG AGG AAT CAA AAT ATC TAC GGG TTT GAA TTT TTA AGC 923 
Arg Tyr Asp Leu Arg Asn Gin Asn He Tyr Gly Phe Glu Phe Leu Ser 
275 280 285 

TCT AGC AGG GAC ACT TTA CAA AAA TAC TTC CAC CTT AAG TCT AAT ATT 971 
Ser Ser Arg Asp Thr Leu Gin Lys Tyr Phe His Leu Lys Ser Asn He 
290 295 300 

GAC AAC GGG CAT TAC ATT GAC TTT TTA TAC ATG AAC GAT TTG GAC TAT 1019 
Asp Asn Gly His Tyr He Asp Phe Leu Tyr Met Asn Asp Leu Asp Tyr 
305 310 315 320 

GTG CGT TTT GAA AAG GTT AAT AAG CGT ATC ACA GAC GCC ACG CAC ATG 1067 
Val Arg Phe Glu Lys Val Asn Lys Arg He Thr Asp Ala Thr His Met 
325 330 335 
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Lys Leu Ser Asp Thr Tyr Glu Gin Leu Ser Arg Leu Leu Thr Asn Asp 

50 55 60 

Asn Gly Thr Asn Ser Lys Thr Ser Ala Gin Xaa Gin Pro Ser Gly 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 60... 2039 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CCCTATCATA GGGCGTGGCA TGAAGAAAAA AGCAAAAGTC TTTTGGTATT GTTTTAATC 59 



ATG ATT TAT TGG TTG TAT TTG GCG GTC TTT TTT TTG TTG AGC GCA TTA 107 
Met lie Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser Ala Leu 
15 10 15 

GAC OCT AAA GAA ATC GCT ATG CAA CGA TTT GAC AAA CAA AAC CAT AAG 15 5 

Asp Ala Lys Glu lie Ala Met Gin Arg Phe Asp Lys Gin Asn His Lys 
20 25 30 

ATT TTT GAA ATC CTT GCG GAT AAA GTG AGC GCT AAA GAC AAT GTG ATA 2 03 

lie Phe Glu lie Leu Ala Asp Lys Val Ser Ala Lys Asp Asn Val lie 
35 40 45 

ACC GCA TCA GGG AAT GCG ATC TTA TTG AAT TAT GAT GTG TAT ATT CTA 2 51 

Thr Ala Ser Gly Asn Ala lie Leu Leu Asn Tyr Asp Val Tyr lie Leu 
50 55 60 

GCG GAC AAG GTG CGT TAT GAC ACT AAA ACC AAA GAA GCG TTA TTA GAG 2 99 

Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu Leu Glu 
65 70 75 80 

GGG AAT ATC AAG GTT TAT AGG GGC GAG GGT TTG CTC GTT 7VAA ACC GAT 34 7 

Gly Asn lie Lys Val Tyr Arg Gly Glu Gly Leu Leu Val Lys Thr Asp 

85 90 95 

TAC GTG AAA TTG AGT TTG AAT GAA AAA TAT GAA ATC ATT TTC CCC TTT 395 
Tyr Val Lys Leu Ser Leu Asn Glu Lys Tyr Glu lie lie Phe Pro Phe 
100 105 110 

TAT GTC CAA GAC AGC GTG AGC GGG ATT TGG GTG AGC GCG GAT ATT GCC 44 3 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 87 . . . 323 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AAATTTATGT TATAATTAAA CGCATTGTAA ATAAATTCTC ATTTTGATAC ATTTTTACAA 60 
TAAAACATTA CTTTAAGGAA CATCTT ATG AAA AAA ACG AAA AAA ACG ATT CTG 113 

Met Lys Lys Thr Lys Lys Thr lie Leu 
1 5 

CTT TCT CTA ACT CTC GCG GCG TCA TTG CTC CAT GCT GAA GAC AAC GGC 161 
Leu Ser Leu Thr Leu Ala Ala Ser Leu Leu His Ala Glu Asp Asn Gly 
10 15 20 25 

GTT TTT TTA AGC GTG GGT TAT CAA ATC GGT GAA GCG GTT CAA AAA GTG 2 09 

Val Phe Leu Ser Val Gly Tyr Gin lie Gly Glu Ala Val Gin Lys Val 
30 35 40 

AAA AAC GCC GAC AAG GTG CAA AAA CTT TCA GAC ACT TAT GAA CAA TTA 2 57 

Lys Asn Ala Asp Lys Val Gin Lys Leu Ser Asp Thr Tyr Glu Gin Leu 
45 50 55 

AGC CGG CTT TTA ACC AAC GAT AAT GGC ACA AAC TCA AAG ACA AGC GCG 3 05 

Ser Arg Leu Leu Thr Asn Asp Asn Gly Thr Asn Ser Lys Thr Ser Ala 
60 65 70 

CAA NAT CAA CCA AGC GGT TAATAATTTG AACGAACGCG CAAAAACTTT AGCCGGTG 3 61 
Gin Xaa Gin Pro Ser Gly 
75 

GGACAACCAA TTCCCC 3 77 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Lys Lys Thr Lys Lys Thr lie Leu Leu Ser Leu Thr Leu Ala Ala 

15 10 15 

Ser Leu Leu His Ala Glu Asp Asn Gly Val Phe Leu Ser Val Gly Tyr 

20 25 30 

Gin lie Gly Glu Ala Val Gin Lys Val Lys Asn Ala Asp Lys Val Gin 
35 40 45 
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130 

Tyr Leu Glu Tyr 
145 

Tyr Leu Ser Thr 

lie Ser Gly Gin 
180 

Gly Arg Ala Phe 
195 

Arg Thr Thr Tyr 
210 

Asp Ser His Gly 
225 

Thr Tyr Lys Pro 

Pro Gin lie Phe 
260 

Pro Asn Phe Ser 
275 

Phe Phe Pro lie 
290 

Pro Val Tyr Lys 
305 

He He Leu Gin 

Phe Tyr Lys Val 
340 

Asn Pro Ser Gly 
355 

Thr Gly Thr Ala 
370 

Gly Gly Thr His 
385 

Tyr Ser Asn Ser 

Gly Tyr Lys Phe 
420 

Tyr Gly Val Tyr 
435 

Val Pro Lys Asp 
450 

Thr Ala Leu Val 
465 



135 

Ala Tyr Lys Asn 
150 

Met Pro Tyr Lys 
165 

Tyr Lys Lys Ala 

Ala Tyr Gly Ser 
200 

Ser Gly Gly Phe 
215 

Arg Ly s Va 1 Leu 
230 

His Arg Phe Leu 
245 

Asn Ala Pro Gly 

Gly Thr Gly Phe 
280 

Tyr Tyr Pro Trp 
295 

Tyr Asp Thr Pro 
310 

Arg Phe Asp He 
325 

Phe Gin Asn Ala 

Val He Met Gly 
360 

Leu Lys Arg Asp 
375 

Phe Ala Lys Lys 
390 

Val Val Ser Trp 
405 

Thr Glu Tyr Leu 

Thr Asn Lys Gly 
440 

Phe Pro Ala Leu 
455 

Ala Ser Phe 
470 



14 0 

Tyr Phe Glu He 
155 

Ser Gly Gin Thr 
170 

Arg Leu Thr Trp 
185 

Phe Leu Met Asp 

Thr Lys Asn Asp 
220 

Tyr Gly Thr His 
235 

He Glu Gly Phe 
250 

Val Lys He Gly 
265 

Arg Ser Asp Thr 

Met He Val Lys 
300 

Ala Thr Gin Asn 
315 

Asn Asn Tyr Asn 
330 

Asn Gly Trp He 
345 

Ser Asn Ser Val 

Ala Ala Thr He 
380 

Phe Thr Trp Lys 
395 

Glu Ala Arg Ala 
410 

Ser Gly Ser Val 
425 

Phe Lys Pro Gly 

Tyr Ser Asp Arg 
460 



Lys Ala Gly Arg 
160 

Gin Gly Phe Gin 
175 

Phe Ser Ser Phe 
190 

Trp Phe Ala Ala 
205 

Lys Gly Gly Tyr 

Ala Val Gin Leu 
240 

Tyr Tyr Leu Ser 
255 

Trp Asp Ser Asn 
270 

Ala He He Gly 
285 

Ser Asn Gly Ser 

Gly Gin Asn Leu 
320 

Val Ser He Ala 
335 

Gly Asn Met Gly 
350 

Tyr Ala Gly Phe 
365 

Phe Leu Ser Cys 

Phe Ala Thr Gin 
400 

Met He Ser Leu 
415 

Asp Leu Ala Tyr 
430 

Glu Asn Gly Pro 
445 

Ser Ala Leu Tyr 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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Thr His Phe Ala Lys L.ys Phe Thr Trp Lys Phe Ala Thr Gin Tyr Ser 
390 395 400 

AAT TCA GTG GTT TCT TGG GAA GCG AGA GCG ATG ATC TCT TTA GGT TAT 1304 
Asn Ser Val Val Ser Trp Glu Ala Arg Ala Met lie Ser Leu Gly Tyr 
405 410 415 

AAA TTC ACT GAA TAG TTG AGC GGT AGC GTG GAT CTT GCA TAT TAT GGC 1352 
Lys Phe Thr Glu Tyr Leu Ser Gly Ser Val Asp Leu Ala Tyr Tyr Gly 
420 425 430 

GTG TAT ACT AAC AAA GGA TTT AAA COG GGT GAA AAC GGG CCT GTG OCT 14 0 0 

Val Tyr Thr Asn Lys Gly Phe Lys Pro Gly Glu Asn Gly Pro Val Pro 
435 440 445 450 

AAA GAC TTC CCC GCC CTT TAT TCT GAC AGG AGC GCG TTA TAC ACG GCT 144 8 

Lys Asp Phe Pro Ala Leu Tyr Ser Asp Arg Ser Ala Leu Tyr Thr Ala 
455 460 465 

CTA GTA GCA TCT TTT TGATGCTACC CTATGATTAT GGTGGGCGTC TTTTGATGCT G 1504 
Leu Val Ala Ser Phe 
470 

TTTCTCTAGT CT 1516 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 

Met Lys Lys Ala Ser Gin Val Leu Phe Phe Gly Ala Phe Leu Ser Ser 

15 10 15 

Ser Leu Gin Gly Phe Glu Ala Lys Leu Asn Gly Phe Val Asp Gin Ser 

20 25 30 

Ser Thr lie Gly Phe Asn Gin His Lys lie Asn Lys Glu Arg Gly lie 

35 40 45 

Tyr Pro Met Gin Gin Phe Ala Thr lie Ala Gly Tyr Leu Gly Leu Gly 

50 55 60 

Phe Ser Leu Leu Pro Lys Lys Val Ser Asp His Val Leu Lys Gly Lys 
65 70 75 80 

lie Gly Gly Met Val Gly Ser lie Phe Tyr Asp Gly Thr Lys Lys Phe 

85 90 95 

Glu Asp Ser Ser Val Ala Tyr Asn Leu Phe Gly Tyr Tyr Asp Gly Phe 

100 105 110 

Met Gly Gly Tyr Thr Asn lie Leu Gin Ser Asp Asp Leu Ala Thr Gin 

115 120 125 

Asn Met Lys Tyr Asn Lys Asn Val Arg Asn Tyr Val Phe Ser Asp Ala 
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165 170 175 

GGG CAA TAG AAG AAA GCG CGC TTG ACT TGG TTT AGO TCT TTT GGG AGG 632 
Gly Gin Tyr Lys Lys Ala Arg Leu Thr Trp Phe Ser Ser Phe Gly Arg 
180 185 190 

GCG TTC GCT TAG GGC TCG TTT TTG ATG GAT TGG TTT GCC GCT AGG ACC 680 
Ala Phe Ala Tyr Gly Ser Phe Leu Met Asp Trp Phe Ala Ala Arg Thr 
195 200 205 210 

ACT TAT AGC GGA GGT TTT ACC AAA AAC GAT AAG GGA GGT TAT GAT AGC 72 8 

Thr Tyr Ser Gly Gly Phe Thr Lys Asn Asp Lys Gly Gly Tyr Asp Ser 
215 220 225 

CAT GGG CGA AAG GTG CTT TAT GGC ACG CAT GCG GTG CAA CTC ACC TAT 776 
His Gly Arg Lys Val Leu Tyr Gly Thr His Ala Val Gin Leu Thr Tyr 
230 235 240 

AAA CCT CAT CGT TTC CTC ATA GAA GGC TTT TAT TAG CTT TCG CCT CAA 824 
Lys Pro His Arg Phe Leu lie Glu Gly Phe Tyr Tyr Leu Ser Pro Gin 
245 250 255 

ATC TTT AAC GCT CCG GGC GTT AAG ATT GGT TGG GAT TCT AAC CCT AAT 872 
lie Phe Asn Ala Pro Gly Val Lys lie Gly Trp Asp Ser Asn Pro Asn 
260 265 270 

TTT AGC GGC ACA GGC TTT CGC TCT GAT ACA GCT ATC ATA GGG TTT TTC 92 0 

Phe Ser Gly Thr Gly Phe Arg Ser Asp Thr Ala lie lie Gly Phe Phe 
275 280 285 290 

CCC ATT TAG TAG CCT TGG ATG ATC GTT AAA TCC AAT GGA AGC CCG GTC 96 8 

Pro lie Tyr Tyr Pro Trp Met lie Val Lys Ser Asn Gly Ser Pro Val 
295 300 305 

TAT AAA TAG GAG ACG CCT GCC ACT CAA AAT GGG CAA AAC CTC ATT ATC 1016 
Tyr Lys Tyr Asp Thr Pro Ala Thr Gin Asn Gly Gin Asn Leu lie lie 
310 315 320 

CTC CAA CGC TTT GAG ATC AAC AAT TAG AAT GTT TCC ATC GCT TTT TAT 1064 
Leu Gin Arg Phe Asp lie Asn Asn Tyr Asn Val Ser lie Ala Phe Tyr 
325 330 335 

AAA GTC TTT CAA AAC GCT AAT GGT TGG ATA GGC AAC ATG GGG AAT CCA 1112 
Lys Val Phe Gin Asn Ala Asn Gly Trp lie Gly Asn Met Gly Asn Pro 
340 345 350 

AGC GGT GTG ATC ATG GGG AGT AAC AGC GTC TAT GCG GGT TTT ACA GGC 1160 
Ser Gly Val lie Met Gly Ser Asn Ser Val Tyr Ala Gly Phe Thr Gly 
355 360 365 370 

ACA GCC CTT AAA AGA GAT GCC GCT ACC ATT TTC CTT TCT TGT GGC GGC 12 0 8 

Thr Ala Leu Lys Arg Asp Ala Ala Thr lie Phe Leu Ser Cys Gly Gly 
375 380 385 

ACT CAT TTT GCC AAA AAA TTC ACA TGG AAA TTC GCC ACG CAA TAG TCC 12 5 6 
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(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1463 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TTTATCTTTA AAAGTATTTG CATTTATCAA TCTCATTTTA GGAGGCATGC ATG AAA 56 

Met Lys 
1 

AAG GCA AGT CAG GTT TTA TTC TTT GGG GCA TTT TTA AGC TCT TCT TTG 104 
Lys Ala Ser Gin Val Leu Phe Phe Gly Ala Phe Leu Ser Ser Ser Leu 
5 10 15 

CAA GGT TTT GAA GCT AAG CTC AAC GGC TTT GTG GAT CAA TCC AGC ACT 152 
Gin Gly Phe Glu Ala Lys Leu Asn Gly Phe Val Asp Gin Ser Ser Thr 
20 25 30 

ATC GGT TTT AAC CAG CAT AAA ATC AAT AAA GAA AGA GGT ATC TAC CCT 20 0 

lie Gly Phe Asn Gin His Lys lie Asn Lys Glu Arg Gly lie Tyr Pro 
35 40 45 50 

ATG CAG CAA TTC GCA ACG ATT GCG GGC TAT TTA GGG CTT GGT TTT AGC 24 8 

Met Gin Gin Phe Ala Thr lie Ala Gly Tyr Leu Gly Leu Gly Phe Ser 
55 60 65 

CTG TTA CCC AAA AAG GTT TCA GAC CAT GTT CTA AAA GGC AAA ATA GGA 2 96 

Leu Leu Pro Lys Lys Val Ser Asp His Val Leu Lys Gly Lys lie Gly 
70 75 80 

GGC ATG GTG GGA TCT ATT TTC TAT GAT GGC ACG AAG AAG TTT GAA GAC 344 
Gly Met Val Gly Ser lie Phe Tyr Asp Gly Thr Lys Lys Phe Glu Asp 
85 90 95 

AGC TCT GTA GCT TAC AAC CTC TTT GGT TAT TAT GAT GGG TTC ATG GGG 3 92 

Ser Ser Val Ala Tyr Asn Leu Phe Gly Tyr Tyr Asp Gly Phe Met Gly 
100 105 110 

GGT TAT ACA AAC ATC TTA CAA AGC GAT GAT TTA GCG ACA CAA AAC ATG 44 0 

Gly Tyr Thr Asn He Leu Gin Ser Asp Asp Leu Ala Thr Gin Asn Met 
115 120 125 130 

AAA TAC AAT AAA AAT GTC CGC AAC TAT GTC TTT AGC GAC GCG TAT TTA 4 88 

Lys Tyr Asn Lys Asn Val Arg Asn Tyr Val Phe Ser Asp Ala Tyr Leu 
135 140 145 

GAA TAC GCT TAT AAG AAT TAT TTT GAA ATA AAA GCC GGG CGC TAT TTA 536 
Glu Tyr Ala Tyr Lys Asn Tyr Phe Glu He Lys Ala Gly Arg Tyr Leu 
150 155 160 

TCC ACT ATG CCT TAT AAA AGC GGT CAA ACG CAA GGC TTT CAA ATT TCT 584 
Ser Thr Met Pro Tyr Lys Ser Gly Gin Thr Gin Gly Phe Gin He Ser 
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(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Asn Lys Thr 
1 

Ser Leu Gin Ala 
20 

Lys Phe Ala Asp 
35 

Ser Ala lie Asn 
50 

Phe Met Thr Gly 
65 

Ala Lys Thr Gin 

Tyr Asp Gin Thr 
100 

Leu Glu Phe Ser 
115 

Leu Gly Leu Glu 
130 

Gly Gly Ala Met 
145 

Ser Leu Phe Gin 

His Asn Glu lie 
180 

Arg Thr Glu lie 
195 

Leu Pro Val Ala 
210 



Thr Val Lys lie 
5 

Ala Glu Ala Glu 

Arg Asn Thr Phe 
40 

Thr Ser Phe Ser 
55 

Asn Gly Phe Gly 
70 

Ala Val Glu His 
85 

Phe Ser Ser His 

Gly Leu Trp Asp 
120 

Phe Gly Leu Gly 
135 

His Gly lie lie 
150 

Leu Leu Val Lys 
165 

Thr Phe Gly Leu 

lie Asp Gly Leu 
200 

Tyr Phe Asn Tyr 
215 



Leu Met Gly Met 
10 

Leu Asp Glu Lys 
25 

Tyr Leu Gly Val 

Thr Glu Ser Val 
60 

Val Val Leu Gly 
75 

Val Gly Phe Arg 
90 

Lys Ser Tyr lie 
105 

Ala Phe Asn Ser 

lie Ala Gly Ala 
140 

Ala Gin Asn Leu 
155 

Val Gly Phe Arg 
170 

Lys Phe Pro Val 
185 

Ser Thr Thr Thr 

lie Tyr Asn Phe 
220 



Ala Leu Leu Ser 
15 

Ser Lys Lys Pro 
30 

Gly Tyr Gin Leu 
45 

Asp Lys Ser Tyr 

Gly Lys Phe Val 
80 

Tyr Gly Leu Phe 
95 

Ser Thr Tyr Gly 
110 

Pro Lys Met Phe 
125 

Thr Tyr Met Pro 

Gly Lys Glu Asn 
160 

Phe Gly Phe Leu 
175 

lie Pro Asn Lys 
190 

Leu Trp His Arg 
205 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 
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Lys He Leu Met Gly Met Ala Leu Leu Ser Ser Leu Gin Ala Ala Glu 
10 15 20 

GCA GAG CTT GAT GAA 7VAA TCA AAA AAA CCT AAA TTT GCG GAC AGG AAT 211 

Ala Glu Leu Asp Glu Lys Ser Lys Lys Pro Lys Phe Ala Asp Arg Asn 
25 30 35 

ACA TTT TAT TTA GGG GTT GGG TAT CAA CTT AGT GCG ATC AAC ACA TCT 25 9 

Thr Phe Tyr Leu Gly Val Gly Tyr Gin Leu Ser Ala He Asn Thr Ser 

40 45 50 

TTT AGC ACC GAG TCT GTA GAT AAA TCG TAT TTT ATG ACC GGC AAT GGC 3 07 

Phe Ser Thr Glu Ser Val Asp Lys Ser Tyr Phe Met Thr Gly Asn Gly 
55 60 65 70 

TTT GGT GTG GTG TTA GGG GGG AAA TTT GTG GCT AAA ACG CAA GCT GTA 355 

Phe Gly Val Val Leu Gly Gly Lys Phe Val Ala Lys Thr Gin Ala Val 
75 80 85 

GAG CAT GTG GGT TTC CGT TAC GGG TTG TTT TAT GAT CAG ACC TTT TCT 4 03 

Glu His Val Gly Phe Arg Tyr Gly Leu Phe Tyr Asp Gin Thr Phe Ser 
90 95 100 

TCT CAC AAA TCC TAT ATT TCT ACC TAT GGT TTA GAA TTT AGC GGT TTG 451 

Ser His Lys Ser Tyr He Ser Thr Tyr Gly Leu Glu Phe Ser Gly Leu 
105 110 115 

TGG GAC GCT TTC AAT TCG CCA AAG ATG TTT TTA GGG TTA GAG TTT GGC 4 99 

Trp Asp Ala Phe Asn Ser Pro Lys Met Phe Leu Gly Leu Glu Phe Gly 

120 125 130 

TTA GGC ATC GCT GGG GCG ACT TAT ATG CCA GGA GGG GCT ATG CAT GGG 54 7 

Leu Gly He Ala Gly Ala Thr Tyr Met Pro Gly Gly Ala Met His Gly 
135 140 145 150 

ATT ATC GCT CAA AAT TTA GGC AAA GAA AAT TCG CTT TTC CAA TTG CTT 5 95 

He He Ala Gin Asn Leu Gly Lys Glu Asn Ser Leu Phe Gin Leu Leu 
155 160 165 

GTG AAA GTG GGT TTT CGT TTT GGC TTT TTG CAC AAT GAA ATC ACT TTC 64 3 

Val Lys Val Gly Phe Arg Phe Gly Phe Leu His Asn Glu He Thr Phe 
170 175 180 

GGG TTG AAA TTC CCT GTC ATT CCT AAC AAA AGA ACG GAA ATC ATT GAT 691 

Gly Leu Lys Phe Pro Val He Pro Asn Lys Arg Thr Glu He He Asp 
185 190 195 

GGC TTG AGC ACG ACT ACT TTA TGG CAC CGC TTA CCG GTA GCT TAT TTC 739 

Gly Leu Ser Thr Thr Thr Leu Trp His Arg Leu Pro Val Ala Tyr Phe 

200 205 210 

AAT TAT ATC TAT AAT TTT TAGATATGGT TATTTAGAGG TTTTAGATTT GACAAAAT 795 
Asn Tyr He Tyr Asn Phe 
215 220 
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Tyr Glu Arg Lys Gly Val Ser Val Ser Pro Phe Phe Gin Phe Ser Pro 

260 265 270 

Gly Thr Tyr Tyr Ser Pro Gly Val Val Val Gly Tyr Asp Ser Asn Pro 

275 280 285 

Asn Phe Asn Gly Val Gly Phe Arg Ser Glu Thr Lys Ala Tyr lie Leu 

290 295 300 

Leu Pro Val His Asp Pro Leu Arg Arg Asp Thr Tyr Arg Tyr Ala He 
305 310 315 320 

Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu He Arg Gin Arg Phe Asp 

325 330 335 

Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe Tyr Lys Val Trp Lys Asn 

340 345 350 

Ala Asn Ala Tyr He Gly Thr Thr Gly Asn Pro Leu Gly He Asp Phe 

355 360 365 

Trp Thr Asn Ser Val Tyr Asp He Gly Gin Ala Leu Ser His Val Val 

370 375 380 

Thr Ala Asp Ala Val Ser Gly Trp Val Phe Gly Gly Gly Val His Lys 
385 390 395 400 

Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp Thr Ser Gly Thr Leu Ala 

405 410 415 

Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys He Ser Lys Ser 

420 425 430 

Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met Thr His Ala 

435 440 445 

Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser Lys Ala Leu 

450 455 460 

Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala Lys Phe 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECtJLE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 98 . . . 757 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 

CTAATATTTA TTTTAAACTT TGTTATTATT TAAGGTGTGA TTTGATTTTA GTCTGTATGG 60 
GGCAAGTGTG GGGCAGGATA ACATAAGGAA TTGGGTT ATG AAT AAA ACA ACG GTT 115 

Met Asn Lys Thr Thr Val 
1 5 

AAA ATA TTA ATG GGC ATG GCG TTA TTA TCA TCG CTT CAA GCC GCA GAG 163 
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AAA GCG CTT TAT TCA GAC AGO AGT CAT TTG ATG ACA ACT CTT AGC GCT 15 03 

Lys Ala Lieu Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala 

465 470 475 

AAA TTC TAACCAATCG CTTTAAGCTG TTTATTAAAG CGTTAAAAAT CCCTTAATAA AA 1561 
Lys Phe 



A 1562 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 14: 



Met 


Lys 


Asn 


Ser 


Thr 


Pro 


Leu 


Lys 


Asn 


Gin 


Val 


Phe 


Cys 


Gly 


Leu 


Tvr 


1 








5 










10 










15 




Val 


Leu 


Ser 


Leu 


Ser 


Ala 


Ser 


Leu 


Gin 


Ala 


Phe 


Asp 


Tyr 


Lys 


He 


Glu 








20 










25 










30 






Val 


Ser 


Ala 


Glu 


Ser 


Phe 


Ser 


Lys 


Val 


Gly 


Phe 


Asn 


Lys 


Lys 


Lys 


He 






35 










40 










45 








Asp 


He 


Ala 


Arg 


Gly 


He 


Tyr 


Pro 


Thr 


Glu 


Thr 


Phe 


Val 


Thr 


Ala 


Val 




50 










55 










60 










Gly 


Gin 


Gly 


Asn 


He 


Tyr 


Ala 


Asp 


Phe 


Leu 


Pro 


Lys 


Gly 


Leu 


Lys 


Asp 


65 










70 










75 










80 


Gin 


Gly 


His 


Val 


Leu 


Glu 


Gly 


Lys 


He 


Gly 


Gly 


Thr 


Leu 


Gly 


Gly 


Val 










85 










90 










95 




Ala 


Tyr 


Asp 


Ser 


Thr 


Lys 


Phe 


Asn 


Gin 


Gly 


Gly 


Ser 


Val 


He 


Tyr 


Asn 








100 










105 










110 






Tyr 


He 


Gly 


Tyr 


Trp 


Asp 


Gly 


Tyr 


Leu 


Gly 


Gly 


Lys 


Arg 


Ala 


Leu 


Leu 






115 










120 










125 








Asp 


Gly 


Thr 


Ser 


He 


His 


Glu 


Cys 


Ala 


Leu 


Gly 


Ser 


Asp 


Gly 


Lys 


Val 




130 










135 










140 










He 


Asp 


Ser 


He 


Ala 


Cys 


Gly 


Asn 


Ala 


Arg 


Ala 


Asn 


Lys 


He 


Arg 


Arg 


145 










ISO 










155 










160 


Asn 


Tyr 


Leu 


Met 


Asn 


Asn 


Ala 


Phe 


Leu 


Glu 


Tyr 


Arg 


Tyr 


Lys 


Asp 


He 










165 










170 










175 




Phe 


Leu 


Ala 


Lys 


Gly 


Gly 


Arg 


Tyr 


Gin 


Ser 


Asn 


Ala 


Pro 


Tyr 


Met 


Ser 








180 










185 










190 






Gly Tyr 


Thr 


Gin 


Gly 


Phe 


Glu 


He 


Ser 


Ala 


Lys 


Val 


Lys 


Asp 


Lys 


Asn 






195 










200 










205 








Glu 


Gly 


He 


His 


Lys 


Leu 


Trp 


Trp 


Phe 


Ser 


Ser 


Trp 


Gly Arg Ala 


Phe 




210 










215 










220 










Ala 


Tyr 


Gly Glu 


Trp 


He 


Tyr 


Asp 


Phe 


Tyr 


Ser 


Pro 


Arg 


Thr 


Val 


Val 


225 










230 










235 










240 


Lys 


Asn 


Gly Arg 


Thr 


Leu 


Asn 


Tyr 


Gly 


He 


His 


Leu 


Val 


Asn 


Tyr 


Thr 










245 










250 










255 
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ACC GTG GTT AAA AAC GGG CGC ACT TTG AAT TAT GGT ATC CAT TTA GTG 831 
Thr Val Val Lys Asn Gly Arg Thr Leu Asn Tyr Gly He His Leu Val 
240 245 250 

AAT TAT ACT TAT GAA AGA AAA GGG GTT AGC GTT AGC CCT TTT TTC CAA 87 9 

Asn Tyr Thr Tyr Glu Arg Lys Gly Val Ser Val Ser Pro Phe Phe Gin 
255 260 265 

TTT TCG CCT GGG ACT TAT TAT AGC CCT GGG GTG GTT GTA GGC TAT GAT 927 
Phe Ser Pro Gly Thr Tyr Tyr Ser Pro Gly Val Val Val Gly Tyr Asp 
270 275 280 285 

AGT AAC CCT AAT TTT AAC GGC GTT GGC TTT AGA TCC GAA ACA AAA GCT 975 
Ser Asn Pro Asn Phe Asn Gly Val Gly Phe Arg Ser Glu Thr Lys Ala 
290 295 300 

TAT ATT TTG CTC CCT GTC CAT GAC CCC TTA AGA AGG GAT ACT TAT CGT 102 3 

Tyr He Leu Leu Pro Val His Asp Pro Leu Arg Arg Asp Thr Tyr Arg 
305 310 315 

TAC GCT ATA AAG GCT GGC ACT GCC GGG CAA AGC TTG CTC ATT AGG CAA 1071 
Tyr Ala He Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu He Arg Gin 
320 325 330 

CGA TTT GAT TAC AAT GAA TTT AAT TTT GGG GGA GCG TTT TAT AAA GTA 1119 
Arg Phe Asp Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe Tyr Lys Val 
335 340 345 

TGG AAA AAC GCA AAC GCT TAC ATC GGC ACG ACA GGA AAC CCT TTA GGC 1167 
Trp Lys Asn Ala Asn Ala Tyr He Gly Thr Thr Gly Asn Pro Leu Gly 
350 355 360 365 

ATT GAT TTT TGG ACC AAT AGC GTT TAT GAT ATA GGG CAA GCT TTA AGC 1215 
He Asp Phe Trp Thr Asn Ser Val Tyr Asp He Gly Gin Ala Leu Ser 
370 375 380 

CAT GTG GTA ACC GCT GAT GCC GTC TCT GGT TGG GTT TTT GGT GGG GGC 12 63 

His Val Val Thr Ala Asp Ala Val Ser Gly Trp Val Phe Gly Gly Gly 
385 390 395 

GTG CAT AAA AAG TGG CTG TGG GGG ACT TTA TGG CGT TGG ACT AGC GGC 1311 
Val His Lys Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp Thr Ser Gly 
400 405 410 

ACT TTA GCC AAT GAA GCG AGT GCG GCT GTT AAT GTG GGC TAT AAG ATC 13 5 9 

Thr Leu Ala Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys He 
415 420 425 

AGT AAG AGT TTG ACA GCG AGC GTG AAA TTA GAA TAT TTG GGC GTG ATG 14 0 7 

Ser Lys Ser Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met 
430 435 440 445 

ACG CAT GCA GGC TTT ACG GTA GGG AGT TAC AGG CCC ACG CCC GGC TCT 14 5 5 

Thr His Ala Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser 
450 455 460 
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Gly Leu Tyr Val Leu Ser Leu Ser Ala Ser Leu Gin Ala Phe Asp Tyr 
15 20 25 

AAA ATT GAA GTT TCA GCG GAG TCC TTT TCT AAA GTT GGC TTT AAT AAA 2 07 

Lys lie Glu Val Ser Ala Glu Ser Phe Ser Lys Val Gly Phe Asn Lys 
30 35 40 45 

AAA AAG ATT GAT ATA GCT AGG GGG ATT TAT CCT ACA GAG ACT TTT GTA 255 
Lys Lys lie Asp lie Ala Arg Gly lie Tyr Pro Thr Glu Thr Phe Val 
50 55 60 

ACC GCT GTA GGG CAG GGC AAT ATC TAT GCG GAT TTT TTA CCC AAA GGC 3 03 

Thr Ala Val Gly Gin Gly Asn lie Tyr Ala Asp Phe Leu Pro Lys Gly 
65 70 75 

CTT AAA GAT CAA GGG CAT GTT TTA GAG GGA AAA ATC GGT GGC ACG CTA 351 
Leu Lys Asp Gin Gly His Val Leu Glu Gly Lys lie Gly Gly Thr Leu 
80 85 90 

GGA GGG GTC GCT TAT GAT AGC ACG AAA TTC AAT CAA GGC GGA TCG GTT 3 99 

Gly Gly Val Ala Tyr Asp Ser Thr Lys Phe Asn Gin Gly Gly Ser Val 
95 100 105 

ATT TAT AAC TAC ATC GGT TAT TGG GAT GGC TAT TTA GGG GGT AAA AGA 447 
lie Tyr Asn Tyr lie Gly Tyr Trp Asp Gly Tyr Leu Gly Gly Lys Arg 
110 115 120 125 

GCC TTG CTT GAT GGC ACG AGT ATC CAT GAG TGC GCG CTT GGA TCT GAT 4 95 

Ala Leu Leu Asp Gly Thr Ser lie His Glu Cys Ala Leu Gly Ser Asp 
130 135 140 

GGC AAG GTG ATT GAT TCT ATA GCG TGC GGG AAC GCT AGG GCC AAT AAA 54 3 

Gly Lys Val lie Asp Ser lie Ala Cys Gly Asn Ala Arg Ala Asn Lys 
145 150 155 

ATC CGC CGT AAT TAC TTG ATG AAT AAC GCT TTT TTA GAA TAC CGC TAT 591 
lie Arg Arg Asn Tyr Leu Met Asn Asn Ala Phe Leu Glu Tyr Arg Tyr 
160 165 170 

AAA GAT ATT TTT TTA GCT AAG GGA GGG CGT TAT CAA TCC AAT GCT CCT 63 9 

Lys Asp lie Phe Leu Ala Lys Gly Gly Arg Tyr Gin Ser Asn Ala Pro 
175 180 185 

TAT ATG AGC GGT TAC ACG CAA GGC TTT GAA ATC AGC GCT AAA GTC AAG 687 
Tyr Met Ser Gly Tyr Thr Gin Gly Phe Glu lie Ser Ala Lys Val Lys 
190 195 200 205 

GAT AAA AAT GAA GGA ATC CAC AAA TTA TGG TGG TTT AGC TCA TGG GGT 735 
Asp Lys Asn Glu Gly lie His Lys Leu Trp Trp Phe Ser Ser Trp Gly 
210 215 220 

AGG GCG TTC GCT TAT GGG GAG TGG ATT TAT GAT TTT TAT TCT CCA AGA 783 
Arg Ala Phe Ala Tyr Gly Glu Trp He Tyr Asp Phe Tyr Ser Pro Arg 
225 230 235 
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Ser Tyr Asp Pro Val Ser Lys Arg Leu Tyr Val Gly Gly Glu Val Phe 

195 200 205 

Val Leu Gly Ala Glu Tyr Arg His Glu Asn Leu Lys Val Val Pro Phe 

210 215 220 

lie Leu Thr Asp Thr Arg Leu Pro Leu Ser Thr Gin Asn Val Leu Val 
225 230 235 240 

Gin Val Gly Gly Lys Leu Glu Tyr Asp Ala Ser Leu Ala Lys Gly Phe 

245 250 255 

Thr Ser His Thr Leu Val His Gly Met Tyr Gin Tyr Gly Asn Thr Asp 

260 265 270 

Ala Ala Thr Ser Val Lys Asn Ala Gly Leu Phe Leu lie Asp Gin Thr 

275 280 285 

Phe Lys Tyr Lys lie Phe Asn Phe Gly Thr Gly Phe Tyr lie Val Pro 

290 295 300 

Ala Arg Asn Asn Lys Gly Tyr Leu Trp Thr Phe Asn Asp Arg Thr Lys 
305 310 315 320 

Phe Tyr Gly Arg Gly lie Asn Ala Pro Gly Val Pro Ala lie Tyr Phe 

325 330 335 

Ala Asn Ser Ser lie Ser Gly Tyr Val Phe Leu Gly Leu Lys Thr Lys 

340 345 350 

Arg Val Arg Leu Asp Ala Met Val Ala Phe Gly Asp Tyr Gin Glu Tyr 

355 360 365 

Ser Leu Met Ser Ser Phe Arg Val Trp Thr Tyr Arg Ser Leu Ser Phe 

370 375 380 

Asp Met Gly Gly Gly Tyr Val Tyr Ala Tyr Asn Ser Lys Ala Thr Arg 
385 390 395 400 

Lys Ser Leu Gly Asn Ser Ser Phe Val Phe PJie Gly Lys Phe Leu Phe 
405 410 415 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 73 . . . 1509 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TTATGCTTCT TTGTTTTTAG ATCAGTTAAG AATTGTAGTC TTTAAGATGT ATTGGCTATT 60 

AAAAGGAAAA AA ATG AAA AAT AGC ACQ CCT TTA AAG AAT CAA GTT TTT TGT 111 
Met Lys Asn Ser Thr Pro Leu Lys Asn Gin Val Phe Cys 
15 10 

GGG TTA TAT GTT TTA AGT TTG AGC GCT TCT TTG CAA GCG TTT GAT TAT 15 9 
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GTG CGT TTA GAC GCG ATG GTG GCT TTT GGG GAT TAG CAA GAA TAT TCT 1164 
Val Arg Leu Asp Ala Met Val Ala Phe Gly Asp Tyr Gin Glu Tyr Ser 
355 360 365 

TTA ATG AGO AGT TTT AGG GTT TGG ACT TAT AGG AGT TTG TCT TTT GAT 1212 
Leu Met Ser Ser Phe Arg Val Trp Thr Tyr Arg Ser Leu Ser Phe Asp 
370 375 380 385 

ATG GGT GGG GGG TAT GTG TAT GCT TAG AAT TCT AAA GCC ACG AGA AAA 1260 
Met Gly Gly Gly Tyr Val Tyr Ala Tyr Asn Ser Lys Ala Thr Arg Lys 
390 395 400 

AGT CTT GGA AAT AGT TCT TTT GTC TTT TTT GGG AAG TTT TTG TTT TAAAA 1310 
Ser Leu Gly Asn Ser Ser Phe Val Phe Phe Gly Lys Phe Leu Phe 
405 410 415 

AATACCATTT CTACAATCAA TAGTGAAGAG TTTGCAATAA AGTAAGC 1357 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Gin Tyr Lys Lys Asn Lys Lys Arg Tyr Tyr Tyr Leu Ala Leu Gly 

15 10 15 

lie Phe Phe Leu Asn Gly Leu Ser Leu Lys Ala Leu Glu lie Ala Val 

20 25 30 

Lys Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Asn Gin Gly Ala Gin Lys 

35 40 45 

Asn Pro His Ser Tyr Val Gly Ala Leu Ala Arg Leu Gly Val Asp Phe 

50 55 60 

Ser Tyr Ser Asn Gly Trp Ser Phe Gly lie Gly Ala lie Gly Ala Trp 
65 70 75 80 

Asn lie Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr lie Ser Leu Gly 

85 90 95 

Asn Phe Phe Gly Ser Ser Lys Asn Val Lys Pro Tyr Leu Ser Ala Gly 

100 105 110 

Asp Val Ser Asp Ala Tyr Val Gin Tyr Thr Asn Gin Arg Phe Lys lie 

115 120 125 

Ala Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp lie Gly 

130 135 140 

Gly Asn lie Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg 
145 150 155 160 

Tyr Phe Gly lie Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin lie 

165 170 175 

Asn Lys Glu Gin Gly Asn Arg lie Ala Thr Ser Leu Asn Ala Leu Ala 
180 185 190 
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Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp lie Gly Gly 
130 135 140 145 

AAT ATT CAA GGG GTT TCT GTA GCT TTT AAG CAA AAT TCC ATG CGT TAT 54 0 

Asn lie Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg Tyr 
150 155 160 

TTT GGG ATT TTT ATG GAT AGC ATG CTT TAT AAT GGG CAT CAA ATC AAC 588 
Phe Gly lie Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin lie Asn 
165 170 175 

AAA GAG CAA GGG AAT CGG ATC GCT ACT TCC CTA AAC GCT CTA GCG TCT 63 6 

Lys Glu Gin Gly Asn Arg lie Ala Thr Ser Leu Asn Ala Leu Ala Ser 
180 185 190 

TAT GAC CCT GTG TCT AAA CGC TTG TAT GTG GGG GGG GAA GTG TTT GTT 684 
Tyr Asp Pro Val Ser Lys Arg Leu Tyr Val Gly Gly Glu Val Phe Val 
195 200 205 

TTA GGT GCA GAA TAC AGG CAT GAA AAT CTT AAA GTG GTG CCT TTT ATT 732 
Leu Gly Ala Glu Tyr Arg His Glu Asn Leu Lys Val Val Pro Phe lie 
210 215 220 225 

TTA ACG GAC ACC CGC TTG CCT TTA TCC ACC CAA AAT GTT TTA GTG CAA 780 
Leu Thr Asp Thr Arg Leu Pro Leu Ser Thr Gin Asn Val Leu Val Gin 
230 235 240 

GTG GGG GGT AAG TTG GAG TAT GAC GCT TCT TTA GCT AAG GGT TTC ACT 82 8 

Val Gly Gly Lys Leu Glu Tyr Asp Ala Ser Leu Ala Lys Gly Phe Thr 
245 250 255 

TCG CAC ACT CTA GTG CAT GGC ATG TAT CAA TAC GGC AAC ACT GAT GCG 876 
Ser His Thr Leu Val His Gly Met Tyr Gin Tyr Gly Asn Thr Asp Ala 
260 265 270 

GCT AC A AGC GTT AAA AAT GCC GGC TTG TTT TTG ATC GAT CAA ACT TTT 924 
Ala Thr Ser Val Lys Asn Ala Gly Leu Phe Leu lie Asp Gin Thr Phe 
275 280 285 

AAA TAC AAA ATT TTT AAT TTT GGA ACG GGT TTT TAT ATC GTT CCG GCA 972 
Lys Tyr Lys lie Phe Asn Phe Gly Thr Gly Phe Tyr lie Val Pro Ala 
290 295 300 305 

AGA AAC AAT AAG GGC TAT CTA TGG ACT TTT AAT GAC AGG ACT AAA TTC 1020 
Arg Asn Asn Lys Gly Tyr Leu Trp Thr Phe Asn Asp Arg Thr Lys Phe 
310 315 320 

TAT GGC CGT GGG ATC AAT GCG CCC GGC GTG CCA GCG ATT TAT TTT GCA 1068 
Tyr Gly Arg Gly lie Asn Ala Pro Gly Val Pro Ala lie Tyr Phe Ala 
325 330 335 

AAC TCT AGC ATT TCA GGC TAT GTT TTT TTA GGG CTT AAG ACT AAA AGG 1116 
Asn Ser Ser He Ser Gly Tyr Val Phe Leu Gly Leu Lys Thr Lys Arg 
340 345 350 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic RNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 58,.. 1305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGCAATTAAA AGGAATTTTA ACTAAAATAT TGAGTTTAAA TCCACGATGA GTTTTTA ATG 60 

Met 

1 

CAA TAT AAG AAA AAT AAG AAA AGA TAT TAT TAT TTA GCG TTA GGG ATC 10 8 

Gin Tyr Lys Lys Asn Lys Lys Arg Tyr Tyr Tyr Leu Ala Leu Gly lie 
5 10 15 

TTT TTT TTA AAT GGT CTG TCT TTG AAA GCT TTA GAA ATC GCC GTC AAA 156 
Phe Phe Leu Asn Gly Leu Ser Leu Lys Ala Leu Glu lie Ala Val Lys 
20 25 30 

CCT TTT GGC TAT CTG GGG CTA TTA TAT AAT CAA GGG GCG CAA AAA AAC 2 04 

Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Asn Gin Gly Ala Gin Lys Asn 
35 40 45 

CCT CAC AGC TAT GTG GGG GCT TTA GCG CGT CTT GGG GTG GAT TTT TCT 2 52 

Pro His Ser Tyr Val Gly Ala Leu Ala Arg Leu Gly Val Asp Phe Ser 
50 55 60 65 

TAT AGC AAC GGG TGG TCC TTT GGT ATT GGA GCG ATT GGG GCT TGG AAT 3 00 

Tyr Ser Asn Gly Trp Ser Phe Gly lie Gly Ala lie Gly Ala Trp Asn 
70 75 80 

ATT TAT AAC AAA CAG CGT TTG GCT AAC CTT TAT ATC AGT CTA GGG AAT 34 8 

lie Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr lie Ser Leu Gly Asn 
85 90 95 

TTT TTT GGT AGT TCT AAA AAT GTT AAA CCT TAT TTG AGC GCT GGC GAT 3 96 

Phe Phe Gly Ser Ser Lys Asn Val Lys Pro Tyr Leu Ser Ala Gly Asp 
100 105 110 

GTT TCT GAT GCG TAT GTT CAA TAC ACT AAC CAG CGT TTT AAA ATC GCT 444 
Val Ser Asp Ala Tyr Val Gin Tyr Thr Asn Gin Arg Phe Lys lie Ala 
115 120 125 

TTA GGG CGT TTC AAT ACC GAT TTT GTG GAT TTT GAT TGG ATA GGG GGC 4 92 
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Gin Tyr Ser Leu Tyr Val Gly Tyr Arg Tyr Asn Phe 
260 265 270 

TTTTTCTCTA ATTGAATTTT CAATTAGAGT TTTC 910 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Lys Phe Val Val Phe Lys Thr Leu Cys Leu Ser Val Val Leu 

15 10 15 

Gly Asn Ser Leu Val Ala Ala Glu Gly Ser Thr Glu Val Gin Lys Gin 

20 25 30 

Leu Glu Lys Pro Lys Glu Tyr Lys Ala Val Lys Gly Glu Lys Asn Ala 

35 40 45 

Trp Tyr Leu Gly lie Ser Tyr Gin Val Gly Gin Ala Ser Gin Ser Val 

50 55 60 

Lys Asn Pro Pro Lys Ser Ser Glu Phe Asn Tyr Pro Lys Phe Pro Val 
65 70 75 80 

Gly Lys Thr Asp Tyr Leu Ala Val Met Gin Gly Leu Gly Leu Thr Val 

85 90 95 

Gly Tyr Lys Gin Phe Phe Gly Glu Lys Arg Trp Phe Gly Ala Arg Tyr 

100 105 110 

Tyr Gly Phe Met Asp Tyr Gly His Ala Val Phe Gly Ala Asn Ala Leu 

115 120 125 

Thr Ser Asp Asn Gly Gly Val Cys Glu Leu His Gin Pro Cys Ala Thr 

130 135 140 

Lys Val Gly Thr Met Gly Asn Leu Ser Asp Met Phe Thr Tyr Gly Val 
145 150 155 160 

Gly lie Asp Thr Leu Tyr Asn Val lie Asn Lys Glu Asp Ala Ser Phe 

165 170 175 

Gly Phe Phe Phe Gly Ala Gin lie Ala Gly Asn Ser Trp Gly Asn Thr 

180 185 190 

Thr Gly Ala Phe Leu Glu Thr Lys Ser Pro Tyr Lys His Thr Ser Tyr 

195 200 205 

Ser Leu Asp Pro Ala lie Phe Gin Phe Leu Phe Asn Leu Gly lie Arg 

210 215 220 

Thr His lie Gly Arg His Gin Glu Phe Asp Phe Gly Val Lys lie Pro 
225 230 235 240 

Thr lie Asn Val Tyr Tyr Phe Asn His Gly Asn Leu Ser Phe Thr Tyr 

245 250 255 

Arg T^g Gin Tyr Ser Leu Tyr Val Gly Tyr Arg Tyr Asn Phe 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 11: 
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35 40 45 50 

TTG GGG ATT AGC TAT CAA GTC GGT GAG GCT TCG CAA AGC GTT AAA AAC 24 8 

Leu Gly lie Ser Tyr Gin Val Gly Gin Ala Ser Gin Ser Val Lys Asn 
55 60 65 

CCC CCC AAA AGC AGT GAA TTT AAC TAC CCT AAG TTC CCT GTG GGT AAA 296 
Pro Pro Lys Ser Ser Glu Phe Asn Tyr Pro Lys Phe Pro Val Gly Lys 
70 75 80 

ACC GAC TAT CTG GCC GTT ATG CAA GGC TTA GGG CTT ACT GTG GGT TAT 344 
Thr Asp Tyr Leu Ala Val Met Gin Gly Leu Gly Leu Thr Val Gly Tyr 
85 90 95 

AAG CAG TTT TTC GGG GAA AAG AGA TGG TTT GGT GCA CGC TAT TAC GGC 3 92 

Lys Gin Phe Phe Gly Glu Lys Arg Trp Phe Gly Ala Arg Tyr Tyr Gly 
100 105 110 

TTC ATG GAT TAT GGG CAT GCC GTA TTT GGA GCG AAC GCT TTA ACA TCG 44 0 

Phe Met Asp Tyr Gly His Ala Val Phe Gly Ala Asn Ala Leu Thr Ser 
115 120 125 130 

GAT AAT GGT GGG GTG TGT GAG CTT CAC CAA CCA TGT GCG ACC AAA GTA 4 88 

Asp Asn Gly Gly Val Cys Glu Leu His Gin Pro Cys Ala Thr Lys Val 
135 140 145 

GGG ACA ATG GGC AAT CTG TCT GAC ATG TTC ACT TAT GGT GTG GGT ATT 536 
Gly Thr Met Gly Asn Leu Ser Asp Met Phe Thr Tyr Gly Val Gly lie 
150 155 160 

GAC ACT TTA TAC AAT GTC ATC AAT AAA GAA GAT GCG AGT TTT GGT TTC 5 84 

Asp Thr Leu Tyr Asn Val lie Asn Lys Glu Asp Ala Ser Phe Gly Phe 
165 170 175 

TTT TTT GGG GCT CAA ATC GCG GGT AAC TCT TGG GGT AAT ACG ACA GGG 632 
Phe Phe Gly Ala Gin lie Ala Gly Asn Ser Trp Gly Asn Thr Thr Gly 
180 185 190 

GCC TTT TTG GAA ACT AAA AGC CCT TAT AAG CAC ACT TCC TAT AGC CTT 680 
Ala Phe Leu Glu Thr Lys Ser Pro Tyr Lys His Thr Ser Tyr Ser Leu 
195 200 205 210 

GAT CCG GCG ATT TTC CAG TTC CTT TTT AAT TTA GGG ATC CGC ACC CAT 72 8 

Asp Pro Ala lie Phe Gin Phe Leu Phe Asn Leu Gly lie Arg Thr His 
215 220 225 

ATT GGC CGG CAT CAA GAA TTT GAC TTT GGC GTG AAG ATT CCC ACT ATC 776 
lie Gly Arg His Gin Glu Phe Asp Phe Gly Val Lys lie Pro Thr lie 
230 235 240 

AAT GTT TAT TAT TTT AAC CAT GGG AAT TTG AGC TTC ACT TAC CGC CGT 824 
Asn Val Tyr Tyr Phe Asn His Gly Asn Leu Ser Phe Thr Tyr Arg Arg 
245 250 255 

CAA TAC ' AGC CTT TAT GTG GGG TAT CGT TAC AAT TTC TGATTTAAAA CGCTTG 876 
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Met His Gin Asn Asn Lys Thr Phe Leu Pro Ser Gin Ser Ala His Leu 

15 10 15 

Ser Lys lie lie Leu Phe Leu Asn Thr Gly Phe Leu Ala Tyr Leu Leu 

20 25 30 

Ser Ala Cys Gly Ala Asn Val Pro lie Glu Glu Val Leu Val Lys Asp 

35 40 45 

Pro Lys Glu Thr Lys Ala Gin Glu Val Ala Arg Glu Glu Lys Ala lie 

50 55 60 

Gin Gin Glu Asn Ala Thr lie Asp Ala Arg Thr Thr Pro Leu lie Asn 
65 70 75 80 

Arg Phe Thr Asn Tyr Ser Ala Tyr Gly Ser Leu Asn Gly Phe Tyr Asn 

85 90 95 

Ser Val Asp Asn Leu Asn Ser Pro Met Gin Asn Gly Met Tyr Gly Gly 

100 105 110 

Tyr Tyr Met Pro Tyr Tyr Tyr Met Pro Tyr Gly Phe Met Pro Tyr Gly 

115 120 125 

Ser Gly Leu Met Pro Tyr Gly Pro Tyr Gly Tyr Gly Ala Pro Gly Tyr 

130 135 140 

Phe Pro Tyr Ala Phe Tyr 
145 150 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 860 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATTATGTTAT TATTATACGA AATGTAGACT TTTAGAAGGA AAAATGTTGT ATG AAA 56 

Met Lys 
1 

AAG TTT GTA GTG TTT AAA ACG CTC TGT TTA TCG GTA GTG TTA GGT AAT 104 
Lys Phe Val Val Phe Lys Thr Leu Cys Leu Ser Val Val Leu Gly Asn 
5 10 15 

AGT CTT GTG GCA GCA GAA GGC AGC ACA GAA GTG CAA AAG CAA TTG GAA 152 
Ser Leu Val Ala Ala Glu Gly Ser Thr Glu Val Gin Lys Gin Leu Glu 
20 25 30 

AAG CCA AAA GAG TAT AAA GCA GTG AAA GGC GAG AAA AAC GCT TGG TAT 2 00 

Lys Pro Lys Glu Tyr Lys Ala Val Lys Gly Glu Lys Asn Ala Trp Tyr 
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CCACAAAAGC TTAATGAATA TTAAATTAAA AACATGTTAA TCTTTAGTTA TTTTTAAAAT 60 
TTAGGAAATC CC ATG CAT CAA AAC AAT AAA ACT TTT TTA CCC AGC CAA TCC 111 
Met His Gin Asn Asn Lys Thr Phe Leu Pro Ser Gin Ser 
15 10 

GCT CAC CTC TCT AAA ATC ATT CTT TTT TTA AAC ACC GGC TTT TTA GCC 159 
Ala His Leu Ser Lys lie lie Leu Phe Leu Asn Thr Gly Phe Leu Ala 
15 20 25 

TAT CTG TTA AGC GCT TGT GGG GCG AAT GTG CCT ATA GAA GAA GTG TTG 2 07 

Tyr Leu Leu Ser Ala Cys Gly Ala Asn Val Pro lie Glu Glu Val Leu 
30 35 40 45 

GTT AAA GAT CCT AAA GAG ACC AAA GCC CAA GAA GTC GCC AGA GAA GAA 255 
Val Lys Asp Pro Lys Glu Thr Lys Ala Gin Glu Val Ala Arg Glu Glu 
50 55 60 

AAG GCT ATC CAG CAA GAA AAC GCC ACT ATT GAT GCG CGC ACC ACG CCT 3 03 

Lys Ala lie Gin Gin Glu Asn Ala Thr lie Asp Ala Arg Thr Thr Pro 
65 70 75 

TTA ATC AAT CGT TTC ACT AAT TAT AGC GCT TAT GGC TCT TTA AAC GGC 3 51 

Leu lie Asn Arg Phe Thr Asn Tyr Ser Ala Tyr Gly Ser Leu Asn Gly 
80 85 90 

TTT TAG AAT TCA GTG GAT AAT CTC AAT TCG CCC ATG CAA AAC GGG ATG 3 99 

Phe Tyr Asn Ser Val Asp Asn Leu Asn Ser Pro Met Gin Asn Gly Met 
95 100 105 

TAT GGA GGC TAT TAC ATG CCT TAT TAT TAG ATG CCC TAT GGT TTC ATG 44 7 

Tyr Gly Gly Tyr Tyr Met Pro Tyr Tyr Tyr Met Pro Tyr Gly Phe Met 
110 115 120 125 

CCT TAT GGG TCA GGT CTT ATG CCT TAT GGG CCT TAT GGG TAT GGA GCG 4 95 

Pro Tyr Gly Ser Gly Leu Met Pro Tyr Gly Pro Tyr Gly Tyr Gly Ala 
130 135 140 

CCT GGA TAC TTC CCT TAC GCT TTT TAT TGATTGAGTG GCTTTAGAAA GCGTGGT 54 9 
Pro Gly Tyr Phe Pro Tyr Ala Phe Tyr 
145 150 

GGTGTTGGTG TTTTTACTCA AACACG 575 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
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85 90 95 

GCG GCC GGG TGT TTT GCA AGT GGG GGC GAT GTA TGC TAATGGGGTA GGGATC 3 96 
Ala Ala Gly Cys Phe Ala Ser Gly Gly Asp Val Cys 
100 105 110 

CAAACCAACA GATTAAAAGC CGCTCGCTAT TATGAATG 434 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Leu Lys Lys Ser Leu Leu Leu Leu Val Phe Leu Val Leu Gin Leu 

15 10 15 

Ser Gly Ala Glu Glu Asn Asn Gin Ala Pro Lys Asn Thr Pro Pro Glu 

20 25 30 

Leu Asn Pro Ala Asn Ala Lys Gly Ala Pro Asn Ser Asn Thr Gin lie 

35 40 45 

Thr Pro Lys Asn Asp Asn Ser Asn Leu Leu Asp Lys Leu Gly Ser Pro 

50 55 60 

Glu Asn Ala Gin Thr Glu Leu Ser Ala Gly lie Asp Leu Ala Lys Lys 
65 70 75 80 

Gly Asp Tyr Gin Gly Ala Phe Lys Leu Phe Ser Gin Ser Cys Asp Asn 

85 90 95 

Gly Asn Ala Ala Gly Cys Phe Ala Ser Gly Gly Asp Val Cys 
100 105 110 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 73... 522 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
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Met Lys Asn Ser Ser 
130 

Leu Gly Ser Thr lie 
145 

lie Pro Phe Ser Lys 
165 

Glu Ser Val Phe lie 
180 



Phe Met Gly Tyr Asn Gly 
135 

Asp Arg His His Arg lie 
150 155 
Thr Arg Asn Ser Phe Lys 
170 

His Ala Thr Tyr Ser Tyr 
185 



Leu Phe Asn Val Gly 
140 

Glu Leu Gly Ala Lys 
160 

Asn Pro Tyr Phe Leu 
175 

Met Phe 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 380 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AGCGTGAAAA AAATTGAGTT GAATCAAAAC CTGCATTAAG GATTAAAAGA ATG CTC 5 6 

Met Leu 
1 

AAA AAA AGT TTG TTA TTG CTT GTT TTT TTA GTC TTA CAG CTT AGC GGC 104 
Lys Lys Ser Leu Leu Leu Leu Val Phe Leu Val Leu Gin Leu Ser Gly 
5 10 15 

GCT GAA GAA AAC AAT CAA GCC CCA AAA AAC ACG CCC CCT GAA TTA AAC 152 
Ala Glu Glu Asn Asn Gin Ala Pro Lys Asn Thr Pro Pro Glu Leu Asn 
20 25 30 

CCC GCT AAC GCT AAG GGC GCG CCA AAC TCT AAC ACC CAG ATC ACC CCT 2 00 

Pro Ala Asn Ala Lys Gly Ala Pro Asn Ser Asn Thr Gin lie Thr Pro 
35 40 45 50 

AAA AAC GAT AAC TCT AAC CTG TTA GAC AAA TTA GGT TCG CCT GAA AAC 24 8 

Lys Asn Asp Asn Ser Asn Leu Leu Asp Lys Leu Gly Ser Pro Glu Asn 
55 60 65 

GCT CAA ACC GAG CTT TCT GCC GGT ATT GAT TTG GCT AAA AAG GGC GAT 2 96 

Ala Gin Thr Glu Leu Ser Ala Gly lie Asp Leu Ala Lys Lys Gly Asp 
70 75 80 

TAT CAA GGG GCT TTC AAG CTT TTT TCC CAA TCG TGC GAT AAT GGT AAT 344 
Tyr Gin Gly Ala Phe Lys Leu Phe Ser Gin Ser Cys Asp Asn Gly Asn 
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TTA AGC GAT TTT TAG ACT TAG AAA AAA TAG AGC TTT GGG GTG TAT GGG 3 92 

Leu Ser Asp Phe Tyr Thr Tyr Lys Lys Tyr Ser Phe Gly Val Tyr Gly 
100 105 110 

GGG CTT GGG ATA GGG TAT TTT TAT CAA AGC AAC CAT TTA GGC ATG AAA 44 0 

Gly Leu Gly lie Gly Tyr Phe Tyr Gin Ser Asn His Leu Gly Met Lys 
115 120 125 130 

AAT AGT TCG TTT ATG GGT TAT AAC GGC TTG TTT AAT GTG GGG CTT GGC 4 88 

Asn Ser Ser Phe Met Gly Tyr Asn Gly Leu Phe Asn Val Gly Leu Gly 
135 140 145 

AGC ACG ATC GAT CGC CAC CAC CGC ATA GAG CTT GGG GCT AAA ATC CCT 536 
Ser Thr lie Asp Arg His His Arg lie Glu Leu Gly Ala Lys lie Pro 
150 155 160 

TTT TCA AAG ACT AGA AAT TCT TTT AAA AAT CCT TAT TTT TTA GAG AGC 584 
Phe Ser Lys Thr Arg Asn Ser Phe Lys Asn Pro Tyr Phe Leu Glu Ser 
165 170 175 

GTT TTT ATC CAT GCG ACT TAT AGC TAT ATG TTT TAAGAGAGAA TAGCCTATTA 63 7 
Val Phe lie His Ala Thr Tyr Ser Tyr Met Phe 
180 185 

GTGGTCGTTA TCAATAAGAT AAGATCCTTA ATG 67 0 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Lys lie Ala Phe lie Leu Ala Leu Trp Val Gly Leu Leu Gly 

15 10 15 

Ala Phe Glu Pro Lys Lys Ser His lie Tyr Phe Gly Ala Met Val Gly 

20 25 30 

Leu Ala Pro lie Lys lie Thr Pro Lys Pro Ala Ser Asp Ser Ser Tyr 

35 40 45 

Thr Ala Phe Leu Trp Gly Ala Lys Gly Gly Tyr Gin Phe Ala Phe Phe 

50 55 60 

Lys Ala Leu Ala Leu Arg Gly Glu Phe Ser Tyr Leu Met Ala lie Lys 
65 70 75 80 

Pro Thr Ala Leu His Thr lie Asn Thr Ser Leu Leu Ser Leu Asn lie 

85 90 95 

Asp Val Leu Ser Asp Phe Tyr Thr Tyr Lys Lys Tyr Ser Phe Gly Val 

100 105 110 

Tyr Gly Gly Leu Gly lie Gly Tyr Phe Tyr Gin Ser Asn His Leu Gly 
115 120 125 
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Met Glu Phe Leu Gly Leu lie Leu Ser Leu Ala Ala lie Leu lie Ala 

15 10 15 

Phe Lys Lys Pro Glu Lys Glu Asn Trp Ala Phe Gly lie Leu Met Val 

20 25 30 

Val Trp Leu Val Glu Leu lie lie Phe lie Ala His Ser Ser Ser Val 

35 40 45 

Leu Pro Asn Met Asn Leu 
50 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 51... 617 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CCCATAGACG ACAAAATCAA GCGGTTTTAT CAAAACCAAA AAACTTTAGA ATG AAA 56 

Met Lys 
1 

AAA ATT OCT TTC ATT TTG GCT TTA TGG GTG GGC TTG TTA GGG GCG TTT 104 
Lys lie Ala Phe lie Leu Ala Leu Trp Val Gly Leu Leu Gly Ala Phe 
5 10 15 

GAG CCT AAA AAA AGT CAT ATT TAT TTT GGG GCT ATG GTG GGT TTA GCT 152 
Glu Pro Lys Lys Ser His lie Tyr Phe Gly Ala Met Val Gly Leu Ala 
20 25 30 

CCT ATT AAA ATA ACC CCA AAA CCG GCT AGT GAT TCT TCT TAT ACG GCT 2 00 

Pro lie Lys lie Thr Pro Lys Pro Ala Ser Asp Ser Ser Tyr Thr Ala 
35 40 45 50 

TTT TTA TGG GGG GCT AAA GGA GGG TAT CAA TTC GCT TTT TTT AAA GCT 24 8 

Phe Leu Trp Gly Ala Lys Gly Gly Tyr Gin Phe Ala Phe Phe Lys Ala 
55 60 65 

CTA GCG TTA AGG GGT GAA TTT TCC TAC CTT ATG GCA ATC AAA CCC ACC 296 
Leu Ala Leu Arg Gly Glu Phe Ser Tyr Leu Met Ala lie Lys Pro Thr 
70 75 80 

GCA CTG CAC ACG ATT AAC ACT TCT TTA TTG AGC TTA AAT ATT GAT GTG 344 
Ala Leu His Thr lie Asn Thr Ser Leu Leu Ser Leu Asn lie Asp Val 
85 90 95 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 212 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTTTTTAGTT TGTTTTTGAG TATAATCCTA CGAAAATTTT AAGGAACGGC ATG GAG 5 6 

Met Glu 
1 

TTT TTG GGA CTG ATT TTA AGT CTG GCC GCT ATT TTG ATA GCG TTT AAA 104 
Phe Leu Gly Leu lie Leu Ser Leu Ala Ala lie Leu lie Ala Phe Lys 
5 10 15 

AAG CCT GAA AAA GAA AAT TGG GCG TTT GGG ATT TTG ATG GTG GTG TGG 152 
Lys Pro Glu Lys Glu Asn Trp Ala Phe Gly lie Leu Met Val Val Trp 
20 25 30 

TTA GTG GAG CTT ATT ATT TTT ATA GCC CAC AGC TCT AGC GTT TTG CCT 2 00 

Leu Val Glu Leu lie lie Phe lie Ala His Ser Ser Ser Val Leu Pro 
35 40 45 50 

AAC ATG AAT CTA TAAGGGGGAT GCATGGATAA AGAAACCCGA TTTTACAACC TTTTT 2 57 
Asn Met Asn Leu 



TCTTTGGC 265 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 ; 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: MERIEUX ORAVAX SOCIETE EN NOM COLLECTIF PASTEUR 
MERIEUX SERUMS ET VACCINS S.A. HUMAN GENOME 
SCIENCES, INC. 

(ii) TITLE OF THE INVENTION: Identification of Polynucleotides 
Encoding Novel Helicobacter Polypeptides in the Helicobacter 
Genome 

(iii) NUMBER OF SEQUENCES: 1376 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 
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(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 
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(B) FILING DATE: Ol-APR-98 

(C) CLASSIFICATION: 
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(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 3 0,162 

(C) REFERENCE/DOCKET NUMBER: 06132/041WO1 

(ix) TELECOMMUNICATION INFORMATION: 
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phosphate buffer, containing 500 mM NaCl (pH 7.5). The gel is then 
transferred to a tube, mixed with 100 mM ethanolamine (pH 7.5) for 4 hours at 
room temperature, and washed twice with 2 column volumes of PBS. The gel 
is then stored in 1/10,000 PBS/merthiolate. The amount of IgG antibodies 
coupled to the gel is determined by measuring the optical density (OD) at 280 
5 nm of the IgG solution and the direct eluate, plus washings. 



4.C. AdsorptDoiiB and elutioim of the amitiigeB 

An antigen solution in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, for 
example, the supernatant or the solubilized pellet obtained using the methods 

10 described in 3.E., after centrifiigation and filtration through a 0.45 ^m 

membrane, is applied to a column equilibrated with 50 mM Tris-HCl (pH 8.0), 
2 mM EDTA, at a flow rate of about 10 ml/hour. The column is then washed 
with 20 volumes of 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Alternatively, 
adsorption can be achieved by mixing ovemight at 5±3 ""C. 

15 The adsorbed gel is washed with 2 to 6 volumes of 10 mM sodium 

phosphate buffer (pH 6.8) and the antigen is eluted with 100 mM glycine buffer 
(pH 2.5). The eluate is recovered in 3 ml fractions, to each of which is added 
150 |Lil of 1 M sodium phosphate buffer (pH 8.0). Absorption is measured at 
280 nm for each fraction; those fractions containing the antigen are pooled and 

20 stored at -20 "^C. 
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EXAMPLE 4: Purification of ttie polypeptides of the invention by 
immunoaffinity 

4. A, Purification of specific IgGs 

An immune serum, as prepared as is described in section 2.G., is applied 
to a protein A Sepharose Fast Flow column (Pharmacia) equilibrated in 100 
mM Tris-HCl (pH 8.0). The resin is washed by applying 10 column volumes 
of 100 mM Tris-HCl and 10 volumes of 10 mM Tris-HCl (pH 8.0) to the 
column. IgG antibodies are eluted with 0.1 M glycine buffer (pH 3.0) and are 
collected as 5 ml fractions to each of which is added 0.25 ml 1 M Tris-HCl 
(pH 8.0). The optical density of the eluate is measured at 280 nm and fractions 
containing the IgG antibodies are pooled, dialyzed against 50 mM Tris-HCl 
(pH 8.0), and, if necessary, stored frozen at -70°C. 

4.B- Preparation of tlie column 

An appropriate amount of CNBr- activated Sepharose 4B gel (1 g of 
dried gel provides for approximately 3.5 ml of hydrated gel; gel capacity is 
from 5 to 10 mg coupled IgG/ml of gel) manufactured by Pharmacia (17-0430- 
01) is suspended in 1 mM HCl buffer and washed with a buchner by adding 
small quantities of 1 mM HCl buffer. The total volume of buffer is 200 ml per 
gram of gel. 

Purified IgG antibodies are dialyzed for 4 hours at 20±5°C against 
50 volumes of 500 mM sodium phosphate buffer (pH 7.5). The antibodies are 
then diluted in 500 mM phosphate buffer (pH 7.5) to a final concentration of 3 
mg/ml. 

IgG antibodies are mixed with the gel overnight at 5di3°C. The gel is 
packed into a chromatography column and is washed with 2 column volumes of 
500 mM phosphate buffer (pH 7.5), and 1 column volume of 50 mM sodium 
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The N-terminal primers are designed to include the ribosome binding 
site of the target gene, the ATG start site, and any signal sequence and cleavage 
site. The N-terminal primers can include a 5' clamp and a restriction 
endonuclease recognition site, such as that for BamHl (GGATCC), which 
facilitates subsequent cloning. Similarly, the C-terminal primers can include a 
restriction endonuclease recognition site, such as that for^ol (CTCGAG), 
which can be used in subsequent cloning, and a TAA stop codon. 

Amplification of genes encoding the polypeptides of the invention can 
be carried out using Thermalase DNA Polymerase under the conditions 
described above in Example 2. Alternatively, Vent DNA polymerase (New 
England Biolabs), Pwo DNA polymerase (Boehringer Mannheim), or Taq 
DNA polymerase (Appligene) can be used, according to instructions provided 
by the manufacturers. 

A single PGR product for each clone is amplified and cloned into 
appropriately cleaved pET 24 (e.g., BamHl-Xhol cleaved pET 24), resulting in 
the construction of a transcriptional fusion that permits expression of the 
proteins without His-tags. The expressed products can be purified as denatured 
proteins that are refolded by dialysis into 1 M arginine. 

Cloning into pET 24 allows transcription of the genes from the T7 
promoter, which is supplied by the vector, but relies upon binding of the RNA- 
specific DNA polymerase to the intrinsic ribosome binding sites of the genes, 
and thereby expression of the complete ORF. The amplification, digestion, and 
cloning protocols that can be used in this method are as described above for 
constructing translational fusions. 
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same way. Fifteen days after the last injection, animal serum is recovered, 
decomplemented, and filtered through a 0.45 )im membrane. 

2«G.2. Mouse hyperimmune ascites fluid 

Ten mice are injected subcutaneously with 10-50 ^ig of a purified fiision 
5 polypeptide as obtained using the methods described in 2.E.I. or 2.E.2., in the 

presence of Freund's complete adjuvant and in a volume of approximately 200 
jxl. Seven and 14 days after the initial injection, booster doses, which are 
identical to the priming doses, except that Freund*s incomplete adjuvant is used, 
are administered in the same way. Twenty one and 28 days after the initial 
10 infection, mice receive 50 |xg of the antigen alone intraperitoneally. On day 21, 

mice are also injected intraperitoneally with sarcoma 180/TG cells CM26684 
(Lennette et aL, Diagnostic Procedures for Viral, Rickettsial, and Chlamydial 
Infections^ 5th Ed. Washington DC, American Public Health Association, 
1979). Ascites fluid is collected 10-13 days after the last injection, 

15 

EXAMPLE 3: Methods for producing transcriptional fusions lacking His- 
tags 

Methods for amplification and cloning of DNA encoding the 
polypeptides of the invention as transcriptional fiisions lacking His-tags are 

20 described as follows. Two PGR primers for each clone are designed based 

upon the sequences of the polynucleotides that encode them (see the attached 
sequence listing, odd numbers, up to SEQ ID NO: 1363). These primers can be 
used to amplify DNA encoding the polypeptides of the invention from any H. 
pylori strain, including, for example, ORV2001 and the strain deposited as 

25 ATCC deposit number 43579, as well as from other Helicobacter species. 
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P-mercaptoethanol is added to the eluted protein to a final concentration of 1 
mM, and then the eluted protein is passed through a Sephadex G-25 column 
equilibrated in 0.1 M acetic acid. Protein eluted from the column is slowly 
added to 4 volumes of 50 mM phosphate buffer (pH 7.0), and the protein 
remains in solution. 

2,F. EvaSeatioini of the protectnve activity of the pMrnfied protem 

Groups of 10 OFl mice (IFF A Credo) are immunized rectally with 25 
(xg of the purified recombinant protein, admixed with 1 |ig of cholera toxin 
(Bema) in physiological buffer. Mice are immunized on days 0, 7, 14, and 21. 
Fourteen days after the last immunization, the mice are challenged with H. 
pylori strain ORV2001, grown in liquid media (the cells are grown on agar 
plates, as described in 2. A., and, after harvest, are resuspended in Bmcella 
broth; the flasks are then incubated overnight at SV^C). Fourteen days after 
challenge, the mice are sacrificed and their stomachs are removed. The amount 
of //. pylori is determined by measuring the urease activity in the stomach and 
by culture. 

2oGo ProdimctDOini of moiniospeciffSc polycloinial aEtibodSes 
2oG.l. Hyperimmuninie rabbit aEtiseruam 
New Zealand rabbits are injected both subcutaneously and 
intramuscularly with 100 |ag of a purified fusion polypeptide, as obtained using 
the methods described in 2.E.I. or 2.E.2., in the presence of Freund's complete 
adjuvant and in a total volume of approximately 2 ml. Twenty one and 42 days 
after the initial injection, booster doses, which are identical to the priming 
doses, except that Freund^s incomplete adjuvant is used, are administered in the 
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The protein fractions eluted from the IMAC column with buffer D 
(buffer C containing 500 mM imidazole) are pooled. Arginine is added to the 
solution to a final concentration of 0.5 M, and the mixture is dialyzed against 
PBS containing 0.5 M arginine and various concentrations of urea (4 M, 3 M, 2 
M, 1 M, and 0.5 M) to progressively decrease the concentration of urea. The 
final dialysate is filtered through a 0.22 |im membrane and stored at -45°C. 

Alternatively, when the above-described purification process is not as 
efficient as it should be, two other processes can be used and are described as 
follows. A first altemative involves the use of a mild denaturant, N-octyl 
glucoside (NOG). Briefly, a pellet obtained as is described in 2.E. is 
homogenized in a solution of 5 mM imidazole, 500 mM sodium chloride, and 
20 mM Tris-HCl (pH 7.9) by microfluidization at a pressure of 15,000 psi, and 
is clarified by centrifligation at 4,000-5,000 x g. The pellet is recovered, 
resuspended in 50 mM NaP04 (pH 7.5) containing 1-2% weight /volume NOG, 
and homogenized. The NOG-soluble impurities are removed by centrifiigation. 
The pellet is extracted once more by repeating the preceding extraction step. 
The pellet is dissolved in 8 M urea, 50 mM Tris (pH 8.0). The urea-solubilized 
protein is diluted with an equal volume of 2 M arginine, 50 mM Tris (pH 8.0), 
and is dialyzed against 1 M arginine for 24-48 hours to remove the urea. The 
final dialysate is filtered through a 0.22 ^im membrane and stored at -45^C. 

A second altemative involves the use of a strong denaturant, such as 
guanidine hydrochloride. Briefly, a pellet obtained as is described in 2.E. is 
homogenized in a solution of 5 mM imidazole, 500 mM sodium chloride, and 
20 mM Tris-HCl (pH 7.9) by microfluidization at a pressure of 15,000 psi, and 
is clarified by centrifiigation at 4,000-5,000 x g. The pellet is recovered, 
resuspended in 6 M guanidine hydrochloride, and passed through an IMAC 
column charged with Ni^. The bound antigen is eluted with 8 M urea (pH 8.5). 
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location of the target protein in the soluble or insoluble fractions, as is 
described further below. 

2-E.lo Solunble fractioim 

If the target protein is produced in a soluble form (i.e., in the supernatant 
obtained using the methods described in 2.E.) NaCl and imidazole are added to 
the supernatant to final concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M 
NaCl, and 10 mM imidazole (buffer A). The mixture is filtered through a 0.45 
|im membrane and loaded onto an IMAC column (Pharmacia HiTrap chelating 
Sepharose; 1 ml), which has been charged with nickel ions according to the 
manufacturer's recommendations. After loading, the column is washed with 50 
column volumes of buffer A and the recombinant protein is eluted with 5 ml of 
buffer B (50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 500 mM imidazole). 

The elution profile is monitored by measuring the absorbance of the 
fractions at 280 nm. Fractions corresponding to the protein peak are pooled, 
dialyzed against PBS containing 0.5 M arginine, filtered through a 0.22 ^im 
membrane, and stored at -45°C. 

2«Eo2. iBSotable ffractionn 

If the target protein is expressed in the insoluble fraction (pellets 
obtained using the methods described in 2.E.), purification is conducted under 
denaturing conditions. NaCl, imidazole, and urea are added to the resuspended 
pellet to final concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 10 mM 
imidazole, and 6 M urea (buffer C). After complete solubilization, the mixture 
is filtered through a 0.45 ^im membrane and loaded onto an IMAC colunm. 

The purification procedures on the IMAC colunm are the same as are 
described in 2.E.I., except that 6 M urea is included in all of the buffers used 
and 10 column volumes of buffer C are used to wash the column after protein 
loading, instead of 50 column voliunes. 
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absorbance at 600 nm (ODgoo) reaches 0.4-1 .0. The culture is stopped from 
growing by placing the flask at 4^C overnight. The following day, 10 ml of the 
overnight culture is used to inoculate 240 ml LB medium containing 
kanamycin (25 ^ig/ml), with the initial OD^oo being about 0.02-0,04. Four 
flasks are inoculated for each ORF. The cells are grown to an OD^oo of 1 0 
5 (about 2 hours at 37^0), a 1 ml sample is harvested by centrifugation, and the 

sample is analyzed by SDS-PAGE to detect any leaky expression. The 
remaining culture is induced with 1 mM IPTG and the induced cultures are 
grown for an additional 2 hours at 37''C. 

The final OD500 reading is taken and the cells are harvested by 
10 centrifiigation at 5,000 x g for 15 minutes at 4°C. The supernatant is discarded 

and the pellets are resuspended in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. 
Two hundred and fifty ml of buffer are used for each 1 L of culture and the 
cells are recovered by centrifiigation at 12,000 x g for 20 minutes. The 
supernatant is discarded and the pellets are stored at -45°C. 

15 

2. £• Protein purification 

Pellets obtained using the methods described in 2.D. are thawed and 
resuspended in 95 ml of 50 mM Tris-HCl (pH 8.0). Pefabloc and lysozjmie are 
added to final concentrations of 100 |xM and 100 M.g/ml, respectively. The 

20 mixture is homogenized with magnetic stirring at 5^C for 30 minutes. 

Benzonase (Merck) is added to a final concentration of 1 U/ml, in the presence 
of 10 mM MgCl2, to ensure total digestion of the DNA. The suspension is 
sonicated (Branson Sonifier 450) for 3 cycles of 2 minutes each at maximum 
output. The homogenate is centrifiiged at 19,000 x g for 15 minutes and both 

25 the supernatant and the pellet are analyzed by SDS-PAGE to detect the cellular 
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presence of histidine residues at the N-terminus of the fusion protein, which are 
encoded by the vector. 

The ligation reaction (20 fil) is carried out at 14°C overnight and then is 
used to transform 100 jil fresh co/z XLl-blue competent cells (Novagen). 
The cells are incubated on ice for 2 hours, heat-shocked at 42°C for 
30 seconds, and returned to ice for 90 seconds. The samples are then added to 
1 ml LB broth in the absence of selection and grown at 37 ""C for 2 hours. The 
cells are plated out on LB agar containing kanamycin (50 )ig/ml) at a 1 Ox and 
neat dilution and incubated ovemight at 37 °C. The following day, 50 colonies 
are picked, plated onto secondary plates, and incubated at 37°C ovemight. 

Five colonies are picked, grown in 3 ml LB broth supplemented with 
kanamycin (100 |ig/ml), and grown ovemight at 37 °C. Plasmid DNA is 
extracted using the Quiagen mini-prep method and is quantitated by agarose gel 
electrophoresis. 

PGR is performed with the gene-specific primers under the conditions 
set forth above and transformant DNA is confirmed to contain the desired 
insert. If PCR-positive, one of the five plasmid DNA samples (500 ng) 
extracted from the E, coli XL 1 -blue cells is used to transform competent BL21 
(XDE3) E. coli competent cells (Novagen; as described previously). 
Transformants (10) are picked, plated onto selective kanamycin (50 |ig/ml)- 
containing LB agar plates, and stored as a research stock in LB containing 
50% glycerol. 

2oB. PusriificaittioHTi of recombiBasiit protems 

One ml of frozen glycerol stock prepared as described in 2.C. is used to 
inoculate 50 ml of LB medium containing 25 p.g/ml kanamycin in a 250 ml 
Erlenmeyer flask. The flask is incubated at 37^C for 2 hours or until the 
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dNTPs mix 

lOx ThermoPol buffer 

primers 

DNA template 

Heat-stable DNA polymerase 



200 ^M 
10^1 



300 nM each 



50 ng 
2 units 



Appropriate amplification reaction conditions can readily be determined by one 
skilled in the art. For example, the following conditions can be used for 
amplification of DNA encoding GHPO 615 using the primers set forth above: 
initial denaturation at 94*^C for 5 minutes, 25 cycles of denaturation at 97°C for 
30 seconds, hybridization at 55°C for 1 minute, and elongation at 72*^C for 2 
minutes, using Vent DNA polymerase. In the case of amplifying DNA 
encoding GHPO 1282 with the primers set forth above, the following 
conditions can be used: initial denaturation at 94^C for 5 minutes, 25 cycles of 
denaturation at 94^C for 30 seconds, hybridization at 45*^C for 30 seconds, and 
elongation at 72°C for 30 seconds, followed by a final elongation at 72°C for 7 
minutes, using Vent DNA polymerase. The following conditions can be used 
for amplification of DNA encoding GHPO 840 using the primers set forth 
above: 25 cycles of denaturation at 97°C for 30 seconds, hybridization at 55^C 
for 1 minute, and elongation at 72°C for 2 minutes using Vent DNA 
polymerase. Table 1 sets forth conditions for using the primers listed therein. 

2.C. Transformation and selection of transformants 

A single PGR product is thus amplified and then is digested at 37° C for 
2 hours with BamHl and Xhol together in a 20 |xl reaction volume. The 
digested product is ligated to similarly cleaved pET28a (Novagen) that is 
dephosphorylated prior to the ligation by treatment with Calf Intestinal 
Alkaline Phosphatase (CIP). The gene fusion constructed in this manner allows 
one-step affinity purification of the resulting fusion protein because of the 
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NO: 1367), and 

5'-CCGCTCGAGCTAAAAGTTTTGCAAAATCAC-3' (SEQ ID 
NO: 1368). 

GHPO 961 : 5'-CGCGGATCCGATTTTACTTGAAAAATTTAAAC-3' (SEQ 
ID NO: 1369), and 

5'-CCGCTCGAGTTAGAAAGTGTAGTTCAAATAC-3' (SEQ ID 
NO: 1370). 

GHPO 1282 : 5'-GCGGATCCTTTTCTTCAATGTTTG-3' (SEQ IDNO:1371), 
and 

5'-CCGCTCGAGTCAAAGTTTTAAACAAATTC-3' (SEQ ID 
NO: 1372). 

GHPO 296 : 5'-CCGAATTCGGTTATAAAGCCCCT-3' (SEQ ID NO: 1373), 
and 

5'-CCGCTCGAGTTAAGGCTGATTTAA-3' (SEQ ID NO: 1374). 
GHPO 840 : 5'-CGCGGATCCGAGGAAATAGCATGTTAATAACC-3' (SEQ 
ID NO: 1375), and 

5'-CCGCTCGAGTCACTGCTTGCATGACTTATTCCA-3' (SEQ ID 
NO: 1376). 

The N-terminal and C-terminal primers for each clone can each include 
a 5' clamp and a restriction enzyme recognition sequence for cloning purposes 
(for example, BamlU (GGATCC) and^ol (CTCGAG) recognition 
sequences). 

Amplification of gene-specific DNA is carried out using Vent DNA 
Polymerase (New England Biolabs) or Taq DNA polymerase (Appligene), 
according to the manufacturer's instructions. The reaction mixture, which is 
brought to a final volume of 100 |xl with distilled water, is as follows: 
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associated with insertion of signal sequences in the membrane of the host strain 
carrying the hybrid construct are avoided. 

EXAMPLE 2: Preparation of isolated DNA encoding the polypeptides of 
the invention, and production of these polypeptides as histidine-tagged 
5 fusion proteins 

2.A. Preparation of genomic DNA from Helicobacter pylori 

H, pylori strain ORV2001, stored in LB medium containing 50% 
glycerol at -70 ""C, is grown on Colombia agar containing 7% sheep blood for 
48 hours under microaerophiUc conditions (8-10% CO2, 5-7% O2, 85-87% Nj). 
10 Cells are harvested, washed with phosphate buffer saline (PBS) (pH 7.2), and 

DNA is then extracted from the cells using the Rapid Prep Genomic DNA 
Isolation kit (Pharmacia Biotech). 

2.B. PGR amplification 

15 DNA molecules encoding the polypeptides of the invention are 

amplified from genomic DNA, as can be prepared as is described above, by the 
Polymerase Chain Reaction (PCR) using primers that can readily be designed 
by one skilled in the art. Specific examples of primers that can be used in the 
invention are shown in Table 1 . As specific examples, to amplify genes 

20 encoding GHPO 147, GHPO 615, GHPO 961, GHPO 1282, GHPO 296, and 

GHPO 840 the following primers can be used: 

GHPO 147 : 5*-CTGAATTCGAATGAAAAGAATTTTAGTCTCT-3* (SEQ ID 
NO:1365),and 

5*-CCGCTCGAGTTAAAACTCATAATTCAAAT-3' (SEQ ID 
25 NO:1366). 

GHPO 615 : 5'-CGCGGATCCGAAGACATGTGCAACCGATG-3' (SEQ ID 
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loBo Ideimtificatiioini of signal seqimeinices 

The deduced protein encoded by a target gene sequence is analyzed 
using the PROTEAN software package (DNAStar, Inc.). This analysis predicts 
those areas of the protein that are hydrophobic by using the Kyte-Doolittle 
algorithm, and identifies any potential polar residues preceding the 
hydrophobic core region, which is typical for many signal sequences. For 
confirmation, the target protein is then searched against a PROSITE database 
(DNAStar, Inc.) consisting of motifs and signatures. Characteristic of many 
signal sequences and hydrophobic regions in general, is the identification of 
predicted prokaryotic lipid attachment sites. Where confirmation between the 
two approaches is apparent at the N-terminus of any protein, putative cleavage 
sites are sought. Specifically, this includes the presence of either an Alanine 
(A), Serine (S), or Glycine (G) residue immediately after the core hydrophobic 
region. In the case of lipoproteins, a Cysteine (C) residue would be identified 
as the +1 residue, post-cleavage. 

I.E. Ratioimall desigun of PGR primers based on the Mentificatioini of sagmal 
sequneimces 

In order to clone gene sequences as N-terminus translational fusions for 
the generation of recombinant proteins with N-terminal Histidine tags, the gene 
sequence that specifies the signal sequence is omitted. The 5'-end of the gene- 
specific portion of the N-terminal primer is designed to start at the first codon 
beyond the cleavage site. In the case of lipoproteins, the 5*-end of the N- 
terminal primer begins at the second codon, immediately after the modifiable 
residue at position +1 post-cleavage. The omission of the signal sequence fi^om 
the recombinant allows for one-step purification, and potential problems 
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by the H, pylori genome for both the plus and minus strands, and was given the 
designation "O," Each ORF was assigned a number indicating its location on 
the genome and its position relative to other genes. No manipulation of the text 
file was required. 

l.B. Searching the H, pylori databases 

The databases constmcted as is described above were searched using the 
program FASTA (Pearson et al, Proc. Natl. Acad. Sci. USA 85:2444-2448, 
1988). FASTA was used for searching either a DNA sequence against either of 
the gene databases ("H" and/or "N"), or a peptide sequence against the ORF 
library ("O"). TFASTX was used to search a peptide sequence against all 
possible reading frames of a DNA database ("H" and/or "N" libraries). 
Potential frameshifts also being resolved, FASTX was used for searching the 
translated reading frames of a DNA sequence against either a DNA database, or 
a peptide sequence against the protein database. 

I.e. Isolation of DNA sequences from the H. pylori genome 

The FASTA searches against the constmcted DNA databases identified 
exact nucleotide coordinates on one or more of the isolated contigs, and 
therefore the location of the target DNA. Once the exact location of the target 
sequence was known, the contig identified to carry the gene was exported into 
the software package MapDraw (DNAStar, Inc.) and the gene was isolated. 
Gene sequences with flanking DNA was then excised and copied into the 
EditSeq. Software package (DNAStar, Inc.) for fiirther analysis. 
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the invention so that they can be produced without his-tags, and Example 4 
describes methods for purifying recombinantly produced polypeptides of the 
invention. 



EXAMPLE 1: Ideimtiiffacatttoini of gennes iini tine H. pylori genaome, 
5 ideimtifficatiora of siginial seqaaeeces, aed primer desigm for ampMfficatBOini off 

geimes lacMeg siginial sequneeces 

1<»A. Creatimg H. pylori genomic databases 

The H, pylori genome was provided as a text file containing a single 

contiguous string of nucleotides that had been determined to be 1.76 
10 Megabases in length. The complete genome was split into 17 separate files 

using the program SPLIT (Creativity in Action), giving rise to 16 contigs, each 

containing 100,000 nucleotides, and a 17^*" contig containing the remaining 

76,000 nucleotides. A header was added to each of the 17 files using the 

format: >hpg0.txt (representing contig 1), .hpgl.txt (representing contig 2), etc. 
15 The resulting 17 files, named hpgO through hpgl6, were then copied together to 

form one file that represented the plus strand of the complete H, pylori genome. 

The constructed database was given the designation "H." A negative strand 

database of the H, pylori genome was created similarly by first creating a 

reverse complement of the positive strand using the program SeqPup (D.G. 
20 Gilbert, Indiana University Biology Department) and then performing the same 

procedure as described above for the plus strand. This database was given the 

designation 'TvF." 

The regions predicted to encode open reading frames (ORFs) were 

defined for the complete H. pylori genome using the program GENEMARK™ 
25 (Borodovsky et al., Comp. Chem. 17:123, 1993). A database was created from 

a text file containing an annotated version of all ORPs predicted to be encoded 
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pepto-bismol (see, e.g., Goodwin et al., Helicobacter pylori^ Biology and 
Clinical Practice, CRC Press, Boca Raton, FL, pp 366-395, 1993; Physicians* 
Desk Reference, 49*^ edn,, Medical Economics Data Production Company, 
Montvale, New Jersey, 1995). In addition, compounds containing more than 
one of the above-listed components coupled together, e.g., ranitidine coupled to 
5 bismuth subcitrate, can be used. The invention also includes compositions for 

carrying out these methods, /.e,, compositions containing a Helicobacter 
antigen (or antigens) of the invention, an adjuvant, and one or more of the 
above-listed compounds, in a pharmaceutically acceptable carrier or diluent. 
Amounts of the above-listed compounds used in the methods and 
10 compositions of the invention can readily be determined by one skilled in the 

art. In addition, one skilled in the art can readily design 

treatment/immunization schedules. For example, the non- vaccine components 
can be administered on days 1-14, and the vaccine antigen + adjuvant can be 
administered on days 7, 14, 21, and 28. 

1 5 Methods and pharmaceutical compositions of the invention can be used 

to treat or to prevent Helicobacter infections and, accordingly, gastroduodenal 
diseases associated with these infections, including acute, chronic, and atrophic 
gastritis, and peptic ulcer diseases, e.g., gastric and duodenal ulcers. 

The invention is further illustrated by the following examples. Example 

20 1 describes identification of genes, such as genes that encode the polypeptides 

of the invention, in the Helicobacter genome, as well as identification of signal 
sequences, and primer design for amplification of genes lacking signal 
sequences. Example 2 describes cloning of DNA molecules encoding 
polypeptides of the invention into a vector that provides a histidine tag, and 

25 production and purification of the resulting his-tagged fiasion proteins. 

Example 3 describes methods for cloning DNA encoding the polypeptides of 
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with a pharmaceutically acceptable diluent or carrier, e.g., water or a saline 
solution, such as phosphate buffer saline, optionally, including a bicarbonate 
salt, such as sodium bicarbonate, e.g., 0.1 to 0.5 M. Bicarbonate can 
advantageously be added to compositions intended for oral or intragastric 
administration. In general, a diluent or carrier can be selected on the basis of 
the mode and route of administration, and standard pharmaceutical practice. 
Suitable pharmaceutical carriers and diluents, as well as pharmaceutical 
necessities for their use in pharmaceutical formulations, are described in 
Remington 's Pharmaceutical Sciences, a standard reference text in this field 
and in the USP/NF. 

The invention also includes methods in which gastroduodenal infections, 
such as Helicobacter infection, are treated by oral administration of a 
Helicobacter polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antisecretory agent, a bismuth salt, an 
antacid, sucralfate, or a combination thereof. Examples of such compounds 
that can be administered with the vaccine antigen and an adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, p-lactams, 
aminoglycosides, quinolones, penicillins, and derivatives thereof (specific 
examples of antibiotics that can be used in the invention include, e.g., 
amoxicillin, clarithromycin, tetracycline, metronidizole, erythromycin, 
cefuroxime, and erythromycin); antisecretory agents, including, e.g., H2- 
receptor antagonists {e.g., cimetidine, ranitidine, famotidine, nizatidine, and 
roxatidine), proton pump inhibitors (e.g., omeprazole, lansoprazole, and 
pantoprazole), prostaglandin analogs (e.g., misoprostil and enprostil), and 
anticholinergic agents (e.g., pirenzepine, telenzepine, carbenoxolone, and 
proglumide); and bismuth salts, including colloidal bismuth subcitrate, 
tripotassium dicitrate bismuthate, bismuth subsalicylate, bicitropeptide, and 
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Adjuvants that can be used in any of the vaccine compositions described 
above are described as foUov^s. Adjuvants for parenteral administration 
include, for example, aluminum compounds, such as aluminum hydroxide, 
aluminum phosphate, and aluminum hydroxy phosphate. The antigen can be 
precipitated with, or adsorbed onto, the aluminum compound using standard 
methods. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can 
also be used in parenteral administration. 

Adjuvants that can be used for mucosal administration include, for 
example, bacterial toxins, e.g., the cholera toxin (CT), the E. coli heat-labile 
toxin (LT), the Clostridium difficile toxin A, the pertussis toxin (PT), and 
combinations, subunits, toxoids, or mutants thereof. For example, a purified 
preparation of native cholera toxin subunit B (CTB) can be used. Fragments, 
homologs, derivatives, and fusions to any of these toxins can also be used, 
provided that they retain adjuvant activity. Preferably, a mutant having 
reduced toxicity is used. Suitable mutants are described, e.g., in WO 95/1721 1 
(Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 
95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants 
that can be used in the methods and compositions of the invention include, e.g., 
Ser-63-Lys, Ala-69-Gly, Glu-1 10- Asp, and Glu-112-Asp mutants. Other 
adjuvants, such as the bacterial monophosphoryl lipid A (MPLA) of, e.g., E. 
coli. Salmonella minnesota. Salmonella typhimurium, or Shigella flexneri\ 
saponins, and polylactide glycolide (PLGA) microspheres, can also be used in 
mucosal administration. Adjuvants useful for both mucosal and parenteral 
administrations, such as polyphosphazene (WO 95/2415), can also be used. 

Any pharmaceutical composition of the invention, containing a 
polynucleotide, polypeptide, polypeptide derivative, or antibody of the 
invention, can be manufactured using standard methods. It can be formulated 
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least one additional monospecific antibody specific for a different Helicobacter 
polypeptide. The amount of antibody and the particular regimen used can be 
readily determined by one skilled in the art. For example, daily administration 
of about 100 to 1,000 mg of antibody over one week, or three doses per day of 
about 100 to 1,000 mg of antibody over two or three days, can be effective 
regimens for most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard 
methods in the art, e.g., by measuring induction of a mucosal immune response 
or induction of protective and/or therapeutic immunity, using, e,g.^ the H,felis 
mouse model and the procedures described by Lee et al. (Eur. J. 
Gastroenterology & Hepatology 7:303, 1995) or Lee et aL (J. Infect. Dis. 
1 72: 161, 1995). Those skilled in the art will recognize that the H, felis strain of 
the model can be replaced with another Helicobacter strain. For example, the 
efficacy of polynucleotide molecules and polypeptides from H. pylori is, 
preferably, evaluated in a mouse model using an H, pylori strain. Protection 
can be determined by comparing the degree of Helicobacter infection in the 
gastric tissue assessed by, for example, urease activity, bacterial counts, or 
gastritis, to that of a control group. Protection is shown when infection is 
reduced by comparison to the control group. Such an evaluation can be made 
for polynucleotides, vaccine vectors, polypeptides, and polypeptide derivatives, 
as well as for antibodies of the invention. 

For example, various doses of an antibody of the invention can be 
administered to the gastric mucosa of mice previously challenged with an H, 
pylori strain, as described, e.g., by Lee et al. {supra). Then, after an 
appropriate period of time, the bacterial load of the mucosa can be estimated by 
assessing urease activity, as compared to a control. Reduced urease activity 
indicates that the antibody is therapeutically effective. 
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allowed to adsorb onto the material. The chromatography material, such as a 
gel or a resin coupled to an antibody of the invention, can be in batch form or in 
a column. The unbound components are washed off and the antigen is eluted 
with an appropriate elution buffer, such as a glycine buffer, a buffer containing 
a chaotropic agent, e.g., guanidine HCl, or a buffer having high salt 
concentration {e.g., 3 M MgCl2). Eluted fractions are recovered and the 
presence of the antigen is detected, e.g., by measuring the absorbance at 280 
nm. 

An antibody of the invention can be screened for therapeutic efficacy as 
follows. According to an eleventh aspect of the invention, there is provided (i) 
a composition of matter containing a monospecific antibody of the invention, 
together with a diluent or carrier; (ii) a pharmaceutical composition containing 
a therapeutically or prophylactically effective amount of a monospecific 
antibody of the invention, and (iii) a method for treating or preventing 
Helicobacter (e.g., H. pylori, H.felis, H. mustelae, or H. heilmanii) infection, 
by administering a therapeutic or prophylactic amount of a monospecific 
antibody of the invention to an individual in need of such treatment. In 
addition, the eleventh aspect of the invention includes the use of a 
monospecific antibody of the invention in the preparation of a medicament for 
treating or preventing Helicobacter infection. 

The monospecific antibody can be polyclonal or monoclonal, and is, 
preferably, predominantly of the IgA isotype. In passive immunization 
methods, the antibody is administered to a mucosal surface of a mammal, e.g., 
the gastric mucosa, e.g., orally or intragastrically, optionally, in the presence of 
a bicarbonate buffer. Altematively, systemic administration, not requiring a 
bicarbonate buffer, can be carried out. A monospecific antibody of the 
invention can be administered as a single active agent or as a mixture with at 
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that interacts with the reagent is first attached to the sohd support. For 
example, if a polypeptide reagent is used, an antibody that binds to it can serve 
as an anti-reagent, provided that it binds to an epitope that is not involved in 
recognition of antibodies in biological samples. Indirect means can also 
employ a ligand-receptor system, for example, a molecule, such as a vitamin, 
can be grafted onto the polypeptide reagent and the corresponding receptor can 
be immobilized on the solid phase. This concept is illustrated by the well 
known biotin-streptavidin system. Altematively, indirect means can be used, 
e.g., by adding to the reagent a peptide tail, chemically or by genetic 
engineering, and immobilizing the grafted or fiised product by passive 
adsorption or covalent linkage of the peptide tail. 

According to a tenth aspect of the invention, there is provided a process 
for purifying, from a biological sample, a polypeptide or polypeptide derivative 
of the invention, which involves carrying out antibody-based affinity 
chromatography with the biological sample, wherein the antibody is a 
monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be 
polyclonal or monospecific, and preferably is of the IgG type. Purified IgGs 
can be prepared from an antiserum using standard methods (see, e.g., Coligan 
et al., supra). Conventional chromatography supports, as well as standard 
methods for grafting antibodies, are described, for example, by Harlow et al. 
(Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1988). 

Briefly, a biological sample, such as an H. pylori extract, preferably in a 
buffer solution, is applied to a chromatography material, which is, preferably, 
equilibrated with the buffer used to dilute the biological sample, so that the 
polypeptide or polypeptide derivative of the invention {i.e., the antigen) is 
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methods to detect the presence Helicobacter antigens in a sample, such as a 
biological sample. The antibodies can also be used in affinity chromatography 
methods for purifying a polypeptide or polypeptide derivative of the invention. 
As is discussed further below, the antibodies can also be used in prophylactic 
and therapeutic passive immunization methods. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for 
detecting the presence of Helicobacter in a biological sample that contains an 
antibody, polypeptide, or polypeptide derivative of the invention; and (ii) a 
diagnostic method for detecting the presence of Helicobacter in a biological 
sample, by contacting the biological sample with an antibody, a polypeptide, or 
a polypeptide derivative of the invention, so that an immune complex is 
formed, and detecting the complex as an indication of the presence of 
Helicobacter in the sample or the organism from which the sample was 
derived. The immune complex is formed between a component of the sample 
and the antibody, polypeptide, or polypeptide derivative, and that any unbound 
material can be removed prior to detecting the complex. A polypeptide reagent 
can be used for detecting the presence of anti-Helicobacter antibodies in a 
sample, e.g.^ a blood sample, while an antibody of the invention can be used for 
screening a sample, such as a gastric extract or biopsy sample, for the presence 
of Helicobacter polypeptides. 

For use in diagnostic methods, the reagent (e.g,, the antibody, 
polypeptide, or polypeptide derivative of the invention) can be in a free state or 
can be immobilized on a solid support, such as, for example, on the interior 
surface of a tube or on the surface, or within pores, of a bead. Immobilization 
can be achieved using direct or indirect means. Direct means include passive 
adsorption (i.e., non-covalent binding) or covalent binding between the support 
and the reagent. By "indirect means" is meant that an anti-reagent compound 
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unlabeled, depending upon the diagnostic method. Diagnostic methods 
involving such a reagent are described below. 

Upon expression of a polynucleotide molecule of the invention, a 
polypeptide or polypeptide derivative is produced and can be purified using 
known methods. For example, the polypeptide or polypeptide derivative can be 
produced as a fusion protein containing a fiised tail that facilitates purification. 
The fusion product can be used to immunize a small mammal, e.g., a mouse or 
a rabbit, in order to raise monospecific antibodies against the polypeptide or 
polypeptide derivative. The eighth aspect of the invention thus provides a 
monospecific antibody that binds to a polypeptide or polypeptide derivative of 
the invention. 

By "monospecific antibody" is meant an antibody that is capable of 
reacting with a unique, naturally-occurring Helicobacter polypeptide. An 
antibody of the invention can be polyclonal or monoclonal. Monospecific 
antibodies can be recombinant, e.g., chimeric (e.g., consisting of a variable 
region of murine origin and a human constant region), humanized (e.g., a 
human immunoglobulin constant region and a variable region of animal, e.g., 
murine, origin), and/or single chain. Both polyclonal and monospecific 
antibodies can also be in the form of immunoglobulin fragments, e.g., F(ab)*2 
or Fab fragments. The antibodies of the invention can be of any isotype, e.g., 
IgG or IgA, and polyclonal antibodies can be of a single isotype or can contain 
a mixture of isotypes. 

The antibodies of the invention, which can be raised to a polj^^eptide or 
polypeptide derivative of the invention, can be produced and identified using 
standard immunological assays, e.g.. Western blot assays, dot blot assays, or 
ELISA (see, e.g., Coligan et aL, Current Protocols in Immunology, John Wiley 
& Sons, Inc., New York, NY, 1994). The antibodies can be used in diagnostic 
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Administration can be achieved in a single dose or repeated as necessary 
at intervals that can be determined by one skilled in the art. For example, a 
priming dose can be followed by three booster doses at weekly or monthly 
intervals. An appropriate dose depends on various parameters, including the 
nature of the recipient (e.g., whether the recipient is an adult or an infant), the 
particular vaccine antigen, the route and frequency of administration, the 
presence/absence or type of adjuvant, and the desired effect (e.g., protection 
and/or treatment), and can be readily determined by one skilled in the art. In 
general, a vaccine antigen of the invention can be administered mucosally in an 
amount ranging from about 10 ^g to about 500 mg, preferably from about 1 mg 
to about 200 mg. For a parenteral route of administration, the dose usually 
should not exceed about 1 mg, and is, preferably, about 100 ^g. 

When used as components of a vaccine, the polynucleotides and 
polypeptides of the invention can be used sequentially as part of a multi-step 
immunization process. For example, a mammal can be initially primed with a 
vaccine vector of the invention, such as a pox virus, e.g., via a parenteral route, 
and then boosted twice with a polypeptide encoded by the vaccine vector, e.g., 
via the mucosal route. In another example, liposomes associated with a 
polypeptide or polypeptide derivative of the invention can be used for priming, 
with boosting being carried out mucosally using a soluble polypeptide or 
polypeptide derivative of the invention, in combination with a mucosal 
adjuvant (e.g., LT). 

Polypeptides and polypeptide derivatives of the invention can also be 
used as diagnostic reagents for detecting the presence of anti-Helicobacter 
antibodies, e.g., in blood samples. Such polypeptides can be about 5 to about 
80, preferably, about 10 to about 50 amino acids in length and can be labeled or 
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the invention in the preparation of a medicament for preventing and/or treating 
Helicobacter infection* 

The immunogenic compositions of the invention can be administered by 
any conventional route in use in the vaccine field, for example, to a mucosal 
(e.g., ocular, intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or 
urinary tract) surface or via a parenteral (e.g., subcutaneous, intradermal, 
intramuscular, intravenous, or intraperitoneal) route. The choice of the 
administration route depends upon a number of parameters, such as the 
adjuvant used. For example, if a mucosal adjuvant is used, the intranasal or 
oral route will be preferred, and if a lipid formulation or an aluminum 
compound is used, a parenteral route will be preferred. In the latter case, the 
subcutaneous or intramuscular route is most preferred. The choice of 
administration route can also depend upon the nature of the vaccine agent. For 
example, a polypeptide of the invention fused to CTB or to LTB will be best 
administered to a mucosal surface. 

A composition of the invention can contain one or several polypeptides 
or derivatives of the invention. It can also contain at least one additional 
Helicobacter antigen, such as the urease apoenzyme, or a subunit, fragment, 
homolog, mutant, or derivative thereof. 

For use in a composition of the invention, a polypeptide or polypeptide 
derivative can be formulated into or with liposomes, such as neutral or anionic 
liposomes, microspheres, ISCOMS, or vims-like particles (VLPs), to facilitate 
delivery and/or enhance the immune response. These compounds are readily 
available to those skilled in the art; for example, see Liposomes: A Practical 
Approach {supra). Adjuvants other than liposomes can also be used in the 
invention and are well known in the art (see, for example, the list provided 
below). 
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addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air-dried. The 
membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5), 0.15 M 
NaCl, 10 g/L skim milk) and incubated with an antiserum diluted from about 
1 : 50 to about 1 :5000, preferably about 1 :500. The reaction is detected using 
standard methods. For example, a goat anti-rabbit peroxidase conjugate can be 
5 added to the wells when rabbit antibodies are used. Incubation is carried out 

for about 90 minutes at 37 °C and the blot is washed. The reaction is developed 
with the appropriate substrate and stopped. The reaction is then measured 
visually by the appearance of a colored spot, e.g., by colorimetry. Under these 
experimental conditions, a positive reaction is associated with detection of a 
10 colored spot for reactions carried out with a dilution of at least about 1 :50, 

preferably, of at least about 1 :500. Therapeutic or prophylactic efficacy of a 
polypeptide or polypeptide derivative of the invention can be evaluated as 
described below. 

According to a seventh aspect of the invention, there is provided (i) a 
15 composition of matter containing a polypeptide of the invention together with a 

diluent or carrier; (ii) a pharmaceutical composition containing a 
therapeutically or prophylactically effective amount of a polypeptide of the 
invention; (iii) a method for inducing an immune response against Helicobacter 
in a mammal by administering to the mammal an immunogenically effective 
20 amount of a polypeptide of the invention to elicit an immune response, e.g., a 

protective immune response to Helicobacter^ and (iv) a method for preventing 
and/or treating a Helicobacter (e.g., H. pylori, H.felis, H mustelae, or H. 
heilmanii) infection, by administering a prophylactic or therapeutic amount of a 
polypeptide of the invention to an individual in need of such treatment. 
25 Additionally, this aspect of the invention includes the use of a polypeptide of 
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corresponding to the product exhibits reactivity at any of the dilutions in the 
range. 

In an ELISA assay, the product to be screened can be used as the coating 
antigen. A purified preparation is preferred, but a whole cell extract can also be 
used. Briefly, about 100 |il of a preparation of about 10 |ig protein/ml is 
distributed into wells of a 96-well ELISA plate. The plate is incubated for 
about 2 hours at 37°C, then overnight at 4''C. The plate is washed with 
phosphate buffer saline (PBS) containing 0.05% Tween 20 (PBS/Tween buffer) 
and the wells are saturated with 250 jxl PBS containing 1% bovine serum 
albumin (BSA), to prevent non-specific antibody binding. After 1 hour of 
incubation at 37 "^C, the plate is washed with PBS/Tween buffer. The antiserum 
is serially diluted in PBS/Tween buffer containing 0.5% BSA, and 100 )al 
dilutions are added to each well. The plate is incubated for 90 minutes at 37 °C, 
washed, and evaluated using standard methods. For example, a goat anti-rabbit 
peroxidase conjugate can be added to the wells when the specific antibodies 
used were raised in rabbits. Incubation is carried out for about 90 minutes at 
37''C and the plate is washed. The reaction is developed with the appropriate 
substrate and the reaction is measured by colorimetry (absorbance measured 
spectrophotometrically). Under these experimental conditions, a positive 
reaction is shown once an O.D. value of 1.0 is detected with a dilution of at 
least about 1:50, preferably of at least about 1:500. 

In a dot blot assay, a purified product is preferred, although a whole cell 
extract can be used. Briefly, a solution of the product at a concentration of 
about 100 \ig/m\ is serially diluted two-fold with 50 mM Tris-HCl (pH 7.5). 
One hundred ^l of each dilution is applied to a filter, such as a 0.45 |j.m 
nitrocellulose membrane, set in a 96-well dot blot apparatus (Biorad). The 
buffer is removed by applying vacuum to the system. Wells are washed by 
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A "substantially purified polypeptide" is defined as a polypeptide that is 
separated fi-om the environment in which it naturally occurs and/or a 
pol3T5eptide that is firee of most of the other polypeptides that are present in the 
environment in which it was synthesized. The polypeptides of the invention 
can be purified fi-om a natural source, such as a Helicobacter strain, or can be 
5 produced using recombinant methods. 

Homologous polypeptides or polypeptide derivatives encoded by 
polynucleotides of the invention can be screened for specific antigenicity by 
testing cross-reactivity with an antiseram raised against a polypeptide having 
an amino acid sequence as shown in the sequence listing (even numbers, up to 

10 SEQ ID NO: 1364). Briefly, a monospecific hyperimmune antiserum can be 

raised against a purified reference polypeptide as such or as a fusion 
polypeptide, for example, an expression product of MBP, GST, or His-tag 
systems, or a synthetic peptide predicted to be antigenic. The homologous 
polypeptide or derivative that is screened for specific antigenicity can be 

15 produced as such or as a fusion polypeptide. In the latter case, and if the 

antiserum is also raised against a fusion polypeptide, two different fusion 
systems are employed. Specific antigenicity can be determined using a number 
of methods, including Westem blot (Towbin et aL, Proc. Natl. Acad. Sci. USA 
76:4350, 1979), dot blot, and ELISA methods, as described below. 

20 In a Westem blot assay, the product to be screened, either as a purified 

preparation or a total E. coli extract, is fractionated by SDS-PAGE, as 
described, for example, by Laemmli (Nature 227:680, 1970). After being 
transferred to a filter, such as a nitrocellulose membrane, the material is 
incubated with the monospecific hyperimmune antiserum, which is diluted in a 

25 range of dilutions from about 1:50 to about 1 :5000, preferably from about 

1 : 1 00 to about 1 :500. Specific antigenicity is shown once a band 
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exception that RNA is used as a target), or a sandwich method (Dunn et aL^ 
Cell 12:23, 1977). As is known in the art, the latter technique involves the use 
of a specific capture probe and a specific detection probe that have nucleotide 
sequences that are at least partially different from each other. 

Primers used in the invention usually contain about 1 0 to 40 nucleotides 
and are used to initiate enzymatic polymerization of DNA in an amplification 
process (e.g., PCR), an elongation process, or a reverse transcription method. 
In a diagnostic method involving PCR, the primers can be labeled. 

Thus, the invention also encompasses (i) a reagent containing a probe of 
the invention for detecting and/or identifying the presence of Helicobacter in a 
biological material; (ii) a method for detecting and/or identifying the presence 
of Helicobacter in a biological material, in which (a) a sample is recovered or 
derived from the biological material, (b) DNA or RNA is extracted from the 
material and denatured, and (c) the sample is exposed to a probe of the 
invention, for example, a capture probe, a detection probe, or both, under 
stringent hybridization conditions, so that hybridization is detected; and (iii) a 
method for detecting and/or identifying the presence of Helicobacter in a 
biological material, in which (a) a sample is recovered or derived from the 
biological material, (b) DNA is extracted therefrom, (c) the extracted DNA is 
contacted with at least one, or, preferably two, primers of the invention, and 
amplified by the polymerase chain reaction, and (d) an amplified DNA 
molecule is produced. 

As mentioned above, polypeptides that can be produced by expression 
of the polynucleotides of the invention can be used as vaccine antigens. 
Accordingly, a sixth aspect of the invention features a substantially purified 
polypeptide or polypeptide derivative having an amino acid sequence encoded 
by a polynucleotide of the invention. 
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listing. For example, they can contain from about 5 to about 100, preferably 
from about 10 to about 80 nucleotides. In particular, probes have sequences 
that are at least 75%, preferably at least 85%, more preferably 95% 
homologous to a portion of a sequence as shown in the sequence listing (odd 
numbers, up to SEQ ID NO: 1363), or a sequence complementary to any of 
such sequences. 

Probes can contain modified bases, such as inosine, methyl-5- 
deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or diamino-2, 6- 
purine. Sugar or phosphate residues can also be modified or substituted. For 
example, a deoxyribose residue can be replaced by a poly amide (Nielsen et aL^ 
Science 254:1497, 1991) and phosphate residues can be replaced by ester 
groups such as diphosphate, alkyl, arylphosphonate, and phosphorothioate 
esters. In addition, the 2*-hydroxyl group on ribonucleotides can be modified 
by addition of, e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, or as capture or 
detection probes. Such capture probes can be immobilized on solid supports, 
directly or indirectly, by covalent means or by passive adsorption. A detection 
probe can be labeled by a detectable label, for example a label selected from 
radioactive isotopes; enzymes, such as peroxidase and alkaline phosphatase; 
enzymes that are able to hydrolyze a chromogenic, fluorogenic, or luminescent 
substrate; compounds that are chromogenic, fluorogenic, or luminescent; 
nucleotide base analogs; and biotin. 

Probes of the invention can be used in any conventional hybridization 
method, such as in dot blot methods (Maniatis et aL, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York, 1982), Southern blot metiiods (Southern, J. Mol. Biol. 
98:503, 1975), northern blot methods (identical to Southern blot to the 
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intestinal, rectal, vaginal, or urinary tract surface, or via a parenteral route, e.g., 
by an intravenous, subcutaneous, intraperitoneal, intradermal, intraepidermal, 
or intramuscular route. The choice of administration route will depend on, e.g., 
the formulation that is selected. A polynucleotide formulated in association 
with bupivacaine is advantageously administered into muscle. When a neutral 
5 or anionic liposome or a cationic lipid, such as DOTMA, is used, the 

formulation can be advantageously injected via intravenous, intranasal (for 
example, by aerosolization), intramuscular, intradermal, and subcutaneous 
routes. A polynucleotide in a naked form can advantageously be administered 
via the intramuscular, intradermal, or subcutaneous routes. Although not 
10 absolutely required, such a composition can also contain an adjuvant. A 

systemic adjuvant that does not require concomitant administration in order to 
exhibit an adjuvant effect is preferable. 

The sequence information provided in the present application enables 
the design of specific nucleotide probes and primers that can be used in 
1 5 diagnostic methods. Accordingly, in a fifth aspect of the invention, there is 

provided a nucleotide probe or primer having a sequence found in, or derived 
by degeneracy of the genetic code from, a sequence shown in the sequence 
listing (odd numbers, up to SEQ ID NO: 1363). 

The term "probe" as used in the present application refers to DNA 
20 (preferably single stranded) or RNA molecules (or modifications or 

combinations thereof) that hybridize under the stringent conditions, as defined 
above, to polynucleotide molecules having sequences homologous to any of 
those shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363), or 
to a complementary or anti-sense sequence of any of those shown in the 
25 sequence listing (odd numbers, up to SEQ ID NO: 1363). Generally, probes are 

significantly shorter than the full-length sequences shown in the sequence 
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for gene delivery are preferably used in association with a neutral lipid such as 
DOPE (dioleyl phosphatidylethanolamine; WO 90/1 1092). Other transfection- 
facilitating compounds can be added to a formulation containing cationic 
liposomes. A number of them are described in, e.g., WO 93/18759, 
WO 93/19768, WO 94/25608, and WO 95/2397. They include, e.g., spermine 

5 derivatives useful for facilitating the transport of DNA through the nuclear 

membrane (see, for example, WO 93/18759) and membrane-permeabilizing 
compounds such as GALA, Gramicidine S, and cationic bile salts (see, for 
example, WO 93/19768). 

Gold or tungsten microparticles can also be used for gene delivery, as 

10 described in WO 91/359, WO 93/17706, and by Tang et al (Nature 356:152, 

1 992). In this case, the microparticle-coated polynucleotides can be injected 
via intradermal or intraepidermal routes using a needleless injection device 
("gene gun"), such as those described in U.S. Patent No. 4,945,050, U.S. Patent 
No. 5,015,580, and WO 94/24263. 

15 The amount of DNA to be used in a vaccine recipient depends, e.g., on 

the strength of the promoter used in the DNA construct, the immunogenicity of 
the expressed gene product, the condition of the mammal intended for 
administration (e.g., the weight, age, and general health of the mammal), the 
mode of administration, and the type of formulation. In general, a 

20 therapeutically or prophylactically effective dose from about 1 ^g to about 

1 mg, preferably, from about 10 ^g to about 800 ^ig, and, more preferably, from 
about 25 |LLg to about 250 |ig, can be administered to human adults. The 
administration can be achieved in a single dose or repeated at intervals. 

The route of administration can be any conventional route used in the 

25 vaccine field. As general guidance, a polynucleotide of the invention can be 

administered via a mucosal surface, e.g., an ocular, intranasal, pulmonary, oral, 



BNSDOCID: <WO 9843478A1 J_> 



wo 98/43478 



PCT/US98/06371 



.55- 

Standard methods can be used in the preparation of therapeutic 
poljmucleotides of the invention. For example, a polynucleotide can be used in 
a naked form, free of any delivery vehicles, such as anionic liposomes, cationic 
lipids, microparticles, e,g., gold microparticles, precipitating agents, e.g., 
calcium phosphate, or any other transfection-facilitating agent. In this case, the 
polynucleotide can be simply diluted in a physiologically acceptable solution, 
such as sterile saline or sterile buffered saline, with or without a carrier. When 
present, the carrier preferably is isotonic, hypotonic, or weakly hypertonic, and 
has a relatively low ionic strength, such as provided by a sucrose solution, e.g., 
a solution containing 20% sucrose. 

Alternatively, a polynucleotide can be associated with agents that assist 
in cellular uptake. It can be, e.g., (i) complemented with a chemical agent that 
modifies cellular permeability, such as bupivacaine (see, e.g., WO 94/16737), 
(ii) encapsulated into liposomes, or (iii) associated with cationic lipids or silica, 
gold, or tungsten microparticles. 

Anionic and neutral liposomes are well-known in the art (see, e.g.. 
Liposomes: A Practical Approach, RPC New Ed, IRL Press, 1990, for a 
detailed description of methods for making liposomes) and are useful for 
delivering a large range of products, including polynucleotides. 

Cationic lipids can also be used for gene delivery. Such lipids include, 
for example, Lipofectin™, which is also known as DOTMA (N-[l-(2,3- 
dioleyloxy)propyl]-N,N,N-trimethylammonium chloride), DOTAP (1,2- 
bis(oleyloxy)-3-(trimethylammonio)propane), DDAB 

(dimethyldioctadecylammonium bromide), DOGS (dioctadecylamidologlycyl 
spermine), and cholesterol derivatives. A description of these cationic lipids 
can be found in EP 187,702, WO 90/1 1092, U.S. Patent No. 5,283,185, 
WO 91/15501, WO 95/26356, and U.S. Patent No. 5,527,928. Cationic lipids 
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unable to replicate in a mammalian cell and unable to integrate into the 
mammalian genome. Typically, a DNA molecule is placed under the control of 
a promoter suitable for expression in a mammalian cell. The promoter can 
function ubiquitously or tissue-specifically. Examples of non-tissue specific 
promoters include the early Cytomegalovims (CMV) promoter (U.S. Patent 
No. 4,168,062) and the Rous Sarcoma Virus promoter (Norton et aL, Molec. 
Cell Biol. 5:281, 1985). The desmin promoter (Li et al. Gene 78:243, 1989; Li 
et al, J. Biol. Chem. 266:6562, 1991; Li et al, J. Biol. Chem. 268:10403, 
1993) is tissue-specific and drives expression in muscle cells. More generally, 
useful promoters and vectors are described, e.g., in WO 94/21797 and by 
Hartikka et al (Human Gene Therapy 7:1205, 1996). 

For DNA/RNA vaccination, the polynucleotide of the invention can 
encode a precursor or a mature form of a polypeptide of the invention. When it 
encodes a precursor form, the precursor sequence can be homologous or 
heterologous. In the latter case, a eucaryotic leader sequence can be used, such 
as the leader sequence of the tissue-type plasminogen factor (tPA). 

A composition of the invention can contain one or several 
polynucleotides of the invention. It can also contain at least one additional 
polynucleotide encoding another Helicobacter antigen, such as urease subunit 
A, B, or both, or a fragment, derivative, mutant, or analog thereof. A 
polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin- 
12 (IL-12), can also be added to the composition so that the immune response 
is enhanced. These additional polynucleotides are placed under appropriate 
control for expression. Advantageously, DNA molecules of the invention 
and/or additional DNA molecules to be included in the same composition are 
carried in the same plasmid. 
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invention can be inserted into the bacterial genome or it can remain in a free 
state, for example, carried on a plasmid. 

An adjuvant can also be added to a composition containing a bacterial 
vector vaccine. A number of adjuvants that can be used are known to those 
skilled in the art. For example, preferred adjuvants can be selected from the list 
provided below. 

According to a fourth aspect of the invention, there is also provided (i) a 
composition of matter containing a polynucleotide of the invention, together 
with a diluent or carrier; (ii) a pharmaceutical composition containing a 
therapeutically or prophylactically effective amount of a polynucleotide of the 
invention; (iii) a method for inducing an immune response against 
Helicobacter^ in a mammal, by administering to the mammal an 
immunogenically effective amount of a polynucleotide of the invention to elicit 
an immune response, e.g., a protective immune response to Helicobacter^ and 
(iv) a method for preventing and/or treating a Helicobacter (e.g., H, pylori, H, 
felis, H. mustelae^ or H, heilmanii) infection, by administering a prophylactic or 
therapeutic amount of a polynucleotide of the invention to an individual in need 
of such treatment. Additionally, the fourth aspect of the invention encompasses 
the use of a polynucleotide of the invention in the preparation of a medicament 
for preventing and/or treating Helicobacter infection. The fourth aspect of the 
invention preferably includes the use of a polynucleotide molecule placed 
under conditions for expression in a mammalian cell, e.g., in a plasmid that is 
unable to replicate in mammalian cells and to substantially integrate into a 
mammalian genome. 

Polynucleotides (for example, DNA or RNA molecules) of the invention 
can also be administered as such to a mammal as a vaccine. When a DNA 
molecule of the invention is used, it can be in the form of a plasmid that is 
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Non-toxicogenic Vibrio cholerae mutant strains that can be used in live 
oral vaccines are described by Mekalanos et aL (Nature 306:551, 1983) and in 
U.S. Patent No. 4,882,278 (strain in which a substantial amount of the coding 
sequence of each of the two ctxA alleles has been deleted so that no functional 
cholerae toxin is produced); WO 92/1 1354 (strain in which the irgA locus is 

5 inactivated by mutation; this mutation can be combined in a single strain with 

CtxA mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and 
attRSl DNA sequences). These strains can be genetically engineered to 
express heterologous antigens, as described in WO 94/19482. An effective 
vaccine dose of a V. cholerae strain capable of expressing a polypeptide or 

10 polypeptide derivative encoded by a polynucleotide molecule of the invention 

can contain, e.g., about 1x10^ to about 1x10^, preferably about 1x10^ to about 
1x10^ viable bacteria in an appropriate volume for the selected route of 
administration. Preferred routes of administration include all mucosal routes, 
but, most preferably, these vectors are administered intranasally or orally. 

15 Attenuated Salmonella typhimurium strains, genetically engineered for 

recombinant expression of heterologous antigens, and their use as oral 
vaccines, are described by Nakayama et al. (Bio/Technology 6:693, 1988) and 
in WO 92/1 1 361 . Preferred routes of administration for these vectors include 
all mucosal routes. Most preferably, the vectors are administered intranasally 

20 or orally. 

Others bacterial strains useful as vaccine vectors are described by High 
et al. (EMBO 1 1:1991, 1992) and Sizemore et al. (Science 270:299, 1995; 
Shigella flexneri); Medaglini et al. (Proc. Natl. Acad. Sci. USA 92:6868, 1995; 
(Streptococcus gordonii); Flynn (Cell. Mol. Biol. 40 (suppl. I):31, 1194), and in 

25 WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 92/21376 

(Bacille Calmette Guerin). In bacterial vectors, a polynucleotide of the 
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intervals. The appropriate dosage depends on various parameters that are 
understood by those skilled in the art, such as the nature of the vaccine vector 
itself, the route of administration, and the condition of the mammal to be 
vaccinated (e.g., the weight, age, and general heahh of the mammal). 

Live vaccine vectors that can be used in the invention include viral 
vectors, such as adenoviruses and poxviruses, as well as bacterial vectors, e.g., 
Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille biUe de Calmette- 
Guerin (BCG), and Streptococcus. An example of an adenovirus vector, as 
well as a method for constructing an adenovirus vector capable of expressing a 
polynucleotide molecule of the invention, is described in U.S. Patent No. 
4,920,209. Poxvirus vectors that can be used in the invention include, e.g., 
vaccinia and canary pox viruses, which are described in U.S. Patent No. 
4,722,848 and U.S. Patent No. 5,364,773, respectively (also see, e.g., Tartaglia 
et al.. Virology 188:217, 1992, for a description of a vaccinia virus vector, and 
Taylor et al. Vaccine 13:539, 1995, for a description of a canary poxvirus 
vector). Poxvirus vectors capable of expressing a polynucleotide of the 
invention can be obtained by homologous recombination, as described in Kieny 
et al. (Nature 312:163, 1984) so that the polynucleotide of the invention is 
inserted in the viral genome under appropriate conditions for expression in 
mammalian cells. Generally, the dose of viral vector vaccine, for therapeutic 
or prophylactic use, can be fi-om about 1x10"^ to about 1x10^*, advantageously 
from about 1x10^ to about 1x10'^, or, preferably, from about 1x10^ to about 
1x10^ plaque-forming units per kilogram. Preferably, viral vectors are 
administered parenterally, for example, in 3 doses that are 4 weeks apart. 
Those skilled in the art will recognize that it is preferable to avoid adding a 
chemical adjuvant to a composition containing a viral vector of the invention 
and thereby minimizing the immune response to the viral vector itself. 
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administering a prophylactic or therapeutic amount of a vaccine vector of the 
invention to an individual in need. Additionally, the third aspect of the 
invention encompasses the use of a vaccine vector of the invention in the 
preparation of a medicament for preventing and/or treating Helicobacter 
infection. 

A vaccine vector of the invention can express one or several 
polypeptides or derivatives of the invention, as well as at least one additional 
Helicobacter antigen such as a urease apoenzyme or a subunit, fragment, 
homolog, mutant, or derivative thereof. In addition, it can express a cytokine, 
such as interleukin-2 (IL-2) or interleukin-12 (IL-12), that enhances the 
immune response. Thus, a vaccine vector can include an additional 
polynucleotide molecules encoding, e.g., urease subunit A, B, or both, or a 
cytokine, placed under the control of elements required for expression in a 
mammalian cell. 

Alternatively, a composition of the invention can include several vaccine 
vectors, each of which being capable of expressing a polypeptide or derivative 
of the invention. A composition can also contain a vaccine vector capable of 
expressing an additional Helicobacter antigen such as urease apoenzyme, a 
subunit, fragment, homolog, mutant, or derivative thereof, or a cytokine such as 
IL-2 or IL-12. 

In vaccination methods for treating or preventing infection in a mammal, 
a vaccine vector of the invention can be administered by any conventional route 
in use in the vaccine field, for example, to a mucosal (e.g., ocular, intranasal, 
oral, gastric, pulmonary, intestinal, rectal, vaginal, or urinary tract) surface or 
via a parenteral (e.g., subcutaneous, intradermal, intramuscular, intravenous, or 
intraperitoneal) route. Preferred routes depend upon the choice of the vaccine 
vector. The administration can be achieved in a single dose or repeated at 
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from the supernatant after centrifugation of the cell culture. Typically, the 
recombinant polypeptide can be purified by antibody-based affinity purification 
or by any other method known to a person skilled in the art, such as by genetic 
fiision to a small affinity-binding domain. Antibody-based affinity purification 
methods are also available for purifying a polypeptide of the invention 
extracted fi'om a Helicobacter strain. Antibodies useful for immunoaffinity 
purification of the polypeptides of the invention can be obtained using methods 
described below. 

Polynucleotides of the invention can also be used in DNA vaccination 
methods, using either a viral or bacterial host as gene delivery vehicle (live 
vaccine vector) or administering the gene in a free form, e.g., inserted into a 
plasmid. Therapeutic or prophylactic efficacy of a polynucleotide of the 
invention can be evaluated as is described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a 
vaccine vector such as a poxvirus, containing a polynucleotide molecule of the 
invention placed under the control of elements required for expression; (ii) a 
composition of matter containing a vaccine vector of the invention, together 
with a diluent or carrier; (iii) a pharmaceutical composition containing a 
therapeutically or prophylactically effective amount of a vaccine vector of the 
invention; (iv) a method for inducing an immune response against Helicobacter 
in a mammal (e.g., a human; alternatively, the method can be used in veterinary 
applications for treating or preventing Helicobacter infection of animals, e.g., 
cats or birds), which involves administering to the mammal an 
immunogenically effective amount of a vaccine vector of the invention to elicit 
an immune response, e.g., a protective or therapeutic immune response to 
Helicobacter^ and (v) a method for preventing and/or treating a Helicobacter 
(e.g., H. pylori, H,felis, H, mustelae^ oxH heilmanii) infection, which involves 
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be homologous or heterologous to the polynucleotide molecule encoding the 
mature polypeptide and it can be specific to the secretion apparatus of the host 
used for expression. The open reading frame constituted by the poljmucleotide 
molecule of the invention, alone or together with the signal peptide, is placed 
under the control of the promoter so that transcription and translation occur in 
5 the host system. Promoters and signal peptide-encoding regions are widely 

known and available to those skilled in the art and include, for example, the 
promoter of Salmonella typhimurium (and derivatives) that is inducible by 
arabinose (promoter araB) and is functional in Gram-negative bacteria such as 
E, coli (U.S. Patent No. 5,028,530; Cagnon et aL, Protein Engineering 

10 4(7): 843, 1991); the promoter of the bacteriophage T7 RNA polymerase gene, 

which is functional in a number of coli strains expressing T7 polymerase 
(U.S. Patent No. 4,952,496); the OspA lipidation signal peptide; and RlpB 
lipidation signal peptide (Takase et aL, J. Bact. 169:5692, 1987). 

The expression cassette is typically part of an expression vector, which 

15 is selected for its ability to replicate in the chosen expression system. 

Expression vectors (e.g., plasmids or viral vectors) can be chosen from, for 
example, those described in Pouwels et aL (Cloning Vectors: A Laboratory 
Manual^ 1985, Supp. 1987) and can purchased fi-om various commercial 
sources. Methods for transforming or transfecting host cells with expression 

20 vectors are well known in the art and will depend on the host system selected, 

as described in Ausubel et aL (supra). 

Upon expression, a recombinant polypeptide of the invention (or a 
polypeptide derivative) is produced and remains in the intracellular 
compartment, is secreted/excreted in the extracellular medium or in the 

25 periplasmic space, or is embedded in the cellular membrane. The polypeptide 

can then be recovered in a substantially purified form from the cell extract or 
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invention; (iii) a procaryotic or eucaryotic cell transformed or transfected with 
an expression cassette and/or vector of the invention, as well as (iv) a process 
for producing a polypeptide or pol3^eptide derivative encoded by a 
polynucleotide of the invention, which involves culturing a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or 
vector of the invention, under conditions that allow expression of the 
polynucleotide molecule of the invention and, recovering the encoded 
polypeptide or polypeptide derivative from the cell culture. 

A recombinant expression system can be selected from procaryotic and 
eucaryotic hosts. Eucaryotic hosts include, for example, yeast cells (e.g., 
Saccharomyces cerevisiae or Pichia Pastoris), mammalian cells (e.g., COSl, 
NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodoptera frugiperda (SF9) 
cells), and plant cells. Preferably, a procaryotic host such as E, coli is used. 
Bacterial and eucaryotic cells are available from a number of different sources 
that are known to those skilled in the art, e.g., the American Type Culture 
Collection (ATCC; Rockville, Maryland). 

The choice of the expression cassette will depend on the host system 
selected, as well as the features desired for the expressed polypeptide. For 
example, it may be useftil to produce a polypeptide of the invention in a 
particular lipidated form or any other form. Typically, an expression cassette 
includes a constitutive or inducible promoter that is functional in the selected 
host system; a ribosome binding site; a start codon (ATG); if necessary, a 
region encoding a signal peptide, e.g., a lipidation signal peptide; a 
polynucleotide molecule of the invention; a stop codon; and, optionally, a 3' 
terminal region (translation and/or transcription terminator). The signal 
peptide-encoding region is adjacent to the polynucleotide of the invention and 
is placed in the proper reading frame. The signal peptide-encoding region can 
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calculated Tm. Those skilled in the art will understand that optimal 
temperature and salt conditions can be readily determined empirically in 
preliminary experiments using conventional procedures. For example, 
stringent conditions can be achieved, both for pre-hybridizing and hybridizing 
incubations, (i) within 4-16 hours at 42 °C, in 6 x SSC containing 
50% formamide or (ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC 
solution (1 M NaCl, 0.1 M sodium citrate (pH 7.0)). For polynucleotides 
containing 30 to 600 nucleotides, the above formula is used and then is 
corrected by subtracting (600/polynucleotide size in base pairs). Stringency 
conditions are defined by a Th that is 5 to 10 °C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases 
do not precisely follow the rules set forth above. In such cases, the formula for 
calculating the Tm is as follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 
18 nucleotide fragment of 50% G+C would have an approximate Tm of 54 ""C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or 
modifications or combinations thereof, can have various applications. For 
example, a polynucleotide molecule can be used (i) in a process for producing 
the encoded polypeptide in a recombinant host system, (ii) in the construction 
of vaccine vectors such as poxvimses, which are further used in methods and 
compositions for preventing and/or treating Helicobacter infection, (iii) as a 
vaccine agent, in a naked form or formulated with a delivery vehicle and, (iv) 
in the construction of attenuated Helicobacter strains that can over-express a 
polynucleotide of the invention or express it in a non-toxic, mutated form. 

According to a second aspect of the invention, there is therefore 
provided (i) an expression cassette containing a polynucleotide molecule of the 
invention placed under the control of elements (e.g., a promoter) required for 
expression; (ii) an expression vector containing an expression cassette of the 
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toxin or E, coli heat-labile toxin. Several possibilities can be used for 
producing such fusion proteins. First, the polypeptide of the invention can be 
fused to the N-terminal end or, preferably, to the C-terminal end of the 
polypeptide having adjuvant activity. Second, a polypeptide fragment of the 
invention can be fused within the amino acid sequence of the polypeptide 
having adjuvant activity. Spacer sequences can also be included, if desired. 

As stated above, the polynucleotides of the invention encode 
Helicobacter polypeptides in precursor or mature form. They can also encode 
hybrid precursors containing heterologous signal peptides, which can mature 
into polypeptides of the invention. By "heterologous signal peptide" is meant a 
signal peptide that is not found in the naturally-occurring precursor of a 
polypeptide of the invention. 

A polynucleotide of the invention hybridizes, preferably under stringent 
conditions, to a polynucleotide having a sequence as shown in the sequence 
listing (odd numbers, up to SEQ ID NO: 1363). Hybridization procedures are, 
e.g., described by Ausubel et al (supra); Silhavy et aL (Experiments with Gene 
Fusions^ Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York, 1984); and Davis et al. (A Manual for Genetic Engineering: Advanced 
Bacterial Genetics^ Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York, 1980). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in the following formula, which 
facilitates calculation of the melting temperature (Tm), which is the 
temperature above which two complementary DNA strands separate from one 
another (Casey et aL, Nucl. Acid Res. 4:1539, 1977): Tm = 8L5 + 0.5 x (% 
GH-C) +1.6 log (positive ion concentration) - 0.6 x (% formamide). Under 
appropriate stringency conditions, hybridization temperature (Th) is 
approximately 20 to 40X, 20 to 25 °C, or, preferably, 30 to 40X below the 
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as murine mammary tumor virus (peptide containing 1 1 amino acids; Dion et 
aL, Virology 179:474-477, 1990), Semliki Forest virus (peptide containing 16 
amino acids; Snijders et aL, J. Gen. Virol. 72:557-565, 1991), and canine 
parvovirus (2 overlapping peptides, each containing 15 amino acids; Langeveld 
et al. Vaccine 12(15): 1473-1480, 1994) have been shown to be effective 
5 vaccine antigens against their respective pathogens. 

Polynucleotides encoding polypeptide fragments and polypeptides 
having large internal deletions can be constructed using standard methods (see, 
e.g., Ausubel et aL^ Current Protocols in Molecular Biology^ John Wiley & 
Sons Inc., 1994), for example, by PCR, including inverse PGR, by restriction 
10 enzyme treatment of the cloned DNA molecules, or by the method of Kunkel et 

aL (Proc. Natl. Acad. Sci. USA 82:448, 1985; biological material available at 
Stratagene), 

A polypeptide derivative can also be produced as a fusion polypeptide 
that contains a polypeptide or a polypeptide derivative of the invention fiised, 

15 e.g^., at the N- or C-terminal end, to any other polypeptide (hereinafter referred 

to as a peptide tail). Such a product can be easily obtained by translation of a 
genetic fusion, i.e., a hybrid gene. Vectors for expressing fusion polypeptides 
are conmiercially available, and include the pMal-c2 or pMal-p2 systems of 
New England Biolabs, in which the peptide tail is a maltose binding protein, 

20 the glutathione-S-transferase system of Pharmacia, or the His-Tag system 

available from Novagen. These and other expression systems provide 
convenient means for further purification of polypeptides and derivatives of the 
invention. 

Another particular example of fusion polypeptides included in invention 
25 includes a polypeptide or polypeptide derivative of the invention fused to a 

polypeptide having adjuvant activity, such as, e.g., subunit B of either cholera 
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at least 20 amino acids, preferably at least 50 amino acids, more preferably at 
least 75 amino acids, and most preferably at least 100 amino acids in length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be 
designed using computer-assisted analysis of amino acid sequences in order to 
identify sites in protein antigens having potential as surface-exposed, antigenic 
regions (Hughes et al.. Infect. Immun. 60(9):3497, 1992). For example, the 
Laser Gene Program from DNA Star can be used to obtain hydrophilicity, 
antigenic index, and intensity index plots for the polypeptides of the invention. 
This program can also be used to obtain information about homologies of the 
polypeptides with known protein motifs. One skilled in the art can readily use 
the information provided in such plots to select peptide fragments for use as 
vaccine antigens. For example, fragments spanning regions of the plots in 
which the antigenic index is relatively high can be selected. One can also 
select fragments spanning regions in which both the antigenic index and the 
intensity plots are relatively high. Fragments containing conserved sequences, 
particularly hydrophilic conserved sequences, can also be selected. 

Polypeptide fragments and polypeptides having large internal deletions 
can be used for revealing epitopes that are otherwise masked in the parent 
polypeptide and that may be of importance for inducing a protective T cell- 
dependent immune response. Deletions can also remove immunodominant 
regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments 
and variants of protein immunogens as vaccines, as all that is required to induce 
an immune response to a protein is a small (e.g., 8 to 10 amino acids) 
immunogenic region of the protein. This has been* done for a number of 
vaccines against pathogens other than Helicobacter. For example, short 
synthetic peptides corresponding to surface-exposed antigens of pathogens such 
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Those skilled in the art can readily design primers that can be used to isolate 
the polynucleotides of the invention jfrom different Helicobacter strains. 
Experimental conditions for carrying out PGR can readily be determined by 
one skilled in the art and an illustration of carrying out PGR is provided in 
Example 2. As is well known in the art, restriction endonuclease recognition 
sites that contain, typically, 4 to 6 nucleotides (for example, the sequences 5 - 
GGATCC-3' (BamHl) or 5'-GTGGAG-3' (Xhol)), can be included on the 5' 
ends of the primers. Restriction sites can be selected by those skilled in the art 
so that the amplified DNA can be conveniently cloned into an appropriately 
digested vector, such as a plasmid. 

Useful homologs that do not occur naturally can be designed using 
known methods for identifying regions of an antigen that are likely to be 
tolerant of amino acid sequence changes and/or deletions. For example, 
sequences of the antigen from different species can be compared to identify 
conserved sequences, 

Pol>peptide derivatives that are encoded by polynucleotides of the 
invention include, e.g., fragments, polypeptides having large internal deletions 
derived from full-length polypeptides, and fusion proteins. Polypeptide 
fragments of the invention can be derived from a polypeptide having a 
sequence homologous to any of the sequences of the sequence listing (even 
numbers, up to SEQ ID NO: 1364), to the extent that the fragments retain the 
substantial antigenicity of the parent polypeptide (specific antigenicity). 
Polypeptide derivatives can also be constructed by large intemal deletions that 
remove a substantial part of the parent polypeptide, while retaining specific 
antigenicity. Generally, polypeptide derivatives should be about at least 
12 amino acids in length to maintain antigenicity. Advantageously, they can be 
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UreA + UreB apoenzyme expressed from pORV214 (UreA and UreB 
sequences differ from H, pylori strain CPM630 by one and two amino acid 
changes, respectively; Lee et al., supra, 1995); a UreA-glutathione-S- 
transferase fiision protein (UreA sequence from H. pylori strain ATCC 43504; 
Thomas et aL, Acta Gastro-Enterologica Belgica 56:54, 1993); UreA + UreB 
holoenzyme purified from K pylori strain NCTCl 1637 (Marchetti et al.. 
Science 267: 1655, 1995); a UreA-MBP ftision protein (UreA from H. pylori 
strain 85P; Ferrero et al. Infection and Immunity 62:4981, 1994); a UreB-MBP 
fiision protein (UreB from H. pylori strain 85P; Ferrero et al., supra); a UreA- 
MBP fiision protein (UreA from H.felis strain ATCC 49179; Ferrero et al, 
supra); a UreB-MBP fiision protein (UreB from Kfelis strain ATCC 49179; 
Ferrero et al., supra); and a 37 kDa fragment of UreB containing amino acids 
220-569 (Dore-Davin et aL, "A 37 kD fragment of UreB is sufficient to confer 
protection against Helicobacter felis infection in mice"). Finally, Thomas et aL 
(supra) showed that oral immunization of mice with crude sonicates of 
H, pylori protected mice from subsequent challenge with H.felis, 

Polynucleotides, e.g., DNA molecules, encoding allelic variants can 
easily be obtained by polymerase chain reaction (PCR) amplification of 
genomic bacterial DNA extracted by conventional methods. This involves the 
use of synthetic oligonucleotide primers matching sequences that are upstream 
and downstream of the 5' and 3' ends of the coding region. Suitable primers 
can be designed based on the nucleotide sequence information provided in the 
sequence listing (odd numbers, up to SEQ ID NO: 1363). Ti^ically, a primer 
consists of 10 to 40, preferably 15 to 25 nucleotides. It can also be 
advantageous to select primers containing C and G nucleotides in proportions 
sufficient to ensure efficient hybridization, e.g., an amount of C and G 
nucleotides of at least 40%, preferably 50%, of the total nucleotide amount. 
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AUelic variants are very common in nature. For example, a bacterial 
species, e.g., K pylori, is usually represented by a variety of strains that differ 
from each other by minor allelic variations. Indeed, a polypeptide that fulfills 
the same biological function in different strains can have an amino acid 
sequence that is not identical in each of the strains. Such an allelic variation 
can be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes 
from, e.g., studies of the Helicobacter urease antigen. The amino acid 
sequence of Helicobacter urease varies widely from species to species, yet 
cross-species protection occurs, indicating that the urease molecule, when used 
as an immunogen, is highly tolerant of amino acid variations. Even among 
different strains of the single species H. pylori, there are amino acid sequence 
variations. 

For example, although the amino acid sequences of the UreA and UreB 
subunits of H. pylori and H.felis ureases differ from one another by 26.5% and 
1 1.8%, respectively (Ferrero et aL, Molecular Microbiology 9(2):323-333, 
1993), it has been shown that H. pylori urease protects mice from H.felis 
infection (Michetti et aL, Gastroenterology 107:1002, 1994). In addition, it has 
been shown that the individual stmctural subunits of urease, UreA and UreB, 
which contain distinct amino acid sequences, are both protective antigens 
against Helicobacter infection (Michetti et aL, supra). Similarly, Cuenca et al. 
(Gastroenterology 1 10:1770, 1996) showed that therapeutic immunization of 
H, mw5te/ae-infected ferrets with H. pylori urease was effective at eradicating 
H, mustelae infection. Further, several urease variants have been reported to be 
effective vaccine antigens, including, e.g., recombinant UreA + UreB 
apoenzyme expressed from pORV142 (UreA and UreB sequences derived from 
H. pylori strain CPM630; Lee et al., J. Infect. Dis. 172:161, 1995); recombinant 
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University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, WI 53705). Similar amino acid sequences are aligned to obtain the 
maximum degree of homology (/.e., identity). To this end, it may be necessary 
to artificially introduce gaps into the sequence. Once the optimal alignment has 
been set up, the degree of homology (z.e., identity) is established by recording 
all of the positions in which the amino acids of both sequences are identical, 
relative to the total number of positions. 

Homologous polynucleotide sequences are defined in a similar way. 
Preferably, a homologous sequence is one that is at least 45%, more preferably 
at least 60%, and most preferably at least 85% identical to a coding sequence of 
any of the nucleotide sequences set forth in the sequence listing (odd numbers, 
up to SEQ ID NO: 1363). 

Polypeptides having a sequence homologous to any one of the sequences 
shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), include 
naturally-occurring allelic variants, as well as mutants or any other non- 
naturally occurring variants that are analogous in terms of antigenicity, to a 
polypeptide having a sequence as shown in the sequence listing (even numbers, 
up to SEQ ID NO: 1364). 

As is known in the art, an allelic variant is an alternate form of a 
polypeptide that is characterized as having a substitution, deletion, or addition 
of one or more amino acids that does not alter the biological fiinction of the 
polypeptide. By "biological function" is meant a function of the polypeptide in 
the cells in which it naturally occurs, even if the function is not necessary for 
the growth or survival of the cells. For example, the biological function of a 
porin is to allow the entry into cells of compounds present in the extracellular 
medium. The biological function is distinct from the antigenic function. A 
polypeptide can have more than one biological function. 
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By "homologous amino acid sequence" is meant an amino acid sequence 
that differs from an amino acid sequence shown in the sequence listing (even 
numbers, up to SEQ ID NO: 1364), or an amino acid sequence encoded by a 
nucleotide sequence shown in the sequence Usting (odd numbers, up to SEQ ID 
NO: 1363), by one or more non-conservative amino acid substitutions, 

5 deletions, or additions located at positions at which they do not destroy the 

specific antigenicity of the polypeptide. Preferably, such a sequence is at least 
75%, more preferably at least 80%, and most preferably at least 90% identical 
to an amino acid sequence shown in the sequence listing (even numbers, up to 
SEQ ID NO: 1364). Homologous amino acid sequences include sequences that 

10 are identical or substantially identical to an amino acid sequence as shown in 

the sequence listing (even numbers, up to SEQ ID NO: 1364). By "amino acid 
sequence that is substantially identical" is meant a sequence that is at least 
90%, preferably at least 95%, more preferably at least 97%, and most 
preferably at least 99% identical to an amino acid sequence of reference and 

15 that differs from the sequence of reference, if at all, by a majority of 

conservative amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions 
among amino acids of the same class. These classes include, for example, 
amino acids having uncharged polar side chains, such as asparagine, glutamine, 

20 serine, threonine, and tjnrosine; amino acids having basic side chains, such as 

lysine, arginine, and histidine; amino acids having acidic side chains, such as 
aspartic acid and glutamic acid; and amino acids having nonpolar side chains, 
such as glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, 
methionine, tryptophan, and cysteine. 

25 Homology can be measured using sequence analysis software (e.g.. 

Sequence Analysis Software Package of the Genetics Computer Group, 
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naturally-occumng DNA molecule present in the genome of a living bacteria or 
as part of a gene bank is not isolated, but the same molecule, separated from the 
remaining part of the bacterial genome, as a result of, e.g., a cloning event 
(amplification), is "isolated." Tj^ically, an isolated DNA molecule is free 
from DNA regions (e,g., coding regions) with v^hich it is immediately 
contiguous, at the 5* or 3' ends, in the naturally occurring genome. Such 
isolated polynucleotides can be part of a vector or a composition and still be 
isolated, as such a vector or composition is not part of its natural environment. 

A polynucleotide of the invention can consist of RNA or DNA (e.g., 
cDNA, genomic DNA, or synthetic DNA), or modifications or combinations of 
RNA or DNA. The polynucleotide can be double-stranded or single-stranded 
and, if single-stranded, can be the coding (sense) strand or the non-coding (anti- 
sense) strand. The sequences that encode polypeptides of the invention, as 
shown in the sequence listing (even numbers, up to SEQ ID NO: 1364), can be 
(a) the coding sequence as shown in any of the nucleotide sequences of the 
sequence listing (odd numbers, up to SEQ ID NO: 1363); (b) a ribonucleotide 
sequence derived by transcription of (a); or (c) a different coding sequence that, 
as a result of the redundancy or degeneracy of the genetic code, encodes the 
same polypeptides as the polynucleotide molecules having the sequences 
illustrated in any of the nucleotide sequences of the sequence listing (odd 
numbers, up to SEQ ID NO: 1363). The polypeptide can be one that is naturally 
secreted or excreted by, e.g., H.felis, H. mustelae, H, heilmanii, orH. pylori. 

By "polypeptide" or "protein" is meant any chain of amino acids, 
regardless of length or post-translational modification (e.g., glycosylation or 
phosphorylation). Both terms are used interchangeably in the present 
application. 
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NO:1299), GHPO 1613 (SEQ ID NO:1301), GHPO 1614 (SEQ ID NO:1303), 
GHPO 1626 (SEQ ID NO: 1305), GHPO 1628 (SEQ ID NO: 1307), GHPO 
1639 (SEQ ID NO: 1309), GHPO 1640 (SEQ ID NO: 131 1), GHPO 1641 (SEQ 
ID NO:1313), GHPO 1646 (SEQ ID NO:1315), GHPO 1662 (SEQ ID 
NO: 13 17), GHPO 1667 (SEQ ID NO: 13 19), GHPO 1668 (SEQ ID NO: 1321), 
5 GHPO 1670 (SEQ ID NO: 1323), GHPO 1671 (SEQ ID NO: 1325), GHPO 

1672 (SEQ ID NO: 1327), GHPO 1678 (SEQ ID NO: 1329), GHPO 1684 (SEQ 
ID NO:1331), GHPO 1695 (SEQ ID NO:1333), GHPO 1697 (SEQ ID 
NO:1335), GHPO 1701 (SEQ ID NO:1337), GHPO 1719 (SEQ ID NO:1339), 
GHPO 1723 (SEQ ID NO: 1341), GHPO 1732 (SEQ ID NO: 1343), GHPO 

10 1739 (SEQ ID NO: 1345), GHPO 1741 (SEQ ID NO: 1347), GHPO 1747 (SEQ 

ID NO: 1349), GHPO 1749 (SEQ ID NO: 1351), GHPO 1750 (SEQ ID 
NO:1353), GHPO 1751 (SEQ ID NO:1355), GHPO 1755 (SEQ ID NO:1357), 
GHPO 1771 (SEQ ID NO:1359), GHPO 1786 (SEQ ID NO:1361), and GHPO 
1789 (SEQIDNO:1363). 

15 An isolated polynucleotide of the invention encodes (i) a polypeptide 

having an amino acid sequence that is homologous to a Helicobacter amino 
acid sequence of a polypeptide, the Helicobacter amino acid sequence being 
selected from the group consisting of the amino acid sequences shown in the 
sequence listing (even numbers, up to SEQ ID NO: 1364), or (ii) a derivative of 

20 the polypeptide. 

In addition to the full-length polypeptides encoded by the 
polynucleotides of the invention, as set forth above, polynucleotides included in 
the invention can also encode polypeptides that lack signal sequences, as well 
as other polypeptide or peptide fragments of the full-length polypeptides. 

25 The term "isolated polynucleotide" is defined as a polynucleotide that is 

removed from the environment in which it naturally occurs. For example, a 
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GHPO 1240 (SEQ ID NO: 1 179), GHPO 1246 (SEQ ID NO: 1181), GHPO 
1251 (SEQ IDNO:1183), GHPO 1252 (SEQ ID NO:1185), GHPO 1261 (SEQ 
ID NO: 1 1 87), GHPO 1265 (SEQ ID NO: 1 1 89), GHPO 1267 (SEQ ID 
NO: 1191), GHPO 1278 (SEQ ID NO: 11 93), GHPO 1282 (SEQ ID NO: 1 195), 
GHPO 1283 (SEQ ID NO: 1197), GHPO 1287 (SEQ ID NO:l 199), GHPO 
5 1292 (SEQ ID NO:1201), GHPO 1293 (SEQ ID NO:1203), GHPO 1302 (SEQ 

ID NO: 1205), GHPO 1309 (SEQ ID NO: 1207), GHPO 1317 (SEQ ID 
NO:1209), GHPO 1318 (SEQ ID NO: 1211), GHPO 1321 (SEQ ID NO: 12 13), 
GHPO 1325 (SEQ ID NO: 1215), GHPO 1341 (SEQ ID NO:1217), GHPO 
1351 (SEQ ID NO:1219), GHPO 1354 (SEQ ID NO:1221), GHPO 1363 (SEQ 
10 ID NO:1223), GHPO 1371 (SEQ ID NO:1225), GHPO 1381 (SEQ ID 

NO:1227), GHPO 1401 (SEQ IDNO:1229), GHPO 1402 (SEQ ID NO:1231), 
GHPO 1403 (SEQ ID NO: 1233), GHPO 1408 (SEQ ID NO: 1235), GHPO 
1416 (SEQ ID NO:1237), GHPO 1420 (SEQ ID NO:1239), GHPO 1428 (SEQ 
ID NO:1241), GHPO 1437 (SEQ ID NO:1243), GHPO 1439 (SEQ ID 
15 NO: 1245), GHPO 1460 (SEQ ID NO: 1247), GHPO 1463 (SEQ ID NO: 1249), 

GHPO 1472 (SEQ ID NO:1251), GHPO 1474 (SEQ ID NO:1253), GHPO 
1484 (SEQ ID NO: 1255), GHPO 1489 (SEQ ID NO: 1257), GHPO 1494 (SEQ 
ID NO: 1259), GHPO 1495 (SEQ ID NO: 1261), GHPO 1498 (SEQ ID 
NO:1263), GHPO 1499 (SEQ ID NO:1265), GHPO 1500 (SEQ ID NO:1267), 
20 GHPO 1503 (SEQ ID NO:1269), GHPO 1504 (SEQ ID NO:1271), GHPO 

1510 (SEQ ID NO: 1273), GHPO 1518 (SEQ ID NO: 1275), GHPO 1533 (SEQ 
ID NO: 1277), GHPO 1541 (SEQ ID NO: 1279), GHPO 1544 (SEQ ID 
NO:1281), GHPO 1548 (SEQ IDNO:1283), GHPO 1565 (SEQ ID NO:1285), 
GHPO 1575 (SEQ ID NO: 1287), GHPO 1582 (SEQ ID NO: 1289), GHPO 
25 1595 (SEQ ID NO: 1291), GHPO 1597 (SEQ ID NO: 1293), GHPO 1599 (SEQ 

ID NO: 1295), GHPO 1601 (SEQ ID NO: 1297), GHPO 1609 (SEQ ID 
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NO:1055), GHPO 904 (SEQ ID NO: 1057), GHPO 906 (SEQ ID NO: 1059), 
GHPO 908 (SEQ ID NO:1061), GHPO 921 (SEQ ID NO:1063), GHPO 923 
(SEQ ID NO: 1065), GHPO 926 (SEQ ID NO: 1067), GHPO 933 (SEQ ID 
NO:1069), GHPO 939 (SEQ ID NO:1071), GHPO 940 (SEQ ID NO:1073), 
GHPO 943 (SEQ ID NO:1075), GHPO 951 (SEQ ID NO:1077), GHPO 961 
5 (SEQ ID NO: 1079), GHPO 965 (SEQ ID NO: 1081), GHPO 990 (SEQ ID 

NO:1083), GHPO 991 (SEQ ID NO:1085), GHPO 998 (SEQ ID NO:1087), 
GHPO 1001 (SEQ ID NO: 1089), GHPO 1005 (SEQ ID NO:1091), GHPO 
1033 (SEQ ID NO: 1093), GHPO 1039 (SEQ ID NO: 1095), GHPO 1041 (SEQ 
ID NO: 1097), GHPO 1043 (SEQ ID NO: 1099), GHPO 1044 (SEQ ID 

10 NO:l 101), GHPO 1051 (SEQ ID NO: 1103), GHPO 1058 (SEQ ID NO: 1 105), 

GHPO 1060 (SEQ ID N0:1 107), GHPO 1075 (SEQ ID NO:l 109), GHPO 
1077 (SEQ ID NO:l 1 1 1), GHPO 1082 (SEQ ID NO:l 1 13), GHPO 1083 (SEQ 
ID NO: 11 15), GHPO 1086 (SEQ IDNO:1117), GHPO 1087 (SEQ ID 
NO:l 119), GHPO 1090 (SEQ ID NO: 1121), GHPO 1097 (SEQ ID NO: 1 123), 

15 GHPO 1098 (SEQ ID NO: 1125), GHPO 1 103 (SEQ ID NO:1127), GHPO 

1113 (SEQ ID NO: 1129), GHPO 11 16 (SEQ ID NO: 1131), GHPO 1123 (SEQ 
ID NO: 11 33), GHPO 1125 (SEQ ID NO: 1135), GHPO 1129 (SEQ ID 
NO:l 137), GHPO 1130 (SEQ ID NO: 1139), GHPO 1134 (SEQ ID N0:1141), 
GHPO 1161 (SEQ ID N0:1 143), GHPO 1 166 (SEQ ID NO:l 145), GHPO 

20 1170 (SEQ ID NO: 1147), GHPO 1175 (SEQ ID NO: 11 49), GHPO 1181 (SEQ 

ID NO: 1151), GHPO 1186 (SEQ ID NO: 11 53), GHPO 1188 (SEQ ID 
NO:l 155), GHPO 1191 (SEQ ID NO:l 157), GHPO 1193 (SEQ ID NO: 11 59), 
GHPO 1 196 (SEQ ID NO:l 161), GHPO 1204 (SEQ ID NO:1163), GHPO 
1210 (SEQ ID NO: 11 65), GHPO 1211 (SEQ ID NO: 1 1 67), GHPO 1216 (SEQ 

25 IDNO:1169),GHPO 1218 (SEQ ID NO:l 171), GHPO 1220 (SEQ ID 

NO: 1173), GHPO 1223 (SEQ ID NO: 1175), GHPO 1226 (SEQ ID NO:1177), 
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487 (SEQ ID NO:925), GHPO 488 (SEQ ID NO:927), GHPO 489 (SEQ ID 
NO:929), GHPO 498 (SEQ ID NO:931), GHPO 501 (SEQ IDNO:933), GHPO 
504 (SEQ ID NO:935), GHPO 512 (SEQ ID NO:937), GHPO 517 (SEQ ID 
NO:939), GHPO 520 (SEQ ID NO:941), GHPO 528 (SEQ ID NO:943), GHPO 
530 (SEQ ID NO:945), GHPO 532 (SEQ ID NO:947), GHPO 548 (SEQ ID 
5 NO:949), GHPO 561 (SEQ ID NO:951), GHPO 564 (SEQ ID NO:953), GHPO 

572 (SEQ ID NO:955), GHPO 573 (SEQ ID NOr957), GHPO 574 (SEQ ID 
NO:959), GHPO 577 (SEQ ID NO:961), GHPO 579 (SEQ IDNO:963), GHPO 
583 (SEQ ID NO:965), GHPO 588 (SEQ ID NO:967), GHPO 593 (SEQ ID 
NO:969), GHPO 597 (SEQ ID NO:971), GHPO 598 (SEQ ID NO:973), GHPO 
10 604 (SEQ ID NO:975), GHPO 606 (SEQ ID NO:977), GHPO 61 1 (SEQ ID 

NO:979), GHPO 612 (SEQ ID NO:981), GHPO 615 (SEQ ID NO:983), GHPO 
632 (SEQ ID NO:985), GHPO 633 (SEQ ID NO:987), GHPO 637 (SEQ ID 
NO:989), GHPO 651 (SEQ ID NO:991), GHPO 663 (SEQ ID NO:993), GHPO 
686 (SEQ ID NO:995), GHPO 693 (SEQ ID NO:997), GHPO 698 (SEQ ID 
15 NO:999), GHPO 703 (SEQ ID NO.lOOl), GHPO 704 (SEQ ID NO:1003), 

GHPO 705 (SEQ ID NO:1005), GHPO 707 (SEQ ID NO:1007), GHPO 721 
(SEQ ID NO: 1009), GHPO 727 (SEQ ID NO: 101 1), GHPO 728 (SEQ ID 
NO:1013), GHPO 733 (SEQ ID NO:1015), GHPO 758 (SEQ ID NO:1017), 
GHPO 763 (SEQ ID NO:1019), GHPO 771 (SEQ ID NO:1021), GHPO 774 
20 (SEQ ID NO: 1023), GHPO 776 (SEQ ID NO: 1025), GHPO 783 (SEQ ID 

NO: 1027), GHPO 800 (SEQ ID NO: 1029), GHPO 806 (SEQ ID NO: 1031), 
GHPO 807 (SEQ ID NO:1033), GHPO 808 (SEQ ID NO:1035), GHPO 809 
(SEQ ID NO: 1037), GHPO 811 (SEQ ID NO: 1039), GHPO 815 (SEQ ID 
NO:1041), GHPO 819 (SEQ ID NO:1043), GHPO 841 (SEQ ID NO:1045), 
25 GHPO 843 (SEQ ID NO: 1047), GHPO 846 (SEQ ID NO: 1049), GHPO 875 

(SEQ ID NO:1051), GHPO 892 (SEQ ID NO:1053), GHPO 902 (SEQ ID 
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NO:789), GHPO 123 (SEQ ID NO:791), GHPO 124 (SEQ ID NO:793), GHPO 
126 (SEQ ID NO:795), GHPO 127 (SEQ ID NO:797), GHPO 128 (SEQ ID 
NO:799), GHPO 131 (SEQ ID NO:801), GHPO 133 (SEQ ID NO:803), GHPO 
140 (SEQ ID NO:805), GHPO 141 (SEQ ID NO:807), GHPO 145 (SEQ ID 
NO:809), GHPO 147 (SEQ ID NO:81 1), GHPO 166 (SEQ ID NO:813), GHPO 

5 181 (SEQ ID NO:815), GHPO 187 (SEQ ID NO:817), GHPO 188 (SEQ ID 

NO:819), GHPO 192 (SEQ ID NO:821), GHPO 202 (SEQ ID NO:823), GHPO 
204 (SEQ ID NO:825), GHPO 205 (SEQ ID NO:827), GHPO 212 (SEQ ID 
NO:829), GHPO 218 (SEQ ID NO:831), GHPO 226 (SEQ ID NO:833), GHPO 
231 (SEQ ID NO:835), GHPO 236 (SEQ ID NO:837), GHPO 239 (SEQ ID 

10 NO:839), GHPO 245 (SEQ ID NO:841), GHPO 246 (SEQ ID NO:843), GHPO 

248 (SEQ ID NO:845), GHPO 253 (SEQ ID NO:847), GHPO 265 (SEQ ID 
NO:849), GHPO 266 (SEQ ID NO:851), GHPO 271 (SEQ ID NO:853), GHPO 
272 (SEQ ID NO:855), GHPO 286 (SEQ ID NO:857), GHPO 291 (SEQ ID 
NO:859), GHPO 292 (SEQ ID NO:861), GHPO 297 (SEQ ID NO:863), GHPO 

15 304 (SEQ ID NO:865), GHPO 307 (SEQ ID NO:867), GHPO 324 (SEQ ID 

NO:869), GHPO 326 (SEQ ID NO:871), GHPO 331 (SEQ ID NO:873), GHPO 
343 (SEQ ID NO:875), GHPO 345 (SEQ ID NO:877), GHPO 346 (SEQ ID 
NO:879), GHPO 352 (SEQ ID NO:881), GHPO 355 (SEQ ID NO:883), GHPO 
363 (SEQ ID NO:885), GHPO 369 (SEQ ID NO:887), GHPO 376 (SEQ ID 

20 NO:889), GHPO 378 (SEQ ID NO:891), GHPO 388 (SEQ ID NO:893), GHPO 

396 (SEQ ID NO:895), GHPO 403 (SEQ ID NO:897), GHPO 410 (SEQ ID 
NO:899), GHPO 415 (SEQ ID NO:901), GHPO 421 (SEQ ID NO:903), GHPO 
439 (SEQ ID NO:905), GHPO 441 (SEQ ID NO:907), GHPO 443 (SEQ ID 
NO:909), GHPO 453 (SEQ ID NO:91 1), GHPO 455 (SEQ ID NO:913), GHPO 

25 464 (SEQ ID NO:9 15), GHPO 467 (SEQ ID NO:9 1 7), GHPO 468 (SEQ ID 

NO:919), GHPO 470 (SEQ ID NO:921), GHPO 486 (SEQ ID NO:923), GHPO 
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NO:659), GHPO 1444 (SEQ ID NO:661), GHPO 1449 (SEQ ID NO:663), 
GHPO 1465 (SEQ ID NO:665), GHPO 1475 (SEQ ID NO:667), GHPO 1479 
(SEQ ID NO:669), GHPO 1483 (SEQ ID NO:671), GHPO 1488 (SEQ ID 
NO:673), GHPO 1496 (SEQ ID NO:675), GHPO 1524 (SEQ ID NO:677), 
GHPO 1536 (SEQ ID NO:679), GHPO 1539 (SEQ ID NO:681), GHPO 1540 
(SEQ ID NO:683), GHPO 1542 (SEQ ID NO:685), GHPO 1555 (SEQ ID 
NO:687), GHPO 1560 (SEQ ID NO:689), GHPO 1564 (SEQ ID NO:691), 
GHPO 1570 (SEQ ID NO:693), GHPO 1588 (SEQ ID NO:695), GHPO 1604 
(SEQ ID NO:697), GHPO 1605 (SEQ ID NO:699), GHPO 1619 (SEQ ID 
NO:701), GHPO 1629 (SEQ ID NO:703), GHPO 1642 (SEQ ID NO:705), 
GHPO 1654 (SEQ ID NO:707), GHPO 1661 (SEQ ID NO:709), GHPO 1673 
(SEQ ID NO:711), GHPO 1687 (SEQ ID NO:713), GHPO 1692 (SEQ ID 
NO:715), GHPO 1693 (SEQ ID NO:717), GHPO 1699 (SEQ IDNO:719), 
GHPO 1738 (SEQ ID NO:721), GHPO 1745 (SEQ ID NO:723), GHPO 1746 
(SEQ ID NO:725), GHPO 1754 (SEQ ID NO:727), GHPO 1792 (SEQ ID 
NO:729), GHPO 1795 (SEQ ID NO:731), GHPO 1796 (SEQ ID NO:733), 
GHPO 7 (SEQ ID NO:735), GHPO 8 (SEQ ID NO:737), GHPO 9 (SEQ ID 
NO:739), GHPO 10 (SEQ ID NO:741), GHPO 12 (SEQ ID NO:743), GHPO 
25 (SEQ ID NO:745), GHPO 27 (SEQ ID NO:747), GHPO 29 (SEQ ID 
NO:749), GHPO 30 (SEQ ID NO:751), GHPO 37 (SEQ ID NO:753), GHPO 
49 (SEQ ID NO:755), GHPO 51 (SEQ ID NO:757), GHPO 54 (SEQ ID 
NO:759), GHPO 65 (SEQ ID NO:761), GHPO 66 (SEQ ID NO:763), GHPO 
68 (SEQ ID NO:765), GHPO 70 (SEQ ID NO:767), GHPO 77 (SEQ ID 
NO:769), GHPO 83 (SEQ ID NO:771), GHPO 85 (SEQ ID NO:773), GHPO 
87 (SEQ ID NO:775), GHPO 91 (SEQ ID NO:777), GHPO 92 (SEQ ID 
NO:779), GHPO 96 (SEQ ID NO:781), GHPO 97 (SEQ ID NO:783), GHPO 
1 1 1 (SEQ ID NO:785), GHPO 1 1 5 (SEQ ID NO:787), GHPO 1 1 7 (SEQ ID 
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782 (SEQ ID NO:531), GHPO 786 (SEQ ID NO:533), GHPO 792 (SEQ ID 
NO:535), GHPO 797 (SEQ ID NO:537), GHPO 816 (SEQ ID NO:539), GHPO 
828 (SEQ ID NO:541), GHPO 839 (SEQ ID NO:543), GHPO 840 (SEQ ID 
NO:545), GHPO 842 (SEQ ID NO:547), GHPO 885 (SEQ ID NO:549), GHPO 
889 (SEQ ID NO:551), GHPO 903 (SEQ ID NO:553), GHPO 912 (SEQ ID 
5 NO:555), GHPO 946 (SEQ ID NO:557), GHPO 958 (SEQ ID NO:559), GHPO 

968 (SEQ ID NO:561), GHPO 987 (SEQ ID NO:563), GHPO 992 (SEQ ID 
NO:565), GHPO 996 (SEQ ID NO:567), GHPO 997 (SEQ ID NO:569), GHPO 
1002 (SEQ ID NO:571), GHPO 1026 (SEQ ID NO:573), GHPO 1028 (SEQ ID 
NO:575), GHPO 1034 (SEQ ID NO:577), GHPO 1038 (SEQ ID NO:579), 

10 GHPO 1059 (SEQ ID NO:581), GHPO 1065 (SEQ ID NO:583), GHPO 1072 

(SEQ ID NO:585), GHPO 1073 (SEQ ID NO:587), GHPO 1088 (SEQ ID 
NO:589), GHPO 1091 (SEQ ID NO:591), GHPO 1 105 (SEQ ID NO:593), 
GHPO 1115 (SEQ ID NO:595), GHPO 1 159 (SEQ ID NO:597), GHPO 1 177 
(SEQ ID NO:599), GHPO 1 1 87 (SEQ ID NO:601), GHPO 1 192 (SEQ ID 

15 NO:603), GHPO 1 195 (SEQ ID NO:605), GHPO 1224 (SEQ ID NO:607), 

GHPO 1225 (SEQ ID NO:609), GHPO 1228 (SEQ ID NO:61 1), GHPO 1229 
(SEQ ID NO:613), GHPO 1231 (SEQ ID NO:615), GHPO 1236 (SEQ ID 
NO:617), GHPO 1242 (SEQ ID NO:619), GHPO 1248 (SEQ ID NO:621), 
GHPO 1270 (SEQ ID NO:623), GHPO 1271 (SEQ ID NO:625), GHPO 1298 

20 (SEQ ID NO:627), GHPO 1301 (SEQ ID NO:629), GHPO 1304 (SEQ ID 

NO:631), GHPO 1315 (SEQ ID NO:633), GHPO 1319 (SEQ ID NO:635), 
GHPO 1323 (SEQ ID NO:637), GHPO 1331 (SEQ IDNO:639), GHPO 1332 
(SEQ ID NO:641), GHPO 1347 (SEQ ID NO:643), GHPO 1373 (SEQ ID 
NO:645), GHPO 1376 (SEQ ID NO:647), GHPO 1380 (SEQ ID NO:649), 

25 GHPO 1394 (SEQ ID NO:651), GHPO 1407 (SEQ ID NO:653), GHPO 1415 

(SEQ ID NO:655), GHPO 1425 (SEQ ID NO:657), GHPO 1427 (SEQ ID 
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NO:395), GHPO 122 (SEQ ID NO:397), GHPO 128 (SEQ ID NO:399), GHPO 
138 (SEQ ID NO:401), GHPO 153 (SEQ ID NO:403), GHPO 160 (SEQ ID 
NO:405), GHPO 168 (SEQ ID NO:407), GHPO 179 (SEQ ID NO:409), GHPO 
189 (SEQ ID NO:41 1), GHPO 229 (SEQ ID NO:413), GHPO 243 (SEQ ID 
NO:415), GHPO 244 (SEQ ID NO:417), GHPO 251 (SEQ ID NO:419), GHPO 

5 267 (SEQ ID NO:42 1 ), GHPO 269 (SEQ ID NO:423), GHPO 279 (SEQ ID 

NO:425), GHPO 284 (SEQ ID NO:427), GHPO 296 (SEQ ID NO:429), GHPO 
300 (SEQ ID NO:431), GHPO 305 (SEQ ID NO:433), GHPO 319 (SEQ ID 
NO:435), GHPO 330 (SEQ ID NO:437), GHPO 340 (SEQ ID NO:439), GHPO 
342 (SEQ ID NO:441), GHPO 344 (SEQ ID NO:443), GHPO 358 (SEQ ID 

1 0 NO:445), GHPO 373 (SEQ ID NO:447), GHPO 382 (SEQ ID NO:449), GHPO 

384 (SEQ ID NO:451), GHPO 398 (SEQ ID NO:453), GHPO 409 (SEQ ID 
NO:455), GHPO 422 (SEQ ID NO:457), GHPO 430 (SEQ ID NO:459), GHPO 
446 (SEQ ID NO:461), GHPO 447 (SEQ ID NO:463), GHPO 450 (SEQ ID 
NO:465), GHPO 451 (SEQ ID NO:467), GHPO 452 (SEQ ID NO:469), GHPO 

15 456 (SEQ ID NO:471), GHPO 461 (SEQ ID NO:473), GHPO 476 (SEQ ID 

NO:475), GHPO 478 (SEQ ID NO:477), GHPO 491 (SEQ ID NO:479), GHPO 
511 (SEQ ID NO:481), GHPO 519 (SEQ ID NO:483), GHPO 526 (SEQ ID 
NO:485), GHPO 534 (SEQ ID NO:487), GHPO 536 (SEQ ID NO:489), GHPO 
542 (SEQ ID NO:491), GHPO 544 (SEQ ID NO:493), GHPO 576 (SEQ ID 

20 NO:495), GHPO 578 (SEQ ID NO:497), GHPO 580 (SEQ ID NO:499), GHPO 

585 (SEQ ID NO:501), GHPO 599 (SEQ ID NO:503), GHPO 639 (SEQ ID 
NO:505), GHPO 642 (SEQ ID NO:507), GHPO 647 (SEQ ID NO:509), GHPO 
654 (SEQ ID NO:51 1), GHPO 669 (SEQ ID NO:513), GHPO 710 (SEQ ID 
NO:515), GHPO 713 (SEQ ID NO:517), GHPO 716 (SEQ ID NO:519), GHPO 

25 718 (SEQ ID NO:521), GHPO 726 (SEQ ID NO: 523), GHPO 734 (SEQ ID 

NO:525), GHPO 740 (SEQ ID NO:527), GHPO 770 (SEQ ID NO:529), GHPO 
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GHPO 1689 (SEQ ID NO:265), GHPO 1733 (SEQ ID NO:267), GHPO 18 
(SEQ ID NO:269), GHPO 139 (SEQ ID NO:271), GHPO 142 (SEQ ID 
NO:273), GHPO 250 (SEQ ID NO:275), GHPO 257 (SEQ ID NO:277), GHPO 
325 (SEQ ID NO:279), GHPO 355 (SEQ ID NO:281), GHPO 357 (SEQ ID 
NO:283), GHPO 454 (SEQ ID NO:285), GHPO 475 (SEQ ID NO:287), GHPO 

5 515 (SEQ ID NO:289), GHPO 527 (SEQ ID NO:291), GHPO 551 (SEQ ID 

NO:293), GHPO 602 (SEQ ID NO:295), GHPO 626 (SEQ ID NO:297), GHPO 
646 (SEQ ID NO:299), GHPO 653 (SEQ ID NO:301), GHPO 655 (SEQ ID 
NO:303), GHPO 670 (SEQ ID NO:305), GHPO 739 (SEQ ID NO:307), GHPO 
798 (SEQ ID NO:309), GHPO 1 102 (SEQ ID N0:3 11), GHPO 1 1 14 (SEQ ID 

10 NO:313),GHPO 1152 (SEQ ID NO:315), GHPO 1272 (SEQ ID NO:317), 

GHPO 1345 (SEQ ID NO:319), GHPO 1377 (SEQ ID NO:321), GHPO 1424 
(SEQ ID NO:323), GHPO 1430 (SEQ ID NO:325), GHPO 1502 (SEQ ID 
NO:327), GHPO 1600 (SEQ ID NO:329), GHPO 1714 (SEQ ID NO:331), 
GHPO 359 (SEQ ID NO:333), GHPO 678 (SEQ ID NO:335), GHPO 708 

15 (SEQ ID NO:337), GHPO 759 (SEQ ID NO:339), GHPO 847 (SEQ ID 

NO:341), GHPO 1050 (SEQ ID NO:343), GHPO 1 101 (SEQ ID NO:345), 
GHPO 1 120 (SEQ ID NO:347), GHPO 1 138 (SEQ ID NO:349), GHPO 1310 
(SEQ ID NO:351), GHPO 1320 (SEQ ID NO:353), GHPO 1375 (SEQ ID 
NO:355), GHPO 1432 (SEQ ID NO:357), GHPO 21 (SEQ ID NO:359), GHPO 

20 282 (SEQ ID NO:361), GHPO 1089 (SEQ ID NO:363), GHPO 1 141 (SEQ ID 

NO:365), GHPO 1280 (SEQ ID NO:367), GHPO 1608 (SEQ ID NO:369), 
GHPO 15 (SEQ ID NO:371), GHPO 16 (SEQ ID NO:373), GHPO 36 (SEQ ID 
NO:375), GHPO 38 (SEQ ID NO:377), GHPO 52 (SEQ ID NO:379), GHPO 
57 (SEQ ID NO:381), GHPO 64 (SEQ ID NO:383), GHPO 79 (SEQ ID 

25 NO:385), GHPO 84 (SEQ ID NO:387), GHPO 86 (SEQ ID NO:389), GHPO 

99 (SEQ ID NO:391), GHPO 106 (SEQ ID NO:393), GHPO 118 (SEQ ID 
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(SEQ ID NO:135), GHPO 374 (SEQ ID NO: 137), GHPO 442 (SEQ ID 
NO: 139), GHPO 480 (SEQ ID NO: 141), GHPO 523 (SEQ ID NO: 143), GHPO 
610 (SEQ ID NO: 145), GHPO 675 (SEQ ID NO: 147), GHPO 690 (SEQ ID 
NO: 149), GHPO 829 (SEQ ID NO: 151), GHPO 850 (SEQ ID NO: 153), GHPO 
876 (SEQ ID NO: 155), GHPO 984 (SEQ ID NO: 157), GHPO 989 (SEQ ID 
5 NO: 159), GHPO 1111 (SEQ ID NO: 161), GHPO 1145 (SEQ ID NO: 163), 

GHPO 1256 (SEQ ID NO: 165), GHPO 1264 (SEQ ID NO: 167), GHPO 1316 
(SEQ ID NO:169), GHPO 1368 (SEQ ID NO:171), GHPO 1442 (SEQ ID 
NO: 173), GHPO 1506 (SEQ ID NO: 175), GHPO 1543 (SEQ ID NO: 177), 
GHPO 1574 (SEQ ID NO:179), GHPO 1627 (SEQ ID NO:181), GHPO 1657 

10 (SEQ ID NO: 183), GHPO 1664 (SEQ ID NO: 185), GHPO 1694 (SEQ ID 

NO:187), GHPO 1704 (SEQ ID NO:189), GHPO 1763 (SEQ ID NO:191), 
GHPO 616 (SEQ ID NO:193), GHPO 76 (SEQ ID NO: 195), GHPO 109 (SEQ 
ID NO:197), GHPO 163 (SEQ ID NO:199), GHPO 169 (SEQ ID NO:201), 
GHPO 208 (SEQ ID NO:203), GHPO 219 (SEQ ID NO:205), GHPO 445 

15 (SEQ ID NO:207), GHPO 479 (SEQ ID NO:209), GHPO 525 (SEQ ID 

NO:211), GHPO 535 (SEQ ID NO:213), GHPO 731 (SEQ ID NO:215), GHPO 
836 (SEQ ID NO:217), GHPO 879 (SEQ ID NO:219), GHPO 881 (SEQ ID 
NO:221), GHPO 886 (SEQ ID NO:223), GHPO 893 (SEQ ID NO:225), GHPO 
894 (SEQ ID NO:227), GHPO 976 (SEQ ID NO:229), GHPO 1011 (SEQ ID 

20 NO:231), GHPO 1024 (SEQ ID NO:233), GHPO 1084 (SEQ ID NO:235), 

GHPO 1329 (SEQ ID NO:237), GHPO 1330 (SEQ ID NO:239), GHPO 1346 
(SEQ ID NO:241), GHPO 1360 (SEQ ID NO:243), GHPO 1388 (SEQ ID 
NO:245), GHPO 1411 (SEQ ID NO:247), GHPO 1419 (SEQ ID NO:249), 
GHPO 1446 (SEQ ID NO:251), GHPO 1469 (SEQ ID NO:253), GHPO 1501 

25 (SEQ ID NO:255), GHPO 1 505 (SEQ ID NO:257), GHPO 1 522 (SEQ ID 

NO:259), GHPO 1525 (SEQ ID NO:261), GHPO 1615 (SEQ ID NO:263), 
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encoding GHPO 35 (SEQ ID NO:l), GHPO 55 (SEQ ID NO:3), GHPO 78 
(SEQ ID NO:5), GHPO 89 (SEQ ID NO:7), GHPO 129 (SEQ ID NO:9), 
GHPO 541 (SEQ ID N0:1 1), GHPO 607 (SEQ ID NO: 13), GHPO 635 (SEQ 
ID NO:15), GHPO 701 (SEQ ID NO:17), GHPO 712 (SEQ ID NO:19), GHPO 
761 (SEQ ID N0:21), GHPO 838 (SEQ ID NO:23), GHPO 1034 (SEQ ID 
5 NO:25), GHPO 1085 (SEQ ID NO:27), GHPO 1213 (SEQ ID NO:29), GHPO 

1255 (SEQ ID NO:31), GHPO 1308 (SEQ ID NO:33), GHPO 1389 (SEQ ID 
NO:35), GHPO 1706 (SEQ ID NO:37), GHPO 234 (SEQ ID NO:39), GHPO 
314 (SEQ ID N0:41), GHPO 510 (SEQ ID NO:43), GHPO 603 (SEQ ID 
NO:45), GHPO 937 (SEQ ID NO:47), GHPO 1027 (SEQ ID NO:49), GHPO 

10 1099 (SEQ ID N0:51), GHPO 1 151 (SEQ ID NO:53), GHPO 1275 (SEQ ID 

NO:55), GHPO 1365 (SEQ ID NO:57), GHPO 1578 (SEQ ID NO: 59), GHPO 
22 (SEQ ID NO:61), GHPO 58 (SEQ ID NO:63), GHPO 200 (SEQ ID NO:65), 
GHPO 558 (SEQ ID NO:67), GHPO 563 (SEQ ID NO:69), GHPO 695 (SEQ 
ID NO:71), GHPO 699 (SEQ ID NO:73), GHPO 702 (SEQ ID NO:75), GHPO 

15 709 (SEQ ID NO:77), GHPO 741 (SEQ ID NO:79), GHPO 762 (SEQ ID 

NO:81), GHPO 827 (SEQ ID NO:83), GHPO 852 (SEQ ID NO:85), GHPO 
1013 (SEQ ID NO:87), GHPO 1020 (SEQ ID NO:89), GHPO 1031 (SEQ ID 
NO:91), GHPO 1052 (SEQ ID NO:93), GHPO 1127 (SEQ ID NO:95), GHPO 
1 149 (SEQ ID NO:97), GHPO 1 176 (SEQ ID NO:99), GHPO 1250 (SEQ ID 

20 NO:101), GHPO 1312 (SEQ ID NO:103), GHPO 1358 (SEQ ID NO:105), 

GHPO 1490 (SEQ ID NO:107), GHPO 1559 (SEQ ID NO:109), GHPO 1651 
(SEQ ID N0:1 1 1), GHPO 1726 (SEQ ID NO:l 13), GHPO 1780 (SEQ ID 
NO: 11 5), GHPO 895 (SEQ ID NO: 11 7), GHPO 1447 (SEQ ID NO: 119), 
GHPO 28 (SEQ ID NO: 121), GHPO 86 (SEQ ID NO: 123), GHPO 155 (SEQ 

25 ID NO: 125), GHPO 157 (SEQ ID NO: 127), GHPO 237 (SEQ ID NO: 129), 

GHPO 290 (SEQ ID NO: 131), GHPO 293 (SEQ ID NO: 133), GHPO 335 
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NO:1336), GHPO 1701 (SEQ IDNO:1338), GHPO 1719 (SEQ IDNO:1340), 
GHPO 1723 (SEQ ID NO:1342), GHPO 1732 (SEQ ID NO:1344), GHPO 
1739 (SEQ ID NO: 1346), GHPO 1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ 
ID NO:1350), GHPO 1749 (SEQ ID NO:1352), GHPO 1750 (SEQ ID 
NO:1354), GHPO 1751 (SEQ ID NO:1356), GHPO 1755 (SEQ ID NO:1358), 

5 GHPO 1771 (SEQ ID NO: 1360), GHPO 1786 (SEQ ID NO: 1362), and GHPO 

1789 (SEQ ID NO: 1364), have been identified in the H, pylori genome. These 
polypeptides can be used, for example, in vaccination methods for preventing 
or treating Helicobacter infection. For example, GHPO 1320, GHPO 523, 
GHPO 792, GHPO 639, GHPO 669, GHPO 992, GHPO 576, GHPO 109, 

10 GHPO 129, GHPO 234, GHPO 257, GHPO 525, GHPO 626, GHPO 1034, 

GHPO 1275, GHPO 1308, GHPO 1600, GHPO 1615, GHPO 536, GHPO 66, 
GHPO 1363, GHPO 1595, and GHPO 1 166 have been shown to be protective 
antigens that can be used in methods for preventing Helicobacter infection. By 
"protective antigen" is meant an antigen that is capable of reducing the 

15 infection level after challenge, relative to a positive control. Absolute 

protection ft-om infection, although included in the invention, is not required. 

Some of the new polypeptides are secreted polypeptides that can be 
produced in their mature forms (/.e., as polypeptides that have been exported 
through class II or class III secretion pathways) or as precursors that include 

20 signal peptides, which can be removed in the course of excretion/secretion by 

cleavage at the N-terminal end of the mature form. (The cleavage site is 
located at the C-terminal end of the signal peptide, adjacent to the mature 
form.) 

According to a first aspect of the invention, there are provided isolated 
25 polynucleotides that encode the precursor and mature forms of the Helicobacter 

GHPO proteins listed above. Examples of such polynucleotides are those 
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GHPO 1325 (SEQ ID NO:1216), GHPO 1341 (SEQ ID NO:1218), GHPO 
1351 (SEQ ID NO: 1220), GHPO 1354 (SEQ ID NO: 1222), GHPO 1363 (SEQ 
ID NO:1224), GHPO 1371 (SEQ ID NO:1226), GHPO 1381 (SEQ ID 
NO:1228), GHPO 1401 (SEQ ID NO:1230), GHPO 1402 (SEQ ID NO: 1232), 
GHPO 1403 (SEQ ID NO: 1234), GHPO 1408 (SEQ ID NO: 1236), GHPO 
5 1416 (SEQ ID NO:1238), GHPO 1420 (SEQ ID NO:1240), GHPO 1428 (SEQ 

ID NO: 1242), GHPO 1437 (SEQ ID NO: 1244), GHPO 1439 (SEQ ID 
NO: 1246), GHPO 1460 (SEQ ID NO: 1248), GHPO 1463 (SEQ ID NO: 1250), 
GHPO 1472 (SEQ ID NO:1252), GHPO 1474 (SEQ ID NO: 1254), GHPO 
1484 (SEQ ID NO: 1256), GHPO 1489 (SEQ ID NO: 1258), GHPO 1494 (SEQ 

10 ID NO:1260), GHPO 1495 (SEQ ID NO:1262), GHPO 1498 (SEQ ID 

NO:1264), GHPO 1499 (SEQ ID NO:1266), GHPO 1500 (SEQ ID NO:1268), 
GHPO 1503 (SEQ ID NO: 1270), GHPO 1504 (SEQ ID NO: 1272), GHPO 
1510 (SEQ ID NO: 1274), GHPO 1518 (SEQ IDNO:1276), GHPO 1533 (SEQ 
ID NO: 1278), GHPO 1541 (SEQ ID NO: 1280), GHPO 1544 (SEQ ID 

15 NO:1282), GHPO 1548 (SEQ ID NO:1284), GHPO 1565 (SEQ ID NO:1286), 

GHPO 1575 (SEQ IDNO:1288), GHPO 1582 (SEQ ID NO:1290), GHPO 
1595 (SEQ ID NO:1292), GHPO 1597 (SEQ ID NO:1294), GHPO 1599 (SEQ 
ID NO: 1296), GHPO 1601 (SEQ ID NO: 1298), GHPO 1609 (SEQ ID 
NO:1300), GHPO 1613 (SEQ ID NO:1302), GHPO 1614 (SEQ ID NO:1304), 

20 GHPO 1626 (SEQ ID NO: 1306), GHPO 1628 (SEQ ID NO: 1308), GHPO 

1639 (SEQ ID NO:1310), GHPO 1640 (SEQ IDNO:1312), GHPO 1641 (SEQ 
ID NO: 13 14), GHPO 1646 (SEQ ID NO:1316), GHPO 1662 (SEQ ID 
NO: 13 18), GHPO 1667 (SEQ ID NO: 1320), GHPO 1668 (SEQ ID NO: 1322), 
GHPO 1670 (SEQ ID NO:1324), GHPO 1671 (SEQ ID NO:1326), GHPO 

25 1672 (SEQ ID NO: 1328), GHPO 1678 (SEQ ID NO: 1330), GHPO 1684 (SEQ 

ID NO: 1332), GHPO 1695 (SEQ ID NO: 1334), GHPO 1697 (SEQ ID 
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1033 (SEQ ID NO:1094), GHPO 1039 (SEQ ID NO:1096), GHPO 1041 (SEQ 
ID NO: 1098), GHPO 1043 (SEQ ID NO:l 100), GHPO 1044 (SEQ ID 
NO:1102),GHPO 1051 (SEQ ID NO: 1104), GHPO 1058 (SEQ ID NO: 1 106), 
GHPO 1060 (SEQ ID NO: 1 108), GHPO 1075 (SEQ ID NO: 11 10), GHPO 
1077 (SEQ ID NO:1112), GHPO 1082 (SEQ IDN0:1114), GHPO 1083 (SEQ 
5 ID NO: 11 16), GHPO 1086 (SEQ ID NO: 11 18), GHPO 1087 (SEQ ID 

NO:1120), GHPO 1090 (SEQ ID NO:1122), GHPO 1097 (SEQ ID NO:1124), 
GHPO 1098 (SEQ ID NO: 1126), GHPO 1103 (SEQ ID NO:1128), GHPO 
1113 (SEQ ID NO: 11 30), GHPO 1 1 16 (SEQ ID NO: 1 132), GHPO 1123 (SEQ 
ID NO: 1 134), GHPO 1 125 (SEQ ID NO: 1 136), GHPO 1 129 (SEQ ID 
10 NO:l 138), GHPO 1 130 (SEQ ID NO: 1 140), GHPO 1134 (SEQ ID NO: 1 142), 

GHPO 1161 (SEQ ID NO: 1144), GHPO 1166 (SEQ ID NO: 1146), GHPO 
1170 (SEQ ID NO: 1148), GHPO 1175 (SEQ ID NO: 1 1 50), GHPO 1181 (SEQ 
ID NO: 1152), GHPO 1186 (SEQ ID NO: 11 54), GHPO 11 88 (SEQ ID 
NO: 1156), GHPO 1191 (SEQ ID NO:l 158), GHPO 1 193 (SEQ ID NO: 11 60), 
1 5 GHPO 1 1 96 (SEQ ID NO: 1 1 62), GHPO 1 204 (SEQ ID NO: 1 1 64), GHPO 

1210 (SEQ ID NO: 1166), GHPO 1211 (SEQ ID NO: 11 68), GHPO 1216 (SEQ 
ID NO: 1 1 70), GHPO 1218 (SEQ ID NO: 1 1 72), GHPO 1220 (SEQ ID 
NO:1174), GHPO 1223 (SEQ ID NO: 11 76), GHPO 1226 (SEQ ID NO: 1 178), 
GHPO 1240 (SEQ ID NO: 1 180), GHPO 1246 (SEQ ID NO: 1 182), GHPO 
20 1251 (SEQ IDNO:1184), GHPO 1252 (SEQ IDNO:1186), GHPO 1261 (SEQ 

ID NO: 1 188), GHPO 1265 (SEQ ID NO: 1190), GHPO 1267 (SEQ ID 
NO:1192), GHPO 1278 (SEQ IDNO:1194), GHPO 1282 (SEQ ID NO: 1 196), 
GHPO 1283 (SEQ ID NO:1198), GHPO 1287 (SEQ ID NO:1200), GHPO 
1292 (SEQ ID NO:1202), GHPO 1293 (SEQ ID NO:1204), GHPO 1302 (SEQ 
25 ID NO:1206), GHPO 1309 (SEQ ID NO:1208), GHPO 1317 (SEQ ID 

NO:1210), GHPO 1318 (SEQ ID NO:1212), GHPO 1321 (SEQ IDNO:1214), 
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583 (SEQ ID NO:966), GHPO 588 (SEQ ID NO:968), GHPO 593 (SEQ ID 
NO:970), GHPO 597 (SEQ ID NO:972), GHPO 598 (SEQ ID NO:974), GHPO 
604 (SEQ ID NO:976), GHPO 606 (SEQ ID NO:978), GHPO 61 1 (SEQ ID 
NO:980), GHPO 612 (SEQ ID NO:982), GHPO 615 (SEQ ID NO:984), GHPO 
632 (SEQ ID NO:986), GHPO 633 (SEQ ID NO:988), GHPO 637 (SEQ ID 
5 NO:990), GHPO 651 (SEQ ID NO:992), GHPO 663 (SEQ ID NO:994), GHPO 

686 (SEQ ID NO:996), GHPO 693 (SEQ ID NO:998), GHPO 698 (SEQ ID 
NO: 1000), GHPO 703 (SEQ ID NO: 1002), GHPO 704 (SEQ ID NO: 1004), 
GHPO 705 (SEQ ID NO: 1006), GHPO 707 (SEQ ID NO: 1008), GHPO 721 
(SEQ ID NO: 1010), GHPO 727 (SEQ ID NO: 1012), GHPO 728 (SEQ ID 

10 NO:1014), GHPO 733 (SEQ ID NO:1016), GHPO 758 (SEQ ID NO:1018), 

GHPO 763 (SEQ ID NO: 1020), GHPO 771 (SEQ ID NO: 1022), GHPO 774 
(SEQ ID NO: 1024), GHPO 776 (SEQ ID NO: 1026), GHPO 783 (SEQ ID 
NO: 1028), GHPO 800 (SEQ ID NO: 1030), GHPO 806 (SEQ ID NO: 1032), 
GHPO 807 (SEQ ID NO: 1034), GHPO 808 (SEQ ID NO: 1036), GHPO 809 

15 (SEQ ID NO:1038), GHPO 81 1 (SEQ ID NO:1040), GHPO 815 (SEQ ID 

NO:1042), GHPO 819 (SEQ ID NO:1044), GHPO 841 (SEQ ID NO:1046), 
GHPO 843 (SEQ ID NO:1048), GHPO 846 (SEQ ID NO:1050), GHPO 875 
(SEQ ID NO: 1052), GHPO 892 (SEQ ID NO: 1054), GHPO 902 (SEQ ID 
NO: 1056), GHPO 904 (SEQ ID NO: 1058), GHPO 906 (SEQ ID NO: 1060), 

20 GHPO 908 (SEQ ID NO: 1062), GHPO 921 (SEQ ID NO: 1064), GHPO 923 

(SEQ ID NO: 1066), GHPO 926 (SEQ ID NO: 1068), GHPO 933 (SEQ ID 
NO: 1070), GHPO 939 (SEQ ID NO: 1072), GHPO 940 (SEQ ID NO: 1074), 
GHPO 943 (SEQ ID NO: 1076), GHPO 951 (SEQ ID NO: 1078), GHPO 961 
(SEQ ID NO: 1080), GHPO 965 (SEQ ID NO: 1082), GHPO 990 (SEQ ID 

25 NO: 1084), GHPO 991 (SEQ ID NO: 1086), GHPO 998 (SEQ ID NO: 1088), 

GHPO 1001 (SEQ ID NO:1090), GHPO 1005 (SEQ ID NO:1092), GHPO 
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NO:830), GHPO 218 (SEQ ID NO:832), GHPO 226 (SEQ ID NO:834), GHPO 
231 (SEQ ID NO:836), GHPO 236 (SEQ ID NO:838), GHPO 239 (SEQ ID 
NO:840), GHPO 245 (SEQ ID NO:842), GHPO 246 (SEQ ID NO:844), GHPO 
248 (SEQ ID NO:846), GHPO 253 (SEQ ID NO:848), GHPO 265 (SEQ ID 
NO:850), GHPO 266 (SEQ ID NO:852), GHPO 271 (SEQ ID NO:854), GHPO 
5 272 (SEQ ID NO:856), GHPO 286 (SEQ ID NO:858), GHPO 291 (SEQ ID 

NO:860), GHPO 292 (SEQ ID NO:862), GHPO 297 (SEQ ID NO:864), GHPO 
304 (SEQ ID NO:866), GHPO 307 (SEQ ID NO:868), GHPO 324 (SEQ ID 
NO:870), GHPO 326 (SEQ ID NO:872), GHPO 33 1 (SEQ ID NO:874), GHPO 
343 (SEQ ID NO:876), GHPO 345 (SEQ ID NO:878), GHPO 346 (SEQ ID 
10 NO:880), GHPO 352 (SEQ ID NO:882), GHPO 355 (SEQ ID NO:884), GHPO 

363 (SEQ ID NO:886), GHPO 369 (SEQ ID NO:888), GHPO 376 (SEQ ID 
NO:890), GHPO 378 (SEQ ID NO:892), GHPO 388 (SEQ ID NO:894), GHPO 
396 (SEQ ID NO:896), GHPO 403 (SEQ ID NO:898), GHPO 410 (SEQ ID 
NO:900), GHPO 415 (SEQ ID NO:902), GHPO 421 (SEQ ID NO:904), GHPO 
1 5 439 (SEQ ID NO:906), GHPO 441 (SEQ ID NO:908), GHPO 443 (SEQ ID 

NO:910), GHPO 453 (SEQ ID NO:912), GHPO 455 (SEQ ID NO:914), GHPO 
464 (SEQ ID NO:916), GHPO 467 (SEQ ID NO:918), GHPO 468 (SEQ ID 
NO:920), GHPO 470 (SEQ ID NO:922), GHPO 486 (SEQ ID NO:924), GHPO 
487 (SEQ ID NO:926), GHPO 488 (SEQ ID NO:928), GHPO 489 (SEQ ID 
20 NO:930), GHPO 498 (SEQ ID NO:932), GHPO 50 1 (SEQ ID NO:934), GHPO 

504 (SEQ ID NO:936), GHPO 512 (SEQ ID NO:938), GHPO 517 (SEQ ID 
NO:940), GHPO 520 (SEQ ID NO:942), GHPO 528 (SEQ ID NO:944), GHPO 
530 (SEQ ID NO:946), GHPO 532 (SEQ ID NO:948), GHPO 548 (SEQ ID 
NO:950), GHPO 561 (SEQ ID NO:952), GHPO 564 (SEQ ID NO:954), GHPO 
25 572 (SEQ ID NO:956), GHPO 573 (SEQ ID NO:958), GHPO 574 (SEQ ID 

NO:960), GHPO 577 (SEQ ID NO:962), GHPO 579 (SEQ ID NO:964), GHPO 
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(SEQ ID NO:698), GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ ID 
NO:702), GHPO 1629 (SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), 
GHPO 1654 (SEQ ID NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 
(SEQ ID NO:712), GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID 
NO:716), GHPO 1693 (SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), 
5 GHPO 1 738 (SEQ ID NO:722), GHPO 1 745 (SEQ ID NO:724), GHPO 1746 

(SEQ ID NO:726), GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID 
NO: 730), GHPO 1795 (SEQ ID NO: 732), GHPO 1796 (SEQ ID NO: 734), 
GHPO 7 (SEQ ID NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID 
NO:740), GHPO 10 (SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 

10 25 (SEQ ID NO:746), GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID 

NO:750), GHPO 30 (SEQ ID NO:752), GHPO 37 (SEQ ID NO:754), GHPO 
49 (SEQ ID NO:756), GHPO 51 (SEQ ID NO:758), GHPO 54 (SEQ ID 
NO:760), GHPO 65 (SEQ ID NO:762), GHPO 66 (SEQ ID NO:764), GHPO 
68 (SEQ ID NO:766), GHPO 70 (SEQ ID NO:768), GHPO 77 (SEQ ID 

1 5 NO:770), GHPO 83 (SEQ ID NO:772), GHPO 85 (SEQ ID NO:774), GHPO 

87 (SEQ ID NO:776), GHPO 91 (SEQ ID NO:778), GHPO 92 (SEQ ID 
NO:780), GHPO 96 (SEQ ID NO:782), GHPO 97 (SEQ ID NO:784), GHPO 
1 1 1 (SEQ ID NO:786), GHPO 115 (SEQ ID NO:788), GHPO 1 17 (SEQ ID 
NO:790), GHPO 123 (SEQ ID NO:792), GHPO 124 (SEQ ID NO:794), GHPO 

20 126 (SEQ ID NO:796), GHPO 127 (SEQ ID NO:798), GHPO 128 (SEQ ID 

NO:800), GHPO 131 (SEQ ID NO:802), GHPO 133 (SEQ ID NO:804), GHPO 
140 (SEQ ID NO: 806), GHPO 141 (SEQ ID NO: 808), GHPO 145 (SEQ ID 
NO:810), GHPO 147 (SEQ ID NO:812), GHPO 166 (SEQ ID NO:814), GHPO 
181 (SEQ ID NO:816), GHPO 187 (SEQ ID NO:818), GHPO 188 (SEQ ID 

25 NO:820), GHPO 192 (SEQ ID NO:822), GHPO 202 (SEQ ID NO:824), GHPO 

204 (SEQ ID NO:826), GHPO 205 (SEQ ID NO:828), GHPO 212 (SEQ ID 
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1002 (SEQ ID NO:572), GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID 
NO:576), GHPO 1034 (SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), 
GHPO 1059 (SEQ ID NO:582), GHPO 1065 (SEQ ID NO:584), GHPO 1072 
(SEQ ID NO:586), GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID 
NO:590), GHPO 1091 (SEQ ID NO:592), GHPO 1 105 (SEQ ID NO:594), 

5 GHPO 1115 (SEQ ID NO:596), GHPO 1159 (SEQ ID NO:598), GHPO 1 177 

(SEQ ID NO:600), GHPO 1 187 (SEQ ID NO:602), GHPO 1 192 (SEQ ID 
NO:604), GHPO 1 195 (SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), 
GHPO 1225 (SEQ ID NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 
(SEQ ID NO:614), GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID 

10 NO:618), GHPO 1242 (SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), 

GHPO 1270 (SEQ ID NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 
(SEQ ID NO:628), GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID 
NO:632), GHPO 1315 (SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), 
GHPO 1323 (SEQ ID NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 

15 (SEQ ID NO:642), GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID 

NO:646), GHPO 1376 (SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), 
GHPO 1394 (SEQ ID NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 
(SEQ ID NO:656), GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID 
NO:660), GHPO 1444 (SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), 

20 GHPO 1465 (SEQ ID NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 

(SEQ ID NO:670), GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID 
NO:674), GHPO 1496 (SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), 
GHPO 1536 (SEQ ID NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 
(SEQ ID NO:684), GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID 

25 NO:688), GHPO 1560 (SEQ ID NO:690), GHPO 1 564 (SEQ ID NO:692), 

GHPO 1570 (SEQ ID NO:694), GHPO 1588 (SEQ IDNO:696), GHPO 1604 
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NO:436), GHPO 330 (SEQ ID NO:438), GHPO 340 (SEQ ID NO:440), GHPO 
342 (SEQ ID NO:442), GHPO 344 (SEQ ID NO:444), GHPO 358 (SEQ ID 
NO:446), GHPO 373 (SEQ ID NO:448), GHPO 382 (SEQ ID NO:450), GHPO 
384 (SEQ ID NO:452), GHPO 398 (SEQ ID NO:454), GHPO 409 (SEQ ID 
NO:456), GHPO 422 (SEQ ID NO:458), GHPO 430 (SEQ ID NO:460), GHPO 
5 446 (SEQ ID NO:462), GHPO 447 (SEQ ID NO:464), GHPO 450 (SEQ ID 

NO:466), GHPO 451 (SEQ ID NO:468), GHPO 452 (SEQ ID NO:470), GHPO 
456 (SEQ ID NO:472), GHPO 461 (SEQ ID NO:474), GHPO 476 (SEQ ID 
NO:476), GHPO 478 (SEQ ID NO:478), GHPO 491 (SEQ ID NO:480), GHPO 
511 (SEQ ID NO:482), GHPO 519 (SEQ ID NO:484), GHPO 526 (SEQ ID 

1 0 NO:486), GHPO 534 (SEQ ID NO:488), GHPO 536 (SEQ ID NO:490), GHPO 

542 (SEQ ID NO:492), GHPO 544 (SEQ ID NO:494), GHPO 576 (SEQ ID 
NO:496), GHPO 578 (SEQ ID NO:498), GHPO 580 (SEQ ID NO:500), GHPO 
585 (SEQ ID NO:502), GHPO 599 (SEQ ID NO:504), GHPO 639 (SEQ ID 
NO:506), GHPO 642 (SEQ ID NO:508), GHPO 647 (SEQ ID NO:510), GHPO 

15 654 (SEQ ID NO:512), GHPO 669 (SEQ ID NO:514), GHPO 710 (SEQ ID 

NO:516), GHPO 713 (SEQ ID NO:518), GHPO 716 (SEQ ID NO:520), GHPO 
718 (SEQ ID NO:522), GHPO 726 (SEQ ID NO:524), GHPO 734 (SEQ ID 
NO:526), GHPO 740 (SEQ ID NO:528), GHPO 770 (SEQ ID NO:530), GHPO 
782 (SEQ ID NO:532), GHPO 786 (SEQ ID NO:534), GHPO 792 (SEQ ID 

20 NO:536), GHPO 797 (SEQ ID NO:538), GHPO 816 (SEQ ID NO:540), GHPO 

828 (SEQ ID NO:542), GHPO 839 (SEQ ID NO:544), GHPO 840 (SEQ ID 
NO:546), GHPO 842 (SEQ ID NO:548), GHPO 885 (SEQ ID NO:550), GHPO 
889 (SEQ ID NO:552), GHPO 903 (SEQ ID NO:554), GHPO 912 (SEQ ID 
NO:556), GHPO 946 (SEQ ID NO:558), GHPO 958 (SEQ ID NO:560), GHPO 

25 968 (SEQ ID NO:562), GHPO 987 (SEQ ID NO:564), GHPO 992 (SEQ ID 

NO:566), GHPO 996 (SEQ ID NO:568), GHPO 997 (SEQ ID NO:570), GHPO 
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NO:304), GHPO 670 (SEQ ID NO:306), GHPO 739 (SEQ ID NO:308), GHPO 
798 (SEQ ID NO:310), GHPO 1102 (SEQ ID NO:312), GHPO 11 14 (SEQ ID 
NO:314), GHPO 1152 (SEQ ID NO:316), GHPO 1272 (SEQ ID NO:318), 
GHPO 1345 (SEQ ID NO:320), GHPO 1377 (SEQ ID NO:322), GHPO 1424 
(SEQ ID NO:324), GHPO 1430 (SEQ ID NO:326), GHPO 1502 (SEQ ID 

5 NO:328), GHPO 1600 (SEQ ID NO:330), GHPO 1714 (SEQ ID NO:332), 

GHPO 359 (SEQ ID NO:334), GHPO 678 (SEQ ID NO:336), GHPO 708 
(SEQ ID NO:338), GHPO 759 (SEQ ID NO:340), GHPO 847 (SEQ ID 
NO:342), GHPO 1050 (SEQ ID NO:344), GHPO 1 101 (SEQ ID NO:346), 
GHPO 1 120 (SEQ ID NO:348), GHPO 1 138 (SEQ ID NO:350), GHPO 1310 

10 (SEQ ID NO:352), GHPO 1320 (SEQ ID NO:354), GHPO 1375 (SEQ ID 

NO:356), GHPO 1432 (SEQ ID NO:358), GHPO 21 (SEQ ID NO:360), GHPO 
282 (SEQ ID NO:362), GHPO 1089 (SEQ ID NO:364), GHPO 1 141 (SEQ ID 
NO:366), GHPO 1280 (SEQ ID NO:368), GHPO 1608 (SEQ ID NO:370), 
GHPO 15 (SEQ ID NO:372), GHPO 16 (SEQ ID NO:374), GHPO 36 (SEQ ID 

1 5 NO:376), GHPO 38 (SEQ ID NO:378), GHPO 52 (SEQ ID NO:380), GHPO 

57 (SEQ ID NO:382), GHPO 64 (SEQ ID NO:384), GHPO 79 (SEQ ID 
NO:386), GHPO 84 (SEQ ID NO:388), GHPO 86 (SEQ ID NO:390), GHPO 
99 (SEQ ID NO:392), GHPO 106 (SEQ ID NO:394), GHPO 118 (SEQ ID 
NO:396), GHPO 122 (SEQ ID NO:398), GHPO 128 (SEQ ID NO:400), GHPO 

20 138 (SEQ ID NO:402), GHPO 153 (SEQ ID NO:404), GHPO 160 (SEQ ID 

NO:406), GHPO 168 (SEQ ID NO:408), GHPO 179 (SEQ ID NO:410), GHPO 
189 (SEQ ID NO:412), GHPO 229 (SEQ ID NO:414), GHPO 243 (SEQ ID 
NO:416), GHPO 244 (SEQ ID NO:418), GHPO 251 (SEQ ID NO:420), GHPO 
267 (SEQ ID NO:422), GHPO 269 (SEQ ID NO:424), GHPO 279 (SEQ ID 

25 NO:426), GHPO 284 (SEQ ID NO:428), GHPO 296 (SEQ ID NO:430), GHPO 

300 (SEQ ID NO:432), GHPO 305 (SEQ ID NO:434), GHPO 319 (SEQ ID 
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NO:174), GHPO 1506 (SEQ ID NO: 176), GHPO 1543 (SEQ ID NO: 178), 
GHPO 1574 (SEQ ID NO: 180), GHPO 1627 (SEQ ID NO: 182), GHPO 1657 
(SEQ ID NO: 184), GHPO 1664 (SEQ ID NO: 186), GHPO 1694 (SEQ ID 
NO: 188), GHPO 1704 (SEQ ID NO: 190), GHPO 1763 (SEQ ID NO: 192), 
GHPO 616 (SEQ ID NO: 194), GHPO 76 (SEQ ID NO: 196), GHPO 109 (SEQ 
5 ID NO: 198), GHPO 163 (SEQ ID NO:200), GHPO 169 (SEQ ID NO:202), 

GHPO 208 (SEQ ID NO:204), GHPO 219 (SEQ ID NO:206), GHPO 445 
(SEQ ID NO:208), GHPO 479 (SEQ ID NO:210), GHPO 525 (SEQ ID 
NO:212), GHPO 535 (SEQ ID NO:214), GHPO 731 (SEQ ID NO:216), GHPO 
836 (SEQ ID NO:218), GHPO 879 (SEQ ID NO:220), GHPO 881 (SEQ ID 

1 0 NO:222), GHPO 886 (SEQ ID NO:224), GHPO 893 (SEQ ID NO:226), GHPO 

894 (SEQ ID NO:228), GHPO 976 (SEQ ID NO:230), GHPO 101 1 (SEQ ID 
NO:232), GHPO 1024 (SEQ ID NO:234), GHPO 1084 (SEQ ID NO:236), 
GHPO 1329 (SEQ ID NO:238), GHPO 1330 (SEQ ID NO:240), GHPO 1346 
(SEQ ID NO:242), GHPO 1360 (SEQ ID NO:244), GHPO 1388 (SEQ ID 

15 NO:246), GHPO 141 1 (SEQ ID NO:248), GHPO 1419 (SEQ ID NO:250), 

GHPO 1446 (SEQ ID NO:252), GHPO 1469 (SEQ ID NO:254), GHPO 1501 
(SEQ ID NO:256), GHPO 1505 (SEQ ID NO:258), GHPO 1522 (SEQ ID 
NO:260), GHPO 1525 (SEQ ID NO:262), GHPO 1615 (SEQ ID NO:264), 
GHPO 1689 (SEQ ID NO:266), GHPO 1733 (SEQ ID NO:268), GHPO 18 

20 (SEQ ID NO:270), GHPO 139 (SEQ ID NO:272), GHPO 142 (SEQ ID 

NO:274), GHPO 250 (SEQ ID NO:276), GHPO 257 (SEQ ID NO:278), GHPO 
325 (SEQ ID NO:280), GHPO 355 (SEQ ID NO:282), GHPO 357 (SEQ ID 
NO:284), GHPO 454 (SEQ ID NO:286), GHPO 475 (SEQ ID NO:288), GHPO 
515 (SEQ ID NO:290), GHPO 527 (SEQ ID NO:292), GHPO 551 (SEQ ID 

25 NO:294), GHPO 602 (SEQ ID NO:296), GHPO 626 (SEQ ID NO:298), GHPO 

646 (SEQ ID NO:300), GHPO 653 (SEQ ID NO:302), GHPO 655 (SEQ ID 
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314 (SEQ ID NO:42), GHPO 510 (SEQ ID NO:44), GHPO 603 (SEQ ID 
NO:46), GHPO 937 (SEQ ID NO:48), GHPO 1027 (SEQ ID NO:50), GHPO 
1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID NO:54), GHPO 1275 (SEQ ID 
NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 1578 (SEQ ID NO:60), GHPO 
22 (SEQ ID NO:62), GHPO 58 (SEQ ID NO:64), GHPO 200 (SEQ ID NO:66), 

5 GHPO 558 (SEQ ID NO:68), GHPO 563 (SEQ ID NO:70), GHPO 695 (SEQ 

ID NO:72), GHPO 699 (SEQ ID NO:74), GHPO 702 (SEQ ID NO:76), GHPO 
709 (SEQ ID NO:78), GHPO 741 (SEQ ID NO: 80), GHPO 762 (SEQ ID 
NO:82), GHPO 827 (SEQ ID NO:84), GHPO 852 (SEQ ID NO: 86), GHPO 
1013 (SEQ ID NO:88), GHPO 1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID 

10 NO:92), GHPO 1052 (SEQ ID NO:94), GHPO 1 127 (SEQ ID NO:96), GHPO 

1 149 (SEQ ID NO:98), GHPO 1 176 (SEQ ID NO: 100), GHPO 1250 (SEQ ID 
NO: 102), GHPO 1312 (SEQ ID NO: 104), GHPO 1358 (SEQ ID NO: 106), 
GHPO 1490 (SEQ ID NO:108), GHPO 1559 (SEQ ID NO:110), GHPO 1651 
(SEQ ID NO:112), GHPO 1726 (SEQ ID NO:l 14), GHPO 1780 (SEQ ID 

15 NO: 116), GHPO 895 (SEQ ID NO: 11 8), GHPO 1447 (SEQ ID NO: 120), 

GHPO 28 (SEQ ID NO:122), GHPO 86 (SEQ ID NO:124), GHPO 155 (SEQ 
ID NO: 126), GHPO 157 (SEQ ID NO: 128), GHPO 237 (SEQ ID NO: 130), 
GHPO 290 (SEQ ID NO: 132), GHPO 293 (SEQ ID NO: 134), GHPO 335 
(SEQ ID NO: 136), GHPO 374 (SEQ ID NO: 138), GHPO 442 (SEQ ID 

20 NO: 140), GHPO 480 (SEQ ID NO: 142), GHPO 523 (SEQ ID NO: 144), GHPO 

610 (SEQ ID NO: 146), GHPO 675 (SEQ ID NO: 148), GHPO 690 (SEQ ID 
NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 850 (SEQ ID NO: 154), GHPO 
876 (SEQ ID NO: 156), GHPO 984 (SEQ ID NO: 158), GHPO 989 (SEQ ID 
NO: 160), GHPO 1111 (SEQ ID NO: 162), GHPO 1 145 (SEQ ID NO: 164), 

25 GHPO 1256 (SEQ ID NO: 166), GHPO 1264 (SEQ ID NO: 168), GHPO 1316 

(SEQ ID NO: 170), GHPO 1368 (SEQ ID NO: 172), GHPO 1442 (SEQ ID 
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of the invention. Accordingly, the present invention provides (i) methods for 
producing polypeptides of the invention in recombinant host systems and 
related expression cassettes, vectors, and transformed or transfected cells; (ii) 
hve vaccine vectors, such as pox vims, Salmonella typhimurium^ and Vibrio 
cholerae vectors, that contain polynucleotides of the invention (such vaccine 
vectors being useful in, e.g-., methods for preventing or treating Helicobacter 
infection) in combination with a diluent or carrier, and related pharmaceutical 
compositions and associated therapeutic and^or prophylactic methods; (iii) 
therapeutic and/or prophylactic methods involving administration of 
polynucleotide molecules, either in a naked form or formulated with a delivery 
vehicle, polypeptides or mixtures of polypeptides, or monospecific antibodies 
of the invention, and related pharmaceutical compositions; (iv) methods for 
detecting the presence of Helicobacter in biological samples, which can 
involve the use of polynucleotide molecules, monospecific antibodies, or 
polypeptides of the invention; and (v) methods for purifying polypeptides of 
the invention by antibody-based affinity chromatography. 

Detailed Description 
Open reading frames (ORFs) encoding new polypeptides, designated 
GHPO 35 (SEQ ID NO:2), GHPO 55 (SEQ ID NO:4), GHPO 78 (SEQ ID 
NO:6), GHPO 89 (SEQ ID NO:8), GHPO 129 (SEQ ID NO: 10), GHPO 541 
(SEQ ID NO: 12), GHPO 607 (SEQ ID NO: 14), GHPO 635 (SEQ ID NO: 16), 
GHPO 701 (SEQ ID NO:18), GHPO 712 (SEQ ID NO:20), GHPO 761 (SEQ 
ID NO:22), GHPO 838 (SEQ ID NO:24), GHPO 1034 (SEQ ID NO:26), 
GHPO 1085 (SEQ ID NO:28), GHPO 1213 (SEQ ID NO:30), GHPO 1255 
(SEQ ID NO:32), GHPO 1308 (SEQ ID NO:34), GHPO 1389 (SEQ ID 
NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 234 (SEQ ID NO:40), GHPO 
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NO:1320), GHPO 1668 (SEQ ID NO: 1322), GHPO 1670 (SEQ ID NO: 1324), 
GHPO 1671 (SEQ ID NO:1326), GHPO 1672 (SEQ ID NO:1328), GHPO 
1678 (SEQ ID NO:1330), GHPO 1684 (SEQ ID NO:1332), GHPO 1695 (SEQ 
ID NO:1334), GHPO 1697 (SEQ ID NO:1336), GHPO 1701 (SEQ ID 
NO:1338), GHPO 1719 (SEQ ID NO:1340), GHPO 1723 (SEQ IDNO:1342), 

5 GHPO 1732 (SEQ ID NO: 1344), GHPO 1739 (SEQ ID NO: 1346), GHPO 

1741 (SEQ ID NO: 1348), GHPO 1747 (SEQ ID NO: 13 50), GHPO 1749 (SEQ 
IDNO:1352), GHPO 1750 (SEQ ID NO: 1354), GHPO 1751 (SEQ ID 
NO:1356), GHPO 1755 (SEQ ID NO:1358), GHPO 1771 (SEQ ID NO:1360), 
GHPO 1786 (SEQ ID NO: 1362), and GHPO 1789 (SEQ ID NO: 1364), which 

1 0 can be used, e.g.^ in methods to prevent, treat, or diagnose Helicobacter 

infection. The sequences of polynucleotides that encode these polypeptides are 
shown in the sequence listing (odd numbers, up to SEQ ID NO: 1363). Those 
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skilled in the art will understand that the invention also includes polynucleotide 
molecules that encode mutants and derivatives of these polypeptides, which can 
result from the addition, deletion, or substitution of non-essential amino acids. 



as is described further below. 
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In addition to the polynucleotide molecules described above, the 
invention includes the corresponding polypeptides (/.e., polypeptides encoded 
by the polynucleotide molecules of the invention, or fragments thereof), and 
monospecific antibodies that specifically bind to these polypeptides. The 
polypeptides of the invention include those having the amino acid sequences 
shown in the sequence listing (even numbers, up to SEQ ID NO: 1363), as well 
as mature forms of proteins having sequences shown in the sequence listing in 
their unprocessed forms, and fragments thereof. 



25 



The present invention has many applications and includes expression 
cassettes, vectors, and cells transfomied or transfected with the polynucleotides 
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GHPO 1287 (SEQ ID NO: 1200), GHPO 1292 (SEQ ID NO: 1202), GHPO 
1293 (SEQ ID NO: 1204), GHPO 1302 (SEQ ID NO: 1206), GHPO 1309 (SEQ 
ID NO: 1208), GHPO 1317 (SEQ ID NO:1210), GHPO 1318 (SEQ ID 
NO:1212), GHPO 1321 (SEQ ID NO:1214), GHPO 1325 (SEQ ID NO:1216), 
GHPO 1341 (SEQ ID NO:1218), GHPO 1351 (SEQ ID NO:1220), GHPO 
5 1354 (SEQ ID NO:1222), GHPO 1363 (SEQ ID NO:1224), GHPO 1371 (SEQ 

ID NO:1226), GHPO 1381 (SEQ ID NO:1228), GHPO 1401 (SEQ ID 
NO:1230), GHPO 1402 (SEQ ID NO:1232), GHPO 1403 (SEQ ID NO:1234), 
GHPO 1408 (SEQ ID NO: 1236), GHPO 1416 (SEQ ID NO: 1238), GHPO 
1420 (SEQ ID NO: 1240), GHPO 1428 (SEQ ID NO: 1242), GHPO 1437 (SEQ 

10 ID NO: 1244), GHPO 1439 (SEQ ID NO: 1246), GHPO 1460 (SEQ ID 

NO: 1248), GHPO 1463 (SEQ ID NO: 1250), GHPO 1472 (SEQ ID NO: 1252), 
GHPO 1474 (SEQ ID NO: 1254), GHPO 1484 (SEQ ID NO: 1256), GHPO 
1489 (SEQ ID NO:1258), GHPO 1494 (SEQ ID NO:1260), GHPO 1495 (SEQ 
IDNO:1262), GHPO 1498 (SEQ ID NO:1264), GHPO 1499 (SEQ ID 

15 NO:1266), GHPO 1500 (SEQ ID NO:1268), GHPO 1503 (SEQ ID NO:1270), 

GHPO 1504 (SEQ ID NO:1272), GHPO 1510 (SEQ ID NO:1274), GHPO 
1518 (SEQ ID NO:1276), GHPO 1533 (SEQ ID NO:1278), GHPO 1541 (SEQ 
ID NO: 1280), GHPO 1544 (SEQ ID NO: 1282), GHPO 1548 (SEQ ID 
NO:1284), GHPO 1565 (SEQ ID NO:1286), GHPO 1575 (SEQ ID NO:1288), 

20 GHPO 1582 (SEQ ID NO: 1290), GHPO 1595 (SEQ ID NO: 1292), GHPO 

1597 (SEQ ID NO: 1294), GHPO 1599 (SEQ ID NO: 1296), GHPO 1601 (SEQ 
IDNO:1298), GHPO 1609 (SEQ IDNO:1300), GHPO 1613 (SEQ ID 
NO: 1302), GHPO 1614 (SEQ ID NO: 1304), GHPO 1626 (SEQ ID NO: 1306), 
GHPO 1628 (SEQ ID NO: 1308), GHPO 1639 (SEQ ID NO: 13 10), GHPO 

25 1640 (SEQ ID NO: 1312), GHPO 1641 (SEQ ID NO: 13 14), GHPO 1646 (SEQ 

IDNO:1316), GHPO 1662 (SEQ IDNO:1318), GHPO 1667 (SEQ ID 
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GHPO 951 (SEQ ID NO:1078), GHPO 961 (SEQ ID NO:1080), GHPO 965 
(SEQ ID NO: 1082), GHPO 990 (SEQ ID NO: 1084), GHPO 991 (SEQ ID 
NO:1086), GHPO 998 (SEQ ID NO:1088), GHPO 1001 (SEQ ID NO:1090), 
GHPO 1005 (SEQ ID NO: 1092), GHPO 1033 (SEQ ID NO: 1094), GHPO 
1039 (SEQ ID NO:1096), GHPO 1041 (SEQ ID NO:1098), GHPO 1043 (SEQ 
ID NO: 11 00), GHPO 1044 (SEQ ID NO: 1102), GHPO 1051 (SEQ ID 
NO: 1104), GHPO 1058 (SEQ ID NO: 1 106), GHPO 1060 (SEQ ID NO: 1 108), 
GHPO 1075 (SEQIDNO:1110), GHPO 1077 (SEQ ID NO: 11 12), GHPO 
1082 (SEQ ID NO: 11 14), GHPO 1083 (SEQ ID NO: 1 116), GHPO 1086 (SEQ 
ID NO:1118), GHPO 1087 (SEQ ID NO: 1 120), GHPO 1090 (SEQ ID 
NO: 1122), GHPO 1097 (SEQ ID NO: 1124), GHPO 1098 (SEQ ID NO: 1126), 
GHPO 1103 (SEQ ID NO: 1128), GHPO 1113 (SEQ ID NO: 1130), GHPO 
1116 (SEQ ID NO: 1 132), GHPO 1 123 (SEQ ID NO: 1 134), GHPO 1 125 (SEQ 
ID NO: 1 136), GHPO 1 129 (SEQ ID NO: 1 138), GHPO 1 130 (SEQ ID 
NO:l 140), GHPO 1134 (SEQ ID NO: 11 42), GHPO 1161 (SEQ ID NO: 1144), 
GHPO 1 166 (SEQ ID NO:l 146), GHPO 1 170 (SEQ ID NO: 1148), GHPO 
1175 (SEQ ID NO: 11 50), GHPO 1181 (SEQ ID NO: 1 152), GHPO 1186 (SEQ 
ID NO: 1154), GHPO 1188 (SEQ ID NO: 1 1 56), GHPO 1191 (SEQ ID 
NO:1158),GHPO 1193 (SEQ ID NO: 11 60), GHPO 1196 (SEQ ID NO: 1 162), 
GHPO 1204 (SEQ ID NO:l 164), GHPO 1210 (SEQ ID NO:l 166), GHPO 
1211 (SEQ ID NO: 11 68), GHPO 1216 (SEQ ID NO: 1170), GHPO 1218 (SEQ 
ID NO:l 172), GHPO 1220 (SEQ ID NO: 1 174), GHPO 1223 (SEQ ID 
NO: 1176), GHPO 1226 (SEQ ID NO: 11 78), GHPO 1240 (SEQ ID NO: 11 80), 
GHPO 1246 (SEQ ID NO: 1 182), GHPO 1251 (SEQ ID NO: 1 184), GHPO 
1252 (SEQ IDNO:1186), GHPO 1261 (SEQ IDNO:1188), GHPO 1265 (SEQ 
ID NO: 1190), GHPO 1267 (SEQ IDN0:1192), GHPO 1278 (SEQ ID 
NO:l 194), GHPO 1282 (SEQ ID NO: 1196), GHPO 1283 (SEQ ID NO: 1 198), 
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532 (SEQ ID NO:948), GHPO 548 (SEQ ID NO:950), GHPO 561 (SEQ ID 
NO:952), GHPO 564 (SEQ ID NO:954), GHPO 572 (SEQ ID NO:956), GHPO 
573 (SEQ ID NO:958), GHPO 574 (SEQ ID NO:960), GHPO 577 (SEQ ID 
NO:962), GHPO 579 (SEQ ID NO:964), GHPO 583 (SEQ ID NO:966), GHPO 
588 (SEQ ID NO:968), GHPO 593 (SEQ ID NO:970), GHPO 597 (SEQ ID 
5 NO:972), GHPO 598 (SEQ ID NO:974), GHPO 604 (SEQ ID NO:976), GHPO 

606 (SEQ ID NO:978), GHPO 61 1 (SEQ ID NO:980), GHPO 612 (SEQ ID 
NO:982), GHPO 615 (SEQ ID NO:984), GHPO 632 (SEQ ID NO:986), GHPO 
633 (SEQ ID NO:988), GHPO 637 (SEQ ID NO:990), GHPO 651 (SEQ ID 
NO:992), GHPO 663 (SEQ ID NO:994), GHPO 686 (SEQ ID NO:996), GHPO 
10 693 (SEQ ID NO:998), GHPO 698 (SEQ ID NO: 1000), GHPO 703 (SEQ ID 

NO: 1002), GHPO 704 (SEQ ID NO:1004), GHPO 705 (SEQ ID NO: 1006), 
GHPO 707 (SEQ ID NO: 1008), GHPO 721 (SEQ ID NO: 1010), GHPO 727 
(SEQ ID NO: 1012), GHPO 728 (SEQ ID NO:1014), GHPO 733 (SEQ ID 
NO:1016), GHPO 758 (SEQ ID NO:1018), GHPO 763 (SEQ ID NO:1020), 
15 GHPO 771 (SEQ ID NO: 1022), GHPO 774 (SEQ ID NO: 1024), GHPO 776 

(SEQ ID NO: 1026), GHPO 783 (SEQ ID NO: 1028), GHPO 800 (SEQ ID 
NO: 1030), GHPO 806 (SEQ ID NO: 1032), GHPO 807 (SEQ ID NO: 1034), 
GHPO 808 (SEQ ID NO:1036), GHPO 809 (SEQ ID NO:1038), GHPO 81 1 
(SEQ ID NO:1040), GHPO 815 (SEQ ID NO:1042), GHPO 819 (SEQ ID 
20 NO: 1044), GHPO 841 (SEQ ID NO: 1046), GHPO 843 (SEQ ID NO: 1048), 

GHPO 846 (SEQ ID NO: 1050), GHPO 875 (SEQ ID NO: 1052), GHPO 892 
(SEQ ID NO: 1054), GHPO 902 (SEQ ID NO:1056), GHPO 904 (SEQ ID 
NO:1058), GHPO 906 (SEQ ID NO:1060), GHPO 908 (SEQ ID NO:1062), 
GHPO 921 (SEQ ID NO: 1064), GHPO 923 (SEQ ID NO: 1066), GHPO 926 
25 (SEQ ID NO: 1068), GHPO 933 (SEQ ID NO: 1070), GHPO 939 (SEQ ID 

NO: 1072), GHPO 940 (SEQ ID NO: 1074), GHPO 943 (SEQ ID NO: 1076), 
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NO:812), GHPO 166 (SEQ ID NO:814), GHPO 181 (SEQ ID NO:816), GHPO 
187 (SEQ ID NO:818), GHPO 188 (SEQ ID NO: 820), GHPO 192 (SEQ ID 
NO:822), GHPO 202 (SEQ ID NO:824), GHPO 204 (SEQ ID NO:826), GHPO 
205 (SEQ ID NO:828), GHPO 212 (SEQ ID NO:830), GHPO 218 (SEQ ID 
NO:832), GHPO 226 (SEQ ID NO:834), GHPO 231 (SEQ ID NO:836), GHPO 
5 236 (SEQ ID NO:838), GHPO 239 (SEQ ID NO: 840), GHPO 245 (SEQ ID 

NO:842), GHPO 246 (SEQ ID NO:844), GHPO 248 (SEQ ID NO:846), GHPO 
253 (SEQ ID NO:848), GHPO 265 (SEQ ID NO:850), GHPO 266 (SEQ ID 
NO:852), GHPO 271 (SEQ ID NO:854), GHPO 272 (SEQ ID NO:856), GHPO 
286 (SEQ ID NO:858), GHPO 291 (SEQ ID NO:860), GHPO 292 (SEQ ID 
10 NO:862), GHPO 297 (SEQ ID NO:864), GHPO 304 (SEQ ID NO:866), GHPO 

307 (SEQ ID NO: 868), GHPO 324 (SEQ ID NO:870), GHPO 326 (SEQ ID 
NO:872), GHPO 331 (SEQ ID NO:874), GHPO 343 (SEQ ID NO:876), GHPO 
345 (SEQ ID NO:878), GHPO 346 (SEQ ID NO:880), GHPO 352 (SEQ ID 
NO: 882), GHPO 355 (SEQ ID NO: 884), GHPO 363 (SEQ ID NO:886), GHPO 
15 369 (SEQ ID NO:888), GHPO 376 (SEQ ID NO:890), GHPO 378 (SEQ ID 

NO: 892), GHPO 388 (SEQ ID NO: 894), GHPO 396 (SEQ ID NO:896), GHPO 
403 (SEQ ID NO:898), GHPO 410 (SEQ ID NO:900), GHPO 415 (SEQ ID 
NO:902), GHPO 421 (SEQ ID NO:904), GHPO 439 (SEQ ID NO:906), GHPO 
441 (SEQ ID NO:908), GHPO 443 (SEQ ID NO:910), GHPO 453 (SEQ ID 
20 NO:912), GHPO 455 (SEQ ID NO:914), GHPO 464 (SEQ ID NO:916), GHPO 

467 (SEQ ID NO:918), GHPO 468 (SEQ ID NO:920), GHPO 470 (SEQ ID 
NO:922), GHPO 486 (SEQ ID NO:924), GHPO 487 (SEQ ID NO:926), GHPO 
488 (SEQ ID NO:928), GHPO 489 (SEQ ID NO:930), GHPO 498 (SEQ ID 
NO:932), GHPO 501 (SEQ ID NO:934), GHPO 504 (SEQ ID NO:936), GHPO 
25 512 (SEQ ID NO:938), GHPO 517 (SEQ ID NO:940), GHPO 520 (SEQ ID 

NO:942), GHPO 528 (SEQ ID NO:944), GHPO 530 (SEQ ID NO:946), GHPO 
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NO:680), GHPO 1539 (SEQ ID NO:682), GHPO 1540 (SEQ ID NO:684), 
GHPO 1542 (SEQ ID NO:686), GHPO 1555 (SEQ ID NO:688), GHPO 1560 
(SEQ ID NO:690), GHPO 1564 (SEQ ID NO:692), GHPO 1570 (SEQ ID 
NO:694), GHPO 1588 (SEQ ID NO:696), GHPO 1604 (SEQ ID NO:698), 
GHPO 1605 (SEQ ID NO:700), GHPO 1619 (SEQ IDNO:702), GHPO 1629 
(SEQ ID NO:704), GHPO 1642 (SEQ ID NO:706), GHPO 1654 (SEQ ID 
NO:708), GHPO 1661 (SEQ ID NO:710), GHPO 1673 (SEQ ID NO:712), 
GHPO 1687 (SEQ ID NO:714), GHPO 1692 (SEQ ID NO:716), GHPO 1693 
(SEQ ID NO:718), GHPO 1699 (SEQ ID NO:720), GHPO 1738 (SEQ ID 
NO:722), GHPO 1745 (SEQ ID NO:724), GHPO 1746 (SEQ ID NO:726), 
GHPO 1754 (SEQ ID NO:728), GHPO 1792 (SEQ ID NO:730), GHPO 1795 
(SEQ ID NO:732), GHPO 1796 (SEQ ID NO:734), GHPO 7 (SEQ ID 
NO:736), GHPO 8 (SEQ ID NO:738), GHPO 9 (SEQ ID NO:740), GHPO 10 
(SEQ ID NO:742), GHPO 12 (SEQ ID NO:744), GHPO 25 (SEQ ID NO:746), 
GHPO 27 (SEQ ID NO:748), GHPO 29 (SEQ ID NO:750), GHPO 30 (SEQ ID 
NO:752), GHPO 37 (SEQ ID NO:754), GHPO 49 (SEQ ID NO:756), GHPO 
51 (SEQ ID NO:758), GHPO 54 (SEQ ID NO:760), GHPO 65 (SEQ ID 
NO:762), GHPO 66 (SEQ ID NO:764), GHPO 68 (SEQ ID NO:766), GHPO 
70 (SEQ ID NO:768), GHPO 77 (SEQ ID NO:770), GHPO 83 (SEQ ID 
NO:772), GHPO 85 (SEQ ID NO:774), GHPO 87 (SEQ ID NO:776), GHPO 
91 (SEQ ID NO:778), GHPO 92 (SEQ ID NO:780), GHPO 96 (SEQ ID 
NO:782), GHPO 97 (SEQ ID NO:784), GHPO 1 1 1 (SEQ ID NO:786), GHPO 
115 (SEQ ID NO:788), GHPO 117 (SEQ ID NO:790), GHPO 123 (SEQ ID 
NO:792), GHPO 124 (SEQ ID NO:794), GHPO 126 (SEQ ID NO:796), GHPO 
127 (SEQ ID NO:798), GHPO 128 (SEQ ID NO:800), GHPO 131 (SEQ ID 
NO:802), GHPO 133 (SEQ ID NO:804), GHPO 140 (SEQ ID NO:806), GHPO 
141 (SEQ ID NO:808), GHPO 145 (SEQ ID NO:810), GHPO 147 (SEQ ID 
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903 (SEQ ID NO:554), GHPO 912 (SEQ ID NO:556), GHPO 946 (SEQ ID 
NO:558), GHPO 958 (SEQ ID NO:560), GHPO 968 (SEQ ID NO:562), GHPO 
987 (SEQ ID NO:564), GHPO 992 (SEQ ID NO:566), GHPO 996 (SEQ ID 
NO:568), GHPO 997 (SEQ ID NO:570), GHPO 1002 (SEQ ID NO:572), 
GHPO 1026 (SEQ ID NO:574), GHPO 1028 (SEQ ID NO:576), GHPO 1034 
(SEQ ID NO:578), GHPO 1038 (SEQ ID NO:580), GHPO 1059 (SEQ ID 
NO:582), GHPO 1065 (SEQ IDNO:584), GHPO 1072 (SEQ ID NO:586), 
GHPO 1073 (SEQ ID NO:588), GHPO 1088 (SEQ ID NO:590), GHPO 1091 
(SEQ ID NO:592), GHPO 1 105 (SEQ ID NO:594), GHPO 1115 (SEQ ID 
NO:596), GHPO 1 159 (SEQ ID NO:598), GHPO 1 177 (SEQ ID NO:600), 
GHPO 1 187 (SEQ ID NO:602), GHPO 1 192 (SEQ ID NO:604), GHPO 1 195 
(SEQ ID NO:606), GHPO 1224 (SEQ ID NO:608), GHPO 1225 (SEQ ID 
NO:610), GHPO 1228 (SEQ ID NO:612), GHPO 1229 (SEQ ID NO:614), 
GHPO 1231 (SEQ ID NO:616), GHPO 1236 (SEQ ID NO:618), GHPO 1242 
(SEQ ID NO:620), GHPO 1248 (SEQ ID NO:622), GHPO 1270 (SEQ ID 
NO:624), GHPO 1271 (SEQ ID NO:626), GHPO 1298 (SEQ ID NO:628), 
GHPO 1301 (SEQ ID NO:630), GHPO 1304 (SEQ ID NO:632), GHPO 1315 
(SEQ ID NO:634), GHPO 1319 (SEQ ID NO:636), GHPO 1323 (SEQ ID 
NO:638), GHPO 1331 (SEQ ID NO:640), GHPO 1332 (SEQ ID NO:642), 
GHPO 1347 (SEQ ID NO:644), GHPO 1373 (SEQ ID NO:646), GHPO 1376 
(SEQ ID NO:648), GHPO 1380 (SEQ ID NO:650), GHPO 1394 (SEQ ID 
NO:652), GHPO 1407 (SEQ ID NO:654), GHPO 1415 (SEQ ID NO:656), 
GHPO 1425 (SEQ ID NO:658), GHPO 1427 (SEQ ID NO:660), GHPO 1444 
(SEQ ID NO:662), GHPO 1449 (SEQ ID NO:664), GHPO 1465 (SEQ ID 
NO:666), GHPO 1475 (SEQ ID NO:668), GHPO 1479 (SEQ ID NO:670), 
GHPO 1483 (SEQ ID NO:672), GHPO 1488 (SEQ ID NO:674), GHPO 1496 
(SEQ ID NO:676), GHPO 1524 (SEQ ID NO:678), GHPO 1536 (SEQ ID 
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NO:418), GHPO 251 (SEQ ID NO:420), GHPO 267 (SEQ ID NO:422), GHPO 
269 (SEQ ID NO:424), GHPO 279 (SEQ ID NO:426), GHPO 284 (SEQ ID 
NO:428), GHPO 296 (SEQ ID NO:430), GHPO 300 (SEQ ID NO:432), GHPO 
305 (SEQ ID NO:434), GHPO 319 (SEQ ID NO:436), GHPO 330 (SEQ ID 
NO:438), GHPO 340 (SEQ ID NO:440), GHPO 342 (SEQ ID NO:442), GHPO 
5 344 (SEQ ID NO:444), GHPO 358 (SEQ ID NO:446), GHPO 373 (SEQ ID 

NO:448), GHPO 382 (SEQ ID NO:450), GHPO 384 (SEQ ID NO:452), GHPO 
398 (SEQ ID NO:454), GHPO 409 (SEQ ID NO:456), GHPO 422 (SEQ ID 
NO:458), GHPO 430 (SEQ ID NO:460), GHPO 446 (SEQ ID NO:462), GHPO 
447 (SEQ ID NO:464), GHPO 450 (SEQ ID NO:466), GHPO 451 (SEQ ID 

10 NO:468), GHPO 452 (SEQ ID NO:470), GHPO 456 (SEQ ID NO:472), GHPO 

461 (SEQ ID NO:474), GHPO 476 (SEQ ID NO:476), GHPO 478 (SEQ ID 
NO:478), GHPO 491 (SEQ ID NO:480), GHPO 511 (SEQ ID NO:482), GHPO 
519 (SEQ ID NO:484), GHPO 526 (SEQ ID NO:486), GHPO 534 (SEQ ID 
NO:488), GHPO 536 (SEQ ID NO:490), GHPO 542 (SEQ ID NO:492), GHPO 

15 544 (SEQ ID NO:494), GHPO 576 (SEQ ID NO:496), GHPO 578 (SEQ ID 

NO:498), GHPO 580 (SEQ ID NO:500), GHPO 585 (SEQ ID NO:502), GHPO 
599 (SEQ ID NO:504), GHPO 639 (SEQ ID NO:506), GHPO 642 (SEQ ID 
NO:508), GHPO 647 (SEQ ID NO:510), GHPO 654 (SEQ ID NO:512), GHPO 
669 (SEQ ID NO:514), GHPO 710 (SEQ ID NO:516), GHPO 713 (SEQ ID 

20 NO:518), GHPO 716 (SEQ ID NO:520), GHPO 718 (SEQ ID NO:522), GHPO 

726 (SEQ ID NO:524), GHPO 734 (SEQ ID NO:526), GHPO 740 (SEQ ID 
NO:528), GHPO 770 (SEQ ID NO:530), GHPO 782 (SEQ ID NO:532), GHPO 
786 (SEQ ID NO:534), GHPO 792 (SEQ ID NO:536), GHPO 797 (SEQ ID 
NO:538), GHPO 816 (SEQ ID NO:540), GHPO 828 (SEQ ID NO:542), GHPO 

25 839 (SEQ ID NO:544), GHPO 840 (SEQ ID NO:546), GHPO 842 (SEQ ID 

NO:548), GHPO 885 (SEQ ID NO:550), GHPO 889 (SEQ ID NO:552), GHPO 
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475 (SEQ ID NO:288), GHPO 515 (SEQ ID NO:290), GHPO 527 (SEQ ID 
NO:292), GHPO 551 (SEQ ID NO:294), GHPO 602 (SEQ ID NO:296), GHPO 
626 (SEQ ID NO:298), GHPO 646 (SEQ ID NO:300), GHPO 653 (SEQ ID 
NO:302), GHPO 655 (SEQ ID NO:304), GHPO 670 (SEQ ID NO:306), GHPO 
739 (SEQ ID NO:308), GHPO 798 (SEQ ID NO:310), GHPO 1 102 (SEQ ID 
5 NO:312), GHPO 1114(SEQIDNO:314),GHPO 1 152 (SEQ ID NO:316), 

GHPO 1272 (SEQ ID NO:318), GHPO 1345 (SEQ ID NO:320), GHPO 1377 
(SEQ ID NO:322), GHPO 1424 (SEQ ID NO:324), GHPO 1430 (SEQ ID 
NO:326), GHPO 1502 (SEQ ID NO:328), GHPO 1600 (SEQ ID NO:330), 
GHPO 1714 (SEQ ID NO:332), GHPO 359 (SEQ ID NO:334), GHPO 678 
1 0 (SEQ ID NO:336), GHPO 708 (SEQ ID NO:338), GHPO 759 (SEQ ID 

NO:340), GHPO 847 (SEQ ID NO:342), GHPO 1050 (SEQ ID NO:344), 
GHPO 1101 (SEQ ID NO:346), GHPO 1 120 (SEQ ID NO:348), GHPO 1 138 
(SEQ ID NO:350), GHPO 1310 (SEQ ID NO:352), GHPO 1320 (SEQ ID 
NO:354), GHPO 1375 (SEQ ID NO:356), GHPO 1432 (SEQ ID NO:358), 
1 5 GHPO 21 (SEQ ID NO:360), GHPO 282 (SEQ ID NO:362), GHPO 1089 

(SEQ ID NO:364), GHPO 1141 (SEQ IDNO:366), GHPO 1280 (SEQ ID 
NO:368), GHPO 1608 (SEQ ID NO:370), GHPO 15 (SEQ ID NO:372), GHPO 
16 (SEQ ID NO:374), GHPO 36 (SEQ ID NO:376), GHPO 38 (SEQ ID 
NO:378), GHPO 52 (SEQ ID NO:380), GHPO 57 (SEQ ID NO:382), GHPO 
20 64 (SEQ ID NO:384), GHPO 79 (SEQ ID NO:386), GHPO 84 (SEQ ID 

NO:388), GHPO 86 (SEQ ID NO:390), GHPO 99 (SEQ ID NO:392), GHPO 
106 (SEQ ID NO:394), GHPO 118 (SEQ ID NO:396), GHPO 122 (SEQ ID 
NO:398), GHPO 128 (SEQ ID NO:400), GHPO 138 (SEQ ID NO:402), GHPO 
153 (SEQ ID NO:404), GHPO 160 (SEQ ID NO:406), GHPO 168 (SEQ ID 
25 NO:408), GHPO 179 (SEQ ID NO:410), GHPO 189 (SEQ ID NO:412), GHPO 

229 (SEQ ID NO:414), GHPO 243 (SEQ ID NO:416), GHPO 244 (SEQ ID 
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NO:158), GHPO 989 (SEQ ID NO:160), GHPO 1111 (SEQ ID NO: 162), 
GHPO 1 145 (SEQ ID NO:164), GHPO 1256 (SEQ ID NO:166), GHPO 1264 
(SEQ ID NO: 168), GHPO 1316 (SEQ ID NO: 170), GHPO 1368 (SEQ ID 
NO: 172), GHPO 1442 (SEQ ID NO: 174), GHPO 1506 (SEQ ID NO: 176), 
GHPO 1543 (SEQ ID NO: 178), GHPO 1574 (SEQ ID NO: 180), GHPO 1627 

5 (SEQ ID NO: 182), GHPO 1657 (SEQ ID NO: 184), GHPO 1664 (SEQ ID 

NO:186), GHPO 1694 (SEQ IDNO:188), GHPO 1704 (SEQ ID NO: 190), 
GHPO 1763 (SEQ ID NO: 192), GHPO 616 (SEQ ID NO: 194), GHPO 76 
(SEQ ID NO: 196), GHPO 109 (SEQ ID NO: 198), GHPO 163 (SEQ ID 
NO:200), GHPO 169 (SEQ ID NO:202), GHPO 208 (SEQ ID NO:204), GHPO 

10 219 (SEQ ID NO:206), GHPO 445 (SEQ ID NO:208), GHPO 479 (SEQ ID 

NO:210), GHPO 525 (SEQ ID NO:212), GHPO 535 (SEQ ID NO:214), GHPO 
731 (SEQ ID NO:216), GHPO 836 (SEQ ID NO:218), GHPO 879 (SEQ ID 
NO:220), GHPO 881 (SEQ ID NO:222), GHPO 886 (SEQ ID NO:224), GHPO 
893 (SEQ ID NO:226), GHPO 894 (SEQ ID NO:228), GHPO 976 (SEQ ID 

15 NO:230), GHPO 1011 (SEQ ID NO:232), GHPO 1024 (SEQ ID NO:234), 

GHPO 1084 (SEQ ID NO:236), GHPO 1329 (SEQ ID NO:238), GHPO 1330 
(SEQ ID NO:240), GHPO 1346 (SEQ ID NO:242), GHPO 1360 (SEQ ID 
NO:244), GHPO 1388 (SEQ ID NO:246), GHPO 141 1 (SEQ ID NO:248), 
GHPO 1419 (SEQ ID NO:250), GHPO 1446 (SEQ ID NO:252), GHPO 1469 

20 (SEQ ID NO:254), GHPO 1501 (SEQ ID NO:256), GHPO 1505 (SEQ ID 

NO:258), GHPO 1522 (SEQ ID NO:260), GHPO 1525 (SEQ ID NO:262), 
GHPO 1615 (SEQ ID NO:264), GHPO 1689 (SEQ ID NO:266), GHPO 1733 
(SEQ ID NO:268), GHPO 18 (SEQ ID NO:270), GHPO 139 (SEQ ID 
NO:272), GHPO 142 (SEQ ID NO:274), GHPO 250 (SEQ ID NO:276), GHPO 

25 257 (SEQ ID NO:278), GHPO 325 (SEQ ID NO:280), GHPO 355 (SEQ ID 

NO:282), GHPO 357 (SEQ ID NO:284), GHPO 454 (SEQ ID NO:286), GHPO 
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NO:24), GHPO 1034 (SEQ ID NO:26), GHPO 1085 (SEQ ID NO:28), GHPO 
1213 (SEQ ID NO:30), GHPO 1255 (SEQ ID NO:32), GHPO 1308 (SEQ ID 
NO:34), GHPO 1389 (SEQ ID NO:36), GHPO 1706 (SEQ ID NO:38), GHPO 
234 (SEQ ID NO:40), GHPO 314 (SEQ ID NO:42), GHPO 510 (SEQ ID 
NO:44), GHPO 603 (SEQ ID NO:46), GHPO 937 (SEQ ID NO:48), GHPO 
5 1027 (SEQ ID NO:50), GHPO 1099 (SEQ ID NO:52), GHPO 1151 (SEQ ID 

NO:54), GHPO 1275 (SEQ ID NO:56), GHPO 1365 (SEQ ID NO:58), GHPO 
1578 (SEQ ID NO:60), GHPO 22 (SEQ ID NO:62), GHPO 58 (SEQ ID 
NO:64), GHPO 200 (SEQ ID NO:66), GHPO 558 (SEQ ID NO:68), GHPO 
563 (SEQ ID NO:70), GHPO 695 (SEQ ID NO:72), GHPO 699 (SEQ ID 
10 NO:74), GHPO 702 (SEQ ID NO:76), GHPO 709 (SEQ ID NO:78), GHPO 

741 (SEQ ID NO:80), GHPO 762 (SEQ ID NO:82), GHPO 827 (SEQ ID 
NO:84), GHPO 852 (SEQ ID NO:86), GHPO 1013 (SEQ ID NO:88), GHPO 
1020 (SEQ ID NO:90), GHPO 1031 (SEQ ID NO:92), GHPO 1052 (SEQ ID 
NO:94), GHPO 1 127 (SEQ ID NO:96), GHPO 1 149 (SEQ ID NO:98), GHPO 
15 1 176 (SEQ ID NO:100), GHPO 1250 (SEQ ID NO: 102), GHPO 1312 (SEQ ID 

NO:104), GHPO 1358 (SEQ ID NO:106), GHPO 1490 (SEQ ID NO:108), 
GHPO 1559 (SEQ ID NO: 110), GHPO 1651 (SEQ ID NO: 112), GHPO 1726 
(SEQ ID N0:1 14), GHPO 1780 (SEQ ID NO: 1 16), GHPO 895 (SEQ ID 
NO:l 1 8), GHPO 1447 (SEQ ID NO: 120), GHPO 28 (SEQ ID NO: 122), GHPO 
20 86 (SEQ ID NO: 124), GHPO 155 (SEQ ID NO: 126), GHPO 157 (SEQ ID 

NO:128), GHPO 237 (SEQ ID NO:130), GHPO 290 (SEQ ID NO:132), GHPO 
293 (SEQ ID NO: 134), GHPO 335 (SEQ ID NO: 136), GHPO 374 (SEQ ID 
NO: 138), GHPO 442 (SEQ ID NO:140), GHPO 480 (SEQ ID NO:142), GHPO 
523 (SEQ ID NO: 144), GHPO 610 (SEQ ID NO: 146), GHPO 675 (SEQ ID 
25 NO: 148), GHPO 690 (SEQ ID NO: 150), GHPO 829 (SEQ ID NO: 152), GHPO 

850 (SEQ ID NO: 154), GHPO 876 (SEQ ID NO: 156), GHPO 984 (SEQ ID 
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al, J. Biol. Chem. 265:9464, 1990; Evans et al.. Microbial Pathogenesis 10:15, 
1991; Labigne et al, J. Bact., 173:1920, 1991); the 87 kDa vacuolar cytotoxin 
(VacA) (Cover et al, J. Biol. Chem. 267:10570, 1992; Phadnis et al. Infect. 
Immun. 62:1557, 1994; WO 93/18150); a 128 kDa immunodominant antigen 
associated with the cytotoxin (CagA, also called TagA; WO 93/18150; U.S. 
Patent No. 5,403,924); 13 and 58 kDa heat shock proteins HspA and HspB 
(Suerbaum et al, Mol. Microbiol. 14:959, 1994; WO 93/18150); a 54 kDa 
catalase (Hazell et al, J. Gen. Microbiol. 137:57, 1991); a 15 kDa histidine-rich 
protein (Hpn) (Gilbert et al. Infect. Inunun. 63:2682, 1995); a 20 kDa 
membrane-associated lipoprotein (Kostrcynska et al, J. Bact. 176:5938, 1994); 
a 30 kDa outer membrane protein (Bolin et al, J. Clin. Microbiol. 33:381, 
1995); a lactoferrin receptor (FR 2,724,936); and several porins, designated 
HopA, HopB, HopC, HopD, and HopE, which have molecular weights of 
48-67 kDa (Exner et al. Infect. Immun. 63:1567, 1995; Doig et al, J. Bact. 
177:5447, 1995). Some of these proteins have been proposed as potential 
vaccine antigens. In particular, urease is believed to be a vaccine candidate 
(WO 94/9823; WO 95/22987; WO 95/3824; Michetti et al. Gastroenterology 
107:1002, 1994). Nevertheless, it is thought that several antigens may 
ultimately be necessary in a vaccine. 

Summary of the Invention 
The invention provides polynucleotide molecules that encode 
He/ico6ac/er polypeptides, designated GHPO 35 (SEQ ID N0:2), GHPO 55 
(SEQ ID NO:4), GHPO 78 (SEQ ID NO: 6), GHPO 89 (SEQ ID NO: 8), GHPO 
129 (SEQ ID NO: 10), GHPO 541 (SEQ ID NO: 12), GHPO 607 (SEQ ID 
NO:14), GHPO 635 (SEQ ID NO:16), GHPO 701 (SEQ ID NO:18), GHPO 
712 (SEQ ID NO:20), GHPO 761 (SEQ ID NO:22), GHPO 838 (SEQ ID 
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TDRNTIFICATION OF POLYNUCLEOTIDES ENCODING NOVEL 
HELICOBACTER POLYPEPTIDES IN THE HELICOBACTER GENOME 
The invention relates to Helicobacter antigens and corresponding 
polynucleotide molecules that can be used in methods to prevent or treat 
Helicobacter infection in mammals, such as humans. 

Background of the Invention 
Helicobacter is a genus of spiral, gram-negative bacteria that colonize 
the gastrointestinal tracts of mammals. Several species colonize the stomach, 
most notably H, pylori, H. heilmanii, H, felis^ and //. mustelae. Although H. 
pylori is the species most commonly associated with human infection, H. 
heilmanii and H.felis have also been isolated from humans, but at lower 
frequencies than H. pylori, Helicobacter infects over 50% of adult populations 
in developed countries and nearly 100% in developing countries and some 
Pacific rim countries, making it one of the most prevalent infections 
worldwide. 

Helicobacter is routinely recovered from gastric biopsies of humans 
with histological evidence of gastritis and peptic ulceration. Indeed, H. pylori 
is now recognized as an important pathogen of humans, in that the chronic 
gastritis it causes is a risk factor for the development of peptic ulcer diseases 
and gastric carcinoma. It is thus highly desirable to develop safe and effective 
vaccines for preventing and treating Helicobacter infection. 

A number of Helicobacter antigens have been characterized or isolated. 
These include urease, which is composed of two structural subunits of 
approximately 30 and 67 kDa (Hu et al. Infect. Immun. 58:992, 1990; Dunn et 
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